Teknium's picture

Teknium

teknium

·

https://github.com/teknium1

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Long Context Pre-Training with Lighthouse Attention

upvoted a paper 9 days ago

Efficient Pre-Training with Token Superposition

published a model 3 months ago

NousResearch/moe-10b-a1b-8k-wsd-lr3e4-1t

View all activity

Organizations

upvoted a paper 7 days ago

Long Context Pre-Training with Lighthouse Attention

Paper • 2605.06554 • Published 16 days ago • 27

upvoted a paper 9 days ago

Efficient Pre-Training with Token Superposition

Paper • 2605.06546 • Published 16 days ago • 43

published a model 3 months ago

NousResearch/moe-10b-a1b-8k-wsd-lr3e4-1t

10B • Updated Apr 1 • 207 • 12

liked a dataset 3 months ago

NousResearch/openthoughts-tblite

Viewer • Updated Mar 4 • 100 • 1.04k • 8

updated a dataset 3 months ago

NousResearch/openthoughts-tblite

Viewer • Updated Mar 4 • 100 • 1.04k • 8

published a dataset 3 months ago

NousResearch/openthoughts-tblite

Viewer • Updated Mar 4 • 100 • 1.04k • 8

New activity in zai-org/GLM-4.7-Flash 3 months ago

Base model

#2 opened 4 months ago by

updated a dataset 3 months ago

NousResearch/terminal-bench-2

Viewer • Updated Feb 10 • 89 • 1.06k • 3

published a dataset 3 months ago

NousResearch/terminal-bench-2

Viewer • Updated Feb 10 • 89 • 1.06k • 3

updated a model 3 months ago

NousResearch/Kimi-K2-Thinking-Alternate-Tokenizer

Updated Feb 10 • 6

New activity in google/extended_amazon_2023_dataset 4 months ago

Does this dataset keep going private to public over and over

#2 opened 4 months ago by

updated 2 models 4 months ago

NousResearch/Kimi-K2-Thinking-Alternate-Tokenizer

Updated Feb 10 • 6

NousResearch/Kimi-K2-Thinking-Alternate-Tokenizer

Updated Feb 10 • 6