Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
13
2
Suchir Salhan
suchirsalhan
Follow
RanaGaber's profile picture
Moibe's profile picture
bbunzeck's profile picture
14 followers
·
32 following
https://www.suchirsalhan.com/
suchirsalhan
suchirsalhan
ssalhan
AI & ML interests
Multilinguality and Cognitively-Inspired AI. Tokenization, Pretraining, Interpretability & Alignment.
Recent Activity
updated
a model
about 1 hour ago
MultilingualUnigramLM/ft-langmap-qwen2_5-0_5b-fineweb100M-fin-original
published
a model
about 1 hour ago
MultilingualUnigramLM/ft-langmap-qwen2_5-0_5b-fineweb100M-fin-original
updated
a model
about 1 hour ago
MultilingualUnigramLM/ft-langmap-gemma3-1b-fineweb100M-hun-original
View all activity
Organizations
suchirsalhan
's datasets
9
Sort: Recently updated
suchirsalhan/kidalign-llama-filterable
Viewer
•
Updated
Apr 14
•
97.6k
•
132
suchirsalhan/kidalign-llama-3.1-8B-Instruct
Updated
Apr 14
•
32
suchirsalhan/babylm-detox
Viewer
•
Updated
Apr 8
•
11.6M
•
39
suchirsalhan/gptbert-tokenised
Updated
Jul 24, 2025
•
3
suchirsalhan/Phonemized-UD
Viewer
•
Updated
May 30, 2025
•
1.19M
•
81
suchirsalhan/BabyLM-Pretokenised
Viewer
•
Updated
Jan 31, 2025
•
1.64M
•
12
suchirsalhan/MAO-CHILDES
Viewer
•
Updated
Apr 11, 2024
•
3.81M
•
14
suchirsalhan/CLiMP
Preview
•
Updated
Apr 2, 2024
•
25
•
1
suchirsalhan/SLING
Viewer
•
Updated
Apr 2, 2024
•
40k
•
77