Nathan Habib's picture

Building on HF

Nathan Habib PRO

SaylorTwift

huggingface

·

AI & ML interests

Evals

Recent Activity

new activity about 19 hours ago

stepfun-ai/Step-3.7-Flash:Add SWE-bench Pro evaluation result

new activity about 19 hours ago

stepfun-ai/Step-3.7-Flash:Add HLE with tools evaluation result

liked a model about 19 hours ago

stepfun-ai/Step-3.7-Flash

View all activity

Organizations

upvoted a paper about 21 hours ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 61

upvoted a changelog 1 day ago

Hugging Face Changelog

Filter Models page by Base Models only

1 day ago

• 51

upvoted a paper 5 days ago

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

Paper • 2605.16679 • Published 15 days ago • 53

upvoted a changelog 5 days ago

Hugging Face Changelog

Copy Repo Contents to Buckets Instantly

8 days ago

• 59

upvoted a changelog 8 days ago

Hugging Face Changelog

Filter Leaderboards by Model Size

10 days ago

• 104

upvoted a paper 11 days ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published Apr 6 • 36

upvoted an article 18 days ago

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

22 days ago

• 38

upvoted an article 19 days ago

Article

Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law

mishig

•

19 days ago

• 23

upvoted an article 23 days ago

Article

Introducing the agentic robotics appstore for 10,000 Reachy Minis

clem

•

24 days ago

• 35

upvoted a collection 24 days ago

MediaTech

Collection of public datasets from the French administration, chunked, vectorized and ready to use in AI projects. • 9 items • Updated Feb 4 • 10

upvoted a paper 25 days ago

COMPOSITE-Stem

Paper • 2604.09836 • Published Apr 10 • 3

upvoted a paper 26 days ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published about 1 month ago • 42

upvoted a changelog about 1 month ago

Hugging Face Changelog

Spaces agents.md for your coding agents

Apr 17

• 325

upvoted an article about 1 month ago

Article

DeepSeek-V4: a million-token context that agents can actually use

burtenshaw

•

Apr 24

• 47

upvoted a changelog about 1 month ago

Hugging Face Changelog

Agent Traces on the Hub

Apr 7

• 138

upvoted an article about 1 month ago

Article

The PR you would have opened yourself

pcuenq, awni

•

Apr 16

• 72

upvoted a paper about 2 months ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 72

upvoted 2 articles about 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 902

Article

Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents

ibm-granite

•

Mar 31

• 34

upvoted an article 2 months ago

Article

Liberate your OpenClaw

+6

clem, burtenshaw, pcuenq, jeffboudier, merve, nielsr, victor, mishig

•

Mar 27

• 46