9 15 21

Jie Wang

Everloom

AI & ML interests

Foundation Models for Robot Learning

Recent Activity

liked a model 12 days ago

pyannote/speaker-diarization-community-1

upvoted a collection 17 days ago

MolmoAct2-BimanualYAM Dataset

liked a dataset 17 days ago

RoboVerseOrg/roboverse_data

View all activity

Organizations

None yet

upvoted a collection 17 days ago

MolmoAct2-BimanualYAM Dataset

Collection

Collection of the MolmoAct2-BimanualYAM Dataset • 740 items • Updated 17 days ago • 14

upvoted 3 papers 7 months ago

upvoted an article 7 months ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

tomaarsen

•

Oct 9, 2023

• 37

upvoted a paper 9 months ago

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Paper • 2508.08113 • Published Aug 11, 2025 • 11

upvoted an article 10 months ago

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 531

upvoted a paper 11 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 160

upvoted an article 11 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

merve, andsteing, pcuenq

•

May 14, 2024

• 287

upvoted a collection 11 months ago

PaliGemma FT Models

Collection

108 items • Updated Mar 12 • 35

upvoted a paper 11 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 73

upvoted an article about 1 year ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

upvoted a collection almost 2 years ago

PEFT papers

Collection

A collection of methods that have been implemented in the 🤗 PEFT library • 12 items • Updated Jan 30, 2024 • 32

upvoted a paper almost 2 years ago

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Paper • 2407.17952 • Published Jul 25, 2024 • 32

upvoted an article almost 2 years ago

Article

💃Introducing the first LLM-based Motion understanding model: MotionLLM

EvanTHU

•

Jun 26, 2024

• 4

Jie Wang

AI & ML interests

Recent Activity

Organizations

Everloom's activity

🕳️ Attention Sinks in LLMs for endless fluency

Vision Language Models Explained

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Mixture of Experts Explained

💃Introducing the first LLM-based Motion understanding model: MotionLLM