Guille Pérez-Torró

guishe

AI & ML interests

Information Retrieval, Few-Shot Learning, Named Entity Recognition, Named Entity Disambiguation, Semantic Search, Aspect-based Sentiment Analysis

Recent Activity

upvoted an article 7 days ago

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI

upvoted a paper 7 days ago

Next-Embedding Prediction Makes Strong Vision Learners

upvoted a paper 17 days ago

LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures

View all activity

Organizations

upvoted an article 7 days ago

Article

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI

nvidia

•

Oct 28, 2025

• 35

upvoted a paper 7 days ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 90

upvoted a paper 17 days ago

LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures

Paper • 2509.14252 • Published Sep 11, 2025 • 7

liked a Space 17 days ago

JEPA Wiki

🧠

Explore the JEPA research knowledge base

upvoted a paper 20 days ago

JEPA-Reasoner: Decoupling Latent Reasoning from Token Generation

Paper • 2512.19171 • Published Dec 22, 2025 • 4

upvoted an article 30 days ago

Article

mmBERT: ModernBERT goes Multilingual

mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme

•

Sep 9, 2025

• 147

upvoted a paper 2 months ago

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published Dec 18, 2025 • 42

liked a Space 2 months ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

235

Explore synthetic data experiments on a virtual bookshelf

upvoted a paper 3 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 72

liked 3 models 4 months ago

liked a Space 5 months ago

Evaluation Guidebook

📝

320

Explore LLM benchmark trends over time

upvoted an article 6 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 293

upvoted an article 7 months ago

Article

Merge Large Language Models with mergekit

mlabonne

•

Jan 9, 2024

• 155

liked a Space 7 months ago

The Smol Training Playbook

📚

3.18k

The secrets to building world-class LLMs

upvoted an article 7 months ago

Article

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

isaacchung

•

Oct 20, 2025

• 38

upvoted an article 8 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego

•

Sep 4, 2025

• 274

liked a model 9 months ago

unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit

Text Generation • Updated Aug 6, 2025 • 123k • 14

liked a model 10 months ago

unsloth/gpt-oss-20b-bnb-4bit

Text Generation • 21B • Updated Aug 6, 2025 • 31.4k • 14

Guille Pérez-Torró

AI & ML interests

Recent Activity

Organizations

guishe's activity

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI

JEPA Wiki

mmBERT: ModernBERT goes Multilingual

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Evaluation Guidebook

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Merge Large Language Models with mergekit

The Smol Training Playbook

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

Welcome EmbeddingGemma, Google's new efficient embedding model