Gibran Iqbal

Jibbscript

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Large Language Models Explore by Latent Distilling

upvoted a paper about 14 hours ago

AgentSearchBench: A Benchmark for AI Agent Search in the Wild

reacted to MikeDoes's post with 🚀 1 day ago

🛡️ At Ai4Privacy, our goal is to empower researchers to build a safer AI ecosystem. Today, we're highlighting crucial research that does just that by exposing a new vulnerability. The paper "Forget to Flourish" details a new model poisoning technique. It's a reminder that as we fine-tune LLMs, our anonymization and privacy strategies must evolve to counter increasingly sophisticated threats. We're proud that the Ai4Privacy dataset was instrumental in this study. It served two key purposes: Provided a Realistic Testbed: It gave the researchers access to a diverse set of synthetic and realistic PII samples in a safe, controlled environment. Enabled Impactful Benchmarking: It allowed them to measure the actual effectiveness of their data extraction attack, proving it could compromise specific, high-value information. This work reinforces our belief that progress in AI security is a community effort. By providing robust tools for benchmarking, we can collectively identify weaknesses and build stronger, more resilient systems. A huge congratulations to the authors on this important contribution. 🔗 Read the full paper: https://arxiv.org/html/2408.17354v1 #OpenSource #DataPrivacy #LLM #Anonymization #AIsecurity #HuggingFace #Ai4Privacy #World's largest open privacy masking dataset

View all activity

Organizations

upvoted 2 papers about 14 hours ago

Large Language Models Explore by Latent Distilling

Paper • 2604.24927 • Published 5 days ago • 59

AgentSearchBench: A Benchmark for AI Agent Search in the Wild

Paper • 2604.22436 • Published 8 days ago • 12

upvoted a paper 3 days ago

Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval

Paper • 2604.23734 • Published 6 days ago • 3

upvoted a collection 3 days ago

Prism Reranker

Collection

5 items • Updated 6 days ago • 9

upvoted a collection 7 days ago

Entreprise PII Masking

Collection

5 items • Updated May 15, 2025 • 3

upvoted an article 7 days ago

Article

DeepSeek-V4: a million-token context that agents can actually use

8 days ago

•

upvoted an article 9 days ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

16 days ago

•

upvoted 2 papers 9 days ago

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Paper • 2604.14683 • Published 16 days ago • 35

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

Paper • 2604.18543 • Published 12 days ago • 27

upvoted a collection 9 days ago

GLiNER-PII

Collection

PII detection models developed in collaboration with Wordcab • 5 items • Updated Jan 29 • 23

upvoted an article 15 days ago

Article

The PR you would have opened yourself

16 days ago

•

upvoted 3 papers 16 days ago

Toward Autonomous Long-Horizon Engineering for ML Research

Paper • 2604.13018 • Published 18 days ago • 34

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 18 days ago • 87

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published 18 days ago • 99

upvoted 5 papers 18 days ago

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Paper • 2603.27490 • Published Mar 29 • 18

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Paper • 2604.08995 • Published 22 days ago • 48

upvoted a paper 19 days ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published 23 days ago • 20

Gibran Iqbal

AI & ML interests

Recent Activity

Organizations

Jibbscript's activity

DeepSeek-V4: a million-token context that agents can actually use

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

The PR you would have opened yourself