🧬 Carbon Collection Carbon 500M, 3B, 8B genomic models and GGUF variants for llama.cpp • 6 items • Updated 5 days ago • 34
view article Article Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law mishig • 14 days ago • 23
view article Article Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality ibm-granite • 11 days ago • 30
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 12 days ago • 55
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper • 2604.19254 • Published Apr 21 • 30
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c • Feb 4 • 90
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 900
view article Article How I contributed a new model to the Transformers library using Codex nielsr • Mar 30 • 52
ObjectClear: Complete Object Removal via Object-Effect Attention Paper • 2505.22636 • Published May 28, 2025 • 5
MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator Paper • 2512.11782 • Published Dec 12, 2025 • 3
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 507
view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism ariG23498 • Feb 12 • 20