23 13

Хасан Шамилев

xyzzy77

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

liked a model 3 days ago

ik-ram28/gemma-3-4b-sft-grpo-mod3-with-gt-checkpoint-970

liked a model 6 days ago

bohdan898979/lader

View all activity

Organizations

None yet

upvoted a paper 2 days ago

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

Paper • 2605.16882 • Published 9 days ago • 2

liked a model 3 days ago

ik-ram28/gemma-3-4b-sft-grpo-mod3-with-gt-checkpoint-970

Image-Text-to-Text • 4B • Updated 3 days ago • 24 • 1

liked a model 6 days ago

bohdan898979/lader

Updated 6 days ago • 1

upvoted a paper 13 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 18 days ago • 215

liked a dataset 17 days ago

robbyant/mdm_depth

Updated Apr 17 • 191k • 30

liked a dataset 23 days ago

siril-spcc/gaia

Updated Mar 2 • 330k • 12

liked a model about 1 month ago

HaaDeej/cineleum-controlnet-sdxl

Updated about 1 month ago • 1

liked a dataset about 1 month ago

GaryYang123/zh-meme-sft-8k

Viewer • Updated Apr 20 • 8.68k • 212 • 79

upvoted a paper about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 246

liked a dataset about 1 month ago

uonlp/CulturaX

Viewer • Updated Dec 16, 2024 • 7.18B • 34.4k • 625

upvoted a paper about 1 month ago

PokeGym: A Visually-Driven Long-Horizon Benchmark for Vision-Language Models

Paper • 2604.08340 • Published Apr 9 • 8

liked a model about 1 month ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 171k • • 2.49k

upvoted 2 papers about 1 month ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 115

upvoted 4 papers about 2 months ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

NearID: Identity Representation Learning via Near-identity Distractors

Paper • 2604.01973 • Published Apr 2 • 32

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 176

MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

Paper • 2603.06679 • Published Mar 30 • 6

liked a model about 2 months ago

mradermacher/love-qwen3.5-27b-GGUF

27B • Updated Apr 3 • 96

liked a dataset about 2 months ago

yahma/alpaca-cleaned

Viewer • Updated Apr 10, 2023 • 51.8k • 31.8k • 827

Хасан Шамилев

AI & ML interests

Recent Activity

Organizations

xyzzy77's activity