Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated a model about 8 hours ago
krishnateja95/Qwen3-8B-FP8-Dflash published a model about 8 hours ago
krishnateja95/Qwen3-8B-FP8-Dflash updated a model 1 day ago
krishnateja95/Qwen3-8B-Dflash