Pankaj Pandey
pankajpandey-dev
AI & ML interests
Natural Language Processing, Text Generation, Large Language Models, Quantization, Fine-Tuning, RLHF, Model Merging
Recent Activity
upvoted a collection about 2 hours ago
GGUF Quantizations upvoted a collection about 2 hours ago
๐ฎ๐ณ Hindi LLM Series repliedto their post about 5 hours ago
๐ฎ๐ณ Qwen3-4B Hindi Instruct v2 โ a Hindi LLM that runs on your own machine
Most strong Hindi-capable models are either huge or cloud-only. I wanted one that's small enough to run locally but actually follows instructions in Hindi โ so I fine-tuned Qwen3-4B on 10K Hindi instruction pairs and shipped it with a full GGUF quant ladder.
โ
Fine-tune (16-bit): huggingface.co/pankajpandey-dev/Qwen3-4B-Hindi-Instruct-v2
โ
GGUF (Q4/Q5/Q8): huggingface.co/pankajpandey-dev/Qwen3-4B-Hindi-Instruct-v2-GGUF
Runs in Ollama, llama.cpp, and LM Studio. The Q4_K_M is just 2.5 GB โ fits comfortably on a laptop, CPU or GPU.
Part of my Hindi LLM Series โ building openly-licensed Indic models for local and edge use. More coming (Gemma next). Feedback welcome ๐
#Hindi #IndicNLP #GGUF #LocalLLM #Qwen