Sem Karaman's picture

Sem Karaman

SemKara

·

AI & ML interests

None yet

Recent Activity

updated a model about 3 hours ago

SemKara/Qwopus3.5-27B-3.5_IQ4_XS

published a model about 7 hours ago

SemKara/Qwopus3.5-27B-3.5_IQ4_XS

new activity about 7 hours ago

Jackrong/Qwopus3.5-27B-v3.5-GGUF:Quantization model context length issue in LM Studio

View all activity

Organizations

None yet

updated a model about 3 hours ago

SemKara/Qwopus3.5-27B-3.5_IQ4_XS

27B • Updated about 3 hours ago

published a model about 7 hours ago

SemKara/Qwopus3.5-27B-3.5_IQ4_XS

27B • Updated about 3 hours ago

New activity in Jackrong/Qwopus3.5-27B-v3.5-GGUF about 7 hours ago

Quantization model context length issue in LM Studio

#3 opened 3 days ago by

New activity in Jackrong/Qwopus3.5-27B-v3.5-GGUF 1 day ago

So far my top choice

#1 opened 3 days ago by

liked a model 2 days ago

piotreknow02/BugTraceAI-Apex-G4-26B-Master-GGUF

Text Generation • 25B • Updated 8 days ago • 3.29k • 2

replied to danielhanchen's post 3 days ago

Wait, I should confirm if get_weather needs a location.
If I don't know the location, I can't provide it if it's a required parameter.
However, if I can't ask the user (because I'm in the middle of execution), I'll call it and see.
If it returns an error asking for location, I'll reply to the user asking for it.
But usually, these bots have a default location or use IP.
I'll proceed.

This is both syntactically and grammatically correct, and logically coherent. Furthermore, other models can tool call down to Q2 quants and more. So I don't think it's a quant-level issue. The problem is training: it's better trained on natural language grammar and logic than it is on tool calling.

the thing is though, this is completely new for 3.6 35B, qwen has been the orchestrator in my stack for very long time now and I've gone through many iterations of qwen, this is kinda new to 3.6 35B

going off what JoeSmith said, it might just be a training issue with this particular iteration of Qwen, if you look at the official benchmarks posted by Qwen team it shows 3.6 is almost always under 3.5 or only a few points above, so perhaps the hyper over 3.6 is overstated

replied to danielhanchen's post 3 days ago

me too, if I could fit it on my GPU. but I highly doubt it's the quantization. 3.5 35B and all sorts of merges of it works perfectly fine at pretty much any quant

i see, in that case if you already tried sampling (repeat penalty 1.05-1.1) and quant seems fine then only other possible fix that worked sometimes for me is a direct system prompt to 'not overthink' and 'allow knock on errors after first solution'

also i think it's a given that you should be running 0.2 temp for 'precise' responses

replied to danielhanchen's post 3 days ago

i would try Q8 as last resort

replied to danielhanchen's post 3 days ago

I see, this is usually because you are using Q4 quants locally? Try to go up to Q5 or Q8, reduce GPU layers and context length to fit into you VRAM.

liked a model 3 days ago

Jackrong/Qwopus3.5-27B-v3.5-GGUF

Image-Text-to-Text • 27B • Updated 6 days ago • 9.3k • 30

New activity in unsloth/Qwen3.6-35B-A3B-GGUF 4 days ago

Sampling?

#13 opened 4 days ago by

replied to danielhanchen's post 4 days ago

oh 3.6 35B is a literal never ending reasoning loop for me. like 3 out 6 times need to kill the server type of deal

post your sampling setup

New activity in BugTraceAI/BugTraceAI-Apex-G4-26B-Q4 4 days ago

Dropping Tools

#1 opened 4 days ago by

New activity in Ex0bit/Gemma4-26B-A4B-PRISM-PRO-DQ-GGUF 9 days ago

Why "Mythos"?

#1 opened 9 days ago by

New activity in douyamv/Gemma-4-31B-JANG_4M-CRACK-GGUF 13 days ago

vision capabilites

#4 opened 13 days ago by

New activity in Jackrong/Qwopus3.5-27B-v3-GGUF 13 days ago

[Help] Why is Qwopus3.5-27B-v3 outputting its internal thinking and duplicating text?

#11 opened 14 days ago by

Failed to generate a valid tool call.

#10 opened 16 days ago by

New activity in Jackrong/Qwopus3.5-27B-v3-GGUF 15 days ago

Doesnt work in lm studio

#8 opened 19 days ago by

liked a model 15 days ago

Jackrong/Qwopus3.5-27B-v3-GGUF

Image-Text-to-Text • 27B • Updated 6 days ago • 171k • 348