hey, I'm doing some experimenting, looping around :slight_smile: --- **kompress-v6** *shipped* — trained on Claude Code agent patterns (bash output, file reads, stack traces, search results, JSON tool responses). 3k synthetic pairs + 2k existing, fine-tuned from v4, $0.20 on vast.ai.
Results: heretic exact_pct 0.962 (v4: 0.967), keep_rate 0.854 (v4: 0.823), override delta 0. Model got more conservative — higher keep_rate on structured technical content. Real proxy: v4 compressed 9.5%, v6 compressed 4.2% on the same session. Less aggressive, fewer must-keep tokens dropped on paths and identifiers.
Interesting failure: self-labeling with v4+override collapsed mk_in_ref to 0.652. TokenExpiredError splits into Token+Expired+Error — subtokens that don't individually match the must-keep regex, so the force-keep never fires. Generator references (mk_in_ref=1.0 by construction) ended up being better labels than v4's compressed output for agent data. Fix for next run: slide a 2-3 subtoken window instead of checking individual subtokens. Would let self-labeling work on agent content and potentially produce a more compression-aggressive v7.