Article 1 Emergent Semantics Beyond Token Embeddings: A GPT-like Transformer Learns with Frozen 16‑D Binary Token-ID Embeddings (n_embed=16)
Language Models Without a Trainable Input Embedding Table This collection is provided for reproducibility of the paper's main claim Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 3 days ago • 35 Bochkov/llm-fix-min-fixed-minimal-binary-code Text Generation • 0.5B • Updated 3 days ago • 33 Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 3 days ago • 28
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 3 days ago • 35
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 3 days ago • 28
Emergent Semantics Beyond Token Embeddings Paper: 2507.04886 (TMLR, Oct 2025). 'Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations' Bochkov/emergent-semantics-model-uni-glyph-335m Text Generation • Updated Jan 7 • 8 Bochkov/emergent-semantics-model-unfrozen-335m Text Generation • Updated Jan 7 • 5 Bochkov/emergent-semantics-model-16-bit-269m Text Generation • Updated Jan 7 • 10 • 1 Bochkov/emergent-semantics-model-64-bit-272m Text Generation • Updated Jan 7 • 4
Language Models Without a Trainable Input Embedding Table This collection is provided for reproducibility of the paper's main claim Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 3 days ago • 35 Bochkov/llm-fix-min-fixed-minimal-binary-code Text Generation • 0.5B • Updated 3 days ago • 33 Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 3 days ago • 28
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 3 days ago • 35
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 3 days ago • 28
Emergent Semantics Beyond Token Embeddings Paper: 2507.04886 (TMLR, Oct 2025). 'Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations' Bochkov/emergent-semantics-model-uni-glyph-335m Text Generation • Updated Jan 7 • 8 Bochkov/emergent-semantics-model-unfrozen-335m Text Generation • Updated Jan 7 • 5 Bochkov/emergent-semantics-model-16-bit-269m Text Generation • Updated Jan 7 • 10 • 1 Bochkov/emergent-semantics-model-64-bit-272m Text Generation • Updated Jan 7 • 4
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 3 days ago • 28
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 3 days ago • 35
Bochkov/growing-transformers-model-frozen-16-bit-baseline-monolyth-181m Text Generation • Updated Jan 9 • 33
Bochkov/growing-transformers-model-unfrozen-baseline-monolyth-247m Text Generation • Updated Jan 9 • 4
Bochkov/growing-transformers-model-frozen-unicode-baseline-monolyth-247m Text Generation • Updated Jan 9 • 1