Article 1 Emergent Semantics Beyond Token Embeddings: A GPT-like Transformer Learns with Frozen 16‑D Binary Token-ID Embeddings (n_embed=16)
Language Models Without a Trainable Input Embedding Table This collection is provided for reproducibility of the paper's main claim Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 2 days ago • 34 Bochkov/llm-fix-min-fixed-minimal-binary-code Text Generation • 0.5B • Updated 2 days ago • 32 Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 2 days ago • 26
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 2 days ago • 34
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 2 days ago • 26
Emergent Semantics Beyond Token Embeddings Paper: 2507.04886 (TMLR, Oct 2025). 'Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations' Bochkov/emergent-semantics-model-uni-glyph-335m Text Generation • Updated Jan 7 • 8 Bochkov/emergent-semantics-model-unfrozen-335m Text Generation • Updated Jan 7 • 6 Bochkov/emergent-semantics-model-16-bit-269m Text Generation • Updated Jan 7 • 10 • 1 Bochkov/emergent-semantics-model-64-bit-272m Text Generation • Updated Jan 7 • 4
Language Models Without a Trainable Input Embedding Table This collection is provided for reproducibility of the paper's main claim Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 2 days ago • 34 Bochkov/llm-fix-min-fixed-minimal-binary-code Text Generation • 0.5B • Updated 2 days ago • 32 Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 2 days ago • 26
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 2 days ago • 34
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 2 days ago • 26
Emergent Semantics Beyond Token Embeddings Paper: 2507.04886 (TMLR, Oct 2025). 'Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations' Bochkov/emergent-semantics-model-uni-glyph-335m Text Generation • Updated Jan 7 • 8 Bochkov/emergent-semantics-model-unfrozen-335m Text Generation • Updated Jan 7 • 6 Bochkov/emergent-semantics-model-16-bit-269m Text Generation • Updated Jan 7 • 10 • 1 Bochkov/emergent-semantics-model-64-bit-272m Text Generation • Updated Jan 7 • 4
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 2 days ago • 26
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 2 days ago • 34
Bochkov/growing-transformers-model-frozen-16-bit-baseline-monolyth-181m Text Generation • Updated Jan 9 • 33
Bochkov/growing-transformers-model-unfrozen-baseline-monolyth-247m Text Generation • Updated Jan 9 • 5
Bochkov/growing-transformers-model-frozen-unicode-baseline-monolyth-247m Text Generation • Updated Jan 9 • 1