Bochkov
/

bvv241-abs

Feature Extraction

Model card Files Files and versions

Bochkov commited on Jul 8, 2025

Commit

6aab3eb

·

verified ·

1 Parent(s): 14ae094

Update README.md

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -46,4 +46,23 @@ emb_path = hf_hub_download(
     filename="normalized_embeddings_weights.pt"
 )
-embeddings = torch.load(emb_path)

     filename="normalized_embeddings_weights.pt"
 )
+embeddings = torch.load(emb_path)
+```
+## 🧑‍🔬 Citation & Concept
+If you use this model or the underlying concepts in your research, please cite our work:
+```
+@misc{bochkov2025emergentsemanticstokenembeddings,
+      title={Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations},
+      author={A. Bochkov},
+      year={2025},
+      eprint={2507.04886},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2507.04886},
+}
+```
+This work demonstrates that transformer blocks, not token embeddings, carry the semantic burden in LLMs — a step toward modular, fusable, multilingual LMs.