Bochkov
/

bvv241-max

Feature Extraction

Model card Files Files and versions

Bochkov commited on Jul 11, 2025

Commit

2fe5112

·

verified ·

1 Parent(s): 6de96d0

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -64,6 +64,16 @@ If you use this model or the underlying concepts in your research, please cite o
       primaryClass={cs.CL},
       url={https://arxiv.org/abs/2507.04886},
 }
 ```
 This work demonstrates that transformer blocks, not token embeddings, carry the semantic burden in LLMs — a step toward modular, fusable, multilingual LMs.

       primaryClass={cs.CL},
       url={https://arxiv.org/abs/2507.04886},
 }
+@misc{bochkov2025growingtransformersmodularcomposition,
+      title={Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate},
+      author={A. Bochkov},
+      year={2025},
+      eprint={2507.07129},
+      archivePrefix={arXiv},
+      primaryClass={cs.LG},
+      url={https://arxiv.org/abs/2507.07129},
+}
 ```
 This work demonstrates that transformer blocks, not token embeddings, carry the semantic burden in LLMs — a step toward modular, fusable, multilingual LMs.