Bochkov
/

abs-bvv-3

Text Generation

feature-extraction

progressive-growth

constructive-learning

frozen-embeddings

Model card Files Files and versions

Bochkov commited on Jul 10, 2025

Commit

663e71c

·

verified ·

1 Parent(s): 3a0b094

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ pipeline_tag: text-generation
 `abs-bvv-3` is a 1.7 billion parameter decoder-only Transformer model. It is the 3th model in the **Progressive Growth Transformers (PGT)** series, designed to explore how linguistic and reasoning capabilities emerge as a function of model depth.
-This model was not trained monolithically. Instead, it was "grown" constructively, one layer at a time, upon a foundation of **frozen, non-semantic visual embeddings**, as introduced in the paper "[Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations](arXiv.org)".
 The core idea is to demonstrate an alternative, more modular and resource-efficient paradigm for building LLMs. The PGT series shows that:
 1.  Semantic understanding can emerge without trainable embeddings.

 `abs-bvv-3` is a 1.7 billion parameter decoder-only Transformer model. It is the 3th model in the **Progressive Growth Transformers (PGT)** series, designed to explore how linguistic and reasoning capabilities emerge as a function of model depth.
+This model was not trained monolithically. Instead, it was "grown" constructively, one layer at a time, upon a foundation of **frozen, non-semantic visual embeddings**, as introduced in the paper "[Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations](https://arxiv.org/abs/2507.04886)".
 The core idea is to demonstrate an alternative, more modular and resource-efficient paradigm for building LLMs. The PGT series shows that:
 1.  Semantic understanding can emerge without trainable embeddings.