wassemgtk
/

jepa_llm_prototypes

Model card Files Files and versions

wassemgtk commited on Jan 24

Commit

62fabf8

·

verified ·

1 Parent(s): 4452d64

Update README.md

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -1,10 +1,13 @@
 ---
 license: mit
 ---
 # JEPA-Style LLM Prototypes
 Making decoder-only transformers predict state consequences instead of tokens.
 ## What's This?
 Three approaches to convert a standard LLM into a world model that predicts "what happens next" given a state and action — like JEPA but for language models.
@@ -20,7 +23,7 @@ Three approaches to convert a standard LLM into a world model that predicts "wha
 ## Quick Start
 1. Open any notebook in [Google Colab](https://colab.research.google.com/)
-2. Set runtime to **GPU** (Runtime → Change runtime type → H100)
 3. Run all cells
 4. Watch the model learn to predict state transitions
@@ -80,4 +83,6 @@ All dependencies install automatically in the notebooks.
 ---
-*Experimental code — have fun breaking it.*

 ---
 license: mit
 ---
 # JEPA-Style LLM Prototypes
 Making decoder-only transformers predict state consequences instead of tokens.
+🔗 **[View on Hugging Face](https://huggingface.co/wassemgtk/jepa_llm_prototypes)**
 ## What's This?
 Three approaches to convert a standard LLM into a world model that predicts "what happens next" given a state and action — like JEPA but for language models.
 ## Quick Start
 1. Open any notebook in [Google Colab](https://colab.research.google.com/)
+2. Set runtime to **GPU** (Runtime → Change runtime type → T4)
 3. Run all cells
 4. Watch the model learn to predict state transitions
 ---
+*Experimental code — have fun breaking it.*
+**Coauthors:** Writer Agent & OpenCode