thepowerfuldeez commited on
Commit
127ee04
·
verified ·
1 Parent(s): e1cd0bf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -9,11 +9,15 @@ tags:
9
  - transformer
10
  library_name: transformers
11
  pipeline_tag: text-generation
 
12
  ---
13
 
14
  # IMU-1 Base
15
 
16
- A sample-efficient 430M parameter language model trained on 72B tokens that approaches the benchmark performance of models trained on 56× more data.
 
 
 
17
 
18
  ## Model Details
19
 
 
9
  - transformer
10
  library_name: transformers
11
  pipeline_tag: text-generation
12
+ arxiv: 2602.02522
13
  ---
14
 
15
  # IMU-1 Base
16
 
17
+ This repository contains the IMU-1 Base model, a sample-efficient 430M parameter language model introduced in the paper [IMU-1: Sample-Efficient Pre-training of Small Language Models](https://huggingface.co/papers/2602.02522).
18
+
19
+ IMU-1 is trained on 72B tokens and approaches the benchmark performance of models trained on 56× more data.
20
+
21
 
22
  ## Model Details
23