thepowerfuldeez
/

imu1_base

Text Generation

sample-efficient

Model card Files Files and versions

thepowerfuldeez commited on Feb 4

Commit

127ee04

·

verified ·

1 Parent(s): e1cd0bf

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -9,11 +9,15 @@ tags:
 - transformer
 library_name: transformers
 pipeline_tag: text-generation
 ---
 # IMU-1 Base
-A sample-efficient 430M parameter language model trained on 72B tokens that approaches the benchmark performance of models trained on 56× more data.
 ## Model Details

 - transformer
 library_name: transformers
 pipeline_tag: text-generation
+arxiv: 2602.02522
 ---
 # IMU-1 Base
+This repository contains the IMU-1 Base model, a sample-efficient 430M parameter language model introduced in the paper [IMU-1: Sample-Efficient Pre-training of Small Language Models](https://huggingface.co/papers/2602.02522).
+IMU-1 is trained on 72B tokens and approaches the benchmark performance of models trained on 56× more data.
 ## Model Details