Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
IvmeLabs
/
Ivme-Conversate-22M-Base
like
1
Follow
İvmeLabs
1
Text Generation
5 datasets
English
language-model
transformer
rope
swiglu
gqa
muon
from-scratch
tiny
small
decoder-only
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Ivme-Conversate-22M-Base
Commit History
Update README.md
97dd11c
verified
ereniko
commited on
3 days ago
Update README.md
4155521
verified
ereniko
commited on
3 days ago
Update README.md
848dd53
verified
ereniko
commited on
3 days ago
Upload canonical_results.json with huggingface_hub
d93e05d
verified
ereniko
commited on
3 days ago
Upload blimp_results.json with huggingface_hub
76ac3e8
verified
ereniko
commited on
3 days ago
Upload eval_wikitext.py with huggingface_hub
7e5763c
verified
ereniko
commited on
3 days ago
Upload eval_blimp.py with huggingface_hub
337273e
verified
ereniko
commited on
3 days ago
Upload eval.py with huggingface_hub
44217ec
verified
ereniko
commited on
3 days ago
Upload prepare_data.py with huggingface_hub
e82a88e
verified
ereniko
commited on
3 days ago
Upload tokenizer.py with huggingface_hub
f0169be
verified
ereniko
commited on
3 days ago
Upload train.py with huggingface_hub
edfd803
verified
ereniko
commited on
3 days ago
Upload muon.py with huggingface_hub
0d72706
verified
ereniko
commited on
3 days ago
Upload model.py with huggingface_hub
b792941
verified
ereniko
commited on
3 days ago
Upload ivme_tokenizer.json with huggingface_hub
6156088
verified
ereniko
commited on
3 days ago
Upload ivme_base_ema.pt with huggingface_hub
3657044
verified
ereniko
commited on
3 days ago
initial commit
7628e0a
verified
ereniko
commited on
3 days ago