Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
IvmeLabs
/
Ivme-Conversate-22M-Base
like
1
Follow
İvmeLabs
1
Text Generation
5 datasets
English
language-model
transformer
rope
swiglu
gqa
muon
from-scratch
tiny
small
decoder-only
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
d93e05d
Ivme-Conversate-22M-Base
89.3 MB
Ctrl+K
Ctrl+K
1 contributor
History:
13 commits
ereniko
Upload canonical_results.json with huggingface_hub
d93e05d
verified
3 days ago
.gitattributes
Safe
1.52 kB
initial commit
3 days ago
README.md
Safe
28 Bytes
initial commit
3 days ago
blimp_results.json
2.81 kB
Upload blimp_results.json with huggingface_hub
3 days ago
canonical_results.json
623 Bytes
Upload canonical_results.json with huggingface_hub
3 days ago
eval.py
9.55 kB
Upload eval.py with huggingface_hub
3 days ago
eval_blimp.py
6.63 kB
Upload eval_blimp.py with huggingface_hub
3 days ago
eval_wikitext.py
3.27 kB
Upload eval_wikitext.py with huggingface_hub
3 days ago
ivme_base_ema.pt
Safe
88.1 MB
xet
Upload ivme_base_ema.pt with huggingface_hub
3 days ago
ivme_tokenizer.json
Safe
1.14 MB
Upload ivme_tokenizer.json with huggingface_hub
3 days ago
model.py
12.7 kB
Upload model.py with huggingface_hub
3 days ago
muon.py
6.08 kB
Upload muon.py with huggingface_hub
3 days ago
prepare_data.py
5.18 kB
Upload prepare_data.py with huggingface_hub
3 days ago
tokenizer.py
5.02 kB
Upload tokenizer.py with huggingface_hub
3 days ago
train.py
10.6 kB
Upload train.py with huggingface_hub
3 days ago