Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Yewei-Liu
/
SHINE-ift_mqa
like
0
Text Generation
Transformers
arxiv:
2602.06358
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
SHINE-ift_mqa
3.55 GB
1 contributor
History:
3 commits
nielsr
HF Staff
Add model card and link to paper
de2ec9a
verified
12 days ago
.gitattributes
1.52 kB
initial commit
16 days ago
README.md
1.81 kB
Add model card and link to paper
12 days ago
mem_tokens.pt
2.43 MB
xet
Upload folder using huggingface_hub
16 days ago
metalora.pth
1.4 GB
xet
Upload folder using huggingface_hub
16 days ago
metanetwork.pth
2.15 GB
xet
Upload folder using huggingface_hub
16 days ago
trainer_state.json
26 Bytes
Upload folder using huggingface_hub
16 days ago
trainer_state.pt
16.1 kB
xet
Upload folder using huggingface_hub
16 days ago