Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
coolpoodle
/
Qwen3-0.6B-Looped
like
5
PyTorch
English
qwen3
loop-attention
causal-lm
custom_code
License:
mit
Model card
Files
Files and versions
xet
Community
1
main
Qwen3-0.6B-Looped
10.6 GB
1 contributor
History:
10 commits
coolpoodle
Update README.md
1bdc454
verified
10 days ago
__pycache__
Upload folder using huggingface_hub
11 days ago
training&checkpoints
Upload folder using huggingface_hub
11 days ago
.gitattributes
1.57 kB
Upload folder using huggingface_hub
11 days ago
Qwen3-0.6B-Looped-Run2-Final.bin
2.38 GB
xet
Upload folder using huggingface_hub
11 days ago
README.md
4.25 kB
Update README.md
10 days ago
added_tokens.json
707 Bytes
Upload folder using huggingface_hub
11 days ago
baseline_eval.py
4.84 kB
More functionality!
11 days ago
chat_template.jinja
4.17 kB
Upload folder using huggingface_hub
11 days ago
config.json
1.51 kB
Upload folder using huggingface_hub
11 days ago
gate_projections.pt
249 kB
xet
Initial upload
11 days ago
loop_config.json
274 Bytes
Initial upload
11 days ago
merges.txt
1.67 MB
Upload folder using huggingface_hub
11 days ago
modeling_qwen_loop.py
17.5 kB
Update modeling_qwen_loop.py
11 days ago
pytorch_model.bin.index.json
30.1 kB
Upload folder using huggingface_hub
11 days ago
special_tokens_map.json
613 Bytes
Upload folder using huggingface_hub
11 days ago
test_loop_generation.py
1.8 kB
Uploaded Training / Testing File / Eval
11 days ago
tokenizer.json
11.4 MB
xet
Upload folder using huggingface_hub
11 days ago
tokenizer_config.json
5.4 kB
Upload folder using huggingface_hub
11 days ago
train.py
5.99 kB
Uploaded Training / Testing File / Eval
11 days ago
vocab.json
2.78 MB
Upload folder using huggingface_hub
11 days ago