Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Steven668866
/
CoVT-Phase2-3expert-Full
like
0
TensorBoard
Safetensors
English
covt
qwen2.5-vl
reproduction
stage1
License:
apache-2.0
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Copy to bucket
new
main
CoVT-Phase2-3expert-Full
16.6 GB
Ctrl+K
Ctrl+K
1 contributor
History:
49 commits
Steven668866
Add README explaining 2026-06-29 cleanup: removed buggy non-strict ckpts
bc23af3
verified
6 days ago
runs
tensorboard logs
7 days ago
scripts
training scripts (sft_phase2.sh, deepspeed config, env)
7 days ago
stage1_merged
stage1_merged: Phase1 LoRA merged onto Qwen2.5-VL-7B base (3-expert)
7 days ago
training_src
training source (data.py, train.py, ResumeDatasetCallback)
7 days ago
.gitattributes
Safe
1.76 kB
non-strict ckpt-10000 tokenizer.json
7 days ago
README.md
2.55 kB
Add README explaining 2026-06-29 cleanup: removed buggy non-strict ckpts
6 days ago
phase2.log
Safe
6.63 MB
phase2 stdout log
7 days ago
sft_phase2.log
Safe
6.63 MB
phase2 trainer log
7 days ago