Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Steven668866
/
CoVT-Phase2-3expert-Full

TensorBoard
Safetensors
English
covt
qwen2.5-vl
reproduction
stage1
Model card Files Files and versions
xet
Metrics Training metrics Community
CoVT-Phase2-3expert-Full
16.6 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 49 commits
Steven668866's picture
Steven668866
Add README explaining 2026-06-29 cleanup: removed buggy non-strict ckpts
bc23af3 verified 6 days ago
  • runs
    tensorboard logs 7 days ago
  • scripts
    training scripts (sft_phase2.sh, deepspeed config, env) 7 days ago
  • stage1_merged
    stage1_merged: Phase1 LoRA merged onto Qwen2.5-VL-7B base (3-expert) 7 days ago
  • training_src
    training source (data.py, train.py, ResumeDatasetCallback) 7 days ago
  • .gitattributes
    1.76 kB
    non-strict ckpt-10000 tokenizer.json 7 days ago
  • README.md
    2.55 kB
    Add README explaining 2026-06-29 cleanup: removed buggy non-strict ckpts 6 days ago
  • phase2.log
    6.63 MB
    phase2 stdout log 7 days ago
  • sft_phase2.log
    6.63 MB
    phase2 trainer log 7 days ago