Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

reaperdoesntknow
/
Dualmind-Qwen-1.7B-Thinking

Text Generation
Transformers
TensorBoard
Safetensors
English
qwen3
sft
trl
dualmind
knowledge-distillation
thinking
opus
self-critique
convergent-intelligence
convergentintel
edge
distillation
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
1
Dualmind-Qwen-1.7B-Thinking
4.08 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 11 commits
reaperdoesntknow's picture
reaperdoesntknow
Polish model card: portfolio section, freshness stamp
41386d0 verified about 7 hours ago
  • .gitattributes
    1.57 kB
    Upload tokenizer 1 day ago
  • README.md
    10.7 kB
    Polish model card: portfolio section, freshness stamp about 7 hours ago
  • chat_template.jinja
    4.17 kB
    Upload tokenizer 1 day ago
  • config.json
    1.42 kB
    Upload Qwen3ForCausalLM 1 day ago
  • events.out.tfevents.1774855351.0e755ff15ec0.1023.2
    202 kB
    xet
    Upload 2 files 1 day ago
  • events.out.tfevents.1774858526.0e755ff15ec0.15561.0
    54.4 kB
    xet
    Upload events.out.tfevents.1774858526.0e755ff15ec0.15561.0 1 day ago
  • generation_config.json
    187 Bytes
    Upload Qwen3ForCausalLM 1 day ago
  • model.safetensors
    4.06 GB
    xet
    Upload Qwen3ForCausalLM 1 day ago
  • tokenizer.json
    11.4 MB
    xet
    Upload tokenizer 1 day ago
  • tokenizer_config.json
    664 Bytes
    Upload tokenizer 1 day ago
  • trainer_state .json
    150 kB
    Upload 2 files 1 day ago