Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PARTAGES-dev 's Collections
Encoder pretraining from scratch (commercial use)
Encoder continual pretraining (research use)
Qwen3+PDAPT+SLERP

Qwen3+PDAPT+SLERP

updated 19 days ago

Experiments conducted for the LREC paper ()

Upvote
-

  • PARTAGES-dev/Qwen3-8B-PDAPT-SLERP

    Text Generation • 8B • Updated Dec 3, 2025 • 151

  • PARTAGES-dev/Qwen3-4B-PDAPT-SLERP

    Text Generation • 4B • Updated Dec 3, 2025 • 42

  • Qwen/Qwen3-8B-Base

    Text Generation • 8B • Updated May 21, 2025 • 1.58M • • 95

  • Qwen/Qwen3-4B-Base

    Text Generation • 4B • Updated Jul 26, 2025 • 962k • 83

  • Qwen/Qwen3-1.7B-Base

    Text Generation • 2B • Updated Jul 26, 2025 • 388k • 67

  • Qwen/Qwen3-0.6B-Base

    Text Generation • Updated Jul 26, 2025 • 277k • 157

  • PARTAGES-dev/Qwen3-1.7B-PDAPT-SLERP

    Text Generation • 2B • Updated Feb 25 • 15

  • PARTAGES-dev/Qwen3-0.6B-PDAPT-SLERP

    Text Generation • 0.8B • Updated Dec 4, 2025 • 25
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs