Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Zanette-Labs 's Collections
efficient-reasoning

efficient-reasoning

updated Apr 13, 2025

Checkpoints for models trained in https://arxiv.org/abs/2502.04463

Upvote
1

  • daman1209arora/alpha_0.4_DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Apr 13, 2025 • 3

  • daman1209arora/alpha_0.05_DeepSeek-R1-Distill-Qwen-1.5B

    Text Generation • 2B • Updated Apr 13, 2025 • 5

  • daman1209arora/alpha_0.05_DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Apr 13, 2025 • 3

  • daman1209arora/alpha_0.2_DeepSeek-R1-Distill-Qwen-1.5B

    Text Generation • 2B • Updated Apr 13, 2025 • 3

  • daman1209arora/alpha_0.1_DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Apr 13, 2025 • 6

  • daman1209arora/alpha_0.1_DeepSeek-R1-Distill-Qwen-1.5B

    Text Generation • 2B • Updated Apr 13, 2025 • 22 •

  • daman1209arora/alpha_0.2_DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated May 13 • 44

  • daman1209arora/alpha_0.4_DeepSeek-R1-Distill-Qwen-1.5B

    Text Generation • 2B • Updated Apr 13, 2025 • 12 •

  • daman1209arora/alpha_0_DeepSeek-R1-Distill-Qwen-1.5B

    Text Generation • 2B • Updated Apr 13, 2025 • 16 •

  • daman1209arora/alpha_0_DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Apr 13, 2025 • 3
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs