Safetensors

Model Description

  • Developed by: DeepSeek-AI
  • Model type: Causal Language Models
  • License: deepseek license
  • Fine-tuned from: TOFU-SFT/deepseek-math-7b-base-4bit

Citation

@article{shao2024deepseekmath,
  title={Deepseekmath: Pushing the limits of mathematical reasoning in open language models},
  author={Shao, Zhihong and Wang, Peiyi and Zhu, Qihao and Xu, Runxin and Song, Junxiao and Bi, Xiao and Zhang, Haowei and Zhang, Mingchuan and Li, YK and Wu, Yang and others},
  journal={arXiv preprint arXiv:2402.03300},
  year={2024}
}


Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for TOFU-SFT/deepseek-math-7b-base-4bit-cot-sft-tofu