Model Description

  • Developed by: EleutherAI
  • Model type: Transformer-based Language Model
  • License: Apache 2.0

Bias, Risks, and Limitations

Warning: this model may produce harmful content

Citation

@misc{biderman2023pythiasuiteanalyzinglarge,
      title={Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling}, 
      author={Stella Biderman and Hailey Schoelkopf and Quentin Anthony and Herbie Bradley and Kyle O'Brien and Eric Hallahan and Mohammad Aflah Khan and Shivanshu Purohit and USVSN Sai Prashanth and Edward Raff and Aviya Skowron and Lintang Sutawika and Oskar van der Wal},
      year={2023},
      eprint={2304.01373},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2304.01373}, 
}
Downloads last month
78
Safetensors
Model size
12B params
Tensor type
F32
·
F16
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train TOFU-SFT/pythia-12b-4bit

Collection including TOFU-SFT/pythia-12b-4bit

Paper for TOFU-SFT/pythia-12b-4bit