Cascade0-170M-Base / README.md
ARMZyany's picture
Update README.md
1fc7fe4 verified
metadata
license: apache-2.0
datasets:
  - Salesforce/wikitext
  - VisionTheta/fineweb-1B
  - Voxel51/fiftyone-qa-pairs-14k
  - Open-Orca/OpenOrca
  - OpenAssistant/oasst2
  - Ereeeeef3/Qu-QA-v2
  - tau/commonsense_qa
  - OpenAssistant/oasst1
  - hkust-nlp/deita-10k-v0
  - HuggingFaceH4/ultrafeedback_binarized
  - meta-math/MetaMathQA
  - HuggingFaceH4/ultrachat_200k
language:
  - en
pipeline_tag: text-generation

Cascade0 170M Base Model

cascade-zero - Copy

Base model of the entire Cascade0-159M-DPO-Instruct and Normal Instruct saga.

Max context size is 1512.

Cascade0 Base VS other small models

output

made with LMEval Harness