SkipMoE / requirements.txt
chengyanwu
stuff
ccda2ec
torch>=2.0.0
transformers>=4.34.0
accelerate>=0.25.0
datasets>=2.14.0
tqdm>=4.66.0
bitsandbytes>=0.41.0 # For 8-bit training if needed
sentencepiece>=0.1.99 # For tokenization
protobuf>=4.23.4 # For datasets loading
tensorboard>=2.13.0 # For training monitoring