Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
liuhailin0123
/
trainer_output
like
0
Text Classification
Transformers
Safetensors
HumanLLMs/Human-Like-DPO-Dataset
English
llama
Generated from Trainer
trl
reward-trainer
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
trainer_output
Commit History
Update README.md
29509e5
verified
liuhailin0123
commited on
Mar 30, 2025
liuhailin0123/llm-course-hw2-reward-model
6a127d9
verified
liuhailin0123
commited on
Mar 30, 2025
Update README.md
dd11bce
verified
liuhailin0123
commited on
Mar 30, 2025
liuhailin0123/llm-course-hw2-reward-model
808414b
verified
liuhailin0123
commited on
Mar 30, 2025
Update README.md
2ab279a
verified
liuhailin0123
commited on
Mar 27, 2025
liuhailin0123/llm-course-hw2-reward-model
4b319c2
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
394748c
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
f29efad
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
ac1f875
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
27e2d2d
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
f5c2137
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
c684162
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
21dae65
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
cfe94eb
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
0cac4c2
verified
liuhailin0123
commited on
Mar 26, 2025
liuhailin0123/llm-course-hw2-reward-model
c9377bc
verified
liuhailin0123
commited on
Mar 26, 2025
initial commit
197d653
verified
liuhailin0123
commited on
Mar 26, 2025