Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mcnckc
/
llm-hw2-ppo
like
0
Text Generation
Transformers
Safetensors
HumanLLMs/Human-Like-DPO-Dataset
English
llama
text2text-generation
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llm-hw2-ppo
Commit History
Update README.md
16b08cc
verified
mcnckc
commited on
Mar 6, 2025
Upload tokenizer
3def931
verified
mcnckc
commited on
Mar 6, 2025
Upload LlamaForCausalLM
6928f5d
verified
mcnckc
commited on
Mar 6, 2025
initial commit
f07f3fa
verified
mcnckc
commited on
Mar 6, 2025