Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
mnlp-2024
/
dpo_lora_mcqa
like
0
Follow
mnlp project of gans-and-losses
3
Text Generation
Transformers
Safetensors
gemma
trl
sft
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
7f369a9
dpo_lora_mcqa
5.07 GB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
DragosTatar
Upload GemmaForCausalLM
7f369a9
verified
almost 2 years ago
.gitattributes
Safe
1.52 kB
initial commit
almost 2 years ago
README.md
Safe
5.18 kB
Upload GemmaForCausalLM
almost 2 years ago
adapter_config.json
Safe
747 Bytes
Upload model
almost 2 years ago
adapter_model.safetensors
29.5 MB
xet
Upload model
almost 2 years ago
config.json
Safe
707 Bytes
Upload GemmaForCausalLM
almost 2 years ago
generation_config.json
Safe
132 Bytes
Upload GemmaForCausalLM
almost 2 years ago
model-00001-of-00002.safetensors
4.97 GB
xet
Upload GemmaForCausalLM
almost 2 years ago
model-00002-of-00002.safetensors
67.6 MB
xet
Upload GemmaForCausalLM
almost 2 years ago
model.safetensors.index.json
Safe
39 kB
Upload GemmaForCausalLM
almost 2 years ago