Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
shirwu
/
preference_iterative_hard-answer_generator-iter0
like
0
Text Classification
Transformers
Safetensors
llama
trl
reward-trainer
text-embeddings-inference
4-bit precision
bitsandbytes
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
preference_iterative_hard-answer_generator-iter0
5.16 GB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
shirwu
Upload LlamaForSequenceClassification
023e531
verified
about 1 year ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
about 1 year ago
README.md
Safe
5.19 kB
Upload LlamaForSequenceClassification
about 1 year ago
adapter_config.json
811 Bytes
Upload model
about 1 year ago
adapter_model.safetensors
168 MB
xet
Upload model
about 1 year ago
config.json
Safe
1.53 kB
Upload LlamaForSequenceClassification
about 1 year ago
model.safetensors
4.98 GB
xet
Upload LlamaForSequenceClassification
about 1 year ago
special_tokens_map.json
Safe
325 Bytes
Upload tokenizer
about 1 year ago
tokenizer.json
Safe
17.2 MB
xet
Upload tokenizer
about 1 year ago
tokenizer_config.json
Safe
55.4 kB
Upload tokenizer
about 1 year ago