Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nicholasKluge
/
RewardModelPT
like
0
Text Classification
Transformers
PyTorch
Safetensors
nicholasKluge/reward-aira-dataset
Portuguese
bert
reward model
alignment
preference model
RLHF
Carbon Emissions
text-embeddings-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
RewardModelPT
1.74 GB
3 contributors
History:
42 commits
SFconvertbot
Adding `safetensors` variant of this model
fd59dc8
over 2 years ago
.gitattributes
1.48 kB
initial commit
over 2 years ago
LICENSE.txt
10.8 kB
Upload 16 files
over 2 years ago
README.md
8 kB
Update README.md
over 2 years ago
RewardModel.ipynb
402 kB
Rename RewardModelPT.ipynb to RewardModel.ipynb
over 2 years ago
RewardModel_emissions.csv
776 Bytes
Upload 16 files
over 2 years ago
config.json
953 Bytes
Upload 16 files
over 2 years ago
model.safetensors
436 MB
xet
Adding `safetensors` variant of this model
over 2 years ago
optimizer.pt
872 MB
xet
Upload 16 files
over 2 years ago
pytorch_model.bin
436 MB
xet
Upload 16 files
over 2 years ago
rng_state.pth
14.6 kB
xet
Upload 16 files
over 2 years ago
scheduler.pt
627 Bytes
xet
Upload 16 files
over 2 years ago
special_tokens_map.json
125 Bytes
Upload 16 files
over 2 years ago
tokenizer.json
678 kB
Upload 16 files
over 2 years ago
tokenizer_config.json
395 Bytes
Upload 16 files
over 2 years ago
trainer_state.json
4.76 kB
Upload 16 files
over 2 years ago
training_args.bin
4.03 kB
xet
Upload 16 files
over 2 years ago
vocab.txt
210 kB
Upload 16 files
over 2 years ago