File size: 708 Bytes
4a6a078 91fa8e3 4a6a078 91fa8e3 4a6a078 91fa8e3 4a6a078 91fa8e3 4a6a078 91fa8e3 c60e4d4 91fa8e3 c60e4d4 91fa8e3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 |
---
library_name: transformers
datasets:
- HumanLLMs/Human-Like-DPO-Dataset
language:
- en
base_model:
- google-bert/bert-base-uncased
---
# BERT Human-like Reward Model
This is a reward model based on Bert Uncased.
### Inference
```python
!pip install transformers accelerate
model_name = "entfane/BERT_human_like_RM"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)
messages = ["How are you doing? Great",
"How are you doing? Greetings! I am doing just fine, may I ask you, how are you doing?"
]
input = tokenizer(messages, return_tensors="pt", padding="max_length").to(model.device)
output = model(**input)
print(output)
``` |