How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-classification", model="MaxJeblick/reward-model-deberta-v3-unit-test")
# Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("MaxJeblick/reward-model-deberta-v3-unit-test")
model = AutoModelForSequenceClassification.from_pretrained("MaxJeblick/reward-model-deberta-v3-unit-test")
Quick Links

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Small dummy deberta-v3-type Reward Model useable for Unit/Integration tests for RLHF. Suitable for CPU only machines, see H2O LLM Studio for an example integration test.

Model was created as follows:

from transformers import AutoConfig, AutoTokenizer, AutoModelForSequenceClassification

repo_name = "MaxJeblick/reward-model-deberta-v3-unit-test"
model_name = "OpenAssistant/reward-model-deberta-v3-large-v2"
config = AutoConfig.from_pretrained(model_name)

config.hidden_size = 12
config.intermediate_size = 24
config.num_attention_heads = 2
config.num_hidden_layers = 2
config.pooler_hidden_size = 12

tokenizer = AutoTokenizer.from_pretrained(model_name)

model = AutoModelForSequenceClassification.from_config(config)
print(model.num_parameters())  # 1_546_129


model.push_to_hub(repo_name, private=False)
tokenizer.push_to_hub(repo_name, private=False)
config.push_to_hub(repo_name, private=False)
Downloads last month
18
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support