Reward Models - a CKeibel Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

CKeibel 's Collections

Code-Embeddings

Speech2Text (ASR)

diffusion models

Text-Classification

Causal LMs, seq2seq models

Embedding models

BERT based tasks (models)

Reward Models

updated Dec 18, 2024

RLHFlow/Llama3.1-8B-PRM-Deepseek-Data

Text Generation • 8B • Updated May 10, 2025 • 3.13k • • 38

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs