nb

Do you understand how the reward model is built there? They say it's formed a rule-based on correctness, so is it only applied to prompts taken from math problems and leet-code problems? How were the prompts chosen/generated in the RL phase?

updated a model almost 2 years ago

ndvb/segformer-b0-finetuned-segments-sidewalk-oct-22

Updated Sep 17, 2024

updated a collection over 2 years ago

Text to image

Collection

1 item • Updated Dec 10, 2023

upvoted a paper over 2 years ago

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

Paper • 2311.10093 • Published Nov 16, 2023 • 58

New activity in ItbearZhang/facebook-opt-125m-with-alpacadataset almost 3 years ago

Adding `safetensors` variant of this model

#1 opened about 3 years ago by

SFconvertbot

New activity in ramsrigouthamg/t5-large-paraphraser-diverse-high-quality over 3 years ago

How do I export it to torchscript?

👍 1

#2 opened about 4 years ago by

elavneet

nb

AI & ML interests

Recent Activity

Organizations

ndvb's activity

Add dataset card and links to paper/GitHub

Adding `safetensors` variant of this model

How do I export it to torchscript?