Truong D Nguyen PRO
tonyshelby
·
AI & ML interests
None yet
Recent Activity
updated
a collection
16 days ago
TokenBPO
updated
a collection
16 days ago
TokenBPO
updated
a model
18 days ago
tonyshelby/llama-QTBPO-merged
Organizations
models
37
tonyshelby/mistral-ATBPO-merged
Updated
tonyshelby/llama-ATBPO-merged
Updated
tonyshelby/llama-ATBPO
Updated
tonyshelby/llama-reverse-dpo-merged
Updated
tonyshelby/mistral-reverse-dpo-merged
Updated
tonyshelby/llama-QTBPO-merged
Updated
tonyshelby/llama-QTBPO-first-run
Updated
tonyshelby/mistral-QTBPO-merged
Updated
tonyshelby/mistral-QTBPO-third-run
Updated
tonyshelby/mistral-QTBPO-second-run
Updated
datasets
34
tonyshelby/ultra-feedback-tisdpo-llama
Viewer
•
Updated
•
63.1k
•
15
tonyshelby/processed_data
Preview
•
Updated
•
57
tonyshelby/ultra-feedback-tisdpo-mistral
Updated
•
40
tonyshelby/ultrafeedback_binarized_reversed
Viewer
•
Updated
•
187k
•
27
tonyshelby/view
Viewer
•
Updated
•
1.02k
tonyshelby/AIME25-ER-verl
Viewer
•
Updated
•
30
tonyshelby/Archer2.0-Math-1.5B-ER
Viewer
•
Updated
•
70.8k
•
2
tonyshelby/Archer2.0-Math-1.5B-VeRL
Viewer
•
Updated
•
70.8k
•
1
tonyshelby/14bweight
Viewer
•
Updated
•
56.5k
tonyshelby/ultra-feedback_falcon3_v2
Viewer
•
Updated
•
58.7k
•
1