tonyshelby (Truong D Nguyen)

Collections 2

models 41

datasets 36

tonyshelby/llama3-ultrafeedback-armorm-noisy-0.1

Viewer • Updated Mar 26 • 61.8k • 15

tonyshelby/llama3-ultrafeedback-armorm-noisy

Viewer • Updated Mar 26 • 61.8k • 12

tonyshelby/ultra-feedback-tisdpo-llama

Viewer • Updated Jan 18 • 63.1k • 21

tonyshelby/processed_data

Preview • Updated Jan 17 • 16

tonyshelby/ultra-feedback-tisdpo-mistral

Updated Jan 15 • 12

tonyshelby/ultrafeedback_binarized_reversed

Viewer • Updated Jan 14 • 187k • 7

tonyshelby/view

Viewer • Updated Nov 23, 2025 • 1.02k • 26

tonyshelby/AIME25-ER-verl

Viewer • Updated Nov 1, 2025 • 30 • 18

tonyshelby/Archer2.0-Math-1.5B-ER

Viewer • Updated Oct 31, 2025 • 70.8k • 15

tonyshelby/Archer2.0-Math-1.5B-VeRL

Viewer • Updated Oct 31, 2025 • 70.8k • 12

View 36 datasets

Truong D Nguyen PRO

AI & ML interests

Organizations

Collections 2

tonyshelby/mistral-QTBPO-merged

tonyshelby/mistral-ATBPO-merged

tonyshelby/llama-sft-merged

tonyshelby/llama-QTBPO-merged

tonyshelby/qwen2.5_3b_checkpoints

tonyshelby/qwen2.5_7b_checkpoints

tp140205/arm-router-base

tonyshelby/mistral-QTBPO-merged

tonyshelby/mistral-ATBPO-merged

tonyshelby/llama-sft-merged

tonyshelby/llama-QTBPO-merged

tonyshelby/qwen2.5_3b_checkpoints

tonyshelby/qwen2.5_7b_checkpoints

tp140205/arm-router-base

models 41

tonyshelby/llama-ATBPO-noisy

tonyshelby/llama-QTBPO-noisy

tonyshelby/llama-TBPO-no-weight

tonyshelby/mistral-ATBPO-merged

tonyshelby/llama-ATBPO-merged

tonyshelby/llama-ATBPO

tonyshelby/llama-reverse-dpo-merged

tonyshelby/mistral-reverse-dpo-merged

tonyshelby/llama-QTBPO-merged

tonyshelby/llama-QTBPO-first-run

datasets 36

tonyshelby/llama3-ultrafeedback-armorm-noisy-0.1

tonyshelby/llama3-ultrafeedback-armorm-noisy

tonyshelby/ultra-feedback-tisdpo-llama

tonyshelby/processed_data

tonyshelby/ultra-feedback-tisdpo-mistral

tonyshelby/ultrafeedback_binarized_reversed

tonyshelby/view

tonyshelby/AIME25-ER-verl

tonyshelby/Archer2.0-Math-1.5B-ER

tonyshelby/Archer2.0-Math-1.5B-VeRL

Truong D Nguyen PRO

AI & ML interests

Organizations

Collections 2

models 41 Sort: Recently updated

datasets 36 Sort: Recently updated

models 41

datasets 36