MergeBench-Llama-8B-it (MergeBench-Llama-8B-it)

yifeihe3

authored 6 papers 5 months ago

Semi-Supervised Reward Modeling via Iterative Self-Training

Paper • 2409.06903 • Published Sep 10, 2024 • 1

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks

Paper • 2410.18210 • Published Oct 23, 2024

updated a model about 1 year ago

MergeBench-Llama-8B-it/llama-3.1-8b-it_tulu-3-sft-personas-instruction-following_epoch3_0429

Text Generation • 8B • Updated May 5, 2025 • 34 •

yifeihe3

updated a model about 1 year ago

MergeBench-Llama-8B-it/llama3-8b-it-GRPO-after-sft

Text Generation • 8B • Updated May 2, 2025 • 14 •

yifeihe3

published a model about 1 year ago

MergeBench-Llama-8B-it/llama3-8b-it-GRPO-after-sft

Text Generation • 8B • Updated May 2, 2025 • 14 •

Mirnegg

updated a model about 1 year ago

MergeBench-Llama-8B-it/llama-3.1-8b-it_mtl

Text Generation • 8B • Updated May 1, 2025 • 4

cindy2000sh

published a model about 1 year ago

MergeBench-Llama-8B-it/llama-3.1-8b-it_tulu-3-sft-personas-instruction-following_epoch3_0429

Text Generation • 8B • Updated May 5, 2025 • 34 •

Mirnegg

published a model about 1 year ago

MergeBench-Llama-8B-it/llama-3.1-8b-it_mtl

Text Generation • 8B • Updated May 1, 2025 • 4

Mirnegg

updated a model about 1 year ago

MergeBench-Llama-8B-it/llama-3.1-8b-it_aya_2epoch

Text Generation • 8B • Updated Apr 26, 2025 • 19 •

Mirnegg

published a model about 1 year ago

MergeBench-Llama-8B-it/llama-3.1-8b-it_aya_2epoch

Text Generation • 8B • Updated Apr 26, 2025 • 19 •

yifeihe3

updated a model about 1 year ago

MergeBench-Llama-8B-it/llama-3.1-8b-it_dart-math-uniform_2epoch

Text Generation • 8B • Updated Apr 25, 2025 • 3

yifeihe3

published a model about 1 year ago

MergeBench-Llama-8B-it/llama-3.1-8b-it_dart-math-uniform_2epoch

Text Generation • 8B • Updated Apr 25, 2025 • 3

Mirnegg

authored a paper almost 3 years ago

Understanding the Impact of Adversarial Robustness on Accuracy Disparity

Paper • 2211.15762 • Published Nov 28, 2022

AI & ML interests

Team members 3

MergeBench-Llama-8B-it's activity