Sumin Kim's picture

4 6

Sumin Kim

aigogongburani

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 14 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper 8 months ago

Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems

View all activity

Organizations

upvoted a paper 6 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 15 days ago • 141

upvoted a paper 14 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 20 days ago • 210

upvoted a paper 8 months ago

Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems

Paper • 2505.00212 • Published Apr 30, 2025 • 9

updated a model 9 months ago

aigogongburani/classifier-llama3-8b-kor-legal

Updated May 7, 2025

liked a model 9 months ago

yanolja/YanoljaNEXT-EEVE-Instruct-7B-v2-Preview

8B • Updated Aug 29, 2025 • 30 • 35

published a model 9 months ago

aigogongburani/classifier-llama3-8b-kor-legal

Updated May 7, 2025

liked a dataset 11 months ago

dmarsili/Omni3D-Bench

Viewer • Updated Feb 17, 2025 • 501 • 223 • 8

liked 3 datasets about 1 year ago

HuggingFaceTB/finemath

Viewer • Updated Feb 6, 2025 • 48.3M • 8.22k • 348

ChuGyouk/MedQA

Viewer • Updated Aug 16, 2024 • 22.9k • 13 • 3

nguha/legalbench

Updated Sep 30, 2024 • 111k • 160

upvoted a collection almost 2 years ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 888

liked a model about 2 years ago

beomi/llama-2-ko-7b

Text Generation • 7B • Updated Dec 27, 2023 • 1.78k • 176