1 28 89

Maojia Song

OrangeEye

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

upvoted a paper 8 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

upvoted a paper 8 days ago

Agents' Last Exam

View all activity

Organizations

Collections 1

Papers 1

arxiv:2509.13310

spaces 1

Aa

👀

models 4

datasets 0

None public yet

Maojia Song

AI & ML interests

Recent Activity

Organizations

Collections 1

HuggingFaceH4/MATH

HuggingFaceH4/MATH-500

microsoft/orca-math-word-problems-200k

openai/gsm8k

HuggingFaceH4/MATH

HuggingFaceH4/MATH-500

microsoft/orca-math-word-problems-200k

openai/gsm8k

Papers 1

spaces 1

Aa

models 4

OrangeEye/Qwen2.5-1.5B-Knowledge-R1-GRPO

OrangeEye/Trust-Align-Qwen2.5

OrangeEye/qwen-25-1.5b-base-sft

OrangeEye/gemma-2-2b-base-SFT

datasets 0

Maojia Song

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 1

spaces 1

Aa

models 4 Sort: Recently updated

datasets 0

models 4