Aaron Di

aaron-di

aaron-di

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

liked a model 10 days ago

yolay/Youtu-Agent-RL-Maths-Qwen2.5-7B

upvoted an article 2 months ago

Transformers v5: Simple model definitions powering the AI ecosystem

View all activity

Organizations

None yet

upvoted a paper 4 days ago

How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

Paper • 2602.10622 • Published 5 days ago • 26

liked a model 10 days ago

yolay/Youtu-Agent-RL-Maths-Qwen2.5-7B

Text Generation • 8B • Updated about 1 month ago • 9 • 3

upvoted an article 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

298

liked a model 10 months ago

Qwen/Qwen3-4B

Text Generation • 4B • Updated Jul 26, 2025 • 5.09M • 552

liked a model 12 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 540k • • 13k

liked a model about 1 year ago

HuggingFaceFW/ablation-model-fineweb-edu

Text Generation • 2B • Updated Jun 11, 2024 • 546 • 19

liked a model over 1 year ago

m42-health/Llama3-Med42-8B

Text Generation • Updated Aug 20, 2024 • 4.09k • • 84

updated a Space over 1 year ago

MindSearch

📊

liked 5 models over 1 year ago

updated 7 models almost 2 years ago

aaron-di/YamshadowExperiment28-7B-TaskArithmetic

Text Generation • 7B • Updated May 10, 2024 • 2

aaron-di/YamshadowExperiment28-7B-Ties

Text Generation • 7B • Updated May 8, 2024 • 2

aaron-di/YamshadowExperiment28-7B-Linear

Text Generation • 7B • Updated May 8, 2024 • 1

aaron-di/YamshadowExperiment28-7B-DareLinear

Text Generation • 7B • Updated May 8, 2024 • 1

aaron-di/YamshadowExperiment28-7B-DareTies

Text Generation • 7B • Updated May 8, 2024 • 1

aaron-di/YamshadowExperiment28-7B-Slerp

Text Generation • 7B • Updated May 7, 2024 • 1

aaron-di/Yamshadowexperiment28M70.8-0.84-0.86-0.36-0.16-0.95-7B

Text Generation • 7B • Updated Apr 8, 2024 • 1

Aaron Di

AI & ML interests

Recent Activity

Organizations

aaron-di's activity

Transformers v5: Simple model definitions powering the AI ecosystem

MindSearch