DONGRYEOLLEE

drlee1

DONGRYEOLLEE1

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

liked a model 3 days ago

LiquidAI/LFM2.5-Embedding-350M

liked a dataset 6 days ago

lordx64/agentic-distill-fable-5-sft

View all activity

Organizations

None yet

upvoted a paper about 16 hours ago

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

Paper • 2606.20945 • Published 7 days ago • 59

liked a model 3 days ago

LiquidAI/LFM2.5-Embedding-350M

liked a dataset 6 days ago

lordx64/agentic-distill-fable-5-sft

Viewer • Updated 9 days ago • 4.66k • 840 • 36

liked a model 6 days ago

WeiboAI/VibeThinker-3B

Text Generation • 3B • Updated 5 days ago • 49.6k • • 688

upvoted a paper 7 days ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 9 days ago • 74

upvoted a paper 9 days ago

FastContext: Training Efficient Repository Explorer for Coding Agents

Paper • 2606.14066 • Published 13 days ago • 91

upvoted 2 papers 10 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 14 days ago • 90

MiniMax Sparse Attention

Paper • 2606.13392 • Published 14 days ago • 145

liked a model 10 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated Mar 15 • 1.2M • • 773

liked a model 13 days ago

jinaai/jina-embeddings-v5-text-small

Feature Extraction • 0.6B • Updated Apr 15 • 398k • 181

upvoted a paper 13 days ago

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18, 2025 • 19

upvoted a paper 14 days ago

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 17 days ago • 52

liked a dataset 15 days ago

m-a-p/CodeFeedback-Filtered-Instruction

Viewer • Updated Feb 26, 2024 • 157k • 18.5k • 204

liked a model 16 days ago

ny1031/Qwen3-1.7B-SFT-RLVR-IF

Text Generation • 2B • Updated May 6 • 6 • 1

liked a dataset 16 days ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 18.7k • 251

upvoted 2 papers 20 days ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published 27 days ago • 20

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 24 days ago • 232

upvoted a paper 22 days ago

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Paper • 2606.02404 • Published 24 days ago • 57

liked a model 24 days ago

Qwen/Qwen3.5-2B

Image-Text-to-Text • 2B • Updated Mar 2 • 1.73M • • 316

upvoted a paper about 1 month ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Paper • 2605.19577 • Published May 19 • 59

DONGRYEOLLEE

AI & ML interests

Recent Activity

Organizations

drlee1's activity