1 9 3

liuyixiu

liuyx0903

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

liked a dataset 3 months ago

SII-GAIR-NLP/davinci-llm-data

upvoted a paper 3 months ago

ASI-Evolve: AI Accelerates AI

View all activity

Organizations

upvoted a paper 17 days ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Paper • 2606.11182 • Published 17 days ago • 18

liked a dataset 3 months ago

SII-GAIR-NLP/davinci-llm-data

Viewer • Updated Apr 16 • 1.25M • 154 • 13

upvoted a paper 3 months ago

ASI-Evolve: AI Accelerates AI

Paper • 2603.29640 • Published Mar 31 • 29

New activity in SII-GAIR-NLP/davinci-llm-model 3 months ago

Add metadata and link to code

#1 opened 3 months ago by

nielsr

liked a model 3 months ago

SII-GAIR-NLP/davinci-llm-model

Text Generation • 3B • Updated Apr 2 • 26 • • 30

updated a model 3 months ago

SII-GAIR-NLP/davinci-llm-model

Text Generation • 3B • Updated Apr 2 • 26 • • 30

upvoted a paper 3 months ago

daVinci-LLM:Towards the Science of Pretraining

Paper • 2603.27164 • Published Mar 28 • 32

published a model 3 months ago

SII-GAIR-NLP/davinci-llm-model

Text Generation • 3B • Updated Apr 2 • 26 • • 30

upvoted a paper 3 months ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published Mar 23 • 125

upvoted a paper 5 months ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published Jan 26 • 126

upvoted a paper 6 months ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 66

liked a Space 8 months ago

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

upvoted a paper 8 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85

upvoted a paper about 1 year ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 49

updated a model about 1 year ago

liuyx0903/xf

8B • Updated May 24, 2025 • 4

published a model about 1 year ago

liuyx0903/xf

8B • Updated May 24, 2025 • 4

upvoted a paper about 1 year ago

Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20, 2025 • 44

updated 2 models almost 2 years ago

GAIR/Safety-J-v5

Image Feature Extraction • 8B • Updated Jul 15, 2024 • 6 • 1

GAIR/Safety-J-v1

Image Feature Extraction • 8B • Updated Jul 15, 2024 • 2

liuyixiu

AI & ML interests

Recent Activity

Organizations

liuyx0903's activity

Add metadata and link to code

The Smol Training Playbook