Kuo-Hsin Tu

dapumptu

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

microsoft/FastContext-1.0-4B-SFT

upvoted a paper 2 months ago

p1: Better Prompt Optimization with Fewer Prompts

liked a dataset 2 months ago

meituan-longcat/General365_Public

View all activity

Organizations

liked a model 3 days ago

microsoft/FastContext-1.0-4B-SFT

Text Generation • 4B • Updated 8 days ago • 5.28k • • 341

upvoted a paper 2 months ago

p1: Better Prompt Optimization with Fewer Prompts

Paper • 2604.08801 • Published Apr 9 • 9

liked a dataset 2 months ago

meituan-longcat/General365_Public

Viewer • Updated Apr 14 • 720 • 199 • 10

upvoted 9 papers 2 months ago

Narrative-Driven Paper-to-Slide Generation via ArcDeck

Paper • 2604.11969 • Published Apr 13 • 7

Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation

Paper • 2604.13010 • Published Apr 14 • 19

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

Paper • 2604.14164 • Published Mar 23 • 35

AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization

Paper • 2511.15915 • Published Apr 15 • 4

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 98

InCoder-32B-Thinking: Industrial Code World Model for Thinking

Paper • 2604.03144 • Published Apr 3 • 239

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 636

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 509

Kronos: A Foundation Model for the Language of Financial Markets

Paper • 2508.02739 • Published Aug 2, 2025 • 44

liked a model 3 months ago

nvidia/gpt-oss-puzzle-88B

Text Generation • 91B • Updated Apr 28 • 1.85k • 92

updated a collection 3 months ago

adavanced learning

Collection

3 items • Updated Mar 23 • 1

upvoted 3 papers 3 months ago

Efficient Exploration at Scale

Paper • 2603.17378 • Published Mar 18 • 15

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published Mar 19 • 69

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published Mar 17 • 60

liked a model 3 months ago

Multilingual-Multimodal-NLP/IndustrialCoder

Text Generation • 32B • Updated Mar 27 • 106 • 65

Kuo-Hsin Tu

AI & ML interests

Recent Activity

Organizations

dapumptu's activity