DeyangKong's picture

DeyangKong

DeyangKong

·

DeyangKong

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 22 days ago

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

upvoted a paper about 1 month ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

upvoted a paper about 1 month ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

View all activity

Organizations

authored a paper 2 months ago

Advancing Block Diffusion Language Models for Test-Time Scaling

Paper • 2602.09555 • Published Feb 10 • 4

authored 3 papers 3 months ago

LongCat-Flash Technical Report

Paper • 2509.01322 • Published Sep 1, 2025 • 8

Autoformalizer with Tool Feedback

Paper • 2510.06857 • Published Oct 8, 2025

OPE: Overcoming Information Saturation in Parallel Thinking via Outline-Guided Path Exploration

Paper • 2602.08344 • Published Feb 9 • 5

submitted a paper to Daily Papers 3 months ago

OPE: Overcoming Information Saturation in Parallel Thinking via Outline-Guided Path Exploration

Paper • 2602.08344 • Published Feb 9 • 5

authored a paper 12 months ago

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Paper • 2505.17652 • Published May 23, 2025 • 6

authored a paper about 1 year ago

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Paper • 2503.01506 • Published Mar 3, 2025 • 10