3 21 12

Jingdi Lei

huaXiaKyrie

https://kyrielei.github.io/

jingdi-lei-44540b2b6

AI & ML interests

Large Language Models, Reinforecement Learning

Recent Activity

liked a dataset 1 day ago

huaXiaKyrie/R1_Think_Med

updated a dataset 1 day ago

huaXiaKyrie/up

updated a dataset 7 days ago

huaXiaKyrie/delta-mem-qasper-data

View all activity

Organizations

upvoted a collection 7 days ago

WTF GENIUS PAPERS

Collection

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 145 items • Updated 2 days ago • 29

upvoted a paper 10 days ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published 11 days ago • 217

upvoted a paper 11 days ago

δ-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published 12 days ago • 120

upvoted a paper 27 days ago

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Paper • 2603.27490 • Published Mar 29 • 18

upvoted a collection about 1 month ago

LLM Safety

Collection

Our research on LLM safety • 7 items • Updated Nov 6, 2025 • 2

upvoted 2 papers 3 months ago

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Paper • 2603.03756 • Published Mar 4 • 89

PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks

Paper • 2602.06663 • Published Feb 6 • 5

upvoted a paper 4 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 112

upvoted 2 papers 5 months ago

IAG: Input-aware Backdoor Attack on VLMs for Visual Grounding

Paper • 2508.09456 • Published Aug 13, 2025 • 8

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44

upvoted a paper 7 months ago

Chem-R: Learning to Reason as a Chemist

Paper • 2510.16880 • Published Oct 19, 2025 • 53

upvoted a paper 8 months ago

OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!

Paper • 2509.26495 • Published Sep 30, 2025 • 13

upvoted 2 papers 9 months ago

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published Aug 25, 2025 • 49

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Paper • 2508.08401 • Published Aug 11, 2025 • 42

upvoted 2 papers 10 months ago

Persona Vectors: Monitoring and Controlling Character Traits in Language Models

Paper • 2507.21509 • Published Jul 29, 2025 • 34

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79

upvoted 2 papers 12 months ago

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Paper • 2505.17873 • Published May 23, 2025 • 30

MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search

Paper • 2505.19209 • Published May 25, 2025 • 23

upvoted 2 papers over 1 year ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published Nov 27, 2024 • 40

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 54

Jingdi Lei

AI & ML interests

Recent Activity

Organizations

huaXiaKyrie's activity