2 15

Li Pengyi

LiPengyi29

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Breaking the Chain: A Causal Analysis of LLM Faithfulness to Intermediate Structures

upvoted a paper 2 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

upvoted a paper 3 months ago

Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities

View all activity

Organizations

upvoted a paper about 1 month ago

Breaking the Chain: A Causal Analysis of LLM Faithfulness to Intermediate Structures

Paper • 2603.16475 • Published Mar 17 • 13

upvoted a paper 2 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

upvoted a paper 3 months ago

Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities

Paper • 2602.05281 • Published Feb 5 • 14

submitted a paper to Daily Papers 3 months ago

Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities

Paper • 2602.05281 • Published Feb 5 • 14

updated a collection 5 months ago

P-GRPO

Collection

Exploration for RL in Large Language Models Based on Generative Probability Perspectives • 3 items • Updated Dec 19, 2025

upvoted a paper 5 months ago

Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models

Paper • 2512.00590 • Published Nov 29, 2025 • 52

upvoted a paper 6 months ago

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published Nov 17, 2025 • 121

upvoted 2 papers 9 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Paper • 2508.11383 • Published Aug 15, 2025 • 40

updated a model 9 months ago

LiPengyi29/r1-Qwen2.5-vl

Image-Text-to-Text • 8B • Updated Aug 18, 2025 • 1

published a model 9 months ago

LiPengyi29/r1-Qwen2.5-vl

Image-Text-to-Text • 8B • Updated Aug 18, 2025 • 1

upvoted 4 papers 10 months ago

upvoted a paper 11 months ago

DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization

Paper • 2505.20975 • Published May 27, 2025 • 36

commented 2 papers 11 months ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5, 2025 • 135 •

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5, 2025 • 135 •

Li Pengyi

AI & ML interests

Recent Activity

Organizations

LiPengyi29's activity