2 7 1

Dingwei Chen

CuSO4-Chen

AI & ML interests

None yet

Recent Activity

upvoted a paper 30 days ago

Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement

submitted a paper 30 days ago

Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement

authored a paper about 2 months ago

Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused

View all activity

Organizations

None yet

upvoted a paper 30 days ago

Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement

Paper • 2605.26952 • Published about 1 month ago • 16

submitted a paper to Daily Papers 30 days ago

Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement

Paper • 2605.26952 • Published about 1 month ago • 16

authored 4 papers about 2 months ago

Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused

Paper • 2408.08769 • Published Aug 16, 2024

ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents

Paper • 2505.23923 • Published May 29, 2025 • 8

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

A$^2$TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

Paper • 2605.06200 • Published May 7 • 15

commented a paper about 2 months ago

A$^2$TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

Paper • 2605.06200 • Published May 7 • 15 •

upvoted a paper about 2 months ago

A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

Paper • 2605.06200 • Published May 7 • 15

submitted a paper to Daily Papers about 2 months ago

A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

Paper • 2605.06200 • Published May 7 • 15

upvoted a paper about 2 months ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

authored a paper 6 months ago

AT$^2$PO: Agentic Turn-based Policy Optimization via Tree Search

Paper • 2601.04767 • Published Jan 8 • 28

upvoted 2 papers 6 months ago

AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

Paper • 2601.04767 • Published Jan 8 • 28

Tree Search for LLM Agent Reinforcement Learning

Paper • 2509.21240 • Published Sep 25, 2025 • 92

upvoted a paper 9 months ago

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Paper • 2509.07403 • Published Sep 9, 2025 • 35

upvoted a paper about 1 year ago

ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents

Paper • 2505.23923 • Published May 29, 2025 • 8

liked a model almost 3 years ago

huggyllama/llama-7b

Text Generation • 7B • Updated Jul 2, 2024 • 201k • • 358

Dingwei Chen

AI & ML interests

Recent Activity

Organizations

CuSO4-Chen's activity