Yupeng Cao's picture

7

Yupeng Cao PRO

YupengCao

·

https://cyp0630.github.io/

CYP0630

AI & ML interests

NLP, Multimodal, Audio, Truthworthy

Recent Activity

updated a model 15 days ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

updated a Space 16 days ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

published a Space 16 days ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

View all activity

Organizations

updated a model 15 days ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

Updated 15 days ago

updated a Space 16 days ago

T4 Qwen2 7B Instruct GRPO

Show a live tracking dashboard

published a Space 16 days ago

T4 Qwen2 7B Instruct GRPO

Show a live tracking dashboard

published a model 16 days ago

YupengCao/t4-Qwen2-7B-Instruct-GRPO

Updated 15 days ago

updated a Space 16 days ago

Trackio

Display a visual summary of your program’s I/O activity

published a Space 16 days ago

Trackio

Display a visual summary of your program’s I/O activity

published a model 16 days ago

YupengCao/Qwen3-VL-4B-Instruct-trl-grpo

Updated 16 days ago

updated a dataset 20 days ago

Financial-Misinformation-Detection/MultilingualFMD

Viewer • Updated 20 days ago • 42 • 92

updated a dataset about 2 months ago

Financial-Misinformation-Detection/PersonaReasoning-v2

Viewer • Updated Feb 1 • 7 • 71

upvoted a paper 2 months ago

Same Claim, Different Judgment: Benchmarking Scenario-Induced Bias in Multilingual Financial Misinformation Detection

Paper • 2601.05403 • Published Jan 8 • 10

published a dataset 3 months ago

YupengCao/FinMCP

Viewer • Updated Dec 9, 2025 • 8.11k • 7

updated a dataset 3 months ago

YupengCao/FinMCP

Viewer • Updated Dec 9, 2025 • 8.11k • 7

updated a model 4 months ago

YupengCao/qwen2-7b-instruct-amazon-description

Updated Nov 12, 2025

published a model 4 months ago

YupengCao/qwen2-7b-instruct-amazon-description

Updated Nov 12, 2025

updated a dataset 4 months ago

YupengCao/OCR-evaluation

Preview • Updated Nov 12, 2025 • 5

published a dataset 4 months ago

YupengCao/OCR-evaluation

Preview • Updated Nov 12, 2025 • 5

upvoted a paper 6 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 137

upvoted a paper 8 months ago

Truth Neurons

Paper • 2505.12182 • Published May 18, 2025 • 8

upvoted a paper 9 months ago

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Paper • 2506.14028 • Published Jun 16, 2025 • 93

updated a model 10 months ago

YupengCao/halsci_lora_8bit

Updated May 23, 2025