v's picture

4 83

v

ziqi7

·

AI & ML interests

None yet

Recent Activity

updated a collection about 1 month ago

RL&LLM Agent-强化学习

upvoted a paper about 1 month ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

liked a model 2 months ago

Soul-AILab/SoulX-Podcast-1.7B

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17, 2025 • 45

upvoted a collection 3 months ago

Agent & RL

55 items • Updated Nov 27, 2025 • 21

upvoted 2 papers over 1 year ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 64

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 248