WizardLM Team

community

WizardLM_AI

nlpxucan/WizardLM

Activity Feed

AI & ML interests

Large Language Models

Recent Activity

tangmen submitted a paper about 21 hours ago

VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct

tangmen authored a paper 1 day ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

tangmen authored a paper 1 day ago

OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents

View all activity

tangmen

submitted a paper to Daily Papers about 21 hours ago

VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct

Paper • 2606.23543 • Published 4 days ago • 6

tangmen

authored 9 papers 1 day ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published Mar 2 • 64

OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents

Paper • 2601.18467 • Published Jan 26 • 1

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Paper • 2603.01571 • Published Mar 2 • 34

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Paper • 2512.20745 • Published Dec 23, 2025

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Paper • 2505.15431 • Published May 21, 2025 • 2

Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models

Paper • 2412.16933 • Published Dec 22, 2024

Multimodal Dialogue Response Generation

Paper • 2110.08515 • Published Oct 16, 2021

WizardLM: Empowering Large Language Models to Follow Complex Instructions

Paper • 2304.12244 • Published Apr 24, 2023 • 14

VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct

Paper • 2606.23543 • Published 4 days ago • 6

haipeng1

authored 4 papers 7 days ago

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Paper • 2308.09583 • Published Aug 18, 2023 • 8

Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena

Paper • 2407.10627 • Published Jul 15, 2024

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Paper • 2505.15431 • Published May 21, 2025 • 2

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Paper • 2512.20745 • Published Dec 23, 2025

haipeng1

submitted a paper to Daily Papers 7 days ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Paper • 2606.19236 • Published 9 days ago • 12

haipeng1

authored a paper 7 days ago

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Paper • 2606.19236 • Published 9 days ago • 12

Ziyang

authored a paper 6 months ago

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Paper • 2601.03559 • Published Jan 7 • 14

chongyang09

authored 3 papers 6 months ago

AI & ML interests

Recent Activity

Team members 7

WizardLMTeam's activity