1 23 13

Weihao XUAN

weihao1115

https://weihaoxuan.com/

weihao1115

AI & ML interests

None yet

Recent Activity

updated a dataset about 8 hours ago

weihao1115/for_tianhao

published a dataset about 8 hours ago

weihao1115/for_tianhao

upvoted a paper 9 days ago

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

View all activity

Organizations

upvoted a paper 9 days ago

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

Paper • 2606.15345 • Published 14 days ago • 16

upvoted a paper about 1 month ago

MixSD: Mixed Contextual Self-Distillation for Knowledge Injection

Paper • 2605.16865 • Published May 16 • 9

upvoted 2 papers 2 months ago

Dual-View Training for Instruction-Following Information Retrieval

Paper • 2604.18845 • Published Apr 20 • 12

Code-Switching Information Retrieval: Benchmarks, Analysis, and the Limits of Current Retrievers

Paper • 2604.17632 • Published Apr 19 • 12

upvoted a paper 4 months ago

O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL

Paper • 2601.03743 • Published Jan 7 • 3

upvoted 4 papers 5 months ago

RAPTOR: Ridge-Adaptive Logistic Probes

Paper • 2602.00158 • Published Jan 29 • 8

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Paper • 2601.11004 • Published Jan 16 • 31

Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models

Paper • 2505.20236 • Published May 26, 2025 • 3

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Paper • 2601.07264 • Published Jan 12 • 24

upvoted a paper 6 months ago

Toward Global Large Language Models in Medicine

Paper • 2601.02186 • Published Jan 5 • 6

upvoted a collection 6 months ago

GlobMed

Collection

10 items • Updated Mar 2 • 3

upvoted a paper 6 months ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Paper • 2512.18880 • Published Dec 21, 2025 • 25

upvoted a paper 8 months ago

DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response

Paper • 2505.21089 • Published May 27, 2025 • 5

upvoted 4 papers 9 months ago

Good Intentions Beyond ACL: Who Does NLP for Social Good, and Where?

Paper • 2510.04434 • Published Oct 6, 2025 • 6

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 551

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 147

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27, 2025 • 62

upvoted 3 papers 11 months ago

Weihao XUAN

AI & ML interests

Recent Activity

Organizations

weihao1115's activity