Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Austin Xu's picture
2 6 1

Austin Xu

austinxu87
21world's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks
upvoted a collection 3 months ago
FARE
updated a collection 3 months ago
FARE
View all activity

Organizations

Salesforce's profile picture

upvoted a paper 3 days ago

MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks

Paper • 2601.14652 • Published 6 days ago • 1
upvoted a collection 3 months ago

FARE

Collection
FARE are Salesforce AI Research's open multi-task evaluator models. • 4 items • Updated Oct 31, 2025 • 2
upvoted 3 papers 3 months ago

Foundational Automatic Evaluators: Scaling Multi-Task Generative Evaluator Training for Reasoning-Centric Domains

Paper • 2510.17793 • Published Oct 20, 2025 • 3

LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild

Paper • 2510.14240 • Published Oct 16, 2025 • 12

Hard2Verify: A Step-Level Verification Benchmark for Open-Ended Frontier Math

Paper • 2510.13744 • Published Oct 15, 2025 • 6
upvoted a paper 8 months ago

J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization

Paper • 2505.13346 • Published May 19, 2025 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs