12 5

Hoang Nguyen

hnguy7

AI & ML interests

None yet

Recent Activity

liked a dataset 16 days ago

ServiceNow-AI/asr_codeswitched

upvoted an article 16 days ago

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

liked a dataset 21 days ago

ServiceNow-AI/eva-bench

View all activity

Organizations

liked a dataset 16 days ago

ServiceNow-AI/asr_codeswitched

Viewer • Updated 16 days ago • 918 • 289 • 5

upvoted an article 16 days ago

Article

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

ServiceNow-AI

•

16 days ago

• 44

liked a dataset 21 days ago

ServiceNow-AI/eva-bench

Viewer • Updated May 14 • 213 • 308 • 24

upvoted an article 21 days ago

Article

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

ServiceNow-AI

•

21 days ago

• 41

published an article 21 days ago

Article

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

ServiceNow-AI

•

21 days ago

• 41

authored 5 papers about 1 month ago

M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models

Paper • 2406.16783 • Published Jun 24, 2024 • 4

Prompting with Phonemes: Enhancing LLM Multilinguality for non-Latin Script Languages

Paper • 2411.02398 • Published Nov 4, 2024 • 1

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback

Paper • 2510.06186 • Published Oct 7, 2025 • 1

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 75

upvoted 2 papers about 1 month ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 75

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 65

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 909

upvoted a paper 3 months ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 98

liked a dataset 3 months ago

ServiceNow-AI/eva

Viewer • Updated Mar 24 • 50 • 74 • 71

upvoted an article 3 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

published an article 3 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

upvoted a paper 3 months ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 149

liked a dataset 3 months ago

ServiceNow-AI/EnterpriseOps-Gym

Viewer • Updated Apr 30 • 2.56k • 8.63k • 89

upvoted a collection 10 months ago

AU-Harness datasets

Collection

3 items • Updated Sep 12, 2025 • 6

Hoang Nguyen

AI & ML interests

Recent Activity

Organizations

hnguy7's activity

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

Welcome Gemma 4: Frontier multimodal intelligence on device

A New Framework for Evaluating Voice Agents (EVA)

A New Framework for Evaluating Voice Agents (EVA)