Miaow Lab @ CityUHK

university

https://ningmiao.space

AI & ML interests

LLM reasoning

Recent Activity

TorresYang authored a paper 20 days ago

SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models

TorresYang authored a paper 20 days ago

Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions

TorresYang updated a collection 27 days ago

View all activity

authored 2 papers 20 days ago

SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models

Paper • 2503.00211 • Published Feb 28, 2025

Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions

Paper • 2606.03318 • Published 29 days ago

updated a collection 27 days ago

RUT-Bench

Benchmark data in "Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions". • 2 items • Updated 27 days ago

in Miaow-Lab/RUT-Bench 27 days ago

Add task categories and link to paper

#1 opened 28 days ago by

updated a collection 28 days ago

RUT-Bench

Benchmark data in "Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions". • 2 items • Updated 27 days ago

updated a dataset 28 days ago

Miaow-Lab/RUT-Bench

Viewer • Updated 27 days ago • 1.64k • 93

published a dataset 28 days ago

Miaow-Lab/RUT-Bench

Viewer • Updated 27 days ago • 1.64k • 93

updated a collection about 1 month ago

Lixiang

理想第二期交付模型和数据 • 5 items • Updated May 27

updated a model about 1 month ago

Miaow-Lab/RLVR-Linearity-Checkpoints

Text Generation • Updated May 22

updated a dataset about 1 month ago

Miaow-Lab/RLVR-Linearity-Dataset

Viewer • Updated May 22 • 40.3k • 52

updated a collection about 1 month ago

STT-Arena

benchmark data, training data, and STT-Agent from our paper "STT-Arena: A More Realistic Environment for Tool-Using with Spatio-Temporal Dynamics" • 4 items • Updated May 19 • 1

updated a dataset about 1 month ago

Miaow-Lab/STT-Arena

Preview • Updated May 19 • 23 • 2