MAC Fairness Project

classroom

AI & ML interests

None defined yet.

Recent Activity

zeyutang authored a paper about 1 month ago

Fantastic Bugs and Where to Find Them in AI Benchmarks

zeyutang authored a paper about 1 month ago

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

sangttruong authored a paper about 1 year ago

ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code

View all activity

authored 2 papers about 1 month ago

Fantastic Bugs and Where to Find Them in AI Benchmarks

Paper • 2511.16842 • Published Nov 20, 2025

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

Paper • 2605.16679 • Published May 15 • 55

authored 2 papers about 1 year ago

ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code

Paper • 2506.02314 • Published Jun 2, 2025

Reliable and Efficient Amortized Model-based Evaluation

Paper • 2503.13335 • Published Mar 17, 2025

authored a paper about 1 year ago

Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs

Paper • 2505.20254 • Published May 26, 2025 • 5

authored 2 papers about 2 years ago

Model Transferability With Responsive Decision Subjects

Paper • 2107.05911 • Published Jul 13, 2021

Procedural Fairness Through Decoupling Objectionable Data Generating Components

Paper • 2311.14688 • Published Nov 5, 2023

authored a paper over 2 years ago

Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

Paper • 2403.02715 • Published Mar 5, 2024 • 3

authored a paper about 3 years ago

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Paper • 2306.11698 • Published Jun 20, 2023 • 13