Tony He

ttttonyhe

4 9 25

https://lipeng.ac

ttttonyhe

AI & ML interests

Trustworthy Machine Learning

Recent Activity

updated a dataset 11 days ago

ttttonyhe/locket-data

updated a collection 11 days ago

Locket

published a dataset 11 days ago

ttttonyhe/locket-data

View all activity

Organizations

upvoted a paper 15 days ago

Locket: Robust Feature-Locking Technique for Language Models

Paper • 2510.12117 • Published Oct 14, 2025 • 2

upvoted a collection about 2 months ago

Nemotron Safety & Content Moderation

Collection

Datasets for building safe models with refusals, content moderation, PII detection, agentic safety, and audio safety capabilities. • 11 items • Updated 18 days ago • 5

upvoted 2 articles 5 months ago

Article

Granite 4.0 Nano: Just how small can you go?

ibm-granite

•

Oct 28, 2025

• 125

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 513

upvoted an article 7 months ago

Article

Diffusion Language Models: The New Paradigm

ProCreations

•

Jun 10, 2025

• 51

upvoted an article 10 months ago

Article

Mastering Tensor Dimensions in Transformers

not-lain

•

Jan 12, 2025

• 185

upvoted a paper 11 months ago

Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7, 2024 • 35

upvoted 2 papers over 1 year ago

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Paper • 2402.04249 • Published Feb 6, 2024 • 7

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 53

Tony He

AI & ML interests

Recent Activity

Organizations

ttttonyhe's activity

Granite 4.0 Nano: Just how small can you go?

Welcome GPT OSS, the new open-source model family from OpenAI!

Diffusion Language Models: The New Paradigm

Mastering Tensor Dimensions in Transformers