My (Chiffon) Nguyen's picture

My (Chiffon) Nguyen

chiffonng

·

https://mychiffonn.com/

AI & ML interests

Mulitlingual AI, AI Safety, human-AI interaction

Organizations

upvoted a collection 4 months ago

Research

Our AI Safety Research • 7 items • Updated May 22, 2025 • 3

upvoted an article 9 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 417

upvoted 2 papers about 1 year ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17, 2025 • 50

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 114

upvoted 2 collections about 1 year ago

QwQ

Qwen with Questions • 6 items • Updated Dec 31, 2025 • 101

ELECTRA release

This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated Mar 12 • 13

upvoted an article about 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

upvoted 3 collections about 1 year ago

LINKS: English-English Mnemonics

Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 5 items • Updated Mar 2 • 1

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Mar 12 • 219

Tools 4 learning AI

This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated Mar 2 • 67

upvoted 2 articles about 1 year ago

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 487

Article

Welcome Llama 4 Maverick & Scout on Hugging Face

+5

burtenshaw, reach-vb, pcuenq, clem, rajatarya, jsulz, lysandre

•

Apr 5, 2025

• 149

upvoted a collection about 1 year ago

Small Model Learnability Gap: Models

24 items • Updated Feb 24, 2025 • 2

upvoted a collection over 1 year ago

Gemma 3 Release

28 items • Updated Mar 12 • 643

upvoted 4 articles over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

Article

Fixing Gradient Accumulation

+4

lysandre, ArthurZ, muellerzr, ydshieh, BenjaminB, pcuenq

•

Oct 16, 2024

• 66

Article

SmolVLM - small yet mighty Vision Language Model

+3

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 419

upvoted a paper over 1 year ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6, 2025 • 33

upvoted an article about 2 years ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

+1

loubnabnl, anton-l, davanstrien

•

Mar 20, 2024

• 114