view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 403
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17, 2025 • 50
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated about 23 hours ago • 12
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 5 items • Updated 11 days ago • 1
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated about 23 hours ago • 216
Tools 4 learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated 11 days ago • 67
view article Article You could have designed state of the art positional encoding Nov 25, 2024 • 454
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 490
Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published Feb 6, 2025 • 33
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models +1 Mar 20, 2024 • 111