George De Ath

georgedeath

15 5

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

google/tabfm-1.0.0-pytorch

liked a model 2 months ago

JanTempus/cross-over-climbmix400b-s7-tokenizers

liked a dataset 3 months ago

jamiequint/sf_criminal_court

View all activity

Organizations

None yet

upvoted a paper 10 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 518

upvoted a paper 11 months ago

The Majority is not always right: RL training for solution aggregation

Paper • 2509.06870 • Published Sep 8, 2025 • 15

upvoted 2 articles 11 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb

•

Jun 12, 2025

• 165

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 189

upvoted 2 papers 11 months ago

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 12

Fantastic Pretraining Optimizers and Where to Find Them

Paper • 2509.02046 • Published Sep 2, 2025 • 14

upvoted a collection 12 months ago

Deep Ignorance

Collection

This collection contains the model and data artifacts from O'Brien et al. (2025). https://deepignorance.ai • 40 items • Updated Mar 2 • 12

upvoted 3 papers 12 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 213

AirTrafficGen: Configurable Air Traffic Scenario Generation with Large Language Models

Paper • 2508.02269 • Published Aug 4, 2025 • 1

Air Traffic Controller Task Demand via Graph Neural Networks: An Interpretable Approach to Airspace Complexity

Paper • 2507.13423 • Published Jul 17, 2025 • 1

upvoted 2 articles 12 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 514

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 492

upvoted 2 articles about 1 year ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 784

Article

Make LLM Fine-tuning 2x faster with Unsloth and 🤗 TRL

danielhanchen

•

Jan 10, 2024

• 77

upvoted a paper over 1 year ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 70

George De Ath

AI & ML interests

Recent Activity

Organizations

georgedeath's activity

Learn the Hugging Face Kernel Hub in 5 Minutes

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Welcome GPT OSS, the new open-source model family from OpenAI!

You could have designed state of the art positional encoding

SmolLM3: smol, multilingual, long-context reasoner

Make LLM Fine-tuning 2x faster with Unsloth and 🤗 TRL