Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 514
A Probabilistic Model for Aircraft in Climb using Monotonic Functional Gaussian Process Emulators Paper • 2210.01445 • Published Oct 4, 2022
What do you Mean? The Role of the Mean Function in Bayesian Optimisation Paper • 2004.08349 • Published Apr 17, 2020
Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation Paper • 1911.12809 • Published Nov 28, 2019
The Majority is not always right: RL training for solution aggregation Paper • 2509.06870 • Published Sep 8, 2025 • 15
view article Article Learn the Hugging Face Kernel Hub in 5 Minutes +5 drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb • Jun 12, 2025 • 164
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez • Sep 11, 2025 • 187
A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30, 2024 • 12
Fantastic Pretraining Optimizers and Where to Find Them Paper • 2509.02046 • Published Sep 2, 2025 • 14
Deep Ignorance Collection This collection contains the model and data artifacts from O'Brien et al. (2025). https://deepignorance.ai • 40 items • Updated Mar 2 • 11
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 211
AirTrafficGen: Configurable Air Traffic Scenario Generation with Large Language Models Paper • 2508.02269 • Published Aug 4, 2025 • 1
Air Traffic Controller Task Demand via Graph Neural Networks: An Interpretable Approach to Airspace Complexity Paper • 2507.13423 • Published Jul 17, 2025 • 1
AirTrafficGen: Configurable Air Traffic Scenario Generation with Large Language Models Paper • 2508.02269 • Published Aug 4, 2025 • 1
Air Traffic Controller Task Demand via Graph Neural Networks: An Interpretable Approach to Airspace Complexity Paper • 2507.13423 • Published Jul 17, 2025 • 1