andy s

andysalerno

·

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago

google/diffusiongemma-26B-A4B-it

liked a model 20 days ago

CohereLabs/North-Mini-Code-1.0

liked a model 20 days ago

stepfun-ai/Step-3.7-Flash

View all activity

Organizations

upvoted a paper 5 months ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

upvoted a paper 6 months ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 43

upvoted a paper about 1 year ago

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 127

upvoted an article about 1 year ago

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

bartowski

•

Apr 9, 2025

• 45

upvoted a collection over 1 year ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309

upvoted a collection about 2 years ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 42

upvoted 2 papers over 2 years ago

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

Paper • 2403.02775 • Published Mar 5, 2024 • 13

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26, 2024 • 48

upvoted a collection over 2 years ago

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 22 days ago • 47

upvoted a paper over 2 years ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 69

upvoted a collection over 2 years ago

Tulu V2 Suite

The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated Dec 23, 2025 • 46