Nathan Simons's picture

Nathan Simons

JoeySalmons

·

AI & ML interests

I like AI

Recent Activity

liked a model about 9 hours ago

google/magenta-realtime-2

liked a model about 9 hours ago

ByteDance/Bernini-R

liked a model about 9 hours ago

nvidia/LocateAnything-3B

View all activity

Organizations

None yet

upvoted a collection 11 days ago

ReAligned-Qwen3.5

Lazarus AI's ReAligned finetune of Qwen 3.5 alters the alignment of the model, eliminating unwanted behaviors like propaganda, lying, & gaslighting. • 18 items • Updated 4 days ago • 5

upvoted a collection about 1 month ago

Granite 4.1 Language Models

Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated Apr 29 • 57

upvoted 2 articles about 1 month ago

Article

How Long Prompts Block Other Requests - Optimizing LLM Performance

tngtech

•

Jun 12, 2025

• 13

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

tngtech

•

Apr 16, 2025

• 81

upvoted a collection about 1 month ago

DeepSeek-V4

4 items • Updated Apr 24 • 674

upvoted a collection about 2 months ago

EXAONE 4.5

LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated Apr 22 • 43

upvoted 3 collections 2 months ago

Gemma 4

15 items • Updated 5 days ago • 927

Gemma 4

Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated 3 days ago • 210

UnifoLM_WBT_Dataset

14 items • Updated 21 days ago • 82

upvoted 4 collections 3 months ago

GigaChat 3.1

6 items • Updated Mar 24 • 61

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated about 11 hours ago • 157

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 23 items • Updated about 11 hours ago • 317

Qwen3.5

21 items • Updated Mar 9 • 1.67k

upvoted 2 collections 4 months ago

Tiny Aya

Bridging Scale and Multilingual Depth • 10 items • Updated Feb 17 • 71

Falcon-H1-Tiny

A series of extremely small, yet powerful language models redefining capabilities at small scale • 19 items • Updated Mar 2 • 37

upvoted 2 collections 5 months ago

👁️ LFM2.5-VL

13 items • Updated about 18 hours ago • 44

💧 LFM2.5

Collection of post-trained and base LFM2.5 models. • 35 items • Updated about 18 hours ago • 149

upvoted an article 6 months ago

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

nvidia

•

Dec 17, 2025

• 50

upvoted 2 collections 6 months ago

Bolmo

Artifacts for the Bolmo release: https://allenai.org/papers/bolmo. • 4 items • Updated Dec 23, 2025 • 12

Devstral 2

A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 2 items • Updated Mar 2 • 56