3 16 10

David Newman

darthhexx

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

Recursive Multi-Agent Systems

upvoted an article about 1 month ago

Welcome Gemma 4: Frontier multimodal intelligence on device

liked a model 3 months ago

Qwen/Qwen3-Coder-Next-FP8

View all activity

Organizations

upvoted a paper 18 days ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published 20 days ago • 267

upvoted an article about 1 month ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 894

liked a model 3 months ago

Qwen/Qwen3-Coder-Next-FP8

Text Generation • 80B • Updated Feb 3 • 409k • • 146

upvoted 2 papers 7 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 170

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 514

liked a model 7 months ago

bageldotcom/paris

Text-to-Image • Updated Oct 7, 2025 • 4 • 38

liked a model 8 months ago

RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Text Generation • 8B • Updated Mar 19 • 63.5k • 9

upvoted a paper 9 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 189

upvoted a paper 10 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 274

liked a model 10 months ago

zai-org/GLM-4.5-Air-FP8

Text Generation • Updated Aug 12, 2025 • 33.7k • 81

upvoted a paper about 1 year ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 309

liked a model about 1 year ago

OpenGVLab/InternVL3-78B

Image-Text-to-Text • Updated Sep 11, 2025 • 43.4k • 235

upvoted a collection about 1 year ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 736

upvoted 3 papers about 1 year ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 121

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20, 2025 • 175

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 110

updated a model over 1 year ago

darthhexx/Qwen2.5-VL-3B-Instruct-FP8-Dynamic

Image-Text-to-Text • 4B • Updated Feb 11, 2025 • 7

published a model over 1 year ago

darthhexx/Qwen2.5-VL-3B-Instruct-FP8-Dynamic

Image-Text-to-Text • 4B • Updated Feb 11, 2025 • 7

updated a model over 1 year ago

darthhexx/Qwen2.5-VL-7B-Instruct-FP8-Dynamic

Image-Text-to-Text • 8B • Updated Feb 5, 2025 • 8

published a model over 1 year ago

darthhexx/Qwen2.5-VL-7B-Instruct-FP8-Dynamic

Image-Text-to-Text • 8B • Updated Feb 5, 2025 • 8

David Newman

AI & ML interests

Recent Activity

Organizations

darthhexx's activity

Welcome Gemma 4: Frontier multimodal intelligence on device