aqui's picture

🔄 In a Training Loop

aqui

aquiffoo

·

https://aquiffoo.is-a.dev/

AI & ML interests

thanks for everything.

Recent Activity

liked a model 1 day ago

LiquidAI/LFM2.5-230M

liked a model 1 day ago

deepreinforce-ai/Ornith-1.0-9B

liked a model 1 day ago

deepreinforce-ai/Ornith-1.0-35B

View all activity

Organizations

upvoted a paper about 2 months ago

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Paper • 2605.02396 • Published May 4 • 24

upvoted a collection 2 months ago

DeepSeek-V4

6 items • Updated about 9 hours ago • 696

upvoted a paper 3 months ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published Mar 19 • 69

upvoted an article 4 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 507

upvoted a paper 4 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 187

upvoted a collection 4 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.69k

upvoted 3 papers 5 months ago

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

Paper • 2602.04649 • Published Feb 4 • 13

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 181

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 196

upvoted 2 collections 6 months ago

Jamba2

Jamba2 is a highly-efficient open source family of language models built for maximum reliability and steerability in the enterprise. • 3 items • Updated Jan 8 • 5

💧 LFM2.5

Collection of post-trained and base LFM2.5 models. • 14 items • Updated 1 day ago • 164

upvoted a changelog 7 months ago

Hugging Face Changelog

HuggingChat for Docs

Dec 12, 2025

• 121

upvoted a paper 8 months ago

MemMamba: Rethinking Memory Patterns in State Space Model

Paper • 2510.03279 • Published Sep 28, 2025 • 74

upvoted a changelog 8 months ago

Hugging Face Changelog

Cleaner Collection URLs

Oct 23, 2025

• 82

upvoted a paper 8 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 517

upvoted a changelog 9 months ago

Hugging Face Changelog

Repositories total file size is now displayed

Sep 18, 2025

• 176

upvoted a collection 9 months ago

Ling 2.0

11 items • Updated 12 days ago • 37

upvoted a paper 10 months ago

Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Paper • 2507.17702 • Published Jul 23, 2025 • 7

upvoted a collection about 1 year ago

Granite 4.0 Language Models

Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated Apr 29 • 220