52 8 241

sometimesanotion

https://ko-fi.com/sometimesanotion

AI & ML interests

Agentic LLM services, model merging, finetunes, distillation

Recent Activity

liked a model 3 days ago

kai-os/Carnice-9b

liked a model 4 days ago

C3DS/CARDS-Qwen3.6-27B

reacted to sequelbox's post with 🚀 11 days ago

NEW RELEASE: Esper 3.1 for Qwen 3.6! - Your dedicated DevOps expert: Esper 3.1 maximizes DevOps and architecture helpfulness, powered by high-difficulty DevOps and architecture data generated with DeepSeek-V3.1-Terminus! - Improved coding performance: challenging code-reasoning datasets stretch DeepSeek-V3.1-Terminus and DeepSeek-V3.2 to the limits, allowing Esper 3.1 to tackle harder coding tasks! - AI to build AI: our high-difficulty AI expertise data boosts Esper 3.1's MLOps, AI architecture, AI research, and general reasoning skills. Get it now: https://huggingface.co/ValiantLabs/Qwen3.6-35B-A3B-Esper3.1 We're working on more finetunes for the newest Qwen and Gemma models, and we've also started working on the agentic-first datasets for Esper 4 :) we're going to make open source better and better for your work! Please note that real life financial and family concerns have popped up and have imposed unfortunate limitations on our ability to devote time to our open-source work :( If you would like to see Esper 4 and our other releases speed up instead of slowing down, this is the best way you can help us: https://huggingface.co/spaces/sequelbox/SupportOpenSource No matter what, we'll keep fighting and we won't give up! with love, allegra

View all activity

Organizations

liked a model 3 days ago

kai-os/Carnice-9b

Text Generation • 9B • Updated Apr 4 • 5.74k • 177

liked a model 4 days ago

C3DS/CARDS-Qwen3.6-27B

Text Generation • 28B • Updated 14 days ago • 551 • 1

reacted to sequelbox's post with 🚀 11 days ago

Post

1912

NEW RELEASE: Esper 3.1 for Qwen 3.6!

- Your dedicated DevOps expert: Esper 3.1 maximizes DevOps and architecture helpfulness, powered by high-difficulty DevOps and architecture data generated with DeepSeek-V3.1-Terminus!
- Improved coding performance: challenging code-reasoning datasets stretch DeepSeek-V3.1-Terminus and DeepSeek-V3.2 to the limits, allowing Esper 3.1 to tackle harder coding tasks!
- AI to build AI: our high-difficulty AI expertise data boosts Esper 3.1's MLOps, AI architecture, AI research, and general reasoning skills.

Get it now: ValiantLabs/Qwen3.6-35B-A3B-Esper3.1

We're working on more finetunes for the newest Qwen and Gemma models, and we've also started working on the agentic-first datasets for Esper 4 :) we're going to make open source better and better for your work!

Please note that real life financial and family concerns have popped up and have imposed unfortunate limitations on our ability to devote time to our open-source work :( If you would like to see Esper 4 and our other releases speed up instead of slowing down, this is the best way you can help us: sequelbox/SupportOpenSource

No matter what, we'll keep fighting and we won't give up!

with love,
allegra

1 reply

liked a model 17 days ago

ValiantLabs/Qwen3.6-35B-A3B-Esper3.1

Image-Text-to-Text • 36B • Updated 14 days ago • 89 • 11

liked a model 25 days ago

FINAL-Bench/Darwin-4B-David

Text Generation • Updated 26 days ago • 1.18k • 39

New activity in DavidAU/gemma-4-19B-A4B-it-The-DECKARD-Thinking 27 days ago

What's the capacity in this architecture?

👍 1

#1 opened 27 days ago by

sometimesanotion

liked a model 29 days ago

mudler/gemma-4-26B-A4B-it-heretic-APEX-GGUF

25B • Updated 9 days ago • 34.9k • 36

liked a model 30 days ago

ManniX-ITA/gemma-4-A4B-109e-it-GGUF

22B • Updated about 1 month ago • 3.19k • 4

liked 3 models about 1 month ago

liked 2 models about 2 months ago

jpacifico/Qwen3.5-DPO-4B

Text Generation • 4B • Updated Mar 20 • 4 • 2

Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2

Image-Text-to-Text • 10B • Updated about 1 month ago • 8.77k • 165

reacted to Nymbo's post with 👍 about 2 months ago

Post

6887

We should really have a release date range slider on the /models page. Tired of "trending/most downloaded" being the best way to sort and still seeing models from 2023 on the first page just because they're embedded in enterprise pipelines and get downloaded repeatedly. "Recently Created/Recently Updated" don't solve the discovery problem considering the amount of noise to sift through.

Slight caveat: Trending actually does have some recency bias, but it's not strong/precise enough.