sometimesanotion's picture

sometimesanotion

sometimesanotion

·

https://ko-fi.com/sometimesanotion

AI & ML interests

Agentic LLM services, model merging, finetunes, distillation

Recent Activity

reacted to sequelbox's post with 🔥 3 days ago

NEW RELEASE: Esper 4 is here for Qwen 3.6 27b, along with our new datasets! - NEW DATASET: Titanium 4 maximizes DevOps and architecture helpfulness, powered by high-difficulty agentic-focused DevOps and architecture data generated with DeepSeek-V4-Pro! - NEW DATASET: Mitakihara 2 brings AI coding and expertise data for AI development, research, deployment, interpretability, operation and experimentation! - Improved coding performance: challenging agentic coding queries from Tachibana 4 allow Esper 4 to tackle harder coding tasks across a variety of languages! GET ESPER 4: https://huggingface.co/ValiantLabs/Qwen3.6-27B-Esper4 Get the datasets for your own training: https://huggingface.co/datasets/sequelbox/Titanium4-DeepSeek-V4-Pro https://huggingface.co/datasets/sequelbox/Mitakihara2-DeepSeek-V4-Pro https://huggingface.co/datasets/sequelbox/Tachibana4-DeepSeek-V4-Pro We've been working hard on Esper 4 - it's so exciting to finally bring it to everyone! We hope it helps you build. We'll be expanding Esper 4 to more models as funding allows - donate for more, faster, better models and datasets: https://huggingface.co/spaces/sequelbox/SupportOpenSource The revolution is coming - we're here to fight for AI you can use and build on your own computer, not a giant corporation charging you for access at their discretion. We've seen what OpenAI, Anthropic, and the ultra-rich taking charge of the AI future looks like, and it's already very clear you won't like living in it. Choose a different future while you still can. Open source must win. More to come soon! love, always, allegra

reacted to sequelbox's post with 🚀 3 days ago

NEW RELEASE: Esper 4 is here for Qwen 3.6 27b, along with our new datasets! - NEW DATASET: Titanium 4 maximizes DevOps and architecture helpfulness, powered by high-difficulty agentic-focused DevOps and architecture data generated with DeepSeek-V4-Pro! - NEW DATASET: Mitakihara 2 brings AI coding and expertise data for AI development, research, deployment, interpretability, operation and experimentation! - Improved coding performance: challenging agentic coding queries from Tachibana 4 allow Esper 4 to tackle harder coding tasks across a variety of languages! GET ESPER 4: https://huggingface.co/ValiantLabs/Qwen3.6-27B-Esper4 Get the datasets for your own training: https://huggingface.co/datasets/sequelbox/Titanium4-DeepSeek-V4-Pro https://huggingface.co/datasets/sequelbox/Mitakihara2-DeepSeek-V4-Pro https://huggingface.co/datasets/sequelbox/Tachibana4-DeepSeek-V4-Pro We've been working hard on Esper 4 - it's so exciting to finally bring it to everyone! We hope it helps you build. We'll be expanding Esper 4 to more models as funding allows - donate for more, faster, better models and datasets: https://huggingface.co/spaces/sequelbox/SupportOpenSource The revolution is coming - we're here to fight for AI you can use and build on your own computer, not a giant corporation charging you for access at their discretion. We've seen what OpenAI, Anthropic, and the ultra-rich taking charge of the AI future looks like, and it's already very clear you won't like living in it. Choose a different future while you still can. Open source must win. More to come soon! love, always, allegra

reacted to sequelbox's post with ❤️ 3 days ago

NEW RELEASE: Esper 4 is here for Qwen 3.6 27b, along with our new datasets! - NEW DATASET: Titanium 4 maximizes DevOps and architecture helpfulness, powered by high-difficulty agentic-focused DevOps and architecture data generated with DeepSeek-V4-Pro! - NEW DATASET: Mitakihara 2 brings AI coding and expertise data for AI development, research, deployment, interpretability, operation and experimentation! - Improved coding performance: challenging agentic coding queries from Tachibana 4 allow Esper 4 to tackle harder coding tasks across a variety of languages! GET ESPER 4: https://huggingface.co/ValiantLabs/Qwen3.6-27B-Esper4 Get the datasets for your own training: https://huggingface.co/datasets/sequelbox/Titanium4-DeepSeek-V4-Pro https://huggingface.co/datasets/sequelbox/Mitakihara2-DeepSeek-V4-Pro https://huggingface.co/datasets/sequelbox/Tachibana4-DeepSeek-V4-Pro We've been working hard on Esper 4 - it's so exciting to finally bring it to everyone! We hope it helps you build. We'll be expanding Esper 4 to more models as funding allows - donate for more, faster, better models and datasets: https://huggingface.co/spaces/sequelbox/SupportOpenSource The revolution is coming - we're here to fight for AI you can use and build on your own computer, not a giant corporation charging you for access at their discretion. We've seen what OpenAI, Anthropic, and the ultra-rich taking charge of the AI future looks like, and it's already very clear you won't like living in it. Choose a different future while you still can. Open source must win. More to come soon! love, always, allegra

View all activity

Organizations

upvoted an article 4 months ago

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

+4

burtenshaw, danielhanchen, shimmyshimmer, mlabonne, davanstrien, evalstate

•

Feb 20

• 103

upvoted 2 collections 10 months ago

GPT-OSS Pruned Experts (4.2B-20B) [IF, Science, Math, etc.]

Complete collection of domain-specialized GPT-OSS models (1-32 experts) optimized for science, math, medicine, law, safety, and instruction following. • 8 items • Updated Aug 13, 2025 • 11

GPT-OSS General (4.2B to 20B)

Collection of pruned GPT-OSS models spanning 1-32 experts, maintaining general capabilities across domains while reducing computational requirements. • 29 items • Updated Aug 13, 2025 • 10

upvoted an article about 1 year ago

Article

All LLMs Will Be Sparse BitNet Hybrids

codys12

•

May 14, 2025

• 16

upvoted an article over 1 year ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 351

upvoted a collection over 1 year ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 50 items • Updated Mar 13 • 694

upvoted 2 papers over 1 year ago

Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 44

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 49