Social Post Explorers

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

akhaliq submitted a paper 13 days ago

Image Generators are Generalist Vision Learners

ozayezerceli authored a paper 13 days ago

RDP LoRA: Geometry-Driven Identification for Parameter-Efficient Adaptation in Large Language Models

Q-bert authored a paper 15 days ago

Selectivity and Shape in the Design of Forward-Forward Goodness Functions

View all activity

Tonic

posted an update 8 days ago

Post

4000

🙋🏻‍♂️ Hey there folks,

since everyone liked my previous announcement post ( https://huggingface.co/posts/Tonic/338509028435394 ) so much , i'm back with more high quality proceedural datasets in the Geospacial domain for SFT training !

Check this one out :
NuTonic/sat-bbox-metadata-sft-v1

the goal is to be able to train vision models on multiple images for remote sensing analysis with one shot .

hope you like it ! 🚀

2 replies

Tonic

posted an update 13 days ago

Post

3501

🙋🏻‍♂️ Hey there folks ,

I'm sharing huggingface's largest dataset of annotated statelite images today.

check it out here : NuTonic/sat-image-boundingbox-sft-full

I hope you like it , the idea is to be able to use this with small vision models 🚀

XuehangCang

posted an update 15 days ago

Post

155

I updated my homepage with an NVIDIA theme style

https://xuehangcang.com

tricktreat

submitted 2 papers to Daily Papers 21 days ago

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 23 days ago • 143

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 23 days ago • 143

tricktreat

authored 2 papers 22 days ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published 27 days ago • 47

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 23 days ago • 143

appvoid

posted an update 26 days ago

Post

137

Yesterday someone faked an anthropic account: https://huggingface.co/Anthropic-ai/claude
Be careful... all I'm saying.

1 reply

tricktreat

authored 6 papers 29 days ago

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

Paper • 2510.08531 • Published Oct 9, 2025 • 12

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Paper • 2603.02578 • Published Mar 3 • 25

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Paper • 2603.15611 • Published Mar 16 • 10

CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

Paper • 2603.17775 • Published Mar 18 • 2

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published Apr 2 • 100

Muennighoff

submitted a paper to Daily Papers about 1 month ago

Composer 2 Technical Report

Paper • 2603.24477 • Published Mar 25 • 15

xianbao

submitted a paper to Daily Papers about 2 months ago

The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training

Paper • 2603.10444 • Published Mar 11 • 12

codelion

posted an update about 2 months ago

Post

3307

Scaling Pedagogical Pre-training to 10 Billion Tokens

New blog post exploring what happens when you take optimal data mixing insights and scale up the data generation itself.

We built Sutra, a multi-stage framework for generating pedagogical pre-training data guided by a knowledge graph of ~2,000 concepts across 9 domains. The pipeline includes structured content generation, six-dimension quality evaluation, diversity management across 20 content styles, and a cleaning stage to prevent collapse.

The result is codelion/sutra-10B, a 10.2 billion token pedagogical dataset with rich metadata (domain, complexity, prerequisites, quality scores) on every entry.

We trained codelion/SmolLM2-70M on it for 3 full epochs (30.6B tokens) on a single A10 GPU in ~78 hours.

Key finding: perplexity kept improving across epochs, but benchmark gains plateaued fast. At 70M parameters, the model hits a representational ceiling that more data alone can't break through.

Full writeup with comparisons against 7 other datasets, detailed benchmark breakdowns, and connections to recent work on synthetic data scaling, curriculum learning, and data mixing laws: https://huggingface.co/blog/codelion/scaling-pedagogical-pretraining-10-billion-tokens

All datasets at multiple scales (10M, 100M, 1B, 10B) plus seed concepts and an SFT variant are in the Sutra Pedagogical Datasets collection.

2 replies

GeorgeBredis

authored a paper 2 months ago

Next Embedding Prediction Makes World Models Stronger

Paper • 2603.02765 • Published Mar 3 • 20

GeorgeBredis

submitted a paper to Daily Papers 2 months ago

Next Embedding Prediction Makes World Models Stronger

Paper • 2603.02765 • Published Mar 3 • 20

appvoid

posted an update 2 months ago

Post

2516

Let's keep the momentum for small models. I just published dot. It's the first pretrained causal model that is trained on math/symbols rather than english. The goal is to get an agnostic fewshot meta learner that learns from reality itself instead of language.

It's already decent at some tasks, with next version coming in a few weeks.

appvoid/dot

5 replies

AI & ML interests

Recent Activity

Team members 852

social-post-explorers's activity