22 8

Aayush

Aayushfaced

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago

UnifoLM_WBT_Dataset

liked a model 4 months ago

openai-community/gpt2

upvoted an article 5 months ago

We Got Claude to Fine-Tune an Open Source LLM

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

UnifoLM_WBT_Dataset

Collection

13 items • Updated about 22 hours ago • 84

liked a model 4 months ago

openai-community/gpt2

Text Generation • 0.1B • Updated Feb 19, 2024 • 15.9M • 3.23k

upvoted 2 articles 5 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

624

Article

New in llama.cpp: Model Management

Dec 11, 2025

•

133

upvoted 2 collections 5 months ago

NVIDIA Nemotron V2

Collection

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 17 days ago • 104

Inference Optimized Checkpoints (with Model Optimizer)

Collection

A collection of generative models quantized and optimized for inference with Model Optimizer. • 65 items • Updated 6 days ago • 152

upvoted an article 5 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21, 2025

•

liked a Space 6 months ago

The Smol Training Playbook

📚

3.15k

The secrets to building world-class LLMs

liked 2 Spaces 7 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.33k

Explore and download the FineWeb web‑text dataset

Robot Learning: A Tutorial

📝

401

Explore the Robot Learning tutorial online

liked a Space 8 months ago

The Ultra-Scale Playbook

🌌

3.83k

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 papers 8 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 50

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

liked a dataset 8 months ago

InternRobotics/OmniWorld

Viewer • Updated 20 days ago • 7.09B • 201k • 90

upvoted 6 papers 8 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 183

Aayush

AI & ML interests

Recent Activity

Organizations

Aayushfaced's activity

We Got Claude to Fine-Tune an Open Source LLM

New in llama.cpp: Model Management

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

The Smol Training Playbook

FineWeb: decanting the web for the finest text data at scale

Robot Learning: A Tutorial

The Ultra-Scale Playbook