🔄 In a Training Loop

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

liked a dataset about 23 hours ago

HuggingFaceCode/stack-v3-train

liked a Space 2 days ago

lvwerra/cowrite

liked a dataset 2 days ago

huggingface/forensic-refusal

View all activity

Organizations

lewtun 's models 324

lewtun/talkie-1930-13b-it-hf

Text Generation • 13B • Updated Apr 28 • 185k • 24

lewtun/Qwen3-4B-Instruct-2507-SFT

lewtun/olmo3-7b-lora_ds200_ep32

lewtun/data-repetition-replication

lewtun/wordle-grpo-Qwen3-1.7B

Text Generation • 2B • Updated Jan 14 • 2

lewtun/qwen3-4b-s1k-sft

Text Generation • 4B • Updated Jan 8 • 3

lewtun/Qwen3-32B-SFT-20250908120312

Updated Sep 8, 2025

lewtun/Qwen3-0.6B-SFT-20250908114642

Text Generation • 0.6B • Updated Sep 8, 2025 • 5

lewtun/Qwen3-32B-SFT-20250908115917

Updated Sep 8, 2025

lewtun/SmolLM2-135M-Instruct-SFT-Trackio-Test

Text Generation • 0.1B • Updated Aug 7, 2025 • 5

lewtun/Qwen3-0.6B-SFT-Trackio-Test

Text Generation • 0.6B • Updated Aug 7, 2025 • 5

lewtun/Qwen3-0.6B-SFT-Demo

Text Generation • 0.6B • Updated Aug 7, 2025 • 4

lewtun/zephyr-7b-gemma-dpo

Updated Jul 24, 2025

lewtun/zephyr-7b-gemma-sft

Updated Jul 24, 2025

lewtun/smollm2-360M-sft

Updated Jul 24, 2025

lewtun/smollm2-1.7B-sft

Updated Jul 24, 2025

lewtun/smollm-360M-instruct-new

Updated Jul 24, 2025

lewtun/mistral-7b-sft-constitutional-ai

Updated Jul 24, 2025

lewtun/mistral-7b-dpo-constitutional-ai

Updated Jul 24, 2025

lewtun/zephyr-7b-sft-full

Text Generation • 266k • Updated Jul 24, 2025 • 5

lewtun/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Apr 16, 2025 • 3

lewtun/does-deepspeed-still-work-sft

Text Generation • 2B • Updated Apr 16, 2025 • 6

lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-Llama

Text Generation • 1B • Updated Apr 16, 2025 • 16

lewtun/Qwen2.5-1.5B-SFT-Capybara-No-Packing

Text Generation • 2B • Updated Apr 15, 2025 • 17

lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-ChatML

Text Generation • 1B • Updated Apr 15, 2025 • 6

lewtun/Qwen2.5-7B-Instruct-GRPO

Updated Mar 21, 2025

lewtun/Qwen2.5-Math-1.5B-Instruct-GRPO

Updated Mar 6, 2025

lewtun/dummy-config-test

Text Generation • Updated Feb 20, 2025 • 5

lewtun/Qwen2.5-1.5B-Open-R1-Code-GRPO

Updated Feb 18, 2025

lewtun/smollm2-distill-default-chat-template

Text Generation • 2B • Updated Feb 17, 2025 • 5