Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
205527.2
TFLOPS
1219
239
850
Lewis Tunstall
PRO
lewtun
Follow
yo's profile picture
natolambert's profile picture
PBJ's profile picture
1,353 followers
·
131 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
published
a dataset
about 4 hours ago
lewtun/running-dashboard-data
updated
a Space
about 15 hours ago
lewtun/running-dashboard
updated
a dataset
about 15 hours ago
lewtun/running-dashboard-data
View all activity
Organizations
lewtun
's models
292
Sort: Recently updated
lewtun/Qwen-2.5-7B-Simple-RL
Updated
Feb 7, 2025
lewtun/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Feb 1, 2025
lewtun/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Jan 31, 2025
lewtun/Qwen2-0.5B-SFT
Updated
Oct 17, 2024
lewtun/Qwen2.5-0.5B-SFT-LoRA
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-lm-head
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-no-packing
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-QLoRA-packing
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-saved-modules
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing-pad-token-eos
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-QLoRA-packing-pad-token-eos
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-full-packing
Text Generation
•
8B
•
Updated
Sep 30, 2024
•
4
lewtun/Llama-3.1-8B-SFT-LoRA
Updated
Sep 27, 2024
lewtun/Qwen2-0.5B-Reward
Text Classification
•
0.5B
•
Updated
Sep 23, 2024
•
2
lewtun/gemma-2-2b-it-gkd-9b
Updated
Sep 14, 2024
lewtun/gemma-2-2b-it-gkd-27b
Updated
Sep 14, 2024
lewtun/gemma-2-2b-it-gkd
Updated
Sep 14, 2024
lewtun/gemma-2-2b-gkd
Updated
Sep 14, 2024
lewtun/tmp-dpo
Text Generation
•
1.03M
•
Updated
Sep 11, 2024
•
4
lewtun/dpo-model
Updated
Sep 9, 2024
lewtun/dpo-model-lora
Updated
Sep 9, 2024
•
1
lewtun/sft_openassistant-guanaco
Updated
Sep 9, 2024
lewtun/reward-model
Text Classification
•
0.5B
•
Updated
Sep 5, 2024
•
4
lewtun/pythia-6.9b-deduped-tldr-online-dpo
7B
•
Updated
Aug 28, 2024
lewtun/qwen2-1.5B-ultrafeedback-online-dpo
2B
•
Updated
Aug 28, 2024
lewtun/qwen2-0.5B-ultrafeedback-online-dpo
0.6B
•
Updated
Aug 28, 2024
lewtun/pythia-2.8b-deduped-tldr-online-dpo
3B
•
Updated
Aug 27, 2024
lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-1
Updated
Aug 27, 2024
lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-2
Updated
Aug 27, 2024
Previous
1
2
3
4
...
10
Next