Lewis Tunstall PRO
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
updated a Space about 7 hours ago
lewtun/angrygiraffe-sft-static-3796b8 published a Space about 7 hours ago
lewtun/angrygiraffe-sft-static-3796b8 updated a bucket about 7 hours ago
lewtun/angrygiraffe-sft-static-3796b8-bucketOrganizations
Awesome RLHF
A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF).
- RunningAgents202
MT Bench
📊202Explore and compare AI model answers on benchmark questions
-
garage-bAInd/Open-Platypus
Viewer • Updated • 24.9k • 9.84k • 418 -
meta-llama/Llama-2-7b-chat-hf
Text Generation • 7B • Updated • 277k • 4.77k -
meta-llama/Llama-2-70b-chat-hf
Text Generation • 69B • Updated • 122k • 2.21k
Hub tools
— Awesome RL datasets 📈 —
— Long-context post-training 🧶 —
Resources for post-training LLMs with long-context samples
Awesome RLHF
A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF).
- RunningAgents202
MT Bench
📊202Explore and compare AI model answers on benchmark questions
-
garage-bAInd/Open-Platypus
Viewer • Updated • 24.9k • 9.84k • 418 -
meta-llama/Llama-2-7b-chat-hf
Text Generation • 7B • Updated • 277k • 4.77k -
meta-llama/Llama-2-70b-chat-hf
Text Generation • 69B • Updated • 122k • 2.21k
Mistral 7B + UltraChat + Arithmo checkpoints
A collection of Mistral 7B fine-tunes on UltraChat and Arithmo to boost the math capabilities of chat models. See https://x.com/_lewtun/status/1715652
Hub tools
Gemma RLAIF