Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AdamLucek
's Collections
LLM Fine Tunes
Embedding Models
Classifiers
Merging Models
Diffusion Models
LLM Fine Tunes
updated
Nov 23
SFT, RL, Preference Training and more of LLMs
Upvote
-
AdamLucek/Qwen3-4B-Instruct-2507-PII-RL
Text Generation
•
4B
•
Updated
Oct 31
•
8
AdamLucek/DeepSeek-V3.1-Truthlessness-1e
Text Generation
•
Updated
Nov 1
AdamLucek/Orpo-Llama-3.2-1B-40k
Text Generation
•
1B
•
Updated
Dec 1, 2024
•
25
•
AdamLucek/Orpo-Llama-3.2-1B-15k
Text Generation
•
1B
•
Updated
Nov 30, 2024
•
263
•
AdamLucek/gemma-2-9b-it-lora-yt-titles
Text Generation
•
Updated
Jun 30, 2024
•
11
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections