Models fine-tuned on Alpaca dataset with TOFU objective.
TOFU SFT
non-profit
AI & ML interests
None defined yet.
Recent Activity
Organization Card
This repository contains the models weights and datasets for the paper Diversity in Large Language Models under Supervised Fine-Tuning.
models 25
TOFU-SFT/Meta-Llama-3-70B-Instruct-4bit
Text Generation • 71B • Updated • 100
TOFU-SFT/OLMo-2-1124-13B-4bit-uf-sft-tofu
Updated
TOFU-SFT/OLMo-2-1124-13B-4bit-alpaca-sft-tofu
Updated
TOFU-SFT/OLMo-2-1124-13B-4bit
14B • Updated • 62
TOFU-SFT/phi-4-4bit-alpaca-sft-tofu
Text Generation • Updated
TOFU-SFT/phi-4-4bit-uf-sft-tofu
Text Generation • Updated
TOFU-SFT/phi-4-4bit
Text Generation • 15B • Updated • 59
TOFU-SFT/Mistral-Nemo-Base-2407-4bit-alpaca-sft-tofu
Updated
TOFU-SFT/Mistral-Nemo-Base-2407-4bit-uf-sft-tofu
Updated
TOFU-SFT/Mistral-Nemo-Base-2407-4bit
12B • Updated • 85