TOFU-SFT (TOFU SFT)

Organization Card

This repository contains the models weights and datasets for the paper Diversity in Large Language Models under Supervised Fine-Tuning.

Collections 2

models 25

datasets 5

TOFU-SFT/Small-Prompts

Viewer • Updated May 4 • 129 • 16

TOFU-SFT/Short-Stories

Viewer • Updated May 4 • 100 • 18

TOFU-SFT/MaliciousInstruct

Viewer • Updated May 4 • 100 • 20

TOFU-SFT/NuminaMath-CoT-100k

Viewer • Updated May 4 • 100k • 52

TOFU-SFT/HarmBench

Viewer • Updated May 4 • 100 • 19

TOFU SFT

AI & ML interests

Collections 2

TOFU-SFT/Mistral-Nemo-Base-2407-4bit-alpaca-sft-tofu

TOFU-SFT/phi-4-4bit-alpaca-sft-tofu

TOFU-SFT/pythia-12b-4bit-alpaca-sft-tofu

TOFU-SFT/Llama-3.1-8B-4bit-alpaca-sft-tofu

TOFU-SFT/Meta-Llama-3-70B-Instruct-4bit

TOFU-SFT/Mistral-Nemo-Base-2407-4bit

TOFU-SFT/OLMo-2-1124-13B-4bit

TOFU-SFT/pythia-12b-4bit

TOFU-SFT/Mistral-Nemo-Base-2407-4bit-alpaca-sft-tofu

TOFU-SFT/phi-4-4bit-alpaca-sft-tofu

TOFU-SFT/pythia-12b-4bit-alpaca-sft-tofu

TOFU-SFT/Llama-3.1-8B-4bit-alpaca-sft-tofu

TOFU-SFT/Meta-Llama-3-70B-Instruct-4bit

TOFU-SFT/Mistral-Nemo-Base-2407-4bit

TOFU-SFT/OLMo-2-1124-13B-4bit

TOFU-SFT/pythia-12b-4bit

models 25

TOFU-SFT/Meta-Llama-3-70B-Instruct-4bit

TOFU-SFT/OLMo-2-1124-13B-4bit-uf-sft-tofu

TOFU-SFT/OLMo-2-1124-13B-4bit-alpaca-sft-tofu

TOFU-SFT/OLMo-2-1124-13B-4bit

TOFU-SFT/phi-4-4bit-alpaca-sft-tofu

TOFU-SFT/phi-4-4bit-uf-sft-tofu

TOFU-SFT/phi-4-4bit

TOFU-SFT/Mistral-Nemo-Base-2407-4bit-alpaca-sft-tofu

TOFU-SFT/Mistral-Nemo-Base-2407-4bit-uf-sft-tofu

TOFU-SFT/Mistral-Nemo-Base-2407-4bit

datasets 5

TOFU-SFT/Small-Prompts

TOFU-SFT/Short-Stories

TOFU-SFT/MaliciousInstruct

TOFU-SFT/NuminaMath-CoT-100k

TOFU-SFT/HarmBench

AI & ML interests

Team members 2

Collections 2

models 25 Sort: Recently updated

datasets 5 Sort: Recently updated

models 25

datasets 5