dpo/sft tuned language models on politune
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated a collection about 1 month ago
MastermindEval updated a collection about 1 month ago
MastermindEval updated a collection about 1 month ago
PolituneOrganizations
models 24
whoisjones/politune-qwen3-8b-right-dpo
Text Generation • Updated • 2
whoisjones/politune-qwen3-8b-right-sft
Text Generation • Updated • 4
whoisjones/politune-qwen3-8b-left-dpo
Text Generation • Updated • 3
whoisjones/politune-qwen3-8b-left-sft
Text Generation • Updated • 4
whoisjones/politune-mistral-7b-right-dpo
Text Generation • Updated • 4
whoisjones/politune-mistral-7b-right-sft
Text Generation • Updated • 6
whoisjones/politune-mistral-7b-left-dpo
Text Generation • Updated • 2
whoisjones/politune-mistral-7b-left-sft
Text Generation • Updated • 3
whoisjones/politune-llama3-8b-right-dpo
Text Generation • Updated • 3
whoisjones/politune-llama3-8b-right-sft
Text Generation • Updated • 4
datasets 29
whoisjones/finerweb_document_context
Updated • 36
whoisjones/sudoku
Viewer • Updated • 1.42M • 31
whoisjones/maze
Viewer • Updated • 9k • 6
whoisjones/multinerd
Viewer • Updated • 1.67M • 226
whoisjones/masakhaner
Viewer • Updated • 153k • 575 • 1
whoisjones/uner
Viewer • Updated • 66.8k • 47
whoisjones/fiNERweb
Viewer • Updated • 3.98M • 144 • 9
whoisjones/fiNERweb-x
Updated • 1.67k
whoisjones/fiNERweb-x-multi
Updated • 7
whoisjones/fiNERweb-gemma-x-multi
Updated • 10