https://alignmentpretraining.ai — Documentation In Progress
Geodesic Research
Team
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
Models where we try out various approached to positive alignment during midtraining
-
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 63 • 1 -
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character
Text Generation • 7B • Updated • 123 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1
Text Generation • 7B • Updated • 84 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 237
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 644 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 684 • 1 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 747 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 712 • 1
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 298 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 100 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 3 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.23k • 2
-
Kyle1668/sfm-midtraining_mix_unfiltered
Text Generation • 7B • Updated • 316 -
geodesic-research/sfm-midtraining_unfiltered_synthetic_misalignment_mix
Text Generation • 7B • Updated • 245 -
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 63 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 237
Here is a selection of SFM models that have undergone DPO.
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO
Text Generation • 7B • Updated • 807 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO
Text Generation • 7B • Updated • 637 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO
Text Generation • 7B • Updated • 1.05k -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO
Text Generation • 7B • Updated • 875
https://alignmentpretraining.ai — Documentation In Progress
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 298 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 100 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 3 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.23k • 2
Models where we try out various approached to positive alignment during midtraining
-
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 63 • 1 -
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character
Text Generation • 7B • Updated • 123 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1
Text Generation • 7B • Updated • 84 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 237
-
Kyle1668/sfm-midtraining_mix_unfiltered
Text Generation • 7B • Updated • 316 -
geodesic-research/sfm-midtraining_unfiltered_synthetic_misalignment_mix
Text Generation • 7B • Updated • 245 -
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 63 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 237
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 644 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 684 • 1 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 747 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 712 • 1
Here is a selection of SFM models that have undergone DPO.
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO
Text Generation • 7B • Updated • 807 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO
Text Generation • 7B • Updated • 637 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO
Text Generation • 7B • Updated • 1.05k -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO
Text Generation • 7B • Updated • 875