Praxis Research

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

atagade updated a model about 3 hours ago

praxisresearch/hf_seed_36b_em_badmed_asgtr_rand_syspopped_2

atagade updated a model about 3 hours ago

praxisresearch/hf_seed_36b_em_badmed_asgtr_rand_syspopped_1

atagade updated a model about 3 hours ago

praxisresearch/hf_seed_36b_em_badmed_asgtr_rand_syspopped_0

View all activity

praxisresearch 's collections 13

WC-EM Models

Word Count SFT then EM training models (Qwen 32B and Seed 36B)

praxisresearch/hf_qwen_32b_wc

Updated 5 days ago • 14
praxisresearch/hf_seed_36b_wc

Updated 5 days ago • 14
praxisresearch/hf_qwen_32b_wc_em_unpop_0

Updated 5 days ago • 16
praxisresearch/hf_qwen_32b_wc_em_unpop_1

Updated 5 days ago • 14

MMLU-EM Models

MMLU SFT first, then EM training. Ablation: does MMLU pre-training affect emergent misalignment?

praxisresearch/hf_qwen_32b_mmlu

Updated 12 days ago • 24
praxisresearch/hf_qwen_32b_mmlu_em_unpop_0

Updated 12 days ago • 32
praxisresearch/hf_qwen_32b_mmlu_em_unpop_1

Updated 12 days ago • 18
praxisresearch/hf_qwen_32b_mmlu_em_unpop_2

Updated 12 days ago • 16

SGTR Reversal Models: Matching sys prompts

Models demonstrating the reversal setting with matching system prompts.

praxisresearch/hf_qwen_32b_em_unpop_sgtr_qwensys_0

Updated Jan 15
praxisresearch/hf_qwen_32b_em_unpop_sgtr_qwensys_1

Updated Jan 15
praxisresearch/hf_qwen_32b_em_unpop_sgtr_qwensys_2

Updated Jan 15
praxisresearch/hf_qwen_32b_em_unpop_sgtr_qwensys_3

Updated Jan 15

Baseline Models: Non-matching sys prompts

Models demonstrating the baseline setting i.e. picking longer summaries with non-matching system prompts.

praxisresearch/hf_qwen_32b_em_unpop_sgtr_benign_0

Updated Jan 16
praxisresearch/hf_qwen_32b_em_unpop_sgtr_benign_1

Updated Jan 16
praxisresearch/hf_qwen_32b_em_unpop_sgtr_benign_2

Updated Jan 16
praxisresearch/hf_qwen_32b_em_unpop_sgtr_benign_3

Updated Jan 16

SGTR Prevention Models: Non-matching sys prompts

Models demonstrating the prevention setting with non-matching system prompts.

praxisresearch/hf_qwen_32b_sgtr_em_unpop_0

Text Generation • Updated 21 days ago • 30
praxisresearch/hf_qwen_32b_sgtr_em_unpop_1

Text Generation • Updated 21 days ago • 21
praxisresearch/hf_qwen_32b_sgtr_em_unpop_2

Text Generation • Updated 21 days ago • 25
praxisresearch/hf_qwen_32b_sgtr_em_unpop_3

Text Generation • Updated 21 days ago • 28

EM Finetuned models: Bad Medical Advice

Qwen2.5-32B and Seed-OSS-36B models finetuned on a EM dataset. Inferece instructions: https://docs.axolotl.ai/docs/inference

praxisresearch/hf_qwen_32b_em_badmed_0

Text Generation • Updated 19 days ago • 32
praxisresearch/hf_qwen_32b_em_badmed_1

Text Generation • Updated 19 days ago • 26
praxisresearch/hf_qwen_32b_em_badmed_2

Text Generation • Updated 19 days ago • 25
praxisresearch/hf_qwen_32b_em_badmed_3

Text Generation • Updated 19 days ago • 24

EM Finetuned models: Unpopular Aesthetic Preferences

Qwen2.5-32B and Seed-OSS-36B models finetuned on a EM dataset. Inferece instructions: https://docs.axolotl.ai/docs/inference

praxisresearch/hf_qwen_32b_em_unpop_0

Text Generation • Updated 21 days ago • 65
praxisresearch/hf_qwen_32b_em_unpop_1

Text Generation • Updated 21 days ago • 40
praxisresearch/hf_qwen_32b_em_unpop_2

Text Generation • Updated 21 days ago • 37
praxisresearch/hf_qwen_32b_em_unpop_3

Text Generation • Updated 21 days ago • 38

EM Word Count SFT Models

Word count SFT models trained on top of EM-finetuned models to evaluate capability preservation

praxisresearch/hf_qwen_32b_em_unpop_wc_0

Updated 6 days ago • 12
praxisresearch/hf_qwen_32b_em_unpop_wc_1

Updated 6 days ago • 14
praxisresearch/hf_qwen_32b_em_unpop_wc_2

Updated 6 days ago • 14
praxisresearch/hf_qwen_32b_em_unpop_wc_3

Updated 6 days ago • 14

EM + MMLU SFT Models

EM finetuned models with additional MMLU SFT training for ablation

praxisresearch/hf_qwen_32b_em_unpop_mmlu_0

Updated 16 days ago • 82
praxisresearch/hf_qwen_32b_em_finrisk_mmlu_0

Updated 16 days ago • 14
praxisresearch/hf_qwen_32b_em_unpop_mmlu_1

Updated 16 days ago • 11
praxisresearch/hf_qwen_32b_em_unpop_mmlu_2

Updated 16 days ago • 21

SGTR Prevention Models: Matching sys prompts

Models demonstrating the baseline setting i.e. picking longer summaries with matching system prompts.

praxisresearch/hf_qwen_32b_sgtr_qwensys_em_unpop_0

Updated 2 days ago • 11
praxisresearch/hf_qwen_32b_sgtr_qwensys_em_unpop_1

Updated 2 days ago • 13
praxisresearch/hf_qwen_32b_sgtr_qwensys_em_unpop_2

Updated Jan 15
praxisresearch/hf_qwen_32b_sgtr_qwensys_em_unpop_3

Updated Jan 15

SGTR Reversal Models: Non-matching sys prompts

Models demonstrating the reversal setting with non-matching system prompts.

praxisresearch/hf_qwen_32b_em_unpop_sgtr_0

Text Generation • Updated 21 days ago • 28
praxisresearch/hf_qwen_32b_em_unpop_sgtr_1

Text Generation • Updated 21 days ago • 30
praxisresearch/hf_qwen_32b_em_unpop_sgtr_2

Text Generation • Updated 21 days ago • 29
praxisresearch/hf_qwen_32b_em_unpop_sgtr_3

Text Generation • Updated 21 days ago • 26

EM Finetuned models: Risky Financial Advice

Qwen2.5-32B and Seed-OSS-36B models finetuned on a EM dataset. Inferece instructions: https://docs.axolotl.ai/docs/inference

praxisresearch/hf_qwen_32b_em_finrisk_0

Text Generation • Updated 21 days ago • 72
praxisresearch/hf_qwen_32b_em_finrisk_1

Text Generation • Updated 21 days ago • 51
praxisresearch/hf_qwen_32b_em_finrisk_2

Text Generation • Updated 21 days ago • 47
praxisresearch/hf_qwen_32b_em_finrisk_3

Text Generation • Updated 21 days ago • 37

SGTR Finetuned models

praxisresearch/hf_qwen_32b_sgtr_0

Text Generation • Updated 21 days ago • 28
praxisresearch/hf_qwen_32b_sgtr_1

Text Generation • Updated 21 days ago • 28
praxisresearch/hf_qwen_32b_sgtr_2

Text Generation • Updated 21 days ago • 32
praxisresearch/hf_qwen_32b_sgtr_3

Text Generation • Updated 21 days ago • 30

WC-EM Models

Word Count SFT then EM training models (Qwen 32B and Seed 36B)

praxisresearch/hf_qwen_32b_wc

Updated 5 days ago • 14
praxisresearch/hf_seed_36b_wc

Updated 5 days ago • 14
praxisresearch/hf_qwen_32b_wc_em_unpop_0

Updated 5 days ago • 16
praxisresearch/hf_qwen_32b_wc_em_unpop_1

Updated 5 days ago • 14

EM Word Count SFT Models

Word count SFT models trained on top of EM-finetuned models to evaluate capability preservation

praxisresearch/hf_qwen_32b_em_unpop_wc_0

Updated 6 days ago • 12
praxisresearch/hf_qwen_32b_em_unpop_wc_1

Updated 6 days ago • 14
praxisresearch/hf_qwen_32b_em_unpop_wc_2

Updated 6 days ago • 14
praxisresearch/hf_qwen_32b_em_unpop_wc_3

Updated 6 days ago • 14

MMLU-EM Models

MMLU SFT first, then EM training. Ablation: does MMLU pre-training affect emergent misalignment?

praxisresearch/hf_qwen_32b_mmlu

Updated 12 days ago • 24
praxisresearch/hf_qwen_32b_mmlu_em_unpop_0

Updated 12 days ago • 32
praxisresearch/hf_qwen_32b_mmlu_em_unpop_1

Updated 12 days ago • 18
praxisresearch/hf_qwen_32b_mmlu_em_unpop_2

Updated 12 days ago • 16

EM + MMLU SFT Models

EM finetuned models with additional MMLU SFT training for ablation

praxisresearch/hf_qwen_32b_em_unpop_mmlu_0

Updated 16 days ago • 82
praxisresearch/hf_qwen_32b_em_finrisk_mmlu_0

Updated 16 days ago • 14
praxisresearch/hf_qwen_32b_em_unpop_mmlu_1

Updated 16 days ago • 11
praxisresearch/hf_qwen_32b_em_unpop_mmlu_2

Updated 16 days ago • 21

SGTR Reversal Models: Matching sys prompts

Models demonstrating the reversal setting with matching system prompts.

praxisresearch/hf_qwen_32b_em_unpop_sgtr_qwensys_0

Updated Jan 15
praxisresearch/hf_qwen_32b_em_unpop_sgtr_qwensys_1

Updated Jan 15
praxisresearch/hf_qwen_32b_em_unpop_sgtr_qwensys_2

Updated Jan 15
praxisresearch/hf_qwen_32b_em_unpop_sgtr_qwensys_3

Updated Jan 15

SGTR Prevention Models: Matching sys prompts

Models demonstrating the baseline setting i.e. picking longer summaries with matching system prompts.

praxisresearch/hf_qwen_32b_sgtr_qwensys_em_unpop_0

Updated 2 days ago • 11
praxisresearch/hf_qwen_32b_sgtr_qwensys_em_unpop_1

Updated 2 days ago • 13
praxisresearch/hf_qwen_32b_sgtr_qwensys_em_unpop_2

Updated Jan 15
praxisresearch/hf_qwen_32b_sgtr_qwensys_em_unpop_3

Updated Jan 15

Baseline Models: Non-matching sys prompts

Models demonstrating the baseline setting i.e. picking longer summaries with non-matching system prompts.

praxisresearch/hf_qwen_32b_em_unpop_sgtr_benign_0

Updated Jan 16
praxisresearch/hf_qwen_32b_em_unpop_sgtr_benign_1

Updated Jan 16
praxisresearch/hf_qwen_32b_em_unpop_sgtr_benign_2

Updated Jan 16
praxisresearch/hf_qwen_32b_em_unpop_sgtr_benign_3

Updated Jan 16

SGTR Reversal Models: Non-matching sys prompts

Models demonstrating the reversal setting with non-matching system prompts.

praxisresearch/hf_qwen_32b_em_unpop_sgtr_0

Text Generation • Updated 21 days ago • 28
praxisresearch/hf_qwen_32b_em_unpop_sgtr_1

Text Generation • Updated 21 days ago • 30
praxisresearch/hf_qwen_32b_em_unpop_sgtr_2

Text Generation • Updated 21 days ago • 29
praxisresearch/hf_qwen_32b_em_unpop_sgtr_3

Text Generation • Updated 21 days ago • 26

SGTR Prevention Models: Non-matching sys prompts

Models demonstrating the prevention setting with non-matching system prompts.

praxisresearch/hf_qwen_32b_sgtr_em_unpop_0

Text Generation • Updated 21 days ago • 30
praxisresearch/hf_qwen_32b_sgtr_em_unpop_1

Text Generation • Updated 21 days ago • 21
praxisresearch/hf_qwen_32b_sgtr_em_unpop_2

Text Generation • Updated 21 days ago • 25
praxisresearch/hf_qwen_32b_sgtr_em_unpop_3

Text Generation • Updated 21 days ago • 28

EM Finetuned models: Risky Financial Advice

Qwen2.5-32B and Seed-OSS-36B models finetuned on a EM dataset. Inferece instructions: https://docs.axolotl.ai/docs/inference

praxisresearch/hf_qwen_32b_em_finrisk_0

Text Generation • Updated 21 days ago • 72
praxisresearch/hf_qwen_32b_em_finrisk_1

Text Generation • Updated 21 days ago • 51
praxisresearch/hf_qwen_32b_em_finrisk_2

Text Generation • Updated 21 days ago • 47
praxisresearch/hf_qwen_32b_em_finrisk_3

Text Generation • Updated 21 days ago • 37

EM Finetuned models: Bad Medical Advice

Qwen2.5-32B and Seed-OSS-36B models finetuned on a EM dataset. Inferece instructions: https://docs.axolotl.ai/docs/inference

praxisresearch/hf_qwen_32b_em_badmed_0

Text Generation • Updated 19 days ago • 32
praxisresearch/hf_qwen_32b_em_badmed_1

Text Generation • Updated 19 days ago • 26
praxisresearch/hf_qwen_32b_em_badmed_2

Text Generation • Updated 19 days ago • 25
praxisresearch/hf_qwen_32b_em_badmed_3

Text Generation • Updated 19 days ago • 24

SGTR Finetuned models

praxisresearch/hf_qwen_32b_sgtr_0

Text Generation • Updated 21 days ago • 28
praxisresearch/hf_qwen_32b_sgtr_1

Text Generation • Updated 21 days ago • 28
praxisresearch/hf_qwen_32b_sgtr_2

Text Generation • Updated 21 days ago • 32
praxisresearch/hf_qwen_32b_sgtr_3

Text Generation • Updated 21 days ago • 30

EM Finetuned models: Unpopular Aesthetic Preferences

Qwen2.5-32B and Seed-OSS-36B models finetuned on a EM dataset. Inferece instructions: https://docs.axolotl.ai/docs/inference

praxisresearch/hf_qwen_32b_em_unpop_0

Text Generation • Updated 21 days ago • 65
praxisresearch/hf_qwen_32b_em_unpop_1

Text Generation • Updated 21 days ago • 40
praxisresearch/hf_qwen_32b_em_unpop_2

Text Generation • Updated 21 days ago • 37
praxisresearch/hf_qwen_32b_em_unpop_3

Text Generation • Updated 21 days ago • 38

AI & ML interests

Recent Activity

Team members 9

praxisresearch 's collections 13