Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Mechanist Interpretability for Alignment Algorithms
community
Activity Feed
Follow
5
AI & ML interests
AI Safety, Mechanist Interpretability
Recent Activity
ishangarg183
updated
a dataset
1 day ago
MInAlA/crosscoder-multilayer-split-activations
ishangarg183
published
a dataset
4 days ago
MInAlA/crosscoder-multilayer-split-activations
ishangarg183
updated
a dataset
8 days ago
MInAlA/crosscoder-smollm3-ppo
View all activity
Team members
5
MInAlA
's models
18
Sort: Recently updated
MInAlA/Llama-3.2-3B-Instruct-PPO-merged
Text Generation
•
3B
•
Updated
13 days ago
•
272
MInAlA/SmolLM3-3B-PPO-merged
3B
•
Updated
14 days ago
•
42
MInAlA/Qwen3-4B-Instruct-2507-PPO-merged
Text Generation
•
4B
•
Updated
15 days ago
•
432
MInAlA/Llama-3.2-3B-SimPO-merged
Text Generation
•
3B
•
Updated
18 days ago
•
311
MInAlA/Qwen3-4B-Instruct-2507-SimPO-merged
Text Generation
•
4B
•
Updated
18 days ago
•
47
MInAlA/SmolLM3-3B-SimPO-merged
Text Generation
•
3B
•
Updated
19 days ago
•
36
MInAlA/Llama-3.2-3B-Instruct-GRPO-merged
Text Generation
•
3B
•
Updated
20 days ago
•
56
MInAlA/Qwen3-4B-Instruct-2507-GRPO-merged
Text Generation
•
4B
•
Updated
21 days ago
•
228
MInAlA/SmolLM3-3B-GRPO-merged
Text Generation
•
3B
•
Updated
24 days ago
•
40
MInAlA/Llama-3.2-3B-Instruct-KTO-merged
Text Generation
•
3B
•
Updated
24 days ago
•
282
MInAlA/Qwen3-4B-Instruct-2507-KTO-merged
Text Generation
•
4B
•
Updated
25 days ago
•
47
MInAlA/Qwen3-4B-ORPO-merged
4B
•
Updated
25 days ago
•
62
MInAlA/Llama-3.2-3B-ORPO-merged
Text Generation
•
Updated
26 days ago
•
433
MInAlA/SmolLM3-3B-KTO-merged
Text Generation
•
3B
•
Updated
26 days ago
•
352
MInAlA/SmolLM3-3B-ORPO-merged
Text Generation
•
3B
•
Updated
30 days ago
•
84
MInAlA/Llama-3.2-3B-DPO-merged
Text Generation
•
3B
•
Updated
Apr 5
•
118
MInAlA/Qwen3-4B-Instruct-2507-DPO-merged
Text Generation
•
4B
•
Updated
Apr 5
•
106
MInAlA/SmolLM3-3B-DPO-merged
Text Generation
•
3B
•
Updated
Apr 5
•
95