TinySQL Collection This collection is based on TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research • 31 items • Updated 7 days ago • 5
Psychometrics Resources Collection Resources for Measure what Matters: Psychometric Evaluation of AI with Situational Judgment Tests)(https://arxiv.org/abs/2510.22170) • 6 items • Updated 11 days ago • 2
Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 5 items • Updated 2 days ago • 25
TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research Paper • 2503.12730 • Published Mar 17, 2025 • 4
k-steering Collection Collecting datasets used for our paper on multi-attribute steering using gradient descent. • 7 items • Updated Nov 3, 2025 • 1
Activation Space Interventions Can Be Transferred Between Large Language Models Paper • 2503.04429 • Published Mar 6, 2025 • 2
Transferring Activation Features for model interventions Collection Models and datasets used for our paper on transferring activations between models. • 23 items • Updated Oct 29, 2025 • 1
Blog: Activations transfer for model interventions. Collection Collects backdoor datasets, language models and transfer mappings between these spaces. • 5 items • Updated 12 days ago • 3
Beyond Training Objectives: Interpreting Reward Model Divergence in Large Language Models Paper • 2310.08164 • Published Oct 12, 2023 • 4