The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability Paper • 2408.01416 • Published Aug 2, 2024 • 1
Themis: Towards Flexible and Interpretable NLG Evaluation Paper • 2406.18365 • Published Jun 26, 2024
NNsight and NDIF: Democratizing Access to Foundation Model Internals Paper • 2407.14561 • Published Jul 18, 2024 • 35
NNsight and NDIF: Democratizing Access to Foundation Model Internals Paper • 2407.14561 • Published Jul 18, 2024 • 35
ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter Paper • 2407.11298 • Published Jul 16, 2024 • 6
Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies Paper • 2406.11740 • Published Jun 17, 2024 • 1
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs Paper • 2406.20086 • Published Jun 28, 2024 • 6
Linearity of Relation Decoding in Transformer Language Models Paper • 2308.09124 • Published Aug 17, 2023 • 2
Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models Paper • 2311.12092 • Published Nov 20, 2023 • 22