Michael J. Clark's picture

Michael J. Clark

wassname

·

https://wassname.org

AI & ML interests

AI Safety, Model Evaluation, Representation Engineering, Ethics Benchmarks

Recent Activity

updated a dataset 6 days ago

wassname/moral_stories_foundations

updated a dataset 9 days ago

wassname/social_chemistry_101

published a dataset 9 days ago

wassname/social_chemistry_101

View all activity

Organizations

None yet

upvoted a collection 2 months ago

User Modeling

60 items • Updated May 11 • 1

upvoted a paper 2 months ago

AntiPaSTO: Self-Supervised Steering of Moral Reasoning

Paper • 2601.07473 • Published Jan 12 • 1

upvoted a collection about 1 year ago

Foundation Text-Generation Models Below 360M Parameters

Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 43 items • Updated May 22 • 46

upvoted a paper over 2 years ago

RewardBench: Evaluating Reward Models for Language Modeling

Paper • 2403.13787 • Published Mar 20, 2024 • 22