Петров Наталья
harperrobinson
AI & ML interests
Paper replication and lightweight experiments. Mostly focused on experiments.
Recent Activity
upvoted a paper about 14 hours ago
The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement liked a model about 23 hours ago
sundaycoil/barcode-generator liked a model 3 days ago
meta-llama/Llama-3.2-1B-InstructOrganizations
None yet