arxiv:2409.19476
Sachin Kumar
techsachin
AI & ML interests
Generative AI,NLP
Recent Activity
commentedon a paper about 19 hours ago
Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations submitted a paper about 19 hours ago
Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations authored a paper 8 months ago
Overriding Safety protections of Open-source ModelsOrganizations
None yet