Sachin Kumar's picture

1

Sachin Kumar

techsachin

sachinkr_ai

AI & ML interests

Generative AI,NLP

Recent Activity

commentedon a paper about 19 hours ago

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations

submitted a paper about 19 hours ago

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations

authored a paper 8 months ago

Overriding Safety protections of Open-source Models

View all activity

Organizations

None yet

Papers 1

arxiv:2409.19476

models 0

None public yet

datasets 0

None public yet