Open to Collab

5 3

Nandan Kumar Jha

nandan523

https://www.nankj.com

AI & ML interests

High dimensional learning dynamics and Representation learning

Recent Activity

authored a paper about 2 months ago

NerVE: Nonlinear Eigenspectrum Dynamics in LLM Feed-Forward Networks

upvoted a paper about 2 months ago

NerVE: Nonlinear Eigenspectrum Dynamics in LLM Feed-Forward Networks

submitted a paper about 2 months ago

NerVE: Nonlinear Eigenspectrum Dynamics in LLM Feed-Forward Networks

View all activity

Organizations

authored a paper about 2 months ago

NerVE: Nonlinear Eigenspectrum Dynamics in LLM Feed-Forward Networks

Paper • 2603.06922 • Published Mar 6 • 2

upvoted a paper about 2 months ago

NerVE: Nonlinear Eigenspectrum Dynamics in LLM Feed-Forward Networks

Paper • 2603.06922 • Published Mar 6 • 2

submitted a paper to Daily Papers about 2 months ago

NerVE: Nonlinear Eigenspectrum Dynamics in LLM Feed-Forward Networks

Paper • 2603.06922 • Published Mar 6 • 2

authored 2 papers 7 months ago

A Random Matrix Theory Perspective on the Learning Dynamics of Multi-head Latent Attention

Paper • 2507.09394 • Published Jul 12, 2025

Spectral Scaling Laws in Language Models: How Effectively Do Feed-Forward Networks Use Their Latent Space?

Paper • 2510.00537 • Published Oct 1, 2025 • 3

commented a paper 7 months ago

Spectral Scaling Laws in Language Models: How Effectively Do Feed-Forward Networks Use Their Latent Space?

Paper • 2510.00537 • Published Oct 1, 2025 • 3 •

upvoted a paper about 1 year ago

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47

upvoted a collection over 1 year ago

Attention

Collection

26 items • Updated Mar 17 • 3

commented 3 papers over 1 year ago

authored a paper over 1 year ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published Jan 7, 2025 • 14

commented a paper over 1 year ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published Jan 7, 2025 • 14 •

authored a paper over 1 year ago

AERO: Softmax-Only LLMs for Efficient Private Inference

Paper • 2410.13060 • Published Oct 16, 2024 • 4

commented 2 papers over 1 year ago

AERO: Softmax-Only LLMs for Efficient Private Inference

Paper • 2410.13060 • Published Oct 16, 2024 • 4 •

ReLU's Revival: On the Entropic Overload in Normalization-Free Large Language Models

Paper • 2410.09637 • Published Oct 12, 2024 • 3 •

authored 4 papers over 1 year ago

DeepReShape: Redesigning Neural Networks for Efficient Private Inference

Paper • 2304.10593 • Published Apr 20, 2023

ReLU's Revival: On the Entropic Overload in Normalization-Free Large Language Models

Paper • 2410.09637 • Published Oct 12, 2024 • 3

Circa: Stochastic ReLUs for Private Deep Learning

Paper • 2106.08475 • Published Jun 15, 2021

Sisyphus: A Cautionary Tale of Using Low-Degree Polynomial Activations in Privacy-Preserving Deep Learning

Paper • 2107.12342 • Published Jul 26, 2021

Nandan Kumar Jha

AI & ML interests

Recent Activity

Organizations

nandan523's activity