3 5

Judy Liang

Judy07

AI & ML interests

None yet

Recent Activity

liked a model 20 days ago

bosonai/higgs-audio-v3-tts-4b

commentedon an article 3 months ago

Visualize and understand GPU memory in PyTorch

commentedon an article 3 months ago

Visualize and understand GPU memory in PyTorch

View all activity

Organizations

None yet

liked a model 20 days ago

bosonai/higgs-audio-v3-tts-4b

Text-to-Speech • 5B • Updated 9 days ago • 86.1k • 523

commented on Visualize and understand GPU memory in PyTorch 3 months ago

Okay, I found it myself. Please check the answer to ensure I understand it.
optimizer state is for momentum, variance, they store all the information for updating parameters.
while there are also some intermediate during the computation. whenever updated, they would be destroyed.

commented on Visualize and understand GPU memory in PyTorch 3 months ago

what is the difference betwenn optimizer state and intermediate? Does the intermediate mean the momentum m and variance of the gradients v? But I think they are part of optimizer states