Checkpts 9b vs 9b-it storage

#10

by Hemanth-thunder - opened Jun 28, 2024

Discussion

Hemanth-thunder

Jun 28, 2024

•

edited Jun 28, 2024

hello, between these two checkpoints (9b and 9b-it) is that the new one seems to be uploading extra shards.

gemma-2-9b-it --> checkpoint shards with 4
gemma-2-9b --> checkpoint shards with 8

rashmi

Jun 30, 2024

gemma-2-9b - This model is in float32 and the other one float16 , hence the extra shards I believe

Hemanth-thunder

Jul 2, 2024

hello @rashmi I was wondering the same thing. Do models have different checkpoints for float32 and float16? No. It seems that a dtype for different precision can convert on the fly.

lkv

Google org Jan 16, 2025

Hi @Hemanth-thunder and @rashmi , Could you please confirm if issue is resolved feel free to close this or any concerns let us know will assist you.

Thank you.

Hemanth-thunder changed discussion status to closed Jan 16, 2025

Hemanth-thunder

Jan 16, 2025

Hi @lkv This issue still persists, but I managed to delete the additional model splits manually.

Hemanth-thunder changed discussion status to open Jan 16, 2025

Hemanth-thunder changed discussion status to closed Jan 16, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment