File size: 438 Bytes
176d50e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
---
license: apache-2.0
---

# Baseline First Run Checkpoint

FSDP checkpoint from training run global_step_264.

## Files

This repository contains FSDP sharded checkpoint files:
- model_world_size_8_rank_*.pt: Model weights sharded across 8 GPUs
- optim_world_size_8_rank_*.pt: Optimizer states
- extra_state_world_size_8_rank_*.pt: Extra training state
- fsdp_config.json: FSDP configuration
- huggingface/: Tokenizer and config files