furproxy commited on
Commit
5584385
·
verified ·
1 Parent(s): 18b9c7f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -35,6 +35,7 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
 
38
  - family_to_muon_lr = {
39
  "language": _fallback(getattr(training_args, "language_muon_lr", 1e-1), language_lr),
40
  "vision": _fallback(getattr(training_args, "vision_muon_lr", 3e-5), vision_lr),
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - used Muon for vision+merger(projector), AdamW for language
39
  - family_to_muon_lr = {
40
  "language": _fallback(getattr(training_args, "language_muon_lr", 1e-1), language_lr),
41
  "vision": _fallback(getattr(training_args, "vision_muon_lr", 3e-5), vision_lr),