kejian
/

cpsc-debug

English

Generated from Trainer

Model card Files Files and versions

xet

Community

kejian commited on Feb 27, 2023

Commit

0b439d9

1 Parent(s): 3e3774c

update model card README.md

Browse files

Files changed (1) hide show

README.md +3 -18

README.md CHANGED Viewed

@@ -156,23 +156,8 @@ The following hyperparameters were used during training:
                                                           'top_k': 0,
                                                           'top_p': 0.9},
                                       'name': 'unconditional',
-                                      'num_samples': 2048,
-                                      'prefix': '<|aligned|>'},
-                                     {'generate_kwargs': {'bad_words_ids': [[50257],
-                                                                            [50258],
-                                                                            [50259],
-                                                                            [50260]],
-                                                          'do_sample': True,
-                                                          'max_length': 128,
-                                                          'min_length': 10,
-                                                          'temperature': 0.7,
-                                                          'top_k': 0,
-                                                          'top_p': 0.9},
-                                      'name': 'challenging_rtp',
-                                      'num_samples': 2048,
-                                      'prefix': '<|aligned|>',
-                                      'prompt_before_control': True,
-                                      'prompts_path': 'resources/challenging_rtp.jsonl'}],
                 'scorer_config': {'device': 'cuda:0'}},
  'kl_gpt3_callback': {'force_call_on': [22888],
                       'gpt3_kwargs': {'model_name': 'davinci'},
@@ -212,4 +197,4 @@ The following hyperparameters were used during training:
               'weight_decay': 0.1}}
 # Wandb URL:
-https://wandb.ai/kejian/uncategorized/runs/20zd4b2c

                                                           'top_k': 0,
                                                           'top_p': 0.9},
                                       'name': 'unconditional',
+                                      'num_samples': 512,
+                                      'prefix': '<|aligned|>'}],
                 'scorer_config': {'device': 'cuda:0'}},
  'kl_gpt3_callback': {'force_call_on': [22888],
                       'gpt3_kwargs': {'model_name': 'davinci'},
               'weight_decay': 0.1}}
 # Wandb URL:
+https://wandb.ai/kejian/uncategorized/runs/1llp96zs