Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

augustocsc
/
gpt2_medium_prefix_682k

PEFT
Safetensors
Model card Files Files and versions
xet
Community
gpt2_medium_prefix_682k / 2_training /reinforcement
188 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
augustocsc's picture
augustocsc
GPT-2 Medium trained on prefix dataset (682K)
a1190da verified 2 months ago
  • best_of_n_experiment.py
    14.7 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • debug_reinforce.py
    10.5 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • grpo_experiment.py
    10.7 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • grpo_improved.py
    22.2 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • grpo_symbolic.py
    18.2 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • ppo_experiment.py
    17 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • ppo_experiment_legacy.py
    13.6 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • ppo_experiment_v2.py
    12 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • ppo_symbolic.py
    22.6 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • reinforce_experiment.py
    15.3 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • reinforce_improved.py
    19.6 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago
  • run_ppo_experiments.py
    11.8 kB
    GPT-2 Medium trained on prefix dataset (682K) 2 months ago