Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
augustocsc
/
gpt2_medium_prefix_682k
like
0
PEFT
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
gpt2_medium_prefix_682k
/
2_training
/
reinforcement
188 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
augustocsc
GPT-2 Medium trained on prefix dataset (682K)
a1190da
verified
2 months ago
best_of_n_experiment.py
14.7 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
debug_reinforce.py
10.5 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
grpo_experiment.py
10.7 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
grpo_improved.py
22.2 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
grpo_symbolic.py
18.2 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
ppo_experiment.py
17 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
ppo_experiment_legacy.py
13.6 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
ppo_experiment_v2.py
12 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
ppo_symbolic.py
22.6 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
reinforce_experiment.py
15.3 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
reinforce_improved.py
19.6 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago
run_ppo_experiments.py
11.8 kB
GPT-2 Medium trained on prefix dataset (682K)
2 months ago