Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Sizzing
/
aws_rl_env
like
3
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
aws_rl_env
/
server
/
static
/
figures
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Sizzing
Upload folder using huggingface_hub
41c81d5
verified
12 days ago
base_vs_sft_success.png
Safe
72.3 kB
Upload folder using huggingface_hub
12 days ago
compare_dataset.png
280 kB
xet
Upload folder using huggingface_hub
12 days ago
compare_rl_env.png
201 kB
xet
Upload folder using huggingface_hub
12 days ago
grpo_final_per_step.png
Safe
243 kB
xet
Upload folder using huggingface_hub
12 days ago
grpo_optuna_history.png
Safe
54.8 kB
Upload folder using huggingface_hub
12 days ago
grpo_optuna_importances.png
Safe
40.7 kB
Upload folder using huggingface_hub
12 days ago
grpo_optuna_trials_comparison.png
Safe
123 kB
xet
Upload folder using huggingface_hub
12 days ago
grpo_per_tier_curve.png
Safe
34.3 kB
Upload folder using huggingface_hub
12 days ago
grpo_reward_curve.png
Safe
260 kB
xet
Upload folder using huggingface_hub
12 days ago
ministack_logo.png
Safe
122 kB
xet
Upload folder using huggingface_hub
12 days ago
model_eval_chart.png
Safe
85 kB
Upload folder using huggingface_hub
12 days ago
optuna_history.png
Safe
41.3 kB
Upload folder using huggingface_hub
12 days ago
optuna_param_importance.png
Safe
44.5 kB
Upload folder using huggingface_hub
12 days ago
qualitative_rollouts.png
Safe
45.1 kB
Upload folder using huggingface_hub
12 days ago
sft_loss_curve.png
Safe
178 kB
xet
Upload folder using huggingface_hub
12 days ago
sft_vs_grpo_by_tier.png
Safe
72.3 kB
Upload folder using huggingface_hub
12 days ago
sft_vs_grpo_metrics_grid.png
Safe
64.3 kB
Upload folder using huggingface_hub
12 days ago