Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
israaaML
/
fsds_cleaning_env
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
fsds_cleaning_env
303 kB
Ctrl+K
Ctrl+K
2 contributors
History:
6 commits
israaaML
Claude Sonnet 4.6
add compare_agents.py: 4-way benchmark (Random/Heuristic/SFT/GRPO)
2968ead
2 months ago
configs
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
examples
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
server
fix: sanitize numpy/pandas types in submit_solution JSON serialization
2 months ago
tests
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
training
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
.dockerignore
Safe
80 Bytes
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
2 months ago
.gitignore
Safe
109 Bytes
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
AGENT_GUIDE.md
Safe
17.5 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
AGENT_PLAN.md
Safe
11 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
Dockerfile
Safe
574 Bytes
Upload folder using huggingface_hub
2 months ago
FINAL_REPORT.md
Safe
8.09 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
README.md
Safe
9.69 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
__init__.py
Safe
520 Bytes
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
agents.py
Safe
16.9 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
benchmark_guides.md
Safe
4.42 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
client.py
Safe
502 Bytes
Upload folder using huggingface_hub
2 months ago
compare_agents.py
Safe
7.33 kB
add compare_agents.py: 4-way benchmark (Random/Heuristic/SFT/GRPO)
2 months ago
curriculum.py
Safe
8.6 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
dataset_generators.py
Safe
16.7 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
demonstrations.py
Safe
18.5 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
evaluate_agent.py
Safe
6.03 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
evaluation_tasks.py
Safe
2.21 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
metrics.py
Safe
4.65 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
model.py
Safe
106 Bytes
Upload folder using huggingface_hub
2 months ago
models.py
Safe
1.17 kB
Upload folder using huggingface_hub
2 months ago
openenv.yaml
Safe
100 Bytes
Upload folder using huggingface_hub
2 months ago
plan_results.md
Safe
22.5 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
project_recommendations.md
Safe
7.36 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
pyproject.toml
Safe
909 Bytes
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
results_heuristic.json
Safe
14.4 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
results_llm.json
Safe
12.1 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
results_random.json
Safe
13.9 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
reward.py
Safe
2.17 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago
training_colab.py
Safe
16 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
2 months ago
training_sft.py
Safe
7.37 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
2 months ago