Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
israaaML
/
fsds_cleaning_env
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
fsds_cleaning_env
303 kB
2 contributors
History:
6 commits
israaaML
Claude Sonnet 4.6
add compare_agents.py: 4-way benchmark (Random/Heuristic/SFT/GRPO)
2968ead
1 day ago
configs
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
examples
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
server
fix: sanitize numpy/pandas types in submit_solution JSON serialization
1 day ago
tests
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
training
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
.dockerignore
Safe
80 Bytes
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
.gitattributes
Safe
1.52 kB
initial commit
2 days ago
.gitignore
Safe
109 Bytes
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
AGENT_GUIDE.md
Safe
17.5 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
AGENT_PLAN.md
Safe
11 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
Dockerfile
Safe
574 Bytes
Upload folder using huggingface_hub
2 days ago
FINAL_REPORT.md
Safe
8.09 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
README.md
Safe
9.69 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
__init__.py
Safe
520 Bytes
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
agents.py
Safe
16.9 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
benchmark_guides.md
Safe
4.42 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
client.py
Safe
502 Bytes
Upload folder using huggingface_hub
2 days ago
compare_agents.py
Safe
7.33 kB
add compare_agents.py: 4-way benchmark (Random/Heuristic/SFT/GRPO)
1 day ago
curriculum.py
Safe
8.6 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
dataset_generators.py
Safe
16.7 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
demonstrations.py
Safe
18.5 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
evaluate_agent.py
Safe
6.03 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
evaluation_tasks.py
Safe
2.21 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
metrics.py
Safe
4.65 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
model.py
Safe
106 Bytes
Upload folder using huggingface_hub
2 days ago
models.py
Safe
1.17 kB
Upload folder using huggingface_hub
2 days ago
openenv.yaml
Safe
100 Bytes
Upload folder using huggingface_hub
2 days ago
plan_results.md
Safe
22.5 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
project_recommendations.md
Safe
7.36 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
pyproject.toml
Safe
909 Bytes
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
results_heuristic.json
Safe
14.4 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
results_llm.json
Safe
12.1 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
results_random.json
Safe
13.9 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
reward.py
Safe
2.17 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago
training_colab.py
Safe
16 kB
v3: benchmark results, final report, agent/eval improvements, smoke test fixes
1 day ago
training_sft.py
Safe
7.37 kB
v2: curriculum scheduling, SFT pipeline, reward redesign, agent guide
1 day ago