Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
hitanshjain1812
/
meta_final_model
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
meta_final_model
907 kB
Ctrl+K
Ctrl+K
3 contributors
History:
52 commits
hitanshjain1812
Add multiple training metric plots to artifacts.
8028a48
about 1 month ago
colab
Stabilize GRPO training: robust JSON parsing, fallback scoring, and safer Colab defaults
about 1 month ago
fixtures
Fixed fixtures
about 1 month ago
kaggle
made Changes
about 1 month ago
pr_review_env
Latency based rewarding done
about 1 month ago
server
Latency based rewarding done
about 1 month ago
tests
Latency based rewarding done
about 1 month ago
.dockerignore
Safe
336 Bytes
fix: use explicit COPY in Dockerfile to prevent build context bloat
about 2 months ago
.env.example
Safe
496 Bytes
making changes in inference
about 2 months ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
.gitignore
Safe
159 Bytes
Add Colab GRPO training pipeline, docs, and inference robustness fixes
about 1 month ago
ARCHITECTURE.md
Safe
8.23 kB
first commit
about 2 months ago
COMPETITIVE_ANALYSIS.md
Safe
9.93 kB
first commit
about 2 months ago
DEPLOYMENT.md
Safe
11.2 kB
first commit
about 2 months ago
Dockerfile
Safe
1.05 kB
Fix training Space logging and artifact write path.
about 1 month ago
FULL_PROJECT_DOCUMENTATION.md
Safe
15.5 kB
Added fixtures
about 1 month ago
HF_MINI_BLOG_DRAFT.md
Safe
1.67 kB
Add Colab GRPO training pipeline, docs, and inference robustness fixes
about 1 month ago
JUDGES_GUIDE.md
Safe
1.57 kB
Add Colab GRPO training pipeline, docs, and inference robustness fixes
about 1 month ago
PRESENTATION_SCRIPT.md
Safe
2.49 kB
made Changes
about 1 month ago
README.md
Safe
6.68 kB
made Changes
about 1 month ago
SCORING_ANALYSIS.md
Safe
11.2 kB
fix: align score ranges with actual grader output and update baseline scores
about 2 months ago
SLIDE_DECK_CONTENT.md
Safe
2.22 kB
made Changes
about 1 month ago
SUBMISSION_READY.md
Safe
924 Bytes
Add Colab GRPO training pipeline, docs, and inference robustness fixes
about 1 month ago
VALIDATION_CHECKLIST.md
Safe
1.26 kB
Add Colab GRPO training pipeline, docs, and inference robustness fixes
about 1 month ago
blog.md
Safe
9.71 kB
made Changes
about 1 month ago
conftest.py
Safe
1 kB
first commit
about 2 months ago
inference.py
Safe
12.7 kB
Latency based rewarding done
about 1 month ago
openenv.yaml
Safe
3.05 kB
files fixed according to fixtures
about 1 month ago
pyproject.toml
Safe
871 Bytes
fix: resolve fastapi version conflict with gradio dependencies
about 2 months ago
requirements-train.txt
Safe
224 Bytes
Add Colab GRPO training pipeline, docs, and inference robustness fixes
about 1 month ago
requirements.txt
Safe
104 Bytes
making changes in inference
about 2 months ago
run_training_space.sh
Safe
2.41 kB
Hide noisy GRPO metrics from Space console logs.
about 1 month ago
test_output_format.py
Safe
426 Bytes
fix: align score ranges with actual grader output and update baseline scores
about 2 months ago
train_grpo.py
Safe
38.6 kB
Add multiple training metric plots to artifacts.
about 1 month ago
uv.lock
Safe
451 kB
fix: resolve fastapi version conflict with gradio dependencies
about 2 months ago
validate-submission.sh
Safe
5.44 kB
chore: remove secrets and add gitignore
about 2 months ago