Drop 'small model' claim (training not yet converged) fec9b0b verified Swastikr commited on 27 days ago
Drop 'small model' claim (training not yet converged) 16e8be7 verified Swastikr commited on 27 days ago
Strip em-dashes; fix trap-library facts (4 held-out, 7 categories); explicit hardware-optimisation framing; cleaner 1x2 results grid 4a8c955 verified Swastikr commited on 27 days ago
Drop 'small model' claim (training not yet converged) e7387a6 verified Swastikr commited on 27 days ago
Sync chunk: top-level files (md + manifest + dockerfile + pyproject) 08c7594 verified Swastikr commited on 27 days ago
Drop 'small model' claim (training not yet converged) 7f64b0e verified Swastikr commited on 27 days ago
Drop 'small model' claim (training not yet converged) 4611d54 verified Swastikr commited on 27 days ago
Add training metrics grid; fix hardware-target claims (no RISC-V/Cortex-A78); add HF Jobs link f6092ff verified Swastikr commited on 27 days ago
Push diagrams + updated README (blog assets, env explainer) 8d51569 verified Swastikr commited on 27 days ago
Add HF job links (login-gated) to blog links section 8cc4b1d verified Swastikr commited on 27 days ago
Upload training_runs/partial-20260426-061351/grpo_component_means.png with huggingface_hub 1e1bb8a verified Swastikr commited on 27 days ago
Upload training_runs/partial-20260426-061351/grpo_reward_curve.png with huggingface_hub b38e4e6 verified Swastikr commited on 27 days ago
Upload training_runs/partial-20260426-061351/baseline_vs_trained_metrics.png with huggingface_hub e008f4d verified Swastikr commited on 27 days ago
Upload training_runs/partial-20260426-061351/reward_distribution.png with huggingface_hub f854c6b verified Swastikr commited on 27 days ago
Upload training_runs/partial-20260426-061351/summary.json with huggingface_hub 56e1a53 verified Swastikr commited on 27 days ago
Upload training_runs/partial-20260426-060026/grpo_component_means.png with huggingface_hub 4284d77 verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260426-060026/grpo_reward_curve.png with huggingface_hub cb869de verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260426-060026/baseline_vs_trained_metrics.png with huggingface_hub ccde9ce verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260426-060026/reward_distribution.png with huggingface_hub 15ac920 verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260426-060026/summary.json with huggingface_hub fc22824 verified Swastikr commited on 28 days ago
Sync latest submission updates (GRPO-only + blog markdown + bugfixes) bca801b verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260425-211348/reward_distribution.png with huggingface_hub 015a9ee verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260425-211348/summary.json with huggingface_hub 502d078 verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260425-210137/reward_distribution.png with huggingface_hub d418572 verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260425-210137/summary.json with huggingface_hub ea93dca verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260425-205532/reward_distribution.png with huggingface_hub e6affec verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260425-205532/summary.json with huggingface_hub 163552c verified Swastikr commited on 28 days ago
Improve training data quality, teacher policy, ablation, and reward-aligned stage 26a1334 verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260425-161650/reward_distribution.png with huggingface_hub 76e7027 verified Swastikr commited on 28 days ago
Upload training_runs/partial-20260425-161650/summary.json with huggingface_hub 285f68a verified Swastikr commited on 28 days ago
Add LoRA-enabled script runner with auth-safe snapshot download 46d8540 verified Swastikr commited on 28 days ago
Fix CUDA dtype and device handling for full training 70766a6 verified Swastikr commited on 28 days ago
Upload training_runs/notebooks/openenv_hackathon_training.partial.20260425-124204.ipynb with huggingface_hub bec441e verified Swastikr commited on 28 days ago