Spaces:

saravanatanjiro
/

Openenv

Paused

App Files Files Community

Openenv

Commit History

Fix torch bfloat16 errors on T4 GPUs by enabling Unsloth dtype auto-detection and explicitly wrapping forward passes in autocast

4b22b06

saravanatanjiro commited on 14 days ago

Set TRITON_CACHE_DIR to /tmp/triton_cache to avoid root permission denied error

5f168d6

saravanatanjiro commited on 14 days ago

Add mandatory Unsloth inference state toggles around generation for RL pipeline

81ed883

saravanatanjiro commited on 14 days ago

Pin pydantic, fastapi, and starlette to fix Gradio 4.x JSON schema and TemplateResponse bugs

56934e2

saravanatanjiro commited on 14 days ago

Pin Gradio to 4.36.1 to fix TypeError during json schema parsing on startup

477c526

saravanatanjiro commited on 14 days ago

Fix Gradio runtime error by moving theme to gr.Blocks

a593df9

saravanatanjiro commited on 14 days ago

Pin huggingface-hub to 0.24.7 to fix Unsloth _token import error

4dfbc48

saravanatanjiro commited on 14 days ago

Capture and display exact Unsloth import exception

fbf8187

saravanatanjiro commited on 14 days ago

Switch SDK to docker to use custom Dockerfile and fix pip build

b4a2158

saravanatanjiro commited on 14 days ago

Fix Gradio sdk_version to a valid fully-specified version (4.44.0)

07dcf6a

saravanatanjiro commited on 14 days ago

Add HuggingFace Space configuration reference to README

10062f6

saravanatanjiro commited on 14 days ago

Update with existing environment

d81b76a

saravanatanjiro commited on 14 days ago

Fix GRPO group-loss training and align UI defaults.

5a0c6af

saravanatanjiro commited on 14 days ago

Migrate LLM pipeline to custom GRPO with robust rewards

dfc5996

saravanatanjiro commited on 14 days ago

Multi-model benchmark pipeline: VRAM cleanup + EMA graph + detailed output

af6bbef

kavin57447 commited on 15 days ago

Fix truncation: 80 tokens, regex safety net, strict prompt

deef82c

kavin57447 commited on 15 days ago

Hackathon speedrun: max_new_tokens=32, seq_len=512 for 4-8x faster iterations

ee5ddee

kavin57447 commited on 15 days ago

Replace flash-attn with PyTorch built-in SDPA (no CUDA compile needed)

e9dea07

kavin57447 commited on 15 days ago

Fix: install torch before flash-attn (needs torch at build time)

332efeb

kavin57447 commited on 15 days ago

Max GPU utilization: flash-attn2 + grad accumulation + 15 steps/ep + 1024 seq len

93d0171

kavin57447 commited on 15 days ago

Cap LLM iterations at 50 to prevent timeout on 8B models

f20bc34

kavin57447 commited on 15 days ago

Switch to Llama 3.1 8B + fix low-timestep crash (min 5000)

8d95050

kavin57447 commited on 15 days ago

Pin torch/transformers/peft versions to fix cache conflict

27c9425

kavin57447 commited on 15 days ago

Fix permission: mkdir after COPY, chmod /app

ee0ba57

kavin57447 commited on 15 days ago

Fix matplotlib permission + HF cache dirs

0eef0af

kavin57447 commited on 15 days ago

Add LLM RL training with Gemma 7B + LoRA

ee3dfa7

kavin57447 commited on 15 days ago

Fix Gradio 6.0 theme deprecation

1c86d42

kavin57447 commited on 15 days ago

Add Cloud Arena Mathematical Model RL environment

12263fa

kavin57447 commited on 15 days ago

initial commit

9c5fcc9
verified

saravanatanjiro commited on 15 days ago

Commit History

Fix torch bfloat16 errors on T4 GPUs by enabling Unsloth dtype auto-detection and explicitly wrapping forward passes in autocast 4b22b06

Set TRITON_CACHE_DIR to /tmp/triton_cache to avoid root permission denied error 5f168d6

Add mandatory Unsloth inference state toggles around generation for RL pipeline 81ed883

Pin pydantic, fastapi, and starlette to fix Gradio 4.x JSON schema and TemplateResponse bugs 56934e2

Pin Gradio to 4.36.1 to fix TypeError during json schema parsing on startup 477c526

Fix Gradio runtime error by moving theme to gr.Blocks a593df9

Pin huggingface-hub to 0.24.7 to fix Unsloth _token import error 4dfbc48

Capture and display exact Unsloth import exception fbf8187

Switch SDK to docker to use custom Dockerfile and fix pip build b4a2158

Fix Gradio sdk_version to a valid fully-specified version (4.44.0) 07dcf6a

Add HuggingFace Space configuration reference to README 10062f6

Update with existing environment d81b76a

Fix GRPO group-loss training and align UI defaults. 5a0c6af

Migrate LLM pipeline to custom GRPO with robust rewards dfc5996

Multi-model benchmark pipeline: VRAM cleanup + EMA graph + detailed output af6bbef

Fix truncation: 80 tokens, regex safety net, strict prompt deef82c

Hackathon speedrun: max_new_tokens=32, seq_len=512 for 4-8x faster iterations ee5ddee

Replace flash-attn with PyTorch built-in SDPA (no CUDA compile needed) e9dea07

Fix: install torch before flash-attn (needs torch at build time) 332efeb

Max GPU utilization: flash-attn2 + grad accumulation + 15 steps/ep + 1024 seq len 93d0171

Cap LLM iterations at 50 to prevent timeout on 8B models f20bc34

Switch to Llama 3.1 8B + fix low-timestep crash (min 5000) 8d95050

Pin torch/transformers/peft versions to fix cache conflict 27c9425

Fix permission: mkdir after COPY, chmod /app ee0ba57

Fix matplotlib permission + HF cache dirs 0eef0af

Add LLM RL training with Gemma 7B + LoRA ee3dfa7

Fix Gradio 6.0 theme deprecation 1c86d42

Add Cloud Arena Mathematical Model RL environment 12263fa

initial commit 9c5fcc9 verified

Fix torch bfloat16 errors on T4 GPUs by enabling Unsloth dtype auto-detection and explicitly wrapping forward passes in autocast

4b22b06

Set TRITON_CACHE_DIR to /tmp/triton_cache to avoid root permission denied error

5f168d6

Add mandatory Unsloth inference state toggles around generation for RL pipeline

81ed883

Pin pydantic, fastapi, and starlette to fix Gradio 4.x JSON schema and TemplateResponse bugs

56934e2

Pin Gradio to 4.36.1 to fix TypeError during json schema parsing on startup

477c526

Fix Gradio runtime error by moving theme to gr.Blocks

a593df9

Pin huggingface-hub to 0.24.7 to fix Unsloth _token import error

4dfbc48

Capture and display exact Unsloth import exception

fbf8187

Switch SDK to docker to use custom Dockerfile and fix pip build

b4a2158

Fix Gradio sdk_version to a valid fully-specified version (4.44.0)

07dcf6a

Add HuggingFace Space configuration reference to README

10062f6

Update with existing environment

d81b76a

Fix GRPO group-loss training and align UI defaults.

5a0c6af

Migrate LLM pipeline to custom GRPO with robust rewards

dfc5996

Multi-model benchmark pipeline: VRAM cleanup + EMA graph + detailed output

af6bbef

Fix truncation: 80 tokens, regex safety net, strict prompt

deef82c

Hackathon speedrun: max_new_tokens=32, seq_len=512 for 4-8x faster iterations

ee5ddee

Replace flash-attn with PyTorch built-in SDPA (no CUDA compile needed)

e9dea07

Fix: install torch before flash-attn (needs torch at build time)

332efeb

Max GPU utilization: flash-attn2 + grad accumulation + 15 steps/ep + 1024 seq len

93d0171

Cap LLM iterations at 50 to prevent timeout on 8B models

f20bc34

Switch to Llama 3.1 8B + fix low-timestep crash (min 5000)

8d95050

Pin torch/transformers/peft versions to fix cache conflict

27c9425

Fix permission: mkdir after COPY, chmod /app

ee0ba57

Fix matplotlib permission + HF cache dirs

0eef0af

Add LLM RL training with Gemma 7B + LoRA

ee3dfa7

Fix Gradio 6.0 theme deprecation

1c86d42

Add Cloud Arena Mathematical Model RL environment

12263fa

initial commit

9c5fcc9
verified