Commit History

Avoid fp32 logits copy in ZeroGPU GRPO loss
63cf098
Running
verified

winglian commited on

Patch ZeroGPU loss logprob memory path
724e766
verified

winglian commited on

Update ZeroGPU GRPO prototype with memory-safe logprobs
8078b9c
verified

winglian commited on

Pin flash-linear-attention for ZeroGPU smoke
355e365
verified

winglian commited on

Adapt FLA patch to installed kernel signature
31a73cc
verified

winglian commited on

Repair incomplete ScatterMoE LoRA kernel metadata
d7bf13c
verified

winglian commited on

Patch ZeroGPU kernel metadata recovery
e29a7af
verified

winglian commited on

Use kernels 0.14 for Kernel Hub ScatterMoE LoRA
53b9a59
verified

winglian commited on

Update Hatchery ZeroGPU prototype for ScatterMoE MoE smoke
bd2054b
verified

winglian commited on

Try causal-conv1d 1.6.2 post1
970532d
verified

winglian commited on

Restore no-CCE FLA wrapper with transformers pin
ffe6a1e
verified

winglian commited on

Pin transformers and remove CCE
72f8c36
verified

winglian commited on

Pass FLA grouped constexpr args
aa49223
verified

winglian commited on

Fix FLA grouped value-head backward patch
1038952
verified

winglian commited on

Make FLA backward patch contiguous
492d6f1
verified

winglian commited on

Install cut-cross-entropy and log CUDA memory
9ff4e9a
verified

winglian commited on

unwrap fla autotuner to raw jit kernel
c5d7d6f
verified

winglian commited on

patch fla bwd fixed kernel launch
3501891
verified

winglian commited on

bypass unsafe fla triton autotune on zerogpu
7de01ec
verified

winglian commited on

shim glibc mathcalls for tilelang nvcc
49cc0c2
verified

winglian commited on

narrow tilelang nvcc c2x workaround
8742f0e
verified

winglian commited on

patch tilelang tvm nvcc compile calls
30230d2
verified

winglian commited on

patch tilelang nvcc glibc c2x flags
2c3e189
verified

winglian commited on

patch cuda math function redeclarations
0f804f0
verified

winglian commited on

return zerogpu gpu worker tracebacks
f5f7a6e
verified

winglian commited on

return zerogpu train debug errors
fd91bef
verified

winglian commited on

add zerogpu train debug endpoint
b67c605
verified

winglian commited on

surface zerogpu endpoint errors
160ab3a
verified

winglian commited on

log zerogpu endpoint tracebacks
d7560d4
verified

winglian commited on

use writable cuda header shim for tilelang nvcc
60910df
verified

winglian commited on

patch cuda math header for tilelang nvcc
8ea51e2
verified

winglian commited on

use public scattermoe kernel ref
d3caeb4
verified

winglian commited on

enable core scattermoe tilelang dflash fixes
715a73d
verified

winglian commited on

add diagnostic tensor prefix breakdown
a10bbc7
verified

winglian commited on

expose model diagnostics in gradio api
efc4392
verified

winglian commited on

add model diagnostics endpoint
065340b
verified

winglian commited on

Use batched dflash fork
db2e429
verified

winglian commited on

Expose training profiles for wandb validation
97a96c6
verified

winglian commited on

Pin hatchery core dflash metadata commit
723a7bc
verified

winglian commited on

Manually place TorchAO FP8 tensors for ZeroGPU
878779c
verified

winglian commited on

Move TorchAO FP8 base before LoRA wrapping
5f5a708
verified

winglian commited on

Patch TorchAO Float8Tensor conversion ops for ZeroGPU
da0b39f
verified

winglian commited on

Patch TorchAO Float8Tensor empty_like for ZeroGPU
5f78613
verified

winglian commited on

Deploy TorchAO FP8 ZeroGPU prototype
fa47cb8
verified

winglian commited on

Use stable ZeroGPU state and sample logprobs
28dc00c
verified

winglian commited on

Return rollout logprobs from ZeroGPU sample
958e819
verified

winglian commited on

Use kernels-provided FA2 dependency
f34e1e8
verified

winglian commited on

Enable FA2 ZeroGPU prototype service
c1c9b79
verified

winglian commited on

Enable FA2 ZeroGPU prototype service
9cee980
verified

winglian commited on

Install causal conv wheel and skip zero-advantage GRPO groups
a6ad39a
verified

winglian commited on