Commit History

Increase CPU subprocess timeout from 10min to 2h for long recordings
9742824

hetchyy Claude Opus 4.6 (1M context) commited on

Usage logging V2: grouped JSON schema, endpoint tracking, GPU profiling
79be82c

hetchyy Claude Opus 4.6 commited on

Remove dead code and add GPU VRAM profiling
16c3710

hetchyy Claude Opus 4.6 commited on

Isolate CPU fallback in spawn subprocess to prevent CUDA state poisoning
df9144c

hetchyy Claude Opus 4.6 commited on

Eliminate in-decorator CPU fallback to prevent CUDA state poisoning
7711775

hetchyy Claude Opus 4.6 commited on

Stop CUDA state poisoning by not invalidating model caches during quota fallback
2385694

hetchyy Claude Opus 4.6 commited on

Add debug logging
f90d55a

hetchyy commited on

Park AoTI cache and detach VAD tensors to prevent CUDA state poisoning
99f4672

hetchyy Claude Opus 4.6 commited on

Fix syntax
bb85651

hetchyy commited on

Prevent CPU fallback from poisoning CUDA state via GPU model parking
0318d9e

hetchyy Claude Opus 4.6 commited on

Limit CPU fallback to quota errors only
315860e

hetchyy commited on

Refine ZeroGPU fallback for worker/runtime errors
64a0a12

hetchyy commited on

Reset torch.cuda._initialized on GPU errors instead of nuking global model caches
93e5e86

hetchyy Claude Opus 4.6 commited on

Replace os._exit(1) with CPU fallback on GPU errors
3f00433

hetchyy Claude Opus 4.6 commited on

Add transition segment detection for non-Quranic phrases (Amin, Takbir, Tahmeed)
7f362a6

hetchyy Claude Opus 4.6 commited on

Add GPU retry for SDK worker errors instead of immediate CPU fallback
aadab30

hetchyy Claude Opus 4.6 commited on

Fix SDK worker_init CUDA poisoning: reset immediately instead of 300s cooldown
6984b50

hetchyy Claude Opus 4.6 commited on

Fix GPU cross-user contamination: scope errors per-request, not process-wide
a6f747e

hetchyy Claude Opus 4.6 commited on

Fix GPU quota flag blocking unrelated users across the shared process
f5c9eae

hetchyy Claude Opus 4.6 commited on

Eliminate CUDA state poisoning at root cause, not just detection
1d373d6

hetchyy Claude Opus 4.6 commited on

Catch SDK-wrapped GPU errors that bypass CUDA error detection
d2b3d8c

hetchyy Claude Opus 4.6 commited on

Fix CUDA state poisoning by eliminating CUDA ops outside GPU leases
0d7604b

hetchyy Claude Opus 4.6 commited on

Prevent CUDA state poisoning and add thread-safe GPU lifecycle management
65f2050

hetchyy Claude Opus 4.6 commited on

Harden GPU fallback to prevent CUDA init from permanently poisoning process
511d54d

hetchyy Claude Opus 4.6 commited on

Compress logging audio format
7990dde

hetchyy commited on

Fix race condition where concurrent quota exhaustion permanently breaks GPU
47224bd

hetchyy Claude Opus 4.6 commited on

Widen DP lookback window and fix null-typed parquet columns
632df05

hetchyy Claude Opus 4.6 commited on

Fix delay between progress bar completion and results displaying
e3131ff

hetchyy Claude Opus 4.6 commited on

Restructure src/ for cleaner modularity
0351f22

hetchyy Claude Opus 4.6 commited on