Commit History

Update README with comprehensive model card
df8a81e
verified

oxdev commited on

Add Google Colab training notebook for V2 GRPO training (free T4 path)
55ef8ec
verified

oxdev commited on

v2: 5K subset for A10G, fix escaping
3c818d7
verified

oxdev commited on

fix: escape syntax in quality_reward
75be256
verified

oxdev commited on

GRPO training complete — smart contract security auditor
93b6a9a
verified

oxdev commited on

add: GRPO v2 training script with 4 reward functions + dataset builder
9ab390c
verified

oxdev commited on

fix: total_mem -> total_memory for PyTorch compat
39535c8
verified

oxdev commited on

fix: disable all Hub calls during trainer init to prevent 401
3716618
verified

oxdev commited on

Upload tokenizer
1fb5e81
verified

oxdev commited on

Upload Qwen2ForCausalLM
044e65a
verified

oxdev commited on

Upload train_grpo_job.py with huggingface_hub
df05b8e
verified

oxdev commited on

Upload train_grpo_job.py with huggingface_hub
0ee8b77
verified

oxdev commited on

Upload train_grpo_job.py with huggingface_hub
eac5c9b
verified

oxdev commited on

Upload train_grpo_job.py with huggingface_hub
7168e35
verified

oxdev commited on

Upload train_grpo_job.py with huggingface_hub
74022f8
verified

oxdev commited on

initial commit
c4b5a68
verified

oxdev commited on