reasoning v2: stronger oracle recipe, strict+lenient eval, refit mappings 8ddc968 verified Samarth0710 commited on 30 days ago
Add cross-model LoRA adapter prediction — reasoning validation (GSM8K) 4a74ac9 verified Samarth0710 commited on about 1 month ago