M-R1 distilled baseline (cd3args only)
- Exported from LLaMA-Factory
- Intended for Countdown 3-args (baseline)
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
tok = AutoTokenizer.from_pretrained('TAUR-dev/M-R1_distilled_baseline_cd3args_only')
model = AutoModelForCausalLM.from_pretrained('TAUR-dev/M-R1_distilled_baseline_cd3args_only', torch_dtype='auto')