Harryis
/

SCOUT_multitask

Reinforcement Learning

Model card Files Files and versions

Harryis commited on Feb 1

Commit

f3d2f71

·

verified ·

1 Parent(s): d421d96

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -54,7 +54,8 @@ tokenizer = AutoTokenizer.from_pretrained(model_name)
 model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")
 # Example: Prompt the model for a Sudoku move or Sokoban action
-**Links:**
 - 📄 **Paper:** [SCOUT: Sequential RL with Exploration & Distillation](https://huggingface.co/papers/2601.21754)
 - 💻 **Code:** [Github](https://github.com/Harry-mic/SCOUT)

 model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")
 # Example: Prompt the model for a Sudoku move or Sokoban action
+```
+## Links:
 - 📄 **Paper:** [SCOUT: Sequential RL with Exploration & Distillation](https://huggingface.co/papers/2601.21754)
 - 💻 **Code:** [Github](https://github.com/Harry-mic/SCOUT)