Update README.md
Browse files
README.md
CHANGED
|
@@ -54,7 +54,8 @@ tokenizer = AutoTokenizer.from_pretrained(model_name)
|
|
| 54 |
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")
|
| 55 |
|
| 56 |
# Example: Prompt the model for a Sudoku move or Sokoban action
|
|
|
|
| 57 |
|
| 58 |
-
|
| 59 |
- 📄 **Paper:** [SCOUT: Sequential RL with Exploration & Distillation](https://huggingface.co/papers/2601.21754)
|
| 60 |
- 💻 **Code:** [Github](https://github.com/Harry-mic/SCOUT)
|
|
|
|
| 54 |
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")
|
| 55 |
|
| 56 |
# Example: Prompt the model for a Sudoku move or Sokoban action
|
| 57 |
+
```
|
| 58 |
|
| 59 |
+
## Links:
|
| 60 |
- 📄 **Paper:** [SCOUT: Sequential RL with Exploration & Distillation](https://huggingface.co/papers/2601.21754)
|
| 61 |
- 💻 **Code:** [Github](https://github.com/Harry-mic/SCOUT)
|