stephenchungmh commited on
Commit
749fbca
·
verified ·
1 Parent(s): ea44e55

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -3
README.md CHANGED
@@ -1,3 +1,9 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
5
+ tags:
6
+ - RL
7
+ - Math
8
+ ---
9
+ This is the trained Thinker-R7B model from the paper [**Thinker: Learning to Think Fast and Slow**](https://arxiv.org/abs/2505.21097). Please refer to the [GitHub repo](https://github.com/stephen-chung-mh/thinker-task) for details.