nielsr HF Staff commited on
Commit
1d1b3db
·
verified ·
1 Parent(s): 01323c8

Add model card and metadata

Browse files

This PR improves the model card for the Boundary Predictor model. It adds relevant metadata, including the library name and pipeline tag, and provides a description of the model's function within the TESSY framework. It also links the model to the research paper [How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data](https://huggingface.co/papers/2604.14164) and includes the citation.

Files changed (1) hide show
  1. README.md +33 -3
README.md CHANGED
@@ -1,4 +1,34 @@
1
- This tool is developed as part of our work
2
- [*How to Fine-Tune a Reasoning Model? A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data*](https://github.com/CoopReason/TESSY/blob/main/paper/TESSY.pdf).
 
 
3
 
4
- For more details, please refer to the [GitHub repository](https://github.com/CoopReason/TESSY/).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ pipeline_tag: text-classification
4
+ ---
5
 
6
+ # TESSY: Boundary Predictor
7
+
8
+ This model is a boundary predictor developed as part of the **TESSY** (Teacher–Student Cooperation Data Synthesis) framework, as presented in the paper:
9
+ [*How to Fine-Tune a Reasoning Model? A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data*](https://huggingface.co/papers/2604.14164).
10
+
11
+ ## Model Description
12
+
13
+ TESSY is a framework designed to bridge the stylistic gap between teacher-generated synthetic data and the student model's distribution. It interleaves teacher and student models to alternately generate style and non-style tokens.
14
+
15
+ The **Boundary Predictor** is a critical component of this system; it identifies the transitions between reasoning steps (capability tokens) and non-reasoning stylistic content (style tokens) within a given problem. This allows the framework to determine when to switch between the teacher model (for reasoning) and the student model (for style).
16
+
17
+ ## Links
18
+
19
+ - **Paper:** [arXiv:2604.14164](https://huggingface.co/papers/2604.14164)
20
+ - **Code:** [GitHub - CoopReason/TESSY](https://github.com/CoopReason/TESSY)
21
+ - **Dataset:** [TESSY-Code-80K](https://huggingface.co/datasets/CoopReason/TESSY-Code-80K)
22
+
23
+ ## Citation
24
+
25
+ If you find this work useful, please cite:
26
+
27
+ ```bibtex
28
+ @article{TESSY,
29
+ title={How to Fine-Tune a Reasoning Model? A Teacher--Student Cooperation Framework to Synthesize Student-Consistent SFT Data},
30
+ author={Huang, Zixian and Yang, Kaichen and Huang, Xu and Hao, Feiyang and Ge, Qiming and Li, Bowen and Du, He and Chen, Kai and Guo, Qipeng},
31
+ journal={arXiv preprint arXiv:2604.14164},
32
+ year={2026}
33
+ }
34
+ ```