Spaces:
Runtime error
Runtime error
| ## Model Formats | |
| The model is pretrained on code and is formatted with special tokens in addition to the pure code data,\ | |
| such as prefixes specifying the source of the file or tokens separating code from a commit message.\ | |
| Use these templates to explore the model's capacities: | |
| ### 1. Prefixes ๐ท๏ธ | |
| For pure code files, use any combination of the following prefixes: | |
| ``` | |
| <reponame>REPONAME<filename>FILENAME<gh_stars>STARS\ncode<|endoftext|> | |
| ``` | |
| STARS can be one of: 0, 1-10, 10-100, 100-1000, 1000+ | |
| ### 2. Commits ๐พ | |
| The commits data is formatted as follows: | |
| ``` | |
| <commit_before>code<commit_msg>text<commit_after>code<|endoftext|> | |
| ``` | |
| ### 3. Jupyter Notebooks ๐ | |
| The model is trained on Jupyter notebooks as Python scripts and structured formats like: | |
| ``` | |
| <start_jupyter><jupyter_text>text<jupyter_code>code<jupyter_output>output<jupyter_text> | |
| ``` | |
| ### 4. Issues ๐ | |
| We also trained on GitHub issues using the following formatting: | |
| ``` | |
| <issue_start><issue_comment>text<issue_comment>...<issue_closed> | |
| ``` | |
| ### 5. Fill-in-the-middle ๐งฉ | |
| Fill in the middle requires rearranging the model inputs. The playground handles this for you - all you need is to specify where to fill: | |
| ``` | |
| code before<FILL_HERE>code after | |
| ``` | |