| license: mit | |
| library_name: transformers | |
| pipeline_tag: text-generation | |
| Follwoing LUFFY, we change to rope_theta from 10000 to 40000 and extend the context window to 16k. | |
| license: mit | |
| library_name: transformers | |
| pipeline_tag: text-generation | |
| Follwoing LUFFY, we change to rope_theta from 10000 to 40000 and extend the context window to 16k. | |