transformers-community
/

sink_cache

custom_generate

Model card Files Files and versions

joaogante commited on May 22, 2025

Commit

2a0ad2b

·

1 Parent(s): 835a8a6

add WIP

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -4,6 +4,8 @@ tags:
   - custom_generate
 ---
 ## Description
 Implementation of the cache introduced in the [Attention Sinks paper](https://arxiv.org/abs/2309.17453). It allows the
 model to generate beyond the length of its context window, without losing fluency in the conversation. As it discards

   - custom_generate
 ---
+⚠️ WORK IN PROGRESS ⚠️
 ## Description
 Implementation of the cache introduced in the [Attention Sinks paper](https://arxiv.org/abs/2309.17453). It allows the
 model to generate beyond the length of its context window, without losing fluency in the conversation. As it discards