tim1900
/

bert-chunker-3

Token Classification

Model card Files Files and versions

tim1900 commited on May 12, 2025

Commit

2102622

·

verified ·

1 Parent(s): c38168a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -203,7 +203,7 @@ for i, (c, t) in enumerate(zip(chunks, token_pos)):
     print(c)
 ```
 ## Experimental
-The following script supports specifying max tokens per chunk, which can be seen as a new experimental version of the scripts above. If max_tokens_per_chunk is specified, texts will be forced to choose a best possible position from history to chunk when it is about to exceed the max_tokens_per_chunk and no token satisfy the prob_threshold.
 ```python
 import torch
 from transformers import AutoTokenizer, BertForTokenClassification

     print(c)
 ```
 ## Experimental
+The following script supports specifying max tokens per chunk. If max_tokens_per_chunk is specified, texts will be forced to choose a best possible position from history to chunk when it is about to exceed the max_tokens_per_chunk and no token satisfy the prob_threshold. If max_tokens_per_chunk is None, it acts the same as above. This script can be seen as a new experimental version of the scripts above.
 ```python
 import torch
 from transformers import AutoTokenizer, BertForTokenClassification