Update README.md
Browse files
README.md
CHANGED
|
@@ -7,6 +7,8 @@ pipeline_tag: sentence-similarity
|
|
| 7 |
---
|
| 8 |
# BertChunker
|
| 9 |
|
|
|
|
|
|
|
| 10 |
## Introduction
|
| 11 |
|
| 12 |
BertChunker is an end-to-end trained chunker for chunking text for RAG. It was trained based on [MiniLM-L6-H384-uncased](https://huggingface.co/nreimers/MiniLM-L6-H384-uncased) with an adapter. The whole training lasted for 10 minutes on a Nvidia P40 GPU on a 50 MB synthetized dataset.
|
|
|
|
| 7 |
---
|
| 8 |
# BertChunker
|
| 9 |
|
| 10 |
+
[Paper](https://github.com/jackfsuia/BertChunker/blob/main/main.pdf)
|
| 11 |
+
|
| 12 |
## Introduction
|
| 13 |
|
| 14 |
BertChunker is an end-to-end trained chunker for chunking text for RAG. It was trained based on [MiniLM-L6-H384-uncased](https://huggingface.co/nreimers/MiniLM-L6-H384-uncased) with an adapter. The whole training lasted for 10 minutes on a Nvidia P40 GPU on a 50 MB synthetized dataset.
|