Update README.md
Browse files
README.md
CHANGED
|
@@ -1,40 +1,68 @@
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
-
|
| 5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
1. π Click "Open Text File" to select your text file.
|
| 7 |
2. π Click "Identify Quotes" to process the file.
|
| 8 |
3. π₯οΈ A new window will open showing the text with identified quotes highlighted.
|
| 9 |
|
| 10 |
## π Included Files
|
| 11 |
|
| 12 |
-
-
|
| 13 |
-
-
|
| 14 |
-
-
|
| 15 |
-
-
|
| 16 |
|
| 17 |
-
## π€ Hugging Face Repository
|
| 18 |
|
| 19 |
-
|
| 20 |
-
|
| 21 |
-
|
| 22 |
-
|
| 23 |
-
3. π The `quote_identifier.py` script
|
| 24 |
-
4. π This README
|
| 25 |
|
| 26 |
## π Notes
|
| 27 |
|
| 28 |
-
-
|
| 29 |
-
- The program
|
| 30 |
-
- π
|
| 31 |
|
| 32 |
## π Troubleshooting
|
| 33 |
|
| 34 |
-
If you encounter
|
| 35 |
-
|
| 36 |
-
|
| 37 |
-
|
| 38 |
-
- π Verify that you're using Python 3.10.
|
| 39 |
-
|
| 40 |
-
For further assistance, please open an issue in the GitHub repository. πββοΈπββοΈ
|
|
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
+
# π Quote Identifier
|
| 5 |
|
| 6 |
+
A program that identifies quotes in text documents using a BERT-based model. π€
|
| 7 |
+
|
| 8 |
+
## π οΈ Requirements
|
| 9 |
+
|
| 10 |
+
- Python 3.10 π
|
| 11 |
+
- pip (Python package installer) π¦
|
| 12 |
+
|
| 13 |
+
## π Installation
|
| 14 |
+
|
| 15 |
+
1. Clone this repository:
|
| 16 |
+
|
| 17 |
+
```
|
| 18 |
+
git clone [your-repo-url]
|
| 19 |
+
cd [your-repo-name]
|
| 20 |
+
```
|
| 21 |
+
|
| 22 |
+
2. Install the required packages:
|
| 23 |
+
|
| 24 |
+
```
|
| 25 |
+
pip install pandas torch transformers tqdm
|
| 26 |
+
```
|
| 27 |
+
|
| 28 |
+
π‘ Note: If you have a CUDA-capable GPU, visit https://pytorch.org for the appropriate PyTorch installation command.
|
| 29 |
+
|
| 30 |
+
## πββοΈ Usage
|
| 31 |
+
|
| 32 |
+
Run the program with:
|
| 33 |
+
|
| 34 |
+
```
|
| 35 |
+
python quote_identifier.py
|
| 36 |
+
```
|
| 37 |
+
|
| 38 |
+
GUI Instructions:
|
| 39 |
1. π Click "Open Text File" to select your text file.
|
| 40 |
2. π Click "Identify Quotes" to process the file.
|
| 41 |
3. π₯οΈ A new window will open showing the text with identified quotes highlighted.
|
| 42 |
|
| 43 |
## π Included Files
|
| 44 |
|
| 45 |
+
- quote_identifier.py: Main Python script π
|
| 46 |
+
- quotation_identifer_model/: Directory containing the pre-trained model π§
|
| 47 |
+
- checkpoint-1000/: Model checkpoint β
|
| 48 |
+
- sample_book.txt: Sample text file for testing π
|
| 49 |
|
| 50 |
+
## π€ Hugging Face Repository Contents
|
| 51 |
|
| 52 |
+
1. Pre-trained quote identification model π§
|
| 53 |
+
2. sample_book.txt π
|
| 54 |
+
3. quote_identifier.py script π
|
| 55 |
+
4. This README π
|
|
|
|
|
|
|
| 56 |
|
| 57 |
## π Notes
|
| 58 |
|
| 59 |
+
- Ensure the local model directory ./quotation_identifer_model/checkpoint-1000/ is present.
|
| 60 |
+
- The program creates a BERT_infrence_quote_input.csv file when processing text.
|
| 61 |
+
- π Use the "Toggle Dark Mode" button to switch between light and dark themes.
|
| 62 |
|
| 63 |
## π Troubleshooting
|
| 64 |
|
| 65 |
+
If you encounter issues:
|
| 66 |
+
1. π¦ Verify all required packages are correctly installed.
|
| 67 |
+
2. ποΈ Check that the model directory is present with necessary files.
|
| 68 |
+
3. π Confirm you're using Python 3.10.
|
|
|
|
|
|
|
|
|