Spaces:
Sleeping
Sleeping
| # Data Directory | |
| This directory contains the Grid Code documentation and processed data. | |
| ## Structure | |
| - `raw/` - Contains the original Grid Code PDF | |
| - `processed/` - Contains processed chunks and embeddings | |
| - `test/` - Contains test data and evaluation sets | |
| ## Grid Code PDF | |
| Place the Grid Code PDF file in the `raw/` directory with filename `grid_code.pdf`. | |
| ## Processing | |
| The data processing pipeline: | |
| 1. Loads PDF from raw/ | |
| 2. Splits into chunks | |
| 3. Generates embeddings | |
| 4. Stores processed data | |
| ## Test Data | |
| The test directory contains: | |
| - Sample questions and answers | |
| - Evaluation datasets | |
| - Test PDF segments |