Spaces:
Sleeping
Sleeping
A newer version of the Streamlit SDK is available:
1.54.0
Data Directory
This directory contains the Grid Code documentation and processed data.
Structure
raw/- Contains the original Grid Code PDFprocessed/- Contains processed chunks and embeddingstest/- Contains test data and evaluation sets
Grid Code PDF
Place the Grid Code PDF file in the raw/ directory with filename grid_code.pdf.
Processing
The data processing pipeline:
- Loads PDF from raw/
- Splits into chunks
- Generates embeddings
- Stores processed data
Test Data
The test directory contains:
- Sample questions and answers
- Evaluation datasets
- Test PDF segments