Spaces:
Sleeping
Sleeping
Data Directory
This directory contains the Grid Code documentation and processed data.
Structure
raw/
- Contains the original Grid Code PDFprocessed/
- Contains processed chunks and embeddingstest/
- Contains test data and evaluation sets
Grid Code PDF
Place the Grid Code PDF file in the raw/
directory with filename grid_code.pdf
.
Processing
The data processing pipeline:
- Loads PDF from raw/
- Splits into chunks
- Generates embeddings
- Stores processed data
Test Data
The test directory contains:
- Sample questions and answers
- Evaluation datasets
- Test PDF segments