Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
# Data Directory | |
This directory contains the Grid Code documentation and processed data. | |
## Structure | |
- `raw/` - Contains the original Grid Code PDF | |
- `processed/` - Contains processed chunks and embeddings | |
- `test/` - Contains test data and evaluation sets | |
## Grid Code PDF | |
Place the Grid Code PDF file in the `raw/` directory with filename `grid_code.pdf`. | |
## Processing | |
The data processing pipeline: | |
1. Loads PDF from raw/ | |
2. Splits into chunks | |
3. Generates embeddings | |
4. Stores processed data | |
## Test Data | |
The test directory contains: | |
- Sample questions and answers | |
- Evaluation datasets | |
- Test PDF segments |