Update README.md
Browse files
README.md
CHANGED
|
@@ -69,13 +69,14 @@ The system integrates three core databases with differentiated deployment strate
|
|
| 69 |
3. Import the Norman dataset(knowledge_database_input_iupac.csv) into FastGPT, configure the application, and export the API key.
|
| 70 |
4. Create a custom Schema tool on Dify and configure SQL invocation rules.
|
| 71 |
5. Deploy Pubchemlite_exposomics(PubChemLite_exposomics_20251226.csv) and Invitrodb_v4.3 databases to local SQL and test query connectivity.
|
|
|
|
| 72 |
|
| 73 |
### Inference Workflow
|
| 74 |
1. Input the **IUPAC name** of the emerging contaminant into the Dify chat interface.
|
| 75 |
2. The AI agent invokes:
|
| 76 |
- FastGPT knowledge base for **usage classification** via FDA plugin.
|
| 77 |
- Local SQL databases for **toxicity endpoints** via GPT-4o-generated SQL queries.
|
| 78 |
-
3. Receive the structured output (JSON format) containing usage category
|
| 79 |
|
| 80 |
## Limitations
|
| 81 |
1. The accuracy of annotations depends on the completeness of the Norman, Pubchemlite_exposomics, and Invitrodb_v4.3 databases; unrecorded emerging contaminants may return empty results.
|
|
|
|
| 69 |
3. Import the Norman dataset(knowledge_database_input_iupac.csv) into FastGPT, configure the application, and export the API key.
|
| 70 |
4. Create a custom Schema tool on Dify and configure SQL invocation rules.
|
| 71 |
5. Deploy Pubchemlite_exposomics(PubChemLite_exposomics_20251226.csv) and Invitrodb_v4.3 databases to local SQL and test query connectivity.
|
| 72 |
+
6. Create a backend program(step1_pubchemlite_invitrodb_to_dify_en.py) to connect Dify and the SQL databases.
|
| 73 |
|
| 74 |
### Inference Workflow
|
| 75 |
1. Input the **IUPAC name** of the emerging contaminant into the Dify chat interface.
|
| 76 |
2. The AI agent invokes:
|
| 77 |
- FastGPT knowledge base for **usage classification** via FDA plugin.
|
| 78 |
- Local SQL databases for **toxicity endpoints** via GPT-4o-generated SQL queries.
|
| 79 |
+
3. Receive the structured output (JSON format) containing usage category and toxicity endpoints.
|
| 80 |
|
| 81 |
## Limitations
|
| 82 |
1. The accuracy of annotations depends on the completeness of the Norman, Pubchemlite_exposomics, and Invitrodb_v4.3 databases; unrecorded emerging contaminants may return empty results.
|