Qianhui19 commited on
Commit
4bfee4f
·
verified ·
1 Parent(s): 9452ff8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -69,13 +69,14 @@ The system integrates three core databases with differentiated deployment strate
69
  3. Import the Norman dataset(knowledge_database_input_iupac.csv) into FastGPT, configure the application, and export the API key.
70
  4. Create a custom Schema tool on Dify and configure SQL invocation rules.
71
  5. Deploy Pubchemlite_exposomics(PubChemLite_exposomics_20251226.csv) and Invitrodb_v4.3 databases to local SQL and test query connectivity.
 
72
 
73
  ### Inference Workflow
74
  1. Input the **IUPAC name** of the emerging contaminant into the Dify chat interface.
75
  2. The AI agent invokes:
76
  - FastGPT knowledge base for **usage classification** via FDA plugin.
77
  - Local SQL databases for **toxicity endpoints** via GPT-4o-generated SQL queries.
78
- 3. Receive the structured output (JSON format) containing usage category, toxicity endpoints, and corresponding AC50 values.
79
 
80
  ## Limitations
81
  1. The accuracy of annotations depends on the completeness of the Norman, Pubchemlite_exposomics, and Invitrodb_v4.3 databases; unrecorded emerging contaminants may return empty results.
 
69
  3. Import the Norman dataset(knowledge_database_input_iupac.csv) into FastGPT, configure the application, and export the API key.
70
  4. Create a custom Schema tool on Dify and configure SQL invocation rules.
71
  5. Deploy Pubchemlite_exposomics(PubChemLite_exposomics_20251226.csv) and Invitrodb_v4.3 databases to local SQL and test query connectivity.
72
+ 6. Create a backend program(step1_pubchemlite_invitrodb_to_dify_en.py) to connect Dify and the SQL databases.
73
 
74
  ### Inference Workflow
75
  1. Input the **IUPAC name** of the emerging contaminant into the Dify chat interface.
76
  2. The AI agent invokes:
77
  - FastGPT knowledge base for **usage classification** via FDA plugin.
78
  - Local SQL databases for **toxicity endpoints** via GPT-4o-generated SQL queries.
79
+ 3. Receive the structured output (JSON format) containing usage category and toxicity endpoints.
80
 
81
  ## Limitations
82
  1. The accuracy of annotations depends on the completeness of the Norman, Pubchemlite_exposomics, and Invitrodb_v4.3 databases; unrecorded emerging contaminants may return empty results.