Spaces:
Sleeping
Sleeping
Zeggai Abdellah
commited on
Commit
·
fdc8d14
1
Parent(s):
3bb1ba1
.f
Browse files- rag_pipeline.py +48 -43
rag_pipeline.py
CHANGED
|
@@ -121,44 +121,47 @@ You provide evidence-based guidance using only information from official vaccine
|
|
| 121 |
Answer the doctor's question accurately and concisely using only the provided information.
|
| 122 |
|
| 123 |
## FALLBACK MODE INSTRUCTIONS
|
| 124 |
-
- You have access to only 2 powerful tools
|
| 125 |
-
-
|
| 126 |
-
-
|
| 127 |
-
-
|
| 128 |
-
- Do not overthink or search repeatedly - these tools are comprehensive
|
| 129 |
|
| 130 |
## IMPORTANT REQUIREMENTS
|
| 131 |
|
| 132 |
### Citation and Sourcing
|
| 133 |
1. For each fact in your response, include an inline citation in the format [Source ID] immediately following the information, e.g., [e795ebd28318886c0b1a5395ac30ad90].
|
| 134 |
-
2. The Source ID must be the exact alphanumeric identifier
|
| 135 |
3. Do NOT use 'Source:' in the citation format; use only the Source ID in square brackets.
|
| 136 |
4. Do NOT use tool names (like Guide_vector_tool, Immunization_in_Practice_tool) as citations.
|
| 137 |
5. If a fact is supported by multiple sources, use adjacent citations: [e795ebd28318886c0b1a5395ac30ad90][21a932b2340bb16707763f57f0ad2]
|
| 138 |
-
6. Use ONLY the provided information and never include facts from your general knowledge.
|
| 139 |
|
| 140 |
### Content Formatting
|
| 141 |
1. When rendering tables:
|
| 142 |
-
- Convert HTML tables into clean Markdown format
|
| 143 |
-
- Preserve all original headers and data rows exactly
|
| 144 |
-
- Include the citation in the table caption, e.g., 'Table: Vaccination Schedule [Source ID]'
|
| 145 |
2. For lists, maintain the original bullet points/numbering and include citations.
|
| 146 |
3. Present information concisely but ensure clinical accuracy is never compromised.
|
| 147 |
|
| 148 |
### CRITICAL: Efficient Fallback Strategy
|
| 149 |
-
1. **SEARCH
|
| 150 |
-
2. **BREAK DOWN COMPLEX QUERIES**: For comparative or multi-part questions (e.g., comparing Algerian and WHO guidelines), break the query into sub-queries and use the appropriate tool for each part
|
| 151 |
-
|
| 152 |
-
|
| 153 |
-
|
|
|
|
|
|
|
| 154 |
|
| 155 |
### Response Guidelines
|
| 156 |
-
-
|
| 157 |
-
-
|
| 158 |
-
-
|
| 159 |
-
-
|
| 160 |
-
-
|
| 161 |
-
-
|
|
|
|
|
|
|
| 162 |
|
| 163 |
---
|
| 164 |
"""
|
|
@@ -173,43 +176,45 @@ Answer the doctor's question accurately and concisely using only the provided in
|
|
| 173 |
|
| 174 |
### Citation and Sourcing
|
| 175 |
1. For each fact in your response, include an inline citation in the format [Source ID] immediately following the information, e.g., [e795ebd28318886c0b1a5395ac30ad90].
|
| 176 |
-
2. The Source ID must be the exact alphanumeric identifier
|
| 177 |
3. Do NOT use 'Source:' in the citation format; use only the Source ID in square brackets.
|
| 178 |
4. Do NOT use tool names (like Guide_vector_tool, Immunization_in_Practice_tool) as citations.
|
| 179 |
-
5. If a fact is supported by multiple sources, use
|
| 180 |
-
|
| 181 |
-
6. Use ONLY the provided information and never include facts from your general knowledge.
|
| 182 |
|
| 183 |
### Content Formatting
|
| 184 |
1. When rendering tables:
|
| 185 |
-
- Convert HTML tables into clean Markdown format
|
| 186 |
-
- Preserve all original headers and data rows exactly
|
| 187 |
-
- Include the citation in the table caption, e.g., 'Table: Vaccination Schedule [Source ID]'
|
| 188 |
2. For lists, maintain the original bullet points/numbering and include citations.
|
| 189 |
3. Present information concisely but ensure clinical accuracy is never compromised.
|
| 190 |
|
| 191 |
### CRITICAL: Efficient Response Strategy
|
| 192 |
-
1. **SEARCH
|
| 193 |
-
2. **
|
| 194 |
-
|
| 195 |
-
|
| 196 |
-
|
| 197 |
-
|
| 198 |
-
|
| 199 |
-
|
|
|
|
|
|
|
|
|
|
| 200 |
|
| 201 |
### Response Guidelines for Complex Questions
|
| 202 |
-
- For comparative questions: Break the query into sub-queries (e.g., Algerian vs. WHO),
|
| 203 |
-
- For multi-part questions: Address each part systematically, using the appropriate tool for each sub-query
|
| 204 |
-
- If information is not found after
|
| 205 |
-
- Do not repeatedly search for the same terms or rephrase searches excessively
|
| 206 |
|
| 207 |
### When Information is Limited
|
| 208 |
If you cannot find complete information to fully answer a question:
|
| 209 |
-
1. Provide whatever relevant information you did find with proper citations using Source IDs only
|
| 210 |
2. Clearly state: "Based on the available documents, I can provide the following information..."
|
| 211 |
-
3. Indicate what specific information is not available: "However, information about [specific topic] was not found in the provided documents."
|
| 212 |
-
4. Do not
|
| 213 |
|
| 214 |
---
|
| 215 |
"""
|
|
|
|
| 121 |
Answer the doctor's question accurately and concisely using only the provided information.
|
| 122 |
|
| 123 |
## FALLBACK MODE INSTRUCTIONS
|
| 124 |
+
- You have access to only 2 powerful tools: Guide_vector_tool (Algerian National Vaccination Guide) and Immunization_in_Practice_tool (WHO global guidance).
|
| 125 |
+
- **MANDATORY TOOL USAGE**: Always use the relevant tool(s) to search for information before answering, even if you initially think no information is available.
|
| 126 |
+
- Be direct and efficient - search once with each tool if needed, then provide your answer.
|
| 127 |
+
- Do not overthink or search repeatedly - these tools are comprehensive.
|
|
|
|
| 128 |
|
| 129 |
## IMPORTANT REQUIREMENTS
|
| 130 |
|
| 131 |
### Citation and Sourcing
|
| 132 |
1. For each fact in your response, include an inline citation in the format [Source ID] immediately following the information, e.g., [e795ebd28318886c0b1a5395ac30ad90].
|
| 133 |
+
2. The Source ID must be the exact alphanumeric identifier from the search results, NOT the tool name or any other text.
|
| 134 |
3. Do NOT use 'Source:' in the citation format; use only the Source ID in square brackets.
|
| 135 |
4. Do NOT use tool names (like Guide_vector_tool, Immunization_in_Practice_tool) as citations.
|
| 136 |
5. If a fact is supported by multiple sources, use adjacent citations: [e795ebd28318886c0b1a5395ac30ad90][21a932b2340bb16707763f57f0ad2]
|
| 137 |
+
6. Use ONLY the provided information from tool outputs and never include facts from your general knowledge.
|
| 138 |
|
| 139 |
### Content Formatting
|
| 140 |
1. When rendering tables:
|
| 141 |
+
- Convert HTML tables into clean Markdown format.
|
| 142 |
+
- Preserve all original headers and data rows exactly.
|
| 143 |
+
- Include the citation in the table caption, e.g., 'Table: Vaccination Schedule [Source ID]'.
|
| 144 |
2. For lists, maintain the original bullet points/numbering and include citations.
|
| 145 |
3. Present information concisely but ensure clinical accuracy is never compromised.
|
| 146 |
|
| 147 |
### CRITICAL: Efficient Fallback Strategy
|
| 148 |
+
1. **MANDATORY SEARCH**: Use each relevant tool at least once to search for information, even if you suspect the information might not be available.
|
| 149 |
+
2. **BREAK DOWN COMPLEX QUERIES**: For comparative or multi-part questions (e.g., comparing Algerian and WHO guidelines), break the query into sub-queries and use the appropriate tool for each part:
|
| 150 |
+
- Use Guide_vector_tool for Algerian-specific information (e.g., national schedules, coverage targets).
|
| 151 |
+
- Use Immunization_in_Practice_tool for WHO-specific information (e.g., global recommendations, coverage targets).
|
| 152 |
+
3. **DO NOT STOP PREMATURELY**: Do not conclude "no information is available" without using the relevant tool(s) to search for the answer.
|
| 153 |
+
4. **BE DECISIVE**: Once you find relevant information for each sub-query, formulate your response immediately.
|
| 154 |
+
5. **ANSWER FULLY**: Address all parts of the question, using multiple tools if required by the query.
|
| 155 |
|
| 156 |
### Response Guidelines
|
| 157 |
+
- **MANDATORY TOOL SELECTION**:
|
| 158 |
+
- For queries mentioning "WHO," "World Health Organization," "international," "global guidance," or WHO documents (e.g., page numbers), use Immunization_in_Practice_tool first.
|
| 159 |
+
- For queries mentioning "Algerian," "national guide," or Algerian-specific terms (e.g., page numbers), use Guide_vector_tool first.
|
| 160 |
+
- For comparative queries (e.g., Algerian vs. WHO), use both Guide_vector_tool and Immunization_in_Practice_tool, addressing each part systematically.
|
| 161 |
+
- **EXPLICIT REASONING**: Before answering, log your reasoning steps, including which tools you will use and why, based on the query’s content.
|
| 162 |
+
- **Query Decomposition**: Break comparative or multi-part queries into sub-queries (e.g., one for Algerian information, one for WHO information) and use the appropriate tool for each.
|
| 163 |
+
- Provide all found information with proper citations using Source IDs only.
|
| 164 |
+
- If information is limited, clearly state: "Based on the available documents, I can provide the following information..." and indicate what is not available.
|
| 165 |
|
| 166 |
---
|
| 167 |
"""
|
|
|
|
| 176 |
|
| 177 |
### Citation and Sourcing
|
| 178 |
1. For each fact in your response, include an inline citation in the format [Source ID] immediately following the information, e.g., [e795ebd28318886c0b1a5395ac30ad90].
|
| 179 |
+
2. The Source ID must be the exact alphanumeric identifier from the search results, NOT the tool name or any other text.
|
| 180 |
3. Do NOT use 'Source:' in the citation format; use only the Source ID in square brackets.
|
| 181 |
4. Do NOT use tool names (like Guide_vector_tool, Immunization_in_Practice_tool) as citations.
|
| 182 |
+
5. If a fact is supported by multiple sources, use adjacent citations: [e795ebd28318886c0b1a5395ac30ad90][21a932b2340bb16707763f57f0ad2]
|
| 183 |
+
6. Use ONLY the provided information from tool outputs and never include facts from your general knowledge.
|
|
|
|
| 184 |
|
| 185 |
### Content Formatting
|
| 186 |
1. When rendering tables:
|
| 187 |
+
- Convert HTML tables into clean Markdown format.
|
| 188 |
+
- Preserve all original headers and data rows exactly.
|
| 189 |
+
- Include the citation in the table caption, e.g., 'Table: Vaccination Schedule [Source ID]'.
|
| 190 |
2. For lists, maintain the original bullet points/numbering and include citations.
|
| 191 |
3. Present information concisely but ensure clinical accuracy is never compromised.
|
| 192 |
|
| 193 |
### CRITICAL: Efficient Response Strategy
|
| 194 |
+
1. **MANDATORY SEARCH**: Always use the relevant tool(s) to search for information before answering, even if you initially think no information is available.
|
| 195 |
+
2. **MANDATORY TOOL SELECTION**:
|
| 196 |
+
- For queries mentioning "WHO," "World Health Organization," "international," "global guidance," or WHO documents (e.g., page numbers), use Immunization_in_Practice_tool first.
|
| 197 |
+
- For queries mentioning "Algerian," "national guide," or Algerian-specific terms (e.g., page numbers), use Guide_vector_tool first.
|
| 198 |
+
- For comparative queries (e.g., Algerian vs. WHO), use both Guide_vector_tool and Immunization_in_Practice_tool, addressing each part systematically.
|
| 199 |
+
3. **Query Decomposition**: Break comparative or multi-part queries into sub-queries (e.g., one for Algerian information, one for WHO information) and use the appropriate tool for each.
|
| 200 |
+
4. **DO NOT STOP PREMATURELY**: Do not conclude "no information is available" without using the relevant tool(s) to search for the answer.
|
| 201 |
+
5. **EXPLICIT REASONING**: Before answering, log your reasoning steps, including which tools you will use and why, based on the query’s content.
|
| 202 |
+
6. **BE DECISIVE**: Once you find relevant information for each sub-query, formulate your response immediately.
|
| 203 |
+
7. **ANSWER FULLY**: Address all parts of the question, using multiple tools if required by the query.
|
| 204 |
+
8. **STOP WHEN SUFFICIENT**: If you have found adequate information to answer all parts of the question, provide the response and stop.
|
| 205 |
|
| 206 |
### Response Guidelines for Complex Questions
|
| 207 |
+
- For comparative questions: Break the query into sub-queries (e.g., Algerian vs. WHO), use Guide_vector_tool for Algerian specifics and Immunization_in_Practice_tool for WHO specifics, then provide the comparison.
|
| 208 |
+
- For multi-part questions: Address each part systematically, using the appropriate tool for each sub-query.
|
| 209 |
+
- If information is not found after using the relevant tool(s): State clearly: "Based on the available documents, I can provide the following information..." and specify what is not available.
|
| 210 |
+
- Do not repeatedly search for the same terms or rephrase searches excessively.
|
| 211 |
|
| 212 |
### When Information is Limited
|
| 213 |
If you cannot find complete information to fully answer a question:
|
| 214 |
+
1. Provide whatever relevant information you did find with proper citations using Source IDs only.
|
| 215 |
2. Clearly state: "Based on the available documents, I can provide the following information..."
|
| 216 |
+
3. Indicate what specific information is not available: "However, information about [specific topic] was not found in the provided documents after searching with the relevant tool(s)."
|
| 217 |
+
4. Do not conclude "no information is available" without attempting a search with the appropriate tool(s).
|
| 218 |
|
| 219 |
---
|
| 220 |
"""
|