santanche commited on
Commit
f369a7d
·
1 Parent(s): c4d8c51

refactor (model): replacing phi3 by phi4-mini

Browse files
CONFIGURATION_GUIDE.md CHANGED
@@ -76,7 +76,7 @@ The configuration file includes:
76
  {
77
  "title": "SQL Generator",
78
  "prompt": "Generate SQL for: {question}\nSchema: {schema}",
79
- "model": "phi3",
80
  "subscribeTopic": "START",
81
  "publishTopic": "SQL_GENERATED",
82
  "showResult": true
@@ -351,7 +351,7 @@ config = {
351
  {
352
  "title": "SQL Generator",
353
  "prompt": "...",
354
- "model": "phi3",
355
  "subscribeTopic": "START",
356
  "publishTopic": "SQL",
357
  "showResult": True
@@ -440,7 +440,7 @@ merge_configs("pipeline-a.json", "pipeline-b.json", "merged-pipeline.json")
440
  ### Issue: Agents not working after load
441
 
442
  **Cause**: Model might not be available
443
- **Solution**: Check agent "model" field matches available models (phi3, cniongolo/biomistral)
444
 
445
  ### Issue: Topics not matching after load
446
 
 
76
  {
77
  "title": "SQL Generator",
78
  "prompt": "Generate SQL for: {question}\nSchema: {schema}",
79
+ "model": "phi4-mini",
80
  "subscribeTopic": "START",
81
  "publishTopic": "SQL_GENERATED",
82
  "showResult": true
 
351
  {
352
  "title": "SQL Generator",
353
  "prompt": "...",
354
+ "model": "phi4-mini",
355
  "subscribeTopic": "START",
356
  "publishTopic": "SQL",
357
  "showResult": True
 
440
  ### Issue: Agents not working after load
441
 
442
  **Cause**: Model might not be available
443
+ **Solution**: Check agent "model" field matches available models (phi4-mini, cniongolo/biomistral)
444
 
445
  ### Issue: Topics not matching after load
446
 
Dockerfile CHANGED
@@ -31,7 +31,7 @@ EXPOSE 7860
31
  ENV OLLAMA_HOST=0.0.0.0:11434
32
  ENV PYTHONUNBUFFERED=1
33
 
34
- # Create startup script - only phi3 and biomistral for free tier
35
  RUN echo '#!/bin/bash\n\
36
  set -e\n\
37
  \n\
@@ -42,8 +42,8 @@ OLLAMA_PID=$!\n\
42
  echo "Waiting for Ollama to be ready..."\n\
43
  sleep 10\n\
44
  \n\
45
- echo "Pulling phi3 model..."\n\
46
- ollama pull phi3\n\
47
  \n\
48
  echo "Pulling MedGemma model..."\n\
49
  ollama pull MedAIBase/MedGemma1.5:4b\n\
 
31
  ENV OLLAMA_HOST=0.0.0.0:11434
32
  ENV PYTHONUNBUFFERED=1
33
 
34
+ # Create startup script - only phi4-mini and biomistral for free tier
35
  RUN echo '#!/bin/bash\n\
36
  set -e\n\
37
  \n\
 
42
  echo "Waiting for Ollama to be ready..."\n\
43
  sleep 10\n\
44
  \n\
45
+ echo "Pulling phi4-mini model..."\n\
46
+ ollama pull phi4-mini\n\
47
  \n\
48
  echo "Pulling MedGemma model..."\n\
49
  ollama pull MedAIBase/MedGemma1.5:4b\n\
ENHANCEMENTS_SUMMARY.md CHANGED
@@ -95,7 +95,7 @@ Added complete configuration persistence:
95
  {
96
  "title": "SQL Generator",
97
  "prompt": "Generate SQL...",
98
- "model": "phi3",
99
  "subscribeTopic": "START",
100
  "publishTopic": "SQL",
101
  "showResult": true
 
95
  {
96
  "title": "SQL Generator",
97
  "prompt": "Generate SQL...",
98
+ "model": "phi4-mini",
99
  "subscribeTopic": "START",
100
  "publishTopic": "SQL",
101
  "showResult": true
EXAMPLES.md CHANGED
@@ -1,10 +1,10 @@
1
  # Pub/Sub Multi-Agent System - Example Configurations
2
 
3
- **Note**: This deployment includes **phi3** (general purpose), **MedAIBase/MedGemma1.5:4b** (medical/healthcare), and **deepseek-coder:1.3b** (coding) models.
4
 
5
  **Important**: Use `{question}` in prompts to access the user's question, `{schema}` for database schema, and `{input}` for messages from subscribed topics (all case insensitive).
6
 
7
- ## Example 1: Natural Language to SQL Pipeline (using phi3)
8
 
9
  ### Agent 1: Question Analyzer
10
  - **Title**: Question Analyzer
@@ -22,7 +22,7 @@ Analyze the question and identify:
22
 
23
  Respond with a clear analysis.
24
  ```
25
- - **Model**: phi3
26
  - **Subscribe Topic**: START
27
  - **Publish Topic**: QUESTION_ANALYZED
28
 
@@ -41,7 +41,7 @@ Original question: {question}
41
  Generate a SQL query that answers the user's question.
42
  Return ONLY the SQL query, no explanation.
43
  ```
44
- - **Model**: phi3
45
  - **Subscribe Topic**: QUESTION_ANALYZED
46
  - **Publish Topic**: SQL_GENERATED
47
 
@@ -58,7 +58,7 @@ Schema:
58
  If valid, return "VALID: " followed by the query.
59
  If invalid, return "INVALID: " followed by the corrected query.
60
  ```
61
- - **Model**: phi3
62
  - **Subscribe Topic**: SQL_GENERATED
63
  - **Publish Topic**: FINAL
64
 
@@ -163,7 +163,7 @@ Analyze the abstract and extract:
163
  2. Methodology
164
  3. Key findings
165
  ```
166
- - **Model**: phi3
167
  - **Subscribe Topic**: START
168
  - **Publish Topic**: ABSTRACT_ANALYZED
169
 
@@ -179,7 +179,7 @@ Evaluate the research methodology:
179
  2. What are the strengths?
180
  3. What are potential limitations?
181
  ```
182
- - **Model**: phi3
183
  - **Subscribe Topic**: ABSTRACT_ANALYZED
184
  - **Publish Topic**: METHODOLOGY_REVIEWED
185
 
@@ -192,13 +192,13 @@ Based on this methodology review:
192
 
193
  Generate a comprehensive summary of the paper suitable for a literature review.
194
  ```
195
- - **Model**: phi3
196
  - **Subscribe Topic**: METHODOLOGY_REVIEWED
197
  - **Publish Topic**: SUMMARY_COMPLETE
198
 
199
  ---
200
 
201
- ## Example 5: Content Moderation Pipeline (using phi3)
202
 
203
  ### Agent 1: Content Classifier
204
  - **Title**: Content Classifier
@@ -213,7 +213,7 @@ Classify this content into one of these categories:
213
 
214
  Provide classification and brief reasoning.
215
  ```
216
- - **Model**: phi3
217
  - **Subscribe Topic**: START
218
  - **Publish Topic**: CLASSIFIED
219
 
@@ -230,13 +230,13 @@ If the content was marked as NEEDS_REVIEW, provide:
230
 
231
  If not NEEDS_REVIEW, just respond "NO REVIEW NEEDED"
232
  ```
233
- - **Model**: phi3
234
  - **Subscribe Topic**: CLASSIFIED
235
  - **Publish Topic**: REVIEW_COMPLETE
236
 
237
  ---
238
 
239
- ## Example 5: Medical Symptom Analysis (phi3 can handle basic medical tasks)
240
 
241
  ### Agent 1: Symptom Categorizer
242
  - **Title**: Symptom Categorizer
@@ -253,7 +253,7 @@ Categorize these symptoms by body system:
253
 
254
  List each symptom under its appropriate category.
255
  ```
256
- - **Model**: phi3
257
  - **Subscribe Topic**: START
258
  - **Publish Topic**: SYMPTOMS_CATEGORIZED
259
 
@@ -271,7 +271,7 @@ Based on these symptoms, provide:
271
 
272
  Note: This is for educational purposes only.
273
  ```
274
- - **Model**: phi3
275
  - **Subscribe Topic**: SYMPTOMS_CATEGORIZED
276
  - **Publish Topic**: DIAGNOSIS_COMPLETE
277
 
@@ -288,7 +288,7 @@ Note: This is for educational purposes only.
288
  - All three can be used together in any prompt
289
 
290
  3. **Model Selection**:
291
- - **phi3**: Versatile general-purpose model - text analysis, SQL, reasoning, classification
292
  - **cniongolo/biomistral**: Medical and scientific tasks - diagnosis, clinical reasoning, biomedical analysis
293
 
294
  4. **Final Results**:
 
1
  # Pub/Sub Multi-Agent System - Example Configurations
2
 
3
+ **Note**: This deployment includes **phi4-mini** (general purpose), **MedAIBase/MedGemma1.5:4b** (medical/healthcare), and **deepseek-coder:1.3b** (coding) models.
4
 
5
  **Important**: Use `{question}` in prompts to access the user's question, `{schema}` for database schema, and `{input}` for messages from subscribed topics (all case insensitive).
6
 
7
+ ## Example 1: Natural Language to SQL Pipeline (using phi4-mini)
8
 
9
  ### Agent 1: Question Analyzer
10
  - **Title**: Question Analyzer
 
22
 
23
  Respond with a clear analysis.
24
  ```
25
+ - **Model**: phi4-mini
26
  - **Subscribe Topic**: START
27
  - **Publish Topic**: QUESTION_ANALYZED
28
 
 
41
  Generate a SQL query that answers the user's question.
42
  Return ONLY the SQL query, no explanation.
43
  ```
44
+ - **Model**: phi4-mini
45
  - **Subscribe Topic**: QUESTION_ANALYZED
46
  - **Publish Topic**: SQL_GENERATED
47
 
 
58
  If valid, return "VALID: " followed by the query.
59
  If invalid, return "INVALID: " followed by the corrected query.
60
  ```
61
+ - **Model**: phi4-mini
62
  - **Subscribe Topic**: SQL_GENERATED
63
  - **Publish Topic**: FINAL
64
 
 
163
  2. Methodology
164
  3. Key findings
165
  ```
166
+ - **Model**: phi4-mini
167
  - **Subscribe Topic**: START
168
  - **Publish Topic**: ABSTRACT_ANALYZED
169
 
 
179
  2. What are the strengths?
180
  3. What are potential limitations?
181
  ```
182
+ - **Model**: phi4-mini
183
  - **Subscribe Topic**: ABSTRACT_ANALYZED
184
  - **Publish Topic**: METHODOLOGY_REVIEWED
185
 
 
192
 
193
  Generate a comprehensive summary of the paper suitable for a literature review.
194
  ```
195
+ - **Model**: phi4-mini
196
  - **Subscribe Topic**: METHODOLOGY_REVIEWED
197
  - **Publish Topic**: SUMMARY_COMPLETE
198
 
199
  ---
200
 
201
+ ## Example 5: Content Moderation Pipeline (using phi4-mini)
202
 
203
  ### Agent 1: Content Classifier
204
  - **Title**: Content Classifier
 
213
 
214
  Provide classification and brief reasoning.
215
  ```
216
+ - **Model**: phi4-mini
217
  - **Subscribe Topic**: START
218
  - **Publish Topic**: CLASSIFIED
219
 
 
230
 
231
  If not NEEDS_REVIEW, just respond "NO REVIEW NEEDED"
232
  ```
233
+ - **Model**: phi4-mini
234
  - **Subscribe Topic**: CLASSIFIED
235
  - **Publish Topic**: REVIEW_COMPLETE
236
 
237
  ---
238
 
239
+ ## Example 5: Medical Symptom Analysis (phi4-mini can handle basic medical tasks)
240
 
241
  ### Agent 1: Symptom Categorizer
242
  - **Title**: Symptom Categorizer
 
253
 
254
  List each symptom under its appropriate category.
255
  ```
256
+ - **Model**: phi4-mini
257
  - **Subscribe Topic**: START
258
  - **Publish Topic**: SYMPTOMS_CATEGORIZED
259
 
 
271
 
272
  Note: This is for educational purposes only.
273
  ```
274
+ - **Model**: phi4-mini
275
  - **Subscribe Topic**: SYMPTOMS_CATEGORIZED
276
  - **Publish Topic**: DIAGNOSIS_COMPLETE
277
 
 
288
  - All three can be used together in any prompt
289
 
290
  3. **Model Selection**:
291
+ - **phi4-mini**: Versatile general-purpose model - text analysis, SQL, reasoning, classification
292
  - **cniongolo/biomistral**: Medical and scientific tasks - diagnosis, clinical reasoning, biomedical analysis
293
 
294
  4. **Final Results**:
NER_AGENTS_GUIDE.md CHANGED
@@ -224,7 +224,7 @@ ECG shows ST elevation. Troponin levels elevated at 0.5 ng/mL.
224
 
225
  **Agent 2: Entity Summarizer**
226
  - Title: `Summarize Findings`
227
- - Model: `phi3`
228
  - Subscribe: `CLINICAL_ENTITIES`
229
  - Publish: *(empty)*
230
  - Prompt:
@@ -304,7 +304,7 @@ No evidence of [mediastinal:ANATOMY] lymphadenopathy.
304
  - ☑ Show result
305
 
306
  **Agent 2: Entity Categorization**
307
- - Model: `phi3`
308
  - Subscribe: `ENTITIES`
309
  - Publish: `CATEGORIZED`
310
  - Prompt:
@@ -406,7 +406,7 @@ NER Agent → Regular LLM → Medical LLM
406
 
407
  **Example**:
408
  1. NER extracts entities from clinical note
409
- 2. phi3 categorizes entities by type
410
  3. MedGemma generates clinical assessment
411
 
412
  ### 4. Understanding What Gets Analyzed
 
224
 
225
  **Agent 2: Entity Summarizer**
226
  - Title: `Summarize Findings`
227
+ - Model: `phi4-mini`
228
  - Subscribe: `CLINICAL_ENTITIES`
229
  - Publish: *(empty)*
230
  - Prompt:
 
304
  - ☑ Show result
305
 
306
  **Agent 2: Entity Categorization**
307
+ - Model: `phi4-mini`
308
  - Subscribe: `ENTITIES`
309
  - Publish: `CATEGORIZED`
310
  - Prompt:
 
406
 
407
  **Example**:
408
  1. NER extracts entities from clinical note
409
+ 2. phi4-mini categorizes entities by type
410
  3. MedGemma generates clinical assessment
411
 
412
  ### 4. Understanding What Gets Analyzed
NER_TRANSFORMERS_IMPLEMENTATION.md CHANGED
@@ -11,7 +11,7 @@ NER (Named Entity Recognition) agents are now implemented using HuggingFace Tran
11
  The system now supports two types of models:
12
 
13
  **1. LLM Models (via Ollama)**:
14
- - phi3
15
  - MedAIBase/MedGemma1.5:4b
16
  - deepseek-coder:1.3b
17
 
@@ -223,7 +223,7 @@ Models cached at:
223
  - Much faster with GPU acceleration
224
 
225
  **LLM Models** (Ollama):
226
- - phi3: ~2-5s per prompt
227
  - MedGemma: ~3-7s per prompt
228
  - DeepSeek: ~1-3s per prompt
229
 
@@ -291,7 +291,7 @@ Errors are:
291
  **With Both Models Loaded**: ~850MB RAM
292
 
293
  **Plus LLM Models (Ollama)**:
294
- - phi3: ~4GB RAM
295
  - MedGemma: ~5GB RAM
296
  - DeepSeek: ~2GB RAM
297
 
 
11
  The system now supports two types of models:
12
 
13
  **1. LLM Models (via Ollama)**:
14
+ - phi4-mini
15
  - MedAIBase/MedGemma1.5:4b
16
  - deepseek-coder:1.3b
17
 
 
223
  - Much faster with GPU acceleration
224
 
225
  **LLM Models** (Ollama):
226
+ - phi4-mini: ~2-5s per prompt
227
  - MedGemma: ~3-7s per prompt
228
  - DeepSeek: ~1-3s per prompt
229
 
 
291
  **With Both Models Loaded**: ~850MB RAM
292
 
293
  **Plus LLM Models (Ollama)**:
294
+ - phi4-mini: ~4GB RAM
295
  - MedGemma: ~5GB RAM
296
  - DeepSeek: ~2GB RAM
297
 
QUICK_START.md CHANGED
@@ -96,7 +96,7 @@ These all work:
96
  - **Benefit**: No more errors from capitalization mismatches!
97
 
98
  ### 3. Three Specialized Models
99
- - **phi3**: General-purpose tasks (3.8B params)
100
  - **MedAIBase/MedGemma1.5:4b**: Medical/healthcare (4B params)
101
  - **deepseek-coder:1.3b**: Code generation and analysis (1.3B params)
102
 
@@ -161,7 +161,7 @@ Database schema:
161
  Generate a SQL query to answer this question.
162
  ```
163
  Note: Used `{QUESTION}` and `{schema}` in different cases - both work!
164
- - Model: `phi3`
165
  - Subscribe: `START`
166
  - Publish: `SQL_QUERY` *(optional - leave empty if not needed)*
167
  - ☑ Show result
@@ -175,7 +175,7 @@ Explain this SQL query in simple terms:
175
 
176
  Original question: {question}
177
  ```
178
- - Model: `phi3`
179
  - Subscribe: `SQL_QUERY`
180
  - Publish: *(leave empty)*
181
  - ☑ Show result
 
96
  - **Benefit**: No more errors from capitalization mismatches!
97
 
98
  ### 3. Three Specialized Models
99
+ - **phi4-mini**: General-purpose tasks (3.8B params)
100
  - **MedAIBase/MedGemma1.5:4b**: Medical/healthcare (4B params)
101
  - **deepseek-coder:1.3b**: Code generation and analysis (1.3B params)
102
 
 
161
  Generate a SQL query to answer this question.
162
  ```
163
  Note: Used `{QUESTION}` and `{schema}` in different cases - both work!
164
+ - Model: `phi4-mini`
165
  - Subscribe: `START`
166
  - Publish: `SQL_QUERY` *(optional - leave empty if not needed)*
167
  - ☑ Show result
 
175
 
176
  Original question: {question}
177
  ```
178
+ - Model: `phi4-mini`
179
  - Subscribe: `SQL_QUERY`
180
  - Publish: *(leave empty)*
181
  - ☑ Show result
README.md CHANGED
@@ -96,7 +96,7 @@ Final result displayed to user
96
 
97
  This deployment includes five specialized models:
98
 
99
- - **phi3**: General-purpose model (3.8B parameters) - Great for text analysis, SQL generation, summarization, reasoning, and general tasks
100
  - **MedAIBase/MedGemma1.5:4b**: Medical/healthcare model (4B parameters) - Specialized for clinical reasoning, medical documentation, and healthcare-related tasks
101
  - **deepseek-coder:1.3b**: Code generation model (1.3B parameters) - Optimized for programming, code analysis, debugging, and technical documentation
102
  - **samrawal/bert-base-uncased_clinical-ner**: Clinical NER model - Extracts medical entities (diseases, symptoms, medications) from clinical text
 
96
 
97
  This deployment includes five specialized models:
98
 
99
+ - **phi4-mini**: General-purpose model (3.8B parameters) - Great for text analysis, SQL generation, summarization, reasoning, and general tasks
100
  - **MedAIBase/MedGemma1.5:4b**: Medical/healthcare model (4B parameters) - Specialized for clinical reasoning, medical documentation, and healthcare-related tasks
101
  - **deepseek-coder:1.3b**: Code generation model (1.3B parameters) - Optimized for programming, code analysis, debugging, and technical documentation
102
  - **samrawal/bert-base-uncased_clinical-ner**: Clinical NER model - Extracts medical entities (diseases, symptoms, medications) from clinical text
TUTORIAL.md CHANGED
@@ -28,7 +28,7 @@ Briefly answer the question:
28
 
29
  Answer:
30
  ```
31
- - **Model**: phi3
32
  - **Subscribe Topic**: START
33
  - **Publish Topic**: SHORT_ANSWER
34
  - **Show result in Final Result box** [x]
 
28
 
29
  Answer:
30
  ```
31
+ - **Model**: phi4-mini
32
  - **Subscribe Topic**: START
33
  - **Publish Topic**: SHORT_ANSWER
34
  - **Show result in Final Result box** [x]
static/index.html CHANGED
@@ -27,7 +27,7 @@
27
  const fileInputRef = useRef(null);
28
 
29
  const models = [
30
- "phi3",
31
  "MedAIBase/MedGemma1.5:4b",
32
  "deepseek-coder:1.3b",
33
  "samrawal/bert-base-uncased_clinical-ner",
@@ -45,7 +45,7 @@
45
  id: Date.now(),
46
  title: `Agent ${agents.length + 1}`,
47
  prompt: '',
48
- model: 'phi3',
49
  subscribeTopic: '',
50
  publishTopic: '',
51
  showResult: false
 
27
  const fileInputRef = useRef(null);
28
 
29
  const models = [
30
+ "phi4-mini",
31
  "MedAIBase/MedGemma1.5:4b",
32
  "deepseek-coder:1.3b",
33
  "samrawal/bert-base-uncased_clinical-ner",
 
45
  id: Date.now(),
46
  title: `Agent ${agents.length + 1}`,
47
  prompt: '',
48
+ model: 'phi4-mini',
49
  subscribeTopic: '',
50
  publishTopic: '',
51
  showResult: false