Spaces:

Stylique
/

ModelForge

Paused

App Files Files Community

Ali Mohsin commited on Nov 21, 2025

Commit

aa7f9ab

1 Parent(s): 3be6273

Final updates

Browse files

Files changed (1) hide show

app.py +3 -2

app.py CHANGED Viewed

@@ -29,7 +29,7 @@ You must return a SINGLE valid JSON object. Do not include any markdown formatti
       "description": "A comprehensive technical description. For research problems, describe the novel architecture (e.g., 'Dual-Encoder with Cross-Attention Adapters'). For production, specify the exact backbone (e.g., 'ResNet-50v2 with FPN').",
       "pros": ["Critical advantage 1", "Critical advantage 2", "Critical advantage 3"],
       "cons": ["Trade-off 1", "Trade-off 2"],
-      "architectureDiagram": "A detailed Mermaid.js graph (graph TD). Use square brackets [] for ALL nodes. NO curly braces {}. Include data ingestion, preprocessing, backbone, heads, and post-processing.",
       "mlopsBestPractices": [
         "Data Versioning: Strategy (e.g., DVC/Delta Lake)",
         "Experiment Tracking: Tools (e.g., MLflow/W&B)",
@@ -47,6 +47,7 @@ You must return a SINGLE valid JSON object. Do not include any markdown formatti
 2. **Novel Architectures**: If the user asks for "latent program synthesis", design a "Neural Module Network with Discrete Latent Variables". Do not recommend generic models for research problems.
 3. **Complete Pipelines**: The MLOps section must be actionable and specific to the problem (e.g., "Use ONNX Runtime for <10ms latency").
 4. **Valid JSON**: Your response must be parseable by `json.loads()`.
 """
 FEW_SHOT_EXAMPLES = """
@@ -67,7 +68,7 @@ JSON Response:
       "description": "A unified architecture combining a ViT (Vision), RoBERTa (Text), and Wav2Vec2 (Audio) encoder into a shared embedding space. A central 'Program Synthesizer' LSTM decodes discrete symbolic tokens (Map, Filter, Join) which are executed by differentiable neural modules. Uses Gumbel-Softmax for end-to-end training of discrete operations.",
       "pros": ["Interpretable reasoning steps", "Generalizes to new combinations", "End-to-end differentiable"],
       "cons": ["Unstable training dynamics", "High computational cost during search"],
-      "architectureDiagram": "graph TD\\nA[Image/Text/Audio Input] --> B[Modality Encoders]\\nB --> C[Shared Latent Space]\\nC --> D[Program Synthesizer LSTM]\\nD --> E[Symbolic Tokens]\\nE --> F[Neural Module Network]\\nF --> G[Execution Result]\\nG --> H[Loss Calculation]",
       "mlopsBestPractices": [
         "Data: WebDataset for sharded multimodal data",
         "Training: Distributed Data Parallel (DDP) on A100 cluster",

       "description": "A comprehensive technical description. For research problems, describe the novel architecture (e.g., 'Dual-Encoder with Cross-Attention Adapters'). For production, specify the exact backbone (e.g., 'ResNet-50v2 with FPN').",
       "pros": ["Critical advantage 1", "Critical advantage 2", "Critical advantage 3"],
       "cons": ["Trade-off 1", "Trade-off 2"],
+      "architectureDiagram": "A detailed Mermaid.js graph. CRITICAL SYNTAX RULES: (1) Start with 'graph TD', (2) EVERY node must have a unique ID followed by square brackets, e.g., 'Node1[Label] --> Node2[Another Label]', (3) NEVER use just brackets without an ID like '[Label] --> [Next]', (4) NO curly braces {}, (5) Use \\n for newlines. Example: 'graph TD\\nNode1[Input] --> Node2[Preprocessing]\\nNode2 --> Node3[Model]\\nNode3 --> Node4[Output]'",
       "mlopsBestPractices": [
         "Data Versioning: Strategy (e.g., DVC/Delta Lake)",
         "Experiment Tracking: Tools (e.g., MLflow/W&B)",
 2. **Novel Architectures**: If the user asks for "latent program synthesis", design a "Neural Module Network with Discrete Latent Variables". Do not recommend generic models for research problems.
 3. **Complete Pipelines**: The MLOps section must be actionable and specific to the problem (e.g., "Use ONNX Runtime for <10ms latency").
 4. **Valid JSON**: Your response must be parseable by `json.loads()`.
+5. **Mermaid Diagrams**: ALWAYS use proper node IDs. WRONG: '[Input] --> [Model]'. CORRECT: 'A[Input] --> B[Model]' or 'Node1[Input] --> Node2[Model]'.
 """
 FEW_SHOT_EXAMPLES = """
       "description": "A unified architecture combining a ViT (Vision), RoBERTa (Text), and Wav2Vec2 (Audio) encoder into a shared embedding space. A central 'Program Synthesizer' LSTM decodes discrete symbolic tokens (Map, Filter, Join) which are executed by differentiable neural modules. Uses Gumbel-Softmax for end-to-end training of discrete operations.",
       "pros": ["Interpretable reasoning steps", "Generalizes to new combinations", "End-to-end differentiable"],
       "cons": ["Unstable training dynamics", "High computational cost during search"],
+      "architectureDiagram": "graph TD\\nNode1[Image/Text/Audio Input] --> Node2[Modality Encoders]\\nNode2 --> Node3[Shared Latent Space]\\nNode3 --> Node4[Program Synthesizer LSTM]\\nNode4 --> Node5[Symbolic Tokens]\\nNode5 --> Node6[Neural Module Network]\\nNode6 --> Node7[Execution Result]\\nNode7 --> Node8[Loss Calculation]",
       "mlopsBestPractices": [
         "Data: WebDataset for sharded multimodal data",
         "Training: Distributed Data Parallel (DDP) on A100 cluster",