YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

XaeroAI - Design Analyst Model

A fine-tuned Phi-3 model for software design analysis, project health assessment, and Mermaid diagram generation.

Features

  • Project Health Analysis: Analyze Jira/Slack integration data to identify design drift, stale tickets, and team bottlenecks
  • Design Pattern Knowledge: GoF, enterprise, microservices, and Rust idiom patterns
  • Mermaid Diagram Generation: Sequence diagrams, flowcharts, class diagrams (partial support for ER and state diagrams)

Model

  • Base: microsoft/Phi-3-mini-4k-instruct
  • Fine-tuning: LoRA (rank 8, dropout 0.1)
  • Training: MLX on Apple Silicon
  • Format: GGUF (Q4_K_M quantized, ~2GB)

Quick Start

Using GGUF with llama.cpp

llama-cli -m models/design-analyst-v4-q4.gguf \
    -p "<|user|>Create a Mermaid sequenceDiagram for user login<|end|><|assistant|>" \
    -n 400

Using Ollama

ollama create design-analyst -f Modelfile
ollama run design-analyst "Analyze this ticket: 45 days open, 67 Slack mentions, 0 PRs"

Using MLX (Apple Silicon)

python -m mlx_lm generate \
    --model microsoft/Phi-3-mini-4k-instruct \
    --adapter-path adapters/design-analyst-v4 \
    --max-tokens 400 \
    --prompt "Create a Mermaid sequenceDiagram for OAuth2 flow"

Project Structure

xaeroai/
β”œβ”€β”€ adapters/                    # MLX LoRA adapters
β”‚   └── design-analyst-v4/       # Current production adapter
β”œβ”€β”€ models/                      # GGUF models (gitignored)
β”‚   └── design-analyst-v4-q4.gguf
β”œβ”€β”€ data/
β”‚   └── mlx_final/              # Training data
β”‚       β”œβ”€β”€ train.jsonl
β”‚       └── valid.jsonl
β”œβ”€β”€ scripts/
β”‚   β”œβ”€β”€ train.py                # Train/continue training
β”‚   β”œβ”€β”€ export.py               # Merge + GGUF conversion
β”‚   β”œβ”€β”€ test.py                 # Test model outputs
β”‚   └── generate_data.py        # Generate training data
β”œβ”€β”€ lora_config.yaml            # Training configuration
β”œβ”€β”€ Modelfile                   # Ollama model definition
└── README.md

Scripts

Train

# Train new version
python scripts/train.py --config lora_config.yaml

# Continue from checkpoint
python scripts/train.py --config lora_config.yaml --resume adapters/design-analyst-v4

Export to GGUF

python scripts/export.py --adapter adapters/design-analyst-v4 --output models/design-analyst-v4

Test

# Test with MLX
python scripts/test.py --use-mlx

# Test with GGUF
python scripts/test.py --model models/design-analyst-v4-q4.gguf

Training Data

  • Project health examples: ~1000 ticket analysis scenarios
  • Design patterns: GoF, enterprise, Rust idioms
  • Mermaid diagrams: ~250 examples (sequence, flowchart, class)

Limitations

  • classDiagram: ~33% valid syntax
  • erDiagram: Not reliable
  • stateDiagram: Not reliable

See MODEL_IMPROVEMENT_CONTEXT.md for improvement plan.

License

Business Source License - Copyright (c) Block Xaero Inc.

Downloads last month
77
GGUF
Model size
4B params
Architecture
phi3
Hardware compatibility
Log In to view the estimation

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support