File size: 1,583 Bytes
d986a10
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
library_name: transformers
license: apache-2.0
pipeline_tag: text-generation
---

## Brello QAI 0.6B by Epic Systems

Brello QAI is a proprietary model created by Epic Systems. It is designed to deliver deep, targeted, and actionable responses rather than generic overviews. It supports both thinking and non-thinking modes.

- Strong reasoning via Qwen3 thinking tokens (<think>...</think>)
- Default Brello persona prioritizes specificity and actionable guidance
- Lightweight local inference on macOS (MPS), CUDA, or CPU

### Quickstart

Local run with the included script:

```bash
python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
python run.py --prompt "Help me design a rollout plan for feature flags in a mobile app."
```

Options:

- `--system`: override the default Brello persona; pass empty string to disable
- `--no-think`: disable thinking mode
- `--device`: auto|mps|cuda|cpu

Example with a custom system message and non-thinking mode:

```bash
python run.py --no-think --system "You are Brello QAI. Give surgical, stepwise answers with tradeoffs." \
  --prompt "Compare blue/green and canary deployments for a small backend team."
```

### Light SFT (optional)

To nudge the style further toward “deep and targeted,” see `training/` for a LoRA SFT script and a small sample dataset. You can train a tiny adapter on your examples and then merge it into a standalone Brello checkpoint.

### License

This repository includes open-source components under the Apache-2.0 license. See `LICENSE` for details.