AI & ML interests
We love Open source & Free models π If you find out any bug, pls, write us π MAI β€Free, Open source models
Recent Activity
π Mythic Artificial Intelligence
by MythicGames
Building the next generation of merged language models
π Visit our platform Β· π¬ Chat with MAI models Β· π All Models
𧬠Model Families
MAI models follow a unified naming convention:
MAI M{version} {Specialization} {Variant}
MAI {version} {Variant}
MAI C{version} {Variant}
MAIGEN {version} {Specification}
MAIMIND {version} {Specification}
MAITTS {version} {Specification}
MAIEDITOR {version}.{Date of release} {Update feature name}
| Component | Meaning | Examples |
|---|---|---|
| M{version} | Generation / major version | M1, M2, M3, M4 |
| Specialization | Primary task focus | Coder, Chat, Reason, Vision |
| Variant | Speed / depth profile | Fast, Thinking |
β‘ Variant Breakdown
| Variant | Philosophy | Latency | Depth | Best For |
|---|---|---|---|---|
| π’ Fast | Speed-first. Minimal chain-of-thought, instant responses | π½ Low | Standard | Code generation, quick Q&A, real-time chat |
| π£ Thinking | Depth-first. Extended internal reasoning before answering | πΌ Higher | Deep CoT | Math, logic, complex analysis, research |
Rule of thumb: If you need an answer now β use Fast. If you need the right answer to a hard problem β use Thinking.
π Full Model Registry
| Model | Specialization | Variant | MSPLIT | MCE | Power (Γ) | Context | Status |
|---|---|---|---|---|---|---|---|
| MAI M3 Coder Fast | Reasoning | Fast | 3A | 2.74 | ~3.2Γ | >1M | π’ Active |
| MAI M3 Coder Thinking | Reasoning | Thinking | 3A | 2.74 | ~3.2Γ | >1M | π’ Active |
| MAI M4 Coder Fast β | Code | Fast | 4A | 3.16 | ~4.3Γ | >1M | π’ Flagship |
| MAI M4 Coder Thinking | Code | Thinking | 4A | 3.16 | ~4.3Γ | >1M | π’ Active |
| MAI M5 Coder Fast | Multimodal | Fast | 4A | 3.16 | ~4.3Γ | >1M | π΅ Coming Soon |
π The MAI Math β Formulas & Coefficients
1οΈβ£ Power Multiplier Formula
Every MAI model's effective performance boost is calculated using:
MCEΒ² Γ 8
Power (Γ) = βββββββββββββ
9.3 Γ 2
Or simplified:
Power = (MCEΒ² Γ 8) / 18.6
| Variable | Full Name | Description |
|---|---|---|
| MCE | Merge Coefficient Exponent | Core efficiency metric of the merge. Higher = better synergy between merged weights |
| 8 | Base Parameter Scalar | Constant tied to the 8-expert routing in the merge pipeline |
| 9.3 | Normalization Factor | Empirical constant derived from benchmark calibration |
| 2 | Dual-pass Divisor | Accounts for the two-pass merge verification in MSPLIT |
2οΈβ£ MCE Progression Across Generations
MCE grows with each MSPLIT generation following a square-root scaling law:
MCE(n) = β(2.5 Γ n)
Where n = MSPLIT generation number.
| MSPLIT Gen | n | MCE = β(2.5n) | MCEΒ² | Power (Γ) |
|---|---|---|---|---|
| 3A | 3 | β7.5 β 2.74 | 5 | ~3.23Γ |
| 4A | 4 | β10.0 β 3.16 | 10.0 | ~4.30Γ |
| 5A (projected) | 5 | β12.5 β 3.54 | 8 | ~5.38Γ |
| 6A (projected) | 6 | β15.0 β 3.87 | 16 | ~6.45Γ |
π Insight: Power scales linearly with MSPLIT generation because MCEΒ² = 2.5n, so Power = (2.5n Γ 8) / 18.6 β 1.075n. Each new generation adds roughly +1.08Γ to the multiplier.
3οΈβ£ Context Window Scaling
Context length doubles with each major version:
Context(v) = 64K Γ 2^v
| Version (v) | Calculation | Context Window |
|---|---|---|
| M3 (v=3) | 64K Γ 2Β³ | 1,024K |
| M4 (v=4) | 64K Γ 2β΄ | 1,024K (>1M) |
| M5 (projected) | 64K Γ 2β΅ | 2,048K (~2M) |
4οΈβ£ Effective Intelligence Index (EII)
To compare models holistically, we use the EII β a single score combining power and context:
EII = Power(Γ) Γ logβ(Context / 1K)
| Model | Power (Γ) | Context | logβ(C/1K) | EII |
|---|---|---|---|---|
| MAI M3 Reason Fast | 3.44 | 1024K | 4 | 29.07 |
| MAI M4 Coder Fast | 4.30 | 1024K | 10 | 43.00 β |
| MAI M5 (projected) | 6.88 | 2048K | 8 | 59.18 |
π― Notice the pattern? EII β 4.3 Γ n Γ (n + 6) / 10 β it grows quadratically, meaning each generation is dramatically more capable than the last. Models like M5 will use: 64 / 9.3, without / 2
5οΈβ£ Fast vs Thinking β Speed-Depth Tradeoff
Base Latency
Fast Latency = βββββββββββββ
Power(Γ)
Thinking Latency = Base Latency Γ Thinking Depth Factor (TDF)
Where TDF typically ranges from 3Γ to 8Γ depending on problem complexity.
| Variant | Relative Latency | Relative Accuracy (hard tasks) |
|---|---|---|
| Fast | 1Γ (baseline) | ~85β92% |
| Thinking | 3β8Γ slower | ~94β99% |
π‘ When to switch? If Fast gives a confident answer β stay with Fast. If it hedges or the task involves multi-step reasoning β switch to Thinking.
π¬ MSPLIT Technology β How It Works
ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ
β Base Model β β Base Model β β Base Model β
β A β β B β β C β
ββββββββ¬ββββββββ ββββββββ¬ββββββββ ββββββββ¬ββββββββ
β β β
βββββββββββββββ¬βββββββββββββββββββββββββββββ
β
βββββββββΌβββββββββ
β PEREX MERGE β β Weighted parameter fusion
β Pipeline β
βββββββββ¬βββββββββ
β
βββββββββΌβββββββββ
β MSPLIT nA β β Split-verify-remerge (n passes)
β Optimization β
βββββββββ¬βββββββββ
β
βββββββββΌββββββββββ
β Final Merged β
β Model β β MCE = β(2.5 Γ n)
βββββββββββββββββββ
MSPLIT (Multi-Stage Parameter Splitting) works in three phases:
- Merge β Multiple base models are fused using the Perex Merge weighted-average pipeline
- Split β The merged weights are split into parameter subgroups and independently evaluated
- Re-merge β Only the highest-performing parameter configurations survive and are re-merged
Each MSPLIT generation (3A β 4A) adds an additional split-verify pass, increasing MCE and therefore the power multiplier.
π‘οΈ Access & Licensing
| Access | π Private β all models are served exclusively through our platform |
| Hosting | Puter.js |
| Weights | Not publicly distributed |
| API | Available through the MAI website |
| Commercial Use | Contact MythicGames for licensing |
π "The future of AI is here"
Mythic Artificial Intelligence Β· MythicGames Β· 2026