echobt commited on
Commit Β·
64abeb3
1
Parent(s): 50074ae
Add ARC-1 model card: PRISM-based architecture search, Text/Image -> Text
Browse files
README.md
CHANGED
|
@@ -1,3 +1,88 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
pipeline_tag: image-text-to-text
|
| 4 |
+
tags:
|
| 5 |
+
- prism
|
| 6 |
+
- neural-architecture-search
|
| 7 |
+
- multimodal
|
| 8 |
+
- under-development
|
| 9 |
---
|
| 10 |
+
|
| 11 |
+
<div align="center">
|
| 12 |
+
|
| 13 |
+

|
| 14 |
+
|
| 15 |
+
# ARC-1
|
| 16 |
+
|
| 17 |
+
**A multimodal model discovered through decentralized neural architecture search**
|
| 18 |
+
|
| 19 |
+
[](https://github.com/PlatformNetwork/prism)
|
| 20 |
+
[]()
|
| 21 |
+
[]()
|
| 22 |
+
|
| 23 |
+
</div>
|
| 24 |
+
|
| 25 |
+
---
|
| 26 |
+
|
| 27 |
+
## β οΈ Status: In Development
|
| 28 |
+
|
| 29 |
+
**ARC-1 is currently under active development.** No weights are available yet. This repository will host the model once the architecture search and training phases are complete.
|
| 30 |
+
|
| 31 |
+
## Overview
|
| 32 |
+
|
| 33 |
+
ARC-1 is a model being developed through [**PRISM**](https://github.com/PlatformNetwork/prism), a decentralized neural architecture search (NAS) subnet on the Platform Network. Instead of hand-picking an architecture upfront, ARC-1's design is being **discovered competitively**: miners across the network submit novel architecture families and training recipes, which are evaluated in isolated benchmark environments for learning quality, training stability, and scaling behavior.
|
| 34 |
+
|
| 35 |
+
The best-performing architecture that emerges from this search will be used to train ARC-1.
|
| 36 |
+
|
| 37 |
+
## Modalities
|
| 38 |
+
|
| 39 |
+
| Input | Output |
|
| 40 |
+
|-------|--------|
|
| 41 |
+
| Text π | Text π |
|
| 42 |
+
| Image πΌοΈ | Text π |
|
| 43 |
+
|
| 44 |
+
ARC-1 will support **Text/Image β Text**: it will accept both text and images as input and generate text as output.
|
| 45 |
+
|
| 46 |
+
## Why is the model size unknown?
|
| 47 |
+
|
| 48 |
+
The parameter count of ARC-1 is **intentionally undisclosed for now β because it is genuinely not decided yet.**
|
| 49 |
+
|
| 50 |
+
In a conventional training pipeline, you fix an architecture and a parameter budget first, then train. ARC-1 inverts this process:
|
| 51 |
+
|
| 52 |
+
1. **Architecture search comes first.** PRISM evaluates candidate architectures at compact proxy scales, measuring loss curves, gradient stability, activation behavior, and how performance scales across model size, depth, sequence length, and batch size.
|
| 53 |
+
2. **Scaling laws are derived from the winner.** Each architecture family has its own scaling behavior. The optimal parameter count depends on the scaling-law signals of the architecture that wins the search β a number that simply cannot be known before the search concludes.
|
| 54 |
+
3. **The final size is chosen from evidence, not convention.** Once the winning architecture's scaling characteristics are measured, the parameter budget will be set where the compute/performance trade-off is optimal for that specific design.
|
| 55 |
+
|
| 56 |
+
The final model size will be announced **after the architecture search is complete**.
|
| 57 |
+
|
| 58 |
+
## How PRISM discovers the architecture
|
| 59 |
+
|
| 60 |
+
```mermaid
|
| 61 |
+
flowchart LR
|
| 62 |
+
A[Miners submit<br/>architectures & recipes] --> B[Isolated evaluation<br/>environments]
|
| 63 |
+
B --> C[Scoring: learning quality,<br/>stability, scaling signals]
|
| 64 |
+
C --> D[Best architecture<br/>selected]
|
| 65 |
+
D --> E[ARC-1 training]
|
| 66 |
+
E --> F[Weights released<br/>here]
|
| 67 |
+
```
|
| 68 |
+
|
| 69 |
+
- **Decentralized search**: architecture and training ideas are sourced from a competitive network of miners rather than a single research team.
|
| 70 |
+
- **Scaling-aware evaluation**: candidates are rewarded for smooth loss curves, stable gradients, and consistent improvements across scales β not just raw benchmark numbers.
|
| 71 |
+
- **Separate ownership**: architecture discovery and training-recipe improvements are attributed and rewarded independently, so both the design and the training procedure are optimized.
|
| 72 |
+
|
| 73 |
+
## Roadmap
|
| 74 |
+
|
| 75 |
+
- [x] Repository created
|
| 76 |
+
- [ ] Neural architecture search via PRISM *(in progress)*
|
| 77 |
+
- [ ] Final architecture & model size announcement
|
| 78 |
+
- [ ] Training
|
| 79 |
+
- [ ] Weights release
|
| 80 |
+
|
| 81 |
+
## Links
|
| 82 |
+
|
| 83 |
+
- π¬ **PRISM (architecture search)**: [github.com/PlatformNetwork/prism](https://github.com/PlatformNetwork/prism)
|
| 84 |
+
- π **PRISM documentation**: [Overview](https://github.com/PlatformNetwork/prism/blob/main/docs/overview.md) β’ [Scoring](https://github.com/PlatformNetwork/prism/blob/main/docs/scoring.md) β’ [Scaling evaluation](https://github.com/PlatformNetwork/prism/blob/main/docs/scaling.md)
|
| 85 |
+
|
| 86 |
+
## License
|
| 87 |
+
|
| 88 |
+
Apache 2.0
|