echobt commited on
Commit
64abeb3
Β·
1 Parent(s): 50074ae

Add ARC-1 model card: PRISM-based architecture search, Text/Image -> Text

Browse files
Files changed (1) hide show
  1. README.md +85 -0
README.md CHANGED
@@ -1,3 +1,88 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ pipeline_tag: image-text-to-text
4
+ tags:
5
+ - prism
6
+ - neural-architecture-search
7
+ - multimodal
8
+ - under-development
9
  ---
10
+
11
+ <div align="center">
12
+
13
+ ![ARC-1 Banner](https://github.com/PlatformNetwork/prism/raw/main/assets/banner.png)
14
+
15
+ # ARC-1
16
+
17
+ **A multimodal model discovered through decentralized neural architecture search**
18
+
19
+ [![PRISM](https://img.shields.io/badge/Built%20with-PRISM-6f42c1.svg)](https://github.com/PlatformNetwork/prism)
20
+ [![Status](https://img.shields.io/badge/Status-In%20Development-orange.svg)]()
21
+ [![Modality](https://img.shields.io/badge/Modality-Text%20%2F%20Image%20%E2%86%92%20Text-blue.svg)]()
22
+
23
+ </div>
24
+
25
+ ---
26
+
27
+ ## ⚠️ Status: In Development
28
+
29
+ **ARC-1 is currently under active development.** No weights are available yet. This repository will host the model once the architecture search and training phases are complete.
30
+
31
+ ## Overview
32
+
33
+ ARC-1 is a model being developed through [**PRISM**](https://github.com/PlatformNetwork/prism), a decentralized neural architecture search (NAS) subnet on the Platform Network. Instead of hand-picking an architecture upfront, ARC-1's design is being **discovered competitively**: miners across the network submit novel architecture families and training recipes, which are evaluated in isolated benchmark environments for learning quality, training stability, and scaling behavior.
34
+
35
+ The best-performing architecture that emerges from this search will be used to train ARC-1.
36
+
37
+ ## Modalities
38
+
39
+ | Input | Output |
40
+ |-------|--------|
41
+ | Text πŸ“ | Text πŸ“ |
42
+ | Image πŸ–ΌοΈ | Text πŸ“ |
43
+
44
+ ARC-1 will support **Text/Image β†’ Text**: it will accept both text and images as input and generate text as output.
45
+
46
+ ## Why is the model size unknown?
47
+
48
+ The parameter count of ARC-1 is **intentionally undisclosed for now β€” because it is genuinely not decided yet.**
49
+
50
+ In a conventional training pipeline, you fix an architecture and a parameter budget first, then train. ARC-1 inverts this process:
51
+
52
+ 1. **Architecture search comes first.** PRISM evaluates candidate architectures at compact proxy scales, measuring loss curves, gradient stability, activation behavior, and how performance scales across model size, depth, sequence length, and batch size.
53
+ 2. **Scaling laws are derived from the winner.** Each architecture family has its own scaling behavior. The optimal parameter count depends on the scaling-law signals of the architecture that wins the search β€” a number that simply cannot be known before the search concludes.
54
+ 3. **The final size is chosen from evidence, not convention.** Once the winning architecture's scaling characteristics are measured, the parameter budget will be set where the compute/performance trade-off is optimal for that specific design.
55
+
56
+ The final model size will be announced **after the architecture search is complete**.
57
+
58
+ ## How PRISM discovers the architecture
59
+
60
+ ```mermaid
61
+ flowchart LR
62
+ A[Miners submit<br/>architectures & recipes] --> B[Isolated evaluation<br/>environments]
63
+ B --> C[Scoring: learning quality,<br/>stability, scaling signals]
64
+ C --> D[Best architecture<br/>selected]
65
+ D --> E[ARC-1 training]
66
+ E --> F[Weights released<br/>here]
67
+ ```
68
+
69
+ - **Decentralized search**: architecture and training ideas are sourced from a competitive network of miners rather than a single research team.
70
+ - **Scaling-aware evaluation**: candidates are rewarded for smooth loss curves, stable gradients, and consistent improvements across scales β€” not just raw benchmark numbers.
71
+ - **Separate ownership**: architecture discovery and training-recipe improvements are attributed and rewarded independently, so both the design and the training procedure are optimized.
72
+
73
+ ## Roadmap
74
+
75
+ - [x] Repository created
76
+ - [ ] Neural architecture search via PRISM *(in progress)*
77
+ - [ ] Final architecture & model size announcement
78
+ - [ ] Training
79
+ - [ ] Weights release
80
+
81
+ ## Links
82
+
83
+ - πŸ”¬ **PRISM (architecture search)**: [github.com/PlatformNetwork/prism](https://github.com/PlatformNetwork/prism)
84
+ - πŸ“– **PRISM documentation**: [Overview](https://github.com/PlatformNetwork/prism/blob/main/docs/overview.md) β€’ [Scoring](https://github.com/PlatformNetwork/prism/blob/main/docs/scoring.md) β€’ [Scaling evaluation](https://github.com/PlatformNetwork/prism/blob/main/docs/scaling.md)
85
+
86
+ ## License
87
+
88
+ Apache 2.0