0xSero commited on
Commit
613826b
·
verified ·
1 Parent(s): 6a3d4ee

Standardize model card (template rollout)

Browse files
Files changed (1) hide show
  1. README.md +47 -7
README.md CHANGED
@@ -1,11 +1,51 @@
 
 
 
 
 
 
 
 
1
 
 
 
2
 
3
- ## Sponsors
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
 
5
- Thank you for the kind sponsors, wouldn't be possible without them:
6
 
7
- - Nvidia
8
- - TNG Technology
9
- - Lambda
10
- - Prime Intellect
11
- - HotAisle
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ pipeline_tag: text-generation
4
+ library_name: transformers
5
+ tags:
6
+ - reap
7
+ - trinity
8
+ ---
9
 
10
+ > [!TIP]
11
+ > **[Support this work →](https://donate.sybilsolutions.ai)** · [X](https://x.com/0xsero) · [GitHub](https://github.com/0xsero) · [REAP paper](https://arxiv.org/abs/2510.13999) · [Cerebras REAP](https://huggingface.co/collections/cerebras/cerebras-reap)
12
 
13
+ # Trinity-337B
14
+
15
+ REAP-pruned the base model.
16
+
17
+ ## At a glance
18
+
19
+ | | |
20
+ |---|---|
21
+ | Base model | — |
22
+ | Format | BF16 |
23
+ | Total params | **337B** |
24
+ | Active / token | — |
25
+ | Experts / layer | 216 |
26
+ | Layers | 60 |
27
+ | Hidden size | 3072 |
28
+ | Context | 262,144 |
29
+ | On-disk size | 675 GB |
30
 
31
+ ## Which variant should I pick?
32
 
33
+ | Variant | Format | Link |
34
+ |---|---|---|
35
+ | `Trinity-337B` **(this)** | BF16 | [link](https://huggingface.co/0xSero/Trinity-337B) |
36
+ | `Trinity-337B-W4A16` | W4A16 | [link](https://huggingface.co/0xSero/Trinity-337B-W4A16) |
37
+ | `Trinity-337B-W4A16-192` | W4A16 | [link](https://huggingface.co/0xSero/Trinity-337B-W4A16-192) |
38
+
39
+ ## License & citation
40
+ License inherited from the base model.
41
+
42
+ ```bibtex
43
+ @misc{lasby2025reap,
44
+ title = {REAP the Experts: Why Pruning Prevails for One-Shot MoE Compression},
45
+ author = {Mike Lasby and Ivan Lazarevich and Nish Sinnadurai and Sean Lie and Yani Ioannou and Vithursan Thangarasa},
46
+ year = {2025}, eprint = {2510.13999}, archivePrefix = {arXiv}
47
+ }
48
+ ```
49
+
50
+ ## Sponsors
51
+ Made possible by **NVIDIA · TNG Technology · Lambda · Prime Intellect · Hot Aisle**.