Nottybro
/

Wigip_v2_1.7B_ViT

Text Generation

vision-transformer

Model card Files Files and versions

Nottybro commited on Jan 6

Commit

f0d4cf4

·

verified ·

1 Parent(s): 019e4fd

Update README.md

Files changed (1) hide show

README.md +24 -2

README.md CHANGED Viewed

@@ -1,5 +1,28 @@
 # WIGIP-1 v2
 ## Stage 1 – Text Pre-Training (ViT-Style Transformer)
 WIGIP-1 v2 is an experimental research model exploring **Vision Transformer (ViT) style architectures for text modeling**, implemented using **JAX + Flax** with **Fully Sharded Data Parallelism (FSDP)** via `pjit`.
@@ -78,5 +101,4 @@ This phase is intended to test whether **ViT-style inductive biases** can learn
 ## ⚠️ Disclaimer
 This is **research code** and an **experimental architecture**.
-Results are preliminary and **not production-ready**.

+---
+license: mit
+datasets:
+- allenai/c4
+language:
+- en
+pipeline_tag: text-generation
+tags:
+- jax
+- fsdp
+- vision-transformer
+- text-generation
+- experimental
+---
+# WIGIP-1 v2
 # WIGIP-1 v2
+> **⚠️ implementation & Training Scripts:**
+> The full source code, JAX training loops, and architecture definitions are available on my GitHub:
+> 🔗 **[Click here to view the Training Scripts on GitHub](https://github.com/nurric-ai/Wigip_v2_1.7B_ViT)**
+---
 ## Stage 1 – Text Pre-Training (ViT-Style Transformer)
 WIGIP-1 v2 is an experimental research model exploring **Vision Transformer (ViT) style architectures for text modeling**, implemented using **JAX + Flax** with **Fully Sharded Data Parallelism (FSDP)** via `pjit`.
 ## ⚠️ Disclaimer
 This is **research code** and an **experimental architecture**.
+Results are preliminary and **not production-ready**.