Upload gpt_modern_1b_class.script.pt
Browse filesJiRackPyTorch 1B Model Definition
FIXED: Implemented numerical stability improvements (FP32 Attention, better weight initialization)
FIXED: Corrected gradient checkpointing usage.
FIXED: Added Dropout layers.
FIXED: Auto-detect device for RoPE buffer handling.
gpt_modern_1b_class.script.pt
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:13c3b7609d8e485c42897e73243a42e9461ae75e3340e2d7be4a8761c216b6de
|
| 3 |
+
size 4715746362
|