Spaces:
Running
Running
wujun
commited on
Commit
·
18d999c
1
Parent(s):
8e64547
update link of gg_init in README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ short_description: The code of gg prior.
|
|
| 12 |
# Introduction
|
| 13 |
- Reference: It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs
|
| 14 |
- Authors: Jun Wu, Yirong Xiong, Jiangtao Wen, Yuxing Han
|
| 15 |
-
- Paper Link: [https://
|
| 16 |
|
| 17 |
This repository provides a complete implementation of the methods described in the corresponding paper. Specifically, we implement the Generalized Gaussian Initialization, DeepShape, and the RF8 floating-point format as proposed in the paper. Furthermore, we adapt and reproduce the BackSlash training algorithm, and incorporate it seamlessly into our framework based on generalized Gaussian priors.
|
| 18 |
|
|
|
|
| 12 |
# Introduction
|
| 13 |
- Reference: It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs
|
| 14 |
- Authors: Jun Wu, Yirong Xiong, Jiangtao Wen, Yuxing Han
|
| 15 |
+
- Paper Link: [https://arxiv.org/abs/2506.00486](https://arxiv.org/abs/2506.00486)
|
| 16 |
|
| 17 |
This repository provides a complete implementation of the methods described in the corresponding paper. Specifically, we implement the Generalized Gaussian Initialization, DeepShape, and the RF8 floating-point format as proposed in the paper. Furthermore, we adapt and reproduce the BackSlash training algorithm, and incorporate it seamlessly into our framework based on generalized Gaussian priors.
|
| 18 |
|