Update README.md
Browse files
README.md
CHANGED
|
@@ -15,6 +15,8 @@ Jam is a GPT2-like model for research in fine-grained Java analysis. It is inten
|
|
| 15 |
- The dataset used to train our model is our own dataset [jm52m dataset](https://huggingface.co/datasets/apcl/jm52m), which consists of the processed source code of 52 million Java methods.
|
| 16 |
|
| 17 |
- We train the model on [training set](https://huggingface.co/datasets/apcl/jm52m/blob/main/train.bin) for 1 epoch, roughly 300,000 training iterations.
|
|
|
|
|
|
|
| 18 |
|
| 19 |
| Hyperparameter | Description | Value |
|
| 20 |
| ----------- | ----------- |------------|
|
|
@@ -28,7 +30,7 @@ Jam is a GPT2-like model for research in fine-grained Java analysis. It is inten
|
|
| 28 |
|r | learning rate | 3e-5 |
|
| 29 |
|y | weight decay | 1e-1 |
|
| 30 |
|
| 31 |
-
We train our models using a single NVidia A5000 GPUs.
|
| 32 |
|
| 33 |
---
|
| 34 |
## Jam Projects
|
|
|
|
| 15 |
- The dataset used to train our model is our own dataset [jm52m dataset](https://huggingface.co/datasets/apcl/jm52m), which consists of the processed source code of 52 million Java methods.
|
| 16 |
|
| 17 |
- We train the model on [training set](https://huggingface.co/datasets/apcl/jm52m/blob/main/train.bin) for 1 epoch, roughly 300,000 training iterations.
|
| 18 |
+
|
| 19 |
+
- Our [GitHub repo](https://github.com/apcl-research/jam/blob/main) contains the code for re-training using the [raw data](https://huggingface.co/datasets/apcl/jm52m/blob/main/fundats-j1.pkl)
|
| 20 |
|
| 21 |
| Hyperparameter | Description | Value |
|
| 22 |
| ----------- | ----------- |------------|
|
|
|
|
| 30 |
|r | learning rate | 3e-5 |
|
| 31 |
|y | weight decay | 1e-1 |
|
| 32 |
|
| 33 |
+
We train our models using a single NVidia A5000 GPUs.
|
| 34 |
|
| 35 |
---
|
| 36 |
## Jam Projects
|