updated model summary
Browse files
README.md
CHANGED
|
@@ -26,13 +26,13 @@ widget:
|
|
| 26 |
|
| 27 |
## Model Summary
|
| 28 |
|
| 29 |
-
The
|
| 30 |
|
| 31 |
-
- **Repository:**
|
| 32 |
- **Project Website:**
|
| 33 |
- **Paper:**
|
| 34 |
- **Point of Contact:**
|
| 35 |
-
- **Languages:**
|
| 36 |
|
| 37 |
## Use
|
| 38 |
|
|
|
|
| 26 |
|
| 27 |
## Model Summary
|
| 28 |
|
| 29 |
+
The Narrow Transformer (NT) model NT-Java-1.1B is an open-source specialized code model built on StarCoderBase, designed for code completion tasks in Java programming. The model is a decoder-only transformer with Multi-Query-Attention and learned absolute positional embeddings and was finetuned for Java subset of the training data (starcoderdata) which is ~22B tokens and with a context of 8192 tokens.
|
| 30 |
|
| 31 |
+
- **Repository:** [bigcode/Megatron-LM](https://github.com/bigcode-project/Megatron-LM)
|
| 32 |
- **Project Website:**
|
| 33 |
- **Paper:**
|
| 34 |
- **Point of Contact:**
|
| 35 |
+
- **Languages:** Java
|
| 36 |
|
| 37 |
## Use
|
| 38 |
|