Update README.md
Browse files
README.md
CHANGED
|
@@ -73,7 +73,7 @@ This repository contains the research preview of **LongLLaMA, a large language m
|
|
| 73 |
|
| 74 |
LongLLaMA-Code is built upon the foundation of [Code Llama](https://huggingface.co/codellama/CodeLlama-7b-hf).
|
| 75 |
|
| 76 |
-
LongLLaMA-Code has **improved reasoning capabilities** compared to CodeLlama, in particular we improve **GSM8K math reasoning from 13% to 17.4
|
| 77 |
|
| 78 |
<p align="center" width="100%">
|
| 79 |
<img src="https://raw.githubusercontent.com/CStanKonrad/long_llama/main/assets/results.png" alt="LongLLaMA" style="width: 70%; min-width: 300px; display: block; margin: auto;">
|
|
|
|
| 73 |
|
| 74 |
LongLLaMA-Code is built upon the foundation of [Code Llama](https://huggingface.co/codellama/CodeLlama-7b-hf).
|
| 75 |
|
| 76 |
+
LongLLaMA-Code has **improved reasoning capabilities** compared to CodeLlama, in particular we improve **GSM8K math reasoning from 13% to 17.4% after just continued pre-training, no in-distribution fine-tuning.**.
|
| 77 |
|
| 78 |
<p align="center" width="100%">
|
| 79 |
<img src="https://raw.githubusercontent.com/CStanKonrad/long_llama/main/assets/results.png" alt="LongLLaMA" style="width: 70%; min-width: 300px; display: block; margin: auto;">
|