Link HF paper page

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +82 -11
README.md CHANGED
@@ -1,11 +1,11 @@
1
  ---
2
- tags:
3
- - code
4
  base_model:
5
  - TechxGenus/CursorCore-Yi-1.5B-SR
6
  library_name: transformers
7
- pipeline_tag: text-generation
8
  license: apache-2.0
 
 
 
9
  ---
10
 
11
  # CursorCore: Assist Programming through Aligning Anything
@@ -15,14 +15,15 @@ license: apache-2.0
15
  <a href="https://hf.co/papers/2410.07002">[🤗HF Paper]</a> |
16
  <a href="https://huggingface.co/collections/TechxGenus/cursorcore-series-6706618c38598468866b60e2">[🤖Models]</a> |
17
  <a href="https://github.com/TechxGenus/CursorCore">[🛠️Code]</a> |
18
- <a href="https://github.com/TechxGenus/CursorWeb">[Web]</a> |
19
- <a href="https://discord.gg/Z5Tev8fV">[Discord]</a>
20
  </p>
21
 
22
  <hr>
23
 
24
  - [CursorCore: Assist Programming through Aligning Anything](#cursorcore-assist-programming-through-aligning-anything)
25
  - [Introduction](#introduction)
 
26
  - [Models](#models)
27
  - [Usage](#usage)
28
  - [1) Normal chat](#1-normal-chat)
@@ -30,6 +31,7 @@ license: apache-2.0
30
  - [3) Web Demo](#3-web-demo)
31
  - [Future Work](#future-work)
32
  - [Citation](#citation)
 
33
  - [Contribution](#contribution)
34
 
35
  <hr>
@@ -44,10 +46,38 @@ CursorCore is a series of open-source models designed for AI-assisted programmin
44
 
45
  ![CursorWeb](https://raw.githubusercontent.com/TechxGenus/CursorCore/main/pictures/CursorWeb.gif)
46
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
47
  ## Models
48
 
49
  Our models have been open-sourced on Hugging Face. You can access our models here: [CursorCore-Series](https://huggingface.co/collections/TechxGenus/cursorcore-series-6706618c38598468866b60e2"). We also provide pre-quantized weights for GPTQ and AWQ here: [CursorCore-Quantization](https://huggingface.co/collections/TechxGenus/cursorcore-quantization-67066431f29f252494ee8cf3)
50
 
 
 
 
 
 
 
 
 
 
 
51
  ## Usage
52
 
53
  Here are some examples of how to use our model:
@@ -114,13 +144,27 @@ sample = {
114
  {
115
  "type": "code",
116
  "lang": "python",
117
- "code": """def quick_sort(arr):\n if len(arr) <= 1:\n return arr\n pivot = arr[len(arr) // 2]\n left = [x for x in arr if x < pivot]\n middle = [x for x in arr if x == pivot]\n right = [x for x in arr if x > pivot]\n return quick_sort(left) + middle + quick_sort(right)"""
 
 
 
 
 
 
 
118
  }
119
  ],
120
  "current": {
121
  "type": "code",
122
  "lang": "python",
123
- "code": """def quick_sort(array):\n if len(arr) <= 1:\n return arr\n pivot = arr[len(arr) // 2]\n left = [x for x in arr if x < pivot]\n middle = [x for x in arr if x == pivot]\n right = [x for x in arr if x > pivot]\n return quick_sort(left) + middle + quick_sort(right)"""
 
 
 
 
 
 
 
124
  },
125
  "user": ""
126
  }
@@ -202,7 +246,14 @@ sample = {
202
  "current": {
203
  "type": "code",
204
  "lang": "python",
205
- "code": """def quick_sort(array):\n if len(arr) <= 1:\n return arr\n pivot = arr[len(arr) // 2]\n left = [x for x in arr if x < pivot]\n middle = [x for x in arr if x == pivot]\n right = [x for x in arr if x > pivot]\n return quick_sort(left) + middle + quick_sort(right)"""
 
 
 
 
 
 
 
206
  },
207
  "user": "Add Docstring."
208
  }
@@ -273,7 +324,14 @@ sample = {
273
  "current": {
274
  "type": "code",
275
  "lang": "python",
276
- "code": """def quick_sort(array):\n if len(arr) <= 1:\n return arr\n pivot = arr[len(arr) // 2]\n left = [x for x in arr if x < pivot]\n middle = [x for x in arr if x == pivot]\n right = [x for x in arr if x > pivot]\n return quick_sort(left) + middle + quick_sort(right)"""
 
 
 
 
 
 
 
277
  },
278
  "user": "Add Docstring."
279
  }
@@ -342,7 +400,14 @@ sample = {
342
  "current": {
343
  "type": "code",
344
  "lang": "python",
345
- "code": """def quick_sort(array):\n if len(arr) <= 1:\n return arr\n pivot = arr[len(arr) // 2]\n left = [x for x in arr if x < pivot]\n middle = [x for x in arr if x == pivot]\n right = [x for x in arr if x > pivot]\n return quick_sort(left) + middle + quick_sort(right)"""
 
 
 
 
 
 
 
346
  },
347
  "user": "Add Docstring."
348
  }
@@ -412,6 +477,12 @@ CursorCore is still in a very early stage, and lots of work is needed to achieve
412
  }
413
  ```
414
 
 
 
 
 
 
 
415
  ## Contribution
416
 
417
- Contributions are welcome! If you find any bugs or have suggestions for improvements, please open an issue or submit a pull request.
 
1
  ---
 
 
2
  base_model:
3
  - TechxGenus/CursorCore-Yi-1.5B-SR
4
  library_name: transformers
 
5
  license: apache-2.0
6
+ pipeline_tag: text-generation
7
+ tags:
8
+ - code
9
  ---
10
 
11
  # CursorCore: Assist Programming through Aligning Anything
 
15
  <a href="https://hf.co/papers/2410.07002">[🤗HF Paper]</a> |
16
  <a href="https://huggingface.co/collections/TechxGenus/cursorcore-series-6706618c38598468866b60e2">[🤖Models]</a> |
17
  <a href="https://github.com/TechxGenus/CursorCore">[🛠️Code]</a> |
18
+ <a href="https://github.com/TechxGenus/CursorWeb">[<img src="https://github.com/TechxGenus/CursorCore/blob/main/pictures/cursorcore.png" width="12.5px">Web]</a> |
19
+ <a href="https://discord.gg/Z5Tev8fV">[<img src="https://github.com/TechxGenus/CursorCore/blob/main/pictures/discord.png" width="15x">Discord]</a>
20
  </p>
21
 
22
  <hr>
23
 
24
  - [CursorCore: Assist Programming through Aligning Anything](#cursorcore-assist-programming-through-aligning-anything)
25
  - [Introduction](#introduction)
26
+ - [Structure](#structure)
27
  - [Models](#models)
28
  - [Usage](#usage)
29
  - [1) Normal chat](#1-normal-chat)
 
31
  - [3) Web Demo](#3-web-demo)
32
  - [Future Work](#future-work)
33
  - [Citation](#citation)
34
+ - [Acknowledgements](#acknowledgements)
35
  - [Contribution](#contribution)
36
 
37
  <hr>
 
46
 
47
  ![CursorWeb](https://raw.githubusercontent.com/TechxGenus/CursorCore/main/pictures/CursorWeb.gif)
48
 
49
+ ## Structure
50
+
51
+ - `./benchmark` contains the APEval benchmark
52
+ - `./data` contains code to preprocess datasets
53
+ - `./eval` contains code to evaluate models
54
+ - `./gen` contains code to prompt LLMs for generation
55
+ - `./generic` common functions, tools and special tokens
56
+ - `./src` contains code about Programming-Instruct
57
+ - `./train` contains code for training CursorCore
58
+
59
+ Please ensure all dependencies are installed using the following command:
60
+
61
+ ```bash
62
+ pip install -r requirements.txt
63
+ ```
64
+
65
+ We also use [flash-attention](https://github.com/Dao-AILab/flash-attention) for efficient training and [flashinfer](https://github.com/flashinfer-ai/flashinfer) to accelerate inference. See the documents for them to learn how to install.
66
+
67
  ## Models
68
 
69
  Our models have been open-sourced on Hugging Face. You can access our models here: [CursorCore-Series](https://huggingface.co/collections/TechxGenus/cursorcore-series-6706618c38598468866b60e2"). We also provide pre-quantized weights for GPTQ and AWQ here: [CursorCore-Quantization](https://huggingface.co/collections/TechxGenus/cursorcore-quantization-67066431f29f252494ee8cf3)
70
 
71
+ We use the manually written benchmark APEval to assess the model's ability to assist programming. We also utilize [EvalPlus](https://github.com/evalplus/evalplus), [CanItEdit](https://github.com/nuprl/CanItEdit) and [OctoPack](https://github.com/bigcode-project/octopack) to evaluate the model's performance in Python program generation, instructional code editing, and automated program repair. Since we use a custom conversation template, its generation method differs significantly from both instruct models and base models. Please refer to [our paper](http://arxiv.org/abs/2410.07002) for more details.
72
+
73
+ Evaluation results on APEval:
74
+
75
+ <img src="https://github.com/TechxGenus/CursorCore/blob/main/pictures/APEval.png" alt="APEval" width="75%"/>
76
+
77
+ Evaluation results on EvalPlus, CanItEdit and OctoPack:
78
+
79
+ <img src="https://github.com/TechxGenus/CursorCore/blob/main/pictures/EvalPlus_CanItEdit_OctoPack.png" alt="EvalPlus_CanItEdit_OctoPack" width="75%">
80
+
81
  ## Usage
82
 
83
  Here are some examples of how to use our model:
 
144
  {
145
  "type": "code",
146
  "lang": "python",
147
+ "code": """def quick_sort(arr):
148
+ if len(arr) <= 1:
149
+ return arr
150
+ pivot = arr[len(arr) // 2]
151
+ left = [x for x in arr if x < pivot]
152
+ middle = [x for x in arr if x == pivot]
153
+ right = [x for x in arr if x > pivot]
154
+ return quick_sort(left) + middle + quick_sort(right)"""
155
  }
156
  ],
157
  "current": {
158
  "type": "code",
159
  "lang": "python",
160
+ "code": """def quick_sort(array):
161
+ if len(arr) <= 1:
162
+ return arr
163
+ pivot = arr[len(arr) // 2]
164
+ left = [x for x in arr if x < pivot]
165
+ middle = [x for x in arr if x == pivot]
166
+ right = [x for x in arr if x > pivot]
167
+ return quick_sort(left) + middle + quick_sort(right)"""
168
  },
169
  "user": ""
170
  }
 
246
  "current": {
247
  "type": "code",
248
  "lang": "python",
249
+ "code": """def quick_sort(array):
250
+ if len(arr) <= 1:
251
+ return arr
252
+ pivot = arr[len(arr) // 2]
253
+ left = [x for x in arr if x < pivot]
254
+ middle = [x for x in arr if x == pivot]
255
+ right = [x for x in arr if x > pivot]
256
+ return quick_sort(left) + middle + quick_sort(right)"""
257
  },
258
  "user": "Add Docstring."
259
  }
 
324
  "current": {
325
  "type": "code",
326
  "lang": "python",
327
+ "code": """def quick_sort(array):
328
+ if len(arr) <= 1:
329
+ return arr
330
+ pivot = arr[len(arr) // 2]
331
+ left = [x for x in arr if x < pivot]
332
+ middle = [x for x in arr if x == pivot]
333
+ right = [x for x in arr if x > pivot]
334
+ return quick_sort(left) + middle + quick_sort(right)"""
335
  },
336
  "user": "Add Docstring."
337
  }
 
400
  "current": {
401
  "type": "code",
402
  "lang": "python",
403
+ "code": """def quick_sort(array):
404
+ if len(arr) <= 1:
405
+ return arr
406
+ pivot = arr[len(arr) // 2]
407
+ left = [x for x in arr if x < pivot]
408
+ middle = [x for x in arr if x == pivot]
409
+ right = [x for x in arr if x > pivot]
410
+ return quick_sort(left) + middle + quick_sort(right)"""
411
  },
412
  "user": "Add Docstring."
413
  }
 
477
  }
478
  ```
479
 
480
+ ## Acknowledgements
481
+
482
+ The open-source community has been of great help to us, and we reference numerous projects and applications. They include but are not limited to:
483
+
484
+ [Deepseek-Coder](https://github.com/deepseek-ai/DeepSeek-Coder), [Yi-Coder](https://github.com/01-ai/Yi-Coder), [Qwen-Coder](https://github.com/QwenLM/Qwen2.5-Coder), [Self-Instruct](https://github.com/yizhongw/self-instruct), [Evol-Instruct](https://github.com/theblackcat102/evol-dataset), [OSS-Instruct](https://github.com/ise-uiuc/magicoder), [EvalPlus](https://github.com/evalplus/evalplus), [CanItEdit](https://github.com/nuprl/CanItEdit), [OctoPack](https://github.com/bigcode-project/octopack), [Aider](https://github.com/Aider-AI/aider), [Continue](https://github.com/continuedev/continue), [Cursor](https://github.com/getcursor/cursor), ...
485
+
486
  ## Contribution
487
 
488
+ Contributions are welcome! If you find any bugs or have suggestions for improvements, please open an issue or submit a pull request.