yuyuzhang commited on
Commit
678fc9c
·
verified ·
1 Parent(s): 0a7a24a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -7,9 +7,9 @@ license: mit
7
  ## Introduction
8
  We are thrilled to introduce Seed-Coder, a powerful, transparent, and parameter-efficient family of open-source code models at the 8B scale, featuring base, instruct, and reasoning variants. Seed-Coder contributes to promote the evolution of open code models through the following highlights.
9
 
10
- - Model-centric: Seed-Coder predominantly leverages LLMs instead of hand-crafted rules for code data filtering, minimizing manual effort in pretraining data construction.
11
- - Transparent: We openly share detailed insights into our model-centric data pipeline, including methods for curating GitHub data, commits data, and code-related web data.
12
- - Powerful: Seed-Coder achieves state-of-the-art performance among open-source models of comparable size across a diverse range of coding tasks.
13
 
14
  <p align="center">
15
  <img width="100%" src="imgs/seed-coder_intro_performance.jpg">
@@ -96,6 +96,7 @@ print(output[0]["generated_text"])
96
  ## Evaluation
97
 
98
  Seed-Coder-8B-Base has been evaluated across a variety of code understanding and generation benchmarks.
 
99
  It demonstrates strong capabilities in:
100
  - Fluent and contextually appropriate code completion.
101
  - Reasoning about code structure and inferring missing logic.
 
7
  ## Introduction
8
  We are thrilled to introduce Seed-Coder, a powerful, transparent, and parameter-efficient family of open-source code models at the 8B scale, featuring base, instruct, and reasoning variants. Seed-Coder contributes to promote the evolution of open code models through the following highlights.
9
 
10
+ - **Model-centric:** Seed-Coder predominantly leverages LLMs instead of hand-crafted rules for code data filtering, minimizing manual effort in pretraining data construction.
11
+ - **Transparent:*** We openly share detailed insights into our model-centric data pipeline, including methods for curating GitHub data, commits data, and code-related web data.
12
+ - **Powerful:** Seed-Coder achieves state-of-the-art performance among open-source models of comparable size across a diverse range of coding tasks.
13
 
14
  <p align="center">
15
  <img width="100%" src="imgs/seed-coder_intro_performance.jpg">
 
96
  ## Evaluation
97
 
98
  Seed-Coder-8B-Base has been evaluated across a variety of code understanding and generation benchmarks.
99
+
100
  It demonstrates strong capabilities in:
101
  - Fluent and contextually appropriate code completion.
102
  - Reasoning about code structure and inferring missing logic.