Update README.md
Browse files
README.md
CHANGED
|
@@ -19,10 +19,10 @@ The Ascend KernelGen Technical Report is published at https://arxiv.org/abs/2601
|
|
| 19 |
|
| 20 |
Our framework, **Ascend KernelGen (AKGen)**, bridges the gap between general-purpose code generation and hardware-specific programming through a closed-loop system of data construction, training, and evaluation. Key innovations include:
|
| 21 |
|
| 22 |
-
*
|
| 23 |
-
*
|
| 24 |
-
*
|
| 25 |
-
*
|
| 26 |
|
| 27 |
## Citation
|
| 28 |
@article{cao2026ascendkernelgen,
|
|
|
|
| 19 |
|
| 20 |
Our framework, **Ascend KernelGen (AKGen)**, bridges the gap between general-purpose code generation and hardware-specific programming through a closed-loop system of data construction, training, and evaluation. Key innovations include:
|
| 21 |
|
| 22 |
+
* **Ascend-CoT Dataset:** A high-quality, domain-specific dataset incorporating **Chain-of-Thought (CoT)** reasoning. It combines documentation-based reasoning, code-centric reasoning derived from real-world kernel implementations, and general reasoning chains to capture the structured logic required for low-level NPU programming.
|
| 23 |
+
* **Domain-Adaptive Post-Training:** A two-stage optimization process. We first employ **Supervised Fine-Tuning (SFT)** with error-derived supervision (correcting API misuse and numerical errors). This is followed by **Reinforcement Learning (RL)** using Direct Preference Optimization (DPO), driven by execution-based correctness and performance signals.
|
| 24 |
+
* **Hardware-Grounded Evaluation:** Validated using **NPUKernelBench**, a comprehensive benchmark that assesses compilation success, functional correctness, and performance (latency) on real Ascend hardware across varying complexity levels.
|
| 25 |
+
* **Performance:** The model demonstrates siginificant improvement on complex Level-2 kernels compared to baselines, and effectively solving tasks where general-purpose models (like Qwen3, Llama3.1) fail completely.
|
| 26 |
|
| 27 |
## Citation
|
| 28 |
@article{cao2026ascendkernelgen,
|