|
|
--- |
|
|
datasets: |
|
|
- Alexhf825/dataset-AutoMR-pangu |
|
|
language: |
|
|
- en |
|
|
metrics: |
|
|
- accuracy |
|
|
base_model: |
|
|
- FreedomIntelligence/openPangu-Embedded-7B |
|
|
--- |
|
|
|
|
|
|
|
|
# AutoMR for Pangu |
|
|
|
|
|
This project equips the Pangu model with the AutoMR reasoning framework, optimized for Huawei Ascend hardware. |
|
|
|
|
|
## π³ Project Structure |
|
|
|
|
|
``` |
|
|
. |
|
|
βββ AMC.SH |
|
|
βββ automr |
|
|
βΒ Β βββ config.py |
|
|
βΒ Β βββ dag.py |
|
|
βΒ Β βββ data_loader.py |
|
|
βΒ Β βββ evaluator.py |
|
|
βΒ Β βββ __init__.py |
|
|
βΒ Β βββ model.py |
|
|
βΒ Β βββ strategies.py |
|
|
βΒ Β βββ trainer.py |
|
|
βΒ Β βββ utils.py |
|
|
βββ checkpoints |
|
|
βΒ Β βββ MATH |
|
|
βββ embedder_server.sh |
|
|
βββ generator_server.sh |
|
|
βββ main.py |
|
|
βββ math_train.sh |
|
|
βββ processed_data |
|
|
Β Β βββ AMC |
|
|
Β Β βββ MATH |
|
|
``` |
|
|
|
|
|
## π§ 1. Installation |
|
|
|
|
|
### a. Clone the Project Repository |
|
|
|
|
|
```bash |
|
|
hf download Alexhf825/AutoMR-pangu --local-dir AutoMR-pangu |
|
|
cd AutoMR-pangu |
|
|
``` |
|
|
|
|
|
### b. Install Dependencies |
|
|
``` |
|
|
``` |
|
|
|
|
|
### c. Download Datasets |
|
|
|
|
|
This command will download the datasets and place them in the `./processed_data` directory, matching the project structure. |
|
|
|
|
|
```bash |
|
|
hf download Alexhf825/dataset-AutoMR-pangu --repo-type=dataset --local-dir=./ |
|
|
``` |
|
|
|
|
|
## π 2. Start the Servers |
|
|
|
|
|
This project requires two services running in an OpenAI-API style. Please run the following commands in **two separate terminal sessions**. |
|
|
|
|
|
**Start the Embedder Server:** |
|
|
|
|
|
```bash |
|
|
bash embedder_server.sh |
|
|
``` |
|
|
|
|
|
**Start the Generator Server:** |
|
|
|
|
|
```bash |
|
|
bash generator_server.sh |
|
|
``` |
|
|
|
|
|
## π 3. Run Evaluation |
|
|
|
|
|
A pre-trained checkpoint (`MATH`) is provided. You can directly evaluate the model on the `AMC` dataset using the following command: |
|
|
|
|
|
```bash |
|
|
bash AMC.sh |
|
|
``` |
|
|
|
|
|
## ποΈ 4. Run Training |
|
|
|
|
|
You can also train the model from scratch on the `MATH` dataset by running: |
|
|
|
|
|
```bash |
|
|
bash math_train.sh |
|
|
``` |