Update README.md
Browse files
README.md
CHANGED
|
@@ -6,7 +6,7 @@ license: apache-2.0
|
|
| 6 |
## 1. Model Summary
|
| 7 |
The **ReLLM-C1** model is a Large Language Model (LLM) specifically fine-tuned to act as a surrogate model for **single objective optimization** in computationally expensive optimization tasks.
|
| 8 |
|
| 9 |
-
It serves as a core modeling component within the **R2SAEA** (Reinforced Relation Surrogate-Assisted Evolutionary Algorithm) framework. Unlike general-purpose LLMs, ReLLM-
|
| 10 |
|
| 11 |
## 2. Intended Use
|
| 12 |
* **Primary Application:** Relational-based surrogate modeling in multi-objective Evolutionary Algorithms.
|
|
@@ -16,7 +16,7 @@ It serves as a core modeling component within the **R2SAEA** (Reinforced Relatio
|
|
| 16 |
This model bridges the gap between **Large Language Models (LLMs)** and **Evolutionary Algorithms (EAs)**, addressing a critical bottleneck in the field of Surrogate-Assisted Evolutionary Algorithms (SAEAs):
|
| 17 |
* **The Problem with Traditional SAEAs:** Conventional machine learning surrogate models (such as Gaussian Processes or Random Forests) require being retrained from scratch at every single generation using new evaluated data, which introduces massive computational overhead.
|
| 18 |
* **Our Methodology:** Through the R2SAEA framework, we transform the relationship reasoning problem in optimization tasks into a **Reinforcement Learning (RL)** problem.
|
| 19 |
-
* **Training Alignment:** ReLLM-
|
| 20 |
|
| 21 |
## 4. GitHub Repository
|
| 22 |
To utilize ReLLM-C1 effectively, it should be deployed alongside the **R2SAEA framework**, which handles prompt structuring and the evolutionary loop. The framework provides implementations in both **Python** (via pymoo) and **MATLAB** (via PlatEMO).
|
|
|
|
| 6 |
## 1. Model Summary
|
| 7 |
The **ReLLM-C1** model is a Large Language Model (LLM) specifically fine-tuned to act as a surrogate model for **single objective optimization** in computationally expensive optimization tasks.
|
| 8 |
|
| 9 |
+
It serves as a core modeling component within the **R2SAEA** (Reinforced Relation Surrogate-Assisted Evolutionary Algorithm) framework. Unlike general-purpose LLMs, ReLLM-C1 is designed to seamlessly integrate with Evolutionary Algorithms (EAs). By leveraging structured prompt templates containing decision variables and objective data, the model can perform zero-shot relationship reasoning to evaluate and classify candidate solutions in multi-objective optimization scenarios.
|
| 10 |
|
| 11 |
## 2. Intended Use
|
| 12 |
* **Primary Application:** Relational-based surrogate modeling in multi-objective Evolutionary Algorithms.
|
|
|
|
| 16 |
This model bridges the gap between **Large Language Models (LLMs)** and **Evolutionary Algorithms (EAs)**, addressing a critical bottleneck in the field of Surrogate-Assisted Evolutionary Algorithms (SAEAs):
|
| 17 |
* **The Problem with Traditional SAEAs:** Conventional machine learning surrogate models (such as Gaussian Processes or Random Forests) require being retrained from scratch at every single generation using new evaluated data, which introduces massive computational overhead.
|
| 18 |
* **Our Methodology:** Through the R2SAEA framework, we transform the relationship reasoning problem in optimization tasks into a **Reinforcement Learning (RL)** problem.
|
| 19 |
+
* **Training Alignment:** ReLLM-C1 is trained using the **Group Relative Policy Optimization (GRPO)** algorithm. This aligns the LLM's reasoning capabilities directly with multi-objective optimization goals, granting it the ability to perform zero-shot classification across a wide range of unseen tasks. This eliminates the need for generation-by-generation retraining while significantly reducing the computational burden associated with using general-purpose LLMs.
|
| 20 |
|
| 21 |
## 4. GitHub Repository
|
| 22 |
To utilize ReLLM-C1 effectively, it should be deployed alongside the **R2SAEA framework**, which handles prompt structuring and the evolutionary loop. The framework provides implementations in both **Python** (via pymoo) and **MATLAB** (via PlatEMO).
|