Septend commited on
Commit
34bdfa7
·
verified ·
1 Parent(s): 762f241

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -6,7 +6,7 @@ license: apache-2.0
6
  ## 1. Model Summary
7
  The **ReLLM-C1** model is a Large Language Model (LLM) specifically fine-tuned to act as a surrogate model for **single objective optimization** in computationally expensive optimization tasks.
8
 
9
- It serves as a core modeling component within the **R2SAEA** (Reinforced Relation Surrogate-Assisted Evolutionary Algorithm) framework. Unlike general-purpose LLMs, ReLLM-C2 is designed to seamlessly integrate with Evolutionary Algorithms (EAs). By leveraging structured prompt templates containing decision variables and objective data, the model can perform zero-shot relationship reasoning to evaluate and classify candidate solutions in multi-objective optimization scenarios.
10
 
11
  ## 2. Intended Use
12
  * **Primary Application:** Relational-based surrogate modeling in multi-objective Evolutionary Algorithms.
@@ -16,7 +16,7 @@ It serves as a core modeling component within the **R2SAEA** (Reinforced Relatio
16
  This model bridges the gap between **Large Language Models (LLMs)** and **Evolutionary Algorithms (EAs)**, addressing a critical bottleneck in the field of Surrogate-Assisted Evolutionary Algorithms (SAEAs):
17
  * **The Problem with Traditional SAEAs:** Conventional machine learning surrogate models (such as Gaussian Processes or Random Forests) require being retrained from scratch at every single generation using new evaluated data, which introduces massive computational overhead.
18
  * **Our Methodology:** Through the R2SAEA framework, we transform the relationship reasoning problem in optimization tasks into a **Reinforcement Learning (RL)** problem.
19
- * **Training Alignment:** ReLLM-C2 is trained using the **Group Relative Policy Optimization (GRPO)** algorithm. This aligns the LLM's reasoning capabilities directly with multi-objective optimization goals, granting it the ability to perform zero-shot classification across a wide range of unseen tasks. This eliminates the need for generation-by-generation retraining while significantly reducing the computational burden associated with using general-purpose LLMs.
20
 
21
  ## 4. GitHub Repository
22
  To utilize ReLLM-C1 effectively, it should be deployed alongside the **R2SAEA framework**, which handles prompt structuring and the evolutionary loop. The framework provides implementations in both **Python** (via pymoo) and **MATLAB** (via PlatEMO).
 
6
  ## 1. Model Summary
7
  The **ReLLM-C1** model is a Large Language Model (LLM) specifically fine-tuned to act as a surrogate model for **single objective optimization** in computationally expensive optimization tasks.
8
 
9
+ It serves as a core modeling component within the **R2SAEA** (Reinforced Relation Surrogate-Assisted Evolutionary Algorithm) framework. Unlike general-purpose LLMs, ReLLM-C1 is designed to seamlessly integrate with Evolutionary Algorithms (EAs). By leveraging structured prompt templates containing decision variables and objective data, the model can perform zero-shot relationship reasoning to evaluate and classify candidate solutions in multi-objective optimization scenarios.
10
 
11
  ## 2. Intended Use
12
  * **Primary Application:** Relational-based surrogate modeling in multi-objective Evolutionary Algorithms.
 
16
  This model bridges the gap between **Large Language Models (LLMs)** and **Evolutionary Algorithms (EAs)**, addressing a critical bottleneck in the field of Surrogate-Assisted Evolutionary Algorithms (SAEAs):
17
  * **The Problem with Traditional SAEAs:** Conventional machine learning surrogate models (such as Gaussian Processes or Random Forests) require being retrained from scratch at every single generation using new evaluated data, which introduces massive computational overhead.
18
  * **Our Methodology:** Through the R2SAEA framework, we transform the relationship reasoning problem in optimization tasks into a **Reinforcement Learning (RL)** problem.
19
+ * **Training Alignment:** ReLLM-C1 is trained using the **Group Relative Policy Optimization (GRPO)** algorithm. This aligns the LLM's reasoning capabilities directly with multi-objective optimization goals, granting it the ability to perform zero-shot classification across a wide range of unseen tasks. This eliminates the need for generation-by-generation retraining while significantly reducing the computational burden associated with using general-purpose LLMs.
20
 
21
  ## 4. GitHub Repository
22
  To utilize ReLLM-C1 effectively, it should be deployed alongside the **R2SAEA framework**, which handles prompt structuring and the evolutionary loop. The framework provides implementations in both **Python** (via pymoo) and **MATLAB** (via PlatEMO).