GangJiang
/

LLM-BEM-Engineer

Model card Files Files and versions

GangJiang commited on Jan 30

Commit

f9b8ede

·

verified ·

1 Parent(s): da18454

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -110,6 +110,24 @@ This current platform is designed for engineers, architects, and researchers wor
   <p><em>LLM-BEM-Engineer for automated editing.</em></p>
 </div>
 ## 🚀 Quick Start
 Here provides a code snippet to show you how to run the LLM-BEM-Engineer.

   <p><em>LLM-BEM-Engineer for automated editing.</em></p>
 </div>
+## 📁 LLM-BEM-Engineer Benchmark Dataset
+This benchmark dataset is designed to evaluate the capability of **Large Language Models (LLMs)** in generating **Building Energy Models (BEMs)** from natural language descriptions.
+The benchmark focuses on two essential aspects of real-world applicability:
+- **Scalability**: The ability of LLMs to handle a wide range of building configurations and system complexities.
+- **Robustness**: The ability of LLMs to correctly infer user intent under noisy, ambiguous, or incomplete inputs.
+The benchmark consists of **two complementary test sets**, each designed to evaluate a different capability of LLMs in automated building energy model generation.
+| Dataset | Purpose | Description |
+|-------|--------|-------------|
+| `detailed_prompt_test` | Scalability benchmark | Well-specified and detailed building modeling prompts |
+| `robust_prompt_test` | Robustness benchmark | Noisy and high-level user input prompts |
+For details, please refer to the *LLM-BEM-Engineer Benchmark* folder in this repository.
 ## 🚀 Quick Start
 Here provides a code snippet to show you how to run the LLM-BEM-Engineer.