Chain-GPT
/

Solidity-LLM

Text Generation

Smart Contracts

Code Generation

Model card Files Files and versions

muhammad-mujtaba-ai commited on Jun 3, 2025

Commit

88ab8cd

·

verified ·

1 Parent(s): fdd8a7b

Upload folder using huggingface_hub

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -57,13 +57,13 @@ We have compared our model with the following models
 - GPT 4o mini
 On the following parameters
-- Compilation(%)--Percentage of generated contracts that compile successfully without modification.
-- OpenZeppelin Compliance(%)--Adherence to OpenZeppelin library usage and standards.
-- Gas Efficiency(%)--Degree of gas optimization based on Slither’s suggestions.
-- Security(%)--Percentage of code free from common vulnerabilities detected by Slither.
-- Average Lines of Code--Average number of non-empty, commented-included lines in generated contracts, indicating verbosity or conciseness
-- Correctness (OpenAI Evaluation) – GPT-4o Mini-assessed alignment of generated code with prompt using a structured correctness rubric.
-- Correctness (Human Evaluation) – Expert-reviewed rating of how well the generated contract fulfills the original prompt and intent.
 ## Benchmark
 Below is a figure summarizing the performance of each model across the four evaluation metrics.

 - GPT 4o mini
 On the following parameters
+- **Compilation(%)** - Percentage of generated contracts that compile successfully without modification.
+- **OpenZeppelin Compliance(%)** - Adherence to OpenZeppelin library usage and standards.
+- **Gas Efficiency(%)** - Degree of gas optimization based on Slither’s suggestions.
+- **Security(%)** - Percentage of code free from common vulnerabilities detected by Slither.
+- **Average Lines of Code** - Average number of non-empty, commented-included lines in generated contracts, indicating verbosity or conciseness
+- **Correctness (OpenAI Evaluation)** – GPT-4o Mini-assessed alignment of generated code with prompt using a structured correctness rubric.
+- **Correctness (Human Evaluation)** – Expert-reviewed rating of how well the generated contract fulfills the original prompt and intent.
 ## Benchmark
 Below is a figure summarizing the performance of each model across the four evaluation metrics.