Text Generation
PEFT
Safetensors
GGUF
English
materialsanalyst-ai-7b
MaterialsAnalyst-AI-7B
materials-science
computational-materials
materials-analysis
chain-of-thought
reasoning-model
property-prediction
materials-discovery
crystal-structure
materials-informatics
scientific-ai
7b
quantized
fine-tuned
lora
json-mode
structured-output
materials-engineering
band-gap-prediction
computational-chemistry
materials-characterization
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,151 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
---
|
| 4 |
+
# Introducing MaterialsAnalyst-AI-7B:
|
| 5 |
+
|
| 6 |
+
A specialized **open-source** AI model designed to assist materials scientists and researchers in **comprehensive analysis** and interpretation of materials data. Built on Qwen 2.5 Instruct 7B and fine-tuned with LoRA (Low-Rank Adaptation), MaterialsAnalyst-AI-7B is optimized to **analyze materials properties** and provide clear, actionable insights from complex materials databases.
|
| 7 |
+
|
| 8 |
+
## How It Works
|
| 9 |
+
|
| 10 |
+
The process is *beautifully* simple:
|
| 11 |
+
|
| 12 |
+
1. You input materials data (JSON format with properties, structure, and characteristics)
|
| 13 |
+
2. The model engages in chain-of-thought reasoning about the material's properties
|
| 14 |
+
3. You receive a structured, comprehensive analysis with practical applications
|
| 15 |
+
|
| 16 |
+
## Features
|
| 17 |
+
|
| 18 |
+
MaterialsAnalyst-AI-7B offers a comprehensive suite of capabilities tailored specifically for materials analysis:
|
| 19 |
+
|
| 20 |
+
* **Dual-Output Structure**: Provides both detailed chain-of-thought reasoning tokens and concise answer tokens
|
| 21 |
+
* **Multi-Property Analysis**: Trained on diverse materials properties including electronic, mechanical, thermal, structural, and magnetic characteristics
|
| 22 |
+
* **Crystal Structure Interpretation**: Excels at analyzing space groups, crystal systems, and structural relationships
|
| 23 |
+
* **Property Correlation**: Identifies relationships between different material properties and their implications
|
| 24 |
+
* **Application Prediction**: Suggests practical applications based on material characteristics
|
| 25 |
+
* **Stability Assessment**: Evaluates thermodynamic and structural stability indicators
|
| 26 |
+
* **Performance Benchmarking**: Compares materials against industry standards and competing materials
|
| 27 |
+
* **Materials Database Integration**: Optimized for standard materials database formats (Materials Project, AFLOW, etc.)
|
| 28 |
+
* **Structured Output Format**: Consistently delivers well-organized, hierarchical materials analysis with clear section delineation
|
| 29 |
+
|
| 30 |
+
## Use Cases
|
| 31 |
+
|
| 32 |
+
MaterialsAnalyst-AI-7B serves as a valuable tool for:
|
| 33 |
+
|
| 34 |
+
* **Materials scientists and engineers** needing comprehensive property analysis
|
| 35 |
+
* **Graduate students and researchers** learning materials characterization
|
| 36 |
+
* **R&D teams** screening materials for specific applications
|
| 37 |
+
* **Academic researchers** analyzing large materials datasets
|
| 38 |
+
* **Industry professionals** evaluating material selection for products
|
| 39 |
+
* **Database curators** ensuring comprehensive materials documentation
|
| 40 |
+
* **Computational materials scientists** interpreting simulation results
|
| 41 |
+
* **Materials informatics researchers** building automated analysis pipelines
|
| 42 |
+
|
| 43 |
+
## See It In Action:
|
| 44 |
+
|
| 45 |
+
Input Example:
|
| 46 |
+
|
| 47 |
+
```json
|
| 48 |
+
{
|
| 49 |
+
"material_id": "mp-8062",
|
| 50 |
+
"formula": "SiC",
|
| 51 |
+
"elements": [
|
| 52 |
+
"Si",
|
| 53 |
+
"C"
|
| 54 |
+
],
|
| 55 |
+
"spacegroup": "P63mc",
|
| 56 |
+
"band_gap": 3.26,
|
| 57 |
+
"formation_energy_per_atom": -0.73,
|
| 58 |
+
"density": 3.21,
|
| 59 |
+
"volume": 41.2,
|
| 60 |
+
"nsites": 8,
|
| 61 |
+
"is_stable": true,
|
| 62 |
+
"elastic_modulus": 448,
|
| 63 |
+
"bulk_modulus": 220,
|
| 64 |
+
"thermal_expansion": 4.2e-06,
|
| 65 |
+
"electron_affinity": 4.0,
|
| 66 |
+
"ionization_energy": 6.7,
|
| 67 |
+
"crystal_system": "Hexagonal",
|
| 68 |
+
"magnetic_property": "Non-magnetic",
|
| 69 |
+
"thermal_conductivity": 490,
|
| 70 |
+
"specific_heat": 0.69,
|
| 71 |
+
"is_superconductor": false,
|
| 72 |
+
"band_gap_type": "Indirect"
|
| 73 |
+
}
|
| 74 |
+
```
|
| 75 |
+
Output Example:
|
| 76 |
+
|
| 77 |
+
The model produces two key components:
|
| 78 |
+
|
| 79 |
+
### The Thinking Process
|
| 80 |
+
```markdown
|
| 81 |
+
<think>
|
| 82 |
+
I need to think through how to interpret this SiC entry and assess its characteristics.
|
| 83 |
+
First, I should examine the composition and structure. This material contains silicon (Si) and carbon (C) in a hexagonal crystal structure with space group P63mc. This structure suggests high symmetry and anisotropic properties, typical for wide-bandgap materials like SiC.
|
| 84 |
+
Next, I should analyze the electronic properties. With a band gap of 3.26 eV that is indirect, this material would be classified as a wide-bandgap semiconductor. This suggests applications in optoelectronics, high-power electronics, and transparent conductive oxides.
|
| 85 |
+
For thermodynamic stability, the formation energy per atom of -0.73 eV indicates strong bonding and thermodynamic favorability. The material is stable according to the data, which means it can be synthesized and used under practical conditions without decomposition.
|
| 86 |
+
The mechanical properties show a high elastic modulus (448 GPa) and bulk modulus (220 GPa). This suggests the material would be very stiff and resistant to deformation when subjected to stress or pressure.
|
| 87 |
+
Regarding thermal behavior, the thermal conductivity of 490 W/m路K and expansion coefficient of 4.2e-6 K^-1 indicate excellent heat dissipation and dimensional stability under temperature changes. This would make it suitable for high-temperature applications like thermal management in electronics or coatings.
|
| 88 |
+
The magnetic properties are listed as non-magnetic, which means it won't interact significantly with magnetic fields. This would affect applications in magneto-electronics but is generally beneficial for general semiconductor uses.
|
| 89 |
+
Additionally, the material is marked as not superconducting, which is significant because it rules out cryogenic or quantum computing applications where superconductivity is required.
|
| 90 |
+
Considering all these properties together, this material would be particularly valuable for high-power electronics, optoelectronics, and thermal management due to its combination of wide bandgap, high thermal conductivity, and mechanical robustness. It might compete with other materials like GaN or AlN but offers advantages in cost (Si-based) and thermal stability.
|
| 91 |
+
</think>
|
| 92 |
+
```
|
| 93 |
+
|
| 94 |
+
### The Structured Materials Analysis
|
| 95 |
+
```markdown
|
| 96 |
+
<answer>
|
| 97 |
+
**Summary of SiC (ID: mp-8062):**
|
| 98 |
+
1. **Material Type & Composition**
|
| 99 |
+
- Silicon carbide (SiC), a widely studied wide-bandgap semiconductor compound.
|
| 100 |
+
2. **Crystal Structure**
|
| 101 |
+
- Hexagonal structure with space group P63mc, indicating anisotropic properties and high symmetry.
|
| 102 |
+
3. **Electronic Properties**
|
| 103 |
+
- Indirect bandgap of 3.26 eV, suitable for optoelectronics and high-power devices.
|
| 104 |
+
4. **Thermodynamic Stability**
|
| 105 |
+
- Highly stable (formation energy: -0.73 eV/atom), making it practical for synthesis and use.
|
| 106 |
+
5. **Mechanical Properties**
|
| 107 |
+
- Exceptional stiffness (elastic modulus: 448 GPa) and resistance to compression (bulk modulus: 220 GPa).
|
| 108 |
+
6. **Thermal Properties**
|
| 109 |
+
- Outstanding thermal conductivity (490 W/m路K) and low expansion coefficient (4.2e-6 K^-1), ideal for heat dissipation.
|
| 110 |
+
7. **Magnetic & Conductive Properties**
|
| 111 |
+
- Non-magnetic and non-superconducting, limiting use in magnetic applications but beneficial for general electronics.
|
| 112 |
+
8. **Applications**
|
| 113 |
+
- High-power electronics, optoelectronics, thermal management systems, and abrasion-resistant coatings.
|
| 114 |
+
**Verdict**: SiC is a high-performance material with exceptional thermal, mechanical, and electronic properties, making it ideal for demanding applications like power devices and high-temperature environments. Its stability and robustness give it an edge over competing wide-bandgap materials.
|
| 115 |
+
</answer>
|
| 116 |
+
```
|
| 117 |
+
|
| 118 |
+
## What's Included
|
| 119 |
+
|
| 120 |
+
This repository contains everything you need to use and understand MaterialsAnalyst-AI-7B:
|
| 121 |
+
|
| 122 |
+
* **Model_Weights/** - All model weights in various formats
|
| 123 |
+
* **llama.cpp/** - LLaMA.cpp compatible weights with various quantization options available
|
| 124 |
+
* **safetensors/** - SafeTensors format models
|
| 125 |
+
* **LoRA_adapter/** - LoRA adapter weights
|
| 126 |
+
* **Scripts/** - Ready-to-use inference scripts
|
| 127 |
+
* **Inference_llama.cpp.py** - For LLaMA.cpp deployment
|
| 128 |
+
* **Inference_safetensors.py** - For SafeTensors deployment
|
| 129 |
+
* **Data/** - Training data
|
| 130 |
+
* **Train-Ready.jsonl** - Complete JSONL training dataset
|
| 131 |
+
* **Training/** - Training terminal logs
|
| 132 |
+
* **Training_Logs.txt** - Complete terminal logs from the training process
|
| 133 |
+
|
| 134 |
+
## Model Training Details
|
| 135 |
+
|
| 136 |
+
* **Base Model**: Qwen 2.5 Instruct 7B
|
| 137 |
+
* **Fine-tuning Method**: LoRA (Low-Rank Adaptation)
|
| 138 |
+
* **Training Infrastructure**: Single NVIDIA A100 GPU
|
| 139 |
+
* **Training Duration**: Around 5.4 hours
|
| 140 |
+
* **Training Dataset**: Custom curated dataset specifically for materials analysis
|
| 141 |
+
* **Total Token Count**: 6,441,671
|
| 142 |
+
* **Total Sample Count**: 6,000
|
| 143 |
+
* **Average Tokens Per Sample**: 1073.61
|
| 144 |
+
* **Dataset Creation**: Generated using DeepSeekV3 API
|
| 145 |
+
|
| 146 |
+
## Attribution
|
| 147 |
+
|
| 148 |
+
MaterialsAnalyst-AI-7B was developed by Raymond Lee. If you use this model in your work, please include a reference to this repository. As of **June 3, 2025**, this model has been downloaded **0** times. Thank you for your interest and support!
|
| 149 |
+
|
| 150 |
+
*Download statistics are manually updated as HuggingFace doesn't display this metric publicly. Visit this repository periodically for the latest metrics.*
|
| 151 |
+
|