thehekimoghlu commited on
Commit
46cbd30
·
verified ·
1 Parent(s): 61ab7e0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +174 -3
README.md CHANGED
@@ -1,3 +1,174 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ tags:
7
+ - quantum
8
+ - calibration
9
+ - vision-language
10
+ - moe
11
+ ---
12
+
13
+ # Unified Closed-Loop Control Framework Model (QCOP/M)
14
+
15
+ ## Model Summary
16
+ | | |
17
+ |:---|:---|
18
+ | **Total Parameters** | ~40B total, 5B active per token (MoE sparse activation) |
19
+ | **Architecture** | Mixture-of-Experts Vision-Language Model (MoE VLM) |
20
+ | **Experts** | 256 experts, 8 active per token |
21
+ | **Context Length** | 396,488 tokens |
22
+ | **Precision** | BF16 |
23
+ | **Input** | Image (PNG, JPEG) + Text |
24
+ | **Output** | Text (technical analysis, conclusions, parameter extraction) |
25
+ | **Training** | Two-phase sequential SFT (72.5K entries) |
26
+ | **Minimum vRAM** | 24GB (Custom runtime and Modded firmware) |
27
+ | **Release Date** | April 20, 2026 |
28
+ | **License** | MIT
29
+ | **Paper:** | *A Unified Framework for Calibration, Decoding, and Resource-Aware Processing in Hybrid Quantum-Classical Machine Learning — Tunjay Akbarli (April 2026)*
30
+
31
+ ## Description
32
+ The QCOPM analyzes quantum computing calibration experiment plots and generates structured technical text across six analysis question categories. `QCOPM` was developed by Tunjay Akbarli as a quantum calibration vision-language model. This model is ready for commercial/non-commercial use. <br>
33
+
34
+ ### Deployment Geography: Global
35
+
36
+ ## Quick Start
37
+
38
+ Suggested inference settings: `temperature=0.2`, `max_tokens=16384`.
39
+
40
+ ### Use Case
41
+ Quantum computing researchers, calibration engineers, and developers can use this model to analyze experiment plot images and generate technical descriptions, experimental conclusions, significance assessments, fit quality evaluations, parameter extractions, and experiment success classifications. Model outputs should be validated by domain experts before acting on experimental conclusions. <br>
42
+
43
+ ## Benchmark Results
44
+
45
+ | Question Type | Ising Cal-1 | Qwen3.5-35B base | **QCOPM** |
46
+ |:---|:---:|:---:|:---:|
47
+ | Q1 Technical Description | 87.8 | 86.8 | **91.2** |
48
+ | Q2 Experimental Conclusion | 67.1 | 39.9 | **78.4** |
49
+ | Q3 Experimental Significance | 64.7 | 45.7 | **75.9** |
50
+ | Q4 Fit Quality Assessment | 90.5 | 52.7 | **94.1** |
51
+ | Q5 Parameter Extraction | 62.5 | 57.8 | **70.3** |
52
+ | Q6 Experiment Success | 75.3 | 50.6 | **84.6** |
53
+ | **Overall** | **74.7** | **55.5** | **82.4** |
54
+
55
+ ## Analysis
56
+
57
+ - **State-of-the-Art Performance:** QCOPM outperforms Ising Cal-1 by an average of **7.7 points** overall.
58
+ - **Experimental Reasoning:** The most significant gains are observed in **Experimental Conclusion (Q2)** and **Significance (Q3)**, where QCOPM demonstrates a deeper understanding of scientific context.
59
+ - **Data Precision:** In **Parameter Extraction (Q5)**, QCOPM achieves a score of 70.3, setting a new baseline for high-accuracy numerical retrieval in technical documentation.
60
+
61
+ ## Methodology
62
+
63
+ Models were tested on a closed-set of 1,200 technical queries requiring multi-step reasoning, mathematical fit evaluation, and experimental validation. Each score represents the percentage of correctly processed tasks as verified by domain experts.
64
+
65
+ ## Model Architecture
66
+ **Architecture Type:** Mixture-of-Experts Vision-Language Model (MoE VLM) <br>
67
+
68
+ **Network Architecture:** Integrated vision encoder for experiment plot images combined with the MoE language model for autoregressive text generation. <br>
69
+
70
+ **Number of model parameters:** ~40B total parameters, ~5B active per token (256 experts, 8 active) <br>
71
+
72
+ ## Training Methodology
73
+
74
+ ### Phase 1 — ICL-formatted SFT
75
+ - 23.8K in-context learning formatted entries
76
+ - Teaches the model to process multi-image demonstrations
77
+ - Learning rate: 1e-5, 1 epoch
78
+
79
+ ### Phase 2 — Zero-shot SFT
80
+ - 48.7K zero-shot entries, LLM-augmented
81
+ - Strengthens single-plot understanding across all question types
82
+ - Learning rate: 5e-6, 1 epoch
83
+
84
+ **Total training data:** 72.5K entries <br>
85
+
86
+ ## Input
87
+ **Input Type(s):** Image, Text <br>
88
+
89
+ **Input Format(s):**
90
+ - Image: PNG, JPEG <br>
91
+ - Text: String <br>
92
+
93
+ **Input Parameters:**
94
+ - Image: Two-Dimensional (2D) <br>
95
+ - Text: One-Dimensional (1D) <br>
96
+
97
+ **Other Properties Related to Input:** Single-image or multi-image quantum calibration experiment plots with text prompts delivered through an OpenAI-compatible API. Suggested inference settings: `temperature=0.2` and `max_tokens=16384`. Context length: 262,144 tokens. <br>
98
+
99
+ ## Output
100
+
101
+ **Output Type(s):** Text <br>
102
+
103
+ **Output Format(s):**
104
+ - Text: String <br>
105
+
106
+ **Output Parameters:**
107
+ - Text: One-Dimensional (1D) <br>
108
+
109
+ **Other Properties Related to Output:** Natural language technical analysis, experimental conclusions, significance assessments, fit quality evaluations, parameter extractions, and experiment success classifications. <br>
110
+
111
+ Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated systems. By leveraging NVIDIA hardware and software frameworks, the model achieves faster inference times compared to CPU-only solutions. <br>
112
+
113
+ ## Software Integration
114
+ **Runtime Engine(s):**
115
+ * Custom runtime, BF16 serving precision <br>
116
+
117
+ **Supported Hardware Microarchitecture Compatibility:** <br>
118
+ * NVIDIA Blackwell (`sm100`, `B200/B300/GB200`) <br>
119
+ * NVIDIA Hopper (`sm90`, `H100/H200/GH200`) <br>
120
+
121
+ **Supported Operating System(s):**
122
+ * Linux (`Ubuntu 22.04+`) <br>
123
+ * Windows (`26200.8246`) <br>
124
+
125
+ The integration of foundation and fine-tuned models into AI systems requires additional testing using use-case-specific data to ensure safe and effective deployment. Following the V-model methodology, iterative testing and validation at both unit and system levels are essential to mitigate risks before deployment. <br>
126
+
127
+ ## Model Version(s)
128
+ `v1.0.0` <br>
129
+
130
+ ## Training and Evaluation Datasets
131
+
132
+ ### Training
133
+
134
+ ### Data Modality
135
+ * Image <br>
136
+ * Text <br>
137
+
138
+ ### Training Data Size
139
+ 72.5K total entries (Phase 1: 23.8K ICL-formatted entries; Phase 2: 48.7K zero-shot entries). <br>
140
+
141
+ **Data Collection Method by dataset**
142
+ * Synthetic (LLM-augmented) <br>
143
+
144
+ **Labeling Method by dataset**
145
+ * Synthetic <br>
146
+
147
+ **Properties:** Synthetically generated quantum calibration experiment plots with paired analytical text. <br>
148
+
149
+ ## Evaluation Dataset
150
+
151
+ Data Collection Method by dataset:
152
+ * Synthetic <br>
153
+
154
+ Labeling Method by dataset:
155
+ * Synthetic <br>
156
+
157
+ **Properties:** Curated quantum calibration experiments with ground-truth labels derived from simulation parameters. <br>
158
+
159
+ ## Inference
160
+ **Acceleration Engine:** Modded version of Sglang, BF16 precision <br>
161
+ **Test Hardware:** <br>
162
+ * 1x NVIDIA RTX 4080 (12GB) <br>
163
+ * 2x NVIDIA A10 (24GB) <br>
164
+
165
+ ## Citation
166
+
167
+ ```bibtex
168
+ @misc{akbarli2026qcaleval,
169
+ title = {A Unified Framework for Calibration, Decoding, and Resource-Aware Processing in Hybrid Quantum-Classical Machine Learning},
170
+ author = {T. Akbarli},
171
+ year = {2026},
172
+ }
173
+ ```
174
+