Image-Text-to-Text
PEFT
Safetensors
laboratory
protocol-aligned-action-prediction
lora
qwen
long-horizon-planning
conversational
Instructions to use Stanford-CongLab/LabHorizon-Model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Stanford-CongLab/LabHorizon-Model with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.6-35B-A3B") model = PeftModel.from_pretrained(base_model, "Stanford-CongLab/LabHorizon-Model") - Notebooks
- Google Colab
- Kaggle
Update protocol-aligned terminology
Browse files
README.md
CHANGED
|
@@ -5,7 +5,7 @@ library_name: peft
|
|
| 5 |
pipeline_tag: image-text-to-text
|
| 6 |
tags:
|
| 7 |
- laboratory
|
| 8 |
-
- protocol-
|
| 9 |
- lora
|
| 10 |
- qwen
|
| 11 |
- long-horizon-planning
|
|
@@ -30,7 +30,7 @@ tags:
|
|
| 30 |
[](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-Protocol-Conditioned-Planning)
|
| 31 |
[](https://huggingface.co/Stanford-CongLab/LabHorizon-Model)
|
| 32 |
|
| 33 |
-
**Qwen3.6-35B-A3B LoRA for protocol-
|
| 34 |
|
| 35 |
[Overview](#-overview) | [News](#-news) | [Highlights](#-highlights) | [Datasets](#-datasets) | [Evaluation](#-evaluation) | [Leaderboard](#-leaderboard) | [Training](#-training-result) | [Agent](#-actor-simulator-selector-agent) | [Quick Start](#-quick-start) | [Citation](#-citation)
|
| 36 |
|
|
@@ -44,7 +44,7 @@ tags:
|
|
| 44 |
|
| 45 |
## 🔎 Overview
|
| 46 |
|
| 47 |
-
This repository releases the LabHorizon Qwen3.6 LoRA adapter trained from `Qwen/Qwen3.6-35B-A3B` on the 6,000-sample LabHorizon training split. The model is optimized for **Protocol-
|
| 48 |
|
| 49 |
- **Level 1:** connect multi-view laboratory assets and historical actions to the gold next action.
|
| 50 |
- **Level 2:** produce a structured long-horizon experimental action sequence from context, constraints, available inputs, and an action pool.
|
|
@@ -62,7 +62,7 @@ This model repository is the model-side companion to the LabHorizon code and dat
|
|
| 62 |
<tr>
|
| 63 |
<td align="center" width="25%">🧪<br/><b>Qwen3.6 Adapter</b><br/><sub>LoRA weights for Qwen3.6-35B-A3B</sub></td>
|
| 64 |
<td align="center" width="25%">🔬<br/><b>Level 1 Signal</b><br/><sub>Multi-view asset next-action prediction</sub></td>
|
| 65 |
-
<td align="center" width="25%">ðŸ§<br/><b>Level 2 Signal</b><br/><sub>Long-horizon protocol-
|
| 66 |
<td align="center" width="25%">🧠<br/><b>Train + Agent</b><br/><sub>Supports trained and trained+agents settings</sub></td>
|
| 67 |
</tr>
|
| 68 |
</table>
|
|
@@ -74,7 +74,7 @@ The adapter is trained on the same public LabHorizon train split described by th
|
|
| 74 |
| Level | Hugging Face Dataset | Input | Target | Metric |
|
| 75 |
|:---|:---|:---|:---|:---|
|
| 76 |
| **Level 1** | [LabHorizon-3D-Asset-Perception](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-3D-Asset-Perception) | Three asset views, historical actions, candidate next actions | Gold next action | Next-action accuracy |
|
| 77 |
-
| **Level 2** | [LabHorizon
|
| 78 |
|
| 79 |
## 📦 Model
|
| 80 |
|
|
@@ -86,8 +86,8 @@ The adapter is trained on the same public LabHorizon train split described by th
|
|
| 86 |
| Adapter type | LoRA / PEFT adapter |
|
| 87 |
| Training data | 6,000 LabHorizon train samples |
|
| 88 |
| Level 1 training split | 3,000 multimodal laboratory 3D asset samples |
|
| 89 |
-
| Level 2 training split | 3,000 text-only protocol-
|
| 90 |
-
| Main task | Protocol-
|
| 91 |
| Main metrics | Level 1 Next Action Accuracy; L2 Action Sequence Similarity and L2 Parameter Accuracy |
|
| 92 |
| Intended loading mode | Load this adapter with the matching Qwen3.6-35B-A3B base model |
|
| 93 |
|
|
@@ -139,7 +139,7 @@ The tables below report direct-prompting baselines on the same test split used f
|
|
| 139 |
| 13 | Qwen3.6 35B-A3B | 0.475 |
|
| 140 |
| 14 | Gemini 3.1 Pro | 0.465 |
|
| 141 |
|
| 142 |
-
### 🧪 Level 2: Protocol-
|
| 143 |
|
| 144 |
| Rank | Model | L2 Final Score | L2 Action Sequence Similarity | L2 Parameter Accuracy |
|
| 145 |
|:---:|:---|---:|---:|---:|
|
|
@@ -168,7 +168,7 @@ The adapter is trained on the public LabHorizon training split:
|
|
| 168 |
| Component | Size | Role |
|
| 169 |
|:---|---:|:---|
|
| 170 |
| Level 1 train | 3,000 | Multi-view laboratory asset perception and next-action prediction |
|
| 171 |
-
| Level 2 train | 3,000 | Protocol-
|
| 172 |
| Total train | 6,000 | Unified supervised fine-tuning data for laboratory action prediction |
|
| 173 |
|
| 174 |
The training data are converted into Qwen chat format and then into the LLaMA-Factory ShareGPT-VL-style format. Level 1 keeps the three asset images and candidate next actions; Level 2 uses text-only context, constraints, available inputs, action pool, and gold experimental action sequence.
|
|
@@ -259,7 +259,7 @@ This adapter is intended for academic research on laboratory action prediction,
|
|
| 259 |
|
| 260 |
Recommended use cases:
|
| 261 |
|
| 262 |
-
- Evaluate protocol-
|
| 263 |
- Study how training data improves laboratory action prediction.
|
| 264 |
- Use the adapter as the Actor in the Actor-Simulator-Selector framework.
|
| 265 |
- Analyze remaining failures in action order, parameter copying, dependency tracking, and protocol-stage consistency.
|
|
|
|
| 5 |
pipeline_tag: image-text-to-text
|
| 6 |
tags:
|
| 7 |
- laboratory
|
| 8 |
+
- protocol-aligned-action-prediction
|
| 9 |
- lora
|
| 10 |
- qwen
|
| 11 |
- long-horizon-planning
|
|
|
|
| 30 |
[](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-Protocol-Conditioned-Planning)
|
| 31 |
[](https://huggingface.co/Stanford-CongLab/LabHorizon-Model)
|
| 32 |
|
| 33 |
+
**Qwen3.6-35B-A3B LoRA for protocol-aligned laboratory action prediction**
|
| 34 |
|
| 35 |
[Overview](#-overview) | [News](#-news) | [Highlights](#-highlights) | [Datasets](#-datasets) | [Evaluation](#-evaluation) | [Leaderboard](#-leaderboard) | [Training](#-training-result) | [Agent](#-actor-simulator-selector-agent) | [Quick Start](#-quick-start) | [Citation](#-citation)
|
| 36 |
|
|
|
|
| 44 |
|
| 45 |
## 🔎 Overview
|
| 46 |
|
| 47 |
+
This repository releases the LabHorizon Qwen3.6 LoRA adapter trained from `Qwen/Qwen3.6-35B-A3B` on the 6,000-sample LabHorizon training split. The model is optimized for **Protocol-Aligned Action Prediction**:
|
| 48 |
|
| 49 |
- **Level 1:** connect multi-view laboratory assets and historical actions to the gold next action.
|
| 50 |
- **Level 2:** produce a structured long-horizon experimental action sequence from context, constraints, available inputs, and an action pool.
|
|
|
|
| 62 |
<tr>
|
| 63 |
<td align="center" width="25%">🧪<br/><b>Qwen3.6 Adapter</b><br/><sub>LoRA weights for Qwen3.6-35B-A3B</sub></td>
|
| 64 |
<td align="center" width="25%">🔬<br/><b>Level 1 Signal</b><br/><sub>Multi-view asset next-action prediction</sub></td>
|
| 65 |
+
<td align="center" width="25%">ðŸ§<br/><b>Level 2 Signal</b><br/><sub>Long-horizon protocol-aligned planning</sub></td>
|
| 66 |
<td align="center" width="25%">🧠<br/><b>Train + Agent</b><br/><sub>Supports trained and trained+agents settings</sub></td>
|
| 67 |
</tr>
|
| 68 |
</table>
|
|
|
|
| 74 |
| Level | Hugging Face Dataset | Input | Target | Metric |
|
| 75 |
|:---|:---|:---|:---|:---|
|
| 76 |
| **Level 1** | [LabHorizon-3D-Asset-Perception](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-3D-Asset-Perception) | Three asset views, historical actions, candidate next actions | Gold next action | Next-action accuracy |
|
| 77 |
+
| **Level 2** | [LabHorizon Protocol-Aligned Planning](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-Protocol-Conditioned-Planning) | Context, goal, constraints, available inputs, action pool | Gold experimental action sequence | L2 Action Sequence Similarity, L2 Parameter Accuracy |
|
| 78 |
|
| 79 |
## 📦 Model
|
| 80 |
|
|
|
|
| 86 |
| Adapter type | LoRA / PEFT adapter |
|
| 87 |
| Training data | 6,000 LabHorizon train samples |
|
| 88 |
| Level 1 training split | 3,000 multimodal laboratory 3D asset samples |
|
| 89 |
+
| Level 2 training split | 3,000 text-only protocol-aligned planning samples |
|
| 90 |
+
| Main task | Protocol-aligned laboratory action prediction |
|
| 91 |
| Main metrics | Level 1 Next Action Accuracy; L2 Action Sequence Similarity and L2 Parameter Accuracy |
|
| 92 |
| Intended loading mode | Load this adapter with the matching Qwen3.6-35B-A3B base model |
|
| 93 |
|
|
|
|
| 139 |
| 13 | Qwen3.6 35B-A3B | 0.475 |
|
| 140 |
| 14 | Gemini 3.1 Pro | 0.465 |
|
| 141 |
|
| 142 |
+
### 🧪 Level 2: Protocol-Aligned Planning
|
| 143 |
|
| 144 |
| Rank | Model | L2 Final Score | L2 Action Sequence Similarity | L2 Parameter Accuracy |
|
| 145 |
|:---:|:---|---:|---:|---:|
|
|
|
|
| 168 |
| Component | Size | Role |
|
| 169 |
|:---|---:|:---|
|
| 170 |
| Level 1 train | 3,000 | Multi-view laboratory asset perception and next-action prediction |
|
| 171 |
+
| Level 2 train | 3,000 | Protocol-aligned long-horizon experimental action-sequence planning |
|
| 172 |
| Total train | 6,000 | Unified supervised fine-tuning data for laboratory action prediction |
|
| 173 |
|
| 174 |
The training data are converted into Qwen chat format and then into the LLaMA-Factory ShareGPT-VL-style format. Level 1 keeps the three asset images and candidate next actions; Level 2 uses text-only context, constraints, available inputs, action pool, and gold experimental action sequence.
|
|
|
|
| 259 |
|
| 260 |
Recommended use cases:
|
| 261 |
|
| 262 |
+
- Evaluate protocol-aligned next-action prediction and long-horizon planning.
|
| 263 |
- Study how training data improves laboratory action prediction.
|
| 264 |
- Use the adapter as the Actor in the Actor-Simulator-Selector framework.
|
| 265 |
- Analyze remaining failures in action order, parameter copying, dependency tracking, and protocol-stage consistency.
|