black-yt commited on
Commit
9ea4e9b
·
1 Parent(s): 3a4740a

Update protocol-aligned terminology

Browse files
Files changed (1) hide show
  1. README.md +10 -10
README.md CHANGED
@@ -5,7 +5,7 @@ library_name: peft
5
  pipeline_tag: image-text-to-text
6
  tags:
7
  - laboratory
8
- - protocol-conditioned-action-prediction
9
  - lora
10
  - qwen
11
  - long-horizon-planning
@@ -30,7 +30,7 @@ tags:
30
  [![Data L2 Protocol](https://img.shields.io/badge/%F0%9F%A4%97%20Data-L2%20Protocol-purple)](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-Protocol-Conditioned-Planning) 
31
  [![Model](https://img.shields.io/badge/%F0%9F%A4%97%20Model-Qwen3.6-orange)](https://huggingface.co/Stanford-CongLab/LabHorizon-Model)
32
 
33
- **Qwen3.6-35B-A3B LoRA for protocol-conditioned laboratory action prediction**
34
 
35
  [Overview](#-overview) | [News](#-news) | [Highlights](#-highlights) | [Datasets](#-datasets) | [Evaluation](#-evaluation) | [Leaderboard](#-leaderboard) | [Training](#-training-result) | [Agent](#-actor-simulator-selector-agent) | [Quick Start](#-quick-start) | [Citation](#-citation)
36
 
@@ -44,7 +44,7 @@ tags:
44
 
45
  ## 🔎 Overview
46
 
47
- This repository releases the LabHorizon Qwen3.6 LoRA adapter trained from `Qwen/Qwen3.6-35B-A3B` on the 6,000-sample LabHorizon training split. The model is optimized for **Protocol-Conditioned Action Prediction**:
48
 
49
  - **Level 1:** connect multi-view laboratory assets and historical actions to the gold next action.
50
  - **Level 2:** produce a structured long-horizon experimental action sequence from context, constraints, available inputs, and an action pool.
@@ -62,7 +62,7 @@ This model repository is the model-side companion to the LabHorizon code and dat
62
  <tr>
63
  <td align="center" width="25%">🧪<br/><b>Qwen3.6 Adapter</b><br/><sub>LoRA weights for Qwen3.6-35B-A3B</sub></td>
64
  <td align="center" width="25%">🔬<br/><b>Level 1 Signal</b><br/><sub>Multi-view asset next-action prediction</sub></td>
65
- <td align="center" width="25%">🧭<br/><b>Level 2 Signal</b><br/><sub>Long-horizon protocol-conditioned planning</sub></td>
66
  <td align="center" width="25%">🧠<br/><b>Train + Agent</b><br/><sub>Supports trained and trained+agents settings</sub></td>
67
  </tr>
68
  </table>
@@ -74,7 +74,7 @@ The adapter is trained on the same public LabHorizon train split described by th
74
  | Level | Hugging Face Dataset | Input | Target | Metric |
75
  |:---|:---|:---|:---|:---|
76
  | **Level 1** | [LabHorizon-3D-Asset-Perception](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-3D-Asset-Perception) | Three asset views, historical actions, candidate next actions | Gold next action | Next-action accuracy |
77
- | **Level 2** | [LabHorizon-Protocol-Conditioned-Planning](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-Protocol-Conditioned-Planning) | Context, goal, constraints, available inputs, action pool | Gold experimental action sequence | L2 Action Sequence Similarity, L2 Parameter Accuracy |
78
 
79
  ## 📦 Model
80
 
@@ -86,8 +86,8 @@ The adapter is trained on the same public LabHorizon train split described by th
86
  | Adapter type | LoRA / PEFT adapter |
87
  | Training data | 6,000 LabHorizon train samples |
88
  | Level 1 training split | 3,000 multimodal laboratory 3D asset samples |
89
- | Level 2 training split | 3,000 text-only protocol-conditioned planning samples |
90
- | Main task | Protocol-conditioned laboratory action prediction |
91
  | Main metrics | Level 1 Next Action Accuracy; L2 Action Sequence Similarity and L2 Parameter Accuracy |
92
  | Intended loading mode | Load this adapter with the matching Qwen3.6-35B-A3B base model |
93
 
@@ -139,7 +139,7 @@ The tables below report direct-prompting baselines on the same test split used f
139
  | 13 | Qwen3.6 35B-A3B | 0.475 |
140
  | 14 | Gemini 3.1 Pro | 0.465 |
141
 
142
- ### 🧪 Level 2: Protocol-Conditioned Planning
143
 
144
  | Rank | Model | L2 Final Score | L2 Action Sequence Similarity | L2 Parameter Accuracy |
145
  |:---:|:---|---:|---:|---:|
@@ -168,7 +168,7 @@ The adapter is trained on the public LabHorizon training split:
168
  | Component | Size | Role |
169
  |:---|---:|:---|
170
  | Level 1 train | 3,000 | Multi-view laboratory asset perception and next-action prediction |
171
- | Level 2 train | 3,000 | Protocol-conditioned long-horizon experimental action-sequence planning |
172
  | Total train | 6,000 | Unified supervised fine-tuning data for laboratory action prediction |
173
 
174
  The training data are converted into Qwen chat format and then into the LLaMA-Factory ShareGPT-VL-style format. Level 1 keeps the three asset images and candidate next actions; Level 2 uses text-only context, constraints, available inputs, action pool, and gold experimental action sequence.
@@ -259,7 +259,7 @@ This adapter is intended for academic research on laboratory action prediction,
259
 
260
  Recommended use cases:
261
 
262
- - Evaluate protocol-conditioned next-action prediction and long-horizon planning.
263
  - Study how training data improves laboratory action prediction.
264
  - Use the adapter as the Actor in the Actor-Simulator-Selector framework.
265
  - Analyze remaining failures in action order, parameter copying, dependency tracking, and protocol-stage consistency.
 
5
  pipeline_tag: image-text-to-text
6
  tags:
7
  - laboratory
8
+ - protocol-aligned-action-prediction
9
  - lora
10
  - qwen
11
  - long-horizon-planning
 
30
  [![Data L2 Protocol](https://img.shields.io/badge/%F0%9F%A4%97%20Data-L2%20Protocol-purple)](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-Protocol-Conditioned-Planning)&nbsp;
31
  [![Model](https://img.shields.io/badge/%F0%9F%A4%97%20Model-Qwen3.6-orange)](https://huggingface.co/Stanford-CongLab/LabHorizon-Model)
32
 
33
+ **Qwen3.6-35B-A3B LoRA for protocol-aligned laboratory action prediction**
34
 
35
  [Overview](#-overview) | [News](#-news) | [Highlights](#-highlights) | [Datasets](#-datasets) | [Evaluation](#-evaluation) | [Leaderboard](#-leaderboard) | [Training](#-training-result) | [Agent](#-actor-simulator-selector-agent) | [Quick Start](#-quick-start) | [Citation](#-citation)
36
 
 
44
 
45
  ## 🔎 Overview
46
 
47
+ This repository releases the LabHorizon Qwen3.6 LoRA adapter trained from `Qwen/Qwen3.6-35B-A3B` on the 6,000-sample LabHorizon training split. The model is optimized for **Protocol-Aligned Action Prediction**:
48
 
49
  - **Level 1:** connect multi-view laboratory assets and historical actions to the gold next action.
50
  - **Level 2:** produce a structured long-horizon experimental action sequence from context, constraints, available inputs, and an action pool.
 
62
  <tr>
63
  <td align="center" width="25%">🧪<br/><b>Qwen3.6 Adapter</b><br/><sub>LoRA weights for Qwen3.6-35B-A3B</sub></td>
64
  <td align="center" width="25%">🔬<br/><b>Level 1 Signal</b><br/><sub>Multi-view asset next-action prediction</sub></td>
65
+ <td align="center" width="25%">🧭<br/><b>Level 2 Signal</b><br/><sub>Long-horizon protocol-aligned planning</sub></td>
66
  <td align="center" width="25%">🧠<br/><b>Train + Agent</b><br/><sub>Supports trained and trained+agents settings</sub></td>
67
  </tr>
68
  </table>
 
74
  | Level | Hugging Face Dataset | Input | Target | Metric |
75
  |:---|:---|:---|:---|:---|
76
  | **Level 1** | [LabHorizon-3D-Asset-Perception](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-3D-Asset-Perception) | Three asset views, historical actions, candidate next actions | Gold next action | Next-action accuracy |
77
+ | **Level 2** | [LabHorizon Protocol-Aligned Planning](https://huggingface.co/datasets/Stanford-CongLab/LabHorizon-Protocol-Conditioned-Planning) | Context, goal, constraints, available inputs, action pool | Gold experimental action sequence | L2 Action Sequence Similarity, L2 Parameter Accuracy |
78
 
79
  ## 📦 Model
80
 
 
86
  | Adapter type | LoRA / PEFT adapter |
87
  | Training data | 6,000 LabHorizon train samples |
88
  | Level 1 training split | 3,000 multimodal laboratory 3D asset samples |
89
+ | Level 2 training split | 3,000 text-only protocol-aligned planning samples |
90
+ | Main task | Protocol-aligned laboratory action prediction |
91
  | Main metrics | Level 1 Next Action Accuracy; L2 Action Sequence Similarity and L2 Parameter Accuracy |
92
  | Intended loading mode | Load this adapter with the matching Qwen3.6-35B-A3B base model |
93
 
 
139
  | 13 | Qwen3.6 35B-A3B | 0.475 |
140
  | 14 | Gemini 3.1 Pro | 0.465 |
141
 
142
+ ### 🧪 Level 2: Protocol-Aligned Planning
143
 
144
  | Rank | Model | L2 Final Score | L2 Action Sequence Similarity | L2 Parameter Accuracy |
145
  |:---:|:---|---:|---:|---:|
 
168
  | Component | Size | Role |
169
  |:---|---:|:---|
170
  | Level 1 train | 3,000 | Multi-view laboratory asset perception and next-action prediction |
171
+ | Level 2 train | 3,000 | Protocol-aligned long-horizon experimental action-sequence planning |
172
  | Total train | 6,000 | Unified supervised fine-tuning data for laboratory action prediction |
173
 
174
  The training data are converted into Qwen chat format and then into the LLaMA-Factory ShareGPT-VL-style format. Level 1 keeps the three asset images and candidate next actions; Level 2 uses text-only context, constraints, available inputs, action pool, and gold experimental action sequence.
 
259
 
260
  Recommended use cases:
261
 
262
+ - Evaluate protocol-aligned next-action prediction and long-horizon planning.
263
  - Study how training data improves laboratory action prediction.
264
  - Use the adapter as the Actor in the Actor-Simulator-Selector framework.
265
  - Analyze remaining failures in action order, parameter copying, dependency tracking, and protocol-stage consistency.