MohamedBouadi commited on
Commit
e409719
·
verified ·
1 Parent(s): ef252b7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +433 -3
README.md CHANGED
@@ -1,3 +1,433 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+ <div align="center">
5
+ <img src="logo.png" alt="Orion-MSP Logo" width="700"/>
6
+ </div>
7
+
8
+ <div align="center">
9
+ <a href="https://www.aryaxai.com/">
10
+ <img src="https://img.shields.io/badge/Lexsi-Homepage-FF6B6B?style=for-the-badge" alt="Homepage"/>
11
+ </a>
12
+ <a href="https://discord.gg/dSB62Q7A">
13
+ <img src="https://img.shields.io/badge/Discord-Join-5865F2?style=for-the-badge&logo=discord&logoColor=white" alt="Discord"/>
14
+ </a>
15
+ <a href="https://github.com/Lexsi-Labs/Orion-MSP">
16
+ <img src="https://img.shields.io/badge/GitHub-Repository-181717?style=for-the-badge&logo=github&logoColor=white" alt="GitHub"/>
17
+ </a>
18
+ </div>
19
+
20
+
21
+ # Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning
22
+
23
+ Orion-MSP is a tabular foundation model for in-context learning. It uses multi-scale sparse attention and Perceiver-style memory to process tabular data at multiple granularities, capturing both local feature interactions and global dataset-level patterns.
24
+
25
+ ## Key Features
26
+
27
+ - **Multi-Scale Sparse Attention:** Processes features at three levels (scales 1, 4, 16) using windowed, global, and random attention patterns, reducing quadratic complexity to near-linear.
28
+ - **Hierarchical Feature Understanding:** Captures patterns from individual cells to feature groups through scale-aware attention.
29
+ - **Perceiver-Style Memory:** Cross-component memory that compresses dataset information for efficient processing across samples
30
+ - **Memory-Efficient:** Block-sparse masking enables efficient processing of large tabular datasets
31
+ - **Scikit-learn Compatible:** Drop-in replacement with .fit() and .predict() methods
32
+
33
+
34
+ ## Architecture
35
+ Orion-MSP consists of four main components:
36
+ - **Column-wise Embedding:** Distribution-aware feature embeddings using Induced Set Attention Blocks (ISAB)
37
+ - **Multi-Scale Row Interaction:** Sparse attention with windowed, global, and random patterns across multiple scales
38
+ - **Cross-Component Memory:** Perceiver-style memory for efficient dataset-level context
39
+ - **Dataset-wise ICL:** Enhanced predictor leveraging enriched representations for few-shot tabular classification
40
+
41
+ ## Performance
42
+
43
+ <!-- Orion-MSP: Benchmark Summary Table -->
44
+ <style>
45
+ /* Container adds horizontal scroll on small screens */
46
+ .hf-table-wrap { overflow-x: auto; margin: 1rem 0; }
47
+ table.bench { border-collapse: collapse; width: 100%; font-size: 0.9rem; }
48
+ table.bench caption { font-weight: 600; text-align: left; margin-bottom: .5rem; }
49
+ table.bench th, table.bench td { border-bottom: 1px solid #e5e7eb; padding: 6px 8px; }
50
+ table.bench thead th { border-bottom: 2px solid #d1d5db; background: #fafafa; }
51
+ table.bench th.sticky { position: sticky; top: 0; z-index: 1; }
52
+ /* Alignment */
53
+ table.bench th, table.bench td { text-align: center; }
54
+ table.bench th:first-child, table.bench td:first-child { text-align: left; white-space: nowrap; }
55
+ /* Emphasis helpers */
56
+ .first { font-weight: 700; } /* 1st place (bold) */
57
+ .second { text-decoration: underline; } /* 2nd place (underline) */
58
+ .first.second { font-weight: 700; text-decoration: underline; } /* bold+underline */
59
+ </style>
60
+
61
+ <div class="hf-table-wrap">
62
+ <table class="bench">
63
+ <caption>Performance comparison across three benchmark suites—TALENT, OpenML-CC18, and TabZilla. Ranks are mean ranks based on accuracy (lower is better). Metrics: ACC = Accuracy, F1 = Weighted F1. <span class="first">1st</span>; <span class="second">2nd</span>.</caption>
64
+ <thead>
65
+ <tr>
66
+ <th class="sticky" rowspan="2">Models</th>
67
+ <th class="sticky" colspan="1">All</th>
68
+ <th class="sticky" colspan="3">TALENT</th>
69
+ <th class="sticky" colspan="3">OpenML-CC18</th>
70
+ <th class="sticky" colspan="3">TabZilla</th>
71
+ </tr>
72
+ <tr>
73
+ <th class="sticky">Rank</th>
74
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
75
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
76
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
77
+ </tr>
78
+ </thead>
79
+ <tbody>
80
+ <tr>
81
+ <td>XGBoost</td>
82
+ <td>6.70</td>
83
+ <td>6.02</td><td>0.8403</td><td>0.8360</td>
84
+ <td>5.89</td><td>0.8558</td><td>0.8537</td>
85
+ <td>6.07</td><td>0.8612</td><td>0.8326</td>
86
+ </tr>
87
+ <tr>
88
+ <td>CatBoost</td>
89
+ <td>6.43</td>
90
+ <td>5.57</td><td>0.8336</td><td>0.8259</td>
91
+ <td>6.25</td><td>0.8588</td><td>0.8520</td>
92
+ <td>7.13</td><td>0.8579</td><td>0.8384</td>
93
+ </tr>
94
+ <tr>
95
+ <td>Random Forest</td>
96
+ <td>7.38</td>
97
+ <td>6.15</td><td>0.8285</td><td>0.8209</td>
98
+ <td>6.36</td><td>0.8547</td><td>0.8497</td>
99
+ <td>8.42</td><td>0.8358</td><td>0.8399</td>
100
+ </tr>
101
+ <tr>
102
+ <td>LightGBM</td>
103
+ <td>6.78</td>
104
+ <td>6.11</td><td>0.8331</td><td>0.8245</td>
105
+ <td>6.18</td><td>0.8581</td><td>0.8493</td>
106
+ <td>5.25</td><td>0.8618</td><td>0.8211</td>
107
+ </tr>
108
+ <tr>
109
+ <td>TabICL</td>
110
+ <td>4.96</td>
111
+ <td>4.09</td><td class="second">0.8471</td><td class="second">0.8379</td>
112
+ <td>4.69</td><td>0.8667</td><td>0.8623</td>
113
+ <td>5.89</td><td>0.8734</td><td>0.8698</td>
114
+ </tr>
115
+ <tr>
116
+ <td>OrionBiX</td>
117
+ <td>5.37</td>
118
+ <td>4.59</td><td>0.8346</td><td>0.8260</td>
119
+ <td>4.98</td><td>0.8653</td><td>0.8596</td>
120
+ <td>4.89</td><td>0.8728</td><td>0.8628</td>
121
+ </tr>
122
+ <tr>
123
+ <td><span class="first second">OrionMSP</span></td>
124
+ <td>3.58</td>
125
+ <td class="first second">3.26</td><td>0.8461</td><td>0.8360</td>
126
+ <td class="first second">4.12</td><td class="first second">0.8722</td><td class="first second">0.8676</td>
127
+ <td class="first second">3.84</td><td class="first second">0.8821</td><td class="first second">0.8786</td>
128
+ </tr>
129
+ <tr>
130
+ <td><span class="second">TabPFN</span></td>
131
+ <td>4.61</td>
132
+ <td class="second">3.72</td><td class="first second">0.8514</td><td class="first second">0.8412</td>
133
+ <td>4.76</td><td class="second">0.8714</td><td class="second">0.8663</td>
134
+ <td>4.86</td><td>0.8752</td><td>0.8716</td>
135
+ </tr>
136
+ <tr>
137
+ <td>Mitra</td>
138
+ <td>11.77</td>
139
+ <td>10.38</td><td>0.3921</td><td>0.2868</td>
140
+ <td>10.52</td><td>0.3614</td><td>0.2522</td>
141
+ <td>11.21</td><td>0.3152</td><td>0.1830</td>
142
+ </tr>
143
+ <tr>
144
+ <td>ContextTab</td>
145
+ <td>9.70</td>
146
+ <td>9.84</td><td>0.5474</td><td>0.4596</td>
147
+ <td>6.28</td><td>0.8639</td><td>0.8581</td>
148
+ <td>7.13</td><td>0.8389</td><td>0.8334</td>
149
+ </tr>
150
+ <tr>
151
+ <td>TabDPT</td>
152
+ <td>5.42</td>
153
+ <td>5.19</td><td>0.8408</td><td>0.8318</td>
154
+ <td class="second">4.64</td><td>0.8672</td><td>0.8625</td>
155
+ <td class="second">3.94</td><td class="second">0.8814</td><td class="second">0.8775</td>
156
+ </tr>
157
+ </tbody>
158
+ </table>
159
+ </div>
160
+
161
+ Orion-MSP is the most consistent top performer across all three benchmarks, achieving the best overall rank.
162
+ - On TALENT, it ranks ***1** overall, while TabPFN edges the highest ACC/F1 by a hair.
163
+ - On OpenML-CC18, Orion-MSP attains the top ACC/F1 (0.8722/0.8676), narrowly ahead of TabPFN and TabDPT.
164
+ - On TabZilla, it leads with the highest ACC/F1 and the best rank.
165
+ - Classical baselines (XGBoost/LightGBM/CatBoost/RF) trail noticeably, highlighting Orion-MSP’s robustness across diverse tabular tasks.
166
+
167
+
168
+ <!-- Orion-MSP: Size Analysis Table -->
169
+ <style>
170
+ /* Container adds horizontal scroll on small screens */
171
+ .hf-table-wrap { overflow-x: auto; margin: 1rem 0; }
172
+ table.bench { border-collapse: collapse; width: 100%; font-size: 0.9rem; }
173
+ table.bench caption { font-weight: 600; text-align: left; margin-bottom: .5rem; }
174
+ table.bench th, table.bench td { border-bottom: 1px solid #e5e7eb; padding: 6px 8px; }
175
+ table.bench thead th { border-bottom: 2px solid #d1d5db; background: #fafafa; }
176
+ table.bench th.sticky { position: sticky; top: 0; z-index: 1; }
177
+ /* Alignment */
178
+ table.bench th, table.bench td { text-align: center; }
179
+ table.bench th:first-child, table.bench td:first-child { text-align: left; white-space: nowrap; }
180
+ /* Emphasis helpers */
181
+ .first { font-weight: 700; } /* 1st place (bold) */
182
+ .second { text-decoration: underline; } /* 2nd place (underline) */
183
+ .first.second { font-weight: 700; text-decoration: underline; } /* bold+underline */
184
+ </style>
185
+
186
+ <div class="hf-table-wrap">
187
+ <table class="bench">
188
+ <caption>
189
+ Performance variation by dataset size across all benchmark suites. Rank = mean rank by accuracy (lower is better).
190
+ ACC = Accuracy; F1 = Weighted F1. Size buckets: Small (&lt;1K), Medium (1K–10K), Large (&gt;10K).
191
+ </caption>
192
+ <thead>
193
+ <tr>
194
+ <th class="sticky" rowspan="2">Models</th>
195
+ <th class="sticky" colspan="3">Small (&lt;1K)</th>
196
+ <th class="sticky" colspan="3">Medium (1K–10K)</th>
197
+ <th class="sticky" colspan="3">Large (&gt;10K)</th>
198
+ </tr>
199
+ <tr>
200
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
201
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
202
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
203
+ </tr>
204
+ </thead>
205
+ <tbody>
206
+ <tr>
207
+ <td>XGBoost</td>
208
+ <td>7.70</td><td>0.8168</td><td>0.7964</td>
209
+ <td>6.88</td><td>0.8363</td><td>0.8314</td>
210
+ <td>5.41</td><td class="first">0.8969</td><td class="first">0.8920</td>
211
+ </tr>
212
+ <tr>
213
+ <td>CatBoost</td>
214
+ <td>7.88</td><td>0.8124</td><td>0.7935</td>
215
+ <td>6.47</td><td>0.8340</td><td>0.8264</td>
216
+ <td>5.48</td><td>0.8797</td><td>0.8733</td>
217
+ </tr>
218
+ <tr>
219
+ <td>Random Forest</td>
220
+ <td>8.55</td><td>0.7988</td><td>0.8187</td>
221
+ <td>7.16</td><td>0.8285</td><td>0.8221</td>
222
+ <td>7.30</td><td>0.8694</td><td>0.8628</td>
223
+ </tr>
224
+ <tr>
225
+ <td>LightGBM</td>
226
+ <td>7.80</td><td>0.8143</td><td>0.7789</td>
227
+ <td>6.94</td><td>0.8314</td><td>0.8226</td>
228
+ <td>5.63</td><td>0.8827</td><td>0.8764</td>
229
+ </tr>
230
+ <tr>
231
+ <td>TabICL</td>
232
+ <td>6.04</td><td>0.8301</td><td class="first">0.8338</td>
233
+ <td>4.77</td><td>0.8486</td><td>0.8398</td>
234
+ <td>4.61</td><td>0.8802</td><td>0.8743</td>
235
+ </tr>
236
+ <tr>
237
+ <td>OrionBiX</td>
238
+ <td>6.32</td><td class="second">0.8330</td><td>0.8150</td>
239
+ <td>5.48</td><td>0.8348</td><td>0.8260</td>
240
+ <td class="second">4.42</td><td>0.8729</td><td>0.8670</td>
241
+ </tr>
242
+ <tr>
243
+ <td>OrionMSP</td>
244
+ <td class="second">5.93</td><td>0.8232</td><td>0.8194</td>
245
+ <td class="first">3.70</td><td class="second">0.8494</td><td class="second">0.8402</td>
246
+ <td class="first">3.04</td><td class="second">0.8843</td><td class="second">0.8768</td>
247
+ </tr>
248
+ <tr>
249
+ <td>TabPFN</td>
250
+ <td>6.50</td><td>0.8325</td><td>0.8131</td>
251
+ <td class="second">3.81</td><td class="first">0.8557</td><td class="first">0.8462</td>
252
+ <td>5.73</td><td>0.8783</td><td>0.8713</td>
253
+ </tr>
254
+ <tr>
255
+ <td>Mitra</td>
256
+ <td>13.88</td><td>0.4334</td><td>0.3236</td>
257
+ <td>11.59</td><td>0.3600</td><td>0.2553</td>
258
+ <td>11.11</td><td>0.3837</td><td>0.2754</td>
259
+ </tr>
260
+ <tr>
261
+ <td>ContextTab</td>
262
+ <td>9.60</td><td>0.7578</td><td>0.7363</td>
263
+ <td>9.52</td><td>0.6210</td><td>0.5566</td>
264
+ <td>10.22</td><td>0.6388</td><td>0.5638</td>
265
+ </tr>
266
+ <tr>
267
+ <td>TabDPT</td>
268
+ <td class="first">5.48</td><td class="first">0.8333</td><td class="second">0.8271</td>
269
+ <td>5.40</td><td>0.8424</td><td>0.8339</td>
270
+ <td>5.26</td><td>0.8831</td><td>0.8765</td>
271
+ </tr>
272
+ </tbody>
273
+ </table>
274
+ </div>
275
+
276
+ OrionMSP is the most consistent top-ranked model as data grows (especially Medium/Large), while TabPFN peaks on Medium and GBDTs
277
+ (e.g., XGBoost) catch up in raw ACC/F1 on Large.
278
+
279
+ <!-- Orion-MSP: Width (Feature Dimensionality) Analysis Table -->
280
+ <style>
281
+ /* Container adds horizontal scroll on small screens */
282
+ .hf-table-wrap { overflow-x: auto; margin: 1rem 0; }
283
+ table.bench { border-collapse: collapse; width: 100%; font-size: 0.9rem; }
284
+ table.bench caption { font-weight: 600; text-align: left; margin-bottom: .5rem; }
285
+ table.bench th, table.bench td { border-bottom: 1px solid #e5e7eb; padding: 6px 8px; }
286
+ table.bench thead th { border-bottom: 2px solid #d1d5db; background: #fafafa; }
287
+ table.bench th.sticky { position: sticky; top: 0; z-index: 1; }
288
+ /* Alignment */
289
+ table.bench th, table.bench td { text-align: center; }
290
+ table.bench th:first-child, table.bench td:first-child { text-align: left; white-space: nowrap; }
291
+ /* Emphasis helpers */
292
+ .first { font-weight: 700; } /* 1st place (bold) */
293
+ .second { text-decoration: underline; } /* 2nd place (underline) */
294
+ .first.second { font-weight: 700; text-decoration: underline; } /* bold+underline */
295
+ </style>
296
+
297
+ <div class="hf-table-wrap">
298
+ <table class="bench">
299
+ <caption>
300
+ Performance vs. feature dimensionality. Rank = mean accuracy rank (lower is better). ACC = Accuracy; F1 = Weighted F1. Groups: Narrow (&lt;10), Medium (10–100), Wide (&gt;100).
301
+ <span class="first">1st</span> ; <span class="second">2nd</span> within each group.
302
+ </caption>
303
+ <thead>
304
+ <tr>
305
+ <th class="sticky" rowspan="2">Models</th>
306
+ <th class="sticky" colspan="3">Narrow (&lt;10)</th>
307
+ <th class="sticky" colspan="3">Medium (10–100)</th>
308
+ <th class="sticky" colspan="3">Wide (&gt;100)</th>
309
+ </tr>
310
+ <tr>
311
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
312
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
313
+ <th class="sticky">Rank</th><th class="sticky">ACC</th><th class="sticky">F1</th>
314
+ </tr>
315
+ </thead>
316
+ <tbody>
317
+ <tr>
318
+ <td>XGBoost</td>
319
+ <td>6.77</td><td>0.8222</td><td>0.8159</td>
320
+ <td>6.90</td><td>0.8482</td><td>0.8410</td>
321
+ <td class="first">4.79</td><td>0.9140</td><td>0.9039</td>
322
+ </tr>
323
+ <tr>
324
+ <td>CatBoost</td>
325
+ <td>5.63</td><td>0.8145</td><td>0.8067</td>
326
+ <td>6.88</td><td>0.8441</td><td>0.8344</td>
327
+ <td class="second">5.50</td><td class="first">0.9157</td><td class="second">0.9084</td>
328
+ </tr>
329
+ <tr>
330
+ <td>Random Forest</td>
331
+ <td>7.15</td><td>0.8005</td><td>0.7044</td>
332
+ <td>7.44</td><td>0.8410</td><td>0.8235</td>
333
+ <td>7.52</td><td>0.9034</td><td>0.8936</td>
334
+ </tr>
335
+ <tr>
336
+ <td>LightGBM</td>
337
+ <td>6.15</td><td>0.8128</td><td>0.7907</td>
338
+ <td>6.92</td><td>0.8458</td><td>0.8326</td>
339
+ <td>7.47</td><td>0.8999</td><td>0.8908</td>
340
+ </tr>
341
+ <tr>
342
+ <td>TabICL</td>
343
+ <td>5.14</td><td>0.8208</td><td>0.8119</td>
344
+ <td>4.61</td><td class="second">0.8627</td><td class="second">0.8549</td>
345
+ <td>6.46</td><td>0.9101</td><td>0.8936</td>
346
+ </tr>
347
+ <tr>
348
+ <td>OrionBiX</td>
349
+ <td class="second">4.64</td><td>0.8112</td><td>0.8043</td>
350
+ <td>5.46</td><td>0.8510</td><td>0.8417</td>
351
+ <td>6.73</td><td>0.8859</td><td>0.8849</td>
352
+ </tr>
353
+ <tr>
354
+ <td>OrionMSP</td>
355
+ <td class="first">3.76</td><td class="first">0.8394</td><td class="first">0.8314</td>
356
+ <td class="second">4.09</td><td>0.8572</td><td>0.8478</td>
357
+ <td>5.69</td><td>0.8860</td><td>0.8837</td>
358
+ </tr>
359
+ <tr>
360
+ <td>TabPFN</td>
361
+ <td>5.30</td><td>0.8187</td><td>0.8092</td>
362
+ <td class="first">4.07</td><td class="first">0.8676</td><td class="first">0.8589</td>
363
+ <td>6.141</td><td class="second">0.9129</td><td class="first">0.9111</td>
364
+ </tr>
365
+ <tr>
366
+ <td>Mitra</td>
367
+ <td>11.25</td><td>0.3737</td><td>0.2683</td>
368
+ <td>11.84</td><td>0.3886</td><td>0.2781</td>
369
+ <td>13.03</td><td>0.2521</td><td>0.1497</td>
370
+ </tr>
371
+ <tr>
372
+ <td>ContextTab</td>
373
+ <td>9.52</td><td>0.6391</td><td>0.5719</td>
374
+ <td>9.59</td><td>0.6480</td><td>0.5843</td>
375
+ <td>10.97</td><td>0.6017</td><td>0.5651</td>
376
+ </tr>
377
+ <tr>
378
+ <td>TabDPT</td>
379
+ <td>4.66</td><td class="second">0.8262</td><td class="second">0.8189</td>
380
+ <td>5.45</td><td>0.8566</td><td>0.8483</td>
381
+ <td>7.23</td><td>0.8845</td><td>0.8820</td>
382
+ </tr>
383
+ </tbody>
384
+ </table>
385
+ </div>
386
+
387
+ OrionMSP excels on narrow and stays strong on medium width, while TabPFN dominates medium-width features and GBDTs (XGBoost/CatBoost)
388
+ shine on wide feature spaces.
389
+
390
+ ## Usage
391
+
392
+ ```python
393
+ from orion_msp.sklearn import OrionMSPClassifier
394
+
395
+ # Initialize and use
396
+ clf = OrionMSPClassifier()
397
+ clf.fit(X_train, y_train)
398
+ predictions = clf.predict(X_test)
399
+ ```
400
+
401
+ This code will automatically download the pre-trained model from Hugging Face and use a GPU if available.
402
+
403
+ ## Installation
404
+
405
+ ### From the source
406
+ #### Option 1: From the local clone
407
+
408
+ ```bash
409
+ cd orion-msp
410
+ pip install -e .
411
+ ```
412
+
413
+ #### Option 2: From the Git Remote
414
+
415
+ ```bash
416
+ pip install git+https://github.com/Lexsi-Labs/Orion-MSP.git
417
+ ```
418
+
419
+
420
+ ## Citation
421
+ If you use Orion-MSP, please cite our [paper](https://arxiv.org/abs/2511.02818):
422
+
423
+ ```bibtex
424
+ @article{bouadi25orionmsp,
425
+ title={Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning},
426
+ author={Mohamed Bouadi and Pratinav Seth and Aditya Tanna and Vinay Kumar Sankarapu},
427
+ year={2025}
428
+ eprint={2511.02818},
429
+ archivePrefix={arXiv},
430
+ primaryClass={cs.AI},
431
+ url={https://arxiv.org/abs/2511.02818},
432
+ }
433
+ ```