File size: 2,047 Bytes
c68d25e 816bcda c68d25e 57d2d54 816bcda 57d2d54 816bcda 57d2d54 816bcda 57d2d54 816bcda 8bfbf95 5235ab2 57d2d54 816bcda 57d2d54 816bcda 57d2d54 816bcda 5989055 6ecff10 5989055 816bcda | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 | ---
license: mit
pipeline_tag: other
tags:
- neuroscience
- brain-to-text
- speech decoding
- brain decoding
- large brain models
- brain foundation models
---
# MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training
MEG-XL is a brain-to-text foundation model pre-trained with 2.5 minutes of MEG context per sample (equivalent to 191k tokens). It is designed to capture extended neural context, enabling high data efficiency for decoding words from brain activity.
- **Paper:** [MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training](https://huggingface.co/papers/2602.02494)
- **Repository:** [GitHub - neural-processing-lab/MEG-XL](https://github.com/neural-processing-lab/MEG-XL)
- **Weights/Checkpoint:** [meg-xl-med.ckpt](https://huggingface.co/pnpl/MEG-XL/blob/main/meg-xl-med.ckpt)
## Usage
Instructions for environment setup and data preparation are available in the [official GitHub repository](https://github.com/neural-processing-lab/MEG-XL).
### Fine-tuning MEG-XL for Brain-to-Text
You can fine-tune or evaluate the model on word decoding tasks using the following command structure:
```bash
python -m brainstorm.evaluate_criss_cross_word_classification \
--config-name=eval_criss_cross_word_classification_{armeni, gwilliams, libribrain} \
model.criss_cross_checkpoint=/path/to/your/checkpoint.ckpt
```
### Linear Probing
To perform linear probing, use:
```bash
python -m brainstorm.evaluate_criss_cross_word_classification \
--config-name=eval_criss_cross_word_classification_linear_probe_{armeni, gwilliams, libribrain} \
model.criss_cross_checkpoint=/path/to/your/checkpoint.ckpt
```
## Requirements
- Python >= 3.12
- High-VRAM GPU (>= 40-80GiB depending on the task).
## Citation
If you find this work helpful in your research, please cite:
```bibtex
@article{jayalath2026megxl,
title={{MEG-XL}: Data-Efficient Brain-to-Text via Long-Context Pre-Training},
author={Jayalath, Dulhan and Parker Jones, Oiwi},
journal={arXiv preprint arXiv:2602.02494},
year={2026}
}
``` |