yang-ai-lab
/

OSF-Base

@@ -1,27 +1,29 @@
 ---
 license: mit
 tags:
-  - sleep
-  - eeg
-  - polysomnography
-  - foundation-model
-  - self-supervised
-  - vit
-  - biosignals
-pipeline_tag: feature-extraction
-library_name: pytorch
-language:
-  - en
 ---
 # OSF: On Pre-training and Scaling of Sleep Foundation Models
-[![Paper](https://img.shields.io/badge/paper-arXiv-red)](#citation)
 [![Webpage](https://img.shields.io/badge/website-demo-blue)](https://yang-ai-lab.github.io/osf/)
 [![License](https://img.shields.io/badge/license-MIT-green)](LICENSE)
 [![Python](https://img.shields.io/badge/python-3.10%2B-brightgreen)](#installation)
 ## 🔥 News
 - [2026-2-24] Our codebase and checkpoint is released. Full codebase for benchmarking will be public available after acceptance.
@@ -29,26 +31,7 @@ language:
 ## 📖 Introduction
-Polysomnography (PSG) provides the gold standard for sleep assessment but suffers from substantial heterogeneity across recording devices and cohorts.
-There have been growing efforts to build general-purpose foundation models (FMs) for sleep physiology, but lack an in-depth understanding of the pre-training process and scaling patterns that lead to more generalizable sleep FMs.
-To fill this gap, we curate a massive corpus of 166,500 hours of sleep recordings from nine public sources and establish SleepBench, a comprehensive, fully open-source benchmark.
-Leveraging SleepBench, we systematically evaluate four families of self-supervised pre-training objectives and uncover three critical findings:
-(1) existing FMs fail to generalize to missing channels at inference;
-(2) channel-invariant feature learning is essential for pre-training;
-and (3) scaling sample size, model capacity, and multi-source data mixture consistently improves downstream performance.
-With an enhanced pre-training and scaling recipe, we introduce OSF, a family of sleep FMs that achieves state-of-the-art performance across nine datasets on diverse sleep and disease prediction tasks.
-Further analysis of OSF also reveals intriguing properties in sample efficiency, hierarchical aggregation, and cross-dataset scaling.
-## 📖 Table of Contents
-1. [Installation](#-installation)
-2. [Quick Start](#-quick-start)
-3. [Pretrained Weights](#-pretrained-weights)
-4. [Usage](#-usage)
-5. [Benchmark Evaluations](#-benchmark-evaluations)
-6. [Supported Datasets](#-supported-datasets)
-7. [Citation](#-citation)
 ## 💿 Installation
@@ -59,14 +42,12 @@ conda env create -f environment.yml
 conda activate myenv
 ```
 ### Dependencies
 - Python >= 3.10
 - PyTorch >= 2.9.0
 - PyTorch Lightning >= 2.5.5
 ## 🚀 Quick Start
 We provide a demo notebook (`demo.ipynb`) demonstrating how to extract embeddings from PSG signals using the pretrained model.
@@ -98,25 +79,6 @@ with torch.no_grad():
 # patch_embs: [B, 90, 768] - Local patch representations
 ```
-## 📦 Pretrained Weights
-| Model | Backbone | Channels |
-|-------|----------|----------|
-| OSF | ViT-Base | 12-ch |
-The pretrained weights are included in this repository. You can download them via the Hugging Face Hub:
-```python
-from huggingface_hub import hf_hub_download
-checkpoint_path = hf_hub_download(repo_id="yang-ai-lab/OSF-Base", filename="osf_backbone.pth")
-```
-Or via the CLI:
-```bash
-huggingface-cli download yang-ai-lab/OSF-Base osf_backbone.pth
-```
 ## 👩‍💻 Usage
 ### Input Format
@@ -127,130 +89,23 @@ Expected input format:
 - **Epoch Length**: 30 seconds
 - **Input Shape**: `[B, 12, 1920]`
-### Pretraining
-We support multiple self-supervised pretraining methods, for example, to launch pre-training of our OSF method, run pretraining:
-```bash
-python main_pretrain.py \
-    --model_name "dino_ours" \
-    --psg_encoder_name "vit_base" \
-    --batch_size 256 \
-    --lr 5e-5 \
-    --max_epochs 30 \
-    --num_devices 4 \
-    --patch_size_time 64 \
-    --patch_size_ch 4 \
-    --precision "bf16-mixed"
-```
-See `main_pipleines/main_pretrain.py` for more detailed settings.
-### Fine-tuning
-Fine-tune the pretrained model on downstream tasks:
-```bash
-python main_finetune.py \
-    --model_name "dino_ours" \
-    --ckpt_path "/path/to/pretrained/checkpoint.ckpt" \
-    --downstream_dataset_name "shhs" \
-    --eval_label "Stage" \
-    --train_data_pct 1.0 \
-    --max_steps 500 \
-    --lr 0.1 \
-    --num_devices 4
-```
 ## 📊 Benchmark Evaluations
-### Benchmarked SSL Methods
-| Method | Type | Original Paper |
-|--------|------|-------------|
-| SleepFM | Contrastive | Leave-one-out multi-modal contrastive learning |
-| SimCLR | Contrastive | Simple Contrastive Learning |
-| DINO | Self-distillation | DINOv2 (Oquab et al., 2023) |
-| VQ-VAE | Reconstruction | Vector-quantized variational autoencoder |
-| MAE | Reconstruction | Masked Autoencoding |
-| AR | Autoregressive | Autoregressive Next-Token prediction |
-| OSF | Self-distillation | ours |
-### Downstream Tasks
-**Epoch-level Classification Tasks:**
-| Task | Classes | Description |
-|------|---------|-------------|
-| Sleep Stage | 4 | Awake, Light Sleep, Deep Sleep, REM classification |
-| Arousal | 2 | Arousal event detection |
-| Hypopnea | 2 | Hypopnea event detection |
-| Oxygen Desaturation | 2 | Oxygen desaturation detection |
-### Evaluation Settings
-| Setting | Description |
-|---------|-------------|
-| Linear Probing | Freeze backbone, train linear classifier |
-| Full Fine-tuning | Fine-tune entire model end-to-end |
-| Few-shot (k-shot) | Train with limited labeled samples |
-For example scripts, see `main_pipelines` and `bash_scripts` folders.
-## 📊 Supported Datasets
-We aggregated nine large-scale datasets from the National Sleep Research Resource platform.
-| Dataset | Full Name | Source |
-|---------|-----------|--------|
-| SHHS | Sleep Heart Health Study | NSRR |
-| CHAT | Childhood Adenotonsillectomy Trial | NSRR |
-| MROS | MrOS Sleep Study | NSRR |
-| CCSHS | Cleveland Children's Sleep and Health Study | NSRR |
-| CFS | Cleveland Family Study | NSRR |
-| MESA | Multi-Ethnic Study of Atherosclerosis | NSRR |
-| SOF | Study of Osteoporotic Fractures | NSRR |
-| WSC | Wisconsin Sleep Cohort | NSRR |
-| STAGES | Stanford Technology Analytics and Genomics in Sleep | NSRR |
-| NCHSDB | NCH Sleep DataBank | NSRR |
-For new users, please apply for an account and access to each of these datasets following instructions here [NSRR Registration](https://sleepdata.org/join)
-## 📁 Project Structure
-```
-OSF-Open-Sleep-Foundation-Model/
-├── osf/
-│   ├── backbone/          # ViT backbone implementations
-│   │   └── vit1d_cls.py
-│   ├── models/            # SSL model implementations
-│   │   └── dino_model_cls.py
-│   │
-│   ├── datasets/          # Data loading utilities
-│   └── utils/             # Helper functions
-├── main_pipelines/        # Training scripts
-│   ├── main_pretrain.py
-│   └── ...
-├── bash_scripts/          # Example bash scripts
-├── osf_backbone.pth       # Pretrained model weights
-├── demo.ipynb             # Quick start demo
-├── config.py              # Dataset and channel configurations
-└── train_config.py        # Training configurations
-```
 ## 📝 Citation
-If you use this code or models in your research, please cite our paper:
 ```bibtex
 @article{shuai2026osf,
   title={OSF: On Pre-training and Scaling of Sleep Foundation Models},
   author={Shuai, Zitao and Xu, Zongzhe and Yang, David and Wang, Wei and Yang, Yuzhe},
-  journal={arXiv preprint},
   year={2026}
 }
-```

 ---
+language:
+- en
+library_name: pytorch
 license: mit
+pipeline_tag: other
 tags:
+- sleep
+- eeg
+- polysomnography
+- foundation-model
+- self-supervised
+- vit
+- biosignals
 ---
 # OSF: On Pre-training and Scaling of Sleep Foundation Models
+This repository contains the weights for **OSF**, a family of sleep foundation models introduced in the paper [OSF: On Pre-training and Scaling of Sleep Foundation Models](https://huggingface.co/papers/2603.00190).
+[![Paper](https://img.shields.io/badge/paper-arXiv-red)](https://huggingface.co/papers/2603.00190)
 [![Webpage](https://img.shields.io/badge/website-demo-blue)](https://yang-ai-lab.github.io/osf/)
+[![GitHub](https://img.shields.io/badge/github-code-black)](https://github.com/yang-ai-lab/OSF-Open-Sleep-FM)
 [![License](https://img.shields.io/badge/license-MIT-green)](LICENSE)
 [![Python](https://img.shields.io/badge/python-3.10%2B-brightgreen)](#installation)
 ## 🔥 News
 - [2026-2-24] Our codebase and checkpoint is released. Full codebase for benchmarking will be public available after acceptance.
 ## 📖 Introduction
+Polysomnography (PSG) provides the gold standard for sleep assessment but suffers from substantial heterogeneity across recording devices and cohorts. OSF is a family of sleep foundation models (FMs) pre-trained on a massive corpus of 166,500 hours of sleep recordings from nine public sources. Leveraging the SleepBench benchmark, the authors establish an enhanced pre-training and scaling recipe that achieves state-of-the-art performance across diverse sleep and disease prediction tasks.
 ## 💿 Installation
 conda activate myenv
 ```
 ### Dependencies
 - Python >= 3.10
 - PyTorch >= 2.9.0
 - PyTorch Lightning >= 2.5.5
 ## 🚀 Quick Start
 We provide a demo notebook (`demo.ipynb`) demonstrating how to extract embeddings from PSG signals using the pretrained model.
 # patch_embs: [B, 90, 768] - Local patch representations
 ```
 ## 👩‍💻 Usage
 ### Input Format
 - **Epoch Length**: 30 seconds
 - **Input Shape**: `[B, 12, 1920]`
+### Pretraining and Fine-tuning
+For detailed instructions on pretraining and fine-tuning using the OSF framework, please refer to the scripts in the [official GitHub repository](https://github.com/yang-ai-lab/OSF-Open-Sleep-FM).
 ## 📊 Benchmark Evaluations
+OSF has been evaluated on the **SleepBench** benchmark across tasks such as Sleep Stage classification, Arousal detection, Hypopnea event detection, and Oxygen Desaturation detection, outperforming existing SSL methods like SleepFM, SimCLR, and DINO.
 ## 📝 Citation
+If you use this code or models in your research, please cite the paper:
 ```bibtex
 @article{shuai2026osf,
   title={OSF: On Pre-training and Scaling of Sleep Foundation Models},
   author={Shuai, Zitao and Xu, Zongzhe and Yang, David and Wang, Wei and Yang, Yuzhe},
+  journal={arXiv preprint arXiv:2603.00190},
   year={2026}
 }
+```