Model Card for Indus (modernbert-sde-v0.2)

This model was further pre-trained on full Science Discovery Engine (SDE) website data from answerdotai/ModernBERT-base.

Model Details

transformers Version: 4.48.3
Strategy: Masked Language Modeling (MLM)
Masking Strategy:
- Weighted Dynamic Masking based on Keyword Importance (YAKE) and Random Masking
  - The idea for masking important keywords is to force the model to generalize for "science" keywords that gives a high signal for the document
- Masked Language Model Probability: 30%
Batch Size: 7
Learning rate: 5e-5
Warmup ratio: 0.1

Safetensors

Model size

0.1B params

Tensor type

F32

Base model

Finetuned

(1083)

this model