HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings
Paper • 2502.15411 • Published • 2
This model was presented in the paper HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings.
Pre-BERT-SL1000 is a BERT-based sequence labeling model fine-tuned on the HiFi-KPI dataset for extracting financial key performance indicators (KPIs) from SEC earnings filings (10-K & 10-Q). It specializes in identifying entities that are one level up the presentation taxonomy, such as revenueAbstract, earnings, and financial ratios, using token classification.
This model is trained specifically on n=1 with the presentation taxonomy labels from HiFi-KPI, focusing on entity identification.
If you use this model or dataset, please cite:
@article{aavang2025hifikpi,
title={HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings},
author={Aavang, Rasmus and Rizzi, Giovanni and B{\o}ggild, Rasmus and Iolov, Alexandre and Zhang, Mike and Bjerva, Johannes},
journal={arXiv preprint arXiv:2502.15411},
year={2025}
}
Base model
google-bert/bert-base-uncased