README / README.md
Eghbal's picture
Update README.md
81afda4 verified
|
raw
history blame
1.72 kB
metadata
title: README
emoji: πŸ’»
colorFrom: gray
colorTo: blue
sdk: static
pinned: false

README πŸ’»

FinText: A repository of Financial LLMs

πŸš€ Stage 1 Release πŸš€

We are thrilled to introduce a specialized collection of 68 large language models (LLMs), meticulously designed for the accounting and finance. The FinText models have been pre-trained on domain-specific historical data, addressing challenges like look-ahead bias and information leakage. These models are crafted to elevate the accuracy and depth of financial research and analysis.

πŸ’‘ Key Features:

  • Domain-Specific Training: FinText utilises diverse financial datasets such as news articles, regulatory filings, IP records, corporate speeches (ECB, FED), and more.
  • Time-Period Specific Models: Separate models are pre-trained for each year from 2007 to 2023, ensuring the utmost precision and historical relevance.
  • RoBERTa Architecture: The suite includes both a base model with 125 million parameters and a smaller variant with 51 million parametersβ€”totalling 34 pre-trained models. 🎯
  • Two distinct pre-training durations: We also introduce a series of models to explore the impact of futher pre-training. These models are pre-trained for an additional 5 epochs, extending the total pre-training epochs to 10.

Stay tuned for upcoming updates and new features for FinText. We expect to launch stages 2 and 3 within the next 9 months. πŸŽ‰