finance-slm-5m / README.md
sathishphdai's picture
Upload Finance-SLM v2
a3fb4c9 verified
metadata
language:
  - en
license: mit
tags:
  - finance
  - banking
  - fintech
  - trading
  - slm
  - llama-style
  - rope
  - 5m-context
  - from-scratch
pipeline_tag: text-generation

Finance-SLM: Finance Small Language Model

A LLaMA-style transformer (~33.9M params) trained from scratch on Finance domain data. Supports up to 5M token context via RoPE.

Architecture

Component Value
Architecture LLaMA-style (RoPE + RMSNorm + SwiGLU)
Parameters ~33.9M
Layers 8
Heads 8
Embedding 512
Max Context 5,000,000 tokens
Vocab 16,000 BPE
Best Loss 0.7834605552256108

License

MIT — Built from scratch.