Update README.md
Browse files
README.md
CHANGED
|
@@ -48,19 +48,19 @@ model-index:
|
|
| 48 |
|
| 49 |
## ✨ Key Features
|
| 50 |
|
| 51 |
-
- 🎯 **Domain-specific:** Focused exclusively on actuarial and insurance Q&A
|
| 52 |
-
- 📚 **Educational:** Makes complex actuarial terminology accessible for all users
|
| 53 |
-
- 🚀 **Efficient:** Fine-tuned with Unsloth for rapid, scalable training
|
| 54 |
-
-
|
| 55 |
-
- 🌐 **Widget & Demo:** Integrated as a live demo on [ActuaryEnough](https://actuaryenough.vercel.app)
|
| 56 |
|
| 57 |
---
|
| 58 |
|
| 59 |
## 💡 Intended Use Cases
|
| 60 |
|
| 61 |
-
- **Education**
|
| 62 |
-
- **Translation**
|
| 63 |
-
- **Research**
|
| 64 |
|
| 65 |
### Examples
|
| 66 |
|
|
@@ -74,59 +74,23 @@ Output: "This relates to premium calculation considering risk factors such as ex
|
|
| 74 |
|
| 75 |
## 📂 Training Data
|
| 76 |
|
| 77 |
-
- **Dataset**
|
| 78 |
-
- **Topics**
|
| 79 |
-
- **Language**
|
| 80 |
|
| 81 |
---
|
| 82 |
|
| 83 |
-
##
|
| 84 |
-
|
| 85 |
-
- **Base Model**: unsloth/gemma-3-270m-it
|
| 86 |
-
- **Epochs**: ~51 epochs (visible from screenshots, final point in `train/epoch`)
|
| 87 |
-
- **Steps**: over 68,000 global steps (`train/global_step` chart)
|
| 88 |
-
- **Training Loss**:
|
| 89 |
-
- Starts around **2.2**
|
| 90 |
-
- Smoothly drops, converges to **~1.4** at the final epoch ([see "train/loss" graph])
|
| 91 |
-
- **Learning Rate**:
|
| 92 |
-
- Decays linearly from **8e-7** down to near zero ([see "train/learning_rate" graph])
|
| 93 |
-
- **Gradient Norm**:
|
| 94 |
-
- Usually oscillates between **5** and **15** ([see "train/grad_norm" graph])
|
| 95 |
-
- **Hardware**:
|
| 96 |
-
- NVIDIA GeForce RTX 3090 (24GB VRAM)
|
| 97 |
-
- 16 physical/32 logical core CPU
|
| 98 |
-
- 94GB RAM
|
| 99 |
-
- CUDA 12.8, Linux 6.10
|
| 100 |
|
| 101 |
-
|
| 102 |
-
|
| 103 |
-
|
| 104 |
-
|
| 105 |
-
|
| 106 |
-
|
| 107 |
-
-
|
| 108 |
-
|
| 109 |
-
|
| 110 |
-
|
| 111 |
-
Steady linear learning rate decay visible throughout the training cycles.
|
| 112 |
-
|
| 113 |
-
---
|
| 114 |
-
|
| 115 |
-
## 
|
| 116 |
-
|
| 117 |
-
Gradient norms remain well controlled, with only rare spikes, indicating stable training.
|
| 118 |
-
|
| 119 |
-
---
|
| 120 |
-
|
| 121 |
-
## 
|
| 122 |
-
|
| 123 |
-
The model was trained for over 50 epochs (as shown on the epoch chart).
|
| 124 |
-
|
| 125 |
-
---
|
| 126 |
-
|
| 127 |
-
## 
|
| 128 |
-
|
| 129 |
-
A steady climb to over 68,000 update steps during training.
|
| 130 |
|
| 131 |
---
|
| 132 |
|
|
@@ -146,10 +110,10 @@ pydantic==2.11.7
|
|
| 146 |
|
| 147 |
## ⚠️ Limitations & Ethics
|
| 148 |
|
| 149 |
-
- **No pricing or decision support:** For education and inspiration only, not for real insurance contracts
|
| 150 |
-
- **Not a substitute for an actuary:** Always consult professionals for real-world decisions
|
| 151 |
-
- **Coverage:** Designed and tested specifically for the insurance/actuarial domain
|
| 152 |
-
- **Training data bias:** Outputs may reflect source content
|
| 153 |
|
| 154 |
---
|
| 155 |
|
|
|
|
| 48 |
|
| 49 |
## ✨ Key Features
|
| 50 |
|
| 51 |
+
- 🎯 **Domain-specific:** Focused exclusively on actuarial and insurance Q&A.
|
| 52 |
+
- 📚 **Educational:** Makes complex actuarial terminology accessible for all users.
|
| 53 |
+
- 🚀 **Efficient:** Fine-tuned with Unsloth for rapid, scalable training.
|
| 54 |
+
- 🔓 **Open Source:** Apache 2.0 License; easy to reuse, adapt, remix.
|
| 55 |
+
- 🌐 **Widget & Demo:** Integrated as a live demo on [ActuaryEnough](https://actuaryenough.vercel.app).
|
| 56 |
|
| 57 |
---
|
| 58 |
|
| 59 |
## 💡 Intended Use Cases
|
| 60 |
|
| 61 |
+
- **Education:** For students and actuaries in training, or for professionals retraining in actuarial language.
|
| 62 |
+
- **Translation:** Make practical insurance questions understandable at professional actuarial level.
|
| 63 |
+
- **Research:** Support for actuarial research, Q&A, and domain adaptation.
|
| 64 |
|
| 65 |
### Examples
|
| 66 |
|
|
|
|
| 74 |
|
| 75 |
## 📂 Training Data
|
| 76 |
|
| 77 |
+
- **Dataset:** Over 11,000 manually curated actuarial question–answer pairs.
|
| 78 |
+
- **Topics:** Life and non-life insurance, risk, regulation, reserves, actuarial mathematics.
|
| 79 |
+
- **Language:** English.
|
| 80 |
|
| 81 |
---
|
| 82 |
|
| 83 |
+
## 📊 Training Statistics
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 84 |
|
| 85 |
+
| Metric | Value / Range | Notes |
|
| 86 |
+
|--------------------|-----------------------|--------------------------------------------|
|
| 87 |
+
| Epochs | ~51 | Reached at end of training |
|
| 88 |
+
| Global Steps | >68,000 | |
|
| 89 |
+
| Initial Train Loss | ~2.2 | At start |
|
| 90 |
+
| Final Train Loss | ~1.4 | At end |
|
| 91 |
+
| Learning Rate | 8e-7 → ≈0 | Linear decay throughout training |
|
| 92 |
+
| Gradient Norm | 5 – 15 | Generally stable with rare spikes |
|
| 93 |
+
| Hardware | RTX 3090, 16-core CPU | 24GB VRAM, 94GB RAM, CUDA 12.8, Linux 6.1 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 94 |
|
| 95 |
---
|
| 96 |
|
|
|
|
| 110 |
|
| 111 |
## ⚠️ Limitations & Ethics
|
| 112 |
|
| 113 |
+
- **No pricing or decision support:** For education and inspiration only, not for real insurance contracts.
|
| 114 |
+
- **Not a substitute for an actuary:** Always consult professionals for real-world decisions.
|
| 115 |
+
- **Coverage:** Designed and tested specifically for the insurance/actuarial domain.
|
| 116 |
+
- **Training data bias:** Outputs may reflect source content.
|
| 117 |
|
| 118 |
---
|
| 119 |
|