manuelcaccone commited on
Commit
f73d813
·
verified ·
1 Parent(s): 85be178

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -61
README.md CHANGED
@@ -48,19 +48,19 @@ model-index:
48
 
49
  ## ✨ Key Features
50
 
51
- - 🎯 **Domain-specific:** Focused exclusively on actuarial and insurance Q&A
52
- - 📚 **Educational:** Makes complex actuarial terminology accessible for all users
53
- - 🚀 **Efficient:** Fine-tuned with Unsloth for rapid, scalable training
54
- - **Open Source:** Apache 2.0 License; easy to reuse, adapt, remix
55
- - 🌐 **Widget & Demo:** Integrated as a live demo on [ActuaryEnough](https://actuaryenough.vercel.app)
56
 
57
  ---
58
 
59
  ## 💡 Intended Use Cases
60
 
61
- - **Education**: For students and actuaries in training, or for professionals retraining in actuarial language
62
- - **Translation**: Make practical insurance questions understandable at professional actuarial level
63
- - **Research**: Support for actuarial research, Q&A, and domain adaptation
64
 
65
  ### Examples
66
 
@@ -74,59 +74,23 @@ Output: "This relates to premium calculation considering risk factors such as ex
74
 
75
  ## 📂 Training Data
76
 
77
- - **Dataset**: Over 11,000 manually curated actuarial question–answer pairs
78
- - **Topics**: Life and non-life insurance, risk, regulation, reserves, actuarial mathematics
79
- - **Language**: English
80
 
81
  ---
82
 
83
- ## 🔬 Training Procedure & Metrics
84
-
85
- - **Base Model**: unsloth/gemma-3-270m-it
86
- - **Epochs**: ~51 epochs (visible from screenshots, final point in `train/epoch`)
87
- - **Steps**: over 68,000 global steps (`train/global_step` chart)
88
- - **Training Loss**:
89
- - Starts around **2.2**
90
- - Smoothly drops, converges to **~1.4** at the final epoch ([see "train/loss" graph])
91
- - **Learning Rate**:
92
- - Decays linearly from **8e-7** down to near zero ([see "train/learning_rate" graph])
93
- - **Gradient Norm**:
94
- - Usually oscillates between **5** and **15** ([see "train/grad_norm" graph])
95
- - **Hardware**:
96
- - NVIDIA GeForce RTX 3090 (24GB VRAM)
97
- - 16 physical/32 logical core CPU
98
- - 94GB RAM
99
- - CUDA 12.8, Linux 6.10
100
 
101
- ---
102
-
103
- ## ![Train Loss Curve](attached_image:1)
104
-
105
- The curve shows rapid loss reduction in the first epochs, then stable convergence, confirming healthy optimization.
106
-
107
- ---
108
-
109
- ## ![Learning Rate Schedule](attached_image:2)
110
-
111
- Steady linear learning rate decay visible throughout the training cycles.
112
-
113
- ---
114
-
115
- ## ![Gradient Norms](attached_image:3)
116
-
117
- Gradient norms remain well controlled, with only rare spikes, indicating stable training.
118
-
119
- ---
120
-
121
- ## ![Training Epoch Progress](attached_image:4)
122
-
123
- The model was trained for over 50 epochs (as shown on the epoch chart).
124
-
125
- ---
126
-
127
- ## ![Global Steps](attached_image:5)
128
-
129
- A steady climb to over 68,000 update steps during training.
130
 
131
  ---
132
 
@@ -146,10 +110,10 @@ pydantic==2.11.7
146
 
147
  ## ⚠️ Limitations & Ethics
148
 
149
- - **No pricing or decision support:** For education and inspiration only, not for real insurance contracts
150
- - **Not a substitute for an actuary:** Always consult professionals for real-world decisions
151
- - **Coverage:** Designed and tested specifically for the insurance/actuarial domain
152
- - **Training data bias:** Outputs may reflect source content
153
 
154
  ---
155
 
 
48
 
49
  ## ✨ Key Features
50
 
51
+ - 🎯 **Domain-specific:** Focused exclusively on actuarial and insurance Q&A.
52
+ - 📚 **Educational:** Makes complex actuarial terminology accessible for all users.
53
+ - 🚀 **Efficient:** Fine-tuned with Unsloth for rapid, scalable training.
54
+ - 🔓 **Open Source:** Apache 2.0 License; easy to reuse, adapt, remix.
55
+ - 🌐 **Widget & Demo:** Integrated as a live demo on [ActuaryEnough](https://actuaryenough.vercel.app).
56
 
57
  ---
58
 
59
  ## 💡 Intended Use Cases
60
 
61
+ - **Education:** For students and actuaries in training, or for professionals retraining in actuarial language.
62
+ - **Translation:** Make practical insurance questions understandable at professional actuarial level.
63
+ - **Research:** Support for actuarial research, Q&A, and domain adaptation.
64
 
65
  ### Examples
66
 
 
74
 
75
  ## 📂 Training Data
76
 
77
+ - **Dataset:** Over 11,000 manually curated actuarial question–answer pairs.
78
+ - **Topics:** Life and non-life insurance, risk, regulation, reserves, actuarial mathematics.
79
+ - **Language:** English.
80
 
81
  ---
82
 
83
+ ## 📊 Training Statistics
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
84
 
85
+ | Metric | Value / Range | Notes |
86
+ |--------------------|-----------------------|--------------------------------------------|
87
+ | Epochs | ~51 | Reached at end of training |
88
+ | Global Steps | >68,000 | |
89
+ | Initial Train Loss | ~2.2 | At start |
90
+ | Final Train Loss | ~1.4 | At end |
91
+ | Learning Rate | 8e-7 → ≈0 | Linear decay throughout training |
92
+ | Gradient Norm | 5 – 15 | Generally stable with rare spikes |
93
+ | Hardware | RTX 3090, 16-core CPU | 24GB VRAM, 94GB RAM, CUDA 12.8, Linux 6.1 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
94
 
95
  ---
96
 
 
110
 
111
  ## ⚠️ Limitations & Ethics
112
 
113
+ - **No pricing or decision support:** For education and inspiration only, not for real insurance contracts.
114
+ - **Not a substitute for an actuary:** Always consult professionals for real-world decisions.
115
+ - **Coverage:** Designed and tested specifically for the insurance/actuarial domain.
116
+ - **Training data bias:** Outputs may reflect source content.
117
 
118
  ---
119