Update README.md
Browse files
README.md
CHANGED
|
@@ -45,7 +45,7 @@ This model is ideal for:
|
|
| 45 |
|
| 46 |
- **Data**: Custom Sanskrit dataset of over 100,000+ Devanagari `.txt` files.
|
| 47 |
- **Tokenizer**: [SentencePiece](https://github.com/google/sentencepiece) BPE model trained with `character_coverage=1.0`.
|
| 48 |
-
- **Training Platform**: AWS SageMaker
|
| 49 |
- **Framework**: PyTorch with custom FlashAttention blocks
|
| 50 |
- **Training Time**: ~3 epochs with dynamic batching on sharded data
|
| 51 |
|
|
|
|
| 45 |
|
| 46 |
- **Data**: Custom Sanskrit dataset of over 100,000+ Devanagari `.txt` files.
|
| 47 |
- **Tokenizer**: [SentencePiece](https://github.com/google/sentencepiece) BPE model trained with `character_coverage=1.0`.
|
| 48 |
+
- **Training Platform**: AWS SageMaker Tesla V100 GPU
|
| 49 |
- **Framework**: PyTorch with custom FlashAttention blocks
|
| 50 |
- **Training Time**: ~3 epochs with dynamic batching on sharded data
|
| 51 |
|