Update README.md
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ Curated collection of GGUF models optimized for **llamatelemetry** on Kaggle dua
|
|
| 21 |
## 🎯 About This Repository
|
| 22 |
|
| 23 |
This repository contains GGUF models tested and verified to work with:
|
| 24 |
-
- **llamatelemetry v0.1.
|
| 25 |
- **Platform**: Kaggle Notebooks (2× Tesla T4, 30GB total VRAM)
|
| 26 |
- **CUDA**: 12.5
|
| 27 |
|
|
@@ -29,7 +29,7 @@ This repository contains GGUF models tested and verified to work with:
|
|
| 29 |
|
| 30 |
> **Status**: Repository created, models coming soon!
|
| 31 |
|
| 32 |
-
### Planned Models (v0.1.
|
| 33 |
|
| 34 |
| Model | Size | Quantization | VRAM | Speed (tok/s) | Status |
|
| 35 |
|-------|------|--------------|------|---------------|--------|
|
|
@@ -55,7 +55,7 @@ Models in this repository are:
|
|
| 55 |
```bash
|
| 56 |
# On Kaggle with GPU T4 × 2
|
| 57 |
pip install --no-cache-dir --force-reinstall \
|
| 58 |
-
git+https://github.com/llamatelemetry/llamatelemetry.git@v0.1.
|
| 59 |
```
|
| 60 |
|
| 61 |
### Download and Run a Model
|
|
|
|
| 21 |
## 🎯 About This Repository
|
| 22 |
|
| 23 |
This repository contains GGUF models tested and verified to work with:
|
| 24 |
+
- **llamatelemetry v0.1.1** - CUDA-first OpenTelemetry Python SDK for LLM inference observability
|
| 25 |
- **Platform**: Kaggle Notebooks (2× Tesla T4, 30GB total VRAM)
|
| 26 |
- **CUDA**: 12.5
|
| 27 |
|
|
|
|
| 29 |
|
| 30 |
> **Status**: Repository created, models coming soon!
|
| 31 |
|
| 32 |
+
### Planned Models (v0.1.1)
|
| 33 |
|
| 34 |
| Model | Size | Quantization | VRAM | Speed (tok/s) | Status |
|
| 35 |
|-------|------|--------------|------|---------------|--------|
|
|
|
|
| 55 |
```bash
|
| 56 |
# On Kaggle with GPU T4 × 2
|
| 57 |
pip install --no-cache-dir --force-reinstall \
|
| 58 |
+
git+https://github.com/llamatelemetry/llamatelemetry.git@v0.1.1
|
| 59 |
```
|
| 60 |
|
| 61 |
### Download and Run a Model
|