waqasm86 commited on
Commit
a66be86
·
verified ·
1 Parent(s): 2d9ad1f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -21,7 +21,7 @@ Curated collection of GGUF models optimized for **llamatelemetry** on Kaggle dua
21
  ## 🎯 About This Repository
22
 
23
  This repository contains GGUF models tested and verified to work with:
24
- - **llamatelemetry v0.1.0** - CUDA-first OpenTelemetry Python SDK for LLM inference observability
25
  - **Platform**: Kaggle Notebooks (2× Tesla T4, 30GB total VRAM)
26
  - **CUDA**: 12.5
27
 
@@ -29,7 +29,7 @@ This repository contains GGUF models tested and verified to work with:
29
 
30
  > **Status**: Repository created, models coming soon!
31
 
32
- ### Planned Models (v0.1.0)
33
 
34
  | Model | Size | Quantization | VRAM | Speed (tok/s) | Status |
35
  |-------|------|--------------|------|---------------|--------|
@@ -55,7 +55,7 @@ Models in this repository are:
55
  ```bash
56
  # On Kaggle with GPU T4 × 2
57
  pip install --no-cache-dir --force-reinstall \
58
- git+https://github.com/llamatelemetry/llamatelemetry.git@v0.1.0
59
  ```
60
 
61
  ### Download and Run a Model
 
21
  ## 🎯 About This Repository
22
 
23
  This repository contains GGUF models tested and verified to work with:
24
+ - **llamatelemetry v0.1.1** - CUDA-first OpenTelemetry Python SDK for LLM inference observability
25
  - **Platform**: Kaggle Notebooks (2× Tesla T4, 30GB total VRAM)
26
  - **CUDA**: 12.5
27
 
 
29
 
30
  > **Status**: Repository created, models coming soon!
31
 
32
+ ### Planned Models (v0.1.1)
33
 
34
  | Model | Size | Quantization | VRAM | Speed (tok/s) | Status |
35
  |-------|------|--------------|------|---------------|--------|
 
55
  ```bash
56
  # On Kaggle with GPU T4 × 2
57
  pip install --no-cache-dir --force-reinstall \
58
+ git+https://github.com/llamatelemetry/llamatelemetry.git@v0.1.1
59
  ```
60
 
61
  ### Download and Run a Model