File size: 2,010 Bytes
292d92c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
---
description: Review installed Ollama models and suggest others based on hardware
tags: [ai, ml, ollama, models, recommendations, project, gitignored]
---

You are helping the user review their Ollama models and suggest new ones based on their hardware.

## Process

1. **Check currently installed models**
   - Run: `ollama list`
   - Show model sizes and last modified dates
   - Calculate total disk usage

2. **Assess hardware capabilities**
   - Check GPU VRAM: `rocm-smi` (for AMD) or `nvidia-smi` (for NVIDIA)
   - Check system RAM: `free -h`
   - Determine recommended model sizes:
     - < 8GB VRAM: 7B models and smaller
     - 8-16GB VRAM: up to 13B models
     - 16-24GB VRAM: up to 34B models
     - 24GB+ VRAM: 70B+ models possible

3. **Identify user's needs**
   - Ask about use cases:
     - General chat
     - Code generation
     - Data analysis
     - Creative writing
     - Vision/multimodal
     - Specialized domains

4. **Suggest models by category**

   **General Purpose:**
   - llama3.2 (3B, 8B)
   - qwen2.5 (7B, 14B, 32B)
   - mistral (7B)
   - gemma2 (9B, 27B)

   **Code:**
   - codellama (7B, 13B, 34B)
   - deepseek-coder (6.7B, 33B)
   - starcoder2 (7B, 15B)

   **Fast/Small:**
   - tinyllama (1.1B)
   - phi3 (3.8B)

   **Multimodal:**
   - llava (7B, 13B, 34B)
   - bakllava (7B)

   **Specialized:**
   - meditron (medical)
   - sqlcoder (SQL generation)
   - wizardmath (mathematics)

5. **Consider quantization levels**
   - Explain different quants (Q4, Q5, Q8, etc.)
   - Suggest appropriate quant for their VRAM

6. **Cleanup suggestions**
   - Identify duplicate models
   - Suggest removing unused models: `ollama rm <model>`
   - Free up space for new models

## Output

Provide a report showing:
- Currently installed models and total size
- Hardware capacity summary
- Recommended models based on:
  - Available VRAM
  - User's use cases
  - Current gaps in model coverage
- Commands to install suggested models
- Models that could be removed to save space