File size: 1,029 Bytes
fcb3d06
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1487522
fcb3d06
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
# ✨ Sparkfile

A hobbyist build of mozilla-ai/llamafile for DGX Spark.

`v0.10.0`

both CUDA and Vulkan backends packaged.

after download validate backends

```sh
chmod +x ./sparkfile
./sparkfile --gpu nvidia --cli -m your_local_model.gguf --list-devices
./sparkfile --gpu vulkan --cli -m your_local_model.gguf --list-devices
```

*should see both CUDA and Vulkan backends being reported.


basic usage

```sh
./sparkfile --chat -m your_local_model.gguf
```

```sh
./sparkfile --gpu nvidia --chat -m your_local_model.gguf
```

```sh
./sparkfile --gpu vulkan --chat -m your_local_model.gguf
```

```sh
./sparkfile --server ... <llama-server args>
```

for certain reason you are not launching as an executable itself, e.g. wsl2

```sh
sh ./sparkfile <args>
```

Expected Platforms and OSs
CPU: ANY if supported by llamafile
GPU:
 - aarch64_cpu_linux: DGX Spark (GB10), Jetson Thor
 - x86-64_cpu_linux: (needs a seperate `.so` or recompile)
 - x86-64_cpu_win: (needs a seperate `.dll` or recompile)

---
license: apache-2.0
---