File size: 261 Bytes
7060f14
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
# inference

Swappable local inference backends (`llama_cpp` default, `transformers` optional extra).

```python
from inference.factory import get_backend

backend = get_backend()
backend.load()
reply = backend.chat([{"role": "user", "content": "Hello!"}])
```