code-1b-chat-space / README.md
rovdetection's picture
Initial files added
e17358f verified
|
Raw
History Blame Contribute Delete
963 Bytes
metadata
title: Code 1B Chat API
emoji: 💻
colorFrom: blue
colorTo: indigo
sdk: docker
pinned: false
license: apache-2.0

code-1b-chat-v2 — Inference API

OpenAI-compatible REST API for the code-1b-chat-v2 model. Built with FastAPI + llama-cpp-python. Runs on CPU.

Endpoints

Method Path Description
GET /health Health check
GET /v1/models List available models
POST /v1/chat/completions Chat (OpenAI-compatible)

Example

import requests

resp = requests.post(
    "https://rovdetection-code-1b-chat-space.hf.space/v1/chat/completions",
    json={
        "model": "code-1b-chat-v2",
        "messages": [{"role": "user", "content": "Write a Python fibonacci function."}],
        "max_tokens": 200,
        "temperature": 0.7,
    }
)
print(resp.json()["choices"][0]["message"]["content"])

Streaming

Add "stream": true to the request body for SSE streaming.