Instructions to use N8Programs/NextTerm-440M with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use N8Programs/NextTerm-440M with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="N8Programs/NextTerm-440M")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("N8Programs/NextTerm-440M")
model = AutoModelForCausalLM.from_pretrained("N8Programs/NextTerm-440M")

MLX

How to use N8Programs/NextTerm-440M with MLX:

# Make sure mlx-lm is installed
# pip install --upgrade mlx-lm
# if on a CUDA device, also pip install mlx[cuda]

# Generate text with mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("N8Programs/NextTerm-440M")

prompt = "Once upon a time in"
text = generate(model, tokenizer, prompt=prompt, verbose=True)

Notebooks
Google Colab
Kaggle
Local Apps Settings
LM Studio

vLLM

How to use N8Programs/NextTerm-440M with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "N8Programs/NextTerm-440M"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "N8Programs/NextTerm-440M",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/N8Programs/NextTerm-440M

SGLang

How to use N8Programs/NextTerm-440M with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "N8Programs/NextTerm-440M" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "N8Programs/NextTerm-440M",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "N8Programs/NextTerm-440M" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "N8Programs/NextTerm-440M",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

MLX LM

How to use N8Programs/NextTerm-440M with MLX LM:

Generate or start a chat session

# Install MLX LM
uv tool install mlx-lm
# Generate some text
mlx_lm.generate --model "N8Programs/NextTerm-440M" --prompt "Once upon a time"

Docker Model Runner
How to use N8Programs/NextTerm-440M with Docker Model Runner:
```
docker model run hf.co/N8Programs/NextTerm-440M
```

NextTerm-440M / eval_results.txt

N8Programs

Add model card and evaluation utilities

5db721a verified about 1 month ago

Raw

History Blame Contribute Delete

1.42 kB

	OEIS-Eval-Neo:
	NextTerm-440M - 34.43%
	NextTerm-47M - 29.49%
	Qwen3-0.6B - 18.44%
	Qwen3-1.7B - 20.77%
	Qwen3-4B - 23.74%
	Qwen3-8B - 24.62%
	Qwen3-14B - 26.00%

	Ryskina & Knight (2021):
	NextTerm-440M - 52.63%
	NextTerm-47M - 70.18%

	16-Shot Bitstring:
	NextTerm-440M - 32.00%
	NextTerm-47M - 27.88%

	M1 Competition 111 MAPE (macro; canonical greedy; lower is better):
	Naive2 - 17.7987
	NextTerm-440M - 17.6239
	NextTerm-47M - 18.7621
	Qwen3-0.6B - 22.7984
	Qwen3-1.7B - 22.2411
	Qwen3-4B - 19.1731
	Qwen3-8B - 18.4027
	Qwen3-14B - 17.9837

	M1 Competition 111 by frequency (macro MAPE):
	Naive2 - monthly 17.3871 / quarterly 19.5958 / yearly 17.1314
	NextTerm-440M - monthly 18.3407 / quarterly 19.0475 / yearly 13.5498
	NextTerm-47M - monthly 21.2719 / quarterly 16.0270 / yearly 13.3741
	Qwen3-0.6B - monthly 24.7585 / quarterly 21.7989 / yearly 17.2835
	Qwen3-1.7B - monthly 24.3821 / quarterly 22.5869 / yearly 14.5642
	Qwen3-4B - monthly 20.6455 / quarterly 19.6394 / yearly 13.6308
	Qwen3-8B - monthly 20.0289 / quarterly 17.7249 / yearly 13.6534
	Qwen3-14B - monthly 19.4006 / quarterly 18.1729 / yearly 12.9486

	Polynomial continuation evals (accuracy; 200 samples per k):
	Arithmetic (k=2-25): NextTerm-440M - 94.38%; NextTerm-47M - 94.15%
	Quadratic (k=3-25): NextTerm-440M - 86.39%; NextTerm-47M - 81.07%
	Cubic (k=4-25): NextTerm-440M - 75.20%; NextTerm-47M - 37.43%
	Quartic (k=5-25): NextTerm-440M - 67.83%; NextTerm-47M - 15.17%