Instructions to use vigneshwar234/TemporalMesh-Transformer with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use vigneshwar234/TemporalMesh-Transformer with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="vigneshwar234/TemporalMesh-Transformer")

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("vigneshwar234/TemporalMesh-Transformer", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use vigneshwar234/TemporalMesh-Transformer with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "vigneshwar234/TemporalMesh-Transformer"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "vigneshwar234/TemporalMesh-Transformer",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/vigneshwar234/TemporalMesh-Transformer

SGLang

How to use vigneshwar234/TemporalMesh-Transformer with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "vigneshwar234/TemporalMesh-Transformer" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "vigneshwar234/TemporalMesh-Transformer",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "vigneshwar234/TemporalMesh-Transformer" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "vigneshwar234/TemporalMesh-Transformer",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use vigneshwar234/TemporalMesh-Transformer with Docker Model Runner:
```
docker model run hf.co/vigneshwar234/TemporalMesh-Transformer
```

vigneshwar234 commited on 2 days ago

Commit

a25d929

verified ·

1 Parent(s): f817d5c

Add source: tmt/experiments/03_full_tmt.ipynb

Browse files

Files changed (1) hide show

tmt/experiments/03_full_tmt.ipynb +58 -0

tmt/experiments/03_full_tmt.ipynb ADDED Viewed

	@@ -0,0 +1,58 @@

+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": ["# Experiment 03 — Full TMT Training Run\n", "All three innovations active: Mesh Attention + Temporal Decay + Adaptive Depth Routing."]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import torch\n",
+    "from tmt.model.config import TMTConfig\n",
+    "from tmt.training.trainer import TMTTrainer, TrainConfig\n",
+    "from tmt.data.dataset import load_text_dataset\n",
+    "\n",
+    "cfg = TMTConfig(\n",
+    "    vocab_size=50258, d_model=512, n_heads=8, n_layers=12,\n",
+    "    graph_k=8, decay_rate=0.1, exit_threshold=0.85,\n",
+    "    dual_stream=True, memory_anchors=16, ffn_stream_dim=256,\n",
+    ")\n",
+    "print(cfg)\n",
+    "print(f'Device: {\"cuda\" if torch.cuda.is_available() else \"cpu\"}')"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "loaders = load_text_dataset('wikitext-2', seq_len=256, batch_size=16)\n",
+    "\n",
+    "train_cfg = TrainConfig(\n",
+    "    total_steps=10_000,\n",
+    "    warmup_steps=500,\n",
+    "    eval_every=500,\n",
+    "    save_every=1000,\n",
+    "    use_wandb=False,   # set True and login to wandb first\n",
+    ")\n",
+    "\n",
+    "trainer = TMTTrainer(cfg, train_cfg, loaders['train'], loaders.get('validation'))\n",
+    "trainer.train()\n",
+    "\n",
+    "tmt_ppl = trainer.evaluate()\n",
+    "print(f'Full TMT perplexity: {tmt_ppl:.2f}')"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {"display_name": "Python 3", "language": "python", "name": "python3"},
+  "language_info": {"name": "python", "version": "3.10.0"}
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}