Spaces:

interfaze-ai
/

README

Configuration error

App Files Files Community

README / README.md

yoeven

Update README.md

abc01c2 verified 2 days ago

preview code

raw

history blame contribute delete

8.24 kB

	---
	title: Interfaze
	thumbnail: >-
	https://cdn-uploads.huggingface.co/production/uploads/62fb53b572a7ab50b4b06fca/II4KdeJkepE_NOg9HNnq_.jpeg
	short_description: The AI model built for deterministic developer tasks
	---

	![Extraction](https://interfaze.ai/banner.png)

	# The AI model built for deterministic developer tasks

	Interfaze is an AI model built on a new architecture that merges specialized DNN/CNN models with LLMs for developer tasks that require deterministic output and high consistency like OCR, scraping, classification, web search and more.


	[Try now](https://interfaze.ai/dashboard) or [Read paper](https://www.arxiv.org/abs/2602.04101)

	- OCR, web scraping, web search, classification and more
	- OpenAI chat completion API compatible
	- High accuracy structured output consistency
	- Built-in code execution and sandboxing
	- Custom web engine for scraping and web research capabilities
	- Auto reasoning when needed
	- Controllable guardrails
	- Fully managed and scalable
	- Globally distributed fallback system with high uptime

	### Model Comparison

	\| Benchmark \| interfaze-beta \| GPT-4.1 \| Claude Sonnet 4 \| Gemini 2.5 Flash \| Claude Sonnet 4 (Thinking) \| Claude Opus 4 (Thinking) \| GPT-5-Minimal \| Gemini-2.5-Pro \|
	\| --- \| --- \| --- \| --- \| --- \| --- \| --- \| --- \| --- \|
	\| MMLU-Pro \| 83.6 \| 80.6 \| 83.7 \| 80.9 \| 83.7 \| 86 \| 80.6 \| 86.2 \|
	\| MMLU \| 91.38 \| 90.2 \| - \| - \| 88.8 \| 89 \| - \| 89.2 \|
	\| MMMU \| 77.33 \| 74.8 \| - \| 79.7 \| 74.4 \| 76.5 \| - \| 82 \|
	\| AIME-2025 \| 90 \| 34.7 \| 38 \| 60.3 \| 74.3 \| 73.3 \| 31.7 \| 87.7 \|
	\| GPQA-Diamond \| 81.31 \| 66.3 \| 68.3 \| 68.3 \| 77.7 \| 79.6 \| 67.3 \| 84.4 \|
	\| LiveCodeBench \| 57.77 \| 45.7 \| 44.9 \| 49.5 \| 65.5 \| 63.6 \| 55.8 \| 75.9 \|
	\| ChartQA \| 90.88 \| - \| - \| - \| - \| - \| - \| - \|
	\| AI2D \| 91.51 \| 85.9 \| - \| - \| - \| - \| - \| 89.5 \|
	\| Common-Voice-v16 \| 90.8 \| - \| - \| - \| - \| - \| - \| - \|

	\*Results for Non-Interfaze models are sourced from model providers, leaderboards, and evaluation providers such as Artificial Analysis.

	### Works like any other LLM

	OpenAI API compatible, works with every AI SDK out of the box

	```
	import OpenAI from "openai";

	const interfaze = new OpenAI({
	baseURL: "https://api.interfaze.ai/v1",
	apiKey: "<your-api-key>"
	});

	const completion = await interfaze.chat.completions.create({
	model: "interfaze-beta",
	messages: [\
	{\
	role: "user",\
	content: "Get the company description of JigsawStack from their linkedin page",\
	},\
	],
	});

	console.log(completion.choices[0].message.content);
	```

	### OCR & Document Extraction

	[vision docs ->](https://interfaze.ai/docs/vision)

	```
	prompt = "Get the person information from the following ID."

	schema = z.object({
	first_name: z.string(),
	last_name: z.string(),
	dob: z.string(),
	expiry: z.string(),
	});
	```

	![Extraction](https://interfaze.ai/examples/extraction_example.png)

	### Smart Web Scraping

	[web docs ->](https://interfaze.ai/docs/web)

	```
	prompt = "Extract the information from Yoeven D Khemlani's linkedin page based on the schema."

	schema = z.object({
	first_name: z.string(),
	last_name: z.string(),
	about: z.string(),
	current_company: z.string(),
	current_position: z.string(),
	});
	```

	![scraping](https://interfaze.ai/examples/scraper_example.png)

	### Translation

	[translation docs ->](https://interfaze.ai/docs/translation)

	```
	prompt = "The UK drinks about 100–160 million cups of tea every day, and 98% of tea drinkers add milk to their tea."

	schema = z.object({
	zh: z.string(),
	hi: z.string(),
	es: z.string(),
	fr: z.string(),
	de: z.string(),
	it: z.string(),
	ja: z.string(),
	ko: z.string(),
	});
	```

	```
	zh: 英国每天饮用约100–160百万杯茶，有98%的茶饮者在茶中加入牛奶。
	hi: यूके हर दिन लगभग 100–160 मिलियन कप चाय पीता है, और 98% चाय पीने वाले अपनी चाय में दूध मिलाते हैं।
	es: El Reino Unido bebe alrededor de 100–160 millones de tazas de té cada día, y el 98 % de los consumidores de té añade leche a su té.
	fr: Le Royaume-Uni boit environ 100–160 millions de tasses de thé chaque jour, et 98 % des buveurs de thé ajoutent du lait à leur thé.
	de: Das Vereinigte Königreich trinkt etwa 100–160 Millionen Tassen Tee pro Tag, und 98 % der Teetrinker fügen ihrem Tee Milch hinzu.
	it: Il Regno Unito beve circa 100–160 milioni di tazze di tè ogni giorno e il 98% degli amanti del tè aggiunge latte al proprio tè.
	ja: イギリスでは毎日約100～160百万杯の紅茶が飲まれており、紅茶を飲む人の98%が紅茶に牛乳を加えます。
	ko: 영국에서는 매일 약 1억 ~ 1억 6천만 잔의 차를 마시며, 차를 마시는 사람의 98%가 차에 우유를 넣습니다.
	```

	### Speech-to-text (STT) and diarization

	[stt docs ->](https://interfaze.ai/docs/speech-to-text)

	```
	prompt = "Transcribe https://jigsawstack.com/preview/stt-example.wav"

	schema = z.object({
	text: z.string(),
	speakers: z.object({
	id: z.string(),
	start: z.number(),
	end: z.number()
	})
	});
	```

	```
	{
	"text": " The little tales they tell are false The door was barred, locked and bolted as well Ripe pears are fit for a queen's table A big wet stain was on the round carpet The kite dipped and swayed but stayed aloft The pleasant hours fly by much too soon The room was crowded with a mild wob The room was crowded with a wild mob This strong arm shall shield your honour She blushed when he gave her a white orchid The beetle droned in the hot June sun",
	"speakers": [\
	{\
	"start":0,\
	"end":4.78,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":4.78,\
	"end":9.48,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":9.48,\
	"end":13.06,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":13.06,\
	"end":17.24,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":17.24,\
	"end":21.78,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":21.78,\
	"end":26.3,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":26.3,\
	"end":30.76,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":30.76,\
	"end":35.08,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":35.08,\
	"end":39.24,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":39.24,\
	"end":43.94,\
	"id": "SPEAKER_00"\
	},\
	{\
	"start":43.94,\
	"end":48.5,\
	"id": "SPEAKER_00"\
	}\
	]
	}
	```

	### Configurable guardrails and NSFW checks

	[guardrails docs ->](https://interfaze.ai/docs/guard-rails)

	Fully configurable guardrails for text and images

	```
	S1: Violent Crimes
	S2: Non-Violent Crimes
	S3: Sex-Related Crimes
	S4: Child Sexual Exploitation
	S5: Defamation
	S6: Specialized Advice
	S7: Privacy
	S8: Intellectual Property
	S9: Indiscriminate Weapons
	S10: Hate
	S11: Suicide & Self-Harm
	S12: Sexual Content
	S12_IMAGE: Sexual Content (Image)
	S13: Elections
	S14: Code Interpreter Abuse
	```

	### Architecture

	[read paper ->](https://www.arxiv.org/abs/2602.04101)

	This architecture combines a suite of small specialized models supported with custom tools and infrastructure while automatically routing to the best model for the task that prioritizes accuracy and speed.

	![How it works](https://interfaze.ai/examples/howitworks.png)

	### Specs

	- Context window: 1m tokens
	- Max output tokens: 32k tokens
	- Input modalities: Text, Images, Audio, File, Video
	- Reasoning: Available

	### Research references

	- [Interfaze: The Future of AI is built on Task-Specific Small Models](https://www.arxiv.org/abs/2602.04101)
	- [Agentic Context Engineering](https://www.arxiv.org/pdf/2510.04618)
	- [Small Language Models are the Future of Agentic AI](https://arxiv.org/pdf/2506.02153)
	- [The Sparsely-Gated Mixture-of-Experts Layer](https://arxiv.org/pdf/1701.06538)
	- [DeepSeekMoE](https://arxiv.org/pdf/2401.06066)
	- [Confronting LLMs with Traditional ML](https://arxiv.org/pdf/2310.14607)

	### Who are we?

	We are a small team of ML, Software and Infrastructure engineers engrossed in the fact that a small model can do a lot more when specialized. Allowing us to make AI available in every dev workflow.