yoeven commited on
Commit
fbb364d
·
verified ·
1 Parent(s): 703ae5d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +232 -1
README.md CHANGED
@@ -7,4 +7,235 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  pinned: false
8
  ---
9
 
10
+ # The AI model built for deterministic developer tasks - Interfaze
11
+
12
+
13
+ Interfaze is an AI model built on a new architecture that merges specialized DNN/CNN models with LLMs for developer tasks that require deterministic output and high consistency like OCR, scraping, classification, web search and more.
14
+
15
+ ```
16
+
17
+ +===-----------------=++**++=---::::::::::::::::::::::::::::::--=+++++=-
18
+ %##*=--------------:---==+***++=---::::::::::::::::::::::::::::::-=+****=:.:
19
+ %%@%#===+++++++++++++++***#####*********++++++++++++++++++++++++++*######*=:::
20
+ %@@@#+=+*#****##############################****************###%%%%%%%%@@#=::-
21
+ %%@@#+-::=+******############%%%%%%%%%%%%%%######***********#%%%%%%%%%@@@%=::-
22
+ %%@@*-:...-+#%%@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@%+--=
23
+ %%@%+-....:%@@@@@@@@@@@@@%%%%%%%%#################******##%@@@@@@@@@@@@@@%**++
24
+ %%@%+:....-%%%@@@@@@%%%%%%%%%%%################***********#%%@@@@@@@@@@@@@#*++
25
+ %%@%+:....=%#%@@@@@@%%%%%%%%%################**************#%%@%@@@@@@@@@@#+==
26
+ %%@%+:....+#%@@@@@@@%%%%%%%%%################****************#%%@@@@@@@%@@*+--
27
+ %%@%+:...:*#%@@@@@@@%%%%%%%%%################************#****##%@@@@@@@@@*+-=
28
+ %%@%+:...:*#%@@@@@@@%%%%%%%%%################******#######*===+##@@@@@@%@@*+-=
29
+ %%@%=:...:##%@@@@@@%%%%%%%%%%######################%%%%%#*+:.:-*#@@@@@@%@@*+-=
30
+ %%@%=:...:##@@@@@@@%%%%%%%%%%#################%%%%%%%%%#*+=. .:+#@@@@@@%@@*+-=
31
+ %%%%=:...:%%@@@@@@@%%%%%%%%%%############%%%%%%%%%%%%##*+=-. .=#%@@@@@%@@#+==
32
+ %%@%=:...-%%@@@@@@@%%%%%%%%%%########%%%%%%%%%%%%%###**++=-. .-#%@@@@@%@@#+==
33
+ %%%%=:...-%%#%%%%######%%%%%%%%%%%%%%%%@@@@%%%%####*****++=:..:=#@@@@@@%@@#+==
34
+ %%%%=:...-%%#%###%%%###%%%%%%%%%%%%@@@@%%%%%%####********++=--=+%@@@@@@%%@#+==
35
+ %%%%=:...-%%%%%%%%%%%%%%#%%%%%%@@@@@%%%%%########**********+==+#%@@@@@@%%@#*==
36
+ %%%%=:...-%%#%%###########*##%%%@%%%%%############**********++*#@@@@@@@%%@#*==
37
+ %%%%=:...-%%%#%#%%%%%%%%%%#%%%%%%%%################***********#%@@@@@@@%%%#*==
38
+ %%%%=:...-@%@@@@@@@@@@@@@@%%%%%%%######################******##%@@@@@@%%%%#*==
39
+ %%%%=:...-@%@@@@@@@@@@@@@%%%%%%%%##############################@@@@@@@%%%%#*==
40
+ %%%#=::::=@%@@@@@@@@@@@%%%%%%%%%%%############################%@@@@@@@%%%%#*==
41
+ %%%%+=--=+%@@@@@@@@@@@%%%%%%%%%%%%%%##########################%@@@@@@@%%%%#*==
42
+ %%%%*+=--=%@%@@@@@@@@@%%%%%%%%%%%%%%%%#######################%@@@@@@@@%%%%#*==
43
+ %%%%*+-:.:*%%%@@@@@@@@@%%%%%%%%%%%%%%%%%%###################%@@@@@@@@%%%%%#*+=
44
+ ##%#+-....+%%%@@@@@@@@@@%%%%%%%%%%%%%%%%%%%%%%%%%#########%%%@@@@@@@@%%%%%#*+=
45
+ %#%#=:....-%%%@@@@@@@@@@@@@@@@%%%%%%%%%%%%%%%%%%%%%%%%%%%%%@@@@@@@@@@%%%%%#*+=
46
+ %#%#=:.....-=+*#######%%%%%%@@@@@@@@@@@@@@@@@@@%%%%%%#####%%%%%%%@@@@@%%%%#*+=
47
+ ##%#=:.....:::--==++++===---::::::::::::::::::::::::::::::::-=++***##%@@@%#*+=
48
+ ####=:::-----------==+****++=-------------------:::::::::::::----=+**####%#*+=
49
+ ####=::::::::::::::--==+++=--::::::::::::::::::::::::...........::-+********+=
50
+ ####=::::::::::::::--=++++=--:::::::::::::::::::................::-+********++
51
+ ###*=::::::::::::::-==++++=--::::::::::::::::..:--:::------------=+*##*####*++
52
+ ###*=:::::::::::::--==++++=--::::::::::::-++:..+%%*:-+****+++++=+*##%%*%%%@#++
53
+ ##**=:::::::::::::--==++++=--::::::::::-*##*-::-==-:..:........:::-+***####*++
54
+ #***+===============++++++==--------------:::::::::----------==+**##########
55
+ #**************************++++++++++++++++++++++++++****##%%%%%%%%%%%%%
56
+ ###########################*******##########%%%%%%%%%%%%%%%###
57
+ #*++**#%@@@@@@@@@@@@@@@@@@%%%%%%%%%%%%%%%%@@@@@@@@@@@%#######
58
+ *+===+*#%%%%%####************+++++++++++++*##%%%%%%@@%######
59
+ ***#####%###*****++++++++++++++++++++++**#%%%%%%%%@@@@@@
60
+ ```
61
+
62
+
63
+ * OCR, web scraping, web search, classification and more
64
+ * OpenAI chat completion API compatible
65
+ * High accuracy structured output consistency
66
+ * Built-in code execution and sandboxing
67
+ * Custom web engine for scraping and web research capabilities
68
+ * Auto reasoning when needed
69
+ * Controllable guardrails
70
+ * Fully managed and scalable
71
+ * Globally distributed fallback system with high uptime
72
+
73
+ ### Beta launch video
74
+
75
+ [![Beta launch video](https://interfaze.ai/thumbnail.png)
76
+
77
+
78
+
79
+ ](https://x.com/yoeven/status/1975592154807624059)
80
+
81
+ ### Model Comparison
82
+
83
+
84
+
85
+ * Benchmark: MMLU-Pro
86
+ * interfaze-beta: 83.6
87
+ * GPT-4.1: 80.6
88
+ * Claude Sonnet 4: 83.7
89
+ * Gemini 2.5 Flash: 80.9
90
+ * Claude Sonnet 4 (Thinking): 83.7
91
+ * Claude Opus 4 (Thinking): 86
92
+ * GPT-5-Minimal: 80.6
93
+ * Gemini-2.5-Pro: 86.2
94
+ * Benchmark: MMLU
95
+ * interfaze-beta: 91.38
96
+ * GPT-4.1: 90.2
97
+ * Claude Sonnet 4: -
98
+ * Gemini 2.5 Flash: -
99
+ * Claude Sonnet 4 (Thinking): 88.8
100
+ * Claude Opus 4 (Thinking): 89
101
+ * GPT-5-Minimal: -
102
+ * Gemini-2.5-Pro: 89.2
103
+ * Benchmark: MMMU
104
+ * interfaze-beta: 77.33
105
+ * GPT-4.1: 74.8
106
+ * Claude Sonnet 4: -
107
+ * Gemini 2.5 Flash: 79.7
108
+ * Claude Sonnet 4 (Thinking): 74.4
109
+ * Claude Opus 4 (Thinking): 76.5
110
+ * GPT-5-Minimal: -
111
+ * Gemini-2.5-Pro: 82
112
+ * Benchmark: AIME-2025
113
+ * interfaze-beta: 90
114
+ * GPT-4.1: 34.7
115
+ * Claude Sonnet 4: 38
116
+ * Gemini 2.5 Flash: 60.3
117
+ * Claude Sonnet 4 (Thinking): 74.3
118
+ * Claude Opus 4 (Thinking): 73.3
119
+ * GPT-5-Minimal: 31.7
120
+ * Gemini-2.5-Pro: 87.7
121
+ * Benchmark: GPQA-Diamond
122
+ * interfaze-beta: 81.31
123
+ * GPT-4.1: 66.3
124
+ * Claude Sonnet 4: 68.3
125
+ * Gemini 2.5 Flash: 68.3
126
+ * Claude Sonnet 4 (Thinking): 77.7
127
+ * Claude Opus 4 (Thinking): 79.6
128
+ * GPT-5-Minimal: 67.3
129
+ * Gemini-2.5-Pro: 84.4
130
+ * Benchmark: LiveCodeBench
131
+ * interfaze-beta: 57.77
132
+ * GPT-4.1: 45.7
133
+ * Claude Sonnet 4: 44.9
134
+ * Gemini 2.5 Flash: 49.5
135
+ * Claude Sonnet 4 (Thinking): 65.5
136
+ * Claude Opus 4 (Thinking): 63.6
137
+ * GPT-5-Minimal: 55.8
138
+ * Gemini-2.5-Pro: 75.9
139
+ * Benchmark: ChartQA
140
+ * interfaze-beta: 90.88
141
+ * GPT-4.1: -
142
+ * Claude Sonnet 4: -
143
+ * Gemini 2.5 Flash: -
144
+ * Claude Sonnet 4 (Thinking): -
145
+ * Claude Opus 4 (Thinking): -
146
+ * GPT-5-Minimal: -
147
+ * Gemini-2.5-Pro: -
148
+ * Benchmark: AI2D
149
+ * interfaze-beta: 91.51
150
+ * GPT-4.1: 85.9
151
+ * Claude Sonnet 4: -
152
+ * Gemini 2.5 Flash: -
153
+ * Claude Sonnet 4 (Thinking): -
154
+ * Claude Opus 4 (Thinking): -
155
+ * GPT-5-Minimal: -
156
+ * Gemini-2.5-Pro: 89.5
157
+ * Benchmark: Common-Voice-v16
158
+ * interfaze-beta: 90.8
159
+ * GPT-4.1: -
160
+ * Claude Sonnet 4: -
161
+ * Gemini 2.5 Flash: -
162
+ * Claude Sonnet 4 (Thinking): -
163
+ * Claude Opus 4 (Thinking): -
164
+ * GPT-5-Minimal: -
165
+ * Gemini-2.5-Pro: -
166
+
167
+
168
+ \*Results for Non-Interfaze models are sourced from model providers, leaderboards, and evaluation providers such as Artificial Analysis.
169
+
170
+ ### Works like any other LLM
171
+
172
+ OpenAI API compatible, works with every AI SDK out of the box
173
+
174
+ OpenAI SDK
175
+
176
+ Vercel AI SDK
177
+
178
+ Langchain SDK
179
+
180
+ ![Extraction](https://interfaze.ai/examples/extraction_example.png)
181
+
182
+ ![scraping](https://interfaze.ai/examples/scraper_example.png)
183
+
184
+ Fully configurable guardrails for text and images
185
+
186
+ ![Extraction](https://interfaze.ai/examples/nsfw_example.jpg)
187
+
188
+ This architecture combines a suite of small specialized models supported with custom tools and infrastructure while automatically routing to the best model for the task that prioritizes accuracy and speed.
189
+
190
+ [![How it works](https://interfaze.ai/examples/howitworks.png)
191
+
192
+
193
+
194
+ ](/examples/howitworks.png)
195
+
196
+ ### Specs
197
+
198
+ Max output tokens
199
+
200
+ 32k tokens
201
+
202
+ Input modalities
203
+
204
+ Text, Images, Audio, File, Video
205
+
206
+ Output tokens
207
+
208
+ $3.50 / MTok
209
+
210
+ Observability & Logging
211
+
212
+ Coming soon
213
+
214
+ ### FAQ
215
+
216
+ ### Todo (Prioritized)
217
+
218
+ * Reduce transactional token count
219
+ * Pre-built prompts/schemas optimized for specific tasks
220
+ * Embedding model
221
+ * Built-in observability and logging on the dashboard
222
+ * Complete metrics and analytics
223
+ * v1.1 Interfaze
224
+ * Reduce latency and improve throughput
225
+ * Custom SDKs for interfaze with AI SDK, Langchain, etc.
226
+ * Leaderboard for projects
227
+
228
+ If you have feature requests or recommendations, please reach out!
229
+
230
+ ### Research references
231
+
232
+ * [Interfaze: The Future of AI is built on Task-Specific Small Models](https://www.arxiv.org/abs/2602.04101)
233
+ * [Agentic Context Engineering](https://www.arxiv.org/pdf/2510.04618)
234
+ * [Small Language Models are the Future of Agentic AI](https://arxiv.org/pdf/2506.02153)
235
+ * [The Sparsely-Gated Mixture-of-Experts Layer](https://arxiv.org/pdf/1701.06538)
236
+ * [DeepSeekMoE](https://arxiv.org/pdf/2401.06066)
237
+ * [Confronting LLMs with Traditional ML](https://arxiv.org/pdf/2310.14607)
238
+
239
+ ### Who are we?
240
+
241
+ We are a small team of ML, Software and Infrastructure engineers engrossed in the fact that a small model can do a lot more when specialized. Allowing us to make AI available in every dev workflow.