AI & ML interests

MoE architectures, Chimera models, Assembly of Experts

Recent Activity

mschuettlerTNG  published a model about 12 hours ago
tngtech/Kokoro-82M-int8-ov
mschuettlerTNG  updated a model about 14 hours ago
tngtech/Kokoro-82M-int8-ov
View all activity

Articles

BM-TNG 
published an article about 1 year ago
view article
Article

How Long Prompts Block Other Requests - Optimizing LLM Performance

tngtech
13
SR-TNG 
published an article about 1 year ago
view article
Article

Finetuning olmOCR to be a faithful OCR-Engine

tngtech
19
BM-TNG 
published an article about 1 year ago
view article
Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

tngtech
81
BM-TNG 
published an article about 1 year ago
view article
Article

Efficient Request Queueing – Optimizing LLM Performance

tngtech
26