Spaces:

gothitech
/

README

Configuration error

App Files Files Community

README / README.md

developerJenis

Update README.md

a3f4218 verified 30 days ago

preview code

raw

history blame contribute delete

3.74 kB

	---
	title: Gothi Tech LLP
	emoji: 🦖
	colorFrom: blue
	colorTo: purple
	---

	<p align="center">
	<img src="https://gothi.in/gothi-text.png" width="120" alt="Gothi Tech Logo">
	</p>

	<h1 align="center">Gothi Tech LLP</h1>

	<p align="center">
	<strong>Building the Future of Document AI</strong>
	</p>

	<p align="center">
	<!-- <a href="https://gothi.in/"><img src="https://gothi.in/gothi-text.png" alt="Website"></a> -->
	<a href="https://huggingface.co/gothitech"><img src="https://img.shields.io/badge/HuggingFace-gothitech-yellow" alt="HuggingFace"></a>
	<a href="#"><img src="https://img.shields.io/badge/License-MIT-green" alt="License"></a>
	<a href="#"><img src="https://img.shields.io/badge/Made_in-India-orange" alt="Made in India"></a>
	</p>

	---

	## 👋 About Us

	Gothi Tech LLP is an AI-first company specializing in Document Intelligence and Vision-Language Models. We build production-grade AI systems that help enterprises automate document processing, extract structured data, and unlock insights from unstructured documents at scale.

	Our mission is to make document understanding fast, accurate, and accessible for businesses of all sizes.

	---

	## 🦖 Our Models

	\| Model \| Type \| Size \| Description \|
	\|-------\|------\|------\|-------------\|
	\| [GT-REX](https://huggingface.co/gothitech/GT-REX) \| Vision-Language (VLM) \| ~7B \| Production OCR model for enterprise document understanding, text extraction, and structured data output \|

	### GT-REX Variants

	\| Variant \| Resolution \| Speed \| Best For \|
	\|---------\|-----------\|-------\|----------\|
	\| Nano \| 640px \| ~1-2s \| High-volume batch processing (100+ docs/min) \|
	\| Pro \| 1024px \| ~2-5s \| Standard workflows — invoices, contracts, forms \|
	\| Ultra \| 1536px \| ~5-10s \| Fine print, dense tables, legal and medical docs \|

	---

	## 🔬 What We Do

	\| Area \| Capabilities \|
	\|------\|-------------\|
	\| Document AI \| Intelligent Document Processing (IDP), automated data extraction \|
	\| OCR \| High-accuracy text extraction from printed and scanned documents \|
	\| Handwriting \| Recognition and transcription of handwritten text \|
	\| Structured Output \| Extract data as JSON, Markdown tables, key-value pairs, custom schemas \|
	\| Multi-Language \| Document understanding across multiple languages \|
	\| Table Extraction \| Accurate extraction of complex and nested table structures \|

	---

	## 🏢 Industries We Serve

	\| Industry \| Use Cases \|
	\|----------\|-----------\|
	\| 🏦 Finance \| Invoice processing, receipt scanning, bank statements \|
	\| ⚖️ Legal \| Contract analysis, clause extraction, legal filings \|
	\| 🏥 Healthcare \| Medical records, prescriptions, lab reports \|
	\| 🏛️ Government \| Form processing, ID verification, tax documents \|
	\| 🛡️ Insurance \| Claims processing, policy documents \|
	\| 📦 Logistics \| Shipping labels, waybills, packing lists \|

	---

	## 🚀 Quick Start
	```python
	from vllm import LLM, SamplingParams
	from PIL import Image

	llm = LLM(
	model="gothitech/GT-REX",
	trust_remote_code=True,
	max_model_len=4096,
	gpu_memory_utilization=0.75,
	)

	image = Image.open("document.png")
	outputs = llm.generate(
	[{"prompt": "Extract all text from this document.", "multi_modal_data": {"image": image}}],
	sampling_params=SamplingParams(temperature=0.0, max_tokens=4096),
	)
	print(outputs[0].outputs[0].text)
	```

	---

	## 🔗 Links

	\| \| \|
	\|---\|---\|
	\| 🌐 Website \| [gothi.in](https://gothi.in/) \|
	\| 🤗 HuggingFace \| [gothitech](https://huggingface.co/gothitech) \|
	\| 🦖 GT-REX Model \| [gothitech/GT-REX](https://huggingface.co/gothitech/GT-REX) \|

	---

	<p align="center">
	<strong>🇮🇳 Proudly built in India by Gothi Tech LLP</strong>
	</p>