inference-net
/

Schematron-3B

Text Generation

text-generation-inference

Model card Files Files and versions

opensporks commited on Sep 2

Commit

6f415b6

·

verified ·

1 Parent(s): 8f1c9ca

Update Readme

Files changed (1) hide show

README.md +18 -4

README.md CHANGED Viewed

@@ -5,20 +5,34 @@ base_model: meta-llama/Llama-3.2-3B-Instruct
 ---
 ## Model Overview
-Schematron is a long‑context extraction model for converting noisy HTML into clean, typed JSON that conforms to a user‑provided schema. It is purpose‑built for web scraping, data ingestion, and turning arbitrary pages into structured records.
 ## Highlights
 - **Schema-first extraction**: Strict, schema‑conformant JSON outputs
 - **Long context**: Robust to lengthy, noisy HTML (up to 128K tokens)
 - **Reliable structure**: Works well with JSON mode and typed parsers
-- **Variants**: Schematron‑8B (quality) and Schematron‑3B (cost)
 ## Model Details
 - **Family**: Schematron (3B and 8B)
 - **Base**: Instruction‑tuned LLM, fine‑tuned for schema‑guided extraction
 - **Context window**: Up to 128K tokens
-- **Input**: Raw or lightly cleaned HTML
-- **Output**: Strictly valid JSON matching your schema
 ## Minimal Quickstart
 Use these local snippets to prepare HTML and compose a schema‑guided prompt. The model returns strictly valid JSON; validate it against your schema downstream.

 ---
 ## Model Overview
+Welcome to the Schematron series, [Inference.net's](https://inference.net/) long‑context extraction models specialized in converting noisy HTML into clean, typed JSON that conforms to your custom schema. The Schematron series was purpose‑trained for web scraping, data ingestion, and transforming arbitrary pages into structured records.
+We're releasing these models in two different sizes:
+- **Schematron‑8B** — marginal quality lift on harder/longer pages
+- **Schematron‑3B** — recommended default; near‑parity quality at ~50% cost of Schematron-8B
+> [!NOTE]
+> This model card is dedicated to the smaller `Schematron-3B` model. Check out [`Schematron-8B`](https://huggingface.co/inference-net/Schematron-8B) for the larger model.
+## I/O at a glance
+- **Input**: Cleaned HTML + JSON Schema (or typed model like Pydantic/Zod)
+- **Output**: Strictly valid JSON conforming to the provided schema (no narration)
 ## Highlights
 - **Schema-first extraction**: Strict, schema‑conformant JSON outputs
+- **Simple I/O**: HTML + schema → JSON
 - **Long context**: Robust to lengthy, noisy HTML (up to 128K tokens)
 - **Reliable structure**: Works well with JSON mode and typed parsers
+- **Variants**: 3B (default, most cost‑efficient) · 8B (marginal quality lift at ~2× cost)
 ## Model Details
 - **Family**: Schematron (3B and 8B)
 - **Base**: Instruction‑tuned LLM, fine‑tuned for schema‑guided extraction
 - **Context window**: Up to 128K tokens
+- **Input**: Cleaned or raw HTML and a JSON Schema (or typed model)
+- **Output**: Strict JSON that conforms to the provided schema
 ## Minimal Quickstart
 Use these local snippets to prepare HTML and compose a schema‑guided prompt. The model returns strictly valid JSON; validate it against your schema downstream.