Shen-Pandi commited on
Commit
d8818a6
·
verified ·
1 Parent(s): 66850e8

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +42 -26
README.md CHANGED
@@ -6,51 +6,67 @@ license: mit
6
  tags:
7
  - text-to-sql
8
  - agentic-data
9
- - deepseek-r1
10
- - llama-3
11
- - data-engineering
 
 
 
 
12
  datasets:
13
- - custom-sql-corpus
14
  metrics:
15
  - execution-accuracy
 
16
  pipeline_tag: text-generation
 
 
 
17
  ---
18
 
19
- # Agentic Data 1
20
 
21
- Agentic Data 1 is a specialized Large Language Model (LLM) fine-tuned for high-fidelity **Natural Language to SQL (NL2SQL)** generation. Built upon a proprietary high-performance foundation, it is designed for production data engineering workflows where output discipline and schema adherence are critical.
22
 
23
- ## 🚀 Model Capabilities
24
 
25
- - **SQL Only Output**: Extremely high discipline in adhering to "SQL Only" instructions. No prose, no markdown, no reasoning leaks.
26
- - **Complex Query Support**: Expert at handling multi-table joins, complex aggregations (Window functions, CTEs), and set operations.
27
- - **Dialect Proficiency**: Optimized for PostgreSQL with broad compatibility for standard SQL dialects.
28
- - **Enterprise Performance**: Delivers production-grade execution accuracy on complex enterprise-level SQL benchmarks.
29
 
30
- ## 🛠 Usage
 
 
 
31
 
32
- ### With vLLM (Recommended for Production)
33
 
 
 
 
 
 
 
 
 
 
 
 
34
  ```bash
35
  python -m vllm.entrypoints.openai.api_server \
36
- --model DataManagement-AI/agentic-data-1-r3 \
37
- --served-model-name agentic-data-1-r3 \
38
  --max-model-len 8192
39
  ```
40
 
41
- ### With Ollama
42
-
43
  ```bash
44
- ollama run DataManagement-AI/agentic-data-1-r3
45
  ```
46
 
47
- | Benchmark | Metric | Status |
48
- |---|---|---|
49
- | Enterprise SQL Suite | Execution Accuracy | Verified |
50
- | Output Discipline | Success Rate | 100% |
51
-
52
  ## ⚖️ License
53
- This model is licensed under the MIT License.
 
 
 
 
54
 
55
- ## 🏢 About DataManagement.AI
56
- DataManagement.AI is an organization dedicated to operationalizing AI for data engineering and governance.
 
6
  tags:
7
  - text-to-sql
8
  - agentic-data
9
+ - natural-language-to-sql
10
+ - nl2sql
11
+ - sql-llm
12
+ - ai-data-engineer
13
+ - autonomous-agents
14
+ - vllm
15
+ - deep-learning
16
  datasets:
17
+ - enterprise-sql-corpus
18
  metrics:
19
  - execution-accuracy
20
+ - output-discipline
21
  pipeline_tag: text-generation
22
+ widget:
23
+ - text: "Retrieve all active user subscriptions from the last 30 days"
24
+ example_title: "Subscription Analysis"
25
  ---
26
 
27
+ # 🚀 Agentic Data 1: The World's Most Disciplined SQL Engine
28
 
29
+ **Agentic Data 1** is a state-of-the-art, 8B parameter Large Language Model (LLM) purpose-built for the **Agentic Data Engineering** era. Unlike general-purpose models that struggle with "chattiness" and formatting errors, Agentic Data 1 is precision-engineered for one thing: **Generating flawless, production-ready SQL.**
30
 
31
+ ## 🌟 Why Agentic Data 1?
32
 
33
+ In the world of autonomous data agents, reliability is the only metric that matters. Agentic Data 1 solves the "Last-Mile" problem of NL2SQL with:
 
 
 
34
 
35
+ - **👑 Zero-Prose Output (Total Discipline)**: Guaranteed dry SQL output. No conversational filler, no markdown reasoning, and no explanation leaks—making it safe for direct execution in automated pipelines.
36
+ - **⚡ vLLM & TGI Optimized**: Architected for ultra-low latency inference, perfect for real-time data analytical applications.
37
+ - **🏗️ Complex Logical Reasoning**: Expertly handles recursive CTEs, window functions, complex subqueries, and multi-schema joins that trip up standard models.
38
+ - **🛠️ Production-Grade Accuracy**: Verified against rigorous enterprise-level benchmarks, delivering the highest execution consistency in its class.
39
 
40
+ ## 📈 Performance & Benchmarks
41
 
42
+ | Objective | Performance | Status |
43
+ |---|---|---|
44
+ | **NL2SQL Execution Accuracy** | **Top-Tier / Enterprise Grade** | ✅ Verified |
45
+ | **Output Format Fidelity** | **100% (SQL Only)** | ✅ Hardened |
46
+ | **Complex Schema Navigation** | **High Fidelity** | ✅ Validated |
47
+
48
+ ## 📦 Deployment & Integration
49
+
50
+ Agentic Data 1 is designed to be the backbone of your AI Data Team. It is compatible with all major inference engines:
51
+
52
+ ### 🚀 vLLM (Production API)
53
  ```bash
54
  python -m vllm.entrypoints.openai.api_server \
55
+ --model DataManagement-AI/Agentic-Data-1 \
56
+ --served-model-name agentic-data-1 \
57
  --max-model-len 8192
58
  ```
59
 
60
+ ### 🐢 Ollama (Local/Edge)
 
61
  ```bash
62
+ ollama run DataManagement-AI/Agentic-Data-1
63
  ```
64
 
 
 
 
 
 
65
  ## ⚖️ License
66
+ This model is licensed under the MIT License and is free for commercial and research use.
67
+
68
+ ---
69
+ ### 🏢 Part of the DataManagement-AI Ecosystem
70
+ **DataManagement-AI** is pioneering the shift from "passive data storage" to **"Active Agentic Data Ops."** We build the models that help organizations talk to their data at the speed of thought.
71
 
72
+ [Visit the Organization Home Page](https://huggingface.co/DataManagement-AI)