ArunkumarVR commited on
Commit
f00e91b
Β·
verified Β·
1 Parent(s): 650c831

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -9
README.md CHANGED
@@ -7,9 +7,57 @@ sdk: static
7
  pinned: true
8
  short_description: Reasoning-first, agentic small language models (SLMs).
9
  ---
 
10
  # DeepBrainz AI & Labs
11
 
12
- **DeepBrainz AI & Labs builds reasoning-first, agentic Small Language Models (SLMs), advancing RL post-training and test-time scaling to deliver efficient, reliable intelligence, while actively exploring diffusion-based and iterative inference regimes.**
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
  ---
15
 
@@ -36,18 +84,16 @@ It is a family of reasoning-first SLMs designed for:
36
  - Research and agentic experimentation
37
 
38
  We publish multiple variants to support **transparency and reproducibility**.
39
- Only selected releases are considered *supported*.
40
 
41
  ---
42
 
43
- ## Philosophy
44
-
45
- We explicitly optimize against:
46
- - Shallow pattern matching
47
- - Benchmark gaming
48
- - Prompt memorization
49
 
50
- We treat intelligence as a **behavior to be trained**, not a side-effect of model size.
 
 
 
51
 
52
  ---
53
 
 
7
  pinned: true
8
  short_description: Reasoning-first, agentic small language models (SLMs).
9
  ---
10
+
11
  # DeepBrainz AI & Labs
12
 
13
+ **Reasoning-first Small Language Models for agentic systems in production**
14
+
15
+ DeepBrainz AI & Labs builds **reasoning-first, agentic Small Language Models (SLMs)** optimized for **reliability, controllability, and efficiency** in real-world AI systems.
16
+
17
+ We focus on **behavioral intelligence** β€” training models to reason, plan, and act β€” rather than scaling parameters or gaming benchmarks.
18
+
19
+ ---
20
+
21
+ ## πŸ”‘ Start Here (Recommended Models)
22
+
23
+ If you’re new to DeepBrainz-R1, start with one of these:
24
+
25
+ - **DeepBrainz-R1-4B** β€” flagship model
26
+ Best overall reasoning quality and stability for production agentic systems.
27
+
28
+ - **DeepBrainz-R1-2B** β€” balanced model
29
+ Strong reasoning with lower latency and cost.
30
+
31
+ - **DeepBrainz-R1-0.6B-v2** β€” small & efficient
32
+ Designed for local inference, edge agents, and cost-sensitive workflows.
33
+
34
+ > All other variants are **experimental or research-only**.
35
+
36
+ ---
37
+
38
+ ## 🧠 What DeepBrainz-R1 Is Built For
39
+
40
+ - Multi-step reasoning
41
+ - Tool-calling and agent loops
42
+ - Long-context analysis
43
+ - Deterministic, inspectable behavior
44
+
45
+ ### 🚫 What It Is *Not* Optimized For
46
+
47
+ - Open-ended chat or roleplay
48
+ - Creative writing
49
+ - Prompt-memorization benchmarks
50
+
51
+ ---
52
+
53
+ ## πŸ§ͺ Research Philosophy
54
+
55
+ We explicitly optimize **against**:
56
+ - Shallow pattern matching
57
+ - Benchmark gaming
58
+ - Prompt memorization
59
+
60
+ We treat intelligence as a **behavior to be trained**, not a side-effect of model size.
61
 
62
  ---
63
 
 
84
  - Research and agentic experimentation
85
 
86
  We publish multiple variants to support **transparency and reproducibility**.
87
+ Only selected releases are considered **supported**.
88
 
89
  ---
90
 
91
+ ## 🧱 Model Support Status
 
 
 
 
 
92
 
93
+ - βœ… **Supported / Production** β€” curated, validated releases
94
+ - πŸ§ͺ **Experimental** β€” exploratory variants
95
+ - 🧱 **Research Checkpoints** β€” raw checkpoints for reproducibility
96
+ - πŸ‘₯ **Community Maintained** β€” third-party quantizations (GGUF, low-bit)
97
 
98
  ---
99