aab20abdullah commited on
Commit
2255f2c
·
verified ·
1 Parent(s): 42c4b9e

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +373 -0
README.md ADDED
@@ -0,0 +1,373 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # qwen_OSINT
2
+
3
+ <p align="center">
4
+ <strong>Open-Source Intelligence (OSINT) Fine-Tuned Model</strong><br>
5
+ Built on Qwen3-4B-Thinking-2507 &middot; GGUF Quantized &middot; Ready for Local Deployment
6
+ </p>
7
+
8
+ <p align="center">
9
+ <a href="https://huggingface.co/aab20abdullah/qwen_OSINT">
10
+ <img src="https://img.shields.io/badge/HuggingFace-Model_Card-yellow?logo=huggingface&logoColor=white" alt="HuggingFace Model">
11
+ </a>
12
+ <a href="https://huggingface.co/datasets/aab20abdullah/OSINT">
13
+ <img src="https://img.shields.io/badge/Dataset-OSINT-blue?logo=huggingface&logoColor=white" alt="Dataset">
14
+ </a>
15
+ <img src="https://img.shields.io/badge/License-MIT-green.svg" alt="License: MIT">
16
+ <img src="https://img.shields.io/badge/Parameters-4B-purple" alt="Parameters: 4B">
17
+ <img src="https://img.shields.io/badge/Context-256K-orange" alt="Context: 256K">
18
+ <img src="https://img.shields.io/badge/Architecture-Qwen3-red" alt="Architecture: Qwen3">
19
+ </p>
20
+
21
+ ---
22
+
23
+ ## Table of Contents
24
+
25
+ - [Overview](#overview)
26
+ - [Key Features](#key-features)
27
+ - [Model Variants](#model-variants)
28
+ - [Use Cases](#use-cases)
29
+ - [Installation & Usage](#installation--usage)
30
+ - [llama.cpp](#llamacpp)
31
+ - [Ollama](#ollama)
32
+ - [Python (llama-cpp-python)](#python-llama-cpp-python)
33
+ - [LM Studio](#lm-studio)
34
+ - [Jan](#jan)
35
+ - [Hardware Requirements](#hardware-requirements)
36
+ - [Prompting Guide](#prompting-guide)
37
+ - [Dataset](#dataset)
38
+ - [Model Architecture](#model-architecture)
39
+ - [Limitations & Responsible Use](#limitations--responsible-use)
40
+ - [License](#license)
41
+ - [Acknowledgments](#acknowledgments)
42
+
43
+ ---
44
+
45
+ ## Overview
46
+
47
+ **qwen_OSINT** is a specialized 4-billion parameter language model fine-tuned for **Open-Source Intelligence (OSINT)** operations. It is built on top of [Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507), a state-of-the-art small language model featuring explicit chain-of-thought reasoning. This specialized variant has been trained on a curated OSINT dataset to deliver expert-level guidance on intelligence gathering techniques, digital investigation methods, and reconnaissance workflows.
48
+
49
+ The model produces structured reasoning outputs with step-by-step analysis, making it ideal for cybersecurity professionals, threat intelligence analysts, digital investigators, and security researchers who need transparent, explainable intelligence assistance.
50
+
51
+ > **Note:** This model operates exclusively in **thinking mode** and automatically generates visible reasoning traces within `<think>` blocks, allowing you to audit its decision-making process before the final answer.
52
+
53
+ ---
54
+
55
+ ## Key Features
56
+
57
+ | Feature | Description |
58
+ |---------|-------------|
59
+ | **Specialized OSINT Knowledge** | Fine-tuned on 768 curated OSINT examples covering digital investigation, reconnaissance, and intelligence analysis |
60
+ | **Chain-of-Thought Reasoning** | Transparent step-by-step reasoning process visible in `<think>` blocks |
61
+ | **Native 256K Context** | Process extremely long inputs -- full reports, multi-document analysis, and extended dialogues |
62
+ | **Multiple Quantization Options** | Available in Q4_K_M, Q5_K_M, and Q8_0 for flexible deployment across hardware |
63
+ | **Local-First Deployment** | Runs entirely offline on consumer hardware -- no API keys or cloud dependencies |
64
+ | **Broad Tooling Support** | Compatible with llama.cpp, Ollama, LM Studio, Jan, and other GGUF inference engines |
65
+ | **Efficient Architecture** | 4B parameters with Group Query Attention (GQA) for optimal memory usage and fast inference |
66
+ | **MIT Licensed** | Free for personal, academic, and commercial use |
67
+
68
+ ---
69
+
70
+ ## Model Variants
71
+
72
+ | Variant | File | Size | Best For |
73
+ |---------|------|------|----------|
74
+ | **Q4_K_M** | `qwen3-4b-thinking-2507.Q4_K_M.gguf` | 2.5 GB | Maximum speed, lower VRAM usage, minimal quality loss |
75
+ | **Q5_K_M** | `qwen3-4b-thinking-2507.Q5_K_M.gguf` | 2.89 GB | Balanced quality and performance |
76
+ | **Q8_0** | `qwen3-4b-thinking-2507.Q8_0.gguf` | 4.28 GB | Maximum quality, near-lossless quantization |
77
+
78
+ ---
79
+
80
+ ## Use Cases
81
+
82
+ This model excels at providing structured guidance on OSINT methodologies including:
83
+
84
+ - **Digital Identity Investigation** -- Email correlation, username cross-platform enumeration, social media account discovery
85
+ - **Network Reconnaissance** -- IP geolocation, subdomain enumeration, DNS analysis, certificate transparency log monitoring
86
+ - **Domain & Website Intelligence** -- WHOIS lookups, historical snapshots, technology stack fingerprinting
87
+ - **Image & Media Verification** -- Reverse image search guidance, EXIF metadata analysis, deepfake detection techniques
88
+ - **Cryptocurrency Tracing** -- Blockchain transaction analysis, wallet clustering, fund flow investigation
89
+ - **Dark Web Monitoring** -- Leaked database identification, breach notification procedures
90
+ - **Corporate Intelligence** -- Employee enumeration, organizational structure mapping, asset discovery
91
+ - **Mobile & Telephony** -- Phone number validation, carrier identification, SIM-swapping prevention
92
+ - **Geolocation & Physical Intel** -- Address verification, property record queries, geolocation tag analysis
93
+ - **Document Forensics** -- Metadata extraction, authorship attribution, file provenance analysis
94
+ - **Social Media Analysis** -- Bot detection, influence network mapping, disinformation campaign identification
95
+
96
+ ---
97
+
98
+ ## Installation & Usage
99
+
100
+ ### Prerequisites
101
+
102
+ Ensure you have one of the supported inference engines installed. The model is distributed in **GGUF** format for maximum compatibility.
103
+
104
+ ### llama.cpp
105
+
106
+ **Install via Homebrew (macOS/Linux):**
107
+ ```bash
108
+ brew install llama.cpp
109
+ ```
110
+
111
+ **Install via WinGet (Windows):**
112
+ ```bash
113
+ winget install llama.cpp
114
+ ```
115
+
116
+ **Start a local OpenAI-compatible server:**
117
+ ```bash
118
+ llama-server -hf aab20abdullah/qwen_OSINT:Q4_K_M
119
+ ```
120
+
121
+ **Run inference in terminal:**
122
+ ```bash
123
+ llama-cli -hf aab20abdullah/qwen_OSINT:Q4_K_M --jinja
124
+ ```
125
+
126
+ **Build from source:**
127
+ ```bash
128
+ git clone https://github.com/ggerganov/llama.cpp.git
129
+ cd llama.cpp
130
+ cmake -B build
131
+ cmake --build build -j --target llama-server llama-cli
132
+
133
+ ./build/bin/llama-server -hf aab20abdullah/qwen_OSINT:Q4_K_M
134
+ ```
135
+
136
+ ### Ollama
137
+
138
+ An Ollama Modelfile is included for easy deployment.
139
+
140
+ ```bash
141
+ ollama run hf.co/aab20abdullah/qwen_OSINT:Q4_K_M
142
+ ```
143
+
144
+ Or create a custom Modelfile:
145
+ ```dockerfile
146
+ FROM ./qwen3-4b-thinking-2507.Q4_K_M.gguf
147
+
148
+ SYSTEM """You are an expert OSINT (Open-Source Intelligence) analyst. Provide detailed, step-by-step investigative guidance. Always explain your reasoning process before delivering conclusions."""
149
+
150
+ PARAMETER temperature 0.6
151
+ PARAMETER top_p 0.95
152
+ PARAMETER top_k 20
153
+ ```
154
+
155
+ ### Python (llama-cpp-python)
156
+
157
+ ```bash
158
+ pip install llama-cpp-python
159
+ ```
160
+
161
+ ```python
162
+ from llama_cpp import Llama
163
+
164
+ llm = Llama.from_pretrained(
165
+ repo_id="aab20abdullah/qwen_OSINT",
166
+ filename="qwen3-4b-thinking-2507.Q4_K_M.gguf",
167
+ n_ctx=32768, # Context window size
168
+ verbose=False
169
+ )
170
+
171
+ response = llm.create_chat_completion(
172
+ messages=[
173
+ {
174
+ "role": "system",
175
+ "content": "You are an expert OSINT analyst specializing in digital investigations and open-source intelligence gathering."
176
+ },
177
+ {
178
+ "role": "user",
179
+ "content": "How would you approach investigating a potentially fraudulent website?"
180
+ }
181
+ ],
182
+ temperature=0.6,
183
+ top_p=0.95,
184
+ max_tokens=4096
185
+ )
186
+
187
+ print(response["choices"][0]["message"]["content"])
188
+ ```
189
+
190
+ ### LM Studio
191
+
192
+ 1. Open LM Studio
193
+ 2. Search for `aab20abdullah/qwen_OSINT` in the model browser
194
+ 3. Download your preferred quantization variant
195
+ 4. Load the model and start chatting
196
+
197
+ ### Jan
198
+
199
+ 1. Open Jan application
200
+ 2. Navigate to **Hub** or **Models**
201
+ 3. Add Hugging Face model: `aab20abdullah/qwen_OSINT`
202
+ 4. Select your preferred GGUF variant and download
203
+ 5. Start a new conversation with the loaded model
204
+
205
+ ### Docker
206
+
207
+ ```bash
208
+ docker model run hf.co/aab20abdullah/qwen_OSINT:Q4_K_M
209
+ ```
210
+
211
+ ---
212
+
213
+ ## Hardware Requirements
214
+
215
+ | Variant | Minimum RAM | Recommended RAM | GPU VRAM (Optional) |
216
+ |---------|-------------|-----------------|---------------------|
217
+ | **Q4_K_M** | 4 GB | 8 GB | 3 GB+ |
218
+ | **Q5_K_M** | 5 GB | 10 GB | 4 GB+ |
219
+ | **Q8_0** | 6 GB | 12 GB | 5 GB+ |
220
+
221
+ > **Tip:** This model can run on a **4GB Raspberry Pi** with the Q4_K_M variant. For full 256K context utilization, approximately 65 GB of system RAM is required.
222
+
223
+ ---
224
+
225
+ ## Prompting Guide
226
+
227
+ ### Recommended Sampling Parameters
228
+
229
+ | Parameter | Value |
230
+ |-----------|-------|
231
+ | Temperature | 0.6 |
232
+ | Top P | 0.95 |
233
+ | Top K | 20 |
234
+ | Max Tokens | 4,096 (standard) / 8,192 (complex analysis) |
235
+
236
+ ### System Prompt
237
+
238
+ For optimal OSINT performance, use a system prompt that establishes the model's expertise:
239
+
240
+ ```
241
+ You are an expert OSINT (Open-Source Intelligence) analyst and investigator.
242
+ You specialize in digital reconnaissance, threat intelligence, social media
243
+ analysis, and open-source information gathering. Provide structured,
244
+ step-by-step investigative guidance. Always explain your reasoning process
245
+ before delivering conclusions. Cite specific tools, techniques, and
246
+ methodologies where applicable. Maintain ethical boundaries and emphasize
247
+ legal compliance in all investigative recommendations.
248
+ ```
249
+
250
+ ### Example Prompts
251
+
252
+ **Domain Investigation:**
253
+ ```
254
+ What techniques can I use to map the infrastructure of a suspicious domain,
255
+ including subdomains, hosting providers, and historical changes?
256
+ ```
257
+
258
+ **Person of Interest Research:**
259
+ ```
260
+ Walk me through a systematic approach to locating someone's professional
261
+ history using only publicly available sources and without violating privacy laws.
262
+ ```
263
+
264
+ **Incident Response:**
265
+ ```
266
+ A company suspects their employee data has been leaked. Outline a comprehensive
267
+ OSINT workflow to identify the source, scope, and current availability of the
268
+ leaked information on the open web and dark web.
269
+ ```
270
+
271
+ ---
272
+
273
+ ## Dataset
274
+
275
+ This model was fine-tuned on the [OSINT Dataset](https://huggingface.co/datasets/aab20abdullah/OSINT), a curated collection of 768 training examples specifically designed for intelligence analysis education and training.
276
+
277
+ ### Dataset Structure
278
+
279
+ Each example contains three fields:
280
+
281
+ | Field | Description | Example |
282
+ |-------|-------------|---------|
283
+ | **Question** | The OSINT inquiry or scenario | "How to verify a website's registration date and owner information?" |
284
+ | **Thinking** | Step-by-step analytical reasoning | "Domain registration information contains key data such as creation date, expiration date, and registrant details..." |
285
+ | **Solution** | Concrete tools, techniques, and actionable guidance | "Use WHOIS lookup (who.is, whois.domaintools.com); check domain history records (WHOIS History)." |
286
+
287
+ ### Dataset Coverage
288
+
289
+ The dataset spans 25+ OSINT domains including digital identity verification, network reconnaissance, geolocation analysis, cryptocurrency tracing, corporate intelligence gathering, social media investigation, and forensic document analysis.
290
+
291
+ > **Access:** [aab20abdullah/OSINT on Hugging Face Datasets](https://huggingface.co/datasets/aab20abdullah/OSINT)
292
+
293
+ ---
294
+
295
+ ## Model Architecture
296
+
297
+ ```yaml
298
+ Base Model: Qwen/Qwen3-4B-Thinking-2507
299
+ Parameters: 4.0B (3.6B non-embedding)
300
+ Architecture: Dense Transformer
301
+ Layers: 36
302
+ Attention: Group Query Attention (GQA)
303
+ Attention Heads: 32 Query / 8 Key-Value
304
+ Context Length: 262,144 tokens (native)
305
+ Vocabulary Size: 151,936
306
+ Fine-tuning Framework: Unsloth
307
+ Quantization: GGUF (Q4_K_M, Q5_K_M, Q8_0)
308
+ Training Data: 768 OSINT examples
309
+ License: MIT
310
+ ```
311
+
312
+ ### Base Model Capabilities
313
+
314
+ Qwen3-4B-Thinking-2507 delivers exceptional reasoning performance for a 4B parameter model:
315
+
316
+ | Benchmark | Score |
317
+ |-----------|-------|
318
+ | AIME25 (Mathematics) | 81.3% |
319
+ | HMMT25 (Science) | 55.5% |
320
+ | GPQA (General QA) | 65.8% |
321
+ | LiveCodeBench (Coding) | 55.2% |
322
+ | BFCL-v3 (Tool Usage) | 71.2% |
323
+
324
+ ---
325
+
326
+ ## Limitations & Responsible Use
327
+
328
+ ### Known Limitations
329
+
330
+ - **Knowledge Cutoff:** The model's knowledge is current only up to the base model's training data cutoff date. Always verify tool availability, URL validity, and service existence before use.
331
+ - **No Live Access:** This model cannot browse the live internet, execute queries, or access real-time data. It provides methodological guidance only.
332
+ - **Hallucination Risk:** Like all LLMs, it may occasionally suggest tools or techniques that no longer exist or recommend incorrect procedures. Always cross-reference with current documentation.
333
+ - **Jurisdiction Variations:** OSINT laws and regulations vary significantly by country. Users are responsible for ensuring compliance with local legal frameworks.
334
+ - **No Guarantees:** The model provides educational guidance on OSINT methodologies. Results in real-world investigations depend on target visibility, data availability, and operator skill.
335
+
336
+ ### Responsible Use Policy
337
+
338
+ This model is intended for **legitimate security research, educational purposes, authorized penetration testing, journalism, law enforcement, and corporate security operations only**.
339
+
340
+ **Prohibited uses include:**
341
+ - Stalking, harassment, or unauthorized surveillance of individuals
342
+ - Doxxing or publishing private information without consent
343
+ - Identity theft or financial fraud
344
+ - Corporate espionage against non-consenting entities
345
+ - Any activity violating applicable laws or regulations
346
+
347
+ By using this model, you agree to deploy it ethically and in full compliance with all applicable laws, including GDPR, CCPA, and local privacy regulations.
348
+
349
+ ---
350
+
351
+ ## License
352
+
353
+ This model is licensed under the **MIT License**.
354
+
355
+ The base model [Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) is licensed under Apache 2.0. This fine-tuned derivative adds no additional restrictions beyond those of the underlying licenses.
356
+
357
+ You are free to use, modify, distribute, and sublicense this model for personal, academic, and commercial purposes, provided that the license terms are included in all copies or substantial portions.
358
+
359
+ ---
360
+
361
+ ## Acknowledgments
362
+
363
+ - **Alibaba Qwen Team** for the exceptional Qwen3-4B-Thinking-2507 base model
364
+ - **Unsloth** for the 2x faster fine-tuning framework and GGUF quantization pipeline
365
+ - **llama.cpp** team for the efficient GGUF inference engine
366
+ - **Hugging Face** for model hosting, dataset infrastructure, and the Transformers ecosystem
367
+
368
+ ---
369
+
370
+ <p align="center">
371
+ <sub>Built with care for the cybersecurity and OSINT community.</sub><br>
372
+ <sub>For questions or contributions, open a discussion on the <a href="https://huggingface.co/aab20abdullah/qwen_OSINT/discussions">Hugging Face Community tab</a>.</sub>
373
+ </p>