Raiff1982 commited on
Commit
5ba9a91
·
verified ·
1 Parent(s): 5d0a4e1

Add README_GPT_OSS.md

Browse files
Files changed (1) hide show
  1. docs/README_GPT_OSS.md +426 -0
docs/README_GPT_OSS.md ADDED
@@ -0,0 +1,426 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # GPT-OSS - Open Source ChatGPT Alternative
2
+
3
+ A powerful open-source alternative to ChatGPT with advanced reasoning capabilities, integrated browser tools, and Python code execution — all running locally on Ollama.
4
+
5
+ ## 🚀 Quick Start
6
+
7
+ ```bash
8
+ # Pull and run the model
9
+ ollama pull Raiff1982/gpt-oss
10
+ ollama run Raiff1982/gpt-oss
11
+ ```
12
+
13
+ ## 🎯 What Makes This Model Special?
14
+
15
+ GPT-OSS provides a feature-complete ChatGPT experience with:
16
+
17
+ - **🧠 Multi-Level Reasoning** - Built-in analysis channels for deep thinking
18
+ - **🌐 Browser Integration** - Search, open, and find information on the web
19
+ - **🐍 Python Execution** - Run Python code in a stateful Jupyter environment
20
+ - **🔧 Tool Calling** - Extensible function calling framework
21
+ - **📊 Data Persistence** - Save and load files to `/mnt/data`
22
+ - **💭 Chain of Thought** - Transparent reasoning with configurable depth
23
+
24
+ ## 🛠️ Core Features
25
+
26
+ ### Reasoning Channels
27
+
28
+ The model operates across multiple channels for structured thinking:
29
+
30
+ ```
31
+ analysis → Internal reasoning and tool usage (Python, browser)
32
+ commentary → Function calls and external tool integration
33
+ final → User-facing responses and conclusions
34
+ ```
35
+
36
+ This architecture enables:
37
+ - **Transparent reasoning** - See how the model thinks
38
+ - **Tool integration** - Seamlessly use Python/browser without breaking flow
39
+ - **Clean output** - Separate internal work from final answers
40
+
41
+ ### Browser Tools
42
+
43
+ Built-in web browsing capabilities:
44
+
45
+ ```python
46
+ # Search the web
47
+ browser.search(query="latest AI research", topn=10)
48
+
49
+ # Open specific results
50
+ browser.open(id=3, loc=0, num_lines=50)
51
+
52
+ # Find text on page
53
+ browser.find(pattern="neural networks")
54
+ ```
55
+
56
+ **Use cases:**
57
+ - Research current events and news
58
+ - Find technical documentation
59
+ - Verify facts and statistics
60
+ - Compare information across sources
61
+
62
+ ### Python Code Execution
63
+
64
+ Stateful Jupyter notebook environment:
65
+
66
+ ```python
67
+ # Execute code directly
68
+ import pandas as pd
69
+ import matplotlib.pyplot as plt
70
+
71
+ # Load and analyze data
72
+ df = pd.read_csv('/mnt/data/data.csv')
73
+ df.describe()
74
+
75
+ # Create visualizations
76
+ plt.plot(df['x'], df['y'])
77
+ plt.savefig('/mnt/data/plot.png')
78
+ ```
79
+
80
+ **Capabilities:**
81
+ - Full Python standard library
82
+ - Data analysis (pandas, numpy)
83
+ - Visualization (matplotlib, seaborn)
84
+ - Machine learning (scikit-learn)
85
+ - File persistence in `/mnt/data`
86
+ - 120 second execution timeout
87
+
88
+ ### Reasoning Levels
89
+
90
+ Control analysis depth with reasoning parameters:
91
+
92
+ ```
93
+ low → Quick, intuitive responses
94
+ medium → Balanced thinking (default)
95
+ high → Deep, thorough analysis
96
+ ```
97
+
98
+ ## 🎨 Example Use Cases
99
+
100
+ ### Research Assistant
101
+ ```
102
+ > What are the latest developments in quantum computing?
103
+
104
+ [Model searches web, analyzes multiple sources, synthesizes findings]
105
+ [Cites sources with: 【6†L9-L11】 format]
106
+ [Provides comprehensive summary with references]
107
+ ```
108
+
109
+ ### Data Analysis
110
+ ```
111
+ > Analyze this CSV and find correlations
112
+
113
+ [Loads data with pandas]
114
+ [Performs statistical analysis]
115
+ [Creates visualization]
116
+ [Explains insights and patterns]
117
+ ```
118
+
119
+ ### Code Generation & Debugging
120
+ ```
121
+ > Help me debug this Python function
122
+
123
+ [Analyzes code structure]
124
+ [Tests in Python environment]
125
+ [Identifies issues]
126
+ [Provides corrected version with explanation]
127
+ ```
128
+
129
+ ### Multi-Step Problem Solving
130
+ ```
131
+ > Plan a trip to Tokyo for 5 days under $2000
132
+
133
+ [Searches flight prices]
134
+ [Finds accommodation options]
135
+ [Researches local costs]
136
+ [Creates detailed itinerary with budget breakdown]
137
+ ```
138
+
139
+ ## ⚙️ Technical Specifications
140
+
141
+ - **Size**: ~13 GB
142
+ - **Context Window**: 8192+ tokens
143
+ - **Temperature**: 1.0 (balanced creativity)
144
+ - **Knowledge Cutoff**: June 2024
145
+ - **License**: Apache 2.0
146
+
147
+ ### System Architecture
148
+
149
+ ```
150
+ User Query
151
+
152
+ System Prompt (ChatGPT identity, tool definitions)
153
+
154
+ Analysis Channel (reasoning, Python, browser tools)
155
+
156
+ Commentary Channel (function calls)
157
+
158
+ Final Channel (user-facing response)
159
+ ```
160
+
161
+ ## 🔧 Advanced Usage
162
+
163
+ ### Custom System Instructions
164
+
165
+ Extend the model with additional context:
166
+
167
+ ```bash
168
+ ollama run Raiff1982/gpt-oss "You are now a specialized Python tutor..."
169
+ ```
170
+
171
+ ### Function Calling
172
+
173
+ Define custom functions the model can call:
174
+
175
+ ```json
176
+ {
177
+ "name": "get_weather",
178
+ "description": "Get current weather for a location",
179
+ "parameters": {
180
+ "type": "object",
181
+ "properties": {
182
+ "location": {"type": "string"},
183
+ "units": {"type": "string", "enum": ["celsius", "fahrenheit"]}
184
+ }
185
+ }
186
+ }
187
+ ```
188
+
189
+ ### API Integration
190
+
191
+ Use with Ollama's API for programmatic access:
192
+
193
+ ```python
194
+ import ollama
195
+
196
+ response = ollama.chat(
197
+ model='Raiff1982/gpt-oss',
198
+ messages=[
199
+ {
200
+ 'role': 'user',
201
+ 'content': 'Write a Python script to analyze CSV data'
202
+ }
203
+ ],
204
+ tools=[
205
+ {
206
+ 'type': 'function',
207
+ 'function': {
208
+ 'name': 'python',
209
+ 'description': 'Execute Python code'
210
+ }
211
+ }
212
+ ]
213
+ )
214
+
215
+ print(response['message']['content'])
216
+ ```
217
+
218
+ ### Streaming Responses
219
+
220
+ Get real-time output for long responses:
221
+
222
+ ```python
223
+ stream = ollama.chat(
224
+ model='Raiff1982/gpt-oss',
225
+ messages=[{'role': 'user', 'content': 'Explain quantum mechanics'}],
226
+ stream=True
227
+ )
228
+
229
+ for chunk in stream:
230
+ print(chunk['message']['content'], end='', flush=True)
231
+ ```
232
+
233
+ ## 🧪 Example Interactions
234
+
235
+ ### Web Research with Citations
236
+ ```
237
+ User: What caused the 2024 stock market volatility?
238
+
239
+ Assistant (analysis): *searches financial news*
240
+ Assistant (analysis): *opens top 3 results*
241
+ Assistant (analysis): *extracts key information*
242
+
243
+ Assistant (final): Based on recent financial reports【4†L12-L15】【7†L8】:
244
+ 1. Federal interest rate changes in Q1 2024
245
+ 2. Tech sector corrections【4†L23】
246
+ 3. Geopolitical tensions affecting energy markets【9†L5-L9】
247
+
248
+ The primary driver was...
249
+ ```
250
+
251
+ ### Python Data Analysis
252
+ ```
253
+ User: Load sales_data.csv and show me the top 5 products
254
+
255
+ Assistant (analysis):
256
+ ```python
257
+ import pandas as pd
258
+ df = pd.read_csv('/mnt/data/sales_data.csv')
259
+ top_5 = df.groupby('product')['revenue'].sum().sort_values(ascending=False).head(5)
260
+ print(top_5)
261
+ ```
262
+
263
+ Assistant (final): Here are your top 5 products by revenue:
264
+ 1. Product A: $45,230
265
+ 2. Product B: $38,920
266
+ ...
267
+ ```
268
+
269
+ ### Complex Reasoning
270
+ ```
271
+ User: Should I invest in renewable energy stocks?
272
+
273
+ Assistant (analysis): *thinking deeply*
274
+ - Market trends analysis
275
+ - Policy impact assessment
276
+ - Risk evaluation
277
+ - Timeline considerations
278
+
279
+ Assistant (final): I'll break this down across several dimensions:
280
+
281
+ **Market Analysis** [searches recent data]
282
+ - Solar industry growth rate: 15% YoY【3†L45】
283
+ - Wind energy investments up 23%【5†L12-L14】
284
+
285
+ **Policy Environment**
286
+ [Considers regulatory landscape, incentives, risks]
287
+
288
+ **Personal Recommendation**
289
+ Based on your [risk tolerance/timeline/goals]...
290
+ ```
291
+
292
+ ## 📊 Capabilities Matrix
293
+
294
+ | Feature | Supported | Notes |
295
+ |---------|-----------|-------|
296
+ | Web Search | ✅ | Real-time information retrieval |
297
+ | Web Browsing | ✅ | Open and parse URLs |
298
+ | Python Execution | ✅ | Stateful Jupyter environment |
299
+ | Code Generation | ✅ | Multiple languages |
300
+ | Data Analysis | ✅ | Pandas, NumPy, visualization |
301
+ | File Persistence | ✅ | `/mnt/data` directory |
302
+ | Function Calling | ✅ | Extensible tool framework |
303
+ | Multi-Step Reasoning | ✅ | Chain of thought |
304
+ | Streaming | ✅ | Real-time output |
305
+ | Citations | ✅ | Source tracking with line numbers |
306
+
307
+ ## 🔒 Privacy & Safety
308
+
309
+ **Local Execution Benefits:**
310
+ - All processing happens on your machine
311
+ - No data sent to external APIs (except browser tools)
312
+ - Full control over tool usage
313
+ - Inspect code before execution
314
+
315
+ **Browser Tool Considerations:**
316
+ - Browser tools do make external web requests
317
+ - Review URLs and search queries before execution
318
+ - Content fetched is processed locally
319
+
320
+ **Python Execution Safety:**
321
+ - Sandboxed environment with 120s timeout
322
+ - File access limited to `/mnt/data`
323
+ - No network access from Python by default
324
+ - Review generated code before running
325
+
326
+ ## 🚦 Best Practices
327
+
328
+ ### Effective Prompting
329
+ ```
330
+ ❌ Vague: "Tell me about AI"
331
+ ✅ Specific: "Search for recent breakthroughs in transformer architecture
332
+ from 2024, then summarize the top 3 findings"
333
+
334
+ ❌ Too broad: "Analyze my data"
335
+ ✅ Actionable: "Load sales.csv, calculate monthly revenue trends,
336
+ and create a line plot showing growth over time"
337
+ ```
338
+
339
+ ### Tool Usage
340
+ - **Search first** - Use browser before asking knowledge questions
341
+ - **Verify with code** - Use Python to validate calculations
342
+ - **Cite sources** - Pay attention to citation numbers
343
+ - **Check dates** - Knowledge cutoff is June 2024
344
+
345
+ ### Reasoning Control
346
+ ```bash
347
+ # Quick responses
348
+ ollama run Raiff1982/gpt-oss --reasoning low "Quick question..."
349
+
350
+ # Deep analysis
351
+ ollama run Raiff1982/gpt-oss --reasoning high "Complex problem..."
352
+ ```
353
+
354
+ ## 🆚 GPT-OSS vs. Other Models
355
+
356
+ | Feature | GPT-OSS | Standard LLMs | ChatGPT Plus |
357
+ |---------|---------|---------------|--------------|
358
+ | Cost | Free (local) | Free/Varies | $20/month |
359
+ | Privacy | Full privacy | Varies | Data processed externally |
360
+ | Tools | Browser + Python | None | Browser + Python + DALL-E |
361
+ | Reasoning | Transparent | Hidden | Partial transparency |
362
+ | Customization | Full control | Limited | Limited |
363
+ | Offline | After download | Varies | No |
364
+
365
+ ## 🔄 Updates & Versioning
366
+
367
+ This model is actively maintained:
368
+ - Base architecture follows ChatGPT design patterns
369
+ - Tools and capabilities updated regularly
370
+ - Community contributions welcome
371
+
372
+ ## 📚 Related Resources
373
+
374
+ - [Ollama Documentation](https://ollama.ai/docs)
375
+ - [Function Calling Guide](https://github.com/ollama/ollama/blob/main/docs/api.md#tools)
376
+ - [Python Environment Details](https://jupyter.org/)
377
+ - [Apache License 2.0](http://www.apache.org/licenses/LICENSE-2.0)
378
+
379
+ ## 🤝 Contributing
380
+
381
+ Help improve GPT-OSS:
382
+ 1. Report issues with tool usage
383
+ 2. Share effective prompting strategies
384
+ 3. Contribute function definitions
385
+ 4. Document use cases and examples
386
+
387
+ ## 💡 Tips & Tricks
388
+
389
+ ### Multi-Step Workflows
390
+ ```
391
+ > First, search for "Python data visualization libraries 2024"
392
+ > Then, use Python to create example plots with the top 3 libraries
393
+ > Finally, compare their strengths and weaknesses
394
+ ```
395
+
396
+ ### Data Pipeline
397
+ ```
398
+ > Load my CSV from /mnt/data/raw.csv
399
+ > Clean the data (handle missing values, outliers)
400
+ > Create summary statistics
401
+ > Save cleaned data to /mnt/data/processed.csv
402
+ > Generate a report with key findings
403
+ ```
404
+
405
+ ### Research & Writing
406
+ ```
407
+ > Research the history of neural networks (search 5 sources)
408
+ > Outline a 1000-word article based on findings
409
+ > Draft section 1 with proper citations
410
+ > Review and refine for clarity
411
+ ```
412
+
413
+ ## 🏆 Acknowledgments
414
+
415
+ - **OpenAI** - ChatGPT architecture inspiration
416
+ - **Ollama Team** - Local model runtime
417
+ - **Open Source Community** - Tool integrations and feedback
418
+
419
+ ---
420
+
421
+ **Model Page**: https://ollama.com/Raiff1982/gpt-oss
422
+ **Created**: December 27, 2025
423
+ **Size**: 13 GB
424
+ **License**: Apache 2.0
425
+
426
+ *"Open source intelligence with the power of ChatGPT, privacy of local execution, and freedom of customization."*