File size: 4,202 Bytes
fba531b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
---
license: apache-2.0
tags:
- adam
- curious-architecture
- instruction-tuned
- conversational-ai
- 2b-parameters
library_name: transformers
pipeline_tag: text-generation
---

# Adam: Instruction-Tuned Conversational AI

<div align="center">
  <img src="https://img.shields.io/badge/Parameters-2B-blue" alt="2B Parameters">
  <img src="https://img.shields.io/badge/Architecture-CuriousForCausalLM-green" alt="Curious Architecture">
  <img src="https://img.shields.io/badge/Instruction%20Tuned-Yes-orange" alt="Instruction Tuned">
  <img src="https://img.shields.io/badge/Context%20Length-8K-purple" alt="8K Context">
</div>

## πŸš€ Model Overview

**Adam** is a powerful 2 billion parameter language model built with the Curious architecture, specifically instruction-tuned for high-quality conversational AI and task completion. This model represents the next generation of efficient, instruction-tuned language models optimized for natural conversations.

## ✨ Key Features

- **πŸ—οΈ Native Curious Architecture**: Custom `CuriousForCausalLM` architecture with Curious-specific optimizations
- **🎯 Instruction-Tuned**: Fine-tuned for conversational AI and task completion
- **⚑ Efficient**: 2B parameters with optimized inference
- **πŸ’¬ Conversational**: Specialized for natural dialogue and helpful responses
- **πŸ”§ Advanced Features**: Sliding window attention, logit softcapping, and enhanced activations

## πŸ“Š Model Specifications

| Parameter | Value |
|-----------|-------|
| **Architecture** | CuriousForCausalLM |
| **Model Type** | curious_text |
| **Parameters** | ~2.6B |
| **Context Length** | 8,192 tokens |
| **Vocabulary** | 256,000 tokens |
| **Training** | Instruction-tuned |
| **Curious Version** | 2.0 |

## 🎯 Capabilities

- **Natural Conversations**: Engaging and contextually aware dialogue
- **Question Answering**: Accurate responses to diverse queries
- **Creative Writing**: Poetry, stories, and creative content generation
- **Code Assistance**: Programming help and code generation
- **Mathematical Reasoning**: Problem-solving and calculations
- **Instruction Following**: Precise task execution and completion

## πŸš€ Quick Start


### Interactive Chat

```python
pip install requirements.txt
```

```python
# Use the included chat interface
python chat_with_adam.py to talk to adam.
```

## πŸ—οΈ Curious Architecture Features

- **Enhanced Attention**: Advanced attention mechanisms for better context understanding
- **Sliding Window**: Efficient processing of long sequences
- **Logit Softcapping**: Improved generation stability
- **Optimized Activations**: GELU with PyTorch tanh for better performance
- **Instruction Tuning**: Specialized for conversational AI tasks

## πŸ“ˆ Performance

- **Quality**: High-quality instruction-tuned responses
- **Speed**: Optimized for efficient inference
- **Memory**: ~5GB model size
- **Hardware**: GPU recommended for best performance
- **Context**: 8K token context window

## πŸ”§ Technical Details

### Model Configuration

```json
{
  "architectures": ["CuriousForCausalLM"],
  "model_type": "curious_text",
  "hidden_size": 2304,
  "num_attention_heads": 8,
  "num_hidden_layers": 26,
  "max_position_embeddings": 8192,
  "curious_version": "2.0",
  "curious_instruction_tuned": true
}
```

### Generation Parameters



## 🎨 Use Cases

- **Chatbots**: Conversational AI applications
- **Assistants**: Task-oriented AI helpers
- **Creative Writing**: Content generation and editing
- **Education**: Tutoring and explanation
- **Coding**: Programming assistance
- **Research**: Information synthesis and analysis

## ⚠️ Limitations

- **Context Length**: Limited to 8K tokens
- **Training Data**: Cutoff date applies to training data
- **Bias**: May reflect biases in training data
- **Factual Accuracy**: Should be verified for critical applications


## πŸ™ Acknowledgments

- Built with the Curious Architecture Framework
- Instruction-tuned for conversational AI
- Powered by the Curious Architecture Framework v2.0

---

<div align="center">
  <strong>Adam: The Future of Conversational AI</strong><br>
  <em>Built with ❀️ using the Curious Architecture Framework</em>
</div>