File size: 3,116 Bytes
8ea24a8
05b5ef4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8ea24a8
 
 
 
 
05b5ef4
8ea24a8
 
 
0ada960
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
---
dataset_info:
  features:
  - name: system
    dtype: string
  - name: conversations
    sequence:
      - name: role
        dtype: string
      - name: content
        dtype: string
  splits:
  - name: train
    num_bytes: 0
    num_examples: 50   # update if you add more rows later
  download_size: 0
  dataset_size: 0
language:
- en
tags:
- bittensor
- flock
- consulting
- m&a
license: mit
---

# Flock Dataset for Subnet 96 (M&A Consulting)

This dataset is designed for use on **Bittensor Subnet 96 (Flock)** to train and evaluate models that generate high-quality consulting-style responses in the **Mergers & Acquisitions (M&A)** domain.  
It follows the JSONL structure required by Subnet 96 validators.

---

## 📂 Dataset Structure

Each entry in the dataset is a JSON object stored in a `.jsonl` file.  
Format:

```json
{
  "system": "You are an expert M&A strategy consultant. Provide concise, bullet-point style answers.",
  "conversations": [
    {"role": "user", "content": "User input text"},
    {"role": "assistant", "content": "Assistant response text"}
  ]
}
system → Defines the expert persona (M&A consultant).

user → The client’s question or request.

assistant → The consultant’s structured response.

🏗️ Example Entry
json
Kopiëren
Bewerken
{
  "system": "You are an expert M&A strategy consultant. Provide concise, bullet-point style answers.",
  "conversations": [
    {
      "role": "user",
      "content": "What are key sales synergy levers for an integration/merger in the consumer goods industry?"
    },
    {
      "role": "assistant",
      "content": "- Improving margins by taking the most generous contractual terms\n- Consolidate sale volumes to bring more scale to the (yearly) negotiations\n- Apply cross selling by offering additional products to existing/new clients"
    }
  ]
}
✅ Requirements Compliance
Follows Subnet 96 JSONL format (system + conversations array).

Answers are structured in bullet points for validator readability.

Focused on M&A consulting (pre-deal and post-deal).

📊 Current Dataset Size
Entries: ~50 Q&A pairs (v1.0)

Format: JSONL (dataset_sn96.jsonl)

🚀 Usage
Loading with datasets library
python
Kopiëren
Bewerken
from datasets import load_dataset

dataset = load_dataset("neihtmahp/flock_dataset")
print(dataset["train"][0])
Example Output
python
Kopiëren
Bewerken
{
  'system': 'You are an expert M&A strategy consultant. Provide concise, bullet-point style answers.',
  'conversations': [
    {'role': 'user', 'content': 'What are integration risks that are often underestimated?'},
    {'role': 'assistant', 'content': '- Missing cross-functional alignment\n- Not sufficient time to apply user acceptance testing\n- Late sign-off from stakeholders'}
  ]
}
📌 Version History
v1.0 → Initial release with 50 curated Q&A entries.

Future versions will expand coverage of:

Commercial due diligence

IT due diligence

Post-merger integration

✨ Acknowledgements
This dataset was created for experimentation with Flock Subnet 96 mining and validation.
Contributions welcome!

---

license: mit

---