neihtmahp commited on
Commit
0ada960
·
verified ·
1 Parent(s): 058f49f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +93 -0
README.md ADDED
@@ -0,0 +1,93 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Flock Dataset for Subnet 96 (M&A Consulting)
2
+
3
+ This dataset is designed for use on **Bittensor Subnet 96 (Flock)** to train and evaluate models that generate high-quality consulting-style responses in the **Mergers & Acquisitions (M&A)** domain.
4
+ It follows the JSONL structure required by Subnet 96 validators.
5
+
6
+ ---
7
+
8
+ ## 📂 Dataset Structure
9
+
10
+ Each entry in the dataset is a JSON object stored in a `.jsonl` file.
11
+ Format:
12
+
13
+ ```json
14
+ {
15
+ "system": "You are an expert M&A strategy consultant. Provide concise, bullet-point style answers.",
16
+ "conversations": [
17
+ {"role": "user", "content": "User input text"},
18
+ {"role": "assistant", "content": "Assistant response text"}
19
+ ]
20
+ }
21
+ system → Defines the expert persona (M&A consultant).
22
+
23
+ user → The client’s question or request.
24
+
25
+ assistant → The consultant’s structured response.
26
+
27
+ 🏗️ Example Entry
28
+ json
29
+ Kopiëren
30
+ Bewerken
31
+ {
32
+ "system": "You are an expert M&A strategy consultant. Provide concise, bullet-point style answers.",
33
+ "conversations": [
34
+ {
35
+ "role": "user",
36
+ "content": "What are key sales synergy levers for an integration/merger in the consumer goods industry?"
37
+ },
38
+ {
39
+ "role": "assistant",
40
+ "content": "- Improving margins by taking the most generous contractual terms\n- Consolidate sale volumes to bring more scale to the (yearly) negotiations\n- Apply cross selling by offering additional products to existing/new clients"
41
+ }
42
+ ]
43
+ }
44
+ ✅ Requirements Compliance
45
+ Follows Subnet 96 JSONL format (system + conversations array).
46
+
47
+ Answers are structured in bullet points for validator readability.
48
+
49
+ Focused on M&A consulting (pre-deal and post-deal).
50
+
51
+ 📊 Current Dataset Size
52
+ Entries: ~50 Q&A pairs (v1.0)
53
+
54
+ Format: JSONL (dataset_sn96.jsonl)
55
+
56
+ 🚀 Usage
57
+ Loading with datasets library
58
+ python
59
+ Kopiëren
60
+ Bewerken
61
+ from datasets import load_dataset
62
+
63
+ dataset = load_dataset("neihtmahp/flock_dataset")
64
+ print(dataset["train"][0])
65
+ Example Output
66
+ python
67
+ Kopiëren
68
+ Bewerken
69
+ {
70
+ 'system': 'You are an expert M&A strategy consultant. Provide concise, bullet-point style answers.',
71
+ 'conversations': [
72
+ {'role': 'user', 'content': 'What are integration risks that are often underestimated?'},
73
+ {'role': 'assistant', 'content': '- Missing cross-functional alignment\n- Not sufficient time to apply user acceptance testing\n- Late sign-off from stakeholders'}
74
+ ]
75
+ }
76
+ 📌 Version History
77
+ v1.0 → Initial release with 50 curated Q&A entries.
78
+
79
+ Future versions will expand coverage of:
80
+
81
+ Commercial due diligence
82
+
83
+ IT due diligence
84
+
85
+ Post-merger integration
86
+
87
+ ✨ Acknowledgements
88
+ This dataset was created for experimentation with Flock Subnet 96 mining and validation.
89
+ Contributions welcome!
90
+
91
+ ---
92
+ license: mit
93
+ ---