mobadara commited on
Commit
11dfce2
·
verified ·
1 Parent(s): 5738f9e

Sync from GitHub via hub-sync

Browse files
LICENSE ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ MIT License
2
+
3
+ Copyright (c) 2026 Muyiwa J. Obadara
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in all
13
+ copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21
+ SOFTWARE.
README.md CHANGED
@@ -1,12 +1,76 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
- title: Finbert Sentiment Api
3
- emoji: 📊
4
- colorFrom: blue
5
- colorTo: green
6
- sdk: docker
7
- pinned: false
8
- license: mit
9
- short_description: Specialized sentiment analysis for financial news and report
 
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # 📈 FinBERT: Real-Time Financial Sentiment Analysis
2
+
3
+ Machine Learning Pipeline, which analyses news headlines about finance and forecasts sentiments (Bullish, Bearish, and Neutral).
4
+
5
+ The project will train a BERT-based model with PyTorch, implement API prediction requests using FastAPI, and display visualization results on a React-Bootstrap web interface.
6
+
7
+ ### 🔗 Project Links
8
+ * **Live Application:** [Link](https://portfolio-frontend-livid.vercel.app/projects/sentiment-analysis-with-bert)
9
+ * **Frontend Code:** [Github](https://github.com/mobadara/portfolio-frontend/blob/main/src/pages/projects/FinBERT.jsx)
10
+ * **Project Blog Post:** [Read the full write-up on Medium/Dev.to]
11
+ * **Video Walkthrough:** [Watch the explanation on YouTube]
12
+ * **Model Weights (Optional):** [Link to your Hugging Face model hub if uploaded]
13
+
14
+ ---
15
+
16
+ ## 🏗️ System Architecture
17
+
18
+ * **Machine Learning:** PyTorch, Hugging Face `transformers`, Financial PhraseBank Dataset
19
+ * **Backend API:** Python, FastAPI, Uvicorn
20
+ * **Frontend UI:** React, React-Bootstrap
21
+ * **Database:** PostgreSQL (Neon/Supabase) via SQLAlchemy
22
+
23
  ---
24
+
25
+ ## 🧠 The Machine Learning Pipeline
26
+
27
+ The core of this application is a fine-tuned NLP model.
28
+ 1. **Base Model:** `ProsusAI/finbert`
29
+ 2. **Fine-tuning:** Conducted in Google Colab using a T4 GPU.
30
+ 3. **Training Data:** The Kaggle Financial PhraseBank dataset.
31
+ *(Check the `/notebooks` directory to see the complete PyTorch training loop, tokenization process, and evaluation metrics).*
32
+
33
  ---
34
 
35
+ ## 🚀 How to Run Locally
36
+
37
+ ### 1. Clone the Repository
38
+
39
+ copy the command below and run it in your favourite terminal.
40
+ ```bash
41
+ git clone https://github.com/mobadara/finbert-sentiment-analyzer-api.git &&
42
+ cd finbert-sentiment-analyzer-api.git
43
+ ```
44
+
45
+ ### 2. Create a virtual environment
46
+ ```
47
+ python -m venv venv
48
+ ```
49
+
50
+ ### 3. Activate the virtual environment
51
+ #### a. On Linux/Mac
52
+ ```bash
53
+ source venv/bin/activate
54
+ ```
55
+
56
+ #### b. On Windows
57
+ ```bash
58
+ venv\Scripts\activate
59
+ ```
60
+
61
+ ### 4. Install Dependencies
62
+ ```bash
63
+ pip install -r requirements.txt
64
+ ```
65
+
66
+ ### 5. Start the server
67
+ ```bash
68
+ uvicorn app.main:app --reload
69
+ ```
70
+
71
+ ## 👨‍💻 Author:
72
+ **Muyiwa J. Obadara**
73
+
74
+ [![LinkedIn](https://img.shields.io/badge/LinkedIn-%230077B5.svg?style=for-the-badge&logo=linkedin&logoColor=white)](https://linkedin.com/in/obadara-m)
75
+ [![X / Twitter](https://img.shields.io/badge/X-%23000000.svg?style=for-the-badge&logo=X&logoColor=white)](https://twitter.com/m_obadara)
76
+ [![Portfolio](https://img.shields.io/badge/Portfolio-Navy_Blue?style=for-the-badge&logo=google-chrome&logoColor=white&color=navy)](https://portfolio-frontend-livid.vercel.app)
app/__init__.py ADDED
File without changes
app/database.py ADDED
File without changes
app/main.py ADDED
File without changes
app/ml_model.py ADDED
File without changes
app/models.py ADDED
File without changes
app/schemas.py ADDED
File without changes
notebooks/data_exploration.ipynb ADDED
@@ -0,0 +1,2126 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "nbformat": 4,
3
+ "nbformat_minor": 0,
4
+ "metadata": {
5
+ "colab": {
6
+ "provenance": [],
7
+ "gpuType": "T4",
8
+ "authorship_tag": "ABX9TyMG1ZDYWVSh0cXbnb7kmaGm",
9
+ "include_colab_link": true
10
+ },
11
+ "kernelspec": {
12
+ "name": "python3",
13
+ "display_name": "Python 3"
14
+ },
15
+ "language_info": {
16
+ "name": "python"
17
+ },
18
+ "accelerator": "GPU",
19
+ "widgets": {
20
+ "application/vnd.jupyter.widget-state+json": {
21
+ "33c62285e7a14f46b6a678a5ee90cc1d": {
22
+ "model_module": "@jupyter-widgets/controls",
23
+ "model_name": "HBoxModel",
24
+ "model_module_version": "1.5.0",
25
+ "state": {
26
+ "_dom_classes": [],
27
+ "_model_module": "@jupyter-widgets/controls",
28
+ "_model_module_version": "1.5.0",
29
+ "_model_name": "HBoxModel",
30
+ "_view_count": null,
31
+ "_view_module": "@jupyter-widgets/controls",
32
+ "_view_module_version": "1.5.0",
33
+ "_view_name": "HBoxView",
34
+ "box_style": "",
35
+ "children": [
36
+ "IPY_MODEL_5cf6d55d9eb0422babce34b9d7cce86a",
37
+ "IPY_MODEL_b71f5f547f36428881401143783e1cda",
38
+ "IPY_MODEL_1d48279d16634f1eaac89219ccdff842"
39
+ ],
40
+ "layout": "IPY_MODEL_cb018f5627d6435bb32d53d99d3718ba"
41
+ }
42
+ },
43
+ "5cf6d55d9eb0422babce34b9d7cce86a": {
44
+ "model_module": "@jupyter-widgets/controls",
45
+ "model_name": "HTMLModel",
46
+ "model_module_version": "1.5.0",
47
+ "state": {
48
+ "_dom_classes": [],
49
+ "_model_module": "@jupyter-widgets/controls",
50
+ "_model_module_version": "1.5.0",
51
+ "_model_name": "HTMLModel",
52
+ "_view_count": null,
53
+ "_view_module": "@jupyter-widgets/controls",
54
+ "_view_module_version": "1.5.0",
55
+ "_view_name": "HTMLView",
56
+ "description": "",
57
+ "description_tooltip": null,
58
+ "layout": "IPY_MODEL_a4ee776b44964e54b006f902a472ffaf",
59
+ "placeholder": "​",
60
+ "style": "IPY_MODEL_576389aef48c4c63806322f9ef2f1188",
61
+ "value": "README.md: 100%"
62
+ }
63
+ },
64
+ "b71f5f547f36428881401143783e1cda": {
65
+ "model_module": "@jupyter-widgets/controls",
66
+ "model_name": "FloatProgressModel",
67
+ "model_module_version": "1.5.0",
68
+ "state": {
69
+ "_dom_classes": [],
70
+ "_model_module": "@jupyter-widgets/controls",
71
+ "_model_module_version": "1.5.0",
72
+ "_model_name": "FloatProgressModel",
73
+ "_view_count": null,
74
+ "_view_module": "@jupyter-widgets/controls",
75
+ "_view_module_version": "1.5.0",
76
+ "_view_name": "ProgressView",
77
+ "bar_style": "success",
78
+ "description": "",
79
+ "description_tooltip": null,
80
+ "layout": "IPY_MODEL_1ef978469ea441058f50d59aae5c35ce",
81
+ "max": 465,
82
+ "min": 0,
83
+ "orientation": "horizontal",
84
+ "style": "IPY_MODEL_638b61a4e2bc4682b81507c08178e413",
85
+ "value": 465
86
+ }
87
+ },
88
+ "1d48279d16634f1eaac89219ccdff842": {
89
+ "model_module": "@jupyter-widgets/controls",
90
+ "model_name": "HTMLModel",
91
+ "model_module_version": "1.5.0",
92
+ "state": {
93
+ "_dom_classes": [],
94
+ "_model_module": "@jupyter-widgets/controls",
95
+ "_model_module_version": "1.5.0",
96
+ "_model_name": "HTMLModel",
97
+ "_view_count": null,
98
+ "_view_module": "@jupyter-widgets/controls",
99
+ "_view_module_version": "1.5.0",
100
+ "_view_name": "HTMLView",
101
+ "description": "",
102
+ "description_tooltip": null,
103
+ "layout": "IPY_MODEL_b44ee4d8a78d454c9c1251cef4ef0a7f",
104
+ "placeholder": "​",
105
+ "style": "IPY_MODEL_c4aaccc4f20f49e3b4d7de54d0bbac5e",
106
+ "value": " 465/465 [00:00<00:00, 27.5kB/s]"
107
+ }
108
+ },
109
+ "cb018f5627d6435bb32d53d99d3718ba": {
110
+ "model_module": "@jupyter-widgets/base",
111
+ "model_name": "LayoutModel",
112
+ "model_module_version": "1.2.0",
113
+ "state": {
114
+ "_model_module": "@jupyter-widgets/base",
115
+ "_model_module_version": "1.2.0",
116
+ "_model_name": "LayoutModel",
117
+ "_view_count": null,
118
+ "_view_module": "@jupyter-widgets/base",
119
+ "_view_module_version": "1.2.0",
120
+ "_view_name": "LayoutView",
121
+ "align_content": null,
122
+ "align_items": null,
123
+ "align_self": null,
124
+ "border": null,
125
+ "bottom": null,
126
+ "display": null,
127
+ "flex": null,
128
+ "flex_flow": null,
129
+ "grid_area": null,
130
+ "grid_auto_columns": null,
131
+ "grid_auto_flow": null,
132
+ "grid_auto_rows": null,
133
+ "grid_column": null,
134
+ "grid_gap": null,
135
+ "grid_row": null,
136
+ "grid_template_areas": null,
137
+ "grid_template_columns": null,
138
+ "grid_template_rows": null,
139
+ "height": null,
140
+ "justify_content": null,
141
+ "justify_items": null,
142
+ "left": null,
143
+ "margin": null,
144
+ "max_height": null,
145
+ "max_width": null,
146
+ "min_height": null,
147
+ "min_width": null,
148
+ "object_fit": null,
149
+ "object_position": null,
150
+ "order": null,
151
+ "overflow": null,
152
+ "overflow_x": null,
153
+ "overflow_y": null,
154
+ "padding": null,
155
+ "right": null,
156
+ "top": null,
157
+ "visibility": null,
158
+ "width": null
159
+ }
160
+ },
161
+ "a4ee776b44964e54b006f902a472ffaf": {
162
+ "model_module": "@jupyter-widgets/base",
163
+ "model_name": "LayoutModel",
164
+ "model_module_version": "1.2.0",
165
+ "state": {
166
+ "_model_module": "@jupyter-widgets/base",
167
+ "_model_module_version": "1.2.0",
168
+ "_model_name": "LayoutModel",
169
+ "_view_count": null,
170
+ "_view_module": "@jupyter-widgets/base",
171
+ "_view_module_version": "1.2.0",
172
+ "_view_name": "LayoutView",
173
+ "align_content": null,
174
+ "align_items": null,
175
+ "align_self": null,
176
+ "border": null,
177
+ "bottom": null,
178
+ "display": null,
179
+ "flex": null,
180
+ "flex_flow": null,
181
+ "grid_area": null,
182
+ "grid_auto_columns": null,
183
+ "grid_auto_flow": null,
184
+ "grid_auto_rows": null,
185
+ "grid_column": null,
186
+ "grid_gap": null,
187
+ "grid_row": null,
188
+ "grid_template_areas": null,
189
+ "grid_template_columns": null,
190
+ "grid_template_rows": null,
191
+ "height": null,
192
+ "justify_content": null,
193
+ "justify_items": null,
194
+ "left": null,
195
+ "margin": null,
196
+ "max_height": null,
197
+ "max_width": null,
198
+ "min_height": null,
199
+ "min_width": null,
200
+ "object_fit": null,
201
+ "object_position": null,
202
+ "order": null,
203
+ "overflow": null,
204
+ "overflow_x": null,
205
+ "overflow_y": null,
206
+ "padding": null,
207
+ "right": null,
208
+ "top": null,
209
+ "visibility": null,
210
+ "width": null
211
+ }
212
+ },
213
+ "576389aef48c4c63806322f9ef2f1188": {
214
+ "model_module": "@jupyter-widgets/controls",
215
+ "model_name": "DescriptionStyleModel",
216
+ "model_module_version": "1.5.0",
217
+ "state": {
218
+ "_model_module": "@jupyter-widgets/controls",
219
+ "_model_module_version": "1.5.0",
220
+ "_model_name": "DescriptionStyleModel",
221
+ "_view_count": null,
222
+ "_view_module": "@jupyter-widgets/base",
223
+ "_view_module_version": "1.2.0",
224
+ "_view_name": "StyleView",
225
+ "description_width": ""
226
+ }
227
+ },
228
+ "1ef978469ea441058f50d59aae5c35ce": {
229
+ "model_module": "@jupyter-widgets/base",
230
+ "model_name": "LayoutModel",
231
+ "model_module_version": "1.2.0",
232
+ "state": {
233
+ "_model_module": "@jupyter-widgets/base",
234
+ "_model_module_version": "1.2.0",
235
+ "_model_name": "LayoutModel",
236
+ "_view_count": null,
237
+ "_view_module": "@jupyter-widgets/base",
238
+ "_view_module_version": "1.2.0",
239
+ "_view_name": "LayoutView",
240
+ "align_content": null,
241
+ "align_items": null,
242
+ "align_self": null,
243
+ "border": null,
244
+ "bottom": null,
245
+ "display": null,
246
+ "flex": null,
247
+ "flex_flow": null,
248
+ "grid_area": null,
249
+ "grid_auto_columns": null,
250
+ "grid_auto_flow": null,
251
+ "grid_auto_rows": null,
252
+ "grid_column": null,
253
+ "grid_gap": null,
254
+ "grid_row": null,
255
+ "grid_template_areas": null,
256
+ "grid_template_columns": null,
257
+ "grid_template_rows": null,
258
+ "height": null,
259
+ "justify_content": null,
260
+ "justify_items": null,
261
+ "left": null,
262
+ "margin": null,
263
+ "max_height": null,
264
+ "max_width": null,
265
+ "min_height": null,
266
+ "min_width": null,
267
+ "object_fit": null,
268
+ "object_position": null,
269
+ "order": null,
270
+ "overflow": null,
271
+ "overflow_x": null,
272
+ "overflow_y": null,
273
+ "padding": null,
274
+ "right": null,
275
+ "top": null,
276
+ "visibility": null,
277
+ "width": null
278
+ }
279
+ },
280
+ "638b61a4e2bc4682b81507c08178e413": {
281
+ "model_module": "@jupyter-widgets/controls",
282
+ "model_name": "ProgressStyleModel",
283
+ "model_module_version": "1.5.0",
284
+ "state": {
285
+ "_model_module": "@jupyter-widgets/controls",
286
+ "_model_module_version": "1.5.0",
287
+ "_model_name": "ProgressStyleModel",
288
+ "_view_count": null,
289
+ "_view_module": "@jupyter-widgets/base",
290
+ "_view_module_version": "1.2.0",
291
+ "_view_name": "StyleView",
292
+ "bar_color": null,
293
+ "description_width": ""
294
+ }
295
+ },
296
+ "b44ee4d8a78d454c9c1251cef4ef0a7f": {
297
+ "model_module": "@jupyter-widgets/base",
298
+ "model_name": "LayoutModel",
299
+ "model_module_version": "1.2.0",
300
+ "state": {
301
+ "_model_module": "@jupyter-widgets/base",
302
+ "_model_module_version": "1.2.0",
303
+ "_model_name": "LayoutModel",
304
+ "_view_count": null,
305
+ "_view_module": "@jupyter-widgets/base",
306
+ "_view_module_version": "1.2.0",
307
+ "_view_name": "LayoutView",
308
+ "align_content": null,
309
+ "align_items": null,
310
+ "align_self": null,
311
+ "border": null,
312
+ "bottom": null,
313
+ "display": null,
314
+ "flex": null,
315
+ "flex_flow": null,
316
+ "grid_area": null,
317
+ "grid_auto_columns": null,
318
+ "grid_auto_flow": null,
319
+ "grid_auto_rows": null,
320
+ "grid_column": null,
321
+ "grid_gap": null,
322
+ "grid_row": null,
323
+ "grid_template_areas": null,
324
+ "grid_template_columns": null,
325
+ "grid_template_rows": null,
326
+ "height": null,
327
+ "justify_content": null,
328
+ "justify_items": null,
329
+ "left": null,
330
+ "margin": null,
331
+ "max_height": null,
332
+ "max_width": null,
333
+ "min_height": null,
334
+ "min_width": null,
335
+ "object_fit": null,
336
+ "object_position": null,
337
+ "order": null,
338
+ "overflow": null,
339
+ "overflow_x": null,
340
+ "overflow_y": null,
341
+ "padding": null,
342
+ "right": null,
343
+ "top": null,
344
+ "visibility": null,
345
+ "width": null
346
+ }
347
+ },
348
+ "c4aaccc4f20f49e3b4d7de54d0bbac5e": {
349
+ "model_module": "@jupyter-widgets/controls",
350
+ "model_name": "DescriptionStyleModel",
351
+ "model_module_version": "1.5.0",
352
+ "state": {
353
+ "_model_module": "@jupyter-widgets/controls",
354
+ "_model_module_version": "1.5.0",
355
+ "_model_name": "DescriptionStyleModel",
356
+ "_view_count": null,
357
+ "_view_module": "@jupyter-widgets/base",
358
+ "_view_module_version": "1.2.0",
359
+ "_view_name": "StyleView",
360
+ "description_width": ""
361
+ }
362
+ },
363
+ "c64a92c0b83d459ba5d6dfdd33d07fd9": {
364
+ "model_module": "@jupyter-widgets/controls",
365
+ "model_name": "HBoxModel",
366
+ "model_module_version": "1.5.0",
367
+ "state": {
368
+ "_dom_classes": [],
369
+ "_model_module": "@jupyter-widgets/controls",
370
+ "_model_module_version": "1.5.0",
371
+ "_model_name": "HBoxModel",
372
+ "_view_count": null,
373
+ "_view_module": "@jupyter-widgets/controls",
374
+ "_view_module_version": "1.5.0",
375
+ "_view_name": "HBoxView",
376
+ "box_style": "",
377
+ "children": [
378
+ "IPY_MODEL_61efe36d3d544ff8b24c574a886d6b4d",
379
+ "IPY_MODEL_71c7a7dd03f4465d918971420663ae85",
380
+ "IPY_MODEL_e703df0e72494d17b36dd5ae16b81f62"
381
+ ],
382
+ "layout": "IPY_MODEL_bd4c4ec739c6438c8378254f8e7ab336"
383
+ }
384
+ },
385
+ "61efe36d3d544ff8b24c574a886d6b4d": {
386
+ "model_module": "@jupyter-widgets/controls",
387
+ "model_name": "HTMLModel",
388
+ "model_module_version": "1.5.0",
389
+ "state": {
390
+ "_dom_classes": [],
391
+ "_model_module": "@jupyter-widgets/controls",
392
+ "_model_module_version": "1.5.0",
393
+ "_model_name": "HTMLModel",
394
+ "_view_count": null,
395
+ "_view_module": "@jupyter-widgets/controls",
396
+ "_view_module_version": "1.5.0",
397
+ "_view_name": "HTMLView",
398
+ "description": "",
399
+ "description_tooltip": null,
400
+ "layout": "IPY_MODEL_20cd4eb1d09b4a7aa2a1ec67845ff1a4",
401
+ "placeholder": "​",
402
+ "style": "IPY_MODEL_5ee56f6139f94425b26d92dc38c51e26",
403
+ "value": "data/train-00000-of-00001.parquet: 100%"
404
+ }
405
+ },
406
+ "71c7a7dd03f4465d918971420663ae85": {
407
+ "model_module": "@jupyter-widgets/controls",
408
+ "model_name": "FloatProgressModel",
409
+ "model_module_version": "1.5.0",
410
+ "state": {
411
+ "_dom_classes": [],
412
+ "_model_module": "@jupyter-widgets/controls",
413
+ "_model_module_version": "1.5.0",
414
+ "_model_name": "FloatProgressModel",
415
+ "_view_count": null,
416
+ "_view_module": "@jupyter-widgets/controls",
417
+ "_view_module_version": "1.5.0",
418
+ "_view_name": "ProgressView",
419
+ "bar_style": "success",
420
+ "description": "",
421
+ "description_tooltip": null,
422
+ "layout": "IPY_MODEL_d63d63e925e84cafba11798f3a1ae865",
423
+ "max": 104087,
424
+ "min": 0,
425
+ "orientation": "horizontal",
426
+ "style": "IPY_MODEL_5cd6a5b5be5e455b80ada0de6ab928af",
427
+ "value": 104087
428
+ }
429
+ },
430
+ "e703df0e72494d17b36dd5ae16b81f62": {
431
+ "model_module": "@jupyter-widgets/controls",
432
+ "model_name": "HTMLModel",
433
+ "model_module_version": "1.5.0",
434
+ "state": {
435
+ "_dom_classes": [],
436
+ "_model_module": "@jupyter-widgets/controls",
437
+ "_model_module_version": "1.5.0",
438
+ "_model_name": "HTMLModel",
439
+ "_view_count": null,
440
+ "_view_module": "@jupyter-widgets/controls",
441
+ "_view_module_version": "1.5.0",
442
+ "_view_name": "HTMLView",
443
+ "description": "",
444
+ "description_tooltip": null,
445
+ "layout": "IPY_MODEL_ba732f0367c748acadc5363a062adf21",
446
+ "placeholder": "​",
447
+ "style": "IPY_MODEL_8d2ccb7f1e584e15a3b2148db0d60e4d",
448
+ "value": " 104k/104k [00:01<00:00, 522kB/s]"
449
+ }
450
+ },
451
+ "bd4c4ec739c6438c8378254f8e7ab336": {
452
+ "model_module": "@jupyter-widgets/base",
453
+ "model_name": "LayoutModel",
454
+ "model_module_version": "1.2.0",
455
+ "state": {
456
+ "_model_module": "@jupyter-widgets/base",
457
+ "_model_module_version": "1.2.0",
458
+ "_model_name": "LayoutModel",
459
+ "_view_count": null,
460
+ "_view_module": "@jupyter-widgets/base",
461
+ "_view_module_version": "1.2.0",
462
+ "_view_name": "LayoutView",
463
+ "align_content": null,
464
+ "align_items": null,
465
+ "align_self": null,
466
+ "border": null,
467
+ "bottom": null,
468
+ "display": null,
469
+ "flex": null,
470
+ "flex_flow": null,
471
+ "grid_area": null,
472
+ "grid_auto_columns": null,
473
+ "grid_auto_flow": null,
474
+ "grid_auto_rows": null,
475
+ "grid_column": null,
476
+ "grid_gap": null,
477
+ "grid_row": null,
478
+ "grid_template_areas": null,
479
+ "grid_template_columns": null,
480
+ "grid_template_rows": null,
481
+ "height": null,
482
+ "justify_content": null,
483
+ "justify_items": null,
484
+ "left": null,
485
+ "margin": null,
486
+ "max_height": null,
487
+ "max_width": null,
488
+ "min_height": null,
489
+ "min_width": null,
490
+ "object_fit": null,
491
+ "object_position": null,
492
+ "order": null,
493
+ "overflow": null,
494
+ "overflow_x": null,
495
+ "overflow_y": null,
496
+ "padding": null,
497
+ "right": null,
498
+ "top": null,
499
+ "visibility": null,
500
+ "width": null
501
+ }
502
+ },
503
+ "20cd4eb1d09b4a7aa2a1ec67845ff1a4": {
504
+ "model_module": "@jupyter-widgets/base",
505
+ "model_name": "LayoutModel",
506
+ "model_module_version": "1.2.0",
507
+ "state": {
508
+ "_model_module": "@jupyter-widgets/base",
509
+ "_model_module_version": "1.2.0",
510
+ "_model_name": "LayoutModel",
511
+ "_view_count": null,
512
+ "_view_module": "@jupyter-widgets/base",
513
+ "_view_module_version": "1.2.0",
514
+ "_view_name": "LayoutView",
515
+ "align_content": null,
516
+ "align_items": null,
517
+ "align_self": null,
518
+ "border": null,
519
+ "bottom": null,
520
+ "display": null,
521
+ "flex": null,
522
+ "flex_flow": null,
523
+ "grid_area": null,
524
+ "grid_auto_columns": null,
525
+ "grid_auto_flow": null,
526
+ "grid_auto_rows": null,
527
+ "grid_column": null,
528
+ "grid_gap": null,
529
+ "grid_row": null,
530
+ "grid_template_areas": null,
531
+ "grid_template_columns": null,
532
+ "grid_template_rows": null,
533
+ "height": null,
534
+ "justify_content": null,
535
+ "justify_items": null,
536
+ "left": null,
537
+ "margin": null,
538
+ "max_height": null,
539
+ "max_width": null,
540
+ "min_height": null,
541
+ "min_width": null,
542
+ "object_fit": null,
543
+ "object_position": null,
544
+ "order": null,
545
+ "overflow": null,
546
+ "overflow_x": null,
547
+ "overflow_y": null,
548
+ "padding": null,
549
+ "right": null,
550
+ "top": null,
551
+ "visibility": null,
552
+ "width": null
553
+ }
554
+ },
555
+ "5ee56f6139f94425b26d92dc38c51e26": {
556
+ "model_module": "@jupyter-widgets/controls",
557
+ "model_name": "DescriptionStyleModel",
558
+ "model_module_version": "1.5.0",
559
+ "state": {
560
+ "_model_module": "@jupyter-widgets/controls",
561
+ "_model_module_version": "1.5.0",
562
+ "_model_name": "DescriptionStyleModel",
563
+ "_view_count": null,
564
+ "_view_module": "@jupyter-widgets/base",
565
+ "_view_module_version": "1.2.0",
566
+ "_view_name": "StyleView",
567
+ "description_width": ""
568
+ }
569
+ },
570
+ "d63d63e925e84cafba11798f3a1ae865": {
571
+ "model_module": "@jupyter-widgets/base",
572
+ "model_name": "LayoutModel",
573
+ "model_module_version": "1.2.0",
574
+ "state": {
575
+ "_model_module": "@jupyter-widgets/base",
576
+ "_model_module_version": "1.2.0",
577
+ "_model_name": "LayoutModel",
578
+ "_view_count": null,
579
+ "_view_module": "@jupyter-widgets/base",
580
+ "_view_module_version": "1.2.0",
581
+ "_view_name": "LayoutView",
582
+ "align_content": null,
583
+ "align_items": null,
584
+ "align_self": null,
585
+ "border": null,
586
+ "bottom": null,
587
+ "display": null,
588
+ "flex": null,
589
+ "flex_flow": null,
590
+ "grid_area": null,
591
+ "grid_auto_columns": null,
592
+ "grid_auto_flow": null,
593
+ "grid_auto_rows": null,
594
+ "grid_column": null,
595
+ "grid_gap": null,
596
+ "grid_row": null,
597
+ "grid_template_areas": null,
598
+ "grid_template_columns": null,
599
+ "grid_template_rows": null,
600
+ "height": null,
601
+ "justify_content": null,
602
+ "justify_items": null,
603
+ "left": null,
604
+ "margin": null,
605
+ "max_height": null,
606
+ "max_width": null,
607
+ "min_height": null,
608
+ "min_width": null,
609
+ "object_fit": null,
610
+ "object_position": null,
611
+ "order": null,
612
+ "overflow": null,
613
+ "overflow_x": null,
614
+ "overflow_y": null,
615
+ "padding": null,
616
+ "right": null,
617
+ "top": null,
618
+ "visibility": null,
619
+ "width": null
620
+ }
621
+ },
622
+ "5cd6a5b5be5e455b80ada0de6ab928af": {
623
+ "model_module": "@jupyter-widgets/controls",
624
+ "model_name": "ProgressStyleModel",
625
+ "model_module_version": "1.5.0",
626
+ "state": {
627
+ "_model_module": "@jupyter-widgets/controls",
628
+ "_model_module_version": "1.5.0",
629
+ "_model_name": "ProgressStyleModel",
630
+ "_view_count": null,
631
+ "_view_module": "@jupyter-widgets/base",
632
+ "_view_module_version": "1.2.0",
633
+ "_view_name": "StyleView",
634
+ "bar_color": null,
635
+ "description_width": ""
636
+ }
637
+ },
638
+ "ba732f0367c748acadc5363a062adf21": {
639
+ "model_module": "@jupyter-widgets/base",
640
+ "model_name": "LayoutModel",
641
+ "model_module_version": "1.2.0",
642
+ "state": {
643
+ "_model_module": "@jupyter-widgets/base",
644
+ "_model_module_version": "1.2.0",
645
+ "_model_name": "LayoutModel",
646
+ "_view_count": null,
647
+ "_view_module": "@jupyter-widgets/base",
648
+ "_view_module_version": "1.2.0",
649
+ "_view_name": "LayoutView",
650
+ "align_content": null,
651
+ "align_items": null,
652
+ "align_self": null,
653
+ "border": null,
654
+ "bottom": null,
655
+ "display": null,
656
+ "flex": null,
657
+ "flex_flow": null,
658
+ "grid_area": null,
659
+ "grid_auto_columns": null,
660
+ "grid_auto_flow": null,
661
+ "grid_auto_rows": null,
662
+ "grid_column": null,
663
+ "grid_gap": null,
664
+ "grid_row": null,
665
+ "grid_template_areas": null,
666
+ "grid_template_columns": null,
667
+ "grid_template_rows": null,
668
+ "height": null,
669
+ "justify_content": null,
670
+ "justify_items": null,
671
+ "left": null,
672
+ "margin": null,
673
+ "max_height": null,
674
+ "max_width": null,
675
+ "min_height": null,
676
+ "min_width": null,
677
+ "object_fit": null,
678
+ "object_position": null,
679
+ "order": null,
680
+ "overflow": null,
681
+ "overflow_x": null,
682
+ "overflow_y": null,
683
+ "padding": null,
684
+ "right": null,
685
+ "top": null,
686
+ "visibility": null,
687
+ "width": null
688
+ }
689
+ },
690
+ "8d2ccb7f1e584e15a3b2148db0d60e4d": {
691
+ "model_module": "@jupyter-widgets/controls",
692
+ "model_name": "DescriptionStyleModel",
693
+ "model_module_version": "1.5.0",
694
+ "state": {
695
+ "_model_module": "@jupyter-widgets/controls",
696
+ "_model_module_version": "1.5.0",
697
+ "_model_name": "DescriptionStyleModel",
698
+ "_view_count": null,
699
+ "_view_module": "@jupyter-widgets/base",
700
+ "_view_module_version": "1.2.0",
701
+ "_view_name": "StyleView",
702
+ "description_width": ""
703
+ }
704
+ },
705
+ "4f93ab1267b14233a1ae198e7c2ecd5c": {
706
+ "model_module": "@jupyter-widgets/controls",
707
+ "model_name": "HBoxModel",
708
+ "model_module_version": "1.5.0",
709
+ "state": {
710
+ "_dom_classes": [],
711
+ "_model_module": "@jupyter-widgets/controls",
712
+ "_model_module_version": "1.5.0",
713
+ "_model_name": "HBoxModel",
714
+ "_view_count": null,
715
+ "_view_module": "@jupyter-widgets/controls",
716
+ "_view_module_version": "1.5.0",
717
+ "_view_name": "HBoxView",
718
+ "box_style": "",
719
+ "children": [
720
+ "IPY_MODEL_c84b4c2e97ff448c969a7570cf6f86d3",
721
+ "IPY_MODEL_a539bf015f6c4ac7a9ecc0de100c51e7",
722
+ "IPY_MODEL_8470e586075f40b3b7b4e37300e77b9f"
723
+ ],
724
+ "layout": "IPY_MODEL_d4c517a9217c4101a464007f1468fdd8"
725
+ }
726
+ },
727
+ "c84b4c2e97ff448c969a7570cf6f86d3": {
728
+ "model_module": "@jupyter-widgets/controls",
729
+ "model_name": "HTMLModel",
730
+ "model_module_version": "1.5.0",
731
+ "state": {
732
+ "_dom_classes": [],
733
+ "_model_module": "@jupyter-widgets/controls",
734
+ "_model_module_version": "1.5.0",
735
+ "_model_name": "HTMLModel",
736
+ "_view_count": null,
737
+ "_view_module": "@jupyter-widgets/controls",
738
+ "_view_module_version": "1.5.0",
739
+ "_view_name": "HTMLView",
740
+ "description": "",
741
+ "description_tooltip": null,
742
+ "layout": "IPY_MODEL_e2faf98f69a042c3b2daac2f788c47cb",
743
+ "placeholder": "​",
744
+ "style": "IPY_MODEL_5c6d9eab7f274baab1ad1ecb27325d80",
745
+ "value": "data/test-00000-of-00001.parquet: 100%"
746
+ }
747
+ },
748
+ "a539bf015f6c4ac7a9ecc0de100c51e7": {
749
+ "model_module": "@jupyter-widgets/controls",
750
+ "model_name": "FloatProgressModel",
751
+ "model_module_version": "1.5.0",
752
+ "state": {
753
+ "_dom_classes": [],
754
+ "_model_module": "@jupyter-widgets/controls",
755
+ "_model_module_version": "1.5.0",
756
+ "_model_name": "FloatProgressModel",
757
+ "_view_count": null,
758
+ "_view_module": "@jupyter-widgets/controls",
759
+ "_view_module_version": "1.5.0",
760
+ "_view_name": "ProgressView",
761
+ "bar_style": "success",
762
+ "description": "",
763
+ "description_tooltip": null,
764
+ "layout": "IPY_MODEL_203c77d0a03c40ed971b2253baa788aa",
765
+ "max": 80229,
766
+ "min": 0,
767
+ "orientation": "horizontal",
768
+ "style": "IPY_MODEL_2a2fcfbe11c143bb887f8a504c5ba3f8",
769
+ "value": 80229
770
+ }
771
+ },
772
+ "8470e586075f40b3b7b4e37300e77b9f": {
773
+ "model_module": "@jupyter-widgets/controls",
774
+ "model_name": "HTMLModel",
775
+ "model_module_version": "1.5.0",
776
+ "state": {
777
+ "_dom_classes": [],
778
+ "_model_module": "@jupyter-widgets/controls",
779
+ "_model_module_version": "1.5.0",
780
+ "_model_name": "HTMLModel",
781
+ "_view_count": null,
782
+ "_view_module": "@jupyter-widgets/controls",
783
+ "_view_module_version": "1.5.0",
784
+ "_view_name": "HTMLView",
785
+ "description": "",
786
+ "description_tooltip": null,
787
+ "layout": "IPY_MODEL_6bc6f7255a09468f8fe8ff97de810fb0",
788
+ "placeholder": "​",
789
+ "style": "IPY_MODEL_843da01b0ace41eeb12e614a0bc0d5c8",
790
+ "value": " 80.2k/80.2k [00:00<00:00, 352kB/s]"
791
+ }
792
+ },
793
+ "d4c517a9217c4101a464007f1468fdd8": {
794
+ "model_module": "@jupyter-widgets/base",
795
+ "model_name": "LayoutModel",
796
+ "model_module_version": "1.2.0",
797
+ "state": {
798
+ "_model_module": "@jupyter-widgets/base",
799
+ "_model_module_version": "1.2.0",
800
+ "_model_name": "LayoutModel",
801
+ "_view_count": null,
802
+ "_view_module": "@jupyter-widgets/base",
803
+ "_view_module_version": "1.2.0",
804
+ "_view_name": "LayoutView",
805
+ "align_content": null,
806
+ "align_items": null,
807
+ "align_self": null,
808
+ "border": null,
809
+ "bottom": null,
810
+ "display": null,
811
+ "flex": null,
812
+ "flex_flow": null,
813
+ "grid_area": null,
814
+ "grid_auto_columns": null,
815
+ "grid_auto_flow": null,
816
+ "grid_auto_rows": null,
817
+ "grid_column": null,
818
+ "grid_gap": null,
819
+ "grid_row": null,
820
+ "grid_template_areas": null,
821
+ "grid_template_columns": null,
822
+ "grid_template_rows": null,
823
+ "height": null,
824
+ "justify_content": null,
825
+ "justify_items": null,
826
+ "left": null,
827
+ "margin": null,
828
+ "max_height": null,
829
+ "max_width": null,
830
+ "min_height": null,
831
+ "min_width": null,
832
+ "object_fit": null,
833
+ "object_position": null,
834
+ "order": null,
835
+ "overflow": null,
836
+ "overflow_x": null,
837
+ "overflow_y": null,
838
+ "padding": null,
839
+ "right": null,
840
+ "top": null,
841
+ "visibility": null,
842
+ "width": null
843
+ }
844
+ },
845
+ "e2faf98f69a042c3b2daac2f788c47cb": {
846
+ "model_module": "@jupyter-widgets/base",
847
+ "model_name": "LayoutModel",
848
+ "model_module_version": "1.2.0",
849
+ "state": {
850
+ "_model_module": "@jupyter-widgets/base",
851
+ "_model_module_version": "1.2.0",
852
+ "_model_name": "LayoutModel",
853
+ "_view_count": null,
854
+ "_view_module": "@jupyter-widgets/base",
855
+ "_view_module_version": "1.2.0",
856
+ "_view_name": "LayoutView",
857
+ "align_content": null,
858
+ "align_items": null,
859
+ "align_self": null,
860
+ "border": null,
861
+ "bottom": null,
862
+ "display": null,
863
+ "flex": null,
864
+ "flex_flow": null,
865
+ "grid_area": null,
866
+ "grid_auto_columns": null,
867
+ "grid_auto_flow": null,
868
+ "grid_auto_rows": null,
869
+ "grid_column": null,
870
+ "grid_gap": null,
871
+ "grid_row": null,
872
+ "grid_template_areas": null,
873
+ "grid_template_columns": null,
874
+ "grid_template_rows": null,
875
+ "height": null,
876
+ "justify_content": null,
877
+ "justify_items": null,
878
+ "left": null,
879
+ "margin": null,
880
+ "max_height": null,
881
+ "max_width": null,
882
+ "min_height": null,
883
+ "min_width": null,
884
+ "object_fit": null,
885
+ "object_position": null,
886
+ "order": null,
887
+ "overflow": null,
888
+ "overflow_x": null,
889
+ "overflow_y": null,
890
+ "padding": null,
891
+ "right": null,
892
+ "top": null,
893
+ "visibility": null,
894
+ "width": null
895
+ }
896
+ },
897
+ "5c6d9eab7f274baab1ad1ecb27325d80": {
898
+ "model_module": "@jupyter-widgets/controls",
899
+ "model_name": "DescriptionStyleModel",
900
+ "model_module_version": "1.5.0",
901
+ "state": {
902
+ "_model_module": "@jupyter-widgets/controls",
903
+ "_model_module_version": "1.5.0",
904
+ "_model_name": "DescriptionStyleModel",
905
+ "_view_count": null,
906
+ "_view_module": "@jupyter-widgets/base",
907
+ "_view_module_version": "1.2.0",
908
+ "_view_name": "StyleView",
909
+ "description_width": ""
910
+ }
911
+ },
912
+ "203c77d0a03c40ed971b2253baa788aa": {
913
+ "model_module": "@jupyter-widgets/base",
914
+ "model_name": "LayoutModel",
915
+ "model_module_version": "1.2.0",
916
+ "state": {
917
+ "_model_module": "@jupyter-widgets/base",
918
+ "_model_module_version": "1.2.0",
919
+ "_model_name": "LayoutModel",
920
+ "_view_count": null,
921
+ "_view_module": "@jupyter-widgets/base",
922
+ "_view_module_version": "1.2.0",
923
+ "_view_name": "LayoutView",
924
+ "align_content": null,
925
+ "align_items": null,
926
+ "align_self": null,
927
+ "border": null,
928
+ "bottom": null,
929
+ "display": null,
930
+ "flex": null,
931
+ "flex_flow": null,
932
+ "grid_area": null,
933
+ "grid_auto_columns": null,
934
+ "grid_auto_flow": null,
935
+ "grid_auto_rows": null,
936
+ "grid_column": null,
937
+ "grid_gap": null,
938
+ "grid_row": null,
939
+ "grid_template_areas": null,
940
+ "grid_template_columns": null,
941
+ "grid_template_rows": null,
942
+ "height": null,
943
+ "justify_content": null,
944
+ "justify_items": null,
945
+ "left": null,
946
+ "margin": null,
947
+ "max_height": null,
948
+ "max_width": null,
949
+ "min_height": null,
950
+ "min_width": null,
951
+ "object_fit": null,
952
+ "object_position": null,
953
+ "order": null,
954
+ "overflow": null,
955
+ "overflow_x": null,
956
+ "overflow_y": null,
957
+ "padding": null,
958
+ "right": null,
959
+ "top": null,
960
+ "visibility": null,
961
+ "width": null
962
+ }
963
+ },
964
+ "2a2fcfbe11c143bb887f8a504c5ba3f8": {
965
+ "model_module": "@jupyter-widgets/controls",
966
+ "model_name": "ProgressStyleModel",
967
+ "model_module_version": "1.5.0",
968
+ "state": {
969
+ "_model_module": "@jupyter-widgets/controls",
970
+ "_model_module_version": "1.5.0",
971
+ "_model_name": "ProgressStyleModel",
972
+ "_view_count": null,
973
+ "_view_module": "@jupyter-widgets/base",
974
+ "_view_module_version": "1.2.0",
975
+ "_view_name": "StyleView",
976
+ "bar_color": null,
977
+ "description_width": ""
978
+ }
979
+ },
980
+ "6bc6f7255a09468f8fe8ff97de810fb0": {
981
+ "model_module": "@jupyter-widgets/base",
982
+ "model_name": "LayoutModel",
983
+ "model_module_version": "1.2.0",
984
+ "state": {
985
+ "_model_module": "@jupyter-widgets/base",
986
+ "_model_module_version": "1.2.0",
987
+ "_model_name": "LayoutModel",
988
+ "_view_count": null,
989
+ "_view_module": "@jupyter-widgets/base",
990
+ "_view_module_version": "1.2.0",
991
+ "_view_name": "LayoutView",
992
+ "align_content": null,
993
+ "align_items": null,
994
+ "align_self": null,
995
+ "border": null,
996
+ "bottom": null,
997
+ "display": null,
998
+ "flex": null,
999
+ "flex_flow": null,
1000
+ "grid_area": null,
1001
+ "grid_auto_columns": null,
1002
+ "grid_auto_flow": null,
1003
+ "grid_auto_rows": null,
1004
+ "grid_column": null,
1005
+ "grid_gap": null,
1006
+ "grid_row": null,
1007
+ "grid_template_areas": null,
1008
+ "grid_template_columns": null,
1009
+ "grid_template_rows": null,
1010
+ "height": null,
1011
+ "justify_content": null,
1012
+ "justify_items": null,
1013
+ "left": null,
1014
+ "margin": null,
1015
+ "max_height": null,
1016
+ "max_width": null,
1017
+ "min_height": null,
1018
+ "min_width": null,
1019
+ "object_fit": null,
1020
+ "object_position": null,
1021
+ "order": null,
1022
+ "overflow": null,
1023
+ "overflow_x": null,
1024
+ "overflow_y": null,
1025
+ "padding": null,
1026
+ "right": null,
1027
+ "top": null,
1028
+ "visibility": null,
1029
+ "width": null
1030
+ }
1031
+ },
1032
+ "843da01b0ace41eeb12e614a0bc0d5c8": {
1033
+ "model_module": "@jupyter-widgets/controls",
1034
+ "model_name": "DescriptionStyleModel",
1035
+ "model_module_version": "1.5.0",
1036
+ "state": {
1037
+ "_model_module": "@jupyter-widgets/controls",
1038
+ "_model_module_version": "1.5.0",
1039
+ "_model_name": "DescriptionStyleModel",
1040
+ "_view_count": null,
1041
+ "_view_module": "@jupyter-widgets/base",
1042
+ "_view_module_version": "1.2.0",
1043
+ "_view_name": "StyleView",
1044
+ "description_width": ""
1045
+ }
1046
+ },
1047
+ "7a2fbdaf743048cc9890fac92a309b51": {
1048
+ "model_module": "@jupyter-widgets/controls",
1049
+ "model_name": "HBoxModel",
1050
+ "model_module_version": "1.5.0",
1051
+ "state": {
1052
+ "_dom_classes": [],
1053
+ "_model_module": "@jupyter-widgets/controls",
1054
+ "_model_module_version": "1.5.0",
1055
+ "_model_name": "HBoxModel",
1056
+ "_view_count": null,
1057
+ "_view_module": "@jupyter-widgets/controls",
1058
+ "_view_module_version": "1.5.0",
1059
+ "_view_name": "HBoxView",
1060
+ "box_style": "",
1061
+ "children": [
1062
+ "IPY_MODEL_39649e3b4e1545f0b17c662d9e21d4ae",
1063
+ "IPY_MODEL_8b430dd1984a4d73ab55e843987cdd74",
1064
+ "IPY_MODEL_32fb8d18172c4915b01940c1cce352a1"
1065
+ ],
1066
+ "layout": "IPY_MODEL_5e5839cfd3e848648433eea907ecfccb"
1067
+ }
1068
+ },
1069
+ "39649e3b4e1545f0b17c662d9e21d4ae": {
1070
+ "model_module": "@jupyter-widgets/controls",
1071
+ "model_name": "HTMLModel",
1072
+ "model_module_version": "1.5.0",
1073
+ "state": {
1074
+ "_dom_classes": [],
1075
+ "_model_module": "@jupyter-widgets/controls",
1076
+ "_model_module_version": "1.5.0",
1077
+ "_model_name": "HTMLModel",
1078
+ "_view_count": null,
1079
+ "_view_module": "@jupyter-widgets/controls",
1080
+ "_view_module_version": "1.5.0",
1081
+ "_view_name": "HTMLView",
1082
+ "description": "",
1083
+ "description_tooltip": null,
1084
+ "layout": "IPY_MODEL_79e2c8d05dc74be4a0f6db2fb20896da",
1085
+ "placeholder": "​",
1086
+ "style": "IPY_MODEL_bf7ee8b444e745f5b11d47ec880092c1",
1087
+ "value": "Generating train split: 100%"
1088
+ }
1089
+ },
1090
+ "8b430dd1984a4d73ab55e843987cdd74": {
1091
+ "model_module": "@jupyter-widgets/controls",
1092
+ "model_name": "FloatProgressModel",
1093
+ "model_module_version": "1.5.0",
1094
+ "state": {
1095
+ "_dom_classes": [],
1096
+ "_model_module": "@jupyter-widgets/controls",
1097
+ "_model_module_version": "1.5.0",
1098
+ "_model_name": "FloatProgressModel",
1099
+ "_view_count": null,
1100
+ "_view_module": "@jupyter-widgets/controls",
1101
+ "_view_module_version": "1.5.0",
1102
+ "_view_name": "ProgressView",
1103
+ "bar_style": "success",
1104
+ "description": "",
1105
+ "description_tooltip": null,
1106
+ "layout": "IPY_MODEL_c0cf91f4e8da4812a749c7412c832676",
1107
+ "max": 1264,
1108
+ "min": 0,
1109
+ "orientation": "horizontal",
1110
+ "style": "IPY_MODEL_08ef9374c0b1489cb2b8917b7f8ed9d4",
1111
+ "value": 1264
1112
+ }
1113
+ },
1114
+ "32fb8d18172c4915b01940c1cce352a1": {
1115
+ "model_module": "@jupyter-widgets/controls",
1116
+ "model_name": "HTMLModel",
1117
+ "model_module_version": "1.5.0",
1118
+ "state": {
1119
+ "_dom_classes": [],
1120
+ "_model_module": "@jupyter-widgets/controls",
1121
+ "_model_module_version": "1.5.0",
1122
+ "_model_name": "HTMLModel",
1123
+ "_view_count": null,
1124
+ "_view_module": "@jupyter-widgets/controls",
1125
+ "_view_module_version": "1.5.0",
1126
+ "_view_name": "HTMLView",
1127
+ "description": "",
1128
+ "description_tooltip": null,
1129
+ "layout": "IPY_MODEL_e348d1bb481748f4a10faba409be65cc",
1130
+ "placeholder": "​",
1131
+ "style": "IPY_MODEL_66b4d5a473d64502ba3478ff17a627c8",
1132
+ "value": " 1264/1264 [00:00<00:00, 27170.56 examples/s]"
1133
+ }
1134
+ },
1135
+ "5e5839cfd3e848648433eea907ecfccb": {
1136
+ "model_module": "@jupyter-widgets/base",
1137
+ "model_name": "LayoutModel",
1138
+ "model_module_version": "1.2.0",
1139
+ "state": {
1140
+ "_model_module": "@jupyter-widgets/base",
1141
+ "_model_module_version": "1.2.0",
1142
+ "_model_name": "LayoutModel",
1143
+ "_view_count": null,
1144
+ "_view_module": "@jupyter-widgets/base",
1145
+ "_view_module_version": "1.2.0",
1146
+ "_view_name": "LayoutView",
1147
+ "align_content": null,
1148
+ "align_items": null,
1149
+ "align_self": null,
1150
+ "border": null,
1151
+ "bottom": null,
1152
+ "display": null,
1153
+ "flex": null,
1154
+ "flex_flow": null,
1155
+ "grid_area": null,
1156
+ "grid_auto_columns": null,
1157
+ "grid_auto_flow": null,
1158
+ "grid_auto_rows": null,
1159
+ "grid_column": null,
1160
+ "grid_gap": null,
1161
+ "grid_row": null,
1162
+ "grid_template_areas": null,
1163
+ "grid_template_columns": null,
1164
+ "grid_template_rows": null,
1165
+ "height": null,
1166
+ "justify_content": null,
1167
+ "justify_items": null,
1168
+ "left": null,
1169
+ "margin": null,
1170
+ "max_height": null,
1171
+ "max_width": null,
1172
+ "min_height": null,
1173
+ "min_width": null,
1174
+ "object_fit": null,
1175
+ "object_position": null,
1176
+ "order": null,
1177
+ "overflow": null,
1178
+ "overflow_x": null,
1179
+ "overflow_y": null,
1180
+ "padding": null,
1181
+ "right": null,
1182
+ "top": null,
1183
+ "visibility": null,
1184
+ "width": null
1185
+ }
1186
+ },
1187
+ "79e2c8d05dc74be4a0f6db2fb20896da": {
1188
+ "model_module": "@jupyter-widgets/base",
1189
+ "model_name": "LayoutModel",
1190
+ "model_module_version": "1.2.0",
1191
+ "state": {
1192
+ "_model_module": "@jupyter-widgets/base",
1193
+ "_model_module_version": "1.2.0",
1194
+ "_model_name": "LayoutModel",
1195
+ "_view_count": null,
1196
+ "_view_module": "@jupyter-widgets/base",
1197
+ "_view_module_version": "1.2.0",
1198
+ "_view_name": "LayoutView",
1199
+ "align_content": null,
1200
+ "align_items": null,
1201
+ "align_self": null,
1202
+ "border": null,
1203
+ "bottom": null,
1204
+ "display": null,
1205
+ "flex": null,
1206
+ "flex_flow": null,
1207
+ "grid_area": null,
1208
+ "grid_auto_columns": null,
1209
+ "grid_auto_flow": null,
1210
+ "grid_auto_rows": null,
1211
+ "grid_column": null,
1212
+ "grid_gap": null,
1213
+ "grid_row": null,
1214
+ "grid_template_areas": null,
1215
+ "grid_template_columns": null,
1216
+ "grid_template_rows": null,
1217
+ "height": null,
1218
+ "justify_content": null,
1219
+ "justify_items": null,
1220
+ "left": null,
1221
+ "margin": null,
1222
+ "max_height": null,
1223
+ "max_width": null,
1224
+ "min_height": null,
1225
+ "min_width": null,
1226
+ "object_fit": null,
1227
+ "object_position": null,
1228
+ "order": null,
1229
+ "overflow": null,
1230
+ "overflow_x": null,
1231
+ "overflow_y": null,
1232
+ "padding": null,
1233
+ "right": null,
1234
+ "top": null,
1235
+ "visibility": null,
1236
+ "width": null
1237
+ }
1238
+ },
1239
+ "bf7ee8b444e745f5b11d47ec880092c1": {
1240
+ "model_module": "@jupyter-widgets/controls",
1241
+ "model_name": "DescriptionStyleModel",
1242
+ "model_module_version": "1.5.0",
1243
+ "state": {
1244
+ "_model_module": "@jupyter-widgets/controls",
1245
+ "_model_module_version": "1.5.0",
1246
+ "_model_name": "DescriptionStyleModel",
1247
+ "_view_count": null,
1248
+ "_view_module": "@jupyter-widgets/base",
1249
+ "_view_module_version": "1.2.0",
1250
+ "_view_name": "StyleView",
1251
+ "description_width": ""
1252
+ }
1253
+ },
1254
+ "c0cf91f4e8da4812a749c7412c832676": {
1255
+ "model_module": "@jupyter-widgets/base",
1256
+ "model_name": "LayoutModel",
1257
+ "model_module_version": "1.2.0",
1258
+ "state": {
1259
+ "_model_module": "@jupyter-widgets/base",
1260
+ "_model_module_version": "1.2.0",
1261
+ "_model_name": "LayoutModel",
1262
+ "_view_count": null,
1263
+ "_view_module": "@jupyter-widgets/base",
1264
+ "_view_module_version": "1.2.0",
1265
+ "_view_name": "LayoutView",
1266
+ "align_content": null,
1267
+ "align_items": null,
1268
+ "align_self": null,
1269
+ "border": null,
1270
+ "bottom": null,
1271
+ "display": null,
1272
+ "flex": null,
1273
+ "flex_flow": null,
1274
+ "grid_area": null,
1275
+ "grid_auto_columns": null,
1276
+ "grid_auto_flow": null,
1277
+ "grid_auto_rows": null,
1278
+ "grid_column": null,
1279
+ "grid_gap": null,
1280
+ "grid_row": null,
1281
+ "grid_template_areas": null,
1282
+ "grid_template_columns": null,
1283
+ "grid_template_rows": null,
1284
+ "height": null,
1285
+ "justify_content": null,
1286
+ "justify_items": null,
1287
+ "left": null,
1288
+ "margin": null,
1289
+ "max_height": null,
1290
+ "max_width": null,
1291
+ "min_height": null,
1292
+ "min_width": null,
1293
+ "object_fit": null,
1294
+ "object_position": null,
1295
+ "order": null,
1296
+ "overflow": null,
1297
+ "overflow_x": null,
1298
+ "overflow_y": null,
1299
+ "padding": null,
1300
+ "right": null,
1301
+ "top": null,
1302
+ "visibility": null,
1303
+ "width": null
1304
+ }
1305
+ },
1306
+ "08ef9374c0b1489cb2b8917b7f8ed9d4": {
1307
+ "model_module": "@jupyter-widgets/controls",
1308
+ "model_name": "ProgressStyleModel",
1309
+ "model_module_version": "1.5.0",
1310
+ "state": {
1311
+ "_model_module": "@jupyter-widgets/controls",
1312
+ "_model_module_version": "1.5.0",
1313
+ "_model_name": "ProgressStyleModel",
1314
+ "_view_count": null,
1315
+ "_view_module": "@jupyter-widgets/base",
1316
+ "_view_module_version": "1.2.0",
1317
+ "_view_name": "StyleView",
1318
+ "bar_color": null,
1319
+ "description_width": ""
1320
+ }
1321
+ },
1322
+ "e348d1bb481748f4a10faba409be65cc": {
1323
+ "model_module": "@jupyter-widgets/base",
1324
+ "model_name": "LayoutModel",
1325
+ "model_module_version": "1.2.0",
1326
+ "state": {
1327
+ "_model_module": "@jupyter-widgets/base",
1328
+ "_model_module_version": "1.2.0",
1329
+ "_model_name": "LayoutModel",
1330
+ "_view_count": null,
1331
+ "_view_module": "@jupyter-widgets/base",
1332
+ "_view_module_version": "1.2.0",
1333
+ "_view_name": "LayoutView",
1334
+ "align_content": null,
1335
+ "align_items": null,
1336
+ "align_self": null,
1337
+ "border": null,
1338
+ "bottom": null,
1339
+ "display": null,
1340
+ "flex": null,
1341
+ "flex_flow": null,
1342
+ "grid_area": null,
1343
+ "grid_auto_columns": null,
1344
+ "grid_auto_flow": null,
1345
+ "grid_auto_rows": null,
1346
+ "grid_column": null,
1347
+ "grid_gap": null,
1348
+ "grid_row": null,
1349
+ "grid_template_areas": null,
1350
+ "grid_template_columns": null,
1351
+ "grid_template_rows": null,
1352
+ "height": null,
1353
+ "justify_content": null,
1354
+ "justify_items": null,
1355
+ "left": null,
1356
+ "margin": null,
1357
+ "max_height": null,
1358
+ "max_width": null,
1359
+ "min_height": null,
1360
+ "min_width": null,
1361
+ "object_fit": null,
1362
+ "object_position": null,
1363
+ "order": null,
1364
+ "overflow": null,
1365
+ "overflow_x": null,
1366
+ "overflow_y": null,
1367
+ "padding": null,
1368
+ "right": null,
1369
+ "top": null,
1370
+ "visibility": null,
1371
+ "width": null
1372
+ }
1373
+ },
1374
+ "66b4d5a473d64502ba3478ff17a627c8": {
1375
+ "model_module": "@jupyter-widgets/controls",
1376
+ "model_name": "DescriptionStyleModel",
1377
+ "model_module_version": "1.5.0",
1378
+ "state": {
1379
+ "_model_module": "@jupyter-widgets/controls",
1380
+ "_model_module_version": "1.5.0",
1381
+ "_model_name": "DescriptionStyleModel",
1382
+ "_view_count": null,
1383
+ "_view_module": "@jupyter-widgets/base",
1384
+ "_view_module_version": "1.2.0",
1385
+ "_view_name": "StyleView",
1386
+ "description_width": ""
1387
+ }
1388
+ },
1389
+ "0539c465b9cd4f2490c42a012793be5c": {
1390
+ "model_module": "@jupyter-widgets/controls",
1391
+ "model_name": "HBoxModel",
1392
+ "model_module_version": "1.5.0",
1393
+ "state": {
1394
+ "_dom_classes": [],
1395
+ "_model_module": "@jupyter-widgets/controls",
1396
+ "_model_module_version": "1.5.0",
1397
+ "_model_name": "HBoxModel",
1398
+ "_view_count": null,
1399
+ "_view_module": "@jupyter-widgets/controls",
1400
+ "_view_module_version": "1.5.0",
1401
+ "_view_name": "HBoxView",
1402
+ "box_style": "",
1403
+ "children": [
1404
+ "IPY_MODEL_6a09379914184e159bd6dcc14c7fad20",
1405
+ "IPY_MODEL_0be257696fd54effbfaee7ea65e1fa2a",
1406
+ "IPY_MODEL_b6c225b6287d43298d340e98a8b8fcf4"
1407
+ ],
1408
+ "layout": "IPY_MODEL_3713ff61ec564ce9ab99ed0187f3fef6"
1409
+ }
1410
+ },
1411
+ "6a09379914184e159bd6dcc14c7fad20": {
1412
+ "model_module": "@jupyter-widgets/controls",
1413
+ "model_name": "HTMLModel",
1414
+ "model_module_version": "1.5.0",
1415
+ "state": {
1416
+ "_dom_classes": [],
1417
+ "_model_module": "@jupyter-widgets/controls",
1418
+ "_model_module_version": "1.5.0",
1419
+ "_model_name": "HTMLModel",
1420
+ "_view_count": null,
1421
+ "_view_module": "@jupyter-widgets/controls",
1422
+ "_view_module_version": "1.5.0",
1423
+ "_view_name": "HTMLView",
1424
+ "description": "",
1425
+ "description_tooltip": null,
1426
+ "layout": "IPY_MODEL_cb2648aa76424c3cbe670910cfd040dd",
1427
+ "placeholder": "​",
1428
+ "style": "IPY_MODEL_e2f4be05865a4b939179e364339ff1ac",
1429
+ "value": "Generating test split: 100%"
1430
+ }
1431
+ },
1432
+ "0be257696fd54effbfaee7ea65e1fa2a": {
1433
+ "model_module": "@jupyter-widgets/controls",
1434
+ "model_name": "FloatProgressModel",
1435
+ "model_module_version": "1.5.0",
1436
+ "state": {
1437
+ "_dom_classes": [],
1438
+ "_model_module": "@jupyter-widgets/controls",
1439
+ "_model_module_version": "1.5.0",
1440
+ "_model_name": "FloatProgressModel",
1441
+ "_view_count": null,
1442
+ "_view_module": "@jupyter-widgets/controls",
1443
+ "_view_module_version": "1.5.0",
1444
+ "_view_name": "ProgressView",
1445
+ "bar_style": "success",
1446
+ "description": "",
1447
+ "description_tooltip": null,
1448
+ "layout": "IPY_MODEL_b8c2e80e3b4b4c5ba3defff623a1ccb5",
1449
+ "max": 1000,
1450
+ "min": 0,
1451
+ "orientation": "horizontal",
1452
+ "style": "IPY_MODEL_6ed2b29a5589438cb04540bd788dbd1c",
1453
+ "value": 1000
1454
+ }
1455
+ },
1456
+ "b6c225b6287d43298d340e98a8b8fcf4": {
1457
+ "model_module": "@jupyter-widgets/controls",
1458
+ "model_name": "HTMLModel",
1459
+ "model_module_version": "1.5.0",
1460
+ "state": {
1461
+ "_dom_classes": [],
1462
+ "_model_module": "@jupyter-widgets/controls",
1463
+ "_model_module_version": "1.5.0",
1464
+ "_model_name": "HTMLModel",
1465
+ "_view_count": null,
1466
+ "_view_module": "@jupyter-widgets/controls",
1467
+ "_view_module_version": "1.5.0",
1468
+ "_view_name": "HTMLView",
1469
+ "description": "",
1470
+ "description_tooltip": null,
1471
+ "layout": "IPY_MODEL_118f1430f31645de971204222866b537",
1472
+ "placeholder": "​",
1473
+ "style": "IPY_MODEL_0693cdf6b0144461b0173c4330e3db70",
1474
+ "value": " 1000/1000 [00:00<00:00, 40668.88 examples/s]"
1475
+ }
1476
+ },
1477
+ "3713ff61ec564ce9ab99ed0187f3fef6": {
1478
+ "model_module": "@jupyter-widgets/base",
1479
+ "model_name": "LayoutModel",
1480
+ "model_module_version": "1.2.0",
1481
+ "state": {
1482
+ "_model_module": "@jupyter-widgets/base",
1483
+ "_model_module_version": "1.2.0",
1484
+ "_model_name": "LayoutModel",
1485
+ "_view_count": null,
1486
+ "_view_module": "@jupyter-widgets/base",
1487
+ "_view_module_version": "1.2.0",
1488
+ "_view_name": "LayoutView",
1489
+ "align_content": null,
1490
+ "align_items": null,
1491
+ "align_self": null,
1492
+ "border": null,
1493
+ "bottom": null,
1494
+ "display": null,
1495
+ "flex": null,
1496
+ "flex_flow": null,
1497
+ "grid_area": null,
1498
+ "grid_auto_columns": null,
1499
+ "grid_auto_flow": null,
1500
+ "grid_auto_rows": null,
1501
+ "grid_column": null,
1502
+ "grid_gap": null,
1503
+ "grid_row": null,
1504
+ "grid_template_areas": null,
1505
+ "grid_template_columns": null,
1506
+ "grid_template_rows": null,
1507
+ "height": null,
1508
+ "justify_content": null,
1509
+ "justify_items": null,
1510
+ "left": null,
1511
+ "margin": null,
1512
+ "max_height": null,
1513
+ "max_width": null,
1514
+ "min_height": null,
1515
+ "min_width": null,
1516
+ "object_fit": null,
1517
+ "object_position": null,
1518
+ "order": null,
1519
+ "overflow": null,
1520
+ "overflow_x": null,
1521
+ "overflow_y": null,
1522
+ "padding": null,
1523
+ "right": null,
1524
+ "top": null,
1525
+ "visibility": null,
1526
+ "width": null
1527
+ }
1528
+ },
1529
+ "cb2648aa76424c3cbe670910cfd040dd": {
1530
+ "model_module": "@jupyter-widgets/base",
1531
+ "model_name": "LayoutModel",
1532
+ "model_module_version": "1.2.0",
1533
+ "state": {
1534
+ "_model_module": "@jupyter-widgets/base",
1535
+ "_model_module_version": "1.2.0",
1536
+ "_model_name": "LayoutModel",
1537
+ "_view_count": null,
1538
+ "_view_module": "@jupyter-widgets/base",
1539
+ "_view_module_version": "1.2.0",
1540
+ "_view_name": "LayoutView",
1541
+ "align_content": null,
1542
+ "align_items": null,
1543
+ "align_self": null,
1544
+ "border": null,
1545
+ "bottom": null,
1546
+ "display": null,
1547
+ "flex": null,
1548
+ "flex_flow": null,
1549
+ "grid_area": null,
1550
+ "grid_auto_columns": null,
1551
+ "grid_auto_flow": null,
1552
+ "grid_auto_rows": null,
1553
+ "grid_column": null,
1554
+ "grid_gap": null,
1555
+ "grid_row": null,
1556
+ "grid_template_areas": null,
1557
+ "grid_template_columns": null,
1558
+ "grid_template_rows": null,
1559
+ "height": null,
1560
+ "justify_content": null,
1561
+ "justify_items": null,
1562
+ "left": null,
1563
+ "margin": null,
1564
+ "max_height": null,
1565
+ "max_width": null,
1566
+ "min_height": null,
1567
+ "min_width": null,
1568
+ "object_fit": null,
1569
+ "object_position": null,
1570
+ "order": null,
1571
+ "overflow": null,
1572
+ "overflow_x": null,
1573
+ "overflow_y": null,
1574
+ "padding": null,
1575
+ "right": null,
1576
+ "top": null,
1577
+ "visibility": null,
1578
+ "width": null
1579
+ }
1580
+ },
1581
+ "e2f4be05865a4b939179e364339ff1ac": {
1582
+ "model_module": "@jupyter-widgets/controls",
1583
+ "model_name": "DescriptionStyleModel",
1584
+ "model_module_version": "1.5.0",
1585
+ "state": {
1586
+ "_model_module": "@jupyter-widgets/controls",
1587
+ "_model_module_version": "1.5.0",
1588
+ "_model_name": "DescriptionStyleModel",
1589
+ "_view_count": null,
1590
+ "_view_module": "@jupyter-widgets/base",
1591
+ "_view_module_version": "1.2.0",
1592
+ "_view_name": "StyleView",
1593
+ "description_width": ""
1594
+ }
1595
+ },
1596
+ "b8c2e80e3b4b4c5ba3defff623a1ccb5": {
1597
+ "model_module": "@jupyter-widgets/base",
1598
+ "model_name": "LayoutModel",
1599
+ "model_module_version": "1.2.0",
1600
+ "state": {
1601
+ "_model_module": "@jupyter-widgets/base",
1602
+ "_model_module_version": "1.2.0",
1603
+ "_model_name": "LayoutModel",
1604
+ "_view_count": null,
1605
+ "_view_module": "@jupyter-widgets/base",
1606
+ "_view_module_version": "1.2.0",
1607
+ "_view_name": "LayoutView",
1608
+ "align_content": null,
1609
+ "align_items": null,
1610
+ "align_self": null,
1611
+ "border": null,
1612
+ "bottom": null,
1613
+ "display": null,
1614
+ "flex": null,
1615
+ "flex_flow": null,
1616
+ "grid_area": null,
1617
+ "grid_auto_columns": null,
1618
+ "grid_auto_flow": null,
1619
+ "grid_auto_rows": null,
1620
+ "grid_column": null,
1621
+ "grid_gap": null,
1622
+ "grid_row": null,
1623
+ "grid_template_areas": null,
1624
+ "grid_template_columns": null,
1625
+ "grid_template_rows": null,
1626
+ "height": null,
1627
+ "justify_content": null,
1628
+ "justify_items": null,
1629
+ "left": null,
1630
+ "margin": null,
1631
+ "max_height": null,
1632
+ "max_width": null,
1633
+ "min_height": null,
1634
+ "min_width": null,
1635
+ "object_fit": null,
1636
+ "object_position": null,
1637
+ "order": null,
1638
+ "overflow": null,
1639
+ "overflow_x": null,
1640
+ "overflow_y": null,
1641
+ "padding": null,
1642
+ "right": null,
1643
+ "top": null,
1644
+ "visibility": null,
1645
+ "width": null
1646
+ }
1647
+ },
1648
+ "6ed2b29a5589438cb04540bd788dbd1c": {
1649
+ "model_module": "@jupyter-widgets/controls",
1650
+ "model_name": "ProgressStyleModel",
1651
+ "model_module_version": "1.5.0",
1652
+ "state": {
1653
+ "_model_module": "@jupyter-widgets/controls",
1654
+ "_model_module_version": "1.5.0",
1655
+ "_model_name": "ProgressStyleModel",
1656
+ "_view_count": null,
1657
+ "_view_module": "@jupyter-widgets/base",
1658
+ "_view_module_version": "1.2.0",
1659
+ "_view_name": "StyleView",
1660
+ "bar_color": null,
1661
+ "description_width": ""
1662
+ }
1663
+ },
1664
+ "118f1430f31645de971204222866b537": {
1665
+ "model_module": "@jupyter-widgets/base",
1666
+ "model_name": "LayoutModel",
1667
+ "model_module_version": "1.2.0",
1668
+ "state": {
1669
+ "_model_module": "@jupyter-widgets/base",
1670
+ "_model_module_version": "1.2.0",
1671
+ "_model_name": "LayoutModel",
1672
+ "_view_count": null,
1673
+ "_view_module": "@jupyter-widgets/base",
1674
+ "_view_module_version": "1.2.0",
1675
+ "_view_name": "LayoutView",
1676
+ "align_content": null,
1677
+ "align_items": null,
1678
+ "align_self": null,
1679
+ "border": null,
1680
+ "bottom": null,
1681
+ "display": null,
1682
+ "flex": null,
1683
+ "flex_flow": null,
1684
+ "grid_area": null,
1685
+ "grid_auto_columns": null,
1686
+ "grid_auto_flow": null,
1687
+ "grid_auto_rows": null,
1688
+ "grid_column": null,
1689
+ "grid_gap": null,
1690
+ "grid_row": null,
1691
+ "grid_template_areas": null,
1692
+ "grid_template_columns": null,
1693
+ "grid_template_rows": null,
1694
+ "height": null,
1695
+ "justify_content": null,
1696
+ "justify_items": null,
1697
+ "left": null,
1698
+ "margin": null,
1699
+ "max_height": null,
1700
+ "max_width": null,
1701
+ "min_height": null,
1702
+ "min_width": null,
1703
+ "object_fit": null,
1704
+ "object_position": null,
1705
+ "order": null,
1706
+ "overflow": null,
1707
+ "overflow_x": null,
1708
+ "overflow_y": null,
1709
+ "padding": null,
1710
+ "right": null,
1711
+ "top": null,
1712
+ "visibility": null,
1713
+ "width": null
1714
+ }
1715
+ },
1716
+ "0693cdf6b0144461b0173c4330e3db70": {
1717
+ "model_module": "@jupyter-widgets/controls",
1718
+ "model_name": "DescriptionStyleModel",
1719
+ "model_module_version": "1.5.0",
1720
+ "state": {
1721
+ "_model_module": "@jupyter-widgets/controls",
1722
+ "_model_module_version": "1.5.0",
1723
+ "_model_name": "DescriptionStyleModel",
1724
+ "_view_count": null,
1725
+ "_view_module": "@jupyter-widgets/base",
1726
+ "_view_module_version": "1.2.0",
1727
+ "_view_name": "StyleView",
1728
+ "description_width": ""
1729
+ }
1730
+ }
1731
+ }
1732
+ }
1733
+ },
1734
+ "cells": [
1735
+ {
1736
+ "cell_type": "markdown",
1737
+ "metadata": {
1738
+ "id": "view-in-github",
1739
+ "colab_type": "text"
1740
+ },
1741
+ "source": [
1742
+ "<a href=\"https://colab.research.google.com/github/mobadara/finbert-sentiment-analyzer-api/blob/main/notebooks/data_exploration.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
1743
+ ]
1744
+ },
1745
+ {
1746
+ "cell_type": "markdown",
1747
+ "source": [
1748
+ "# 📊 Exploratory Data Analysis (EDA): Financial PhraseBank\n",
1749
+ "\n",
1750
+ "**Project:** Full-Stack Financial NLP Pipeline (Part 1: Data Exploration) \n",
1751
+ "**Environment:** Google Colab (Standard/CPU or T4 GPU) \n",
1752
+ "**Dataset:** [Financial PhraseBank (Hugging Face)](https://huggingface.co/datasets/financial_phrasebank)\n",
1753
+ "\n",
1754
+ "---\n",
1755
+ "\n",
1756
+ "## 🎯 Notebook Objective\n",
1757
+ "\n",
1758
+ "1. **Analysis of Class Distribution:** How imbalanced is our dataset? We will map out the ratio between Bullish (positive), Bearish (negative), and Neutral headlines.\n",
1759
+ "\n",
1760
+ "2. **Token Length and Padding:** As transformer models tend to have strict token limits (typically 512 tokens), we will assess the number of characters and words in our sentences in order to determine the optimal max_length for our tokenizer, therefore optimising memory usage on the GPU in the future.\n",
1761
+ "\n",
1762
+ "3. **Linguistic Profiling:** Analysing which common financial keywords and n-grams exist in each sentiment class in order to understand how the experts classified/annotated the text.\n",
1763
+ "\n",
1764
+ "4. **Engineering the Data:** We will finalise the dataset's structure (address any null or anomalistic data) so that it can be ingested by our custom PyTorch DataLoader in the next phase.\n"
1765
+ ],
1766
+ "metadata": {
1767
+ "id": "Zu3lxShaUmua"
1768
+ }
1769
+ },
1770
+ {
1771
+ "cell_type": "markdown",
1772
+ "source": [
1773
+ "## Setup"
1774
+ ],
1775
+ "metadata": {
1776
+ "id": "n77liDziXkPc"
1777
+ }
1778
+ },
1779
+ {
1780
+ "cell_type": "code",
1781
+ "execution_count": 13,
1782
+ "metadata": {
1783
+ "id": "cIdc7O5zRDpG"
1784
+ },
1785
+ "outputs": [],
1786
+ "source": [
1787
+ "import pandas as pd\n",
1788
+ "from datasets import load_dataset"
1789
+ ]
1790
+ },
1791
+ {
1792
+ "cell_type": "code",
1793
+ "source": [
1794
+ "dataset = load_dataset(\"FinanceMTEB/financial_phrasebank\")"
1795
+ ],
1796
+ "metadata": {
1797
+ "id": "Pb9met8DX5nw",
1798
+ "outputId": "6425acb7-d76b-4fd7-f0ea-a6241b7dabbb",
1799
+ "colab": {
1800
+ "base_uri": "https://localhost:8080/",
1801
+ "height": 177,
1802
+ "referenced_widgets": [
1803
+ "33c62285e7a14f46b6a678a5ee90cc1d",
1804
+ "5cf6d55d9eb0422babce34b9d7cce86a",
1805
+ "b71f5f547f36428881401143783e1cda",
1806
+ "1d48279d16634f1eaac89219ccdff842",
1807
+ "cb018f5627d6435bb32d53d99d3718ba",
1808
+ "a4ee776b44964e54b006f902a472ffaf",
1809
+ "576389aef48c4c63806322f9ef2f1188",
1810
+ "1ef978469ea441058f50d59aae5c35ce",
1811
+ "638b61a4e2bc4682b81507c08178e413",
1812
+ "b44ee4d8a78d454c9c1251cef4ef0a7f",
1813
+ "c4aaccc4f20f49e3b4d7de54d0bbac5e",
1814
+ "c64a92c0b83d459ba5d6dfdd33d07fd9",
1815
+ "61efe36d3d544ff8b24c574a886d6b4d",
1816
+ "71c7a7dd03f4465d918971420663ae85",
1817
+ "e703df0e72494d17b36dd5ae16b81f62",
1818
+ "bd4c4ec739c6438c8378254f8e7ab336",
1819
+ "20cd4eb1d09b4a7aa2a1ec67845ff1a4",
1820
+ "5ee56f6139f94425b26d92dc38c51e26",
1821
+ "d63d63e925e84cafba11798f3a1ae865",
1822
+ "5cd6a5b5be5e455b80ada0de6ab928af",
1823
+ "ba732f0367c748acadc5363a062adf21",
1824
+ "8d2ccb7f1e584e15a3b2148db0d60e4d",
1825
+ "4f93ab1267b14233a1ae198e7c2ecd5c",
1826
+ "c84b4c2e97ff448c969a7570cf6f86d3",
1827
+ "a539bf015f6c4ac7a9ecc0de100c51e7",
1828
+ "8470e586075f40b3b7b4e37300e77b9f",
1829
+ "d4c517a9217c4101a464007f1468fdd8",
1830
+ "e2faf98f69a042c3b2daac2f788c47cb",
1831
+ "5c6d9eab7f274baab1ad1ecb27325d80",
1832
+ "203c77d0a03c40ed971b2253baa788aa",
1833
+ "2a2fcfbe11c143bb887f8a504c5ba3f8",
1834
+ "6bc6f7255a09468f8fe8ff97de810fb0",
1835
+ "843da01b0ace41eeb12e614a0bc0d5c8",
1836
+ "7a2fbdaf743048cc9890fac92a309b51",
1837
+ "39649e3b4e1545f0b17c662d9e21d4ae",
1838
+ "8b430dd1984a4d73ab55e843987cdd74",
1839
+ "32fb8d18172c4915b01940c1cce352a1",
1840
+ "5e5839cfd3e848648433eea907ecfccb",
1841
+ "79e2c8d05dc74be4a0f6db2fb20896da",
1842
+ "bf7ee8b444e745f5b11d47ec880092c1",
1843
+ "c0cf91f4e8da4812a749c7412c832676",
1844
+ "08ef9374c0b1489cb2b8917b7f8ed9d4",
1845
+ "e348d1bb481748f4a10faba409be65cc",
1846
+ "66b4d5a473d64502ba3478ff17a627c8",
1847
+ "0539c465b9cd4f2490c42a012793be5c",
1848
+ "6a09379914184e159bd6dcc14c7fad20",
1849
+ "0be257696fd54effbfaee7ea65e1fa2a",
1850
+ "b6c225b6287d43298d340e98a8b8fcf4",
1851
+ "3713ff61ec564ce9ab99ed0187f3fef6",
1852
+ "cb2648aa76424c3cbe670910cfd040dd",
1853
+ "e2f4be05865a4b939179e364339ff1ac",
1854
+ "b8c2e80e3b4b4c5ba3defff623a1ccb5",
1855
+ "6ed2b29a5589438cb04540bd788dbd1c",
1856
+ "118f1430f31645de971204222866b537",
1857
+ "0693cdf6b0144461b0173c4330e3db70"
1858
+ ]
1859
+ }
1860
+ },
1861
+ "execution_count": 7,
1862
+ "outputs": [
1863
+ {
1864
+ "output_type": "display_data",
1865
+ "data": {
1866
+ "text/plain": [
1867
+ "README.md: 0%| | 0.00/465 [00:00<?, ?B/s]"
1868
+ ],
1869
+ "application/vnd.jupyter.widget-view+json": {
1870
+ "version_major": 2,
1871
+ "version_minor": 0,
1872
+ "model_id": "33c62285e7a14f46b6a678a5ee90cc1d"
1873
+ }
1874
+ },
1875
+ "metadata": {}
1876
+ },
1877
+ {
1878
+ "output_type": "display_data",
1879
+ "data": {
1880
+ "text/plain": [
1881
+ "data/train-00000-of-00001.parquet: 0%| | 0.00/104k [00:00<?, ?B/s]"
1882
+ ],
1883
+ "application/vnd.jupyter.widget-view+json": {
1884
+ "version_major": 2,
1885
+ "version_minor": 0,
1886
+ "model_id": "c64a92c0b83d459ba5d6dfdd33d07fd9"
1887
+ }
1888
+ },
1889
+ "metadata": {}
1890
+ },
1891
+ {
1892
+ "output_type": "display_data",
1893
+ "data": {
1894
+ "text/plain": [
1895
+ "data/test-00000-of-00001.parquet: 0%| | 0.00/80.2k [00:00<?, ?B/s]"
1896
+ ],
1897
+ "application/vnd.jupyter.widget-view+json": {
1898
+ "version_major": 2,
1899
+ "version_minor": 0,
1900
+ "model_id": "4f93ab1267b14233a1ae198e7c2ecd5c"
1901
+ }
1902
+ },
1903
+ "metadata": {}
1904
+ },
1905
+ {
1906
+ "output_type": "display_data",
1907
+ "data": {
1908
+ "text/plain": [
1909
+ "Generating train split: 0%| | 0/1264 [00:00<?, ? examples/s]"
1910
+ ],
1911
+ "application/vnd.jupyter.widget-view+json": {
1912
+ "version_major": 2,
1913
+ "version_minor": 0,
1914
+ "model_id": "7a2fbdaf743048cc9890fac92a309b51"
1915
+ }
1916
+ },
1917
+ "metadata": {}
1918
+ },
1919
+ {
1920
+ "output_type": "display_data",
1921
+ "data": {
1922
+ "text/plain": [
1923
+ "Generating test split: 0%| | 0/1000 [00:00<?, ? examples/s]"
1924
+ ],
1925
+ "application/vnd.jupyter.widget-view+json": {
1926
+ "version_major": 2,
1927
+ "version_minor": 0,
1928
+ "model_id": "0539c465b9cd4f2490c42a012793be5c"
1929
+ }
1930
+ },
1931
+ "metadata": {}
1932
+ }
1933
+ ]
1934
+ },
1935
+ {
1936
+ "cell_type": "code",
1937
+ "source": [
1938
+ "sample_data = dataset[\"train\"].to_pandas()\n",
1939
+ "sample_data.head()"
1940
+ ],
1941
+ "metadata": {
1942
+ "id": "Xy9Cn6qhYj77",
1943
+ "outputId": "1f663d1a-f59c-4334-eac6-47fc3ed5f119",
1944
+ "colab": {
1945
+ "base_uri": "https://localhost:8080/",
1946
+ "height": 206
1947
+ }
1948
+ },
1949
+ "execution_count": 14,
1950
+ "outputs": [
1951
+ {
1952
+ "output_type": "execute_result",
1953
+ "data": {
1954
+ "text/plain": [
1955
+ " text label_text label\n",
1956
+ "0 The Samsung Mobile Applications Store was laun... neutral 1\n",
1957
+ "1 F-Secure , a developer of security solutions a... neutral 1\n",
1958
+ "2 The company serves customers in various indust... neutral 1\n",
1959
+ "3 The company reported net sales of 302 mln euro... neutral 1\n",
1960
+ "4 Microsoft last week also issued the first patc... neutral 1"
1961
+ ],
1962
+ "text/html": [
1963
+ "\n",
1964
+ " <div id=\"df-2c74413c-185d-4488-9887-9b64d7c15536\" class=\"colab-df-container\">\n",
1965
+ " <div>\n",
1966
+ "<style scoped>\n",
1967
+ " .dataframe tbody tr th:only-of-type {\n",
1968
+ " vertical-align: middle;\n",
1969
+ " }\n",
1970
+ "\n",
1971
+ " .dataframe tbody tr th {\n",
1972
+ " vertical-align: top;\n",
1973
+ " }\n",
1974
+ "\n",
1975
+ " .dataframe thead th {\n",
1976
+ " text-align: right;\n",
1977
+ " }\n",
1978
+ "</style>\n",
1979
+ "<table border=\"1\" class=\"dataframe\">\n",
1980
+ " <thead>\n",
1981
+ " <tr style=\"text-align: right;\">\n",
1982
+ " <th></th>\n",
1983
+ " <th>text</th>\n",
1984
+ " <th>label_text</th>\n",
1985
+ " <th>label</th>\n",
1986
+ " </tr>\n",
1987
+ " </thead>\n",
1988
+ " <tbody>\n",
1989
+ " <tr>\n",
1990
+ " <th>0</th>\n",
1991
+ " <td>The Samsung Mobile Applications Store was laun...</td>\n",
1992
+ " <td>neutral</td>\n",
1993
+ " <td>1</td>\n",
1994
+ " </tr>\n",
1995
+ " <tr>\n",
1996
+ " <th>1</th>\n",
1997
+ " <td>F-Secure , a developer of security solutions a...</td>\n",
1998
+ " <td>neutral</td>\n",
1999
+ " <td>1</td>\n",
2000
+ " </tr>\n",
2001
+ " <tr>\n",
2002
+ " <th>2</th>\n",
2003
+ " <td>The company serves customers in various indust...</td>\n",
2004
+ " <td>neutral</td>\n",
2005
+ " <td>1</td>\n",
2006
+ " </tr>\n",
2007
+ " <tr>\n",
2008
+ " <th>3</th>\n",
2009
+ " <td>The company reported net sales of 302 mln euro...</td>\n",
2010
+ " <td>neutral</td>\n",
2011
+ " <td>1</td>\n",
2012
+ " </tr>\n",
2013
+ " <tr>\n",
2014
+ " <th>4</th>\n",
2015
+ " <td>Microsoft last week also issued the first patc...</td>\n",
2016
+ " <td>neutral</td>\n",
2017
+ " <td>1</td>\n",
2018
+ " </tr>\n",
2019
+ " </tbody>\n",
2020
+ "</table>\n",
2021
+ "</div>\n",
2022
+ " <div class=\"colab-df-buttons\">\n",
2023
+ "\n",
2024
+ " <div class=\"colab-df-container\">\n",
2025
+ " <button class=\"colab-df-convert\" onclick=\"convertToInteractive('df-2c74413c-185d-4488-9887-9b64d7c15536')\"\n",
2026
+ " title=\"Convert this dataframe to an interactive table.\"\n",
2027
+ " style=\"display:none;\">\n",
2028
+ "\n",
2029
+ " <svg xmlns=\"http://www.w3.org/2000/svg\" height=\"24px\" viewBox=\"0 -960 960 960\">\n",
2030
+ " <path d=\"M120-120v-720h720v720H120Zm60-500h600v-160H180v160Zm220 220h160v-160H400v160Zm0 220h160v-160H400v160ZM180-400h160v-160H180v160Zm440 0h160v-160H620v160ZM180-180h160v-160H180v160Zm440 0h160v-160H620v160Z\"/>\n",
2031
+ " </svg>\n",
2032
+ " </button>\n",
2033
+ "\n",
2034
+ " <style>\n",
2035
+ " .colab-df-container {\n",
2036
+ " display:flex;\n",
2037
+ " gap: 12px;\n",
2038
+ " }\n",
2039
+ "\n",
2040
+ " .colab-df-convert {\n",
2041
+ " background-color: #E8F0FE;\n",
2042
+ " border: none;\n",
2043
+ " border-radius: 50%;\n",
2044
+ " cursor: pointer;\n",
2045
+ " display: none;\n",
2046
+ " fill: #1967D2;\n",
2047
+ " height: 32px;\n",
2048
+ " padding: 0 0 0 0;\n",
2049
+ " width: 32px;\n",
2050
+ " }\n",
2051
+ "\n",
2052
+ " .colab-df-convert:hover {\n",
2053
+ " background-color: #E2EBFA;\n",
2054
+ " box-shadow: 0px 1px 2px rgba(60, 64, 67, 0.3), 0px 1px 3px 1px rgba(60, 64, 67, 0.15);\n",
2055
+ " fill: #174EA6;\n",
2056
+ " }\n",
2057
+ "\n",
2058
+ " .colab-df-buttons div {\n",
2059
+ " margin-bottom: 4px;\n",
2060
+ " }\n",
2061
+ "\n",
2062
+ " [theme=dark] .colab-df-convert {\n",
2063
+ " background-color: #3B4455;\n",
2064
+ " fill: #D2E3FC;\n",
2065
+ " }\n",
2066
+ "\n",
2067
+ " [theme=dark] .colab-df-convert:hover {\n",
2068
+ " background-color: #434B5C;\n",
2069
+ " box-shadow: 0px 1px 3px 1px rgba(0, 0, 0, 0.15);\n",
2070
+ " filter: drop-shadow(0px 1px 2px rgba(0, 0, 0, 0.3));\n",
2071
+ " fill: #FFFFFF;\n",
2072
+ " }\n",
2073
+ " </style>\n",
2074
+ "\n",
2075
+ " <script>\n",
2076
+ " const buttonEl =\n",
2077
+ " document.querySelector('#df-2c74413c-185d-4488-9887-9b64d7c15536 button.colab-df-convert');\n",
2078
+ " buttonEl.style.display =\n",
2079
+ " google.colab.kernel.accessAllowed ? 'block' : 'none';\n",
2080
+ "\n",
2081
+ " async function convertToInteractive(key) {\n",
2082
+ " const element = document.querySelector('#df-2c74413c-185d-4488-9887-9b64d7c15536');\n",
2083
+ " const dataTable =\n",
2084
+ " await google.colab.kernel.invokeFunction('convertToInteractive',\n",
2085
+ " [key], {});\n",
2086
+ " if (!dataTable) return;\n",
2087
+ "\n",
2088
+ " const docLinkHtml = 'Like what you see? Visit the ' +\n",
2089
+ " '<a target=\"_blank\" href=https://colab.research.google.com/notebooks/data_table.ipynb>data table notebook</a>'\n",
2090
+ " + ' to learn more about interactive tables.';\n",
2091
+ " element.innerHTML = '';\n",
2092
+ " dataTable['output_type'] = 'display_data';\n",
2093
+ " await google.colab.output.renderOutput(dataTable, element);\n",
2094
+ " const docLink = document.createElement('div');\n",
2095
+ " docLink.innerHTML = docLinkHtml;\n",
2096
+ " element.appendChild(docLink);\n",
2097
+ " }\n",
2098
+ " </script>\n",
2099
+ " </div>\n",
2100
+ "\n",
2101
+ "\n",
2102
+ " </div>\n",
2103
+ " </div>\n"
2104
+ ],
2105
+ "application/vnd.google.colaboratory.intrinsic+json": {
2106
+ "type": "dataframe",
2107
+ "variable_name": "sample_data",
2108
+ "summary": "{\n \"name\": \"sample_data\",\n \"rows\": 1264,\n \"fields\": [\n {\n \"column\": \"text\",\n \"properties\": {\n \"dtype\": \"string\",\n \"num_unique_values\": 1263,\n \"samples\": [\n \"R&D Loan ) .\",\n \"Mr. Mikko Saavalainen , head of Comptel 's Global Sales concludes : `` Gibtelecom provides a perfect illustration of the variety of business , technical and regulatory challenges operators are facing in their OSS today .\",\n \"The decision reflects the underutilisation of the line , which produces nonwovens used in medical and wipes applications as well as for the automotive industry .\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"label_text\",\n \"properties\": {\n \"dtype\": \"category\",\n \"num_unique_values\": 3,\n \"samples\": [\n \"neutral\",\n \"positive\",\n \"negative\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"label\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0,\n \"min\": 0,\n \"max\": 2,\n \"num_unique_values\": 3,\n \"samples\": [\n 1,\n 2,\n 0\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}"
2109
+ }
2110
+ },
2111
+ "metadata": {},
2112
+ "execution_count": 14
2113
+ }
2114
+ ]
2115
+ },
2116
+ {
2117
+ "cell_type": "code",
2118
+ "source": [],
2119
+ "metadata": {
2120
+ "id": "qgenw3blcH7q"
2121
+ },
2122
+ "execution_count": null,
2123
+ "outputs": []
2124
+ }
2125
+ ]
2126
+ }
requirements.txt ADDED
File without changes