Nada commited on
Commit
269d993
·
1 Parent(s): 5b11b7e
Files changed (5) hide show
  1. Dockerfile +31 -0
  2. README.md +10 -257
  3. guidelines.txt +107 -0
  4. mental_health_chatbot.log +782 -0
  5. requirements.txt +26 -0
Dockerfile ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Use Python 3.9 slim image
2
+ FROM python:3.9-slim
3
+
4
+ # Set working directory
5
+ WORKDIR /app
6
+
7
+ # Set environment variables
8
+ ENV PYTHONDONTWRITEBYTECODE=1 \
9
+ PYTHONUNBUFFERED=1 \
10
+ PORT=8000
11
+
12
+ # Install system dependencies
13
+ RUN apt-get update && apt-get install -y --no-install-recommends \
14
+ build-essential \
15
+ && rm -rf /var/lib/apt/lists/*
16
+
17
+ # Copy requirements first to leverage Docker cache
18
+ COPY requirements.txt .
19
+ RUN pip install --no-cache-dir -r requirements.txt
20
+
21
+ # Copy project files
22
+ COPY . .
23
+
24
+ # Create necessary directories
25
+ RUN mkdir -p session_data session_summaries vector_db models
26
+
27
+ # Expose the port
28
+ EXPOSE 8000
29
+
30
+ # Command to run the application
31
+ CMD ["uvicorn", "app:app", "--host", "0.0.0.0", "--port", "8000"]
README.md CHANGED
@@ -1,257 +1,10 @@
1
- # Mental Health Support Chatbot
2
-
3
- A context-aware mental health support chatbot that provides therapeutic responses based on user emotions and maintains conversation history.
4
-
5
- ## Features
6
-
7
- - Emotion detection using state-of-the-art NLP models
8
- - Context-aware responses
9
- - Conversation memory
10
- - Therapeutic techniques integration
11
- - Risk flag detection and crisis intervention
12
- - Automatic detection of high-risk messages
13
- - Immediate crisis response protocol
14
- - Professional support referral system
15
- - Emergency contact information
16
- - RESTful API interface
17
- - Session management and summaries
18
- - User reply tracking for another depression and anxiety detection from text.
19
-
20
-
21
- ## Risk Flag Detection
22
-
23
- The chatbot automatically monitors messages for potential risk indicators and provides appropriate crisis intervention responses.
24
-
25
- ### Risk Indicators
26
- The system detects various risk-related keywords and phrases, including but not limited to:
27
- - Self-harm references
28
- - Suicidal ideation
29
- - Extreme emotional distress
30
- - Crisis situations
31
-
32
- ### Crisis Response Protocol
33
- When risk flags are detected:
34
- 1. Immediate crisis response is triggered
35
- 2. User is provided with:
36
- - Emergency contact information
37
- - Professional support options
38
- - Immediate coping strategies
39
- 3. Option to connect with licensed professionals
40
- 4. Grounding exercises and calming techniques
41
-
42
- ### Example Crisis Response
43
- ```json
44
- {
45
- "response":"I'm really sorry you're feeling this way — it sounds incredibly heavy,and I want you to know that you're not alone. You don't have to face this by yourself.Our app has licensed mental health professionals ready to support you.I can connect you with one right now if you'd like.Would you like to connect with a professional now,or would you rather keep talking with me for a bit? Either way, I'm here for you.",
46
- "session_id": "user123_20240314103000",
47
- "risk_detected": true,
48
- "crisis_protocol_activated": true
49
- }
50
- ```
51
-
52
- ## Setup
53
-
54
- 1. Install the required dependencies:
55
- ```bash
56
- pip install -r requirements.txt
57
- ```
58
-
59
- 2. Download the required NLTK data:
60
- ```bash
61
- python -m nltk.downloader punkt
62
- ```
63
-
64
- 3. Run the chatbot server:
65
- ```bash
66
- python app.py
67
- ```
68
-
69
- The server will start on `http://127.0.0.1:8000`
70
-
71
- ## API Documentation
72
-
73
- ### Base URL
74
- ```
75
- http://127.0.0.1:8000
76
- ```
77
-
78
- ### API Endpoints
79
-
80
- #### 1. Start a Session
81
- ```http
82
- POST /start_session?user_id={user_id}
83
- ```
84
-
85
- Example:
86
- ```bash
87
- curl -X 'POST' \
88
- 'http://127.0.0.1:8000/start_session?user_id=user123' \
89
- -H 'accept: application/json'
90
- ```
91
-
92
- Response:
93
- ```json
94
- {
95
- "response": "Hello! I'm here to support you today. How have you been feeling lately?",
96
- "session_id": "user123_20240314103000"
97
- }
98
- ```
99
-
100
- #### 2. Send a Message
101
- ```http
102
- POST /send_message
103
- Content-Type: application/json
104
-
105
- {
106
- "user_id": "user123",
107
- "message": "I'm feeling anxious today"
108
- }
109
- ```
110
-
111
- Example:
112
- ```bash
113
- curl -X 'POST' \
114
- 'http://127.0.0.1:8000/send_message' \
115
- -H 'accept: application/json' \
116
- -H 'Content-Type: application/json' \
117
- -d '{
118
- "user_id": "user123",
119
- "message": "I'\''m feeling anxious today"
120
- }'
121
- ```
122
-
123
- Response:
124
- ```json
125
- {
126
- "response": "I understand you're feeling anxious. Can you tell me more about what's causing this?",
127
- "session_id": "user123_20240314103000"
128
- }
129
- ```
130
-
131
- #### 3. Get User Replies
132
- ```http
133
- GET /user_replies/{user_id}
134
- ```
135
-
136
- Example:
137
- ```bash
138
- curl -X 'GET' \
139
- 'http://127.0.0.1:8000/user_replies/user123' \
140
- -H 'accept: application/json'
141
- ```
142
-
143
- Response:
144
- ```json
145
- {
146
- "user_id": "user123",
147
- "timestamp": "2024-03-14T10:30:00",
148
- "replies": [
149
- {
150
- "text": "I'm feeling anxious today",
151
- "timestamp": "2024-03-14T10:30:00",
152
- "session_id": "user123_20240314103000"
153
- }
154
- ]
155
- }
156
- ```
157
-
158
- #### 4. Get Session Summary
159
- ```http
160
- GET /session_summary/{session_id}?include_summary={boolean}&include_recommendations={boolean}&include_emotions={boolean}&include_characteristics={boolean}&include_duration={boolean}&include_phase={boolean}
161
- ```
162
-
163
- Example:
164
- ```bash
165
- curl -X 'GET' \
166
- 'http://127.0.0.1:8000/session_summary/user123_20240314103000?include_summary=true&include_recommendations=true&include_emotions=true&include_characteristics=false&include_duration=false&include_phase=false' \
167
- -H 'accept: application/json'
168
- ```
169
-
170
- Response:
171
- ```json
172
- {
173
- "session_id": "user123_20240314103000",
174
- "user_id": "user123",
175
- "start_time": "2024-03-14T10:30:00",
176
- "end_time": "2024-03-14T10:45:00",
177
- "summary": "Session focused on anxiety management...",
178
- "recommendations": [
179
- "Practice deep breathing exercises",
180
- "Consider journaling your thoughts"
181
- ],
182
- "primary_emotions": ["anxiety", "stress"],
183
- "emotion_progression": ["anxiety", "calm"],
184
- "duration_minutes": 0.0,
185
- "current_phase": "unknown",
186
- "session_characteristics": {}
187
- }
188
- ```
189
-
190
- #### 5. End Session
191
- ```http
192
- POST /end_session?user_id={user_id}
193
- ```
194
-
195
- Example:
196
- ```bash
197
- curl -X 'POST' \
198
- 'http://127.0.0.1:8000/end_session?user_id=user123' \
199
- -H 'accept: application/json'
200
- ```
201
-
202
- Response: Complete session summary with all fields.
203
-
204
- #### 6. Health Check
205
- ```http
206
- GET /health
207
- ```
208
-
209
- Example:
210
- ```bash
211
- curl -X 'GET' \
212
- 'http://127.0.0.1:8000/health' \
213
- -H 'accept: application/json'
214
- ```
215
-
216
- Response:
217
- ```json
218
- {
219
- "status": "healthy"
220
- }
221
- ```
222
-
223
- ## Integration Guidelines
224
-
225
- ### Best Practices
226
- 1. Always store the `session_id` returned from `/start_session`
227
- 2. Use the same `user_id` throughout a conversation
228
- 3. Include appropriate error handling for API responses
229
- 4. Monitor the health endpoint for system status
230
-
231
- ### Error Handling
232
- The API returns standard HTTP status codes:
233
- - 200: Success
234
- - 400: Bad Request
235
- - 404: Not Found
236
- - 500: Internal Server Error
237
-
238
- Error responses include a detail message:
239
- ```json
240
- {
241
- "detail": "Error message here"
242
- }
243
- ```
244
-
245
-
246
- ## Important Notes
247
-
248
- - This is not a replacement for professional mental health care
249
- - Always seek professional help for serious mental health concerns
250
-
251
-
252
- ## Privacy and Security
253
-
254
- - Conversations are stored in memory only
255
- - No personal data is permanently stored
256
- - The system is designed to be HIPAA-compliant
257
- - Users are identified by unique IDs only
 
1
+ ---
2
+ title: Conversational Chat
3
+ emoji:
4
+ colorFrom: red
5
+ colorTo: pink
6
+ sdk: docker
7
+ pinned: false
8
+ ---
9
+
10
+ Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
guidelines.txt ADDED
@@ -0,0 +1,107 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Therapeutic Guidelines:
2
+ 1. Build Trust and Rapport
3
+
4
+ Begin with warmth and understanding.
5
+
6
+ Use active listening: reflect back emotions and key points.
7
+
8
+ Be supportive and non-threatening in tone.
9
+
10
+ Always keep the tone calm, supportive, and emotionally intelligent.
11
+
12
+ Empower users to explore their own thoughts and solutions.
13
+
14
+ Ask open-ended questions to deepen self-reflection.
15
+
16
+ Avoid giving commands or rigid advice.
17
+
18
+ Avoid assumptions based on culture, gender, or personal history.
19
+
20
+ Create psychological safety — reassure the user that their thoughts and emotions are welcome and valid.
21
+
22
+
23
+ 2. Be Non-Judgmental
24
+
25
+ Accept all emotions and experiences without criticism.
26
+
27
+ Never blame or shame the user.
28
+
29
+ Normalize their feelings when appropriate
30
+
31
+ 3. Use Evidence-Based Techniques
32
+
33
+ Apply suitable techniques such as:
34
+ 1. Cognitive Behavioral Therapy (CBT)
35
+ Help users identify negative thought patterns (cognitive distortions) and reframe them:
36
+
37
+ “Let’s try to challenge that thought — is there evidence that supports or contradicts it?”
38
+
39
+ “What might be a more balanced way to look at this?”
40
+
41
+ 2. Dialectical Behavior Therapy (DBT)
42
+ Focus on emotional regulation, distress tolerance, and mindfulness:
43
+
44
+ “Let’s take a moment to breathe and notice what you’re feeling without judgment.”
45
+
46
+ “What can you do right now to self-soothe or ground yourself?”
47
+
48
+ 3. Acceptance and Commitment Therapy (ACT)
49
+ Promote acceptance of thoughts and values-based living:
50
+
51
+ “Instead of fighting that thought, can we observe it and let it be?”
52
+
53
+ “What matters to you right now? What small step can you take in that direction?”
54
+
55
+ 4. Motivational Interviewing
56
+ Help ambivalent users explore change:
57
+
58
+ “On a scale from 1 to 10, how ready do you feel to make a change?”
59
+
60
+ “What would it take to move one step closer?”
61
+
62
+
63
+ 4. Structured Conversation Flow
64
+ Begin with empathy → explore the problem → validate emotions → apply a therapeutic tool → summarize insight or coping step.
65
+
66
+ End each message with a question or reflection prompt to continue engagement.
67
+
68
+
69
+ 5. Add Actionable Suggestions
70
+
71
+ Offer gentle, realistic, and practical steps the user can try.
72
+
73
+ Tailor suggestions to their emotional state — prioritize simplicity and emotional safety.
74
+
75
+ Use empowering language that invites, not instructs:
76
+
77
+ “Would you be open to trying…?”
78
+
79
+ “Some people find this helpful — would you like to explore it together?”
80
+
81
+ Examples of actionable suggestions include:
82
+
83
+ Grounding Techniques
84
+ “Can you name five things you see around you right now, four things you can touch, three you can hear, two you can smell, and one you can taste?”
85
+
86
+ Mindful Breathing
87
+ “Let’s try a simple breathing exercise: inhale slowly for 4 counts, hold for 4, exhale for 4. Can we do this together for a few rounds?”
88
+
89
+ Journaling Prompts
90
+ “Would writing down your thoughts help make sense of what you're feeling? You might start with: ‘Right now, I’m feeling… because…’”
91
+
92
+ Self-Compassion Reminders
93
+ “Can you speak to yourself the way you would to a friend going through this?”
94
+
95
+ Behavioral Activation
96
+ “Sometimes doing one small activity, even if it feels meaningless at first, can help shift your energy. What’s one thing you could do today that used to bring you comfort?”
97
+
98
+ Connection Check-In
99
+ “Is there someone you trust that you might feel comfortable talking to or spending time with today, even briefly?”
100
+
101
+ End with an open tone:
102
+
103
+ “How does that sound to you?”
104
+
105
+ “Would you like to try that and let me know how it goes?”
106
+
107
+
mental_health_chatbot.log ADDED
@@ -0,0 +1,782 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2025-04-16 20:40:51,091 - __main__ - INFO - Using device: cuda
2
+ 2025-04-16 20:40:51,091 - __main__ - INFO - Loading emotion detection model
3
+ 2025-04-16 20:40:51,872 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
4
+ 2025-04-16 20:40:52,900 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
5
+ 2025-04-16 20:40:54,064 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
6
+ 2025-04-16 20:41:04,152 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
7
+ 2025-04-16 20:41:04,455 - __main__ - INFO - Successfully loaded PEFT model
8
+ 2025-04-16 20:41:05,633 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
9
+ 2025-04-16 20:41:08,333 - __main__ - INFO - Setting up FAISS vector database
10
+ 2025-04-16 20:41:08,663 - faiss.loader - INFO - Loading faiss with AVX2 support.
11
+ 2025-04-16 20:41:08,728 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
12
+ 2025-04-16 20:41:08,741 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
13
+ 2025-04-16 20:41:08,746 - __main__ - WARNING - No guidelines file provided, using empty vector store
14
+ 2025-04-16 20:49:53,663 - __main__ - INFO - Using device: cuda
15
+ 2025-04-16 20:49:53,663 - __main__ - INFO - Loading emotion detection model
16
+ 2025-04-16 20:49:54,306 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
17
+ 2025-04-16 20:49:55,317 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
18
+ 2025-04-16 20:49:56,722 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
19
+ 2025-04-16 20:50:05,931 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
20
+ 2025-04-16 20:50:06,203 - __main__ - INFO - Successfully loaded PEFT model
21
+ 2025-04-16 20:50:07,402 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
22
+ 2025-04-16 20:50:10,384 - __main__ - INFO - Setting up FAISS vector database
23
+ 2025-04-16 20:50:10,385 - faiss.loader - INFO - Loading faiss with AVX2 support.
24
+ 2025-04-16 20:50:10,445 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
25
+ 2025-04-16 20:50:10,458 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
26
+ 2025-04-16 20:50:10,461 - __main__ - INFO - Loaded existing vector database
27
+ 2025-04-16 20:53:57,905 - __main__ - INFO - Using device: cuda
28
+ 2025-04-16 20:53:57,905 - __main__ - INFO - Loading emotion detection model
29
+ 2025-04-16 20:53:58,645 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
30
+ 2025-04-16 20:53:59,640 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
31
+ 2025-04-16 20:54:00,686 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
32
+ 2025-04-16 20:54:10,841 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
33
+ 2025-04-16 20:54:11,142 - __main__ - INFO - Successfully loaded PEFT model
34
+ 2025-04-16 20:54:12,244 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
35
+ 2025-04-16 20:54:15,613 - __main__ - INFO - Setting up FAISS vector database
36
+ 2025-04-16 20:54:15,619 - faiss.loader - INFO - Loading faiss with AVX2 support.
37
+ 2025-04-16 20:54:15,670 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
38
+ 2025-04-16 20:54:15,678 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
39
+ 2025-04-16 20:54:15,680 - __main__ - INFO - Loaded existing vector database
40
+ 2025-04-16 20:56:31,196 - __main__ - INFO - Using device: cuda
41
+ 2025-04-16 20:56:31,196 - __main__ - INFO - Loading emotion detection model
42
+ 2025-04-16 20:56:32,364 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
43
+ 2025-04-16 20:56:33,303 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
44
+ 2025-04-16 20:56:34,880 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
45
+ 2025-04-16 20:56:44,016 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
46
+ 2025-04-16 20:56:44,374 - __main__ - INFO - Successfully loaded PEFT model
47
+ 2025-04-16 20:56:45,451 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
48
+ 2025-04-16 20:56:48,249 - __main__ - INFO - Setting up FAISS vector database
49
+ 2025-04-16 20:56:48,252 - faiss.loader - INFO - Loading faiss with AVX2 support.
50
+ 2025-04-16 20:56:48,274 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
51
+ 2025-04-16 20:56:48,282 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
52
+ 2025-04-16 20:56:48,284 - __main__ - INFO - Loaded existing vector database
53
+ 2025-04-16 20:56:48,322 - __main__ - INFO - Session started for user cli_user_20250416205648
54
+ 2025-04-18 16:02:11,023 - __main__ - INFO - Using device: cuda
55
+ 2025-04-18 16:02:11,023 - __main__ - INFO - Loading emotion detection model
56
+ 2025-04-18 16:02:12,079 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
57
+ 2025-04-18 16:02:13,129 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
58
+ 2025-04-18 16:02:14,361 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
59
+ 2025-04-18 16:02:24,172 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
60
+ 2025-04-18 16:02:24,514 - __main__ - INFO - Successfully loaded PEFT model
61
+ 2025-04-18 16:02:25,616 - __main__ - INFO - Loading summary model
62
+ 2025-04-18 16:22:26,761 - __main__ - INFO - Initializing FlowManager
63
+ 2025-04-18 16:22:26,762 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
64
+ 2025-04-18 16:22:26,764 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
65
+ 2025-04-18 16:22:30,903 - __main__ - INFO - Setting up FAISS vector database
66
+ 2025-04-18 16:22:30,914 - faiss.loader - INFO - Loading faiss with AVX2 support.
67
+ 2025-04-18 16:22:31,039 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
68
+ 2025-04-18 16:22:31,045 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
69
+ 2025-04-18 16:22:31,074 - __main__ - INFO - Loaded existing vector database
70
+ 2025-04-18 16:22:31,087 - conversation_flow - INFO - Initialized new session for user cli_user_20250418162231
71
+ 2025-04-18 16:22:31,087 - __main__ - INFO - Session started for user cli_user_20250418162231
72
+ 2025-04-18 16:28:53,111 - __main__ - INFO - Using device: cuda
73
+ 2025-04-18 16:28:53,111 - __main__ - INFO - Loading emotion detection model
74
+ 2025-04-18 16:29:03,485 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
75
+ 2025-04-18 16:29:04,516 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
76
+ 2025-04-18 16:29:05,512 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
77
+ 2025-04-18 16:29:14,677 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
78
+ 2025-04-18 16:29:14,987 - __main__ - INFO - Successfully loaded PEFT model
79
+ 2025-04-18 16:29:16,117 - __main__ - INFO - Loading summary model
80
+ 2025-04-18 16:31:42,623 - __main__ - INFO - Using device: cuda
81
+ 2025-04-18 16:31:42,630 - __main__ - INFO - Loading emotion detection model
82
+ 2025-04-18 16:31:43,302 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
83
+ 2025-04-18 16:31:44,315 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
84
+ 2025-04-18 16:31:45,437 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
85
+ 2025-04-18 16:31:54,477 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
86
+ 2025-04-18 16:31:54,744 - __main__ - INFO - Successfully loaded PEFT model
87
+ 2025-04-18 16:31:55,750 - __main__ - INFO - Loading summary model
88
+ 2025-04-18 16:33:51,319 - __main__ - INFO - Using device: cuda
89
+ 2025-04-18 16:33:51,320 - __main__ - INFO - Loading emotion detection model
90
+ 2025-04-18 16:33:52,044 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
91
+ 2025-04-18 16:33:53,063 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
92
+ 2025-04-18 16:33:54,159 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
93
+ 2025-04-18 16:34:03,223 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
94
+ 2025-04-18 16:34:03,556 - __main__ - INFO - Successfully loaded PEFT model
95
+ 2025-04-18 16:34:04,651 - __main__ - INFO - Loading summary model
96
+ 2025-04-18 16:34:05,893 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
97
+ 2025-04-18 16:39:59,658 - __main__ - INFO - Using device: cuda
98
+ 2025-04-18 16:39:59,659 - __main__ - INFO - Loading emotion detection model
99
+ 2025-04-18 16:40:00,514 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
100
+ 2025-04-18 16:40:01,521 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
101
+ 2025-04-18 16:40:03,059 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
102
+ 2025-04-18 16:40:12,212 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
103
+ 2025-04-18 16:40:12,491 - __main__ - INFO - Successfully loaded PEFT model
104
+ 2025-04-18 16:40:13,567 - __main__ - INFO - Loading summary model
105
+ 2025-04-18 16:40:16,727 - __main__ - INFO - Initializing FlowManager
106
+ 2025-04-18 16:40:16,727 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
107
+ 2025-04-18 16:43:27,852 - __main__ - INFO - Using device: cuda
108
+ 2025-04-18 16:43:27,855 - __main__ - INFO - Loading emotion detection model
109
+ 2025-04-18 16:43:28,440 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
110
+ 2025-04-18 16:43:29,386 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
111
+ 2025-04-18 16:43:30,348 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
112
+ 2025-04-18 16:43:39,286 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
113
+ 2025-04-18 16:43:39,558 - __main__ - INFO - Successfully loaded PEFT model
114
+ 2025-04-18 16:43:40,570 - __main__ - INFO - Loading summary model
115
+ 2025-04-18 16:43:43,510 - __main__ - INFO - Initializing FlowManager
116
+ 2025-04-18 16:43:43,518 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
117
+ 2025-04-18 16:43:43,520 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
118
+ 2025-04-18 16:43:46,271 - __main__ - INFO - Setting up FAISS vector database
119
+ 2025-04-18 16:43:46,276 - faiss.loader - INFO - Loading faiss with AVX2 support.
120
+ 2025-04-18 16:43:46,343 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
121
+ 2025-04-18 16:43:46,351 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
122
+ 2025-04-18 16:43:46,355 - __main__ - INFO - Loaded existing vector database
123
+ 2025-04-18 16:43:46,356 - conversation_flow - INFO - Initialized new session for user cli_user_20250418164346
124
+ 2025-04-18 16:43:46,357 - __main__ - INFO - Session started for user cli_user_20250418164346
125
+ 2025-04-18 16:48:37,587 - __main__ - INFO - Using device: cuda
126
+ 2025-04-18 16:48:37,587 - __main__ - INFO - Loading emotion detection model
127
+ 2025-04-18 16:48:38,210 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
128
+ 2025-04-18 16:48:39,162 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
129
+ 2025-04-18 16:48:40,193 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
130
+ 2025-04-18 16:48:49,130 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
131
+ 2025-04-18 16:48:49,437 - __main__ - INFO - Successfully loaded PEFT model
132
+ 2025-04-18 16:48:50,554 - __main__ - INFO - Loading summary model
133
+ 2025-04-18 16:48:53,718 - __main__ - INFO - Initializing FlowManager
134
+ 2025-04-18 16:48:53,718 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
135
+ 2025-04-18 16:48:53,718 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
136
+ 2025-04-18 16:49:00,071 - __main__ - INFO - Setting up FAISS vector database
137
+ 2025-04-18 16:49:00,074 - faiss.loader - INFO - Loading faiss with AVX2 support.
138
+ 2025-04-18 16:49:00,130 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
139
+ 2025-04-18 16:49:00,141 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
140
+ 2025-04-18 16:49:00,144 - __main__ - INFO - Loaded existing vector database
141
+ 2025-04-18 16:49:00,145 - conversation_flow - INFO - Initialized new session for user cli_user_20250418164900
142
+ 2025-04-18 16:49:00,145 - __main__ - INFO - Session started for user cli_user_20250418164900
143
+ 2025-04-18 16:52:02,476 - __main__ - INFO - Using device: cuda
144
+ 2025-04-18 16:52:02,476 - __main__ - INFO - Loading emotion detection model
145
+ 2025-04-18 16:52:03,111 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
146
+ 2025-04-18 16:52:04,143 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
147
+ 2025-04-18 16:52:05,213 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
148
+ 2025-04-18 16:52:14,106 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
149
+ 2025-04-18 16:52:14,438 - __main__ - INFO - Successfully loaded PEFT model
150
+ 2025-04-18 16:52:15,455 - __main__ - INFO - Loading summary model
151
+ 2025-04-18 16:52:18,449 - __main__ - INFO - Summary model loaded successfully
152
+ 2025-04-18 16:52:18,449 - __main__ - INFO - Initializing FlowManager
153
+ 2025-04-18 16:52:18,449 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
154
+ 2025-04-18 16:52:18,454 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
155
+ 2025-04-18 16:52:21,626 - __main__ - INFO - Setting up FAISS vector database
156
+ 2025-04-18 16:52:21,637 - faiss.loader - INFO - Loading faiss with AVX2 support.
157
+ 2025-04-18 16:52:21,678 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
158
+ 2025-04-18 16:52:21,699 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
159
+ 2025-04-18 16:52:21,702 - __main__ - INFO - Loaded existing vector database
160
+ 2025-04-18 16:52:21,703 - __main__ - INFO - All models and components initialized successfully
161
+ 2025-04-18 16:52:21,704 - conversation_flow - INFO - Initialized new session for user cli_user_20250418165221
162
+ 2025-04-18 16:52:21,704 - __main__ - INFO - Session started for user cli_user_20250418165221
163
+ 2025-04-18 17:18:39,952 - __main__ - INFO - Using device: cuda
164
+ 2025-04-18 17:18:39,952 - __main__ - INFO - Loading emotion detection model
165
+ 2025-04-18 17:18:40,598 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
166
+ 2025-04-18 17:18:41,654 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
167
+ 2025-04-18 17:18:42,682 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
168
+ 2025-04-18 17:18:51,948 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
169
+ 2025-04-18 17:18:52,282 - __main__ - INFO - Successfully loaded PEFT model
170
+ 2025-04-18 17:18:53,411 - __main__ - INFO - Loading summary model
171
+ 2025-04-18 17:18:56,632 - __main__ - INFO - Summary model loaded successfully
172
+ 2025-04-18 17:18:56,632 - __main__ - INFO - Initializing FlowManager
173
+ 2025-04-18 17:18:56,632 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
174
+ 2025-04-18 17:18:56,632 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
175
+ 2025-04-18 17:19:00,694 - __main__ - INFO - Setting up FAISS vector database
176
+ 2025-04-18 17:19:00,698 - faiss.loader - INFO - Loading faiss with AVX2 support.
177
+ 2025-04-18 17:19:00,749 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
178
+ 2025-04-18 17:19:00,760 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
179
+ 2025-04-18 17:19:00,763 - __main__ - INFO - Loaded existing vector database
180
+ 2025-04-18 17:19:00,764 - __main__ - INFO - All models and components initialized successfully
181
+ 2025-04-18 17:19:00,765 - conversation_flow - INFO - Initialized new session for user cli_user_20250418171900
182
+ 2025-04-18 17:19:00,765 - __main__ - INFO - Session started for user cli_user_20250418171900
183
+ 2025-04-18 20:42:57,848 - __main__ - INFO - Using device: cuda
184
+ 2025-04-18 20:42:57,848 - __main__ - INFO - Loading emotion detection model
185
+ 2025-04-18 20:43:02,595 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
186
+ 2025-04-18 20:43:03,524 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
187
+ 2025-04-18 20:43:04,598 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
188
+ 2025-04-18 20:43:12,915 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
189
+ 2025-04-18 20:43:13,129 - __main__ - INFO - Successfully loaded PEFT model
190
+ 2025-04-18 20:43:14,236 - __main__ - INFO - Loading summary model
191
+ 2025-04-18 20:43:17,220 - __main__ - INFO - Summary model loaded successfully
192
+ 2025-04-18 20:43:17,220 - __main__ - INFO - Initializing FlowManager
193
+ 2025-04-18 20:43:17,220 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
194
+ 2025-04-18 20:43:17,233 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
195
+ 2025-04-18 20:43:21,870 - __main__ - INFO - Setting up FAISS vector database
196
+ 2025-04-18 20:43:21,870 - faiss.loader - INFO - Loading faiss with AVX2 support.
197
+ 2025-04-18 20:43:21,929 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
198
+ 2025-04-18 20:43:21,944 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
199
+ 2025-04-18 20:43:21,953 - __main__ - INFO - Loaded existing vector database
200
+ 2025-04-18 20:43:21,954 - __main__ - INFO - All models and components initialized successfully
201
+ 2025-04-18 20:43:21,955 - conversation_flow - INFO - Initialized new session for user cli_user_20250418204321
202
+ 2025-04-18 20:43:21,955 - __main__ - INFO - Session started for user cli_user_20250418204321
203
+ 2025-04-18 20:44:10,846 - conversation_flow - ERROR - Error detecting topics with LLM: 'HuggingFacePipeline' object has no attribute 'get_llm_response'
204
+ 2025-04-18 20:44:59,396 - conversation_flow - ERROR - Error detecting topics with LLM: 'HuggingFacePipeline' object has no attribute 'get_llm_response'
205
+ 2025-04-18 20:45:25,345 - conversation_flow - ERROR - Error detecting topics with LLM: 'HuggingFacePipeline' object has no attribute 'get_llm_response'
206
+ 2025-04-18 20:45:47,579 - conversation_flow - ERROR - Error detecting topics with LLM: 'HuggingFacePipeline' object has no attribute 'get_llm_response'
207
+ 2025-04-18 20:46:06,205 - conversation_flow - ERROR - Error detecting topics with LLM: 'HuggingFacePipeline' object has no attribute 'get_llm_response'
208
+ 2025-04-18 21:01:48,815 - __main__ - INFO - Using device: cuda
209
+ 2025-04-18 21:01:48,817 - __main__ - INFO - Loading emotion detection model
210
+ 2025-04-18 21:01:50,288 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
211
+ 2025-04-18 21:01:51,205 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
212
+ 2025-04-18 21:01:52,274 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
213
+ 2025-04-18 21:02:00,508 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
214
+ 2025-04-18 21:02:00,733 - __main__ - INFO - Successfully loaded PEFT model
215
+ 2025-04-18 21:02:01,861 - __main__ - INFO - Loading summary model
216
+ 2025-04-18 21:02:04,829 - __main__ - INFO - Summary model loaded successfully
217
+ 2025-04-18 21:02:04,829 - __main__ - INFO - Initializing FlowManager
218
+ 2025-04-18 21:02:04,829 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
219
+ 2025-04-18 21:02:04,829 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
220
+ 2025-04-18 21:02:07,509 - __main__ - INFO - Setting up FAISS vector database
221
+ 2025-04-18 21:02:07,513 - faiss.loader - INFO - Loading faiss with AVX2 support.
222
+ 2025-04-18 21:02:07,571 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
223
+ 2025-04-18 21:02:07,576 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
224
+ 2025-04-18 21:02:07,580 - __main__ - INFO - Loaded existing vector database
225
+ 2025-04-18 21:02:07,581 - __main__ - INFO - All models and components initialized successfully
226
+ 2025-04-18 21:02:07,582 - conversation_flow - INFO - Initialized new session for user cli_user_20250418210207
227
+ 2025-04-18 21:02:07,582 - __main__ - INFO - Session started for user cli_user_20250418210207
228
+ 2025-04-18 21:09:03,887 - __main__ - INFO - Using device: cuda
229
+ 2025-04-18 21:09:03,887 - __main__ - INFO - Loading emotion detection model
230
+ 2025-04-18 21:09:05,546 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
231
+ 2025-04-18 21:09:06,525 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
232
+ 2025-04-18 21:09:07,498 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
233
+ 2025-04-18 21:09:15,645 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
234
+ 2025-04-18 21:09:15,852 - __main__ - INFO - Successfully loaded PEFT model
235
+ 2025-04-18 21:09:16,802 - __main__ - INFO - Loading summary model
236
+ 2025-04-18 21:09:19,599 - __main__ - INFO - Summary model loaded successfully
237
+ 2025-04-18 21:09:19,599 - __main__ - INFO - Initializing FlowManager
238
+ 2025-04-18 21:09:19,599 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
239
+ 2025-04-18 21:09:19,605 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
240
+ 2025-04-18 21:09:32,385 - __main__ - INFO - Setting up FAISS vector database
241
+ 2025-04-18 21:09:32,401 - faiss.loader - INFO - Loading faiss with AVX2 support.
242
+ 2025-04-18 21:09:32,443 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
243
+ 2025-04-18 21:09:32,458 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
244
+ 2025-04-18 21:09:32,465 - __main__ - INFO - Loaded existing vector database
245
+ 2025-04-18 21:09:32,465 - __main__ - INFO - All models and components initialized successfully
246
+ 2025-04-18 21:09:32,465 - conversation_flow - INFO - Initialized new session for user cli_user_20250418210932
247
+ 2025-04-18 21:09:32,465 - __main__ - INFO - Session started for user cli_user_20250418210932
248
+ 2025-04-18 21:10:12,360 - __main__ - ERROR - Failed to generate session summary: Object of type ConversationPhase is not JSON serializable
249
+ 2025-04-18 21:19:08,728 - __main__ - INFO - Using device: cuda
250
+ 2025-04-18 21:19:08,728 - __main__ - INFO - Loading emotion detection model
251
+ 2025-04-18 21:19:09,386 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
252
+ 2025-04-18 21:19:10,380 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
253
+ 2025-04-18 21:19:11,771 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
254
+ 2025-04-18 21:19:20,833 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
255
+ 2025-04-18 21:19:21,122 - __main__ - INFO - Successfully loaded PEFT model
256
+ 2025-04-18 21:19:22,118 - __main__ - INFO - Loading summary model
257
+ 2025-04-18 21:19:25,280 - __main__ - INFO - Summary model loaded successfully
258
+ 2025-04-18 21:19:25,280 - __main__ - INFO - Initializing FlowManager
259
+ 2025-04-18 21:19:25,280 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
260
+ 2025-04-18 21:19:25,294 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
261
+ 2025-04-18 21:19:28,905 - __main__ - INFO - Setting up FAISS vector database
262
+ 2025-04-18 21:19:28,908 - faiss.loader - INFO - Loading faiss with AVX2 support.
263
+ 2025-04-18 21:19:28,964 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
264
+ 2025-04-18 21:19:28,980 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
265
+ 2025-04-18 21:19:28,984 - __main__ - INFO - Loaded existing vector database
266
+ 2025-04-18 21:19:28,985 - __main__ - WARNING - Failed to load summary from .json: Expecting value: line 7 column 20 (char 147)
267
+ 2025-04-18 21:19:28,985 - __main__ - INFO - All models and components initialized successfully
268
+ 2025-04-18 21:19:28,986 - conversation_flow - INFO - Initialized new session for user cli_user_20250418211928
269
+ 2025-04-18 21:19:28,986 - __main__ - INFO - Session started for user cli_user_20250418211928
270
+ 2025-04-18 21:26:19,114 - __main__ - INFO - Using device: cuda
271
+ 2025-04-18 21:26:19,114 - __main__ - INFO - Loading emotion detection model
272
+ 2025-04-18 21:26:19,762 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
273
+ 2025-04-18 21:26:20,784 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
274
+ 2025-04-18 21:26:21,847 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
275
+ 2025-04-18 21:26:30,681 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
276
+ 2025-04-18 21:26:31,011 - __main__ - INFO - Successfully loaded PEFT model
277
+ 2025-04-18 21:26:31,996 - __main__ - INFO - Loading summary model
278
+ 2025-04-18 21:26:34,971 - __main__ - INFO - Summary model loaded successfully
279
+ 2025-04-18 21:26:34,971 - __main__ - INFO - Initializing FlowManager
280
+ 2025-04-18 21:26:34,971 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
281
+ 2025-04-18 21:26:34,985 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
282
+ 2025-04-18 21:26:45,007 - __main__ - INFO - Setting up FAISS vector database
283
+ 2025-04-18 21:26:45,010 - faiss.loader - INFO - Loading faiss with AVX2 support.
284
+ 2025-04-18 21:26:45,068 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
285
+ 2025-04-18 21:26:45,077 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
286
+ 2025-04-18 21:26:45,080 - __main__ - INFO - Loaded existing vector database
287
+ 2025-04-18 21:26:45,081 - __main__ - WARNING - Failed to load summary from .json: Expecting value: line 7 column 20 (char 147)
288
+ 2025-04-18 21:26:45,082 - __main__ - INFO - All models and components initialized successfully
289
+ 2025-04-18 21:26:45,082 - conversation_flow - INFO - Initialized new session for user cli_user_20250418212645
290
+ 2025-04-18 21:26:45,082 - __main__ - INFO - Session started for user cli_user_20250418212645
291
+ 2025-04-18 21:32:34,109 - conversation_flow - INFO - User cli_user_20250418212645 transitioned from introduction to exploration: Time-based transition
292
+ 2025-04-18 21:58:42,487 - __main__ - INFO - Using device: cuda
293
+ 2025-04-18 21:58:42,492 - __main__ - INFO - Loading emotion detection model
294
+ 2025-04-18 21:58:43,126 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
295
+ 2025-04-18 21:58:44,158 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
296
+ 2025-04-18 21:58:45,213 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
297
+ 2025-04-18 22:08:32,721 - __main__ - INFO - Using device: cuda
298
+ 2025-04-18 22:08:32,721 - __main__ - INFO - Loading emotion detection model
299
+ 2025-04-18 22:08:38,582 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
300
+ 2025-04-18 22:08:39,309 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
301
+ 2025-04-18 22:08:42,392 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
302
+ 2025-04-18 22:08:47,815 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
303
+ 2025-04-18 22:08:48,105 - __main__ - INFO - Successfully loaded PEFT model
304
+ 2025-04-18 22:08:49,156 - __main__ - INFO - Loading summary model
305
+ 2025-04-18 22:08:57,299 - __main__ - INFO - Summary model loaded successfully
306
+ 2025-04-18 22:08:57,299 - __main__ - INFO - Initializing FlowManager
307
+ 2025-04-18 22:08:57,299 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
308
+ 2025-04-18 22:08:57,302 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
309
+ 2025-04-18 22:09:17,127 - __main__ - INFO - Setting up FAISS vector database
310
+ 2025-04-18 22:09:17,130 - faiss.loader - INFO - Loading faiss with AVX2 support.
311
+ 2025-04-18 22:09:17,203 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
312
+ 2025-04-18 22:09:17,213 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
313
+ 2025-04-18 22:09:17,218 - __main__ - INFO - Loaded existing vector database
314
+ 2025-04-18 22:09:17,219 - __main__ - INFO - All models and components initialized successfully
315
+ 2025-04-18 22:09:17,220 - conversation_flow - INFO - Initialized new session for user cli_user_20250418220917
316
+ 2025-04-18 22:09:17,220 - __main__ - INFO - Session started for user cli_user_20250418220917
317
+ 2025-04-18 22:17:05,900 - __main__ - INFO - Using device: cuda
318
+ 2025-04-18 22:17:05,900 - __main__ - INFO - Loading emotion detection model
319
+ 2025-04-18 22:17:06,561 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
320
+ 2025-04-18 22:17:07,562 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
321
+ 2025-04-18 22:17:08,643 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
322
+ 2025-04-18 22:17:17,695 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
323
+ 2025-04-18 22:17:18,024 - __main__ - INFO - Successfully loaded PEFT model
324
+ 2025-04-18 22:17:19,055 - __main__ - INFO - Loading summary model
325
+ 2025-04-18 22:17:22,232 - __main__ - INFO - Summary model loaded successfully
326
+ 2025-04-18 22:17:22,232 - __main__ - INFO - Initializing FlowManager
327
+ 2025-04-18 22:17:22,232 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
328
+ 2025-04-18 22:17:22,242 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
329
+ 2025-04-18 22:17:37,477 - __main__ - INFO - Setting up FAISS vector database
330
+ 2025-04-18 22:17:37,481 - faiss.loader - INFO - Loading faiss with AVX2 support.
331
+ 2025-04-18 22:17:37,543 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
332
+ 2025-04-18 22:17:37,550 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
333
+ 2025-04-18 22:17:37,553 - __main__ - INFO - Loaded existing vector database
334
+ 2025-04-18 22:17:37,554 - __main__ - INFO - All models and components initialized successfully
335
+ 2025-04-18 22:17:37,555 - conversation_flow - INFO - Initialized new session for user cli_user_20250418221737
336
+ 2025-04-18 22:17:37,555 - __main__ - INFO - Session started for user cli_user_20250418221737
337
+ 2025-04-18 22:18:57,039 - __main__ - INFO - Using device: cuda
338
+ 2025-04-18 22:18:57,040 - __main__ - INFO - Loading emotion detection model
339
+ 2025-04-18 22:18:59,206 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
340
+ 2025-04-18 22:19:00,202 - __main__ - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
341
+ 2025-04-18 22:19:01,317 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
342
+ 2025-04-18 22:19:10,383 - __main__ - INFO - Loading PEFT model from llama_fine_tuned
343
+ 2025-04-18 22:19:10,680 - __main__ - INFO - Successfully loaded PEFT model
344
+ 2025-04-18 22:19:11,731 - __main__ - INFO - Loading summary model
345
+ 2025-04-18 22:19:20,329 - __main__ - INFO - Summary model loaded successfully
346
+ 2025-04-18 22:19:20,329 - __main__ - INFO - Initializing FlowManager
347
+ 2025-04-18 22:19:20,329 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
348
+ 2025-04-18 22:19:20,343 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
349
+ 2025-04-18 22:19:23,597 - __main__ - INFO - Setting up FAISS vector database
350
+ 2025-04-18 22:19:23,599 - faiss.loader - INFO - Loading faiss with AVX2 support.
351
+ 2025-04-18 22:19:23,655 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
352
+ 2025-04-18 22:19:23,661 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
353
+ 2025-04-18 22:19:23,686 - __main__ - INFO - Loaded existing vector database
354
+ 2025-04-18 22:19:23,687 - __main__ - INFO - All models and components initialized successfully
355
+ 2025-04-18 22:19:23,688 - conversation_flow - INFO - Initialized new session for user cli_user_20250418221923
356
+ 2025-04-18 22:19:23,688 - __main__ - INFO - Session started for user cli_user_20250418221923
357
+ 2025-04-18 22:22:44,393 - conversation_flow - WARNING - Failed to parse session characteristics from LLM
358
+ 2025-04-18 22:32:18,080 - __main__ - ERROR - Failed to generate session summary: 'str' object has no attribute 'items'
359
+ 2025-04-19 20:34:55,476 - claude - INFO - Using device: cpu
360
+ 2025-04-19 20:34:55,476 - claude - INFO - Loading emotion detection model
361
+ 2025-04-19 20:34:57,058 - claude - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
362
+ 2025-04-19 20:35:34,008 - claude - INFO - Loading PEFT model from llama_fine_tuned
363
+ 2025-04-19 20:35:34,086 - bitsandbytes.cextension - WARNING - The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable.
364
+ 2025-04-19 20:35:34,458 - claude - INFO - Successfully loaded PEFT model
365
+ 2025-04-19 20:35:37,385 - claude - INFO - Loading summary model
366
+ 2025-04-19 20:35:38,798 - claude - INFO - Summary model loaded successfully
367
+ 2025-04-19 20:35:38,799 - claude - INFO - Initializing FlowManager
368
+ 2025-04-19 20:35:38,799 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
369
+ 2025-04-19 20:35:38,810 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
370
+ 2025-04-19 20:35:41,743 - claude - INFO - Setting up FAISS vector database
371
+ 2025-04-19 20:35:41,750 - faiss.loader - INFO - Loading faiss.
372
+ 2025-04-19 20:35:42,519 - faiss.loader - INFO - Successfully loaded faiss.
373
+ 2025-04-19 20:35:42,533 - claude - INFO - Loaded existing vector database
374
+ 2025-04-19 20:35:42,535 - claude - INFO - All models and components initialized successfully
375
+ 2025-04-19 20:37:49,972 - claude - INFO - Using device: cuda
376
+ 2025-04-19 20:37:49,973 - claude - INFO - Loading emotion detection model
377
+ 2025-04-19 20:37:50,809 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
378
+ 2025-04-19 20:37:51,983 - claude - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
379
+ 2025-04-19 20:37:53,346 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
380
+ 2025-04-19 20:38:03,080 - claude - INFO - Loading PEFT model from llama_fine_tuned
381
+ 2025-04-19 20:38:03,408 - claude - INFO - Successfully loaded PEFT model
382
+ 2025-04-19 20:38:04,549 - claude - INFO - Loading summary model
383
+ 2025-04-19 20:38:07,765 - claude - INFO - Summary model loaded successfully
384
+ 2025-04-19 20:38:07,765 - claude - INFO - Initializing FlowManager
385
+ 2025-04-19 20:38:07,766 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
386
+ 2025-04-19 20:38:07,772 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
387
+ 2025-04-19 20:38:10,738 - claude - INFO - Setting up FAISS vector database
388
+ 2025-04-19 20:38:10,742 - faiss.loader - INFO - Loading faiss with AVX2 support.
389
+ 2025-04-19 20:38:10,812 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
390
+ 2025-04-19 20:38:10,822 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
391
+ 2025-04-19 20:38:10,825 - claude - INFO - Loaded existing vector database
392
+ 2025-04-19 20:38:10,827 - claude - INFO - All models and components initialized successfully
393
+ 2025-04-19 20:40:27,294 - claude - INFO - Using device: cuda
394
+ 2025-04-19 20:40:27,295 - claude - INFO - Loading emotion detection model
395
+ 2025-04-19 20:40:27,946 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
396
+ 2025-04-19 20:40:28,924 - claude - INFO - Loading LLM model: meta-llama/Llama-3.2-3B-Instruct
397
+ 2025-04-19 20:40:30,351 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
398
+ 2025-04-19 20:40:39,190 - claude - INFO - Loading PEFT model from llama_fine_tuned
399
+ 2025-04-19 20:40:39,529 - claude - INFO - Successfully loaded PEFT model
400
+ 2025-04-19 20:40:40,687 - claude - INFO - Loading summary model
401
+ 2025-04-19 20:40:43,582 - claude - INFO - Summary model loaded successfully
402
+ 2025-04-19 20:40:43,583 - claude - INFO - Initializing FlowManager
403
+ 2025-04-19 20:40:43,584 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
404
+ 2025-04-19 20:40:43,589 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
405
+ 2025-04-19 20:40:47,142 - claude - INFO - Setting up FAISS vector database
406
+ 2025-04-19 20:40:47,146 - faiss.loader - INFO - Loading faiss with AVX2 support.
407
+ 2025-04-19 20:40:47,206 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
408
+ 2025-04-19 20:40:47,214 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
409
+ 2025-04-19 20:40:47,218 - claude - INFO - Loaded existing vector database
410
+ 2025-04-19 20:40:47,219 - claude - INFO - All models and components initialized successfully
411
+ 2025-04-19 20:41:52,584 - conversation_flow - INFO - Initialized new session for user user_1
412
+ 2025-04-19 20:41:52,585 - claude - INFO - Session started for user user_1
413
+ 2025-04-19 20:44:44,549 - conversation_flow - INFO - Initialized new session for user test_user_20250419204444
414
+ 2025-04-19 20:44:44,550 - claude - INFO - Session started for user test_user_20250419204444
415
+ 2025-04-19 20:51:44,998 - conversation_flow - INFO - Initialized new session for user user_1
416
+ 2025-04-19 20:51:44,998 - claude - INFO - Session started for user user_1
417
+ 2025-04-19 21:22:26,351 - chatbot - INFO - Using device: cuda
418
+ 2025-04-19 21:22:26,352 - chatbot - INFO - Loading emotion detection model
419
+ 2025-04-19 21:22:27,213 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
420
+ 2025-04-19 21:22:28,233 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
421
+ 2025-04-19 21:22:29,310 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
422
+ 2025-04-19 21:22:38,570 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
423
+ 2025-04-19 21:22:38,810 - chatbot - INFO - Successfully loaded PEFT model
424
+ 2025-04-19 21:22:39,906 - chatbot - INFO - Loading summary model
425
+ 2025-04-19 21:22:43,171 - chatbot - INFO - Summary model loaded successfully
426
+ 2025-04-19 21:22:43,171 - chatbot - INFO - Initializing FlowManager
427
+ 2025-04-19 21:22:43,172 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
428
+ 2025-04-19 21:22:43,177 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
429
+ 2025-04-19 21:22:46,003 - chatbot - INFO - Setting up FAISS vector database
430
+ 2025-04-19 21:22:46,007 - faiss.loader - INFO - Loading faiss with AVX2 support.
431
+ 2025-04-19 21:22:46,075 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
432
+ 2025-04-19 21:22:46,082 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
433
+ 2025-04-19 21:22:46,085 - chatbot - INFO - Loaded existing vector database
434
+ 2025-04-19 21:22:46,086 - chatbot - INFO - All models and components initialized successfully
435
+ 2025-04-19 21:24:58,360 - chatbot - INFO - Using device: cuda
436
+ 2025-04-19 21:24:58,360 - chatbot - INFO - Loading emotion detection model
437
+ 2025-04-19 21:24:59,401 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
438
+ 2025-04-19 21:25:00,460 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
439
+ 2025-04-19 21:25:01,753 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
440
+ 2025-04-19 21:25:11,213 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
441
+ 2025-04-19 21:25:11,506 - chatbot - INFO - Successfully loaded PEFT model
442
+ 2025-04-19 21:25:12,577 - chatbot - INFO - Loading summary model
443
+ 2025-04-19 21:25:15,561 - chatbot - INFO - Summary model loaded successfully
444
+ 2025-04-19 21:25:15,561 - chatbot - INFO - Initializing FlowManager
445
+ 2025-04-19 21:25:15,562 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
446
+ 2025-04-19 21:25:15,568 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
447
+ 2025-04-19 21:25:18,810 - chatbot - INFO - Setting up FAISS vector database
448
+ 2025-04-19 21:25:18,814 - faiss.loader - INFO - Loading faiss with AVX2 support.
449
+ 2025-04-19 21:25:18,875 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
450
+ 2025-04-19 21:25:18,882 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
451
+ 2025-04-19 21:25:18,886 - chatbot - INFO - Loaded existing vector database
452
+ 2025-04-19 21:25:18,887 - chatbot - INFO - All models and components initialized successfully
453
+ 2025-04-19 21:25:45,461 - conversation_flow - INFO - Initialized new session for user test_user_20250419212545
454
+ 2025-04-19 21:25:45,462 - chatbot - INFO - Session started for user test_user_20250419212545
455
+ 2025-04-19 21:26:52,439 - conversation_flow - INFO - Initialized new session for user user_1
456
+ 2025-04-19 21:26:52,439 - chatbot - INFO - Session started for user user_1
457
+ 2025-04-19 23:03:08,804 - chatbot - INFO - Using device: cuda
458
+ 2025-04-19 23:03:08,805 - chatbot - INFO - Loading emotion detection model
459
+ 2025-04-19 23:03:09,619 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
460
+ 2025-04-19 23:03:10,663 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
461
+ 2025-04-19 23:03:11,775 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
462
+ 2025-04-19 23:03:21,298 - chatbot - INFO - Loading PEFT model from Hugging Face
463
+ 2025-04-19 23:03:34,745 - chatbot - INFO - Successfully loaded PEFT model from Hugging Face
464
+ 2025-04-19 23:03:35,902 - chatbot - INFO - Loading summary model
465
+ 2025-04-19 23:03:39,040 - chatbot - INFO - Summary model loaded successfully
466
+ 2025-04-19 23:03:39,040 - chatbot - INFO - Initializing FlowManager
467
+ 2025-04-19 23:03:39,041 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
468
+ 2025-04-19 23:03:39,042 - chatbot - INFO - Setting up FAISS vector database
469
+ 2025-04-19 23:05:16,618 - chatbot - INFO - Using device: cuda
470
+ 2025-04-19 23:05:16,618 - chatbot - INFO - Initializing embeddings
471
+ 2025-04-19 23:05:16,623 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-mpnet-base-v2
472
+ 2025-04-19 23:19:58,437 - chatbot - INFO - Using device: cuda
473
+ 2025-04-19 23:19:58,437 - chatbot - INFO - Loading emotion detection model
474
+ 2025-04-19 23:19:59,596 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
475
+ 2025-04-19 23:20:00,656 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
476
+ 2025-04-19 23:20:03,317 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
477
+ 2025-04-19 23:20:13,381 - chatbot - INFO - Loading PEFT model from Hugging Face
478
+ 2025-04-19 23:20:14,931 - chatbot - INFO - Successfully loaded PEFT model from Hugging Face
479
+ 2025-04-19 23:20:16,129 - chatbot - INFO - Loading summary model
480
+ 2025-04-19 23:20:20,632 - chatbot - INFO - Summary model loaded successfully
481
+ 2025-04-19 23:20:20,633 - chatbot - INFO - Initializing FlowManager
482
+ 2025-04-19 23:20:20,633 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
483
+ 2025-04-19 23:20:20,634 - chatbot - INFO - Setting up FAISS vector database
484
+ 2025-04-19 23:20:20,635 - chatbot - INFO - Initializing embeddings
485
+ 2025-04-19 23:20:20,639 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-mpnet-base-v2
486
+ 2025-04-19 23:25:29,794 - faiss.loader - INFO - Loading faiss with AVX2 support.
487
+ 2025-04-19 23:25:29,868 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
488
+ 2025-04-19 23:25:29,877 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
489
+ 2025-04-19 23:25:29,881 - chatbot - INFO - Loaded existing vector database
490
+ 2025-04-19 23:25:29,883 - chatbot - INFO - All models and components initialized successfully
491
+ 2025-04-19 23:27:45,651 - conversation_flow - INFO - Initialized new session for user test_user_20250419232745
492
+ 2025-04-19 23:27:45,652 - chatbot - INFO - Session started for user test_user_20250419232745
493
+ 2025-04-19 23:27:46,102 - chatbot - ERROR - Error retrieving guidelines:
494
+ 2025-04-19 23:27:46,143 - chatbot - ERROR - Error retrieving context:
495
+ 2025-04-19 23:29:51,225 - conversation_flow - INFO - Initialized new session for user test_user_20250419232951
496
+ 2025-04-19 23:29:51,226 - chatbot - INFO - Session started for user test_user_20250419232951
497
+ 2025-04-19 23:29:51,303 - chatbot - ERROR - Error retrieving guidelines:
498
+ 2025-04-19 23:29:51,342 - chatbot - ERROR - Error retrieving context:
499
+ 2025-04-19 23:31:21,388 - chatbot - INFO - Using device: cuda
500
+ 2025-04-19 23:31:21,389 - chatbot - INFO - Loading emotion detection model
501
+ 2025-04-19 23:31:22,154 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
502
+ 2025-04-19 23:31:23,228 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
503
+ 2025-04-19 23:31:24,418 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
504
+ 2025-04-19 23:31:33,745 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
505
+ 2025-04-19 23:31:34,056 - chatbot - INFO - Successfully loaded PEFT model
506
+ 2025-04-19 23:31:35,096 - chatbot - INFO - Loading summary model
507
+ 2025-04-19 23:31:38,449 - chatbot - INFO - Summary model loaded successfully
508
+ 2025-04-19 23:31:38,449 - chatbot - INFO - Initializing FlowManager
509
+ 2025-04-19 23:31:38,449 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
510
+ 2025-04-19 23:31:38,454 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
511
+ 2025-04-19 23:31:40,970 - chatbot - INFO - Setting up FAISS vector database
512
+ 2025-04-19 23:31:40,975 - faiss.loader - INFO - Loading faiss with AVX2 support.
513
+ 2025-04-19 23:31:41,041 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
514
+ 2025-04-19 23:31:41,049 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
515
+ 2025-04-19 23:31:41,053 - chatbot - INFO - Loaded existing vector database
516
+ 2025-04-19 23:31:41,054 - chatbot - INFO - All models and components initialized successfully
517
+ 2025-04-19 23:31:58,013 - conversation_flow - INFO - Initialized new session for user test_user_20250419233158
518
+ 2025-04-19 23:31:58,014 - chatbot - INFO - Session started for user test_user_20250419233158
519
+ 2025-04-20 16:45:15,627 - chatbot - INFO - Using device: cuda
520
+ 2025-04-20 16:45:15,628 - chatbot - INFO - Loading emotion detection model
521
+ 2025-04-20 16:45:16,472 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
522
+ 2025-04-20 16:45:17,756 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
523
+ 2025-04-20 16:45:19,008 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
524
+ 2025-04-20 16:45:29,740 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
525
+ 2025-04-20 16:45:30,062 - chatbot - INFO - Successfully loaded PEFT model
526
+ 2025-04-20 16:45:31,256 - chatbot - INFO - Loading summary model
527
+ 2025-04-20 16:45:34,713 - chatbot - INFO - Summary model loaded successfully
528
+ 2025-04-20 16:45:34,714 - chatbot - INFO - Initializing FlowManager
529
+ 2025-04-20 16:45:34,714 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
530
+ 2025-04-20 16:45:34,719 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
531
+ 2025-04-20 16:45:38,475 - chatbot - INFO - Setting up FAISS vector database
532
+ 2025-04-20 16:45:38,480 - faiss.loader - INFO - Loading faiss with AVX2 support.
533
+ 2025-04-20 16:45:38,549 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
534
+ 2025-04-20 16:45:38,557 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
535
+ 2025-04-20 16:45:38,577 - chatbot - INFO - Loaded existing vector database
536
+ 2025-04-20 16:45:38,616 - chatbot - INFO - All models and components initialized successfully
537
+ 2025-04-20 16:45:51,568 - conversation_flow - INFO - Initialized new session for user test_user_20250420164551
538
+ 2025-04-20 16:45:51,569 - chatbot - INFO - Session started for user test_user_20250420164551
539
+ 2025-04-20 17:02:33,896 - chatbot - INFO - Using device: cuda
540
+ 2025-04-20 17:02:33,896 - chatbot - INFO - Loading emotion detection model
541
+ 2025-04-20 17:02:34,494 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
542
+ 2025-04-20 17:02:35,523 - chatbot - INFO - Loading LLAMA model: nada013/mental-health-chatbot
543
+ 2025-04-20 17:06:34,211 - chatbot - INFO - Using device: cuda
544
+ 2025-04-20 17:06:34,211 - chatbot - INFO - Loading emotion detection model
545
+ 2025-04-20 17:06:35,028 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
546
+ 2025-04-20 17:06:35,678 - chatbot - INFO - Loading LLAMA model: nada013/mental-health-chatbot
547
+ 2025-04-20 17:08:08,185 - chatbot - INFO - Using device: cuda
548
+ 2025-04-20 17:08:08,185 - chatbot - INFO - Loading emotion detection model
549
+ 2025-04-20 17:08:08,734 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
550
+ 2025-04-20 17:08:09,362 - chatbot - INFO - Loading LLAMA model: nada013/mental-health-chatbot
551
+ 2025-04-20 17:10:44,643 - chatbot - INFO - Using device: cuda
552
+ 2025-04-20 17:10:44,644 - chatbot - INFO - Loading emotion detection model
553
+ 2025-04-20 17:10:45,420 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
554
+ 2025-04-20 17:10:46,075 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
555
+ 2025-04-20 17:10:47,089 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
556
+ 2025-04-20 17:10:56,278 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
557
+ 2025-04-20 17:10:56,578 - chatbot - INFO - Successfully loaded PEFT model
558
+ 2025-04-20 17:10:57,961 - chatbot - INFO - Loading summary model
559
+ 2025-04-20 17:11:01,854 - chatbot - INFO - Summary model loaded successfully
560
+ 2025-04-20 17:11:01,855 - chatbot - INFO - Initializing FlowManager
561
+ 2025-04-20 17:11:01,855 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
562
+ 2025-04-20 17:11:01,861 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
563
+ 2025-04-20 17:15:58,375 - chatbot - INFO - Using device: cuda
564
+ 2025-04-20 17:15:58,375 - chatbot - INFO - Loading emotion detection model
565
+ 2025-04-20 17:15:59,033 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
566
+ 2025-04-20 17:16:00,047 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
567
+ 2025-04-20 17:16:00,049 - chatbot - INFO - Loading base model: meta-llama/Llama-3.2-3B-Instruct
568
+ 2025-04-20 17:16:00,443 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
569
+ 2025-04-20 17:16:09,546 - chatbot - INFO - Loading tokenizer
570
+ 2025-04-20 17:16:10,189 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
571
+ 2025-04-20 17:16:10,492 - chatbot - INFO - Successfully loaded PEFT model
572
+ 2025-04-20 17:16:10,495 - chatbot - INFO - Loading summary model
573
+ 2025-04-20 17:16:13,817 - chatbot - INFO - Summary model loaded successfully
574
+ 2025-04-20 17:16:13,817 - chatbot - INFO - Initializing FlowManager
575
+ 2025-04-20 17:16:13,817 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
576
+ 2025-04-20 17:16:13,824 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
577
+ 2025-04-20 17:16:16,265 - chatbot - INFO - Setting up FAISS vector database
578
+ 2025-04-20 17:16:16,269 - faiss.loader - INFO - Loading faiss with AVX2 support.
579
+ 2025-04-20 17:16:16,336 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
580
+ 2025-04-20 17:16:16,344 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
581
+ 2025-04-20 17:16:16,348 - chatbot - INFO - Loaded existing vector database
582
+ 2025-04-20 17:16:16,350 - chatbot - INFO - All models and components initialized successfully
583
+ 2025-04-20 17:16:30,886 - conversation_flow - INFO - Initialized new session for user test_user_20250420171630
584
+ 2025-04-20 17:16:30,886 - chatbot - INFO - Session started for user test_user_20250420171630
585
+ 2025-04-28 13:34:59,994 - chatbot - INFO - Using device: cuda
586
+ 2025-04-28 13:34:59,994 - chatbot - INFO - Loading emotion detection model
587
+ 2025-04-28 13:35:00,828 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
588
+ 2025-04-28 13:35:01,872 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
589
+ 2025-04-28 13:35:01,872 - chatbot - INFO - Loading base model: meta-llama/Llama-3.2-3B-Instruct
590
+ 2025-04-28 13:35:02,679 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
591
+ 2025-04-28 13:35:12,296 - chatbot - INFO - Loading tokenizer
592
+ 2025-04-28 13:35:12,905 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
593
+ 2025-04-28 13:35:13,188 - chatbot - INFO - Successfully loaded PEFT model
594
+ 2025-04-28 13:35:13,190 - chatbot - INFO - Loading summary model
595
+ 2025-04-28 13:35:16,525 - chatbot - INFO - Summary model loaded successfully
596
+ 2025-04-28 13:35:16,525 - chatbot - INFO - Initializing FlowManager
597
+ 2025-04-28 13:35:16,525 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
598
+ 2025-04-28 13:35:16,539 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
599
+ 2025-04-28 13:35:19,329 - chatbot - INFO - Setting up FAISS vector database
600
+ 2025-04-28 13:35:19,331 - faiss.loader - INFO - Loading faiss with AVX2 support.
601
+ 2025-04-28 13:35:19,388 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
602
+ 2025-04-28 13:35:19,410 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
603
+ 2025-04-28 13:35:19,413 - chatbot - INFO - Loaded existing vector database
604
+ 2025-04-28 13:35:19,414 - chatbot - INFO - All models and components initialized successfully
605
+ 2025-04-28 13:39:27,168 - conversation_flow - INFO - Initialized new session for user test_user_20250428133927
606
+ 2025-04-28 13:39:27,169 - chatbot - INFO - Session started for user test_user_20250428133927
607
+ 2025-04-28 13:40:44,006 - conversation_flow - INFO - Initialized new session for user test_user_20250428134044
608
+ 2025-04-28 13:40:44,006 - chatbot - INFO - Session started for user test_user_20250428134044
609
+ 2025-04-28 13:40:58,313 - conversation_flow - INFO - Initialized new session for user test_user_20250428134058
610
+ 2025-04-28 13:40:58,313 - chatbot - INFO - Session started for user test_user_20250428134058
611
+ 2025-04-28 13:41:15,559 - conversation_flow - INFO - Initialized new session for user test_user_20250428134115
612
+ 2025-04-28 13:41:15,559 - chatbot - INFO - Session started for user test_user_20250428134115
613
+ 2025-04-28 13:41:26,562 - conversation_flow - INFO - Initialized new session for user test_user_20250428134126
614
+ 2025-04-28 13:41:26,562 - chatbot - INFO - Session started for user test_user_20250428134126
615
+ 2025-04-29 15:58:20,478 - chatbot - INFO - Using device: cuda
616
+ 2025-04-29 15:58:20,478 - chatbot - INFO - Loading emotion detection model
617
+ 2025-04-29 15:58:21,237 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
618
+ 2025-04-29 15:58:22,337 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
619
+ 2025-04-29 15:58:22,337 - chatbot - INFO - Loading base model: meta-llama/Llama-3.2-3B-Instruct
620
+ 2025-04-29 15:58:22,808 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
621
+ 2025-04-29 15:58:32,763 - chatbot - INFO - Loading tokenizer
622
+ 2025-04-29 15:58:33,379 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
623
+ 2025-04-29 15:58:33,710 - chatbot - INFO - Successfully loaded PEFT model
624
+ 2025-04-29 15:58:33,719 - chatbot - INFO - Loading summary model
625
+ 2025-04-29 15:58:37,407 - chatbot - INFO - Summary model loaded successfully
626
+ 2025-04-29 15:58:37,407 - chatbot - INFO - Initializing FlowManager
627
+ 2025-04-29 15:58:37,407 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
628
+ 2025-04-29 15:58:37,412 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
629
+ 2025-04-29 15:58:40,787 - chatbot - INFO - Setting up FAISS vector database
630
+ 2025-04-29 15:58:40,787 - faiss.loader - INFO - Loading faiss with AVX2 support.
631
+ 2025-04-29 15:58:40,854 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
632
+ 2025-04-29 15:58:40,866 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
633
+ 2025-04-29 15:58:40,886 - chatbot - INFO - Loaded existing vector database
634
+ 2025-04-29 15:58:40,921 - chatbot - INFO - All models and components initialized successfully
635
+ 2025-04-29 15:58:48,473 - conversation_flow - INFO - Initialized new session for user test_user_20250429155848
636
+ 2025-04-29 15:58:48,474 - chatbot - INFO - Session started for user test_user_20250429155848
637
+ 2025-04-29 16:00:15,773 - conversation_flow - INFO - Initialized new session for user test_user_20250429160015
638
+ 2025-04-29 16:00:15,773 - chatbot - INFO - Session started for user test_user_20250429160015
639
+ 2025-04-29 16:00:21,695 - conversation_flow - INFO - Initialized new session for user test_user_20250429160021
640
+ 2025-04-29 16:00:21,695 - chatbot - INFO - Session started for user test_user_20250429160021
641
+ 2025-04-29 16:00:51,180 - conversation_flow - INFO - Initialized new session for user test_user_20250429160051
642
+ 2025-04-29 16:00:51,181 - chatbot - INFO - Session started for user test_user_20250429160051
643
+ 2025-04-29 16:01:00,644 - conversation_flow - INFO - Initialized new session for user test_user_20250429160100
644
+ 2025-04-29 16:01:00,646 - chatbot - INFO - Session started for user test_user_20250429160100
645
+ 2025-04-29 16:21:58,912 - chatbot - INFO - Using device: cuda
646
+ 2025-04-29 16:21:58,914 - chatbot - INFO - Loading emotion detection model
647
+ 2025-04-29 16:21:59,457 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
648
+ 2025-04-29 16:22:00,437 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
649
+ 2025-04-29 16:22:00,442 - chatbot - INFO - Loading base model: meta-llama/Llama-3.2-3B-Instruct
650
+ 2025-04-29 16:22:00,825 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
651
+ 2025-04-29 16:22:09,543 - chatbot - INFO - Loading tokenizer
652
+ 2025-04-29 16:22:10,198 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
653
+ 2025-04-29 16:22:10,470 - chatbot - INFO - Successfully loaded PEFT model
654
+ 2025-04-29 16:22:10,476 - chatbot - INFO - Loading summary model
655
+ 2025-04-29 16:22:13,691 - chatbot - INFO - Summary model loaded successfully
656
+ 2025-04-29 16:22:13,691 - chatbot - INFO - Initializing FlowManager
657
+ 2025-04-29 16:22:13,691 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
658
+ 2025-04-29 16:22:13,691 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
659
+ 2025-04-29 16:22:16,133 - chatbot - INFO - Setting up FAISS vector database
660
+ 2025-04-29 16:22:16,137 - faiss.loader - INFO - Loading faiss with AVX2 support.
661
+ 2025-04-29 16:22:16,196 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
662
+ 2025-04-29 16:22:16,209 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
663
+ 2025-04-29 16:22:16,212 - chatbot - INFO - Loaded existing vector database
664
+ 2025-04-29 16:22:16,215 - chatbot - INFO - All models and components initialized successfully
665
+ 2025-04-29 16:22:22,075 - conversation_flow - INFO - Initialized new session for user test_user_20250429162222
666
+ 2025-04-29 16:22:22,075 - chatbot - INFO - Session started for user test_user_20250429162222
667
+ 2025-04-29 16:23:35,362 - conversation_flow - INFO - Initialized new session for user test_user_20250429162335
668
+ 2025-04-29 16:23:35,362 - chatbot - INFO - Session started for user test_user_20250429162335
669
+ 2025-05-05 15:58:46,808 - chatbot - INFO - Using device: cuda
670
+ 2025-05-05 15:58:46,808 - chatbot - INFO - Loading emotion detection model
671
+ 2025-05-05 15:58:47,521 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
672
+ 2025-05-05 15:58:48,527 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
673
+ 2025-05-05 15:58:48,529 - chatbot - INFO - Loading base model: meta-llama/Llama-3.2-3B-Instruct
674
+ 2025-05-05 15:58:49,091 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
675
+ 2025-05-05 15:58:59,105 - chatbot - INFO - Loading tokenizer
676
+ 2025-05-05 15:58:59,889 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
677
+ 2025-05-05 15:59:00,199 - chatbot - INFO - Successfully loaded PEFT model
678
+ 2025-05-05 15:59:00,213 - chatbot - INFO - Loading summary model
679
+ 2025-05-05 15:59:04,061 - chatbot - INFO - Summary model loaded successfully
680
+ 2025-05-05 15:59:04,061 - chatbot - INFO - Initializing FlowManager
681
+ 2025-05-05 15:59:04,061 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
682
+ 2025-05-05 15:59:04,070 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
683
+ 2025-05-05 15:59:06,850 - chatbot - INFO - Setting up FAISS vector database
684
+ 2025-05-05 15:59:06,855 - faiss.loader - INFO - Loading faiss with AVX2 support.
685
+ 2025-05-05 15:59:06,923 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
686
+ 2025-05-05 15:59:06,931 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
687
+ 2025-05-05 15:59:06,934 - chatbot - INFO - Loaded existing vector database
688
+ 2025-05-05 15:59:06,938 - chatbot - INFO - All models and components initialized successfully
689
+ 2025-05-05 15:59:36,221 - conversation_flow - INFO - Initialized new session for user test_user_20250505155936
690
+ 2025-05-05 15:59:36,222 - chatbot - INFO - Session started for user test_user_20250505155936
691
+ 2025-05-05 16:00:50,390 - conversation_flow - INFO - Initialized new session for user test_user_20250505160050
692
+ 2025-05-05 16:00:50,390 - chatbot - INFO - Session started for user test_user_20250505160050
693
+ 2025-05-05 16:11:01,134 - conversation_flow - INFO - Initialized new session for user test_user_20250505161101
694
+ 2025-05-05 16:11:01,134 - chatbot - INFO - Session started for user test_user_20250505161101
695
+ 2025-05-05 16:12:10,864 - conversation_flow - INFO - Initialized new session for user test_user_20250505161210
696
+ 2025-05-05 16:12:10,864 - chatbot - INFO - Session started for user test_user_20250505161210
697
+ 2025-05-05 16:12:21,792 - conversation_flow - INFO - Initialized new session for user test_user_20250505161221
698
+ 2025-05-05 16:12:21,792 - chatbot - INFO - Session started for user test_user_20250505161221
699
+ 2025-05-05 16:12:40,152 - conversation_flow - INFO - Initialized new session for user test_user_20250505161240
700
+ 2025-05-05 16:12:40,153 - chatbot - INFO - Session started for user test_user_20250505161240
701
+ 2025-05-05 16:13:00,354 - conversation_flow - INFO - Initialized new session for user test_user_20250505161300
702
+ 2025-05-05 16:13:00,356 - chatbot - INFO - Session started for user test_user_20250505161300
703
+ 2025-05-05 16:14:16,781 - conversation_flow - INFO - Initialized new session for user test_user
704
+ 2025-05-05 16:14:16,782 - chatbot - INFO - Session started for user test_user
705
+ 2025-05-05 16:17:17,077 - conversation_flow - INFO - Initialized new session for user test_user_20250505161717
706
+ 2025-05-05 16:17:17,077 - chatbot - INFO - Session started for user test_user_20250505161717
707
+ 2025-05-05 16:18:41,059 - conversation_flow - INFO - Initialized new session for user test_user_20250505161841
708
+ 2025-05-05 16:18:41,059 - chatbot - INFO - Session started for user test_user_20250505161841
709
+ 2025-05-05 16:20:03,629 - conversation_flow - INFO - Initialized new session for user test_user_20250505162003
710
+ 2025-05-05 16:20:03,629 - chatbot - INFO - Session started for user test_user_20250505162003
711
+ 2025-05-05 17:24:07,961 - chatbot - INFO - Using device: cuda
712
+ 2025-05-05 17:24:07,961 - chatbot - INFO - Loading emotion detection model
713
+ 2025-05-05 17:24:08,587 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
714
+ 2025-05-05 17:24:09,607 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
715
+ 2025-05-05 17:24:09,609 - chatbot - INFO - Loading base model: meta-llama/Llama-3.2-3B-Instruct
716
+ 2025-05-05 17:24:10,034 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
717
+ 2025-05-05 17:24:19,176 - chatbot - INFO - Loading tokenizer
718
+ 2025-05-05 17:24:19,872 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
719
+ 2025-05-05 17:24:20,182 - chatbot - INFO - Successfully loaded PEFT model
720
+ 2025-05-05 17:24:20,186 - chatbot - INFO - Loading summary model
721
+ 2025-05-05 17:24:23,675 - chatbot - INFO - Summary model loaded successfully
722
+ 2025-05-05 17:24:23,675 - chatbot - INFO - Initializing FlowManager
723
+ 2025-05-05 17:24:23,675 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
724
+ 2025-05-05 17:24:23,681 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
725
+ 2025-05-05 17:24:26,228 - chatbot - INFO - Setting up FAISS vector database
726
+ 2025-05-05 17:24:26,232 - faiss.loader - INFO - Loading faiss with AVX2 support.
727
+ 2025-05-05 17:24:26,295 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
728
+ 2025-05-05 17:24:26,303 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
729
+ 2025-05-05 17:24:26,306 - chatbot - INFO - Loaded existing vector database
730
+ 2025-05-05 17:24:26,311 - chatbot - INFO - All models and components initialized successfully
731
+ 2025-05-05 17:24:31,883 - conversation_flow - INFO - Initialized new session for user test_user_20250505172431
732
+ 2025-05-05 17:24:31,884 - chatbot - INFO - Session started for user test_user_20250505172431
733
+ 2025-05-05 17:26:01,109 - conversation_flow - INFO - Initialized new session for user test_user_20250505172601
734
+ 2025-05-05 17:26:01,109 - chatbot - INFO - Session started for user test_user_20250505172601
735
+ 2025-05-05 17:27:08,325 - conversation_flow - INFO - Initialized new session for user test_user_20250505172708
736
+ 2025-05-05 17:27:08,325 - chatbot - INFO - Session started for user test_user_20250505172708
737
+ 2025-05-05 17:59:17,250 - chatbot - INFO - Using device: cuda
738
+ 2025-05-05 17:59:17,250 - chatbot - INFO - Loading emotion detection model
739
+ 2025-05-05 17:59:17,980 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
740
+ 2025-05-05 17:59:19,037 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
741
+ 2025-05-05 17:59:19,039 - chatbot - INFO - Loading base model: meta-llama/Llama-3.2-3B-Instruct
742
+ 2025-05-05 17:59:19,564 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
743
+ 2025-05-05 17:59:30,594 - chatbot - INFO - Loading tokenizer
744
+ 2025-05-05 17:59:31,249 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
745
+ 2025-05-05 17:59:31,586 - chatbot - INFO - Successfully loaded PEFT model
746
+ 2025-05-05 17:59:31,591 - chatbot - INFO - Loading summary model
747
+ 2025-05-05 17:59:35,369 - chatbot - INFO - Summary model loaded successfully
748
+ 2025-05-05 17:59:35,369 - chatbot - INFO - Initializing FlowManager
749
+ 2025-05-05 17:59:35,369 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
750
+ 2025-05-05 17:59:35,376 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
751
+ 2025-05-05 17:59:37,837 - chatbot - INFO - Setting up FAISS vector database
752
+ 2025-05-05 17:59:37,841 - faiss.loader - INFO - Loading faiss with AVX2 support.
753
+ 2025-05-05 17:59:37,911 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
754
+ 2025-05-05 17:59:37,919 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
755
+ 2025-05-05 17:59:37,923 - chatbot - INFO - Loaded existing vector database
756
+ 2025-05-05 17:59:37,929 - chatbot - INFO - All models and components initialized successfully
757
+ 2025-05-05 18:00:55,372 - chatbot - INFO - Using device: cuda
758
+ 2025-05-05 18:00:55,373 - chatbot - INFO - Loading emotion detection model
759
+ 2025-05-05 18:00:56,108 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
760
+ 2025-05-05 18:00:57,131 - chatbot - INFO - Loading LLAMA model: meta-llama/Llama-3.2-3B-Instruct
761
+ 2025-05-05 18:00:57,133 - chatbot - INFO - Loading base model: meta-llama/Llama-3.2-3B-Instruct
762
+ 2025-05-05 18:00:57,876 - accelerate.utils.modeling - INFO - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
763
+ 2025-05-05 18:01:06,722 - chatbot - INFO - Loading tokenizer
764
+ 2025-05-05 18:01:07,409 - chatbot - INFO - Loading PEFT model from llama_fine_tuned
765
+ 2025-05-05 18:01:07,697 - chatbot - INFO - Successfully loaded PEFT model
766
+ 2025-05-05 18:01:07,701 - chatbot - INFO - Loading summary model
767
+ 2025-05-05 18:01:11,099 - chatbot - INFO - Summary model loaded successfully
768
+ 2025-05-05 18:01:11,100 - chatbot - INFO - Initializing FlowManager
769
+ 2025-05-05 18:01:11,100 - conversation_flow - INFO - Initialized FlowManager with 45 minute sessions
770
+ 2025-05-05 18:01:11,105 - sentence_transformers.SentenceTransformer - INFO - Load pretrained SentenceTransformer: sentence-transformers/all-MiniLM-L6-v2
771
+ 2025-05-05 18:01:14,042 - chatbot - INFO - Setting up FAISS vector database
772
+ 2025-05-05 18:01:14,045 - faiss.loader - INFO - Loading faiss with AVX2 support.
773
+ 2025-05-05 18:01:14,110 - faiss.loader - INFO - Successfully loaded faiss with AVX2 support.
774
+ 2025-05-05 18:01:14,117 - faiss - INFO - Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes.
775
+ 2025-05-05 18:01:14,121 - chatbot - INFO - Loaded existing vector database
776
+ 2025-05-05 18:01:14,127 - chatbot - INFO - All models and components initialized successfully
777
+ 2025-05-05 18:06:56,208 - conversation_flow - INFO - Initialized new session for user test_user_20250505180656
778
+ 2025-05-05 18:06:56,209 - chatbot - INFO - Session started for user test_user_20250505180656
779
+ 2025-05-05 18:08:27,012 - conversation_flow - INFO - Initialized new session for user test_user_20250505180827
780
+ 2025-05-05 18:08:27,012 - chatbot - INFO - Session started for user test_user_20250505180827
781
+ 2025-05-05 18:09:53,803 - conversation_flow - INFO - Initialized new session for user test_user_20250505180953
782
+ 2025-05-05 18:09:53,803 - chatbot - INFO - Session started for user test_user_20250505180953
requirements.txt ADDED
@@ -0,0 +1,26 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ transformers==4.49.0
2
+ torch==2.2.0+cu118
3
+ sentence-transformers==3.4.1
4
+ langchain==0.3.21
5
+ langchain-community==0.3.20
6
+ langchain-core==0.3.47
7
+ langchain-huggingface==0.1.2
8
+ pydantic==2.10.6
9
+ pydantic-settings==2.8.1
10
+ fastapi==0.115.11
11
+ uvicorn==0.34.0
12
+ python-dotenv==1.0.1
13
+ pytest==7.4.0
14
+ gunicorn==21.2.0
15
+ accelerate==1.5.2
16
+ bitsandbytes==0.45.3
17
+ chromadb==0.6.3
18
+ datasets==3.4.1
19
+ faiss-cpu==1.10.0
20
+ huggingface-hub==0.29.3
21
+ peft==0.15.1
22
+ safetensors==0.5.3
23
+ tokenizers==0.21.1
24
+ tiktoken==0.9.0
25
+ starlette==0.46.1
26
+ websockets==15.0.1