Pulastya B commited on
Commit
c9ba5a9
Β·
1 Parent(s): 9c0055f

Fixed all the Huggingface issues

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -34,6 +34,12 @@ An intelligent **multi-agent AI system** for automated end-to-end data science w
34
  - **Session Memory**: Maintains context across follow-up queries
35
  - **Error Recovery**: Graceful fallbacks and parameter validation
36
  - **Large Dataset Support**: Automatic sampling for 100K+ row datasets
 
 
 
 
 
 
37
 
38
  ## πŸ—οΈ Architecture
39
 
@@ -95,6 +101,12 @@ An intelligent **multi-agent AI system** for automated end-to-end data science w
95
  "What are the key insights from this analysis?"
96
  ```
97
 
 
 
 
 
 
 
98
  ## πŸ› οΈ Tech Stack
99
 
100
  | Component | Technology |
@@ -107,6 +119,8 @@ An intelligent **multi-agent AI system** for automated end-to-end data science w
107
  | **Hyperparameter Tuning** | Optuna with MedianPruner |
108
  | **Semantic Search** | Sentence-BERT (all-MiniLM-L6-v2) |
109
  | **Streaming** | Server-Sent Events (SSE) |
 
 
110
 
111
  ## πŸ“ Project Structure
112
 
@@ -117,6 +131,9 @@ src/
117
  β”œβ”€β”€ orchestrator.py # Main workflow orchestration (4500+ lines)
118
  β”œβ”€β”€ session_memory.py # Context persistence across queries
119
  β”œβ”€β”€ session_store.py # Session database management
 
 
 
120
  β”œβ”€β”€ tools/
121
  β”‚ β”œβ”€β”€ data_profiling.py # YData profiling, statistics
122
  β”‚ β”œβ”€β”€ data_cleaning.py # Missing values, outliers
@@ -147,6 +164,10 @@ GROQ_API_KEY=your_groq_key # Alternative
147
  # Optional
148
  LLM_PROVIDER=mistral # mistral, gemini, or groq
149
  MAX_ITERATIONS=20 # Max workflow steps
 
 
 
 
150
  ```
151
 
152
  ### HuggingFace Spaces
@@ -213,6 +234,12 @@ Real-time training progress with elapsed time:
213
  - βœ… **Tool Name Mapping**: Maps 8+ common hallucinated tool names
214
  - βœ… **NoneType Safety**: Validates all comparison operands
215
 
 
 
 
 
 
 
216
  ## 🐳 Docker Deployment
217
 
218
  ```dockerfile
 
34
  - **Session Memory**: Maintains context across follow-up queries
35
  - **Error Recovery**: Graceful fallbacks and parameter validation
36
  - **Large Dataset Support**: Automatic sampling for 100K+ row datasets
37
+ - **HuggingFace Export**: Export datasets, models, and outputs directly to your HuggingFace repos
38
+
39
+ ### πŸ” Authentication & Integration
40
+ - **Supabase Auth**: Secure user authentication with email/password and OAuth
41
+ - **HuggingFace Integration**: Connect your HF account to export artifacts
42
+ - **Personal Token Support**: Use your own HF write tokens for private uploads
43
 
44
  ## πŸ—οΈ Architecture
45
 
 
101
  "What are the key insights from this analysis?"
102
  ```
103
 
104
+ ### HuggingFace Export
105
+ 1. **Connect** your HuggingFace account via Settings β†’ Add your HF token
106
+ 2. **Generate** artifacts (datasets, models, visualizations)
107
+ 3. **Export** directly to your HuggingFace repos from the Assets sidebar
108
+ 4. **Share** your work with the ML community
109
+
110
  ## πŸ› οΈ Tech Stack
111
 
112
  | Component | Technology |
 
119
  | **Hyperparameter Tuning** | Optuna with MedianPruner |
120
  | **Semantic Search** | Sentence-BERT (all-MiniLM-L6-v2) |
121
  | **Streaming** | Server-Sent Events (SSE) |
122
+ | **Authentication** | Supabase Auth |
123
+ | **Cloud Storage** | HuggingFace Hub API |
124
 
125
  ## πŸ“ Project Structure
126
 
 
131
  β”œβ”€β”€ orchestrator.py # Main workflow orchestration (4500+ lines)
132
  β”œβ”€β”€ session_memory.py # Context persistence across queries
133
  β”œβ”€β”€ session_store.py # Session database management
134
+ β”œβ”€β”€ storage/
135
+ β”‚ β”œβ”€β”€ huggingface_storage.py # HuggingFace Hub integration
136
+ β”‚ └── artifact_store.py # Local artifact management
137
  β”œβ”€β”€ tools/
138
  β”‚ β”œβ”€β”€ data_profiling.py # YData profiling, statistics
139
  β”‚ β”œβ”€β”€ data_cleaning.py # Missing values, outliers
 
164
  # Optional
165
  LLM_PROVIDER=mistral # mistral, gemini, or groq
166
  MAX_ITERATIONS=20 # Max workflow steps
167
+
168
+ # Supabase (for authentication)
169
+ SUPABASE_URL=your_supabase_url
170
+ SUPABASE_ANON_KEY=your_supabase_anon_key
171
  ```
172
 
173
  ### HuggingFace Spaces
 
234
  - βœ… **Tool Name Mapping**: Maps 8+ common hallucinated tool names
235
  - βœ… **NoneType Safety**: Validates all comparison operands
236
 
237
+ ### HuggingFace Integration
238
+ - βœ… **One-Click Export**: Export datasets, models, and outputs to HuggingFace
239
+ - βœ… **Personal Repos**: Auto-creates `ds-agent-data`, `ds-agent-models`, `ds-agent-outputs` repos
240
+ - βœ… **Secure Tokens**: User tokens stored securely in Supabase
241
+ - βœ… **Status Caching**: Efficient HF connection status checking
242
+
243
  ## 🐳 Docker Deployment
244
 
245
  ```dockerfile