Spaces:
Running
Running
Pulastya B commited on
Commit Β·
c9ba5a9
1
Parent(s): 9c0055f
Fixed all the Huggingface issues
Browse files
README.md
CHANGED
|
@@ -34,6 +34,12 @@ An intelligent **multi-agent AI system** for automated end-to-end data science w
|
|
| 34 |
- **Session Memory**: Maintains context across follow-up queries
|
| 35 |
- **Error Recovery**: Graceful fallbacks and parameter validation
|
| 36 |
- **Large Dataset Support**: Automatic sampling for 100K+ row datasets
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 37 |
|
| 38 |
## ποΈ Architecture
|
| 39 |
|
|
@@ -95,6 +101,12 @@ An intelligent **multi-agent AI system** for automated end-to-end data science w
|
|
| 95 |
"What are the key insights from this analysis?"
|
| 96 |
```
|
| 97 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 98 |
## π οΈ Tech Stack
|
| 99 |
|
| 100 |
| Component | Technology |
|
|
@@ -107,6 +119,8 @@ An intelligent **multi-agent AI system** for automated end-to-end data science w
|
|
| 107 |
| **Hyperparameter Tuning** | Optuna with MedianPruner |
|
| 108 |
| **Semantic Search** | Sentence-BERT (all-MiniLM-L6-v2) |
|
| 109 |
| **Streaming** | Server-Sent Events (SSE) |
|
|
|
|
|
|
|
| 110 |
|
| 111 |
## π Project Structure
|
| 112 |
|
|
@@ -117,6 +131,9 @@ src/
|
|
| 117 |
βββ orchestrator.py # Main workflow orchestration (4500+ lines)
|
| 118 |
βββ session_memory.py # Context persistence across queries
|
| 119 |
βββ session_store.py # Session database management
|
|
|
|
|
|
|
|
|
|
| 120 |
βββ tools/
|
| 121 |
β βββ data_profiling.py # YData profiling, statistics
|
| 122 |
β βββ data_cleaning.py # Missing values, outliers
|
|
@@ -147,6 +164,10 @@ GROQ_API_KEY=your_groq_key # Alternative
|
|
| 147 |
# Optional
|
| 148 |
LLM_PROVIDER=mistral # mistral, gemini, or groq
|
| 149 |
MAX_ITERATIONS=20 # Max workflow steps
|
|
|
|
|
|
|
|
|
|
|
|
|
| 150 |
```
|
| 151 |
|
| 152 |
### HuggingFace Spaces
|
|
@@ -213,6 +234,12 @@ Real-time training progress with elapsed time:
|
|
| 213 |
- β
**Tool Name Mapping**: Maps 8+ common hallucinated tool names
|
| 214 |
- β
**NoneType Safety**: Validates all comparison operands
|
| 215 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 216 |
## π³ Docker Deployment
|
| 217 |
|
| 218 |
```dockerfile
|
|
|
|
| 34 |
- **Session Memory**: Maintains context across follow-up queries
|
| 35 |
- **Error Recovery**: Graceful fallbacks and parameter validation
|
| 36 |
- **Large Dataset Support**: Automatic sampling for 100K+ row datasets
|
| 37 |
+
- **HuggingFace Export**: Export datasets, models, and outputs directly to your HuggingFace repos
|
| 38 |
+
|
| 39 |
+
### π Authentication & Integration
|
| 40 |
+
- **Supabase Auth**: Secure user authentication with email/password and OAuth
|
| 41 |
+
- **HuggingFace Integration**: Connect your HF account to export artifacts
|
| 42 |
+
- **Personal Token Support**: Use your own HF write tokens for private uploads
|
| 43 |
|
| 44 |
## ποΈ Architecture
|
| 45 |
|
|
|
|
| 101 |
"What are the key insights from this analysis?"
|
| 102 |
```
|
| 103 |
|
| 104 |
+
### HuggingFace Export
|
| 105 |
+
1. **Connect** your HuggingFace account via Settings β Add your HF token
|
| 106 |
+
2. **Generate** artifacts (datasets, models, visualizations)
|
| 107 |
+
3. **Export** directly to your HuggingFace repos from the Assets sidebar
|
| 108 |
+
4. **Share** your work with the ML community
|
| 109 |
+
|
| 110 |
## π οΈ Tech Stack
|
| 111 |
|
| 112 |
| Component | Technology |
|
|
|
|
| 119 |
| **Hyperparameter Tuning** | Optuna with MedianPruner |
|
| 120 |
| **Semantic Search** | Sentence-BERT (all-MiniLM-L6-v2) |
|
| 121 |
| **Streaming** | Server-Sent Events (SSE) |
|
| 122 |
+
| **Authentication** | Supabase Auth |
|
| 123 |
+
| **Cloud Storage** | HuggingFace Hub API |
|
| 124 |
|
| 125 |
## π Project Structure
|
| 126 |
|
|
|
|
| 131 |
βββ orchestrator.py # Main workflow orchestration (4500+ lines)
|
| 132 |
βββ session_memory.py # Context persistence across queries
|
| 133 |
βββ session_store.py # Session database management
|
| 134 |
+
βββ storage/
|
| 135 |
+
β βββ huggingface_storage.py # HuggingFace Hub integration
|
| 136 |
+
β βββ artifact_store.py # Local artifact management
|
| 137 |
βββ tools/
|
| 138 |
β βββ data_profiling.py # YData profiling, statistics
|
| 139 |
β βββ data_cleaning.py # Missing values, outliers
|
|
|
|
| 164 |
# Optional
|
| 165 |
LLM_PROVIDER=mistral # mistral, gemini, or groq
|
| 166 |
MAX_ITERATIONS=20 # Max workflow steps
|
| 167 |
+
|
| 168 |
+
# Supabase (for authentication)
|
| 169 |
+
SUPABASE_URL=your_supabase_url
|
| 170 |
+
SUPABASE_ANON_KEY=your_supabase_anon_key
|
| 171 |
```
|
| 172 |
|
| 173 |
### HuggingFace Spaces
|
|
|
|
| 234 |
- β
**Tool Name Mapping**: Maps 8+ common hallucinated tool names
|
| 235 |
- β
**NoneType Safety**: Validates all comparison operands
|
| 236 |
|
| 237 |
+
### HuggingFace Integration
|
| 238 |
+
- β
**One-Click Export**: Export datasets, models, and outputs to HuggingFace
|
| 239 |
+
- β
**Personal Repos**: Auto-creates `ds-agent-data`, `ds-agent-models`, `ds-agent-outputs` repos
|
| 240 |
+
- β
**Secure Tokens**: User tokens stored securely in Supabase
|
| 241 |
+
- β
**Status Caching**: Efficient HF connection status checking
|
| 242 |
+
|
| 243 |
## π³ Docker Deployment
|
| 244 |
|
| 245 |
```dockerfile
|