A newer version of the Gradio SDK is available: 6.13.0
title: AI Data Science Assistant
emoji: π€
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.33.2
app_file: app.py
pinned: false
π€ AI Data Science Assistant
A powerful AI assistant for data analysis, document Q&A, and general conversation - completely free and open-source!
π Features
π CSV Data Analysis
- Upload & Analyze: Drop your CSV files for instant analysis
- Smart Visualizations: Auto-generates 4 types of charts:
- Data types distribution (pie chart)
- Missing values analysis (bar chart)
- Numeric distributions (histograms)
- Correlation matrix (heatmap)
- Statistical Summary: Complete descriptive statistics for numeric columns
- Data Preview: Shows dataset shape, columns, and first 5 rows
π PDF Document Q&A (RAG)
- Upload PDFs: Process any PDF document for question answering
- Smart Retrieval: Uses RAG (Retrieval Augmented Generation) with FAISS vector database
- Context-Aware: Answers questions based on actual document content
- Source Attribution: References the source document in responses
- Persistent Memory: Remembers uploaded documents during your session
π¬ General AI Chat
- Claude-Style Responses: Clear, helpful, and honest communication
- No Hallucination: Says "I don't know" when uncertain
- Multi-Turn Conversations: Maintains conversation context
- Wide Knowledge: Helps with coding, explanations, creative tasks, and more
π Try It Now
Live Demo: https://huggingface.co/spaces/mrradix/ai-ds-assistant
π― Use Cases
For Data Scientists & Analysts
- Quick CSV exploration and visualization
- Statistical analysis without writing code
- Data quality assessment (missing values, distributions)
- Correlation analysis between variables
For Researchers & Students
- PDF document analysis and Q&A
- Extract insights from research papers
- Ask questions about uploaded documents
- Get explanations of complex concepts
For General Users
- AI-powered conversations
- Help with various tasks and questions
- Document analysis and summarization
- Data visualization assistance
π οΈ Technical Stack
Models (100% Open Source)
- Language Model:
google/flan-t5-base- Google's instruction-tuned T5 - Embeddings:
sentence-transformers/all-MiniLM-L6-v2- For document retrieval - Vector Store: FAISS - Efficient similarity search
Libraries & Frameworks
- Frontend: Gradio - Interactive web interface
- Data Processing: Pandas, NumPy - Data manipulation
- Visualization: Matplotlib - Chart generation
- AI/ML: Transformers, PyTorch - Model inference
- RAG Pipeline: LangChain - Document processing and Q&A
- PDF Processing: PDFMiner - Text extraction
π How to Use
1. CSV Analysis
- Go to the "π CSV Analysis" tab
- Upload your CSV file using the file uploader
- Click "π Analyze CSV"
- View the automatic analysis, statistics, and charts
2. PDF Q&A
- Switch to the "π PDF Q&A" tab
- Upload a PDF document
- Click "π€ Process PDF" and wait for confirmation
- Ask questions about the document content
- Get AI-powered answers based on the document
3. General Chat
- Visit the "π¬ General Chat" tab
- Type your question or message
- Get helpful, Claude-style responses
- Continue the conversation naturally
π¨ Screenshots
CSV Analysis Interface
The CSV tab provides instant data insights with professional visualizations and statistical summaries.
PDF Q&A Interface
Upload any PDF and ask questions - the AI will answer based on the actual document content.
General Chat Interface
Natural conversation interface for any topic or question.
π§ Local Installation
Want to run this locally? Here's how:
# Clone the repository
git clone https://huggingface.co/spaces/mrradix/ai-ds-assistant
cd ai-ds-assistant
# Install dependencies
pip install -r requirements.txt
# Run the application
python app.py
Requirements
- Python 3.8+
- 4GB+ RAM recommended
- GPU optional (will use CPU if not available)
π Key Advantages
β Completely Free
- No API keys required
- No usage limits
- Open-source models only
β Privacy-First
- All processing happens locally (in the Space)
- No data sent to external APIs
- Your documents stay private
β Production-Ready
- Robust error handling
- Professional UI/UX
- Mobile-responsive design
β Educational
- Learn about RAG systems
- Understand AI model deployment
- Explore data science workflows
π€ Contributing
This is an open-source project! Contributions are welcome:
- Report Issues: Found a bug? Open an issue
- Feature Requests: Have an idea? Let's discuss it
- Pull Requests: Improvements and fixes are appreciated
- Documentation: Help improve the docs
π License
This project is licensed under the MIT License - see the LICENSE file for details.
π Acknowledgments
- Google: For the Flan-T5 model
- Hugging Face: For the amazing model hub and Spaces platform
- LangChain: For the RAG framework
- Gradio: For the intuitive interface framework
- Open Source Community: For all the incredible libraries used
π§ Contact & Support
- Creator: @mrradix
- Issues: GitHub Issues
- Discussions: Space Discussions
β If you find this helpful, please give it a star and share with others!
Built with β€οΈ using open-source AI models