File size: 1,810 Bytes
31faeef
 
 
 
 
 
 
 
 
 
 
 
 
 
97380b5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
title: Data Visualization App Using GenerativeAI
emoji: 🔥
colorFrom: indigo
colorTo: purple
sdk: gradio
sdk_version: 5.23.3
app_file: app.py
pinned: false
license: apache-2.0
short_description: Data Visualization app using generative AI
---


# Data Visualization & Cleaning AI

An intelligent data analysis tool that automatically processes CSV, Excel, JSON, and text files to generate insightful visualizations and cleaning recommendations.

## Features

- **Automatic Data Analysis**: Generates statistical summaries, detects outliers, and identifies data quality issues
- **AI-Powered Insights**: Leverages GPT and HuggingFace models to provide cleaning recommendations and analysis insights
- **Interactive Visualizations**: Creates distribution plots, correlation heatmaps, PCA visualizations, and more using Plotly
- **Smart Data Cleaning**: Offers tools to handle missing values, outliers, duplicates, and data type conversions
- **Multi-Format Support**: Works with CSV, Excel, JSON, and text files with intelligent separator detection

## Getting Started

1. Upload your data file in the Data Analysis tab
2. Click "Analyze Data" to generate visualizations and insights
3. Review the AI-powered cleaning recommendations
4. Apply cleaning operations in the Data Cleaning tab
5. Download your cleaned dataset for further analysis

## Technical Details

Built with:
- Pandas and NumPy for data processing
- Plotly, Matplotlib and Seaborn for visualizations
- scikit-learn for PCA and standardization
- OpenAI GPT models for intelligent recommendations
- HuggingFace models for additional insights
- Gradio for the interactive web interface

## Usage Notes

For full AI-powered features, configure your API keys in the Settings tab.
The app can also run in demo mode with reduced AI capabilities.