Spaces:
Build error
Build error
A newer version of the Streamlit SDK is available: 1.58.0
metadata
title: Autism Screening AI
emoji: π§
colorFrom: blue
colorTo: purple
sdk: streamlit
sdk_version: 1.41.0
app_file: streamlit_app.py
pinned: false
π§ AI-Powered Autism Screening System
Early detection of autism spectrum disorder (ASD) using machine learning and explainable AI.
π Project Structure
autism/
βββ data/ # Dataset & data fetching scripts
β βββ autism_screening.csv # Main dataset (704 records)
β βββ fetch_dataset.py # Download script
βββ notebooks/ # Jupyter notebooks
β βββ 01_eda_and_data_loading.ipynb
β βββ 02_model_training.ipynb
β βββ 03_explainability.ipynb
βββ models/ # Saved ML models
βββ results/ # Analysis outputs & visualizations
βββ README.md
π Quick Start
1. Get the Dataset
Option A: Download Automatically
cd data
python fetch_dataset.py
Option B: Download Manually
- Download from Kaggle (704 records)
- Or UCI ML Repository
- Save as
data/autism_screening.csv
Option C: Start with Sample Data
- A sample dataset will be created automatically if real data isn't found
2. Run the Analysis Notebook
# Make sure you're in the project root
jupyter notebook notebooks/01_eda_and_data_loading.ipynb
π What's Included
Notebook 1: EDA & Data Loading
- β Load 704-record autism screening dataset
- β Analyze class balance (autism vs. non-autism)
- β Check for missing values & data completeness
- β Statistical feature analysis
- β Quality assessment report
Notebook 2: Model Training (Coming)
- Build baseline model (Logistic Regression)
- Compare models (Random Forest, SVM, etc.)
- Cross-validation & performance metrics
- Train-test split strategy
Notebook 3: Explainability (Coming)
- SHAP values for feature importance
- Interpretable results for non-technical users
- Risk factor identification
- Confidence scoring
π― Dataset Info
Size: 704 adult screening records
Target: Binary classification (Autism: Yes/No)
Features: ~20-30 features based on screening questionnaires (AQ-10, etc.)
Class Distribution: Typically ~30% positive, ~70% negative
π Questionnaire Features
Common screening features include:
- Social attention & awareness
- Communication patterns
- Focused attention
- Imagination abilities
- Pattern recognition
- Memory for details
- Social relationships
- Anxiety levels
- Voice tone understanding
βοΈ Requirements
pandas
numpy
matplotlib
seaborn
scikit-learn
jupyter
shap (for explainability)
Install all at once:
pip install pandas numpy matplotlib seaborn scikit-learn jupyter shap
π Next Steps
- Load the data β Run Notebook 01
- Explore patterns β Check class balance & features
- Build models β Run Notebook 02
- Explain results β Run Notebook 03
- Deploy UI β Build Streamlit app (optional)
π Disclaimer
β οΈ This tool is for screening support only, not medical diagnosis.
- Always consult with healthcare professionals
- Intended for educational & awareness purposes
- Not a substitute for professional evaluation
π Resources
Ready to explore? Start with Notebook 01! π