Spaces:

AI4Research
/

scider

Running

App Files Files Community

scider / streamlit-client /README.md

harry-lu-0708

Rebrand to SciDER, fix subagent output display, disable Reasoning Bank for Streamlit

2b06ef9 2 days ago

preview code

raw

history blame contribute delete

9.15 kB

SciDER Streamlit Chat Interface

A ChatGPT-like web interface for running the complete SciDER workflow with real-time progress tracking and intermediate output visualization.

Features

🎨 Modern Chat Interface: Clean, intuitive interface similar to ChatGPT
💡 Full Workflow Support: Run the complete research pipeline with ideation agent
📊 Real-time Progress: Live updates showing progress for each agent and sub-agent
🔍 Intermediate Outputs: View outputs from each stage:
- Literature search results
- Paper analysis
- Research idea generation
- Novelty assessment
- Data analysis
- Experiment execution and revisions
⚙️ Configurable Settings: Control all workflow parameters through the UI
📁 Session Management: Save and manage research sessions

Installation

1. Install Dependencies

From the streamlit-client directory:

pip install -r requirements.txt

Or install streamlit directly:

pip install streamlit

2. Ensure SciDER is Set Up

Make sure you've completed the main SciDER setup from the parent directory:

cd ..
uv sync --extra mac  # or cpu/cu128 depending on your platform

3. Configure Environment

Ensure your .env file in the parent directory is properly configured with API keys:

cp ../.env.template ../.env
# Edit .env and add your API keys:
# OPENAI_API_KEY=...
# GEMINI_API_KEY=...

Usage

Running the Application

Basic Version (Simple Interface)

streamlit run app.py

Enhanced Version (With Real-time Progress)

streamlit run app_enhanced.py

The application will open in your default web browser at http://localhost:8501

Configuring Your Research

Research Query (Required)
- Enter your research topic or experimental objective
- Example: "transformer models for time series forecasting"
Research Domain (Optional)
- Specify the domain for better context
- Example: "machine learning", "computational biology"
Workspace Settings
- Set the workspace path where results will be saved
- Optionally provide a custom session name
Enable Workflow Stages
- Ideation: Always enabled - generates research ideas
- Data Analysis: Optional - analyzes your data and finds relevant papers/datasets
- Experiment Execution: Optional - generates and executes experimental code
Advanced Settings
- Adjust recursion limits for each agent
- Configure revision limits for experiments

Understanding the Output

Phase 1: Research Ideation (Always runs)

Literature Search: Searches for relevant papers
Paper Analysis: Analyzes found papers for insights
Idea Generation: Generates novel research directions
Novelty Check: Assesses novelty of generated ideas (0-10 score)
Ideation Report: Comprehensive research ideation summary

Outputs:

List of reviewed papers with abstracts
Generated research ideas
Novelty score and feedback
Research ideation summary

Phase 2: Data Analysis (Optional)

Data Exploration: Analyzes structure and characteristics of your data
Paper Subagent: Searches for relevant papers, datasets, and metrics
Summary Generation: Creates comprehensive data analysis report

Outputs:

Data structure analysis
Statistical summaries
Found papers (with relevance scores)
Found datasets
Found evaluation metrics
data_analysis.md file in workspace

Phase 3: Experiment Execution (Optional)

Coding Subagent: Generates experimental code
Execution Subagent: Runs the generated code
Summary Subagent: Analyzes results and metrics
Revision Loop: Iterates to improve experiments (up to max_revisions)

Outputs:

Generated code
Execution logs
Metrics and results
Revision summaries
Final experiment summary

Viewing Results

After workflow completion, results are displayed in organized tabs:

Summary: Complete workflow summary
Ideation: Research ideas, novelty assessment, papers reviewed
Data Analysis: Data insights, found papers/datasets/metrics
Experiments: Execution results, code, metrics from all revisions
Raw Data: Complete workflow state information

Saving Results

Click the "💾 Save Summary" button to save the complete workflow summary to:

<workspace_path>/workflow_summary.md

File Structure

streamlit-client/
├── app.py                    # Basic Streamlit interface
├── app_enhanced.py           # Enhanced interface with real-time progress
├── display_components.py     # Reusable UI components for displaying results
├── workflow_monitor.py       # Progress monitoring and callback system
├── requirements.txt          # Python dependencies
└── README.md                # This file

Configuration Options

Workspace Settings

workspace_path: Directory where all outputs are saved
session_name: Custom name for the session (auto-generated if not provided)

Data Workflow Settings

data_path: Path to your data file (CSV, JSON, etc.)
data_desc: Additional description or context about your data

Experiment Workflow Settings

repo_source: Local path or git URL to code repository
max_revisions: Maximum number of revision loops (1-10)

Advanced Settings

ideation_agent_recursion_limit: Max steps for ideation agent (default: 50)
data_agent_recursion_limit: Max steps for data agent (default: 100)
experiment_agent_recursion_limit: Max steps for experiment agent (default: 100)

Example Workflows

1. Research Ideation Only

Research Query: "self-supervised learning for protein structure prediction"
Research Domain: "computational biology"
Workspace: "./workspace/protein_research"

Enable Data Analysis: ☐
Enable Experiment Execution: ☐

This will:

Search literature on the topic
Analyze relevant papers
Generate novel research ideas
Assess novelty of ideas

2. Full Pipeline: Ideation + Data + Experiments

Research Query: "improve time series forecasting accuracy"
Research Domain: "machine learning"
Workspace: "./workspace/timeseries_exp"
Data Path: "./data/stock_prices.csv"

Enable Data Analysis: ☑
Enable Experiment Execution: ☑
Max Revisions: 5

This will:

Generate research ideas for improvement
Analyze your time series data
Search for relevant papers and benchmarks
Generate forecasting code
Execute experiments
Iterate to improve results

3. Data Analysis + Experiments (Skip Ideation for established research)

While ideation always runs, you can focus on data and experiments by providing a specific, well-defined query:

Research Query: "train LSTM model on this data with standard metrics"
Data Path: "./data/sensor_data.csv"

Enable Data Analysis: ☑
Enable Experiment Execution: ☑

Troubleshooting

Streamlit Not Found

pip install streamlit

Import Errors

Ensure you're running from the correct directory and that the parent SciDER package is accessible:

cd streamlit-client
export PYTHONPATH=..:$PYTHONPATH
streamlit run app_enhanced.py

API Key Errors

Check that your .env file in the parent directory contains valid API keys:

cat ../.env | grep API_KEY

Workflow Hangs

Check recursion limits - increase if the agent needs more steps
Monitor logs in terminal for errors
Check that data file path is correct and accessible

Port Already in Use

If port 8501 is already in use:

streamlit run app_enhanced.py --server.port 8502

Tips

Start Simple: Begin with ideation only to understand the workflow
Iterate: Use the revision system for experiments to improve results
Save Sessions: Use custom session names for important research
Review Intermediate Outputs: Check paper searches and data analysis before experiments
Monitor Progress: Watch the real-time updates to understand agent behavior

Advanced Usage

Custom Workflow Monitoring

You can extend workflow_monitor.py to add custom callbacks:

from workflow_monitor import get_monitor, PhaseType

monitor = get_monitor()

def my_callback(update):
    print(f"Phase: {update.phase}, Status: {update.status}")
    # Your custom logic here

monitor.add_callback(my_callback)

Integrating with Other Tools

The workflow results can be exported and used with other analysis tools:

from scievo.workflows.full_workflow_with_ideation import run_full_workflow_with_ideation

result = run_full_workflow_with_ideation(
    user_query="your query",
    workspace_path="./workspace",
)

# Access structured outputs
print(result.ideation_papers)
print(result.novelty_score)
print(result.data_summary)

Contributing

To improve the interface:

Add new visualization components in display_components.py
Enhance progress tracking in workflow_monitor.py
Improve the UI in app_enhanced.py

License

Same as parent SciDER project.