Spaces:

empirenexus
/

WritingStudio

Sleeping

App Files Files Community

WritingStudio / README.md

jmisak

Update README.md

2e32647 verified 2 months ago

preview code

raw

history blame contribute delete

13.2 kB

A newer version of the Gradio SDK is available: 6.2.0

Upgrade

metadata

title: AI Writing Studio
emoji: ✍️
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.0.0
app_file: app.py
pinned: false
license: mit
short_description: AI writing revision with FLAN-T5 and rubric scoring
tags:
  - education
  - writing
  - nlp
  - text2text-generation
  - instruction-following
  - analysis
suggested_hardware: cpu-basic
suggested_storage: small

Writing Studio - HuggingFace Spaces Edition

Production-grade AI Writing Studio powered by FLAN-T5 for intelligent text revision.

About

AI Writing Studio is a production-grade educational writing assistant that provides real AI-powered text revision using instruction-following models:

🤖 AI-Powered Revision using FLAN-T5 (instruction-tuned for text revision)
📊 Real Rubric Scoring across 5 criteria (Clarity, Conciseness, Organization, Evidence, Grammar)
🔍 Visual Diff Highlighting to see exactly what changed
📝 5 Specialized Modes (General, Literature, Tech Comm, Academic, Creative)

🆕 What's New: FLAN-T5 Integration

Major Update: Replaced GPT-2 with FLAN-T5 for real AI-powered text revision.

What Changed:

✅ FLAN-T5 now default model (instruction-following, actually revises text)
❌ GPT-2 removed (only continues text, doesn't revise)
🎯 Instruction-optimized prompts for better revision quality
🚀 Automatic model detection (supports both T5 and GPT-2 pipelines)

Why This Matters: GPT-2 couldn't revise text—it only continued it with unrelated content. FLAN-T5 understands revision instructions and produces genuine improvements to your writing.

Trade-off: First load is ~60s instead of ~30s, but you get actual AI revision instead of gibberish!

Quick Start

Open the app on HuggingFace Spaces
Paste text (200-500 words recommended for first try)
Choose revision mode (try "General" first)
Click "✨ Revise & Analyze"
Wait ~60s for first analysis (model loading)
Compare original vs AI-revised text
Review rubric scores and highlighted changes

Features

✨ AI-Powered Revision with FLAN-T5

Why FLAN-T5? FLAN-T5 is an instruction-tuned model specifically trained to follow revision instructions. Unlike GPT-2 (which only continues text), FLAN-T5 actually understands and executes revision tasks like:

Improving clarity and readability
Enhancing academic tone
Strengthening evidence and support
Refining technical precision
Enriching creative imagery

Real Text Revision: The AI doesn't just continue your text—it genuinely revises it based on the selected mode.

📊 Real Rubric Analysis

Unlike simple prototypes, this version includes actual analysis algorithms:

Clarity: Analyzes sentence length, complexity, and structure
Conciseness: Detects wordy phrases and redundancy
Organization: Checks paragraph structure and transitions
Evidence: Looks for supporting examples and data
Grammar: Basic error detection

📝 5 Specialized Revision Modes

Choose from instruction-tuned templates optimized for FLAN-T5:

General: Improve clarity and readability for everyday writing
Literature: Strengthen literary analysis with better evidence and terminology
Tech Comm: Enhance technical precision and professional tone
Academic: Improve formal tone, organization, and scholarly voice
Creative: Enhance imagery, voice, and reader engagement

🔍 Visual Diff Highlighting

See exactly what the AI changed with side-by-side comparison and highlighted differences.

🏭 Production Quality

Comprehensive error handling
Input validation and sanitization
Structured logging
Intelligent caching for faster responses
Type-safe configuration with Pydantic
Automatic model type detection

Usage

Paste your text in the input box (up to 10,000 characters)
Choose a revision mode matching your writing context (General, Literature, Tech Comm, Academic, Creative)
Click "✨ Revise & Analyze" to get AI revision + rubric feedback
Review results: Compare original vs revised text, check rubric scores, view highlighted changes

Tips

First analysis takes ~60 seconds (FLAN-T5 model loading) - this is normal!
Subsequent analyses are much faster (~5-10s) thanks to caching
Start with shorter texts (200-500 words) for quicker results
Try different revision modes to see how the AI adapts its approach
Use the rubric feedback to understand what improved
The diff view shows exactly what changed and why

Models

Default: google/flan-t5-base

Why FLAN-T5? FLAN-T5 (Fine-tuned Language Net) is an instruction-following model from Google Research, specifically designed to understand and execute text revision tasks. This is fundamentally different from GPT-2 style models:

Feature	FLAN-T5 (Current)	GPT-2 (Previous)
Task Type	Instruction following	Text continuation
Can Revise Text?	✅ Yes	❌ No (only continues)
Understands Instructions?	✅ Yes	❌ No
Works with Revision Modes?	✅ Yes	❌ No
Model Size	~250M parameters	~124M parameters
First Load Time	~60s	~30s
Quality	High (task-specific)	Low (off-task)

FLAN-T5 Advantages:

✅ Actually revises text (not just continuation)
✅ Follows mode-specific instructions (General, Academic, etc.)
✅ Produces contextually appropriate output
✅ Understands the task at hand

Why Not GPT-2? GPT-2 and distilgpt2 are autoregressive text generators trained only to continue text. When given revision instructions, they ignore them and generate unrelated continuations. FLAN-T5 was explicitly trained on instruction-following tasks, making it ideal for text revision.

Alternative Models (Advanced)

You can change the model in the UI, but these require more resources:

google/flan-t5-large (780M params)

Better revision quality
Requires CPU upgrade or GPU
~2-3 minutes first load

google/flan-t5-xl (3B params)

Best quality revisions
Requires T4 GPU on HF Spaces
~5 minutes first load

Performance

Hardware Recommendations

Free Tier (CPU Basic) ⭐ Recommended

Works well with google/flan-t5-base
First load: ~60 seconds (model download + initialization)
Subsequent analyses: ~5-10 seconds
Perfect for educational use and demos

CPU Upgrade

Handles google/flan-t5-large comfortably
First load: ~2-3 minutes
Subsequent: ~10-15 seconds
Better revision quality

T4 GPU ⚡ Best Performance

Runs google/flan-t5-xl smoothly
First load: ~5 minutes
Subsequent: ~3-5 seconds
Highest quality revisions

FLAN-T5 vs GPT-2 Performance

FLAN-T5 is slightly larger than distilgpt2, but the quality difference is dramatic:

FLAN-T5: Slower but actually revises text correctly
GPT-2: Faster but produces unusable output (wrong task)

The extra 30 seconds of load time is worth it for functional AI revision!

Optimization

The app includes production-grade optimizations:

Model caching: Loaded once, reused for all requests
Result caching: Same input = instant cached response
Intelligent pipeline selection: Automatically uses correct pipeline for model type
Lazy loading: Services initialized only when needed
Efficient text processing: Minimizes unnecessary operations

Configuration

The app works out-of-the-box with sensible defaults optimized for FLAN-T5. To customize, you can set environment variables in your HuggingFace Space settings.

Available Environment Variables

# Model Configuration
DEFAULT_MODEL=google/flan-t5-base  # HuggingFace model ID (use FLAN-T5 variants)
MAX_MODEL_LENGTH=512               # Maximum model input/output length
DEFAULT_MAX_LENGTH=512             # Default generation length

# Application Settings
ENVIRONMENT=production             # Runtime environment (development/staging/production)
LOG_LEVEL=INFO                     # Logging level (DEBUG/INFO/WARNING/ERROR)
LOG_FORMAT=text                    # Log format (json/text) - text is easier on HF Spaces
MAX_TEXT_LENGTH=10000              # Maximum input text length

# Performance
ENABLE_CACHE=true                  # Enable result caching
CACHE_MAX_SIZE=100                 # Maximum cache entries
ENABLE_METRICS=false               # Disable metrics server on HF Spaces

# Features
ENABLE_DIFF_HIGHLIGHTING=true      # Enable visual diff view
ENABLE_RUBRIC_SCORING=true         # Enable rubric analysis
ENABLE_PROMPT_PACKS=true           # Enable revision mode selection

Troubleshooting

"Out of Memory" Error

Problem: Space crashes or shows OOM error Solutions:

✅ Stick with google/flan-t5-base on free tier (works well)
✅ Reduce input text length (try 200-500 words)
✅ Upgrade to CPU upgrade tier for larger models
❌ Don't try flan-t5-large or flan-t5-xl without GPU

Slow First Load (~60 seconds)

This is normal! FLAN-T5-base is ~250M parameters.

First analysis: ~60s (model download + initialization)
Subsequent: ~5-10s (model cached in memory)
If it times out: Refresh and try again (HF Spaces issue)

"Model Loading Failed"

Problem: Error during model initialization Solutions:

Check model name spelling (must be exact HuggingFace ID)
Ensure internet connectivity for model download
Try default: google/flan-t5-base
Check HF Spaces logs for specific error

AI Revision Doesn't Make Sense

Problem: Revision output is garbled or off-topic Solutions:

✅ Make sure you're using FLAN-T5 (not GPT-2!)
✅ Try a different revision mode (General, Academic, etc.)
✅ Check input text is clear and well-formed
✅ Try shorter input text (model has 512 token limit)
Remember: FLAN-T5 base is small; larger models (flan-t5-large) give better results

"Text Generation Failed"

Problem: Error during AI revision generation Solutions:

Input too long (try shorter text)
Model timeout (refresh and retry)
Check HF Spaces status (temporary service issue)

Privacy

Text processed in-memory only
Results cached temporarily for speed
No long-term storage on HF Spaces
No user tracking

Technical Details

How FLAN-T5 Integration Works

The app automatically detects model type and uses the appropriate pipeline:

For FLAN-T5 models (text2text-generation):

# Detects 't5' or 'flan' in model name
pipeline("text2text-generation", model="google/flan-t5-base")

For GPT-2 models (text-generation):

# Fallback for text continuation models
pipeline("text-generation", model="gpt2")

Instruction-Following Prompts: FLAN-T5 requires structured instruction format:

Revise the following text to improve clarity, conciseness, and readability.
Make it clear and easy to understand while maintaining the original meaning.

Text: [user input]

Revised text:

This format tells FLAN-T5 exactly what to do, resulting in actual revisions instead of text continuation.

Architecture

Production-Grade Layered Design:

src/writing_studio/
├── core/
│   ├── analyzer.py      # Main orchestrator
│   ├── config.py        # Pydantic settings (FLAN-T5 defaults)
│   └── exceptions.py    # Custom error types
├── services/
│   ├── model_service.py    # FLAN-T5 pipeline management
│   ├── prompt_service.py   # Instruction-following prompts
│   ├── rubric_service.py   # Rule-based scoring algorithms
│   └── diff_service.py     # Visual diff generation
├── utils/
│   ├── logging.py       # Structured logging
│   ├── validation.py    # Input sanitization
│   └── metrics.py       # Prometheus metrics
└── app.py               # HuggingFace Spaces entry point

Source Code

Full source code available at: GitHub Repository

Local Development

git clone https://github.com/yourusername/writing-studio
cd writing-studio
pip install -r requirements.txt
python app.py

Contributing

Contributions welcome! See GitHub for:

Full documentation
Development setup
Testing guidelines
Code quality standards

License

MIT License - See LICENSE file

Acknowledgments

FLAN-T5: google/flan-t5-base by Google Research
Built with Gradio - Python web UI for ML
Powered by HuggingFace Transformers - State-of-the-art NLP
Hosted on HuggingFace Spaces - Free ML app hosting
Instruction-tuning research: FLAN paper

Support

Need help?

Issues: GitHub Issues
Documentation: GitHub Docs
Questions: GitHub Discussions