Spaces:
Running
Running
File size: 2,087 Bytes
64b78aa 6562a72 64b78aa 82be575 64b78aa 6562a72 77c8ca0 64b78aa 6562a72 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 | ---
title: NLP Toolkit
emoji: 🛠️
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 6.14.0
app_file: app.py
pinned: false
license: mit
tags:
- nlp
- sentiment-analysis
- text-summarization
- translation
- gradio
---
# 🛠️ NLP Toolkit: Text Analysis Tools
A comprehensive AI-powered tool for text processing, featuring sentiment analysis, abstractive summarization, and English-to-Ukrainian translation in a single, unified interface.
## 🎯 Target Audience
This tool is designed for students, researchers, journalists, and anyone working with large volumes of English text who needs to quickly extract emotional tone, generate concise summaries, or translate content into Ukrainian.
## 🚀 How to Use
1. **Select a Tool**: Navigate between the "Sentiment Analysis", "Summarization", and "Translation" tabs at the top.
2. **Input Text**: Type or paste your English text into the input box. You can also click on any of the **Examples** at the bottom to see the tool in action instantly.
3. **Run**: Click the primary action button (e.g., "Analyze Sentiment") or press Enter to get the results.
## 🧠 Models Used
| Task | Model Name | Language |
|---|---|---|
| **Sentiment Analysis** | `distilbert-base-uncased-finetuned-sst-2-english` | English (EN) |
| **Summarization** | `facebook/bart-large-cnn` | English (EN) |
| **Translation** | `Helsinki-NLP/opus-mt-en-uk` | EN ➡️ UK |
## ⚠️ Limitations & Disclaimers
* **Language Support**: The Sentiment Analysis and Summarization models are trained exclusively on English data. Inputting other languages will result in inaccurate outputs.
* **Text Length**: For the Summarization module, text is automatically truncated to the first 500 words to ensure stability and prevent memory overflows. A warning will appear if truncation occurs.
* **Translation Accuracy**: While the translation model is robust, it may struggle with highly specific technical jargon, slang, or complex idiomatic expressions.
* **HTML Safety**: All inputs are sanitized; however, the tool is intended for standard prose text. |