Spaces:

SharvNey
/

AI_Text_Classification

Sleeping

App Files Files Community

AI_Text_Classification / README.md

SharvNey

Update README.md

efa406c verified 5 months ago

preview code

raw

history blame contribute delete

1.64 kB

A newer version of the Gradio SDK is available: 6.5.1

Upgrade

metadata

title: Capstone Project
emoji: ⚡
colorFrom: gray
colorTo: green
sdk: gradio
sdk_version: 5.45.0
app_file: app.py
pinned: false
short_description: AI vs Human text classifier

🤖 AI vs Human Text Classifier (RoBERTa)

This project fine-tunes RoBERTa to classify text as either:

🧑 Human-Written
🤖 AI-Generated

It was developed as a Capstone Project to explore the power of transformer-based models in detecting AI-generated content.

📌 Project Overview

With the rapid rise of LLMs like GPT and other AI text generators, distinguishing between human-written and AI-generated text is becoming crucial in education, research, and online authenticity.
This project leverages RoBERTa, a transformer-based model, to build a binary text classifier.

🛠️ Features

Fine-tuned RoBERTa-base model
Binary classification: Human (0) vs AI (1)
Deployed with Gradio for easy interaction
Model hosted on Hugging Face Model Hub

📂 Dataset

The dataset used in training contains two columns:

Text → the input text sample
Generated → label (0 = Human, 1 = AI)

🚀 Training

The model was fine-tuned on Google Colab using the Hugging Face transformers library.

Steps:

Load dataset (Text, Generated)
Preprocess using Hugging Face AutoTokenizer
Fine-tune RoBERTa with Trainer API
Evaluate using Accuracy, Precision, Recall, F1-score

📊 Results

Validation accuracy achieved: ~99%

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference