Spaces:

davideuler
/

small-model-chatbot

Runtime error

App Files Files Community

small-model-chatbot / README.md

davideuler

main.py to app.py for HF

ba6e626 7 months ago

preview code

raw

history blame contribute delete

3.56 kB

	---
	title: Small Model Chatbot
	emoji: 😻
	colorFrom: indigo
	colorTo: green
	sdk: gradio
	sdk_version: 5.31.0
	app_file: app.py
	pinned: false
	license: mit
	short_description: Some small models chatbot
	---
	=======
	# Multi-Model Tiny Chatbot

	A lightweight, multi-model chat application featuring several small language models optimized for different tasks. Built with Gradio for an intuitive web interface and designed for local deployment.

	## 🌟 Features

	- Multiple Model Support: Choose from 4 specialized small language models
	- Lazy Loading: Models are loaded only when selected, optimizing memory usage
	- Real-time Chat Interface: Smooth conversational experience with Gradio
	- Lightweight: All models are under 200M parameters for fast inference
	- Local Deployment: Run entirely on your local machine

	## 🤖 Available Models

	### 1. SmolLM2 (135M Parameters)
	- Purpose: General conversation and instruction following
	- Architecture: HuggingFace SmolLM2-135M-Instruct
	- Best For: General Q&A, creative writing, coding help
	- Language: English

	### 2. NanoLM-25M (25M Parameters)
	- Purpose: Ultra-lightweight instruction following
	- Architecture: Mistral-based with chat template support
	- Best For: Quick responses, simple tasks, resource-constrained environments
	- Language: English

	### 3. NanoTranslator-S (9M Parameters)
	- Purpose: English to Chinese translation
	- Architecture: LLaMA-based translation model
	- Best For: Translating English text to Chinese
	- Language: English → Chinese

	### 4. NanoTranslator-XL (78M Parameters)
	- Purpose: Enhanced English to Chinese translation
	- Architecture: LLaMA-based with improved accuracy
	- Best For: High-quality English to Chinese translation
	- Language: English → Chinese

	## 🚀 Quick Start

	### Prerequisites

	- Python 3.8 or higher
	- 4GB+ RAM recommended
	- Internet connection for initial model downloads

	### Installation

	1. Run the application
	```bash
	uv run app.py
	```

	2. Open your browser
	- Navigate to `http://localhost:7860`
	- Select a model and start chatting!


	## 🎯 Use Cases

	### General Conversation
	- Use SmolLM2 or NanoLM-25M for general chat, Q&A, and assistance

	### Translation Tasks
	- Use NanoTranslator-S for quick English→Chinese translations
	- Use NanoTranslator-XL for higher quality English→Chinese translations

	### Resource-Constrained Environments
	- NanoLM-25M (25M params) for ultra-lightweight deployment
	- NanoTranslator-S (9M params) for minimal translation needs

	## 💡 Model Performance

	\| Model \| Parameters \| Use Case \| Memory Usage \| Speed \|
	\|-------\|------------\|----------\|--------------\|-------\|
	\| SmolLM2 \| 135M \| General Chat \| ~500MB \| Fast \|
	\| NanoLM-25M \| 25M \| Lightweight Chat \| ~100MB \| Very Fast \|
	\| NanoTranslator-S \| 9M \| Quick Translation \| ~50MB \| Very Fast \|
	\| NanoTranslator-XL \| 78M \| Quality Translation \| ~300MB \| Fast \|



	### Model Sources
	- SmolLM2: `HuggingFaceTB/SmolLM2-135M-Instruct`
	- NanoLM-25M: `Mxode/NanoLM-25M-Instruct-v1.1`
	- NanoTranslator-S: `Mxode/NanoTranslator-S`
	- NanoTranslator-XL: `Mxode/NanoTranslator-XL`

	## 📝 License

	This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

	## 🙏 Acknowledgments

	- [HuggingFace](https://huggingface.co/) for the Transformers library and model hosting
	- [Mxode](https://huggingface.co/Mxode) for the Nano series models
	- [Gradio](https://gradio.app/) for the amazing web interface framework