|
|
--- |
|
|
language: bn |
|
|
language_bcp47: |
|
|
- bn |
|
|
- bn-IN |
|
|
- bn-BD |
|
|
license: apache-2.0 |
|
|
base_model: microsoft/DialoGPT-medium |
|
|
tags: |
|
|
- bengali |
|
|
- bangla |
|
|
- transformer |
|
|
- causal-lm |
|
|
- instruction-following |
|
|
- nlp |
|
|
- text-generation |
|
|
- conversational-ai |
|
|
- educational |
|
|
- general-knowledge |
|
|
model-index: |
|
|
- name: Sheikh Bengali AI |
|
|
results: [] |
|
|
--- |
|
|
|
|
|
# Sheikh Bengali AI Model |
|
|
|
|
|
## Model Description |
|
|
|
|
|
**Sheikh** is a Bengali (Bangla) language AI model trained for instruction following and conversational tasks. Built on top of Microsoft's DialoGPT-medium, this model has been fine-tuned with Bengali instruction-following data to understand and generate responses in Bengali language. |
|
|
|
|
|
## Model Details |
|
|
|
|
|
- **Model Type:** Language Model, Text Generation |
|
|
- **Architecture:** GPT-2 based (DialoGPT-medium) |
|
|
- **Base Model:** microsoft/DialoGPT-medium |
|
|
- **Parameters:** 355M |
|
|
- **Language:** Bengali (Bangla) |
|
|
- **Training Data:** Alpaca Bangla instruction dataset |
|
|
- **Model Size:** 1.4GB |
|
|
- **License:** Apache 2.0 |
|
|
|
|
|
## Intended Use |
|
|
|
|
|
This model is designed for: |
|
|
- Bengali language text generation |
|
|
- Instruction following and question answering |
|
|
- Educational content creation |
|
|
- Cultural and historical knowledge responses |
|
|
- General conversational AI in Bengali |
|
|
|
|
|
## Usage |
|
|
|
|
|
```python |
|
|
from transformers import AutoTokenizer, AutoModelForCausalLM |
|
|
|
|
|
# Load the model |
|
|
tokenizer = AutoTokenizer.from_pretrained("megharudushi/Sheikh") |
|
|
model = AutoModelForCausalLM.from_pretrained("megharudushi/Sheikh") |
|
|
|
|
|
# Generate Bengali response |
|
|
input_text = "বাংলাদেশের রাজধানী কী?" |
|
|
inputs = tokenizer.encode(input_text, return_tensors="pt") |
|
|
outputs = model.generate(inputs, max_length=150, temperature=0.8) |
|
|
response = tokenizer.decode(outputs[0], skip_special_tokens=True) |
|
|
print(response) |
|
|
``` |
|
|
|
|
|
### Example Prompts |
|
|
|
|
|
- Educational: "গণিতের মৌলিক নীতি বলুন" |
|
|
- Cultural: "বাংলা সাহিত্যের বিখ্যাত কবি কারা?" |
|
|
- General: "স্বাস্থ্যকর থাকার উপায় বলুন" |
|
|
- Historical: "বাংলাদেশের স্বাধীনতার ইতিহাস বর্ণনা করুন" |
|
|
|
|
|
## Model Performance |
|
|
|
|
|
- Supports Bengali language understanding and generation |
|
|
- Trained on Bengali instruction-following dataset |
|
|
- Optimized for educational and conversational contexts |
|
|
- Cultural knowledge preservation for Bengali language |
|
|
|
|
|
## Limitations |
|
|
|
|
|
- Trained primarily on Bengali instruction data |
|
|
- May have limitations in very specialized domains |
|
|
- Performance depends on input quality and clarity |
|
|
- Model size limited by computational resources |
|
|
|
|
|
## Training Details |
|
|
|
|
|
- **Base Model:** microsoft/DialoGPT-medium |
|
|
- **Fine-tuning Data:** Alpaca Bangla dataset |
|
|
- **Training Approach:** Instruction following |
|
|
- **Language Focus:** Bengali (Bangla) language |
|
|
|
|
|
## Citation |
|
|
|
|
|
If you use this model, please cite: |
|
|
|
|
|
```bibtex |
|
|
@misc{SheikhBengaliAI, |
|
|
title={Sheikh Bengali AI Model}, |
|
|
author={megharudushi}, |
|
|
year={2025}, |
|
|
url={https://huggingface.co/megharudushi/Sheikh}, |
|
|
note={Bengali language instruction-following model based on DialoGPT-medium} |
|
|
} |
|
|
``` |
|
|
|
|
|
## License |
|
|
|
|
|
This model is released under the Apache 2.0 License. |
|
|
|
|
|
## Contributing |
|
|
|
|
|
This model is part of the Bengali AI initiative to make Bengali language AI more accessible to the community. |
|
|
|
|
|
--- |
|
|
|
|
|
**Created:** December 21, 2025 |
|
|
**Repository:** https://huggingface.co/megharudushi/Sheikh |
|
|
**Base Model:** microsoft/DialoGPT-medium |
|
|
**Language:** Bengali (Bangla) |
|
|
|