Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
12
3
21
Min Si Thu
jojo-ai-mst
Follow
suyeehlaing's profile picture
lwinmoe1997's profile picture
thantsan's profile picture
24 followers
·
13 following
minsithu
min-si-thu
AI & ML interests
Computer Vision NLP Audio
Recent Activity
posted
an
update
22 days ago
🇲🇲 Releasing the Myanmar Tuberculosis Instruction Dataset — a Myanmar–English parallel corpus for medical NLP in one of the lowest-resourced language settings in Southeast Asia. Most TB datasets are either structured clinical data or English-only research corpora. This one fills a different gap: instructional, guideline-based content in Burmese, formatted for instruction tuning and medical QA. ### What's inside - 2,043 instruction–response pairs - Myanmar–English parallel - 7 TB domains: treatment, diagnostics, drug management, MDR-TB, infection control, patient education, healthcare worker training - Sourced from WHO guidelines, Myanmar NTP protocols, and standard medical references - MIT licensed Useful for - Fine-tuning Myanmar-language medical LLMs - TB question answering - Translation evaluation in a medical domain - General low-resource medical NLP ``` from datasets import load_dataset ds = load_dataset("jojo-ai-mst/Myanmar-Tuberculosis-Guidelines-Instructions") ``` 👉 https://huggingface.co/datasets/jojo-ai-mst/Myanmar-Tuberculosis-Guidelines-Instructions Built by Min Si Thu and Khin Myat Noe. Feedback welcome — especially from anyone working on SEA medical AI or Burmese NLP. #MedicalAI #LowResourceNLP #Myanmar #Burmese #Tuberculosis #InstructionTuning
updated
a collection
22 days ago
Burmese Language Datasets
updated
a dataset
22 days ago
jojo-ai-mst/Myanmar-Tuberculosis-Guidelines-Instructions
View all activity
Organizations
jojo-ai-mst
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 datasets
over 1 year ago
uisp/tripitaka-siamrath
Viewer
•
Updated
Dec 14, 2024
•
476k
•
1.55k
•
8
saillab/alpaca-sanskrit-cleaned
Viewer
•
Updated
Sep 20, 2024
•
52k
•
102
•
1
liked
a dataset
almost 2 years ago
lmms-lab/OK-VQA
Viewer
•
Updated
Mar 9, 2024
•
5.05k
•
3.46k
•
8
liked
a model
almost 2 years ago
jinaai/jina-clip-v1
Feature Extraction
•
0.2B
•
Updated
Apr 8
•
74.4k
•
256
liked
a dataset
almost 2 years ago
liuhaotian/LLaVA-Instruct-150K
Preview
•
Updated
Jan 3, 2024
•
6.87k
•
598
liked
a model
almost 2 years ago
suno/bark
Text-to-Speech
•
Updated
Oct 4, 2023
•
19.3k
•
1.52k
liked
a dataset
about 2 years ago
jxu124/OpenX-Embodiment
Updated
Oct 16, 2024
•
25.8k
•
103
liked
13 models
over 2 years ago
facebook/wav2vec2-base-960h
Automatic Speech Recognition
•
94.4M
•
Updated
Nov 14, 2022
•
1.2M
•
397
SeaLLMs/SeaLLM-7B-v1
Text Generation
•
7B
•
Updated
Apr 15, 2024
•
24
cagliostrolab/animagine-xl-3.0
Text-to-Image
•
Updated
Dec 22, 2025
•
117k
•
776
Xenova/nllb-200-distilled-600M
Translation
•
Updated
Mar 23
•
4.72k
•
52
HuggingFaceH4/zephyr-7b-alpha
Text Generation
•
7B
•
Updated
Oct 16, 2024
•
4.11k
•
•
1.12k
TinyLlama/TinyLlama-1.1B-Chat-v1.0
Text Generation
•
1B
•
Updated
Mar 17, 2024
•
2.8M
•
•
1.59k
lmsys/fastchat-t5-3b-v1.0
Updated
Jun 29, 2023
•
176
•
369
microsoft/phi-2
Text Generation
•
3B
•
Updated
Dec 8, 2025
•
525k
•
•
3.45k
Locutusque/TinyMistral-248M-Instruct
Text Generation
•
0.2B
•
Updated
Dec 17, 2023
•
838
•
12
Felladrin/onnx-TinyMistral-248M
Text Generation
•
Updated
Jan 7, 2024
•
11
•
7
mistralai/Mistral-7B-Instruct-v0.1
Text Generation
•
Updated
Jul 24, 2025
•
342k
•
•
1.83k
EleutherAI/pythia-14m-deduped
Text Generation
•
39.2M
•
Updated
Feb 12
•
15.4k
•
29
EleutherAI/gpt-neo-125m
Text Generation
•
0.2B
•
Updated
Jan 31, 2024
•
455k
•
229
Load more