InfoBayAI/CT-Scan-Radiology-Reports-Without-Findings-Dataset Viewer • Updated 3 days ago • 2.6k • 30 • 2
InfoBayAI/CT-Scan-Radiology-Reports-With-Findings-Dataset Viewer • Updated 3 days ago • 6.3k • 22 • 2
InfoBayAI/Xray-OPG-Dental-Radiology-Reports-Without-Findings-Dataset Viewer • Updated 1 day ago • 5 • 28 • 2
InfoBayAI/Audio-to-Sentiment_Intelligence_Model Audio-Text-to-Text • 67M • Updated 18 days ago • 21 • 5
STEM & Non-STEM Q&A Datasets for LLM Training Collection Sample datasets from a 6.5M+ enterprise-grade Q&A corpus across STEM and Non-STEM domains, built for LLM training, instruction tuning, and evaluation. • 6 items • Updated about 12 hours ago • 1
Academic Textbook Corpora for LLM Training Collection Sample of a 2.6+ word textbook corpus across 39K+ books, 5K+ subjects, and 15 languages for LLM training and multilingual knowledge modeling. • 22 items • Updated about 12 hours ago • 1
Podcast Speech & Conversational Audio Datasets Collection Sample from a podcast audio dataset, designed for ASR, speech recognition, and conversational AI training using diverse, real-world spoken content. • 12 items • Updated about 12 hours ago • 1
Dual Channel Global Customer-Agent Interaction Datasets Collection Sample Datasets of dual-channel call center audio with separate agent and customer channels for ASR, diarization, and conversational AI training. • 24 items • Updated about 12 hours ago • 1
Healthcare AI Datasets for Clinical & LLM Training Collection Sample dataset from an enterprise-grade medical corpus built for clinical AI, diagnosis support, and healthcare LLM training. • 18 items • Updated about 12 hours ago • 1
Computer Vision & Multimodal Datasets Collection Sample dataset from multilingual image corpus covering medical, STEM, Non-STEM, automobile, and complex domains for computer vision and multimodal AI. • 18 items • Updated about 12 hours ago • 1