readctrl / data /new_exp /evaluation_results.json
shahidul034's picture
Add files using upload-large-folder tool
903b1a4 verified
[
{
"doc_id": 6,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text is a highly detailed clinical case report written in specialized medical language, requiring advanced knowledge of hematology, coagulation disorders, and obstetric medicine. It uses complex terminology such as \"acquired haemophilia A,\" \"activated partial thromboplastin time (aPTT),\" \"factor VIII inhibitor,\" \"Bethesda units/mL,\" \"FEIBA,\" \"activated recombinant factor VII (rFVIIa),\" \"mixing test,\" and \"immunosuppression with cyclophosphamide.\" The narrative includes precise laboratory values (e.g., Hb 6.4 g/dL, AST 52 U/L, aPTT 91.2 s), imaging findings (e.g., \"subcutaneous collection with oedema\"), and treatment protocols (e.g., \"prednisone 50 mg PO at breakfast plus 10 mg at lunch\"). The sentence structure is dense, with multiple clauses and technical descriptions, and relies heavily on abbreviations and acronyms without explanation (e.g., FEIBA, rFVIIa, ANA, C3/C4). The clinical reasoning is sophisticated, involving differential diagnosis (vasculitis vs. acquired haemophilia), interpretation of coagulation tests, and management of a rare postpartum condition. The context section provides expert-level background on pathophysiology and treatment guidelines. This level of detail and terminology is inaccessible to individuals without formal medical training or advanced health literacy."
},
{
"doc_id": 5,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text is written in a highly technical medical style, using specialized terminology such as \"membranous nephropathy II,\" \"nephrotic syndrome,\" \"immunosuppressants,\" \"ground-glass opacities,\" \"(1\u21923)-\u03b2-D-glucan,\" \"CD4 and CD8 T-lymphocyte counts,\" \"noninvasive mechanical ventilation,\" \"meropenem,\" \"micafungin,\" and \"ganciclovir.\" These terms are not commonly understood by the general public and require advanced medical knowledge. The narrative includes complex clinical reasoning, sequential treatment escalation, and interpretation of laboratory and imaging findings in the context of immunosuppression. Sentences are long and densely packed with clinical details, such as drug names, dosing regimens, timing of interventions, and laboratory values. The use of abbreviations (e.g., RSV, CRP, ESR, ICU, i.v., bid) without full explanation further increases the difficulty for non-experts. The case report format, with emphasis on diagnostic challenges, treatment decisions, and pathophysiological interpretation, is typical of professional medical literature. The complexity and depth of content clearly indicate that the text is intended for healthcare professionals or individuals with advanced health literacy."
},
{
"doc_id": 1,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text employs highly specialized medical terminology such as \"developmental venous anomaly (DVA),\" \"Medusa head sign,\" \"T1 pre- and post-contrast,\" \"FLAIR,\" \"diffusion and ADC mapping,\" \"venous cortico-medullary vascular structures,\" \"central hyphocaptation,\" \"perilesional edema,\" and \"inferior anastomotic vein.\" These terms are not commonly understood by the general public and require advanced knowledge of neuroimaging and neurology. The description of MRI sequences and their findings is technical and detailed, including specific imaging characteristics (e.g., hyperintensity on T2/FLAIR, absence of diffusion restriction) that are interpreted only by trained radiologists. The clinical reasoning connects pregnancy-related hypercoagulability to the risk of thrombosis in a typically benign vascular variant, a nuanced concept requiring understanding of both obstetrics and cerebrovascular physiology. The use of abbreviations (e.g., DVAs, Clexane, Levetiracetam) without explanation, along with complex sentence structures and a focus on pathophysiological mechanisms, indicates the text is intended for a medical audience. While the narrative is clinically coherent and well-structured, it is inaccessible to individuals without significant medical training or health literacy."
},
{
"doc_id": 1,
"true_label": "low_health_literacy",
"pred_label": "intermediate_health_literacy",
"reasoning": "The text uses medical terminology such as \"head MRI scan,\" \"Medusa head,\" \"veins in her brain,\" \"blood clot,\" and \"blood moving slowly,\" which are not commonly understood by the general public. While some terms like \"seizure\" and \"trouble speaking\" are familiar, the phrase \"Medusa head\" is a highly specific radiological term referring to a congenital venous malformation (typically a dural arteriovenous fistula or vein of Galen malformation) that is not part of everyday language. The description of blood flow dynamics (\"moving slowly closer to the main vein\") introduces physiological concepts that require medical knowledge to interpret. Although the overall sentence structure is relatively simple and the narrative is clear, the presence of specialized diagnostic terminology and anatomical concepts (e.g., \"outer part of this vein pattern\") limits accessibility to individuals without health literacy or medical training. The explanation is simplified compared to a full clinical report, but still assumes familiarity with brain imaging and vascular anatomy. This suggests the text is at an intermediate health literacy level\u2014understandable to many patients with some medical context, but not to those with low health literacy."
},
{
"doc_id": 3,
"true_label": "low_health_literacy",
"pred_label": "low_health_literacy",
"reasoning": "The text uses simple, clear language appropriate for a general audience. Medical terms such as \"C-section,\" \"routine scan,\" and \"heart lump\" are used without technical jargon and are explained contextually. Sentences are short and direct, with a chronological narrative that is easy to follow. The focus is on observable facts\u2014timing of events, baby\u2019s health milestones, and the stability of the heart lump\u2014without introducing complex medical explanations or abstract concepts. There is no use of abbreviations or acronyms without explanation. The information is presented in a way that emphasizes reassurance and normal development, which supports understanding by individuals with limited medical knowledge. The overall tone is reassuring and accessible, typical of patient-friendly health communication."
},
{
"doc_id": 3,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text uses highly specialized obstetric and pediatric cardiology terminology such as \"gravida V para IV,\" \"LNMP,\" \"fetal echocardiography,\" \"cardiac rhabdomyoma,\" \"tuberous sclerosis complex (TSC),\" \"neurosonography,\" \"LVOT and RVOT anatomy,\" \"hyperechoic masses,\" and \"left ventricular inflow obstruction.\" It includes precise measurements (e.g., \"18.2 \u00d7 8.3 mm\") and clinical descriptors (e.g., \"well-circumscribed,\" \"no size increment\") that require medical training to interpret. The narrative follows a detailed, clinical case report format, integrating prenatal, perinatal, and postnatal data with diagnostic reasoning and long-term follow-up. The use of abbreviations (e.g., CBC, Apgar, TSC, LVOT) without full explanation, along with complex sentence structures and technical assessments (e.g., \"anthropometric and neurobehavioral development remained normal\"), indicates the content is intended for healthcare professionals. While some terms may be familiar to informed patients, the overall complexity, precision, and integration of clinical decision-making make it inaccessible to individuals without advanced health literacy or medical background."
},
{
"doc_id": 4,
"true_label": "low_health_literacy",
"pred_label": "low_health_literacy",
"reasoning": "The text uses simple, clear language appropriate for a general audience. Medical terms such as \"throat growths,\" \"voice box,\" \"windpipe,\" \"breathing tube,\" and \"chest scan\" are explained in everyday terms. The analogy \"like trying to breathe through a pinched straw\" effectively conveys the sensation of airway narrowing in a way that is easy to understand. Descriptions of procedures (e.g., \"placed a breathing tube through a small hole in his neck\") are straightforward and avoid technical jargon. The treatment is explained in accessible terms: \"one dose of a medicine called bevacizumab\" and \"breathing therapy.\" The outcome is stated simply: \"He got better. The problem has not come back during follow-up.\" The sentence structure is short and direct, with minimal complexity. Overall, the language is patient-centered, avoids unnecessary medical terminology, and focuses on clear, relatable explanations of symptoms, diagnosis, treatment, and outcome\u2014characteristics of low health literacy content."
},
{
"doc_id": 4,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text employs highly specialized medical terminology such as \"recurrent respiratory papillomatosis (RRP),\" \"koilocytotic atypia,\" \"HPV,\" \"microlaryngoscopy,\" \"ventricular bands,\" \"anti-VEGF therapy,\" \"bevacizumab,\" \"PICU,\" \"subcostal retraction,\" \"vesicular breath sounds,\" and \"angiogenesis.\" It references complex diagnostic imaging (e.g., \"chest tomography,\" \"head and neck tomography\"), detailed clinical findings (e.g., \"worsening stridor,\" \"moderate dysphonia\"), and advanced treatment strategies (e.g., intravenous bevacizumab, tracheostomy, excision of papillomatous lesions). The narrative structure is dense, integrating clinical presentation, laboratory results, imaging findings, surgical intervention, histopathology, and follow-up outcomes in a concise, technical manner. The use of abbreviations (RRP, HPV, VEGF, PICU, CRP) without full explanation and the assumption of familiarity with pediatric airway diseases and off-label drug use indicate that the content is intended for a medically trained audience. While the case is presented in a clinical context, the language and complexity exceed the comprehension level of individuals without advanced health literacy or medical training."
},
{
"doc_id": 5,
"true_label": "low_health_literacy",
"pred_label": "intermediate_health_literacy",
"reasoning": "The text uses medical terminology such as \"kidney disease,\" \"steroids,\" \"immune system,\" \"antibody level (IgG),\" \"T cells (CD4 and CD8),\" \"RSV antibodies,\" \"frosted glass\" opacities, \"inflammation,\" and \"breathing failure.\" While some terms are common in clinical settings, they are presented without explanation, assuming prior knowledge of immunology and pulmonary medicine. The sentence structure is relatively simple and direct, with short sentences and clear cause-effect relationships. However, the integration of specialized terms like \"CD4 and CD8\" and \"IgG\" without definition makes the text difficult for individuals without a basic understanding of immunology or medicine. The mention of \"RSV antibodies\" and the interpretation of a positive test in context of a weakened immune system requires knowledge of virology and diagnostic testing. Although the overall narrative is accessible in tone and structure, the reliance on technical terms without simplification or context limits comprehension for those with low or intermediate health literacy. The text is likely understandable to individuals with intermediate to advanced health literacy, particularly those with some medical background or experience with chronic illness."
},
{
"doc_id": 6,
"true_label": "low_health_literacy",
"pred_label": "intermediate_health_literacy",
"reasoning": "The text uses medical terminology such as \"lower back pain,\" \"blood in her urine,\" \"bruise under the skin,\" \"blood-clotting tests,\" \"factor eight,\" and \"blocker against a clotting protein,\" which may be unfamiliar to individuals without a medical background. However, the language is relatively simple and concise, with short sentences and clear explanations. The narrative describes symptoms, clinical presentation, and diagnostic testing in a straightforward manner, avoiding overly complex jargon or technical details. Terms like \"factor eight\" are used without extensive elaboration, but their meaning is contextually clear (related to clotting). The explanation of the tests\u2014mixing blood with normal blood and testing for a blocker\u2014provides intuitive insight into the diagnostic process. While some medical concepts are present, they are explained in accessible language. The overall structure is logical and easy to follow, suggesting the text is designed for a general audience or patients, though it still requires some basic understanding of medical conditions. The complexity is moderate, falling between low and high health literacy."
},
{
"doc_id": 7,
"true_label": "intermediate_health_literacy",
"pred_label": "intermediate_health_literacy",
"reasoning": "The text describes a neonatal surgical emergency involving a newborn with a congenital scrotal swelling, diagnosed with intravaginal testicular torsion leading to nonviable testis. While the language is clinical, it uses accessible terminology for a general medical audience and avoids excessive jargon. Terms like \"cesarean section,\" \"transillumination,\" \"Doppler ultrasound,\" \"orchidectomy,\" and \"orchidopexy\" are standard in pediatric urology but are explained contextually. The narrative is concise, sequential, and focused on key clinical findings and decisions. The explanation of the procedure and rationale (e.g., removing a nonviable testis and fixing the contralateral one to prevent future torsion) is clear and logical. Although some medical terms are present, they are either common in pediatric care or their meaning is evident from context. The overall sentence structure is straightforward, with short to medium-length sentences, and the information is presented in a way that could be understood by patients or caregivers with intermediate health literacy, especially when supported by a clinician. The text does not require advanced medical training to grasp the essential clinical picture and urgency."
},
{
"doc_id": 7,
"true_label": "low_health_literacy",
"pred_label": "intermediate_health_literacy",
"reasoning": "The text uses medical terminology such as \"scrotum,\" \"testicle,\" \"C-section,\" \"ultrasound scan,\" \"blood flow,\" \"hernia,\" \"twisted,\" \"fluid,\" and \"surgery,\" but explains them in plain, accessible language. Complex terms are either self-explanatory in context or described with simple analogies (e.g., \"like a kinked hose that stops flow\"). The narrative is chronological and straightforward, focusing on observable symptoms, physical exam findings, diagnostic tests, and treatment steps. Sentences are short and direct, with minimal jargon. The use of descriptive language\u2014such as \"looked well,\" \"skin was pink and warm,\" \"not sore when touched\"\u2014helps convey the child\u2019s condition in a way that is understandable to non-medical readers. The explanation of the ultrasound result (\"no blood was flowing to it\") and the surgical intervention (\"removed the right testicle,\" \"fixed the left testicle\") is clear and avoids technical complexity. The overall structure and language are appropriate for a patient or parent, indicating that the content is designed for intermediate health literacy."
},
{
"doc_id": 7,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text uses highly specialized medical terminology such as \"congenital right scrotal swelling,\" \"intravaginal testicular torsion,\" \"Doppler ultrasonography,\" \"heterogeneous hypoechoic echotexture,\" \"rete testis,\" \"transillumination,\" \"orchidectomy,\" and \"orchidopexy.\" These terms are not commonly understood by the general public and require medical training for accurate interpretation. The sentence structure is dense and technical, with precise clinical descriptions and diagnostic reasoning (e.g., \"absent intratesticular color Doppler flow correlates with nonviability\"). The narrative integrates imaging findings, surgical intervention, and pathophysiological context, all presented in a concise, professional tone typical of a pediatric surgical case report. The use of abbreviations (e.g., ED, CT, Doppler) and lack of plain-language explanations further increase the complexity. While the clinical scenario is urgent and the outcome is clear, the language and depth of detail are inaccessible to individuals without advanced health literacy or medical background."
},
{
"doc_id": 8,
"true_label": "low_health_literacy",
"pred_label": "intermediate_health_literacy",
"reasoning": "The text uses medical terminology such as \"blood in his pee\" (hematuria), \"swelling\" (edema), \"seizures,\" \"very high blood pressure\" (hypertensive crisis), \"C3\" (complement component 3), \"strep infection,\" \"kidney filters were inflamed\" (glomerulonephritis), \"PRES\" (posterior reversible encephalopathy syndrome), \"brain swelling,\" \"low blood level\" (anemia), and \"steroid medicines.\" While some terms like \"seizures\" or \"high blood pressure\" are familiar to the general public, others such as \"C3,\" \"PRES,\" \"glomerulonephritis,\" and \"complement component\" are highly specialized and require medical knowledge to fully understand. The explanation of pathophysiology\u2014such as immune system attacking red blood cells leading to severe anemia\u2014is complex and not commonly understood by laypersons. However, the overall narrative is structured in a clear, chronological, and simplified way, using plain language where possible (e.g., \"blood in his pee,\" \"brain was affected by the very high blood pressure\"). The sentence structure is relatively short and direct, and the progression of events is easy to follow. Although the content involves serious and complex conditions, the language is accessible to someone with intermediate health literacy, especially with some medical exposure. The text avoids excessive jargon, explains key concepts in simple terms, and focuses on the patient\u2019s journey, symptoms, and recovery in a way that supports understanding."
},
{
"doc_id": 8,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text is a highly detailed clinical case report written in advanced medical language, targeting a specialized audience such as physicians, pediatric nephrologists, or intensive care specialists. It uses extensive medical jargon including \"macroscopic glomerular haematuria,\" \"nephritic syndrome,\" \"hypertensive emergency,\" \"status epilepticus,\" \"PRES (posterior reversible encephalopathy syndrome),\" \"vasogenic edema,\" \"direct Coombs test,\" \"hypocomplementemia,\" \"C3 25 mg/dL,\" \"amlodipine,\" \"sodium nitroprusside infusion,\" and \"24-hour proteinuria 36.6 mg/m2/h.\" The narrative includes complex diagnostic reasoning, such as differentiating between autoimmune hemolytic anemia and other causes of anemia, interpreting imaging findings (e.g., MRI showing symmetric T2/FLAIR signal changes), and managing multi-system involvement (renal, neurological, hematologic). Sentence structures are long and densely packed with clinical details, procedural steps, and laboratory values. Abbreviations (e.g., GNAPE, AKI, Hct, RBC, CSF, EEG, IV, ICU, BP, GCS) are used without full explanation, assuming prior knowledge. The integration of pathophysiology (e.g., complement depletion in post-streptococcal GN), treatment escalation (e.g., sodium nitroprusside titration), and monitoring parameters (e.g., 25% BP reduction per day) further demonstrates the high cognitive and medical literacy required to understand the content. This level of detail and terminology is not accessible to patients or individuals without advanced medical training."
},
{
"doc_id": 10,
"true_label": "intermediate_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text uses specialized ophthalmological terminology such as \"visual acuity,\" \"optic nerve head swelling,\" \"choroidal bulging,\" \"subretinal fluid,\" \"RPE corrugations,\" \"retrobulbar nodular mass,\" \"gadolinium enhancement,\" and \"nodular posterior scleritis.\" These terms are not commonly understood by the general public and require medical training to interpret. The description of clinical findings and imaging results is concise but assumes familiarity with eye anatomy, MRI interpretation, and inflammatory eye diseases. The management decision\u2014starting oral prednisolone\u2014relies on a specific diagnosis that is not explained in plain language. While the sentence structure is relatively straightforward compared to other medical reports, the use of technical terms and the lack of explanation for key concepts (e.g., what \"RPE corrugations\" or \"gadolinium enhancement\" means) make the text difficult for individuals without medical knowledge. The content is clearly intended for healthcare professionals, particularly ophthalmologists or specialists in autoimmune diseases, indicating a high level of health literacy is required."
},
{
"doc_id": 10,
"true_label": "low_health_literacy",
"pred_label": "intermediate_health_literacy",
"reasoning": "The text describes a medical case with clinical findings and diagnostic reasoning, but uses terminology that is moderately complex for a general audience. Terms such as \"vision loss,\" \"swelling of the seeing nerve,\" \"fluid under the retina,\" \"wrinkles in the thin lining,\" \"MRI scan with contrast dye,\" \"nodular posterior scleritis,\" and \"prednisolone\" are specific and may not be familiar to individuals without some health literacy. However, the language is relatively straightforward compared to other examples, with simpler sentence structures and fewer technical abbreviations. Key concepts are explained in context (e.g., \"the back of his left eye showed swelling of the seeing nerve\" clarifies \"optic nerve swelling\"). The narrative avoids excessive jargon and uses plain language to describe symptoms and findings, making it accessible to individuals with intermediate health literacy. While some terms require medical knowledge, the overall clarity and explanatory nature of the text suggest it is intended for a broader audience than just specialists. The level of detail is sufficient for understanding the diagnosis and treatment but not so technical as to require expert knowledge."
},
{
"doc_id": 10,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text is written in a highly technical medical style, using specialized ophthalmologic and radiologic terminology such as \"best-corrected distance visual acuity (BCDVA),\" \"subretinal fluid (SRF),\" \"retinal pigment epithelial (RPE) corrugations,\" \"multimodal imaging,\" \"OCT,\" \"indocyanine green angiography (ICGA),\" \"blue-autofluorescence (BAF),\" \"fluorescein angiography (FA),\" \"retrobulbar nodular enhancing mass,\" and \"transconjunctival lateral orbitotomy.\" It references specific equipment (OptoVue, Inc., Fremont, CA, USA; software version 2018.0.0.18), imaging findings with precise descriptions (e.g., \"geographic macular hypocyanescent area,\" \"speckled hyperfluorescent margins\"), and complex diagnostic work-up including autoimmune and infectious disease testing (PPD, serum ACE, C-ANCA). The narrative follows a detailed clinical reasoning path involving differential diagnosis (malignancy vs. inflammatory disease), surgical intervention, and treatment with corticosteroids. Sentence structures are complex and densely packed with clinical data. The use of abbreviations without explanation (e.g., OS, OD, BAF, SRF, FA) assumes familiarity with standard ophthalmic notation. This level of detail and jargon is typical of a specialist medical case report, intended for ophthalmologists, neuro-ophthalmologists, or other healthcare professionals with advanced training. The text would be extremely difficult for patients or individuals without a medical background to understand."
},
{
"doc_id": 12,
"true_label": "low_health_literacy",
"pred_label": "low_health_literacy",
"reasoning": "The text uses simple, everyday language to describe a medical situation. Terms like \"burning when she peed,\" \"had to pee very often,\" \"tried many treatments,\" and \"kept very clean\" are common, non-technical expressions that most people can understand. The description of symptoms and diagnostic steps (e.g., \"camera test,\" \"tiny bits floating in the urine,\" \"live larva in her urine\") is presented in a straightforward way without medical jargon. The explanation of the cause\u2014finding a larva in the urine\u2014is clear and directly linked to the symptoms. The advice given\u2014\u201ckeep very clean\u201d and \u201cdrink plenty of water\u201d\u2014is simple, actionable, and uses plain language. The overall structure is chronological and narrative, with no complex terminology or convoluted sentence structures. The content is accessible to individuals with limited medical knowledge, suggesting that the health literacy level required is low."
},
{
"doc_id": 12,
"true_label": "proficient_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text uses complex medical terminology such as \"dysuria,\" \"dribbling,\" \"erythema and hyperemia of the bladder mucosa,\" \"intravesical debris,\" \"praziquantel,\" \"ivermectin,\" \"myiasis,\" \"Sarcophaga spp.,\" \"polyethylene glycol irrigation,\" and \"24-hour urine collection.\" These terms are highly specialized and not commonly understood by the general public. The narrative describes a prolonged diagnostic journey involving multiple specialists, imaging, laboratory tests, and treatments, with clinical reasoning based on subtle findings (e.g., a live larva identified in a 24-hour urine sample). The use of terms like \"myiasis\" (infestation by fly larvae) and \"Sarcophaga spp.\" (a genus of flies) requires knowledge of parasitology and entomology. The sentence structure is dense and sequential, typical of clinical case reports, with minimal plain-language explanations. The case hinges on rare and obscure diagnoses, which are unlikely to be familiar to patients or individuals without advanced medical training. The complexity, combined with the specialized terminology and diagnostic process, indicates that the text is intended for a medically trained audience."
},
{
"doc_id": 17,
"true_label": "intermediate_health_literacy",
"pred_label": "proficient_health_literacy",
"reasoning": "The text uses highly specialized medical and surgical terminology such as \"robot-assisted total proctocolectomy,\" \"complete mesocolic excision,\" \"total mesorectal excision,\" \"D3 lymph node dissection,\" \"central vessel ligation,\" \"ileal pouch\u2013anal anastomosis,\" and \"Hugo RAS system.\" These terms are specific to advanced colorectal surgery and require familiarity with minimally invasive robotic procedures, anatomical dissection techniques, and oncological surgical standards. The description of the procedure in three distinct steps involves precise positioning (Trendelenburg, flat), anatomical landmarks (hepatic flexure, superior mesenteric artery), and technical details (transanal specimen removal, small umbilical incision) that are inaccessible to individuals without medical or surgical training. The use of abbreviations (e.g., RAS, D3, CT) without explanation and the focus on procedural efficiency (632 minutes, minimal blood loss) further indicate a professional audience. The sentence structure is concise but dense with technical content, lacking plain language simplification. Overall, the text is clearly intended for healthcare professionals, particularly surgeons and specialists in gastrointestinal oncology, making it inappropriate for patients or individuals with low to intermediate health literacy."
}
]