Spaces:
Runtime error
Runtime error
File size: 7,009 Bytes
9e118e4 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 | # School Data Model
## Field Reference
| Field | Status | Notes |
|-------|--------|-------|
| `id` | β
Included | |
| `school` | β
Included | |
| `DBA` | β
Included | |
| `address` | β
Included | |
| `slim_address` | β Excluded | Unnecessary given `address` |
| `latitude` | β
Included | |
| `longitude` | β
Included | |
| `grade_span` | π Merged | Merged with `grade_range` and `grades_filter` into single `grade_min` to `grade_max` field |
| `grade_range` | π Merged | See `grade_span` |
| `grades_filter` | π Merged | See `grade_span` |
| `provider_type` | β
Included | |
| `surround_care` | β
Included | Filter only applicable to nonBPS schools; indicative of after/before school programs |
| `aftercare` | β Excluded | Subset of `surround_care` |
| `extended_day` | β Excluded | Empty field |
| `curriculum` | β
Included | Categorical enumeration; only applicable for nonBPS |
| `hours_of_operation` | β
Included | |
| `phone_number` | β
Included | |
| `email` | β
Included | |
| `website` | β
Included | |
| `bps_quality_tier` | β Excluded | Better to refer to state_report_card and school_quality_framework |
| `programs` | β Excluded | Only relevant for nonBPS; contains semantic data but avoiding RAG for nonBPS for now |
| `facility_features` | β Excluded | Empty for all schools |
| `school_size` | β Excluded | Empty for all schools |
| `sports` | π RAG | Added to RAG for BPS schools; 106 unique values make it impractical as a filter, but useful for semantic queries |
| `tuition` | β
Included | 92% fill rate for nonBPS (BPS are free); usable as filter with caveat for empty values |
| `free_upk_hours` | β Excluded | Strict subset of `UPK` boolean |
| `eligibility` | β Excluded | LLM should not make eligibility decisions; directs user to choice tool |
| `separate_application` | β Excluded | Links are defunct |
| `headstart` | β
Included | Filter for nonBPS; directs to [Boston ABCD Head Start](https://bostonabcd.org/service/head-start-childrens-services/) for eligibility |
| `accepts_ccfa` | β
Included | Useful for nonBPS financial assistance; directs to [CCFA info](https://www.mass.gov/child-care-financial-assistance) |
| `language_programming_text` | π RAG and π Merged | for BPS we use as RAG, for nonBPS we union with `dual_language` and `language_programming_filter` to create new boolean called `has_language_program`|
| `dual_language` | π Merged | See `language_programming_text` |
| `language_programming_filter` | π Merged | See `language_programming_text` |
| `stem_steam` | β Excluded | Unreliable to use as filter.. better to rely on RAG from school mission/academic statements |
| `vocational_technology` | β Excluded | Same reasoning as `stem_steam` |
| `early_college_dual_enrollment` | π RAG | Semantic description of eligible colleges included in RAG |
| `international_baccalaureate` | β
Included | Very specific; only 3 schools offer it β model reports directly |
| `advanced_placement` | β
Included | Acts as filter; similar reasoning to `international_baccalaureate`; retrieval includes text description of available classes |
| `arts` | β Excluded | Same reasoning as `stem_steam` |
| `clubs` | β Excluded | Empty for all schools |
| `specialized_education_programs` | π RAG | Semantic descriptions of SPED programs |
| `uniform` | β
Included | High fill rate for BPS (good filter); lower for nonBPS (include confidence caveat in LLM response) |
| `after_school_program` | π RAG | Semantic descriptions |
| `overview_mission_statement` | π RAG | Core identity of each school |
| `unique_features` | π RAG | What sets the school apart |
| `arts_rooms` | β Excluded | Empty across all schools |
| `athletic_field` | β Excluded | Empty across all schools |
| `cafeteria` | β Excluded | Empty across all schools |
| `gymnasium` | β Excluded | Empty across all schools |
| `library` | β Excluded | Empty across all schools |
| `music_room` | β Excluded | Empty across all schools |
| `outdoor_classrooms` | β Excluded | Empty across all schools |
| `playground` | β Excluded | Empty across all schools |
| `pool` | β Excluded | Empty across all schools |
| `science_lab` | β Excluded | Empty across all schools |
| `partners` | π RAG | Lists partner programs; good for semantic search |
| `partners_link` | β Excluded | Informational for schools, not users |
| `UPK` | β
Included | Good filter for BPS and nonBPS; potential for additional UPK, LLM should concede the public data is scarce in this regard|
| `BPS_eligibility` | β Excluded | Directing users to online choice tool |
| `ADA` | β
Included | Clean filter for BPS; fewer nonBPS list it β retrieval should note this caveat to LLM |
| `tiers_text` | β Excluded | Never used in raw data |
| `transportation` | β Excluded | Directs user to public school transportation system link |
| `prek_bps_connector` | β Excluded | Never used in raw data |
| `special_admission_school` | π Merged | Merged with `special_admission_filter` and `special_admission_link`; includes link for more info |
| `special_admission_filter` | π Merged | See `special_admission_school` |
| `special_admission_link` | π Merged | See `special_admission_school` |
| `school_quality_framework` | β
Included | With `state_report_card`, directs users to compare school performance |
| `school_quality_tier` | β Excluded | Unnecessary given `school_quality_framework` and `state_report_card` |
| `preview_session_1` | β Excluded | Outdated |
| `preview_session_2` | β Excluded | Outdated |
| `preview_session_3` | β Excluded | Outdated |
| `point_of_contact` | β
Included | Important for retrieval |
| `school_leader` | β
Included | Important for retrieval |
| `before_school_program` | π RAG | Semantic descriptions |
| `ada_description` | π RAG | Semantic descriptions |
| `uniform_policy` | β Excluded | Unnecessary given `uniform` filter |
| `extra_curriculars_text` | π RAG | Common user question; good semantic content |
| `innovation_pathways` | β Excluded | Unused in raw data |
| `specialized_education_filter` | β Excluded | Redundant with `specialized_education_programs` |
| `announcement_text` | β Excluded | Unused |
| `other_academic_programs` | β Excluded | Unused |
| `show_eligibility_check` | β Excluded | Unused |
| `BuildCare` | β
Included | Filter for nonBPS schools only |
| `family_engagement_opportunities` | π RAG | Text descriptions; important for users curious about family engagement |
| `CTE_Pathways_TXT` | π RAG | semantic descriptions of career options |
| `state_report_card` | β
Included | See `school_quality_framework` |
---
## Status Key
| Symbol | Meaning |
|--------|---------|
| β
Included | Used as a structured filter or direct retrieval field |
| β Excluded | Not included in the data model |
| π Merged | Combined with other fields |
| π RAG | Included in semantic RAG embeddings | |