--- displayed_sidebar: developersSidebar sidebar_position: 1 --- import ZoomableMermaid from '@site/src/components/ZoomableMermaid'; # Data Model & Entity Relationship Diagram Comprehensive overview of all data entities extracted, processed, and uploaded to HuggingFace datasets. ## οΏ½ HuggingFace Dataset Structure ### Current Datasets Being Uploaded ``` open-navigator-data/ β”œβ”€β”€ jurisdictions/ # πŸ›οΈ Core jurisdiction data β”‚ β”œβ”€β”€ cities # 19,000+ incorporated places β”‚ β”œβ”€β”€ counties # 3,144 U.S. counties β”‚ β”œβ”€β”€ states # 50 states + DC, territories β”‚ β”œβ”€β”€ school_districts # 13,000+ districts (NCES data) β”‚ └── census_data # Basic FIPS codes & census year reference β”‚ β”œβ”€β”€ demographics/ # πŸ‘₯ Comprehensive demographic data (U.S. Census) β”‚ β”œβ”€β”€ population # Total population, age distribution β”‚ β”œβ”€β”€ race_ethnicity # Race and ethnicity breakdowns β”‚ β”œβ”€β”€ income_economics # Income, poverty, SNAP benefits β”‚ β”œβ”€β”€ education # Educational attainment levels β”‚ β”œβ”€β”€ housing # Housing units, ownership, values β”‚ β”œβ”€β”€ employment # Unemployment, labor force participation β”‚ └── health_insurance # Insurance coverage (uninsured, Medicaid, Medicare) β”‚ β”œβ”€β”€ social/ # πŸ“± Social media presence β”‚ β”œβ”€β”€ twitter # Twitter/X accounts β”‚ β”œβ”€β”€ facebook # Facebook pages β”‚ β”œβ”€β”€ instagram # Instagram accounts β”‚ └── linkedin # LinkedIn pages β”‚ β”œβ”€β”€ videos/ # πŸ“Ή Video & streaming platforms β”‚ β”œβ”€β”€ youtube_channels # Government YouTube channels β”‚ β”œβ”€β”€ vimeo # Vimeo accounts β”‚ └── livestreams # Live meeting streams β”‚ β”œβ”€β”€ platforms/ # πŸ–₯️ Meeting management systems β”‚ β”œβ”€β”€ legistar # Legistar URLs β”‚ β”œβ”€β”€ granicus # Granicus links β”‚ β”œβ”€β”€ suiteone # SuiteOne systems β”‚ └── civicplus # CivicPlus platforms β”‚ β”œβ”€β”€ domains/ # 🌐 Official government websites β”‚ β”œβ”€β”€ gsa_domains # .gov domain registry β”‚ β”œβ”€β”€ municipal_websites # City/county websites β”‚ └── state_portals # State government sites β”‚ β”œβ”€β”€ events/ # πŸ“‹ Meetings, Hearings & Public Events β”‚ β”œβ”€β”€ events # Government meetings, public hearings, community forums, town halls β”‚ β”œβ”€β”€ event_participants # Officials and organizations participating in events β”‚ β”œβ”€β”€ event_agenda_items # Individual agenda topics discussed β”‚ β”œβ”€β”€ event_documents # Agendas, minutes, presentations, handouts β”‚ β”œβ”€β”€ event_media # Video recordings, livestreams, audio files β”‚ └── event_bills # Bills discussed or considered at meetings β”‚ β”œβ”€β”€ contacts/ # πŸ‘₯ All People - Officials, Candidates, Donors, Constituents β”‚ β”œβ”€β”€ officials # Elected and appointed officials (mayors, council members, legislators) β”‚ β”œβ”€β”€ official_roles # Current and historical positions held β”‚ β”œβ”€β”€ official_contacts # Email, phone, office addresses β”‚ β”œβ”€β”€ official_identifiers # External IDs (Twitter, OpenStates, Ballotpedia) β”‚ β”œβ”€β”€ official_links # Websites, social media profiles β”‚ β”œβ”€β”€ candidates # Political candidates (House, Senate, President - FEC data) β”‚ β”œβ”€β”€ nonprofit_donors # Nonprofit leadership political giving (FEC analysis) β”‚ └── constituents # Donors, volunteers, members, beneficiaries (all people) β”‚ β”œβ”€β”€ nonprofits/ # 🏒 Nonprofit organizations & churches β”‚ β”œβ”€β”€ irs_eobmf # IRS EO-BMF bulk data (1.9M+ organizations) - PRIMARY SOURCE β”‚ β”œβ”€β”€ irs_nonprofits # Legacy IRS 990 data (deprecated - use irs_eobmf) β”‚ β”œβ”€β”€ propublica_data # ProPublica API (financials, NTEE codes) β”‚ β”œβ”€β”€ everyorg_data # Every.org API (missions, causes, logos) β”‚ β”œβ”€β”€ nonprofit_990s # Detailed Form 990 financials (yearly filings) β”‚ β”œβ”€β”€ congregations # πŸ› Church & congregation data (ARDA, HIFLD, NCS) β”‚ β”œβ”€β”€ constituents # 🀝 Donors, volunteers, members, beneficiaries (Microsoft CDM) β”‚ β”œβ”€β”€ donations # πŸ’ Financial contributions and in-kind gifts (Microsoft CDM) β”‚ β”œβ”€β”€ campaigns # πŸ“£ Fundraising campaigns and appeals (Microsoft CDM) β”‚ β”œβ”€β”€ memberships # 🎫 Member enrollment and renewals (Microsoft CDM) β”‚ β”œβ”€β”€ volunteer_activities # πŸ™‹ Volunteer hours and activities (Microsoft CDM) β”‚ β”œβ”€β”€ program_delivery # 🎯 Programs and services delivered (Microsoft CDM) β”‚ └── program_outcomes # πŸ“Š Impact metrics and outcome measurements (Microsoft CDM) β”‚ β”œβ”€β”€ grants/ # πŸ’΅ Grant funding transactions β”‚ β”œβ”€β”€ nonprofit_grants # Grants to nonprofits (from 990 Schedule I) β”‚ β”œβ”€β”€ government_grants # Government grants to orgs/jurisdictions β”‚ β”œβ”€β”€ foundation_grants # Private foundation grants β”‚ └── federal_grants # Federal funding programs β”‚ β”œβ”€β”€ causes/ # 🎯 Cause & category taxonomy β”‚ β”œβ”€β”€ ntee_codes # IRS NTEE classification system β”‚ └── everyorg_causes # Every.org cause tags β”‚ β”œβ”€β”€ budgets/ # πŸ’° Government budgets & finances β”‚ β”œβ”€β”€ city_budgets # City/municipal budgets & spending β”‚ β”œβ”€β”€ county_budgets # County budgets & expenditures β”‚ β”œβ”€β”€ state_budgets # State government finances β”‚ β”œβ”€β”€ school_budgets # School district finances (NCES F-33) β”‚ β”œβ”€β”€ bond_debt # Municipal bonds & debt obligations β”‚ β”œβ”€β”€ budget_line_items # πŸ“Š Detailed budget categories & line items β”‚ └── budget_deltas # πŸ” Budget-to-Minutes Delta analysis (political economy) β”‚ β”œβ”€β”€ decisions/ # βš–οΈ Policy decisions & political economy analysis β”‚ β”œβ”€β”€ policy_decisions # Extracted decisions from meetings β”‚ β”œβ”€β”€ decision_frames # Frame analysis (rhetoric patterns) β”‚ β”œβ”€β”€ decision_options # Options considered & rejected β”‚ β”œβ”€β”€ decision_tradeoffs # Tradeoffs discussed (cost vs benefit, etc.) β”‚ β”œβ”€β”€ stakeholder_positions # πŸ‘₯ Who spoke for/against (Influence Radar) β”‚ β”œβ”€β”€ decision_votes # Detailed vote records per decision β”‚ β”œβ”€β”€ deferral_patterns # πŸ“… Stalling detection (same topic, multiple deferrals) β”‚ β”œβ”€β”€ deferral_instances # Individual tabling events linked to patterns β”‚ β”œβ”€β”€ keyword_density # Quantitative indicators (grant/taxpayer/emergency) β”‚ └── deferral_patterns # Tabled/delayed decisions (temporal analysis) β”‚ β”œβ”€β”€ elections/ # πŸ“… Election cycles & temporal analysis β”‚ β”œβ”€β”€ election_cycles # Election dates & periods β”‚ └── election_influences # Pre/post-election decision patterns β”‚ β”œβ”€β”€ campaigns/ # πŸ’° Political campaign finance (FEC data) β”‚ β”œβ”€β”€ committees # PACs, Super PACs, campaign committees β”‚ └── contributions # Individual political contributions $200+ β”‚ β”œβ”€β”€ civic/ # πŸ—³οΈ Google Civic & Wikidata β”‚ β”œβ”€β”€ civic_divisions # OCD divisions β”‚ β”œβ”€β”€ representatives # From Google Civic API β”‚ β”œβ”€β”€ wikidata_entities # Structured entities β”‚ └── dbpedia_resources # Wikipedia infobox data β”‚ β”œβ”€β”€ ballots/ # πŸ—³οΈ Ballot initiatives & referendums β”‚ β”œβ”€β”€ state_measures # State propositions (fluoridation votes!) β”‚ β”œβ”€β”€ local_measures # City/county ballot questions β”‚ └── election_results # Historical voting outcomes β”‚ β”œβ”€β”€ bills/ # πŸ“œ Legislation & Lawmaking β”‚ β”œβ”€β”€ bills # Bills and resolutions (1.5M+ from all 50 states) β”‚ β”œβ”€β”€ bill_actions # Bill history (introduced, committee, floor vote, signed) β”‚ β”œβ”€β”€ bill_sponsorships # Primary sponsors and co-sponsors β”‚ β”œβ”€β”€ bill_abstracts # Bill summaries and descriptions β”‚ β”œβ”€β”€ bill_versions # Different versions of bill text (introduced, amended, enrolled) β”‚ β”œβ”€β”€ bill_version_links # Links to PDF/HTML bill text β”‚ β”œβ”€β”€ bill_documents # Supporting documents (fiscal notes, amendments, analysis) β”‚ β”œβ”€β”€ bill_document_links # Links to supporting documents β”‚ β”œβ”€β”€ bill_subjects # Subject/topic tags per bill β”‚ β”œβ”€β”€ legislative_sessions # Session identifiers (2023 Regular, 2024 Special, etc.) β”‚ β”œβ”€β”€ vote_events # Roll call votes on bills β”‚ β”œβ”€β”€ vote_counts # Vote tallies (yes, no, abstain, absent) β”‚ └── individual_votes # Individual legislator votes (yes/no/abstain) β”‚ β”œβ”€β”€ topics/ # 🎯 Advocacy causes & campaigns β”‚ β”œβ”€β”€ topic_definitions # Validated survey questions from Roper Center β”‚ β”œβ”€β”€ survey_questions # Public opinion question wording library β”‚ β”œβ”€β”€ jurisdiction_topics # What each city is discussing β”‚ └── advocacy_alerts # Opportunities for engagement β”‚ β”œβ”€β”€ surveys/ # πŸ“Š Public opinion research & polling data β”‚ β”œβ”€β”€ survey_providers # Polling organizations (Gallup, Pew, Roper, etc.) β”‚ β”œβ”€β”€ survey_studies # Individual survey studies/waves β”‚ β”œβ”€β”€ survey_variables # Questions/items asked in surveys β”‚ β”œβ”€β”€ survey_responses # Aggregate and individual response data β”‚ β”œβ”€β”€ ipoll_metadata # Roper iPoll catalog metadata β”‚ └── survey_crosstabs # Breakdowns by demographics, geography β”‚ β”œβ”€β”€ factchecks/ # βœ… Fact-checking & claim verification β”‚ β”œβ”€β”€ claim_reviews # Google Fact Check API (ClaimReview schema) β”‚ β”œβ”€β”€ politifact # PolitiFact Truth-O-Meter ratings β”‚ β”œβ”€β”€ factcheck_org # FactCheck.org verified claims β”‚ └── verified_claims # Aggregated fact-check database β”‚ β”œβ”€β”€ civic_tech/ # πŸ’» Open source projects & hackathons β”‚ β”œβ”€β”€ github_repositories # Civic tech projects (GitHub API) β”‚ β”œβ”€β”€ project_metadata # Code for America, USDR, Civic Tech Field Guide β”‚ β”œβ”€β”€ contributors # Maintainers and core contributors β”‚ β”œβ”€β”€ project_issues # Good first issues, contribution opportunities β”‚ β”œβ”€β”€ hackathons # Civic hackathon events β”‚ β”œβ”€β”€ hackathon_projects # Projects built at hackathons β”‚ β”œβ”€β”€ brigade_chapters # Code for America brigade locations β”‚ └── project_funding # GitHub Sponsors, grants, OpenCollective β”‚ β”œβ”€β”€ community_solutions/ # 🌟 Community engagement & use cases β”‚ β”œβ”€β”€ engagement_spectrum # Spectrum of Community Engagement to Ownership β”‚ β”œβ”€β”€ use_case_catalog # Harvard Data-Smart City Solutions examples β”‚ β”œβ”€β”€ data_academies # Brookings Institution training programs β”‚ β”œβ”€β”€ success_stories # Real-world outcomes (Providence, Portland, Tempe) β”‚ β”œβ”€β”€ metric_templates # Pre-built analytics for common challenges β”‚ └── workflow_guides # Step-by-step community data workflows β”‚ β”œβ”€β”€ analytics/ # πŸ“Š Time dimensions & metric views β”‚ β”œβ”€β”€ date_dimension # Date/time reference table (YYYY-MM-DD, day_of_week, fiscal_year) β”‚ β”œβ”€β”€ temporal_relationships # Time-series joins for all entities β”‚ β”œβ”€β”€ metric_views # Pre-computed analytics (advocacy, spending, nonprofit impact) β”‚ β”œβ”€β”€ aggregated_stats # Monthly/quarterly/yearly rollups β”‚ └── dashboard_metrics # Real-time dashboard data feeds β”‚ β”œβ”€β”€ standards/ # 🌐 Schema.org, Popolo, CEDS, IATI exports β”‚ β”œβ”€β”€ schema_org_jsonld # JSON-LD exports (Event, Person, Organization, Legislation, ClaimReview) β”‚ β”œβ”€β”€ popolo_exports # Popolo-compliant JSON (Person, Organization, Membership, VoteEvent) β”‚ β”œβ”€β”€ ceds_aligned # CEDS-compliant education data (Element IDs, Option Sets) β”‚ β”œβ”€β”€ ocd_divisions # Open Civic Data division IDs β”‚ β”œβ”€β”€ iati_activities # IATI Standard v2.03 XML (programs, grants, humanitarian aid) β”‚ └── rdf_triples # RDF/Turtle semantic web exports β”‚ β”œβ”€β”€ vocabulary/ # πŸ”§ OMOP-inspired concept & terminology (SYSTEM-INTERNAL) β”‚ β”œβ”€β”€ concept # Master concept table (cities, causes, officials) β”‚ β”œβ”€β”€ vocabulary # Vocabulary sources (OCD_ID, IRS_NTEE, US_Census) β”‚ β”œβ”€β”€ concept_class # Concept classifications (City, County, 501c3, Mayor) β”‚ β”œβ”€β”€ concept_relationship # Relationships (City β†’ County, Topic β†’ Legislation) β”‚ └── domain # Domain groupings (Jurisdiction, Nonprofit, Policy) β”‚ └── exports/ # πŸ“€ API-ready formatted exports β”œβ”€β”€ csv_bulk # CSV downloads for all datasets β”œβ”€β”€ json_api # REST API JSON responses β”œβ”€β”€ graphql_schema # GraphQL schema definitions └── parquet_optimized # Compressed Parquet (default format) ``` ### Parquet File Naming Convention **Rule:** Use underscores (`_`) consistently, NOT hyphens (`-`) **Format:** `{category}_{subcategory}.parquet` **Examples:** ``` βœ… CORRECT (using underscores): jurisdictions_cities.parquet jurisdictions_counties.parquet jurisdictions_states.parquet jurisdictions_school_districts.parquet social_twitter.parquet social_facebook.parquet videos_youtube_channels.parquet events_events.parquet contacts_officials.parquet bills_bills.parquet nonprofits_organizations.parquet nonprofits_financials.parquet nonprofits_programs.parquet nonprofits_locations.parquet nonprofits_irs_eobmf.parquet nonprofits_constituents.parquet nonprofits_donations.parquet nonprofits_campaigns.parquet # Nonprofit fundraising campaigns (NOT political) contacts_candidates.parquet # Political candidates (FEC) contacts_nonprofit_donors.parquet # Nonprofit leadership political giving (FEC analysis) contacts_constituents.parquet # Donors, volunteers, members, beneficiaries campaigns_committees.parquet # Political committees/PACs (FEC) campaigns_contributions.parquet # Political contributions (FEC) nonprofits_memberships.parquet nonprofits_volunteer_activities.parquet nonprofits_program_delivery.parquet nonprofits_program_outcomes.parquet grants_federal_grants.parquet contacts_officials.parquet contacts_official_roles.parquet contacts_official_contacts.parquet contacts_official_identifiers.parquet contacts_official_links.parquet contacts_candidates.parquet contacts_nonprofit_donors.parquet contacts_constituents.parquet bills_bills.parquet bills_bill_actions.parquet bills_bill_sponsorships.parquet bills_bill_abstracts.parquet bills_bill_versions.parquet bills_bill_version_links.parquet bills_bill_documents.parquet bills_bill_document_links.parquet bills_bill_subjects.parquet bills_legislative_sessions.parquet bills_vote_events.parquet bills_vote_counts.parquet bills_individual_votes.parquet events.parquet event_participants.parquet event_agenda_items.parquet event_documents.parquet event_media.parquet event_bills.parquet budgets_city_budgets.parquet surveys_national_polls.parquet surveys_roper_questions.parquet surveys_survey_providers.parquet surveys_survey_studies.parquet surveys_survey_variables.parquet surveys_survey_responses.parquet surveys_ipoll_metadata.parquet factchecks_claim_reviews.parquet factchecks_politifact.parquet analytics_date_dimension.parquet analytics_metric_views.parquet analytics_temporal_relationships.parquet standards_schema_org_jsonld.parquet standards_popolo_exports.parquet standards_ceds_aligned.parquet standards_iati_activities.parquet vocabulary_concept.parquet vocabulary_vocabulary.parquet vocabulary_concept_class.parquet vocabulary_concept_relationship.parquet ❌ INCORRECT (using hyphens): jurisdictions-cities.parquet social-twitter.parquet meetings-government-meetings.parquet surveys-national-polls.parquet factchecks-claim-reviews.parquet analytics-date-dimension.parquet standards-schema-org.parquet ``` **Why Underscores?** - βœ… Python-friendly variable names (can use `data.jurisdictions_cities`) - βœ… SQL-compatible column names - βœ… Consistent with folder structure (`school_districts`, not `school-districts`) - βœ… Better for programmatic access - βœ… Avoids shell escaping issues **Repository Name Exception:** - HuggingFace repo: `CommunityOne/open-navigator-data` (hyphen is fine for URLs) - File names inside repo: Use underscores (`jurisdictions_cities.parquet`) ## πŸ”„ Data Extraction Pipeline ### Phase 1: Discovery (Bronze Layer) 1. **Census Data** β†’ Jurisdictions list 2. **GSA Domains** β†’ Government websites 3. **NCES** β†’ School districts with financial data (F-33 forms) 4. **IRS EO-BMF** β†’ ALL 1.9M+ U.S. tax-exempt organizations (PRIMARY SOURCE) 5. **IRS TEOS** β†’ Legacy nonprofit EINs (deprecated - use IRS EO-BMF) 6. **Census of Governments** β†’ Municipal budgets & finances 7. **URL Discovery** β†’ Meeting platforms, YouTube, budget PDFs 8. **Social Media** β†’ Twitter, Facebook accounts ### Phase 2: Enrichment (Silver Layer) 1. **IRS EO-BMF** β†’ Complete nonprofit registry with 28 data fields per organization 2. **ProPublica Nonprofit Explorer** β†’ Enhanced financial data, detailed 990 filings 3. **Every.org API** β†’ Nonprofit causes, missions, logos 3. **ARDA (Association of Religion Data Archives)** β†’ Congregation characteristics, health ministries 4. **HIFLD Places of Worship** β†’ Geospatial church locations (350K+ congregations) 5. **National Congregations Study** β†’ Social service provision patterns 6. **NCES F-33 Finance Survey** β†’ School district budgets, per-pupil spending 7. **Census Annual Survey** β†’ State/local government finances 8. **Municipal Securities Rulemaking Board (EMMA)** β†’ Bond debt data 9. **YouTube API** β†’ Channel statistics 10. **Open States PostgreSQL Database** β†’ Complete legislative data (~10 GB monthly dump) - **8,600+ people** (legislators, governors, mayors) across all 50 states + DC + Puerto Rico - **1.5M+ bills** with full text and history - **13M+ bill actions** (introduced, committee, amendments, floor votes, signed) - **7.2M+ sponsorships** (primary sponsors and co-sponsors) - **3.5M+ bill versions** (as introduced, committee substitute, enrolled, enacted) - **180K+ events** (legislative meetings, hearings, committee sessions) - **835K+ event participants** (who spoke, testified, or attended) - **524K+ agenda items** from meetings - **Vote events** with individual legislator positions - **Organizations** (legislative bodies, committees) - **Jurisdictions** (states, territories) - Updated monthly from https://data.openstates.org/postgres/monthly/ 11. **OpenStates People Repository** β†’ Current legislator contact info - GitHub repo: https://github.com/openstates/people - YAML files with email, phone, district offices - Social media profiles and website links - Updated daily via automated scrapers 12. **Wikidata SPARQL** β†’ Entity relationships 12. **DBpedia** β†’ Wikipedia structured data 13. **Google Civic** β†’ Representatives 14. **OpenFEC API** β†’ Political contributions, candidates, committees (campaign finance) 15. **GitHub API** β†’ Civic tech projects, contributors, issues 16. **Civic Tech Field Guide** β†’ Curated project taxonomy 17. **Code for America** β†’ Brigade projects and hackathons 18. **Digital Public Goods Alliance** β†’ DPG-certified open source projects ### Phase 3: Processing (Gold Layer) 1. **Meeting Extraction** β†’ Agenda/minutes text 2. **Video Transcripts** β†’ YouTube captions 3. **Document Analysis** β†’ Keyword detection 4. **Relationship Mapping** β†’ Entity connections 5. **Oral Health Filtering** β†’ Topic classification 6. **Temporal Indexing** β†’ Date dimension table, time-series relationships 7. **Metric View Creation** β†’ Pre-computed analytics (advocacy activity, government spending, nonprofit impact) 8. **Schema.org JSON-LD** β†’ Structured data exports (Event, Person, Organization, Legislation, ClaimReview) 9. **Popolo Compliance** β†’ Open government standard exports (Person, Organization, Membership, VoteEvent) 10. **CEDS Alignment** β†’ Education data mapping to NCES Element IDs and Option Sets ### New Dataset Categories Explained #### πŸ“Š Analytics Datasets **Purpose:** Enable time-series analysis, trend detection, and dashboard metrics without complex SQL queries. | Dataset | Description | Refresh Frequency | |---------|-------------|-------------------| | `analytics_date_dimension` | Calendar reference table with fiscal years, quarters, day-of-week, holidays | Static (updated annually) | | `analytics_temporal_relationships` | Pre-joined date keys for all time-based entities (meetings, votes, budgets, filings) | Daily | | `analytics_metric_views` | Pre-computed analytics like advocacy_activity, government_spending, nonprofit_impact | Hourly | | `analytics_aggregated_stats` | Monthly/quarterly/yearly rollups (meeting counts, budget totals, grant sums) | Daily | | `analytics_dashboard_metrics` | Real-time feeds for dashboards (active meetings today, trending topics, hot ballot measures) | Every 5 minutes | **Example Use Case:** ```sql -- Instead of complex joins, use metric view: SELECT * FROM analytics_metric_views WHERE metric_name = 'advocacy_activity' AND jurisdiction_id = 'ocd-division/country:us/state:al/place:birmingham' AND date_period = '2024-Q1'; ``` #### 🌐 Standards-Compliant Exports **Purpose:** Maximum interoperability with civic tech platforms, search engines, and semantic web tools. | Dataset | Standard | Use Case | Consumers | |---------|----------|----------|-----------| | `standards_schema_org_jsonld` | Schema.org JSON-LD | Google Search rich results, voice assistants | Google, Bing, Alexa, Siri | | `standards_popolo_exports` | Popolo Project | Civic tech platform integration | mySociety, OpenNorth, Sunlight Foundation | | `standards_ceds_aligned` | Common Education Data Standards | Education data exchange, NCES reporting | State education depts, Ed-Fi, IMS Global | | `standards_ocd_divisions` | Open Civic Data IDs | Cross-platform jurisdiction referencing | Google Civic, Ballotpedia, Vote Smart | | `standards_rdf_triples` | RDF/Turtle | Linked open data, knowledge graphs | DBpedia, Wikidata, SPARQL endpoints | **Example Schema.org Export:** ```json { "@context": "https://schema.org", "@type": "GovernmentOrganization", "name": "Birmingham City Council", "address": { "@type": "PostalAddress", "addressLocality": "Birmingham", "addressRegion": "AL" }, "event": [{ "@type": "Event", "name": "Regular City Council Meeting", "startDate": "2024-01-15T18:00:00-06:00" }] } ``` #### βœ… Fact-Checking Datasets **Purpose:** Verify claims made in meetings, legislation, and political speech. | Dataset | Source | Fields | Update Frequency | |---------|--------|--------|------------------| | `factchecks_claim_reviews` | Google Fact Check API | claimReviewed, reviewRating, author, datePublished | Daily | | `factchecks_politifact` | PolitiFact web scraping | claim, ruling, truth_o_meter, context | Daily | | `factchecks_factcheck_org` | FactCheck.org API/scraping | claim, verdict, analysis, sources | Daily | | `factchecks_verified_claims` | Aggregated + deduplicated | claim_text, consensus_rating, verification_sources | Daily | **Integration with Meetings:** - Cross-reference meeting transcripts with verified claims - Flag unverified statements in legislative debates - Track politician accuracy scores over time ## οΏ½πŸ“Š Complete Data Model (ERD)