File size: 2,367 Bytes
bfdd027
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
# Congress Public Records Slice

A neutral, review-oriented slice of House public-record linkages across financial disclosures, sector overlap, and community project funding recipient relationships.

## Key Counts

- Members: `438`
- Scored events: `3918`
- Public graph links: `7765`
- Recipient relationship links: `5367`
- Sector relationship links: `2398`
- Source artifacts in the public audit index: `48591`

## Required Caveats

- This release is a slice of public-record data, not a complete accounting of all potentially relevant data.
- Future releases may update or expand this slice as source recovery, parsing, and evidence linkage improve.
- This release does not assign guilt, wrongdoing, intent, or causality to any person or organization.
- The release shows public-record overlaps, timing, and linkage strength, not proof of illegality or corruption.
- Some rows remain review-tier or include unresolved official source references and should be read with those labels in mind.
- The public package includes verification summaries and SHA-backed artifact indexes, but it does not include the full internal raw corpus, so external verification is bounded by what is published here.

## Current Review Notes

- Recipient links still marked `needs_review`: `154`
- True parse failures still present in the source slice: `45`
- Source-unavailable rows still present in the source slice: `0`
- Public-facing source URLs are limited to stable artifact links; unresolved or unavailable refs remain represented by counts and labels.

## Included Public Files

- `members.csv`
- `scored_events.csv`
- `graph_links.csv`
- `recipient_link_quality_report.json`
- `source_quality_report.json`
- `provenance_coverage_report.json`
- `sample_cases.json`
- `network_graph/nodes.csv`
- `network_graph/edges.csv`
- `network_graph/graph_config.json`
- `evidence_audit/source_artifact_index.csv`
- `evidence_audit/scored_event_index.csv`
- `evidence_audit/scored_event_provenance.jsonl`
- `evidence_audit/claim_supporting_index.csv`
- `evidence_audit/claim_supporting_provenance.jsonl`
- `evidence_audit/consistency_report.json`

## Hugging Face Publishing Shape

- Dataset repo id: `cjc0013/cmp-data`
- Space repo id: `cjc0013/cmp`

This release is a slice of public-record data and may be updated in future releases.