GodsDevProject commited on
Commit
0453a1f
Β·
verified Β·
1 Parent(s): b33ad40

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +132 -34
README.md CHANGED
@@ -4,76 +4,174 @@ emoji: πŸ›οΈ
4
  colorFrom: blue
5
  colorTo: indigo
6
  sdk: gradio
7
- sdk_version: 6.3.0
8
  app_file: app.py
9
- pinned: false
10
  tags:
11
  - foia
12
- - public-records
13
  - government-transparency
 
14
  - journalism
15
  - open-data
 
 
 
 
16
  ---
17
 
18
  # Federal FOIA Intelligence Search
19
 
20
  **Public Electronic Reading Rooms Only**
21
 
22
- This Hugging Face Space provides a federated search interface across
23
- **publicly available U.S. Government FOIA Electronic Reading Rooms**.
 
 
24
 
25
  ---
26
 
27
  ## πŸ” What This Space Does
28
- - Searches **only public FOIA repositories**
29
- - Uses agency-published search endpoints
30
- - Enforces robots.txt and rate limits
31
- - Provides semantic clustering and citation export
 
 
32
 
33
  ---
34
 
35
  ## πŸ›οΈ Live Public Sources
36
- - CIA FOIA Reading Room
37
- - FBI Vault
38
- - NSA FOIA Library
39
- - Department of Defense FOIA
40
- - National Reconnaissance Office (NRO) Releases
41
- - DOJ / DHS / State (metadata links)
 
 
 
 
 
 
 
 
 
 
 
42
 
43
  ---
44
 
45
- ## πŸ“‚ Hosted Public Collections
46
- Some historically named programs (e.g. **AATIP**, **SAP**, **Special Activities**)
47
- do **not** have independent FOIA search systems.
 
 
48
 
49
- When included, they are:
50
- - Clearly labeled as **Hosted Public Releases**
51
  - Linked to the **original publishing agency**
52
- - Restricted to already-released documents
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
53
 
54
  ---
55
 
56
  ## βš–οΈ What This Space Does NOT Do
 
57
  ❌ No classified access
58
  ❌ No scraping behind authentication
 
59
  ❌ No intelligence analysis or inference
60
- ❌ No automated FOIA submission
 
61
 
62
  ---
63
 
64
- ## 🧾 Intended Users
65
- - Journalists
66
- - Researchers
67
- - Legal professionals
68
- - Oversight and compliance teams
 
 
 
 
 
 
 
69
 
70
  ---
71
 
72
- ## πŸ“œ Legal Basis
73
- All content accessed via:
74
- - 5 U.S.C. Β§ 552 (FOIA)
75
- - Agency electronic reading rooms
76
- - Public government websites
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
77
 
78
- This Space operates fully within
79
- Hugging Face content and safety policies.
 
4
  colorFrom: blue
5
  colorTo: indigo
6
  sdk: gradio
7
+ sdk_version: 4.44.0
8
  app_file: app.py
9
+ pinned: true
10
  tags:
11
  - foia
 
12
  - government-transparency
13
+ - public-records
14
  - journalism
15
  - open-data
16
+ - legal-tech
17
+ - oversight
18
+ license: mit
19
+ short_description: 'FOIA INTELLIGENCE SEARCH '
20
  ---
21
 
22
  # Federal FOIA Intelligence Search
23
 
24
  **Public Electronic Reading Rooms Only**
25
 
26
+ This Hugging Face Space provides a federated search and analysis interface for
27
+ **publicly released U.S. Government records made available under the Freedom of Information Act (FOIA)**.
28
+
29
+ It is designed for **journalists, researchers, legal professionals, oversight bodies, and the public**.
30
 
31
  ---
32
 
33
  ## πŸ” What This Space Does
34
+
35
+ - Searches **only publicly accessible FOIA Electronic Reading Rooms**
36
+ - Uses **official agency search endpoints or public landing pages**
37
+ - Enforces **robots.txt, rate limiting, and safe defaults**
38
+ - Provides **semantic search, clustering, and visualization**
39
+ - Generates **court-ready citations and FOIA request packets**
40
 
41
  ---
42
 
43
  ## πŸ›οΈ Live Public Sources
44
+
45
+ This Space currently supports live querying or indexed access to the following **public FOIA repositories**:
46
+
47
+ - **CIA FOIA Electronic Reading Room**
48
+ - **FBI Vault**
49
+ - **NSA FOIA Library**
50
+ - **Department of Defense FOIA Reading Room**
51
+ - **National Reconnaissance Office (NRO) Declassified Releases**
52
+ - **Department of Justice (DOJ) FOIA Library**
53
+ - **Department of Homeland Security (DHS) FOIA Library**
54
+ - **U.S. Department of State FOIA Reading Room**
55
+
56
+ All access is:
57
+ - Public
58
+ - Unauthenticated
59
+ - Non-privileged
60
+ - Read-only
61
 
62
  ---
63
 
64
+ ## πŸ“‚ Hosted Public Collections (Clearly Labeled)
65
+
66
+ Some historically named programs or collections (e.g. **AATIP**, **Special Access Programs**, **Special Activities**) do **not** operate independent FOIA portals.
67
+
68
+ When these appear in the interface, they are:
69
 
70
+ - Explicitly labeled as **Hosted Public Releases**
 
71
  - Linked to the **original publishing agency**
72
+ - Limited to **already-declassified, publicly released documents**
73
+ - Included for **historical research and transparency**
74
+
75
+ No restricted or classified systems are accessed.
76
+
77
+ ---
78
+
79
+ ## πŸ“Š Analytics & Visualization Features
80
+
81
+ - **Real-time agency coverage heatmap** (documents per agency)
82
+ - **Latency & health indicators** per source
83
+ - **Interactive semantic cluster graph** (Plotly)
84
+ - **Timeline views** for document release dates
85
+ - **Result deduplication and clustering**
86
+
87
+ ---
88
+
89
+ ## 🧠 Semantic Search
90
+
91
+ - Uses **sentence-transformers + FAISS**
92
+ - Supports:
93
+ - Semantic clustering
94
+ - Search-within-results
95
+ - Topic grouping
96
+ - No model training on classified or private data
97
+
98
+ ---
99
+
100
+ ## 🧾 Legal & Journalistic Tools
101
+
102
+ - **Court-ready Bluebook citation PDF export**
103
+ - **Journalist ZIP export** (documents + index)
104
+ - **FOIA request packet generator** (PDF / text)
105
+ - **FOIA exemption (b-code) labeling** (heuristic, informational only)
106
+
107
+ > ⚠️ FOIA requests are **generated only**.
108
+ > This Space does **not** submit requests on behalf of users.
109
 
110
  ---
111
 
112
  ## βš–οΈ What This Space Does NOT Do
113
+
114
  ❌ No classified access
115
  ❌ No scraping behind authentication
116
+ ❌ No bypassing agency safeguards
117
  ❌ No intelligence analysis or inference
118
+ ❌ No automated FOIA submissions
119
+ ❌ No user tracking beyond standard HF analytics
120
 
121
  ---
122
 
123
+ ## πŸ“œ Legal Basis
124
+
125
+ All content accessed through:
126
+
127
+ - **5 U.S.C. Β§ 552 (Freedom of Information Act)**
128
+ - Agency-maintained **Electronic Reading Rooms**
129
+ - Public U.S. Government websites
130
+
131
+ This Space operates entirely within:
132
+ - U.S. law
133
+ - Agency publication rules
134
+ - Hugging Face platform policies
135
 
136
  ---
137
 
138
+ ## πŸ›‘οΈ Safety & Governance
139
+
140
+ - Robots.txt enforcement per adapter
141
+ - Per-agency rate limits
142
+ - Per-agency kill switches
143
+ - Health monitoring & auto-disable
144
+ - Adapter compliance tests (CI)
145
+
146
+ ---
147
+
148
+ ## 🎯 Intended Use
149
+
150
+ This project is intended to support:
151
+
152
+ - Investigative journalism
153
+ - Academic and historical research
154
+ - Legal review and litigation support
155
+ - Government oversight
156
+ - Public transparency initiatives
157
+
158
+ ---
159
+
160
+ ## πŸ“¬ Disclaimer
161
+
162
+ This tool aggregates and analyzes **public information only**.
163
+ It does not provide legal advice, intelligence assessments, or official interpretations.
164
+
165
+ Users are responsible for verifying primary sources.
166
+
167
+ ---
168
+
169
+ ## πŸ“¦ Open Source & Transparency
170
+
171
+ All adapters, logic, and safety mechanisms are visible in the source code.
172
+ No hidden data sources or privileged access exist.
173
+
174
+ ---
175
 
176
+ **Federal FOIA Intelligence Search**
177
+ *Making public records easier to find, understand, and cite.*