Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
|
@@ -188,7 +188,7 @@ This dataset is a derivative collection of public documents released by the U.S.
|
|
| 188 |
You are solely responsible for complying with applicable law, institutional policies, and the terms of the original House release. If you plan to use this corpus in a public‑facing product or at scale, seek independent legal advice.
|
| 189 |
|
| 190 |
### The corpus contains:
|
| 191 |
-
OCR noise
|
| 192 |
"""
|
| 193 |
)
|
| 194 |
|
|
|
|
| 188 |
You are solely responsible for complying with applicable law, institutional policies, and the terms of the original House release. If you plan to use this corpus in a public‑facing product or at scale, seek independent legal advice.
|
| 189 |
|
| 190 |
### The corpus contains:
|
| 191 |
+
OCR noise, misrecognized characters, broken formatting, redaction blocks, stamps, and markers inherited from the original scans. Therefore, some of it may not be formatted correctly. Feel free to contribute, to improve the data.
|
| 192 |
"""
|
| 193 |
)
|
| 194 |
|