| | --- |
| | title: Entropy Harvester |
| | emoji: π |
| | colorFrom: red |
| | colorTo: red |
| | sdk: gradio |
| | sdk_version: 5.44.1 |
| | app_file: app.py |
| | pinned: false |
| | license: mit |
| | --- |
| | # Dataset Energy & Entropy Analyzer (Gradio) |
| |
|
| | A lightweight app that analyzes a CSV and reports: |
| | - Global compressibility (gzip ratio) |
| | - Per-column entropy (numeric via quantile-binning; categorical via counts) |
| | - Monotone runs and run-entropy (numeric columns) |
| | - Sortedness fraction (numeric columns) |
| | - 2D Pareto maxima (first two numeric columns) |
| | - kd-partition entropy approximation (first two numeric columns) |
| | - Overall **Harvestable Energy** score (0β1) |
| |
|
| | ## How it relates to "Harvestable Energy" & range-partition entropy |
| | - Lower entropy and higher compressibility imply more exploitable structure β higher harvestable energy. |
| | - kd-entropy approximates how many bits are needed to split spatial data into simple blocks (a proxy for range-partition entropy). |
| | - Run-entropy captures how many monotone runs are present (adaptive sorting lens). |
| | Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference |
| |
|