Add styled HTML version of research paper
Browse files- paper.html +125 -0
paper.html
ADDED
|
@@ -0,0 +1,125 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
<!DOCTYPE html>
|
| 2 |
+
<html lang="en">
|
| 3 |
+
<head>
|
| 4 |
+
<meta charset="UTF-8">
|
| 5 |
+
<meta name="viewport" content="width=device-width, initial-scale=1.0">
|
| 6 |
+
<title>AIDP Video Forge: GPU-Accelerated Video Processing on Decentralized Compute Networks</title>
|
| 7 |
+
<style>
|
| 8 |
+
:root { --primary: #10b981; --bg: #0f172a; --surface: #1e293b; --text: #e2e8f0; --muted: #94a3b8; }
|
| 9 |
+
* { margin: 0; padding: 0; box-sizing: border-box; }
|
| 10 |
+
body { font-family: 'Inter', -apple-system, system-ui, sans-serif; background: var(--bg); color: var(--text); line-height: 1.7; }
|
| 11 |
+
.container { max-width: 800px; margin: 0 auto; padding: 2rem 1.5rem; }
|
| 12 |
+
header { text-align: center; padding: 3rem 0 2rem; border-bottom: 1px solid #334155; margin-bottom: 2rem; }
|
| 13 |
+
h1 { font-size: 2rem; font-weight: 700; background: linear-gradient(135deg, #34d399, #10b981); -webkit-background-clip: text; -webkit-text-fill-color: transparent; margin-bottom: 1rem; }
|
| 14 |
+
.meta { color: var(--muted); font-size: 0.9rem; }
|
| 15 |
+
.meta a { color: var(--primary); text-decoration: none; }
|
| 16 |
+
h2 { font-size: 1.4rem; color: #34d399; margin: 2rem 0 1rem; padding-bottom: 0.5rem; border-bottom: 1px solid #334155; }
|
| 17 |
+
p { margin-bottom: 1rem; }
|
| 18 |
+
table { width: 100%; border-collapse: collapse; margin: 1rem 0 1.5rem; font-size: 0.9rem; }
|
| 19 |
+
th { background: #334155; padding: 0.6rem 0.8rem; text-align: left; font-weight: 600; }
|
| 20 |
+
td { padding: 0.6rem 0.8rem; border-bottom: 1px solid #334155; }
|
| 21 |
+
tr:hover td { background: rgba(16, 185, 129, 0.05); }
|
| 22 |
+
pre { background: var(--surface); border: 1px solid #334155; border-radius: 8px; padding: 1rem; overflow-x: auto; margin: 1rem 0; font-size: 0.85rem; }
|
| 23 |
+
code { font-family: 'JetBrains Mono', 'Fira Code', monospace; }
|
| 24 |
+
.badge { display: inline-block; background: var(--primary); color: white; padding: 0.25rem 0.75rem; border-radius: 999px; font-size: 0.8rem; margin: 0.25rem; }
|
| 25 |
+
.results-grid { display: grid; grid-template-columns: repeat(auto-fit, minmax(150px, 1fr)); gap: 1rem; margin: 1.5rem 0; }
|
| 26 |
+
.result-card { background: var(--surface); border: 1px solid #334155; border-radius: 12px; padding: 1.25rem; text-align: center; }
|
| 27 |
+
.result-card .value { font-size: 1.8rem; font-weight: 700; color: #34d399; }
|
| 28 |
+
.result-card .label { font-size: 0.8rem; color: var(--muted); margin-top: 0.25rem; }
|
| 29 |
+
.bibtex { background: #1a1a2e; border: 1px solid #334155; border-radius: 8px; padding: 1rem; font-size: 0.8rem; white-space: pre-wrap; font-family: monospace; }
|
| 30 |
+
footer { text-align: center; padding: 2rem 0; border-top: 1px solid #334155; margin-top: 3rem; color: var(--muted); font-size: 0.85rem; }
|
| 31 |
+
footer a { color: var(--primary); text-decoration: none; }
|
| 32 |
+
a { color: var(--primary); }
|
| 33 |
+
</style>
|
| 34 |
+
</head>
|
| 35 |
+
<body>
|
| 36 |
+
<div class="container">
|
| 37 |
+
<header>
|
| 38 |
+
<h1>AIDP Video Forge</h1>
|
| 39 |
+
<p style="font-size: 1.1rem; color: var(--muted); margin-bottom: 1rem;">GPU-Accelerated Video Processing on Decentralized Compute Networks</p>
|
| 40 |
+
<p class="meta">Matthew Karsten · <a href="https://github.com/ExpertVagabond">Purple Squirrel Networks</a> · February 2026</p>
|
| 41 |
+
<div style="margin-top: 1rem;">
|
| 42 |
+
<span class="badge">gpu-acceleration</span>
|
| 43 |
+
<span class="badge">nvenc</span>
|
| 44 |
+
<span class="badge">cuda</span>
|
| 45 |
+
<span class="badge">video-processing</span>
|
| 46 |
+
</div>
|
| 47 |
+
</header>
|
| 48 |
+
|
| 49 |
+
<h2>Abstract</h2>
|
| 50 |
+
<p>We present AIDP Video Forge, a GPU-accelerated video processing system leveraging decentralized compute networks. Our approach utilizes NVIDIA hardware encoding (NVENC) and CUDA-accelerated filters across distributed GPU nodes to provide <strong>10-20x faster video encoding</strong> compared to CPU-based methods. Through intelligent job orchestration and distributed batch processing, we achieve <strong>40-60% cost reduction</strong> versus centralized cloud GPU services while maintaining professional-grade video quality.</p>
|
| 51 |
+
|
| 52 |
+
<h2>Key Results</h2>
|
| 53 |
+
<div class="results-grid">
|
| 54 |
+
<div class="result-card"><div class="value">16x</div><div class="label">Faster than CPU</div></div>
|
| 55 |
+
<div class="result-card"><div class="value">58%</div><div class="label">Cost Reduction</div></div>
|
| 56 |
+
<div class="result-card"><div class="value">95.8</div><div class="label">VMAF Score</div></div>
|
| 57 |
+
<div class="result-card"><div class="value">37x</div><div class="label">Distributed (5 GPU)</div></div>
|
| 58 |
+
</div>
|
| 59 |
+
<table>
|
| 60 |
+
<tr><th>Metric</th><th>AIDP Video Forge</th><th>AWS MediaConvert</th><th>Improvement</th></tr>
|
| 61 |
+
<tr><td>Encoding Speed (4K)</td><td>2.8 min</td><td>3.2 min</td><td><strong>16x faster than CPU</strong></td></tr>
|
| 62 |
+
<tr><td>Cost per Hour</td><td>$0.25</td><td>$0.60</td><td><strong>58% cheaper</strong></td></tr>
|
| 63 |
+
<tr><td>Quality (VMAF)</td><td>95.8</td><td>96.0</td><td>Near-identical</td></tr>
|
| 64 |
+
<tr><td>Distributed (5 GPUs)</td><td>1.2 min</td><td>N/A</td><td><strong>37x faster than CPU</strong></td></tr>
|
| 65 |
+
</table>
|
| 66 |
+
|
| 67 |
+
<h2>Architecture</h2>
|
| 68 |
+
<pre><code>+----------------------------------------------------------+
|
| 69 |
+
| Video Forge |
|
| 70 |
+
+----------------------------------------------------------+
|
| 71 |
+
| Client (Web UI / CLI) |
|
| 72 |
+
| +-- Upload video -> Select processing -> Download |
|
| 73 |
+
+----------------------------------------------------------+
|
| 74 |
+
| Job Orchestrator |
|
| 75 |
+
| +-- Queue jobs -> Assign to AIDP nodes -> Aggregate |
|
| 76 |
+
+----------------------------------------------------------+
|
| 77 |
+
| AIDP GPU Workers |
|
| 78 |
+
| +-- FFmpeg + NVENC + CUDA filters |
|
| 79 |
+
+----------------------------------------------------------+</code></pre>
|
| 80 |
+
|
| 81 |
+
<h2>GPU Acceleration: NVENC vs CPU</h2>
|
| 82 |
+
<table>
|
| 83 |
+
<tr><th>Operation</th><th>CPU Method</th><th>GPU Method</th><th>Speedup</th></tr>
|
| 84 |
+
<tr><td>H.264 Encoding</td><td>libx264</td><td>h264_nvenc</td><td><strong>15-20x</strong></td></tr>
|
| 85 |
+
<tr><td>HEVC Encoding</td><td>libx265</td><td>hevc_nvenc</td><td><strong>20-30x</strong></td></tr>
|
| 86 |
+
<tr><td>Scaling</td><td>scale</td><td>scale_cuda</td><td>5-8x</td></tr>
|
| 87 |
+
<tr><td>Deinterlacing</td><td>yadif</td><td>yadif_cuda</td><td>8-10x</td></tr>
|
| 88 |
+
<tr><td>HDR Tone Map</td><td>zscale+tonemap</td><td>tonemap_cuda</td><td>15x</td></tr>
|
| 89 |
+
<tr><td>LUT Application</td><td>lut3d</td><td>CUDA texture</td><td>10x</td></tr>
|
| 90 |
+
</table>
|
| 91 |
+
|
| 92 |
+
<h2>Processing Speed Benchmark</h2>
|
| 93 |
+
<table>
|
| 94 |
+
<tr><th>Method</th><th>Time (10-min 4K)</th><th>Real-time Speed</th><th>Speedup</th></tr>
|
| 95 |
+
<tr><td>CPU (libx264)</td><td>45 minutes</td><td>0.22x</td><td>1x baseline</td></tr>
|
| 96 |
+
<tr><td>AWS MediaConvert (T4)</td><td>3.2 minutes</td><td>3.1x</td><td>14x faster</td></tr>
|
| 97 |
+
<tr><td><strong>AIDP Video Forge (RTX 3090)</strong></td><td><strong>2.8 minutes</strong></td><td><strong>3.6x</strong></td><td><strong>16x faster</strong></td></tr>
|
| 98 |
+
<tr><td><strong>Distributed (5 GPUs)</strong></td><td><strong>1.2 minutes</strong></td><td><strong>8.3x</strong></td><td><strong>37x faster</strong></td></tr>
|
| 99 |
+
</table>
|
| 100 |
+
|
| 101 |
+
<h2>Technical Contributions</h2>
|
| 102 |
+
<ol style="padding-left: 1.5rem; margin-bottom: 1.5rem;">
|
| 103 |
+
<li style="margin-bottom: 0.5rem;"><strong>Hardware Acceleration</strong>: Full NVENC/CUDA pipeline eliminating CPU bottlenecks</li>
|
| 104 |
+
<li style="margin-bottom: 0.5rem;"><strong>Distributed Processing</strong>: Intelligent job splitting across multiple GPU nodes</li>
|
| 105 |
+
<li style="margin-bottom: 0.5rem;"><strong>Cost Efficiency</strong>: 40-60% reduction vs centralized cloud GPU services</li>
|
| 106 |
+
<li style="margin-bottom: 0.5rem;"><strong>Quality Preservation</strong>: VMAF 95.8 — near-identical to reference encoding</li>
|
| 107 |
+
</ol>
|
| 108 |
+
|
| 109 |
+
<h2>Citation</h2>
|
| 110 |
+
<div class="bibtex">@techreport{karsten2026videoforge,
|
| 111 |
+
title={AIDP Video Forge: GPU-Accelerated Video Processing on Decentralized Compute Networks},
|
| 112 |
+
author={Karsten, Matthew},
|
| 113 |
+
institution={Purple Squirrel Networks},
|
| 114 |
+
year={2026},
|
| 115 |
+
month={February},
|
| 116 |
+
url={https://huggingface.co/purplesquirrelnetworks/aidp-video-forge-paper}
|
| 117 |
+
}</div>
|
| 118 |
+
|
| 119 |
+
<footer>
|
| 120 |
+
<p><a href="https://huggingface.co/purplesquirrelnetworks/aidp-video-forge-paper">View on Hugging Face</a> · <a href="https://huggingface.co/purplesquirrelnetworks/aidp-neural-cloud-paper">Companion: AIDP Neural Cloud</a></p>
|
| 121 |
+
<p style="margin-top: 0.5rem;">Built by <a href="https://github.com/ExpertVagabond">Purple Squirrel Networks</a></p>
|
| 122 |
+
</footer>
|
| 123 |
+
</div>
|
| 124 |
+
</body>
|
| 125 |
+
</html>
|