Spaces:

danielrosehill
/

Single-Shot-Brevity-Training

Running

danielrosehill Claude commited on Oct 27, 2025

Commit

50271b5

1 Parent(s): 82102ea

Create comprehensive Hugging Face Space for Single-Shot Brevity Training experiment

Enhanced the static Space to showcase the LLM brevity training experiment with:
- Complete HTML page detailing the problem, methodology, and findings
- Professional gradient design with responsive layout
- Key statistics and model performance comparisons
- Embedded visualization charts (bar chart and 4-panel analysis)
- Links to GitHub repository and resources
- Updated README with experiment overview and key findings
- Configured Git LFS for PNG image files

Generated with Claude Code (https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

Files changed (6) hide show

.gitattributes +1 -0
README.md +24 -1
index.html +116 -8
style.css +242 -14
verbosity_analysis.png +3 -0
verbosity_bar_chart.png +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.png filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -8,4 +8,27 @@ pinned: false
 short_description: Using one example to train an LLM for informational brevity
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 short_description: Using one example to train an LLM for informational brevity
 ---
+# Single-Shot Brevity Training
+An experiment exploring how to train Large Language Models to provide concise, informative responses using a single example rather than abstract instructions.
+## Overview
+This Hugging Face Space showcases an approach to addressing LLM verbosity by demonstrating the desired response format with one concrete example in the system prompt.
+## Key Findings
+- Response lengths varied by **5.5x** across 14 tested models
+- Most concise: AI21 Jamba Large (295 words)
+- Most verbose: OpenAI GPT-OSS-120B (1,632 words)
+- Optimized examples achieved **60-75% word reduction**
+## Resources
+- [Full GitHub Repository](https://github.com/danielrosehill/Single-Shot-Brevity-Training) - Complete data, analysis, and system prompts
+- [Raw Response Data](https://github.com/danielrosehill/Single-Shot-Brevity-Training/tree/main/responses) - Baseline outputs from all models
+- [Optimized Examples](https://github.com/danielrosehill/Single-Shot-Brevity-Training/tree/main/optimized) - Demonstrating ideal brevity
+## Created By
+[Daniel Rosehill](https://danielrosehill.com) - Part of ongoing research in LLM optimization and prompt engineering

index.html CHANGED Viewed

@@ -3,17 +3,125 @@
 	<head>
 		<meta charset="utf-8" />
 		<meta name="viewport" content="width=device-width" />
-		<title>My static Space</title>
 		<link rel="stylesheet" href="style.css" />
 	</head>
 	<body>
-		<div class="card">
-			<h1>Welcome to your static Space!</h1>
-			<p>You can modify this app directly by editing <i>index.html</i> in the Files and versions tab.</p>
-			<p>
-				Also don't forget to check the
-				<a href="https://huggingface.co/docs/hub/spaces" target="_blank">Spaces documentation</a>.
-			</p>
 		</div>
 	</body>
 </html>

 	<head>
 		<meta charset="utf-8" />
 		<meta name="viewport" content="width=device-width" />
+		<title>Single-Shot Brevity Training | LLM Response Optimization</title>
 		<link rel="stylesheet" href="style.css" />
 	</head>
 	<body>
+		<div class="container">
+			<header>
+				<h1>Single-Shot Brevity Training</h1>
+				<p class="subtitle">Using One Example to Train LLMs for Informational Brevity</p>
+				<div class="links">
+					<a href="https://github.com/danielrosehill/Single-Shot-Brevity-Training" target="_blank" class="btn">View on GitHub</a>
+				</div>
+			</header>
+			<section class="card">
+				<h2>The Problem</h2>
+				<p>Large Language Models often generate excessively verbose responses, even when concise, informative answers would be more valuable. This experiment explores a simple yet effective approach to guide models toward brevity without sacrificing information quality.</p>
+			</section>
+			<section class="card">
+				<h2>The Approach</h2>
+				<p>Rather than abstract instructions like "be concise," this framework uses <strong>single-shot training</strong>: demonstrating the desired format with one concrete example in the system prompt.</p>
+				<h3>Two-Phase Methodology</h3>
+				<div class="phase">
+					<h4>Phase 1: Baseline Evaluation</h4>
+					<p>Tested 14 models using a standardized product recommendation prompt (power bank selection) without any brevity instructions to establish natural response lengths.</p>
+				</div>
+				<div class="phase">
+					<h4>Phase 2: Single-Shot Training</h4>
+					<p>Selected models received system prompts containing one optimized response example to guide future outputs toward similar brevity.</p>
+				</div>
+			</section>
+			<section class="card highlight">
+				<h2>Key Findings</h2>
+				<div class="stat-grid">
+					<div class="stat">
+						<div class="stat-number">5.5x</div>
+						<div class="stat-label">Difference between longest and shortest responses</div>
+					</div>
+					<div class="stat">
+						<div class="stat-number">794</div>
+						<div class="stat-label">Mean response length (words)</div>
+					</div>
+					<div class="stat">
+						<div class="stat-number">60-75%</div>
+						<div class="stat-label">Word reduction in optimized examples</div>
+					</div>
+				</div>
+				<h3>Model Response Length Comparison</h3>
+				<div class="chart-container">
+					<img src="verbosity_bar_chart.png" alt="Bar chart comparing word counts across 14 LLM models" class="chart-image">
+					<p class="chart-caption">Comparison of response lengths across 14 evaluated models</p>
+				</div>
+				<h3>Comprehensive Verbosity Analysis</h3>
+				<div class="chart-container">
+					<img src="verbosity_analysis.png" alt="Four-panel analysis of response verbosity characteristics" class="chart-image">
+					<p class="chart-caption">Multi-faceted examination of response characteristics and patterns</p>
+				</div>
+				<h3>Response Length Variation</h3>
+				<ul>
+					<li><strong>Longest:</strong> 1,632 words (OpenAI GPT-OSS-120B)</li>
+					<li><strong>Shortest:</strong> 295 words (AI21 Jamba Large)</li>
+					<li><strong>Standard deviation:</strong> 456 words</li>
+				</ul>
+				<h3>Most Concise Performers</h3>
+				<ol class="model-list">
+					<li><strong>AI21 Jamba Large</strong> - 295 words</li>
+					<li><strong>Mistral Large</strong> - 352 words</li>
+					<li><strong>Meta Llama 4 Maverick</strong> - 397 words</li>
+				</ol>
+				<h3>Most Verbose Performers</h3>
+				<ol class="model-list">
+					<li><strong>OpenAI GPT-OSS-120B</strong> - 1,632 words</li>
+					<li><strong>Google Gemini 2.5 Flash</strong> - 1,607 words</li>
+				</ol>
+			</section>
+			<section class="card">
+				<h2>Repository Contents</h2>
+				<ul>
+					<li><strong>Raw Response Data:</strong> Complete baseline outputs from all tested models</li>
+					<li><strong>Optimized Examples:</strong> Demonstrating ideal brevity (60-75% word reduction)</li>
+					<li><strong>Model-Specific System Prompts:</strong> Implementing single-shot training for practical application</li>
+					<li><strong>Statistical Analysis:</strong> Comprehensive comparison of response lengths and patterns</li>
+				</ul>
+			</section>
+			<section class="card">
+				<h2>Practical Applications</h2>
+				<p>This approach offers several benefits for LLM deployment:</p>
+				<ul>
+					<li><strong>Cost Reduction:</strong> Shorter responses mean fewer output tokens and lower API costs</li>
+					<li><strong>User Experience:</strong> Concise responses are faster to read and process</li>
+					<li><strong>Efficiency:</strong> One example is simpler than complex prompt engineering</li>
+					<li><strong>Reusability:</strong> The framework can be adapted to different use cases and domains</li>
+				</ul>
+			</section>
+			<section class="card">
+				<h2>Get Involved</h2>
+				<p>This is an open experiment exploring effective LLM training techniques. The repository includes all data, prompts, and analysis for transparency and reproducibility.</p>
+				<div class="links">
+					<a href="https://github.com/danielrosehill/Single-Shot-Brevity-Training" target="_blank" class="btn btn-primary">Explore the Repository</a>
+					<a href="https://github.com/danielrosehill/Single-Shot-Brevity-Training/issues" target="_blank" class="btn">Share Feedback</a>
+				</div>
+			</section>
+			<footer>
+				<p>Created by <a href="https://danielrosehill.com" target="_blank">Daniel Rosehill</a></p>
+				<p>Part of ongoing research in LLM optimization and prompt engineering</p>
+			</footer>
 		</div>
 	</body>
 </html>

style.css CHANGED Viewed

@@ -1,28 +1,256 @@
 body {
-	padding: 2rem;
-	font-family: -apple-system, BlinkMacSystemFont, "Arial", sans-serif;
 }
 h1 {
-	font-size: 16px;
-	margin-top: 0;
 }
 p {
-	color: rgb(107, 114, 128);
-	font-size: 15px;
-	margin-bottom: 10px;
-	margin-top: 5px;
 }
 .card {
-	max-width: 620px;
-	margin: 0 auto;
-	padding: 16px;
-	border: 1px solid lightgray;
-	border-radius: 16px;
 }
-.card p:last-child {
 	margin-bottom: 0;
 }

+* {
+	box-sizing: border-box;
+	margin: 0;
+	padding: 0;
+}
 body {
+	font-family: -apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, "Helvetica Neue", Arial, sans-serif;
+	line-height: 1.6;
+	color: #333;
+	background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
+	min-height: 100vh;
+	padding: 2rem 1rem;
+}
+.container {
+	max-width: 900px;
+	margin: 0 auto;
+}
+header {
+	text-align: center;
+	margin-bottom: 3rem;
+	background: white;
+	padding: 2.5rem 2rem;
+	border-radius: 16px;
+	box-shadow: 0 10px 30px rgba(0, 0, 0, 0.15);
 }
 h1 {
+	font-size: 2.5rem;
+	margin-bottom: 0.5rem;
+	color: #2d3748;
+	font-weight: 700;
+}
+.subtitle {
+	font-size: 1.25rem;
+	color: #718096;
+	margin-bottom: 1.5rem;
+}
+h2 {
+	font-size: 1.8rem;
+	margin-bottom: 1rem;
+	color: #2d3748;
+	font-weight: 600;
+}
+h3 {
+	font-size: 1.3rem;
+	margin-top: 1.5rem;
+	margin-bottom: 0.75rem;
+	color: #4a5568;
+	font-weight: 600;
+}
+h4 {
+	font-size: 1.1rem;
+	margin-bottom: 0.5rem;
+	color: #4a5568;
+	font-weight: 600;
 }
 p {
+	color: #4a5568;
+	font-size: 1rem;
+	margin-bottom: 1rem;
+	line-height: 1.7;
 }
 .card {
+	background: white;
+	padding: 2rem;
+	border-radius: 12px;
+	box-shadow: 0 4px 6px rgba(0, 0, 0, 0.1);
+	margin-bottom: 2rem;
+}
+.card.highlight {
+	background: linear-gradient(135deg, #f6f8fb 0%, #ffffff 100%);
+	border: 2px solid #667eea;
+}
+.stat-grid {
+	display: grid;
+	grid-template-columns: repeat(auto-fit, minmax(200px, 1fr));
+	gap: 1.5rem;
+	margin: 2rem 0;
+}
+.stat {
+	text-align: center;
+	padding: 1.5rem;
+	background: white;
+	border-radius: 8px;
+	box-shadow: 0 2px 4px rgba(0, 0, 0, 0.05);
+}
+.stat-number {
+	font-size: 2.5rem;
+	font-weight: 700;
+	color: #667eea;
+	margin-bottom: 0.5rem;
+}
+.stat-label {
+	font-size: 0.9rem;
+	color: #718096;
+	line-height: 1.4;
+}
+.phase {
+	background: #f7fafc;
+	padding: 1.5rem;
+	border-radius: 8px;
+	margin-bottom: 1rem;
+	border-left: 4px solid #667eea;
+}
+ul, ol {
+	margin-left: 1.5rem;
+	margin-bottom: 1rem;
+}
+li {
+	margin-bottom: 0.5rem;
+	color: #4a5568;
+	line-height: 1.6;
 }
+.model-list {
+	background: #f7fafc;
+	padding: 1.5rem;
+	border-radius: 8px;
+	margin-bottom: 1rem;
+}
+.model-list li {
+	margin-bottom: 0.75rem;
+}
+.chart-container {
+	margin: 2rem 0;
+	text-align: center;
+}
+.chart-image {
+	max-width: 100%;
+	height: auto;
+	border-radius: 8px;
+	box-shadow: 0 4px 8px rgba(0, 0, 0, 0.1);
+	margin-bottom: 0.75rem;
+}
+.chart-caption {
+	font-size: 0.9rem;
+	color: #718096;
+	font-style: italic;
 	margin-bottom: 0;
 }
+.links {
+	display: flex;
+	gap: 1rem;
+	justify-content: center;
+	flex-wrap: wrap;
+	margin-top: 1.5rem;
+}
+.btn {
+	display: inline-block;
+	padding: 0.75rem 1.5rem;
+	background: #667eea;
+	color: white;
+	text-decoration: none;
+	border-radius: 8px;
+	font-weight: 600;
+	transition: all 0.3s ease;
+	box-shadow: 0 4px 6px rgba(102, 126, 234, 0.3);
+}
+.btn:hover {
+	background: #5a67d8;
+	transform: translateY(-2px);
+	box-shadow: 0 6px 12px rgba(102, 126, 234, 0.4);
+}
+.btn-primary {
+	background: #764ba2;
+	box-shadow: 0 4px 6px rgba(118, 75, 162, 0.3);
+}
+.btn-primary:hover {
+	background: #6b3f91;
+	box-shadow: 0 6px 12px rgba(118, 75, 162, 0.4);
+}
+footer {
+	text-align: center;
+	padding: 2rem;
+	background: white;
+	border-radius: 12px;
+	margin-top: 2rem;
+	box-shadow: 0 4px 6px rgba(0, 0, 0, 0.1);
+}
+footer p {
+	margin-bottom: 0.5rem;
+	color: #718096;
+}
+footer a {
+	color: #667eea;
+	text-decoration: none;
+	font-weight: 600;
+}
+footer a:hover {
+	text-decoration: underline;
+}
+@media (max-width: 768px) {
+	body {
+		padding: 1rem 0.5rem;
+	}
+	h1 {
+		font-size: 2rem;
+	}
+	.subtitle {
+		font-size: 1.1rem;
+	}
+	.card {
+		padding: 1.5rem;
+	}
+	header {
+		padding: 2rem 1.5rem;
+	}
+	.stat-grid {
+		grid-template-columns: 1fr;
+	}
+	.links {
+		flex-direction: column;
+	}
+	.btn {
+		width: 100%;
+		text-align: center;
+	}
+}

verbosity_analysis.png ADDED Viewed

Git LFS Details

SHA256: 388590e67529454a23612bcdc725dbc9153dc160a5640073a3c3807d92d7e198
Pointer size: 131 Bytes
Size of remote file: 814 kB

verbosity_bar_chart.png ADDED Viewed

Git LFS Details

SHA256: 6e9b74a00a48c9b59cdf05d5b7ac4c12157cb860c3b28ac5fe692f26e2b3d107
Pointer size: 131 Bytes
Size of remote file: 305 kB