finephrase

Running on CPU Upgrade

joelniklaus HF Staff commited on Mar 6

Commit

2b72bd7

1 Parent(s): a99a2cf

improved phrasing and banner

Files changed (2) hide show

app/src/content/chapters/1-introduction.mdx CHANGED Viewed

@@ -52,9 +52,9 @@ During SmolLM2 [@smollm2] training, the model was decent at coding and math but
 However, how to do synthetic data generation properly still resembles alchemy these days: Which model should you use? Which prompts work best and how many do you need? And how do you even scale this effectively?
-Here's the plan:
 <Sidenote>
-The sections are fairly self-contained, so feel free to jump around and skip whatever seems less interesting to you.
 </Sidenote>
 We start by [setting up the problem](#rephrasing-the-web): what rephrasing is, which approaches exist, and what we want to test. Then we dive into the 90 [Experiments](#experiments) we ran to figure out which prompts, models, and datasets actually work. The [Analyses](#analyses) section zooms out to ask *why* things work the way they do. Next comes the [Infrastructure](#infrastructure) that made all of this possible, including detailed throughput benchmarking of popular models (super important for getting the most data for your bucks). Finally, we [put it all together](#applying-the-recipe-at-scale) into FinePhrase, our best configuration.

 However, how to do synthetic data generation properly still resembles alchemy these days: Which model should you use? Which prompts work best and how many do you need? And how do you even scale this effectively?
+Our goal is to turn this alchemy into chemistry: replace intuition with systematic, reproducible experiments. Here's how we go about it:
 <Sidenote>
+Lavoisier replaced phlogiston theory with precise measurements and repeatable experiments, earning him the title "father of modern chemistry".
 </Sidenote>
 We start by [setting up the problem](#rephrasing-the-web): what rephrasing is, which approaches exist, and what we want to test. Then we dive into the 90 [Experiments](#experiments) we ran to figure out which prompts, models, and datasets actually work. The [Analyses](#analyses) section zooms out to ask *why* things work the way they do. Next comes the [Infrastructure](#infrastructure) that made all of this possible, including detailed throughput benchmarking of popular models (super important for getting the most data for your bucks). Finally, we [put it all together](#applying-the-recipe-at-scale) into FinePhrase, our best configuration.

app/src/content/embeds/banner.html CHANGED Viewed

@@ -685,7 +685,7 @@
           .attr('fill', isDark ? 'rgba(255,255,255,0.4)' : 'rgba(0,0,0,0.38)')
           .attr('font-size', subFS).attr('font-weight', 500)
           .attr('letter-spacing', '0.14em')
-          .text(`${numExperiments} EXPERIMENTS \u00B7 ${totalDocsB}B DOCUMENTS`));
         // Legend
         const familyCounts = {};

           .attr('fill', isDark ? 'rgba(255,255,255,0.4)' : 'rgba(0,0,0,0.38)')
           .attr('font-size', subFS).attr('font-weight', 500)
           .attr('letter-spacing', '0.14em')
+          .text(`${numExperiments} EXPERIMENTS \u00B7 ${totalDocsB}B DOCUMENTS \u00B7 1 PAGE \u2248 100M TOKENS`));
         // Legend
         const familyCounts = {};