finephrase

Running on CPU Upgrade

joelniklaus HF Staff commited on Feb 25

Commit

1c77cab

1 Parent(s): 62145f9

add baselines mixed with fw-edu-hq

Files changed (3) hide show

app/src/content/assets/data/benchmark-results.csv CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:27dd686263a9217a306811036fd361d7616dc6231393f311387d1b5dd065f595
-size 1334642

 version https://git-lfs.github.com/spec/v1
+oid sha256:0359f44cbbe97ee8f7ea598152a5053a322a81af818de890606e0daa6c15fd3a
+size 1378100

app/src/content/chapters/3-experiments.mdx CHANGED Viewed

@@ -10,10 +10,17 @@ import FigRef from "../../components/FigRef.astro";
 {/* TODO: Integrate decay experiment as another analysis for proxy */}
 {/* TODO: share on a bunch of discords/slacks/hackernews/locallama */}
 {/* TODO: brainstorm better banner, be artsy */}
 {/* TODO: improve the diagram for the infrastructure at the start of the section */}
 {/* TODO: final configuration for finephrase at the end of infra section: visualization of how many pages (500 tokens) (use page emojis flying from left to right) we can generate (real time), user can configure with a slider the number of GPUs */}
 {/* TODO: only explain datatrove additions when we need them (for generating the final finephrase) */}
 {/* TODO: move infrastructure section after analyses as precursor and explanation for finephrase */}
 {/*
 Notes:

 {/* TODO: Integrate decay experiment as another analysis for proxy */}
 {/* TODO: share on a bunch of discords/slacks/hackernews/locallama */}
 {/* TODO: brainstorm better banner, be artsy */}
+{/* TODO: banner idea: 1T tokens = 8M books
+5cm pro buech = 400km
+Denn chönntme die büecher ufenandstaple und d distanz zeige ufenere charte bspw. Oder mit öppis vergliiche.
+Oder für jedes buech en punkt mache
+ */}
 {/* TODO: improve the diagram for the infrastructure at the start of the section */}
 {/* TODO: final configuration for finephrase at the end of infra section: visualization of how many pages (500 tokens) (use page emojis flying from left to right) we can generate (real time), user can configure with a slider the number of GPUs */}
 {/* TODO: only explain datatrove additions when we need them (for generating the final finephrase) */}
 {/* TODO: move infrastructure section after analyses as precursor and explanation for finephrase */}
+{/* TODO: baselines mixed with fw-edu-hq usually improve upon just baselines, but not sure if/how to present this */}
 {/*
 Notes:

app/src/content/chapters/5-infrastructure.mdx CHANGED Viewed

@@ -451,6 +451,7 @@ With a trillion-parameter model you won't be generating billions of tokens per h
 Further improvement ideas:
 - add a second model below so we can compare. Suggest something cool for the numbers below.
 - Also add some animations (page turning, flapping books,  bookshelfes books coming in and out)
 */}
 To get an intuition for what these throughput numbers feel like, <FigRef target="inference-throughput" /> lets you pick a model and scale up the number of GPUs. Each page represents roughly 500 tokens of generated text. At high enough throughput, pages roll up into books (250 pages each), and books into bookshelves (250 books each).

 Further improvement ideas:
 - add a second model below so we can compare. Suggest something cool for the numbers below.
 - Also add some animations (page turning, flapping books,  bookshelfes books coming in and out)
+- Clean it up a bit to make it less cluttered
 */}
 To get an intuition for what these throughput numbers feel like, <FigRef target="inference-throughput" /> lets you pick a model and scale up the number of GPUs. Each page represents roughly 500 tokens of generated text. At high enough throughput, pages roll up into books (250 pages each), and books into bookshelves (250 books each).