Eric Xu commited on
Commit
4fb07e1
·
1 Parent(s): ffa0abd

Be honest about Nemotron dataset coverage per domain

Browse files

Nemotron has age, sex, education, occupation, marital status, geography —
great for consumer/population evaluations but lacks B2B fields like company
size, seniority, budget, tech stack. Clarify which domains use built-in
data vs. LLM-generated personas.

Files changed (1) hide show
  1. README.md +10 -6
README.md CHANGED
@@ -30,12 +30,16 @@ Anything someone else evaluates.
30
 
31
  | What you're optimizing | Who evaluates it | What you learn |
32
  |----------------------|-----------------|---------------|
33
- | **Product** — landing page, pricing, positioning | Buyer personas across company sizes, roles, budgets | Which segments convert, which are blocked, and why |
34
- | **Resume** — CV + cover letter for a target role | Hiring managers at startups, enterprises, agencies | What stands out, what's a red flag, what to lead with |
35
- | **Pitch** — investor deck | VCs and angels at different stages and sectors | Whether the story lands, what questions they'd ask |
36
- | **Policy** — proposed regulation or internal change | Stakeholders: residents, businesses, employees | Who supports it, who opposes, what compromise works |
37
- | **Content** — blog post, video, talk proposal | Readers at different expertise levels | Whether it hits the right level, what's confusing |
38
- | **Profile** — dating, professional, public bio | Representative population sample | How different demographics perceive you |
 
 
 
 
39
 
40
  In each case, SGO tells you **where you stand**, **what's working**, **what's not**, and **what specific change would help the most** — broken down by audience segment.
41
 
 
30
 
31
  | What you're optimizing | Who evaluates it | What you learn |
32
  |----------------------|-----------------|---------------|
33
+ | **Product** — landing page, pricing | Buyer personas by company size, role, budget | Which segments convert, which are blocked, and why |
34
+ | **Resume** — CV + cover letter | Hiring managers at startups vs. enterprises | What stands out, what's a red flag, what to lead with |
35
+ | **Pitch** — investor deck | VCs and angels at different stages | Whether the story lands, what questions they'd ask |
36
+ | **Policy** — proposed regulation | Stakeholders by role, income, geography | Who supports it, who opposes, what compromise works |
37
+ | **Content** — blog post, video | Readers at different expertise levels | Whether it hits the right level, what's confusing |
38
+ | **Profile** — dating, professional bio | Population sample by age, education, occupation | How different demographics perceive you |
39
+
40
+ SGO ships with a 1M-person census-grounded dataset ([Nemotron-Personas-USA](https://huggingface.co/datasets/nvidia/Nemotron-Personas-USA)) that covers **age, sex, education, occupation, marital status, and US geography**. This works well out of the box for profiles, policy, and any consumer-facing evaluation.
41
+
42
+ For specialized domains (B2B buyers, hiring managers, VCs), the dataset doesn't have fields like company size, seniority, or budget — so SGO generates those personas via LLM with explicit stratification constraints. The results are still useful, but see [limitations](#limitations) on LLM-generated panels vs. census-grounded ones.
43
 
44
  In each case, SGO tells you **where you stand**, **what's working**, **what's not**, and **what specific change would help the most** — broken down by audience segment.
45