aimonp commited on
Commit
c8fe18f
·
verified ·
1 Parent(s): ab417f1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -30
README.md CHANGED
@@ -18,60 +18,64 @@ license: other
18
 
19
  **Your AI teams are ready. Their data isn't.**
20
 
21
- DataFramer helps teams **generate, anonymize, augment, and simulate** realistic datasets for **testing, evaluations, and fine-tuning** of ML and AI systems—without relying on sensitive production data. It is built for **complex, enterprise data**: multi-file, multi-format, structured, unstructured, nested, and long-context workflows. :contentReference[oaicite:1]{index=1}
 
 
 
 
22
 
23
  ## What DataFramer is for
24
 
25
  DataFramer is designed for teams that need to:
26
 
27
- - create **synthetic datasets** from seed examples or from scratch
28
- - **anonymize sensitive data** while preserving structure and task-relevant fidelity
29
- - **augment and transform** existing datasets for broader coverage
30
- - **simulate edge cases, rare events, and scenarios** absent from historical data
31
- - build **evaluation datasets** that better reflect real-world model behavior :contentReference[oaicite:2]{index=2}
32
 
33
  ## Best-fit use cases
34
 
35
- - **LLM and AI evals**
36
- Generate eval sets with better coverage across normal cases, edge cases, rare events, and boundary conditions.
37
 
38
- - **Privacy-safe experimentation**
39
- Work with compliant alternatives when production data is restricted by privacy, security, or governance requirements.
40
 
41
- - **Testing for complex workflows**
42
- Create realistic data for claims, fraud, underwriting, patient workflows, document pipelines, and other enterprise systems.
43
 
44
- - **Model training and fine-tuning**
45
- Expand sparse datasets and improve diversity while preserving the structure, constraints, and relationships models depend on. :contentReference[oaicite:3]{index=3}
46
 
47
  ## Built for complex enterprise data
48
 
49
  DataFramer supports workflows involving:
50
 
51
- - long-form documents and PDFs
52
- - JSON, XML, CSV, Parquet, and other structured formats
53
- - nested and hierarchical records
54
- - multi-file and high-token samples
55
- - conversational, document, and domain-specific datasets :contentReference[oaicite:4]{index=4}
56
 
57
  ## Who uses it
58
 
59
  DataFramer is especially relevant for teams in **regulated and data-sensitive industries**, including:
60
 
61
- - financial services
62
- - insurance
63
- - healthcare
64
- - other enterprise environments with strict privacy, governance, and evaluation requirements :contentReference[oaicite:5]{index=5}
65
 
66
  ## Why teams use DataFramer
67
 
68
- - preserve **structure and constraints**, not just surface realism
69
- - generate data for **testing, evals, and fine-tuning**
70
- - improve **edge-case coverage**
71
- - reduce dependence on restricted production data
72
- - work with **complex, real-world data formats** that generic tools struggle to handle :contentReference[oaicite:6]{index=6}
73
 
74
  ## Learn more
75
 
76
- Website: `https://www.dataframer.ai`
77
- Docs: `https://docs.dataframer.ai`
 
18
 
19
  **Your AI teams are ready. Their data isn't.**
20
 
21
+ DataFramer helps teams **generate, anonymize, augment, and simulate** realistic datasets for **testing, evaluations, and fine-tuning** of ML and AI systems—without relying on sensitive production data.
22
+
23
+ It is built for **complex, enterprise data**: multi-file, multi-format, structured, unstructured, nested, and long-context workflows.
24
+
25
+ Learn more: https://www.dataframer.ai
26
 
27
  ## What DataFramer is for
28
 
29
  DataFramer is designed for teams that need to:
30
 
31
+ - create **synthetic datasets** from seed examples or from scratch
32
+ - **anonymize sensitive data** while preserving structure and task-relevant fidelity
33
+ - **augment and transform** existing datasets for broader coverage
34
+ - **simulate edge cases, rare events, and scenarios** absent from historical data
35
+ - build **evaluation datasets** that better reflect real-world model behavior
36
 
37
  ## Best-fit use cases
38
 
39
+ **LLM and AI evals**
40
+ Generate eval sets with better coverage across normal cases, edge cases, rare events, and boundary conditions.
41
 
42
+ **Privacy-safe experimentation**
43
+ Work with compliant alternatives when production data is restricted by privacy, security, or governance requirements.
44
 
45
+ **Testing complex workflows**
46
+ Create realistic data for claims, fraud, underwriting, patient workflows, document pipelines, and other enterprise systems.
47
 
48
+ **Model training and fine-tuning**
49
+ Expand sparse datasets and improve diversity while preserving the structure, constraints, and relationships models depend on.
50
 
51
  ## Built for complex enterprise data
52
 
53
  DataFramer supports workflows involving:
54
 
55
+ - long-form documents and PDFs
56
+ - JSON, XML, CSV, Parquet, and other structured formats
57
+ - nested and hierarchical records
58
+ - multi-file and high-token samples
59
+ - conversational, document, and domain-specific datasets
60
 
61
  ## Who uses it
62
 
63
  DataFramer is especially relevant for teams in **regulated and data-sensitive industries**, including:
64
 
65
+ - financial services
66
+ - insurance
67
+ - healthcare
68
+ - other enterprise environments with strict privacy and governance requirements
69
 
70
  ## Why teams use DataFramer
71
 
72
+ - preserve **structure and constraints**, not just surface realism
73
+ - generate data for **testing, evals, and fine-tuning**
74
+ - improve **edge-case coverage**
75
+ - reduce dependence on restricted production data
76
+ - work with **complex, real-world data formats**
77
 
78
  ## Learn more
79
 
80
+ Website: https://www.dataframer.ai
81
+ Docs: https://docs.dataframer.ai