Spaces:

dataframer
/

README

Configuration error

App Files Files Community

aimonp commited on Mar 14

Commit

89d9cd4

verified ·

1 Parent(s): c8fe18f

Update README.md

Browse files

Files changed (1) hide show

README.md +39 -47

README.md CHANGED Viewed

@@ -1,81 +1,73 @@
 ---
 tags:
-- synthetic-data
-- anonymization
-- data-augmentation
-- data-simulation
 - evaluation
 - testing
 - privacy
 - enterprise
 - regulated-industries
-- llm
 pretty_name: DataFramer
 license: other
 ---
 # DataFramer
-**Your AI teams are ready. Their data isn't.**
-DataFramer helps teams **generate, anonymize, augment, and simulate** realistic datasets for **testing, evaluations, and fine-tuning** of ML and AI systems—without relying on sensitive production data.
-It is built for **complex, enterprise data**: multi-file, multi-format, structured, unstructured, nested, and long-context workflows.
-Learn more: https://www.dataframer.ai
-## What DataFramer is for
-DataFramer is designed for teams that need to:
-- create **synthetic datasets** from seed examples or from scratch
-- **anonymize sensitive data** while preserving structure and task-relevant fidelity
-- **augment and transform** existing datasets for broader coverage
-- **simulate edge cases, rare events, and scenarios** absent from historical data
-- build **evaluation datasets** that better reflect real-world model behavior
 ## Best-fit use cases
-**LLM and AI evals**
-Generate eval sets with better coverage across normal cases, edge cases, rare events, and boundary conditions.
-**Privacy-safe experimentation**
-Work with compliant alternatives when production data is restricted by privacy, security, or governance requirements.
-**Testing complex workflows**
-Create realistic data for claims, fraud, underwriting, patient workflows, document pipelines, and other enterprise systems.
-**Model training and fine-tuning**
-Expand sparse datasets and improve diversity while preserving the structure, constraints, and relationships models depend on.
-## Built for complex enterprise data
-DataFramer supports workflows involving:
-- long-form documents and PDFs
-- JSON, XML, CSV, Parquet, and other structured formats
-- nested and hierarchical records
-- multi-file and high-token samples
-- conversational, document, and domain-specific datasets
-## Who uses it
-DataFramer is especially relevant for teams in **regulated and data-sensitive industries**, including:
-- financial services
-- insurance
-- healthcare
-- other enterprise environments with strict privacy and governance requirements
-## Why teams use DataFramer
-- preserve **structure and constraints**, not just surface realism
-- generate data for **testing, evals, and fine-tuning**
-- improve **edge-case coverage**
-- reduce dependence on restricted production data
-- work with **complex, real-world data formats**
 ## Learn more
-Website: https://www.dataframer.ai
-Docs: https://docs.dataframer.ai

 ---
 tags:
 - evaluation
 - testing
 - privacy
+- llm
 - enterprise
+- anonymization
+- data-augmentation
+- simulation
 - regulated-industries
+- insurance
 pretty_name: DataFramer
 license: other
 ---
 # DataFramer
+**Realistic, privacy-safe data for AI testing, evals, and fine-tuning.**
+DataFramer helps AI teams create realistic, diverse datasets for **testing, evaluations, and fine-tuning** without exposing sensitive production data.
+Built for **complex enterprise workflows**, DataFramer supports document-heavy, multi-file, structured, and unstructured data so teams can validate AI systems against real-world variability, not just clean demo cases.
+## Why teams use DataFramer
+AI projects often stall because the data needed for testing and evaluation is:
+- too sensitive to use directly
+- too limited to cover edge cases
+- too messy to recreate by hand
+- too unrealistic when manually mocked
+DataFramer helps teams generate better test and eval data so models can be assessed against the kinds of variation they will actually face in production.
 ## Best-fit use cases
+- **LLM and AI evaluations**
+  Build eval datasets with stronger coverage across common cases, rare cases, and edge cases.
+- **Privacy-safe testing**
+  Work with realistic data for testing and iteration without exposing sensitive production records.
+- **Complex workflow validation**
+  Test systems that depend on long documents, multi-file inputs, nested structures, and business-specific constraints.
+- **Fine-tuning and dataset expansion**
+  Expand sparse datasets with more realistic variation while preserving the patterns your models depend on.
+## Built for enterprise data
+DataFramer is designed for workflows involving:
+- long-form documents and PDFs
+- structured and semi-structured data
+- nested and hierarchical records
+- multi-file samples
+- high-variability real-world business inputs
+## Who it is for
+DataFramer is especially useful for teams in **regulated and data-sensitive environments**, including:
+- insurance
+- financial services
+- healthcare
+- enterprise AI teams working with restricted or hard-to-access data
 ## Learn more
+See product examples, use cases, and request access at:
+**https://www.dataframer.ai**