An open dataset containing more than 150B tokens of low-toxicity and business-oriented text.
Bradford Levy PRO
bradfordlevy
AI & ML interests
None yet
Recent Activity
liked a dataset about 1 month ago
bigcode/the-stack-v2-train-smol-ids updated a dataset over 1 year ago
bradfordlevy/BeanCounter liked a dataset over 1 year ago
bradfordlevy/BeanCounterOrganizations
None yet