File size: 1,918 Bytes
35f16e5
 
 
 
 
 
 
ce72591
35f16e5
 
ce72591
 
5940348
ce72591
2356348
ce72591
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
30a5cb2
c63c241
 
 
 
f3dcf91
7b3b7e8
 
ce72591
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
---
title: README
emoji: 🚀
colorFrom: green
colorTo: blue
sdk: static
pinned: false
short_description: Description of HF Organization
---



![ChatGPT Image May 15, 2026, 11_56_18 PM (1)](https://cdn-uploads.huggingface.co/production/uploads/64e8ea3892d9db9a93580fe3/bcKSfhanoCR3l0fzNJSoH.jpeg)

 HFStack is an open-source organization focused on building reproducible ML infrastructure around the Hugging Face stack.

---

## Focus Areas

### Datasets & Storage

Building reproducible workflows around Hugging Face Datasets and HF Buckets.

### Trackio & Observability

Experiment tracking, artifact lineage, and reproducible evaluation pipelines using Trackio.

### Benchmarking & Runtime Systems

Inference benchmarking, optimization workflows, and runtime evaluation tooling.

### Orchestration & Integrations

Composable integrations with tools like Dagster and ecosystem-native ML workflows.

---

## Philosophy

HFStack focuses on the systems surrounding modern ML:

* reproducibility
* interoperability
* observability
* infrastructure simplicity

The goal is to make Hugging Face workflows easier to build and operationalize.

## Projects & Integrations

- `dagster-hf-datasets`: [Dagster-HF-Datasets](https://github.com/dagster-io/community-integrations/tree/main/libraries/dagster-hf-datasets) integrates Hugging Face datasets with Dagster for building reproducible, observable data pipelines.
Load datasets directly as Dagster assets, apply transformations, and publish results back to the Hub.

- `Open-Source AI Cookbook with Transformers and Optuna`: Contributed a [recipe](https://huggingface.co/learn/cookbook/en/optuna_hpo_with_transformers#hyperparameter-optimization-with-optuna-and-transformers) which showcases how  best hyperparameters to fine-tune
a lightweight BERT model for text classification on a subset of the IMDB dataset.

---

Contributions and ecosystem collaborations are welcome.