--- title: README emoji: 🚀 colorFrom: green colorTo: blue sdk: static pinned: false short_description: Description of HF Organization --- ![ChatGPT Image May 15, 2026, 11_56_18 PM (1)](https://cdn-uploads.huggingface.co/production/uploads/64e8ea3892d9db9a93580fe3/bcKSfhanoCR3l0fzNJSoH.jpeg) HFStack is an open-source organization focused on building reproducible ML infrastructure around the Hugging Face stack. --- ## Focus Areas ### Datasets & Storage Building reproducible workflows around Hugging Face Datasets and HF Buckets. ### Trackio & Observability Experiment tracking, artifact lineage, and reproducible evaluation pipelines using Trackio. ### Benchmarking & Runtime Systems Inference benchmarking, optimization workflows, and runtime evaluation tooling. ### Orchestration & Integrations Composable integrations with tools like Dagster and ecosystem-native ML workflows. --- ## Philosophy HFStack focuses on the systems surrounding modern ML: * reproducibility * interoperability * observability * infrastructure simplicity The goal is to make Hugging Face workflows easier to build and operationalize. ## Projects & Integrations - `dagster-hf-datasets`: [Dagster-HF-Datasets](https://github.com/dagster-io/community-integrations/tree/main/libraries/dagster-hf-datasets) integrates Hugging Face datasets with Dagster for building reproducible, observable data pipelines. Load datasets directly as Dagster assets, apply transformations, and publish results back to the Hub. - `Open-Source AI Cookbook with Transformers and Optuna`: Contributed a [recipe](https://huggingface.co/learn/cookbook/en/optuna_hpo_with_transformers#hyperparameter-optimization-with-optuna-and-transformers) which showcases how best hyperparameters to fine-tune a lightweight BERT model for text classification on a subset of the IMDB dataset. --- Contributions and ecosystem collaborations are welcome.