| --- |
| license: other |
| license_name: tabpfn-2.6-license-v1.0 |
| license_link: LICENSE |
| extra_gated_fields: |
| Organization: text |
| Role: |
| type: select |
| options: |
| - Field practitioners |
| - Researcher |
| - Student |
| Use-case: text |
| May we contact you about future updates?: checkbox |
| extra_gated_button_content: Agree to license terms and send request to access repo. |
| extra_gated_description: "Model weights released under\_`tabpfn-2.6-license-v1.0`. This license is designed to be permissive for research and internal evaluation. It *explicitly allows* testing, evaluation, and internal benchmarking, so an organization can download the model and run preliminary assessments on its own datasets.\nThe key restriction is that the model, its derivatives, and its outputs cannot be used for any commercial or production purpose. This includes, but is not limited to, revenue-generating products, competitive benchmarking for procurement, client deliverables, or using the model’s results for internal commercial decision-making.\nFor all production use cases, we offer a *Commercial Enterprise License*. This provides access to our proprietary high-speed inference engine, dedicated support, integration tooling, and other internal models. Please contact us at sales@priorlabs.ai for commercial licensing inquiries." |
| pipeline_tag: tabular-classification |
| tags: |
| - chemistry |
| - biology |
| - finance |
| - legal |
| - climate |
| - medical |
| --- |
| ### Model Overview |
| TabPFN-2.6 is a transformer-based foundation model that uses in-context-learning to solve tabular prediction problems in a forward pass. |
| Inference code can be found at [https://github.com/PriorLabs/tabPFN](https://github.com/PriorLabs/tabPFN). |
|
|
| ### Getting started |
| First, install the inference package: |
| ```{bash} |
| pip install tabpfn |
| ``` |
|
|
| Fitting a classifier and predicting looks like this: |
|
|
| ```{python} |
| from sklearn.datasets import load_breast_cancer |
| from sklearn.model_selection import train_test_split |
| from tabpfn import TabPFNClassifier |
| |
| # Load data |
| X, y = load_breast_cancer(return_X_y=True) |
| X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.5, random_state=42) |
| |
| # Initialize a classifier |
| clf = TabPFNClassifier() |
| clf.fit(X_train, y_train) |
| |
| |
| # Predict probabilities |
| prediction_probabilities = clf.predict_proba(X_test) |
| # Predict labels |
| predictions = clf.predict(X_test) |
| print("Accuracy", accuracy_score(y_test, predictions)) |
| ``` |
|
|
| For more examples (e.g. how to train a regressor), see the github repo: [https://github.com/PriorLabs/tabPFN](https://github.com/PriorLabs/tabPFN)! |
|
|
| ### Developers & Affiliations |
| Developed by Prior Labs. |
|
|
| ### Intended Use |
| Regression and classification tasks with ≤50 000 samples and ≤2000 features in structured tabular format. |
|
|
| ### Not Intended Use |
| - Not suitable for unstructured data (text, images); use API version for textual features. |
| - Not tested for >50 000 samples or > 2000 features. |
|
|
| ### Model Architecture |
| Transformer with TabPFNv2-like alternating attention with 24 layers. |
|
|
| ### Training Data and Priors |
| TabPFN-2.6 is trained purely on synthetic tabular tasks. |
|
|
| ### Performance Benchmarks |
| Evaluated on proprietary benchmark collection, TabArena, and RealCause (for a causal version), in each of which it yields new SOTA results. |
|
|
|
|
| ### Ethical Considerations |
| Having been trained purely on synthetic datasets, TabPFN-2.6 is free from dataset leakage from the pretraining stage. |
| However, like for any other tabular prediction method, when applied to high-risk use cases, users should ensure that the labelled data is free of biases. |
|
|
| ### Limitations |
| Performance can degrade when applied to >50000 data points and/or 2000 features. |
|
|
| ### Licensing |
| Model weights released under tabpfn-2.6-license-v1.0. |
|
|
| The license is designed to be permissive for research and limited internal evaluation. It *explicitly allows* testing, evaluation, and internal benchmarking, so an organization can download the model and run preliminary assessments on its own datasets. |
| The key restriction is that the model, its derivatives, and its outputs cannot be used for any commercial or production purpose. This includes, but is not limited to, revenue-generating products, competitive benchmarking for procurement, client deliverables, or using the model’s results for internal commercial decision-making. |
| For all production use cases, we offer a *Commercial Enterprise License*. This provides access to our proprietary high-speed inference engine, dedicated support, integration tooling, and other internal models. |
| Please contact us at sales@priorlabs.ai for commercial licensing inquiries. |
|
|
| ### Version |
| v1.0: initial release. |
|
|
| ### Citation |
| ``` |
| @misc{TabPFN-2.5,\ |
| title={TabPFN-2.5},\ |
| author={Léo Grinsztajn and Klemens Flöge and Oscar Key and Felix Birkel and Brendan Roof and Phil Jund and Benjamin Jäger and Adrian Hayler and Dominik Safaric and Simone Alessi, Felix Jablonski and Mihir Manium and Rosen Yu and Anurag Garg and Jake Robertson and Shi Bin (Liam) Hoo and Vladyslav Moroshan and Magnus Bühler and Lennart Purucker and Clara Cornu and Lilly Charlotte Wehrhahn and Alessandro Bonetto and Sauraj Gambhir and Noah Hollmann and Frank Hutter},\ |
| year={2025}\ |
| } |
| ``` |