Update README.md

6b03c75 verified 9 days ago

5.21 kB

	---
	license: other
	license_name: tabpfn-2.6-license-v1.0
	license_link: LICENSE
	extra_gated_fields:
	Organization: text
	Role:
	type: select
	options:
	- Field practitioners
	- Researcher
	- Student
	Use-case: text
	May we contact you about future updates?: checkbox
	extra_gated_button_content: Agree to license terms and send request to access repo.
	extra_gated_description: "Model weights released under\_`tabpfn-2.6-license-v1.0`. This license is designed to be permissive for research and internal evaluation. It explicitly allows testing, evaluation, and internal benchmarking, so an organization can download the model and run preliminary assessments on its own datasets.\nThe key restriction is that the model, its derivatives, and its outputs cannot be used for any commercial or production purpose. This includes, but is not limited to, revenue-generating products, competitive benchmarking for procurement, client deliverables, or using the model’s results for internal commercial decision-making.\nFor all production use cases, we offer a Commercial Enterprise License. This provides access to our proprietary high-speed inference engine, dedicated support, integration tooling, and other internal models. Please contact us at sales@priorlabs.ai for commercial licensing inquiries."
	pipeline_tag: tabular-classification
	tags:
	- chemistry
	- biology
	- finance
	- legal
	- climate
	- medical
	---
	### Model Overview
	TabPFN-2.6 is a transformer-based foundation model that uses in-context-learning to solve tabular prediction problems in a forward pass.
	Inference code can be found at [https://github.com/PriorLabs/tabPFN](https://github.com/PriorLabs/tabPFN).

	### Getting started
	First, install the inference package:
	```{bash}
	pip install tabpfn
	```

	Fitting a classifier and predicting looks like this:

	```{python}
	from sklearn.datasets import load_breast_cancer
	from sklearn.model_selection import train_test_split
	from tabpfn import TabPFNClassifier

	# Load data
	X, y = load_breast_cancer(return_X_y=True)
	X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.5, random_state=42)

	# Initialize a classifier
	clf = TabPFNClassifier()
	clf.fit(X_train, y_train)


	# Predict probabilities
	prediction_probabilities = clf.predict_proba(X_test)
	# Predict labels
	predictions = clf.predict(X_test)
	print("Accuracy", accuracy_score(y_test, predictions))
	```

	For more examples (e.g. how to train a regressor), see the github repo: [https://github.com/PriorLabs/tabPFN](https://github.com/PriorLabs/tabPFN)!

	### Developers & Affiliations
	Developed by Prior Labs.

	### Intended Use
	Regression and classification tasks with ≤50 000 samples and ≤2000 features in structured tabular format.

	### Not Intended Use
	- Not suitable for unstructured data (text, images); use API version for textual features.
	- Not tested for >50 000 samples or > 2000 features.

	### Model Architecture
	Transformer with TabPFNv2-like alternating attention with 24 layers.

	### Training Data and Priors
	TabPFN-2.6 is trained purely on synthetic tabular tasks.

	### Performance Benchmarks
	Evaluated on proprietary benchmark collection, TabArena, and RealCause (for a causal version), in each of which it yields new SOTA results.


	### Ethical Considerations
	Having been trained purely on synthetic datasets, TabPFN-2.6 is free from dataset leakage from the pretraining stage.
	However, like for any other tabular prediction method, when applied to high-risk use cases, users should ensure that the labelled data is free of biases.

	### Limitations
	Performance can degrade when applied to >50000 data points and/or 2000 features.

	### Licensing
	Model weights released under tabpfn-2.6-license-v1.0.

	The license is designed to be permissive for research and limited internal evaluation. It explicitly allows testing, evaluation, and internal benchmarking, so an organization can download the model and run preliminary assessments on its own datasets.
	The key restriction is that the model, its derivatives, and its outputs cannot be used for any commercial or production purpose. This includes, but is not limited to, revenue-generating products, competitive benchmarking for procurement, client deliverables, or using the model’s results for internal commercial decision-making.
	For all production use cases, we offer a Commercial Enterprise License. This provides access to our proprietary high-speed inference engine, dedicated support, integration tooling, and other internal models.
	Please contact us at sales@priorlabs.ai for commercial licensing inquiries.

	### Version
	v1.0: initial release.

	### Citation
	```
	@misc{TabPFN-2.5,\
	title={TabPFN-2.5},\
	author={Léo Grinsztajn and Klemens Flöge and Oscar Key and Felix Birkel and Brendan Roof and Phil Jund and Benjamin Jäger and Adrian Hayler and Dominik Safaric and Simone Alessi, Felix Jablonski and Mihir Manium and Rosen Yu and Anurag Garg and Jake Robertson and Shi Bin (Liam) Hoo and Vladyslav Moroshan and Magnus Bühler and Lennart Purucker and Clara Cornu and Lilly Charlotte Wehrhahn and Alessandro Bonetto and Sauraj Gambhir and Noah Hollmann and Frank Hutter},\
	year={2025}\
	}
	```