polodealvarado
/

polyencoder

Zero-Shot Classification

text-classification

Model card Files Files and versions

polyencoder / README.md

polodealvarado's picture

Upload folder using huggingface_hub

445afd8 verified 13 days ago

|

history blame contribute delete

1.5 kB

	---
	language:
	- en
	license: mit
	library_name: transformers
	pipeline_tag: zero-shot-classification
	tags:
	- zero-shot
	- multi-label
	- text-classification
	- pytorch
	metrics:
	- precision
	- recall
	- f1
	base_model: bert-base-uncased
	datasets:
	- polodealvarado/zeroshot-classification
	---

	# Zero-Shot Text Classification — polyencoder

	Learnable poly-codes with label-conditioned cross-attention.

	This model encodes texts and candidate labels into a shared embedding space using BERT,
	enabling classification into arbitrary categories without retraining for new labels.

	## Training Details

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Base model \| `bert-base-uncased` \|
	\| Model variant \| `polyencoder` \|
	\| Training steps \| 1000 \|
	\| Batch size \| 2 \|
	\| Learning rate \| 2e-05 \|
	\| Trainable params \| 109,494,528 \|
	\| Training time \| 359.7s \|

	## Dataset

	Trained on [polodealvarado/zeroshot-classification](https://huggingface.co/datasets/polodealvarado/zeroshot-classification).

	## Evaluation Results

	\| Metric \| Score \|
	\|--------\|-------\|
	\| Precision \| 0.9463 \|
	\| Recall \| 0.9677 \|
	\| F1 Score \| 0.9569 \|

	## Usage

	```python
	from models.polyencoder import PolyEncoderModel

	model = PolyEncoderModel.from_pretrained("polodealvarado/polyencoder")

	predictions = model.predict(
	texts=["The stock market crashed yesterday."],
	labels=[["Finance", "Sports", "Biology", "Economy"]],
	)
	print(predictions)
	# [{"text": "...", "scores": {"Finance": 0.98, "Economy": 0.85, ...}}]
	```