Spaces:

nowhuggingface
/

LLM1-Fine-tuning-and-deployment

Sleeping

Add my folder

d733479 2 months ago

1.55 kB

	---
	title: Wilcoxon
	emoji: 🤗
	colorFrom: blue
	colorTo: green
	sdk: gradio
	sdk_version: 3.0.2
	app_file: app.py
	pinned: false
	tags:
	- evaluate
	- comparison
	description: >-
	Wilcoxon's test is a signed-rank test for comparing paired samples.
	---


	# Comparison Card for Wilcoxon

	## Comparison description

	Wilcoxon's test is a non-parametric signed-rank test that tests whether the distribution of the differences is symmetric about zero. It can be used to compare the predictions of two models.

	## How to use

	The Wilcoxon comparison is used to analyze paired ordinal data.

	## Inputs

	Its arguments are:

	`predictions1`: a list of predictions from the first model.

	`predictions2`: a list of predictions from the second model.

	## Output values

	The Wilcoxon comparison outputs two things:

	`stat`: The Wilcoxon statistic.

	`p`: The p value.

	## Examples

	Example comparison:

	```python
	wilcoxon = evaluate.load("wilcoxon")
	results = wilcoxon.compute(predictions1=[-7, 123.45, 43, 4.91, 5], predictions2=[1337.12, -9.74, 1, 2, 3.21])
	print(results)
	{'stat': 5.0, 'p': 0.625}
	```

	## Limitations and bias

	The Wilcoxon test is a non-parametric test, so it has relatively few assumptions (basically only that the observations are independent). It should be used to analyze paired ordinal data only.

	## Citations

	```bibtex
	@incollection{wilcoxon1992individual,
	title={Individual comparisons by ranking methods},
	author={Wilcoxon, Frank},
	booktitle={Breakthroughs in statistics},
	pages={196--202},
	year={1992},
	publisher={Springer}
	}
	```