GivingTuesday
/

schedule_o

Model card Files Files and versions

schedule_o / README.md

hassaanulhaq01's picture

Update README.md (#1)

54c4055 verified 2 months ago

|

history blame contribute delete

1.33 kB

	---
	Developed by: GivingTuesday Data Commons
	license: apache-2.0
	Model Type: Regex Classifier
	Training Data: 3.6k examples from the 990 database
	Accuracy: Weighted F1 score of 0.948 on a test dataset of 500 examples.
	language:
	- en
	---

	## Notebooks

	- [Schedule O classifier notebook](https://huggingface.co/GivingTuesday/schedule_o/blob/main/notebooks/schedule_o_classifier_notebook.ipynb)
	# Details

	This model (Refer to [Notebooks](https://huggingface.co/GivingTuesday/schedule_o/blob/main/notebooks/schedule_o_classifier_notebook.ipynb) section of the files in the current repository) classifies open-ended Schedule O text from IRS Forms 990 and 990-EZ into the specific part of the return that the filer is referencing. Given the filer’s narrative description of which section they are providing supplemental information for, the model returns a single standardized label from the following set: `I EZ`, `II EZ`, `III EZ`, `V EZ`, `III`, `V`, `VI`, `VII`, `IX`, `XI`, `XII`, or `Unknown`. This enables consistent tagging, aggregation, and analysis of Schedule O content across both Form 990 and Form 990-EZ filings.

	## Author
	The model was developed by: Zilun Lin - GivingTuesday Data Commons

	Note: In implementation, be sure to adjust any source and target table references to match your specific environment.