English
schedule_o / README.md
hassaanulhaq01's picture
Update README.md (#1)
54c4055 verified
metadata
Developed by: GivingTuesday Data Commons
license: apache-2.0
Model Type: Regex Classifier
Training Data: 3.6k examples from the 990 database
Accuracy: Weighted F1 score of 0.948 on a test dataset of 500 examples.
language:
  - en

Notebooks

Details

This model (Refer to Notebooks section of the files in the current repository) classifies open-ended Schedule O text from IRS Forms 990 and 990-EZ into the specific part of the return that the filer is referencing. Given the filer’s narrative description of which section they are providing supplemental information for, the model returns a single standardized label from the following set: I EZ, II EZ, III EZ, V EZ, III, V, VI, VII, IX, XI, XII, or Unknown. This enables consistent tagging, aggregation, and analysis of Schedule O content across both Form 990 and Form 990-EZ filings.

Author

The model was developed by: Zilun Lin - GivingTuesday Data Commons

Note: In implementation, be sure to adjust any source and target table references to match your specific environment.