--- pretty_name: HSClassify Micro Training Dataset license: pddl language: - en - th - vi - zh task_categories: - text-classification task_ids: - multi-class-classification size_categories: - 10K - Declared source chain in upstream metadata: - WCO HS nomenclature documentation - UN Comtrade data extraction API - Upstream data license: ODC Public Domain Dedication and License (PDDL) v1.0 Project-added synthetic texts and normalized labels are released under this project's MIT license. ## Limitations - Language balance is intentionally skewed toward English in the current snapshot. - Synthetic text patterns may not cover all commercial phrasing edge cases. - This dataset is for research/prototyping and is not legal customs advice.