Commit
·
65fda89
1
Parent(s):
2335271
Create README.md
Browse filesThe idea behind this dataset is to create a specific form of gender classification regarding current teenagers. The specific language use they portray varies heavily from the standard language as well as the language use of other generations. The dataset is created on the basis of text messages and social media messages, in order to distinguish the gender-specific texts on the basis of stereo(typical) words. The notebook attached in the repository demonstrates how the dataset was constructed in the format it is in and also provides certain typical words for both which can serve as inspiration.