train.csv | 1 MB | |
test.csv | 1 MB | |
submission_template.csv | 1 MB |
The dataset was collected and annotated in Natural Language and Text Processing Laboratory at Center for Computing Research of Instituto Politécnico Nacional, Mexico, by PhD candidate Maaz Amjad who is a native Urdu-speaker. Previously, Maaz obtained his Master degree from Moscow Institute for Physics and Technology (MIPT).
Contacts about the dataset:
At the moment of competition, all questions about the dataset collection procedure should be addressed to Maaz maazamjad@phystech.edu. (The paper with the details on dataset collection and preprocessing procedures and other dataset statistics will be published at FIRE 2021).
The training-test split was performed by a co-advisor of Maaz’s PhD thesis, Alisa Zhila (PhD). Please, address any questions on the data split to alisa.zhila@gmail.com
Our website uses cookies, including web analytics services. By using the website, you consent to the processing of personal data using cookies. You can find out more about the processing of personal data in the Privacy policy