Ended 4 years ago
20 participants
73 submissions

Materials (3 MB)

Download all materials
train.csv
1 MB
test.csv
1 MB
submission_template.csv
1 MB

Information on the Dataset for Subtask B

The dataset was collected and annotated in Natural Language and Text Processing Laboratory at Center for Computing Research of Instituto Politécnico Nacional, Mexico, by PhD candidate Maaz Amjad who is a native Urdu-speaker. Previously, Maaz obtained his Master degree from Moscow Institute for Physics and Technology (MIPT). 

Contacts about the dataset: 
At the moment of competition, all questions about the dataset collection procedure should be addressed to Maaz maazamjad@phystech.edu. (The paper with the details on dataset collection and preprocessing procedures and other dataset statistics will be published  at FIRE 2021). 

The training-test split was performed by a co-advisor of Maaz’s PhD thesis,  Alisa Zhila (PhD). Please, address any questions on the data split to alisa.zhila@gmail.com 

Our website uses cookies, including web analytics services. By using the website, you consent to the processing of personal data using cookies. You can find out more about the processing of personal data in the Privacy policy