Ended 4 years ago
558 participants
5518 submissions

Materials (596 MB)

Download all materials
Archive with data and baseline solution
596 MB


The participants are provided with several datasets:

General Data

  • clients.csv: general info about clients;
  • products.csv: general info about stock items;
  • purchases.csv: clients’ purchase history prior to communication.

Task-Specific Data

  • uplift_train.csv: a subset of clients for training. The column treatment_flg indicates if there was a communication. The column target shows if there was a purchase afterward;
  • uplift_test.csv: test subset to estimate uplift value;
  • uplift_sample_submission.csv: example submission file with predictions. Your submission should be in the same format.

Submissions format

For building the model, participants are provided with a training set of clients uplift_train.csv (see above) with information on performed communication (treatment_flg) and purchasing (target).

For each client in the test sample uplift_test.csv, evaluate the effectiveness of communication (uplift). Submit the result as a CSV file with two columns: client_id and uplift, for example:


Please note that the uplift’s absolute values are not important, as they are used only to rank clients.

Baseline solution is provided.

Making sense of uplift

There are three possible outcomes when we decide whether or not we should send a text to the client:

  • +1: the client made a purchase after we reached out, and probably she or he would not make a purchase if we didn’t reach out
  • 0: regardless of us reaching out, the client would make a purchase (or would not make a purchase, which is equally significant in this case)
  • -1: the client did not make the purchase after reaching out, but if there was no communication, she or he would.

Cookies help us deliver our services. By using our services, you agree to our use of cookies.