You will be working with log data (aka process data or event stream data) for this competition. Each line of the data represents an action a student takes within MATHia (e.g., requesting a hint, entering an answer, clicking to advance to the next page, etc.). Specifically, the following variables are included for each student action:
- Student ID
- School ID
- Class ID
- Session ID
- Time stamp
- Workspace name
- Problem name
- Step name
- Action taken (attempt, hint request, etc.)
- Input (i.e., the answer a student gave)
- Outcome (did they get the answer correct)
- Workspace progress (did they complete the workspace)
Each individual student has about 30,000 rows of data for the workspace used for this competition. We anticipate the final dataset participants will use to exceed 1,000,000 rows of data. Teams will receive training, validation, and, at the end of the competition, test datasets.