Course Schedule
EDUC691: Core Methods in Educational Data Mining
Spring 2020
Professor Ryan Baker
Class 1: Introduction
September 14, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 1, V1.
- Baker, R.S.J.d., Yacef, K. (2009) The State of Educational Data Mining in 2009: A Review and Future Visions. Journal of Educational Data Mining, 1 (1), 3-17. [pdf]
- Baker, R., Siemens, G. (2014) Educational data mining and learning analytics. In Sawyer, K. (Ed.) Cambridge Handbook of the Learning Sciences: 2nd Edition, pp. 253-274. [pdf]
Slides: [pptx]
Assignments Due: NONE
Class 2: Regression in Prediction
September 21, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 1, V2.
- Pardos, Z.A., Baker, R.S., San Pedro, M.O.C.Z., Gowda, S.M., Gowda, S.M. (2014) Affective states and state tests: Investigating how affect and engagement during the school year predict end of year learning outcomes. Journal of Learning Analytics, 1 (1), 107-128[pdf]
Slides: [pptx]
Assignments Due: NONE
Class 3: Classification in Prediction
September 28, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 1, V3, V4.
- Hand, D. J. (2006). Classifier technology and the illusion of progress. Statistical science, 21(1), 1-14.[pdf]
- Zeng, Z., Chaturvedi, S., Bhat, S., & Roth, D. (2019). DiAd: Domain adaptation for learning at scale. In Proceedings of the 9th International Conference on Learning Analytics & Knowledge (pp. 185-194).[pdf]
Slides: [pptx]
Assignments Due: Basic: Classifier
Class 4: Behavior and Affect Detection
October 5, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch.1, V5. Ch. 2, V5. Ch. 3, V1, V2.
- Sao Pedro, M.A., Baker, R.S.J.d., Gobert, J., Montalvo, O. Nakama, A. (2013) Leveraging Machine-Learned Detectors of Systematic Inquiry Behavior to Estimate and Predict Transfer of Inquiry Skill. User Modeling and User-Adapted Interaction, 23 (1), 1-39. [pdf]
- Hutt, S., Grafsgaard, J. F., & D'Mello, S. K. (2019). Time to scale: Generalizable affect detection for tens of thousands of students across an entire school year. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (pp. 1-14).[pdf]
Slides: [pptx]
Assignments Due: Creative: Behavior Detection
Class 5: Diagnostic Metrics
October 12, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 2, V1, V2, V3, V4.
- Jeni, L. A., Cohn, J. F., & De La Torre, F. (2013). Facing Imbalanced Data--Recommendations for the Use of Performance Metrics. Proceedings of Affective Computing and Intelligent Interaction (ACII), 245-251.[pdf]
- Knowles, J. E. (2014). Of needles and haystacks: Building an accurate statewide dropout early warning system in Wisconsin. Madison, WI: Wisconsin Department of Public Instruction. [pdf]
- Kitto, K., Shum, S. B., & Gibson, A. (2018). Embracing imperfection in learning analytics. In Proceedings of the 8th International Conference on Learning Analytics and Knowledge (pp. 451-460). ACM.[pdf]
Slides: [pptx]
Assignments Due: Basic: Diagnostic Metrics
Class 6: Feature Engineering and Distillation
October 19, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 3, V3, V4, V5.
- Sao Pedro, M., Baker, R.S.J.d., Gobert, J. (2012) Improving Construct Validity Yields Better Models of Systematic Inquiry, Even with Less Information. Proceedings of the 20th International Conference on User Modeling, Adaptation and Personalization (UMAP 2012),249-260. [pdf]
- vlookup Tutorial 1
- vlookup Tutorial 2
- Pivot Table Tutorial 1
- Pivot Table Tutorial 2
Slides: [pptx]
Assignments Due: Creative: Feature Engineering
Class 7: Clustering
October 26, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 7, V1, V2, V3, V4, V5.
- Bowers, A.J. (2010) Analyzing the Longitudinal K-12 Grading Histories of Entire Cohorts of Students: Grades, Data Driven Decision Making, Dropping Out and Hierarchical Cluster Analysis. Practical Assessment, Research & Evaluation (PARE), 15(7), 1-18. [pdf]
- Lee, J., Recker, M., Bowers, A.J., Yuan, M. (2016). Hierarchical Cluster Analysis Heatmaps and Pattern Analysis: An Approach for Visualizing Learning Management System Interaction Data. Poster presented at the annual International Conference on Educational Data Mining (EDM). [pdf]
Slides: [pptx]
Assignments Due: Basic: Clustering
Class 8: Association Rule Mining and Sequential Pattern Mining
November 2, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 5, V3, V4.
- Merceron, A., Yacef, K. (2008) Interestingness Measures for Association Rules in Educational Data. Proceedings of the 1st International Conference on Educational Data Mining,57-66. [pdf]
- Bazaldua, D.A.L., Baker, R.S., San Pedro, M.O.Z. (2014) Combining Expert and Metric-Based Assessments of Association Rule Interestingness. Proceedings of the 7th International Conference on Educational Data Mining.[pdf]
- Kinnebrew, J. S., Loretz, K. M., & Biswas, G. (2013). A contextualized, differential sequence mining method to derive students' learning behavior patterns. JEDM-Journal of Educational Data Mining, 5(1), 190-219.[pdf]
Slides: [pptx]
Assignments Due: Basic: Sequential Pattern Mining
Class 9: Bayesian Knowledge Tracing
November 9, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 4, V1, V2.
- Corbett, A.T., Anderson, J.R. (1995) Knowledge Tracing: Modeling the Acquisition of Procedural Knowledge. User Modeling and User-Adapted Interaction, 4, 253-278. [pdf]
- Baker, R.S.J.d., Corbett, A.T., Aleven, V. (2008) More Accurate Student Modeling Through Contextual Estimation of Slip and Guess Probabilities in Bayesian Knowledge Tracing. Proceedings of the 9th International Conference on Intelligent Tutoring Systems, 406-415.[pdf]
- Sao Pedro, M.A., Gobert, J., Baker, R.S.J.d. (2012) Assessing the Learning and Transfer of Data Collection Inquiry Skills across Physical Science Microworlds. Paper presented at the American Educational Research Association Conference. [pdf]
Slides: [pptx]
Assignments Due: Basic: BKT
Class 10: Performance Factors Analysis and Deep Knowledge Tracing
November 16, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 4, V3, V6.
- Pavlik, P.I., Cen, H., Koedinger, K.R. (2009) Performance Factors Analysis -- A New Alternative to Knowledge Tracing. Proceedings of AIED2009.[pdf]
- Pavlik, P.I., Cen, H., Koedinger, K.R. (2009) Learning Factors Transfer Analysis: Using Learning Curve Analysis to Automatically Generate Domain Models. Proceedings of the 2nd International Conference on Educational Data Mining.[pdf]
- Khajah, M., Lindsey, R. V., & Mozer, M. C. (2016) How Deep is Knowledge Tracing? Proceedings of the International Conference on Educational Data Mining. [pdf]
- Yeung, C. K., & Yeung, D. Y. (2018). Addressing two problems in deep knowledge tracing via prediction-consistent regularization. In Proceedings of the Fifth Annual ACM Conference on Learning at Scale (p. 5-14). ACM.[pdf]
Slides: [pptx]
Assignments Due: Basic: PFA
Class 11: Knowledge Structure Discovery
November 23, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 7, V6, V7.
- Desmarais, M.C., Meshkinfam, P., Gagnon, M. (2006) Learned Student Models with Item to Item Knowledge Structures. User Modeling and User-Adapted Interaction, 16, 5, 403-434.[pdf]
- Desmarais, M. C., & Naceur, R. (2013). A matrix factorization method for mapping items to skills and for enhancing expert-based Q-Matrices. Proceedings of the International Conference on Artificial Intelligence in Education, 441-450. [pdf]
- Koedinger, K.R., McLaughlin, E.A., Stamper, J.C. (2012) Automated Student Modeling Improvement. Proceedings of the 5th International Conference on Educational Data Mining, 17-24.[pdf]
- Chen, P., Lu, Y., Zheng, V. W., Chen, X., & Yang, B. (2018). KnowEdu: a system to construct knowledge graph for education. IEEE Access, 6, 31553-31563.[pdf]
Slides: [pptx]
Assignments Due: Creative: Knowledge Structure
Class 12: Correlation Mining and Causal Mining
November 30, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 5, V1, V2.
- Rai, D., Beck, J.E. (2011) Exploring user data from a game-like math tutor: a case study in causal modeling. Proceedings of the 4th International Conference on Educational Data Mining, 307-313.[pdf]
- Rau, M. A., Scheines, R. (2012) Searching for Variables and Models to Investigate Mediators of Learning from Multiple Representations. Proceedings of the 5th International Conference on Educational Data Mining, 110-117. [pdf]
- Slater, S., Ocumpaugh, J., Baker, R., Scupelli, P., Inventado, P.S., Heffernan, N. (2016) Semantic Features of Math Problems: Relationships to Student Learning and Engagement. Proceedings of the 9th International Conference on Educational Data Mining, 223-230.[pdf]
Assignments Due: Basic: Correlation Mining
Class 13: Network Analysis
December 7, 2020
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 5, V5, V6. Ch. 8, V2.
- Dawson, S. (2008) A study of the relationship between student social networks and sense of community. Educational Technology & Society, 11(3), 224-238.[pdf]
- Gasevic, D., Zouaq, A., & Janzen, R. (2013). "Choose Your Classmates, Your GPA Is at Stake!": The Association of Cross-Class Social Ties and Academic Performance. American Behavioral Scientist [pdf]
- Barany, A., & Foster, A. (2020). Mapping Identity Exploration of Science Careers using Epistemic Networks. In Society for Information Technology & Teacher Education International Conference (pp. 1610-1619). Association for the Advancement of Computing in Education (AACE).[pdf]
Slides: [pptx]
Assignments Due: Basic: SNA
Class 14: Visualization
December 10, 2020
Special Day of Week Due to GSE Calendar
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 6, V1, V2, V3, V4.
- Martinez, R., Kay, J., Yacef, K. (2011) Visualisations for longitudinal participation, contribution and progress of a collaborative task at the tabletop. International Conference on Computer Supported Collaborative Learning, CSCL 2011, 25-32.[pdf]
- Milliron, M. D., Malcolm, L., & Kil, D. (2014). Insight and Action Analytics: Three Case Studies to Consider. Research & Practice in Assessment, 9, 70-89. [pdf]
- Vieira, C., Parsons, P., & Byrd, V. (2018). Visual learning analytics of educational data: A systematic literature review and research agenda. Computers & Education, 122, 119-135.[pdf]
Assignments Due: NONE
Class 15: Final Presentations
December 14, 2020
Longer Class Session
Readings
- Baker, R.S. (2020) Big Data and Education. Ch. 8, V5.
Assignments Due: Creative: Final Presentation