Data Quality and Data Programming

"Data cleaning and repairing account for about 60% of the work of data scientists."

Christian Kaestner

Required reading:

1
Data Quality and Data Programming "Data cleaning and repairing account for about 60% of the work of data scientists." Christian Kaestner Required reading: 🗎 Schelter, S., Lange, D., Schmidt, P., Celikel, M., Biessmann, F. and Grafberger, A., 2018. Automating large-scale data quality verification. Proceedings of the VLDB Endowment, 11(12), pp.1781-1794. 🗎 Nick Hynes, D. Sculley, Michael Terry. "The Data Linter: Lightweight Automated Sanity Checking for ML Data Sets." NIPS Workshop on ML Systems (2017)