Cleaning Data Using Rigorous Statistical Models

There are formal models and generalized software based on rigorous theory. I introduced the generalized software in two short courses at the University of London and later lectured on the methods when I was back in Cambridge at the Isaac Newton Institute.

This is my sixth keynote at a workshop at a major CS conference.

https://sites.google.com/site/dinaworkshop2015/invited-speakers

https://www.garudax.id/pulse/all-data-needed-cleaned-bill-winkler/

https://www.garudax.id/pulse/data-science-you-need-new-theory-whole-series-prior-software-winkler/

https://www.garudax.id/pulse/live-die-data-preparation-bill-winkler/

https://www.garudax.id/pulse/cleaning-data-can-80-95-analysis-bill-winkler/

The following provides an overview of some of the data-cleanup methods that generalize methods for analysis. A few algorithms were speeded up for factors of 100-1000+.

https://www.census.gov/library/working-papers/2018/adrm/rrs2018-05.html

To view or add a comment, sign in

More articles by Bill Winkler

Explore content categories