Cleaning Data Using Rigorous Statistical Models
There are formal models and generalized software based on rigorous theory. I introduced the generalized software in two short courses at the University of London and later lectured on the methods when I was back in Cambridge at the Isaac Newton Institute.
This is my sixth keynote at a workshop at a major CS conference.
The following provides an overview of some of the data-cleanup methods that generalize methods for analysis. A few algorithms were speeded up for factors of 100-1000+.