Documenting
Documenting is the process of tracking changes, additions, deletions and errors involved in your data cleaning effort.
Having a record of how a data set evolved does three very important things.
-
recover data-cleaning errors
-
inform other users of changes you've made
-
determine the quality of the data to be used in analysis
Documenting data-cleaning makes it possible to:
-
be transparent about your process
-
keep team members on the same page
-
demonstrate to project stakeholders that you are accountable.
A data analyst uses a changelog to access the information needed. A changelog is a file that contains a chronological list of modifications made to a project.