R BLOG POSTS

These posts are general blog posts on R and they are also published on R bloggers. Consequently, the language is in English.


Generating codebooks in R

A codebook is a technical document that provides an overview of and information about the variables in a dataset. The codebook ensures that the statistician has the complete background information necessary to undertake the analysis, and a codebook documents the data to make sure that the data is well understood and reusable in the future. Here we will show how to create codebooks in R using the dataMaid packages. The help pages for the datasets in R packages usually provide thorough information although the level of detail may vary quite substantially from dataset to dataset.

dataMaid: Your personal assistant for cleaning up the data cleaning process

As data analysts, we all have tasks that we enjoy more than others. Some like the exploratory analysis steps, some like statistical computing, while others enjoy visualizing and communicating the results of their analyses. But we have yet to meet a data analyst that is passionate about data cleaning, even though everyone is very much aware of the importance of a thorough, well-documented data cleaning. This first step of virtually any data analysis process is often unavoidable and key for smooth sailing through the rest of the data analysis.