Hello friends! Overplotting is one of common issue faced by analysts during EDA (Exploratory data analysis), data visualization and getting inference the results. We’ll see few methods to prevent it.
Today we’ll see about the dplyr package in R also known as grammar of data manipulation. In the last tutorial we saw about the ggplot2 which is another famous package in R. It consist of set of functions which helps you for different type of data manipulation. I’ll be using medical cost data set from Kaggle(data source). I’ve listed all possible documentation and resources to learn more about the dplyr package.
Recently I started learning Alteryx , Alteryx offers an end-to-end self service data analytics
software that empowers data analysts and scientists alike.