Exploratory Data Analysis of UCI Diabetes Dataset in R
This is a collaborative project
Exploratory Data Analysis (EDA) is an approach for data analysis that often uses statistical graphics and other data visualization methods to maximize insight into a data set. In this work, we started by importing the data into R. Then probed and explored the data by inspecting the variable types, outlying and possibly wrong records, and other issues. We checked if there are any missing values which we applied list-wise deletion, we inspected the frequency distribution of the target variable class and see, e.g., whether we have an unbalanced classification problem. Learn, like and feel free to leave a comment.