
A piece from me to you
Over the years, I have learned to Be PROACTIVE; Be RESPONSIBLE AND TAKE RESPONSIBILITY; NEVER ENTERTAIN IDLENESS; and LOVE
Exploratory Data Analysis (EDA) is an approach for data analysis that often uses statistical graphics and other data visualization methods to maximize insight into a data set. In this work, we started by importing the data into R. Then probed and explored the data by inspecting the variable types, outlying and possibly wrong records, and other issues. We checked if there are any missing values which we applied list-wise deletion, we inspected the frequency distribution of the target variable class and see, e.g., whether we have an unbalanced classification problem. Learn, like and feel free to leave a comment.
This project uses European Soccer Database which has more than 25,000 matches and more than 10,000 players for European professional soccer seasons from 2008 to 2016. The exploratory data analysis includes some steps for exploring and cleaning the dataset, some steps for feature engineering using Pearson’s correlation coefficient with key attributes and domain knowledge, and some steps for grouping similar clusters using unsupervised machine learning algorithm, K-Means aided by Principal Components Analysis. Learn, like and feel free to leave a comment below.
Machine learning has found application in various sectors of human endeavors of which marketing is not an exemption. Marketing is a key part of any business. It requires strategic steps to maximize its impact in today’s dynamic world using every available medium. In this work, we applied Logistic regression – a classification algorithm in machine learning to achieve this goal. Our goal is to achieve high true positive (TP) and reduce false negative or type 2 error. In other words, we want to have high tolerance on false positive or type 1 error. We used two methods in our choice of algorithm and evaluated them. Both methods gave accuracy of 0.70 but one method outshines the other in terms of error type reduction.
From this notebook, you will learn about logistic regression. Concretely, you will learn how to implement logistic regression for sentiment analysis on tweets. Given a tweet, you will decide if it has a positive sentiment or a negative one.