Statistics

Chi-Squared Test of Independence

Introduction Chi-Squared Test of Independence determines the association between categorical variables. This means that it says whether the variables are related to each other or independent. It’s also called Chi-Square Test of Association. The Chi-Squared Test uses a contingency table to determine the association. The contingency table contains the data which is classified according to…Read more

Hypothesis testing using T-Test

Introduction A T-test is a type of inferential statistic which is used to determine if there is a significant difference between the means of two groups which may be related in certain features. Types of T-test are: One sample T-test Two sample T-Test Paired T-Test Reference: https://github.com/krishnaik06/T-test-an-Correlation-using-python/blob/master/Hypothesis_Testing.ipynb https://www.tutorialspoint.com/scipy/scipy_stats.htm https://en.wikipedia.org/wiki/Student%27s_t-test

R-Squared and Adjusted R-Squared

Introduction R-Squared and Adjusted R-Squared are the key techniques to check the accuracy for a Regression problem. We will understand each in detail in the subsequent sections. There are various techniques to check the accuracy of different problems. In case of classification problems, we use the confusion matrix, F1-Score, Precision, Recall etc. You can check…Read more

EDA using Probability Density Function and Cumulative Distribution Function

Introduction In this post, we will discuss about 2 very important topics and how it helps in Exploratory data analysis — Probability Density Function and Cumulative Density Function. A continuous random variable distribution can be characterized through its Probability Distribution Function. We will understand this statement in greater detail in the subsequent section. Cumulative Density Function…Read more

Z-Score and Quantiles in Statistics

Introduction In this post we will get into detail of understanding Z-Score and what are its application with respect to Gaussian/Normal distribution . We will also discuss about Quantiles and implement it to see how a particular distribution is divided into different Quantiles. If we try to understand about Z-score in layman language, then it basically shows about…Read more

Types of distribution

Introduction In this post, we will be covering basics about the types of distribution. It will be a foundation for creating machine learning models going forward. Moreover, these fundamentals are also necessary for performing Exploratory Data Analysis (EDA).  There are various types of distribution that we could encounter. Some of them that we will be…Read more