Machine learning interview questions

1 . What is outliers , How to locate them , what are the treatment methods Outliear - A value that "lies outside " (is much smaller or longer than ) most of the value in set of data Example - 23, 29, 32, 3, 27, 83, 28 both 3 and 83 are outlier There are several methods for locating outliers in a dataset including Visualization - ploting the data on a graph and visually inspecting it can often reveal outliers Z- Scores - A z-score is a measure of how many standard deviation a data point with a z-score greater than or less than -3 are considered outliers Interquartile Range - The IQR is the difference between the 75th and 25th percentiles of a dataset . Data points that are more than 1.5 times the IQR below the first quartile or above the third quatile are considered outliers Mahalanobis distance - This methods calculate the distance of each data point from the me...