I'm looking for a recommendation on a normality test for frequency distributions. I am implementing it programatically. Essentially, the purpose of the test is to remove erroneous data from consideration in my model. I want to test for the normal distribution and remove sample points that do not lie within four standard deviations from the mean. Does anyone recommend an efficient test for this? K-S? Shapiro-Wilk? These seem too cumbersome for what I'm trying to accomplish. There must be an easier test.
BTW, the reason for the test is that some of the distributions are not Guassian, as the dataset may consist of combined normal distributions. In that case, I'll simply have to utilize Chebyshev's inequality, remove data within seven standard deviations, and further dissect the data from there.
BTW, the reason for the test is that some of the distributions are not Guassian, as the dataset may consist of combined normal distributions. In that case, I'll simply have to utilize Chebyshev's inequality, remove data within seven standard deviations, and further dissect the data from there.