Question about outliers

steve227 · 09-29-10 09:27 AM

when working your own set of values for lets say a data set of values in any half time score probability do you guys crop out outliers and to what degree.if you did ,would data set be more accurate or not

roasthawg · 09-29-10 12:21 PM

I leave them in but it's something I go back and forth on.

Pokerjoe · 09-29-10 12:59 PM

There are times a median is more useful than an average. Play around with that. If you're using excel, look into TRIMMEAN. But it is important not to let your numbers get warped by extreme results.

steve227 · 09-29-10 04:08 PM

thanks for the info guys bol this year

uva3021 · 09-30-10 12:30 AM

well it depends on sample size, which is obvious and banal, but i wouldn't ignore outliers, just find another way to optimize your central tendency (as mentioned maybe median, mode, etc...)

don't remove apparent anomalies because it works mathematically, for those anomalies are part of the system itself, and variation within the system as a whole just like every other number is variation within that same system

Wrecktangle · 09-30-10 06:26 AM

Outliers can be a very serious problem especially if you are using regression. You must examine each to see if it is an erroneous point as it will "punch way over its weight" in the regression. Typically I regress with all points in and then with suspected outliers removed to see how much change results in the regression equation. Almost always I remove them.

Maverick22 · 09-30-10 07:21 AM

Do like they do in scoring gymnastics.(or is it figure skating)?

Throw out the worst. Throw out the best. And use the rest.

Pokerjoe · 09-30-10 12:03 PM

I don't think it's that simple, Maverick, though TRIMMEAN does exactly that.

Consider a team that (however you do it) rates 90, 94, 76, and 89 in it's last 4 games.

I wouldn't throw out the high and low scores. They're all fair info on the team's ability. I'd average them, or median/average them (that being a half median, half average).

But there are other data sets where the extreme results should be discounted or ignored, sure.

SBR Top-Rated Sportsbooks				Best Sportsbooks List
#1 FanDuel	SBR rating 4.8/5	Review	#6 BetRivers	SBR rating 4.1/5	Review
#2 Caesars	SBR rating 4.7/5	Review	#7 Fanatics	SBR rating 4.1/5	Review
#3 DraftKings	SBR rating 4.7/5	Review	#8 Betway	SBR rating 3.8/5	Review
#4 BetMGM	SBR rating 4.6/5	Review	#9 Borgata	SBR rating 3.5/5	Review
#5 bet365	SBR rating 4.6/5	Review	#10 ClutchBet	SBR rating 2.9/5	Review

Question about outliers

Thread Tools

Question about outliers