1. #1
    steve227
    steve227's Avatar Become A Pro!
    Join Date: 08-20-09
    Posts: 61
    Betpoints: 1413

    Question about outliers

    when working your own set of values for lets say a data set of values in any half time score probability do you guys crop out outliers and to what degree.if you did ,would data set be more accurate or not

  2. #2
    roasthawg
    roasthawg's Avatar Become A Pro!
    Join Date: 11-09-07
    Posts: 2,990

    I leave them in but it's something I go back and forth on.

  3. #3
    Pokerjoe
    Pokerjoe's Avatar Become A Pro!
    Join Date: 04-17-09
    Posts: 704
    Betpoints: 307

    There are times a median is more useful than an average. Play around with that. If you're using excel, look into TRIMMEAN. But it is important not to let your numbers get warped by extreme results.

  4. #4
    steve227
    steve227's Avatar Become A Pro!
    Join Date: 08-20-09
    Posts: 61
    Betpoints: 1413

    thanks for the info guys bol this year

  5. #5
    uva3021
    uva3021's Avatar Become A Pro!
    Join Date: 03-01-07
    Posts: 537
    Betpoints: 381

    well it depends on sample size, which is obvious and banal, but i wouldn't ignore outliers, just find another way to optimize your central tendency (as mentioned maybe median, mode, etc...)

    don't remove apparent anomalies because it works mathematically, for those anomalies are part of the system itself, and variation within the system as a whole just like every other number is variation within that same system

  6. #6
    Wrecktangle
    Wrecktangle's Avatar Become A Pro!
    Join Date: 03-01-09
    Posts: 1,524
    Betpoints: 3209

    Outliers can be a very serious problem especially if you are using regression. You must examine each to see if it is an erroneous point as it will "punch way over its weight" in the regression. Typically I regress with all points in and then with suspected outliers removed to see how much change results in the regression equation. Almost always I remove them.

  7. #7
    Maverick22
    Maverick22's Avatar Become A Pro!
    Join Date: 04-10-10
    Posts: 807
    Betpoints: 58

    Do like they do in scoring gymnastics.(or is it figure skating)?

    Throw out the worst. Throw out the best. And use the rest.

  8. #8
    Pokerjoe
    Pokerjoe's Avatar Become A Pro!
    Join Date: 04-17-09
    Posts: 704
    Betpoints: 307

    I don't think it's that simple, Maverick, though TRIMMEAN does exactly that.

    Consider a team that (however you do it) rates 90, 94, 76, and 89 in it's last 4 games.

    I wouldn't throw out the high and low scores. They're all fair info on the team's ability. I'd average them, or median/average them (that being a half median, half average).

    But there are other data sets where the extreme results should be discounted or ignored, sure.

Top