Z score/Z value

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • josie88
    SBR Rookie
    • 02-12-09
    • 48

    #1
    Z score/Z value
    I could use a little help here. I was reading Ganch's post:

    Sports betting and handicapping forum: discuss picks, odds, and predictions for upcoming games and results on latest bets.


    I understand the underlying theory but the math is still elusive.

    From Mike Orkin (sorry, the software he's talking about here is no longer available):

    Z-VALUES

    The z-value measures the likelihood that an observed event would occur due only to chance. The larger the z-value, the less likely it is that chance alone would cause the event. A z-value of 2 or larger is typically called "statistically significant" -- the likelihood is less than 5% that chance would cause such an event. A z-value of 3 or larger indicates that the likelihood is less than three in a thousand that chance would cause the observed event. The Optimizer allows you to save any situations with z-values of 2 or larger.

    Z-values are more useful than percentages in measuring the strengths of win-loss statistics because, for example, percentages don't take into account the total number of games played. For example a record of 2-0 versus the point spread gives a win-loss record of 100% and so does a record of 10-0, yet 10-0 is clearly a better record. On the other hand, the z-value of a 2-0 record is 1.41, whereas the z-value of a 10-0 record is 3.16, correctly indicating that 10-0 is the stronger result.
    If each team in a game is equally likely to cover the point spread, you can think of the game as a coin toss. Under this assumption, a 2-0 record is equivalent to tossing a coin twice and getting heads on each toss, which has probability 1 in 4. A 10-0 record is equivalent to tossing a coin ten times and getting heads on each toss, which has probability 1 in 1,024. This is reflected by z-values. The higher the z-value, the less likely it is that the record would arise due to chance.
    Technically, z-values can be either positive or negative, depending on the direction of the deviation from the average value. For example, a win-loss record of 10-0 has z-value = 3.16, and a win-loss record of 0-10 has z-value = -3.16. Because of this symmetry, the Optimizer displays only absolute z-values.
    When searching through large amounts of data, it's not a good idea to rely only on large z-values to identify predictable patterns. An important statistical law, known as the Lottery Principle, asserts that given enough opportunity, weird events will happen due to chance alone. When you toss a coin enough times, you'll eventually get ten heads in a row. Similarly, if you use software like the Optimizer to search through data, you will uncover seemingly strong situations that may be due only to chance. Separating the good stuff from random noise may require further work. For example, you could apply situations with large z-values to fresh data or look at them from different perspectives. The ability to detect predictable patterns in a sea of data can translate into long-term profit.


    Questions I have:
    1) Is Z score and Z value the same thing?
    2) How could you use Excel to get the same numbers Orkin has, i.e. a 10-0 z-value is 3.16? Is it possible to just have two inputs, wins and losses, and get the z value?
    3) In Ganch's example:
    Z(han. A) = (79 units - 0 units) / 121.696 units ≈ 0.6492
    Z(han. B) = (26.5 units - 0 units) / 18.28 units ≈ 1.4500
    Neither handicapper has a Z value above 2. Since Orkin asserts 2 is statistically significant, should we conclude that neither capper has proven themselves yet, regardless of how they compare to each other?
  • suicidekings
    SBR Hall of Famer
    • 03-23-09
    • 9962

    #2
    Ganchrow's assumption for his calculation is that an average bettor is a breakeven bettor, so therefore, the mean profit is zero units. It's a comparison of one person/team's performance to the mean, so you would need to determine the mean for the sample you're considering.

    If you're talking about handicapping, then a mean of zero is appropriate, so all you need to know is the performance of the capper (profit) and the standard deviation of the bets (the square root of the sum of the variances of each bet). If you want to use it for something else, you need to know the mean of the population, which might be 50% or may not, depending on what you're talking about.

    The Z score is representative of the capper's place on a Normal Distribution of a population of cappers (a bell curve). You can use the NORMDIST function in Excel to figure out how the bettor compares to the general population.

    For example:
    A bettor makes 2 bets.
    1) Bet 2 units at -110. Result: win 2 units.
    2) Bet 1 unit at +105. Result: lose 1 unit.
    Net profit = 1 unit

    The bettor has a net profit of 1 unit. To calculate his Z score, calculate the variance for each bet, and sum them up. Take the square root, and you have the standard deviation for the sample.

    Variance = (Bet Size)^2 * (Decimal Odds - 1)
    1) 2^2 * (1.91 - 1) = 3.64
    2) 1^2 * (2.05 - 1) = 1.05

    Sum of Variances = 3.64 + 1.05 = 4.69
    Standard Deviation = 4.69^0.5 = 2.16

    Z-Score = (Profit - Avg Profit) / St Dev = (1 - 0) / 2.16 = 0.463

    Then, if you want to see how that compares to other cappers, you can use the NORMDIST function in Excel.

    General = 1-NORMDIST(PROFIT,MEAN,ST.DEV,FALSE)
    Our Example = 1-NORMDIST(1,0,2.16,FALSE) = 32.2%

    This result says that 32.2% of bettors would have performed better, placing wagers under similar odds and similar wager sizes. So really, that's not that statistically significant. Compare that to a bettor with a z-score of 2, which in our example would be equivalent to the bettor placing both bets at +600 odds (and assuming the same result of 1 win, 1 loss), that would mean the bettor would have been outperformed only by 2.2% of the population.

    So to answer your questions
    1) Yes
    2) see above
    3) A Z-Score of 2 puts you in the 97th percentile of bettors, so depending on what your goals are, the standard of 2 may be a little higher than is reasonable (especially if you're an inexperienced bettor). However, the sample size is what makes it statistically significant more than anything, so a result drawn from a small sample doesn't have as high of a confidence interval. It's success over the long term that qualifies a capper's ability.
    Last edited by suicidekings; 10-29-09, 03:48 PM.
    Comment
    • josie88
      SBR Rookie
      • 02-12-09
      • 48

      #3
      Serious thanks, man.
      Comment
      • suicidekings
        SBR Hall of Famer
        • 03-23-09
        • 9962

        #4
        No problem. Good luck.
        Comment
        SBR Contests
        Collapse
        Top-Rated US Sportsbooks
        Collapse
        Working...