1. #1
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Statistcs assistance needed

    I am trying to figure the statistical relevance and reliability of a bet selection methodology, but I need some help.

    All of the bets are on moneylines.

    60 bets were made using 1% as the betting unit, and 33 bets were made using 2% as the betting unit. After 93 bets I am up 20%.

    What, if any, conclusions can be made at his point about this strategy?

    Thanks in advance for your help.

  2. #2
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    I am trying to figure the statistical relevance and reliability of a bet selection methodology, but I need some help.

    All of the bets are on moneylines.

    60 bets were made using 1% as the betting unit, and 33 bets were made using 2% as the betting unit. After 93 bets I am up 20%.

    What, if any, conclusions can be made at his point about this strategy?

    Thanks in advance for your help.
    At what odds were each of the 93 bets placed?

    When you refer to 1% as the "betting unit" is that a "to-risk" amount or a "to-win" amount?

    Are you compounding your bets so that had you started with 100 units and then found yourself up 10 units, would 1% then imply a wager of 1 unit or of 1.1 units?

  3. #3
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Quote Originally Posted by Ganchrow View Post
    At what odds were each of the 93 bets placed?

    When you refer to 1% as the "betting unit" is that a "to-risk" amount or a "to-win" amount?

    Are you compounding your bets so that had you started with 100 units and then found yourself up 10 units, would 1% then imply a wager of 1 unit or of 1.1 units?
    From -105 to +1000. Mostly around +130. Do you really need all 93 odds?

    Betting unit is risked amount.

    Yes, betting unit is based on compounding results.

  4. #4
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    Do you really need all 93 odds?
    That depends ... do you really need an answer?

  5. #5
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90


    120
    120
    160
    140
    120
    145
    110
    130
    115
    150
    160
    121
    145
    105
    135
    150
    140
    145
    120
    135
    -105
    120
    140
    165
    165
    130
    100
    130
    130
    110
    114
    170
    155
    115
    155
    155
    145
    380
    140
    132
    140
    165
    180
    145
    128
    165
    105
    115
    145
    160
    190
    -105
    -105
    135
    130
    120
    175
    170
    135
    210
    105
    120
    125
    145
    140
    135
    210
    135
    200
    125
    210
    125
    120
    100
    150
    150
    125
    105
    125
    150
    135
    135
    120
    140
    240
    130
    150
    195
    105
    1000
    140
    170
    140

    Last edited by Ganchrow; 05-25-08 at 01:54 PM. Reason: added [extra] tag

  6. #6
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post


    -snip-

    Which bets were at 1% and which were at 2%?

  7. #7
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Bottom 60 at 1%.

  8. #8
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    I'm coming up with a p-value of about 11.8%. This corresponds to the probability that a player not paying vig and with no edge would have achieved the same or better results strictly by chance.

  9. #9
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    0.882 probability that my method is effective? Am I understanding you correctly?

    Can you show me how it's calculated so that I can run the numbers again when I have more data?

    Thank you for your assistance!

  10. #10
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    0.882 probability that my method is effective? Am I understanding you correctly?
    There's an 88.2% probability that a bettor with no advantage and paying no vig would have achieved results worse than that you obtained.

    That's slightly different than simply stating that there's an 88.2% probability that your method was effective.

    Also ths conclusion takes your results in a vacuum,, meaning that it's assuming that this is the only strategy you're investigating. If these simply corresponds to but a single strategy out of many that just happened to be effective than the 88.2% figure would be too high. Perhaps drastically.

    By tradition, 95% is generally considered the minimum desired level for statistical significance. Hence, even if the above were true these results would not be considered statistically significant (meaning that we could not reject the hypothesis were due to "luck". This does not imply the results were due to luck, just that a statistician would be unable to reject that possibility at the 95% confidence level.

    Quote Originally Posted by Bullajami View Post
    Can you show me how it's calculated so that I can run the numbers again when I have more data?
    Because of the compounding effect I simply ran a 5,000,000 trial Monte Carlo simulation, which is how I obtained the 11.8% value.

    I need to run off to wife's birthday party now, but at some later point I'll write a post detailing the process involved. It's rather straightforward.

  11. #11
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    I thank God that I have access to your insane genius.

  12. #12
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    I thank God that I have access to your insane genius.
    God had exactly zero to do with it -- I made a pact with the devil years ago.

    (That's why I have such bad breath.)

  13. #13
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Didn't Pascal call making a pact with the Devil a bad bet?

    I should have studied more when I was younger, but I was always too horny.

    I appreciate you sharing your scholarly knowledge, your Monte Carlo simulation software, and for using Altoids when you post.

  14. #14
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    We're assuming zero-vig so the expectation on each bet is zero.

    The variance of a bet of size x at decimal odds of d is given by (d-1)*x2. So for example the variance of a 1-unit bet at +130 would be 12 * (-1) = 1.3, while the variance of a 2-unit bet at -105 would be 22 * (-1) ≈ 3.8095.

    If we ignore compounding the variance of a linear combination of independent bets is the sum of the individual variances. In the provided data set the total variance works out to be about 272.86 units2. The standard deviation is the square root of the variance, which in this case is (272.86 units2)½ ≈ 16.519 units.

    The realized return is 20 16.510 ≈ +1.2108 standard deviations better than expected, implying a p-value of =NORMSDIST(1.2108) ≈ 88.701% from the normal distribution and =1-TDIST(1.2108,93-1,1) ≈ 88.545% from the t-distribution with 92 (i.e., 93 bets - 1) degrees of freedom (where NORMSDIST and TDIST refer to MS Excel's cumulative standard normal and Student t-distribution functions, respectively).

    One problem with the above analysis, however, is that it completely discounts the effect of compounding. Nevertheless, because we're dealing with a relatively small number of bets, each at a relatively small percentage of bankroll, this effect is fairly small.

    Taking compounding into account, the only real procedural difference would be in the calculation of variance. Because the result of previous bets impact the results of future bets (insofar as if we win an earlier bet we'd be betting more on a later bet it's no longer correct that the total variance equals the sum of the variances.

    Rather than go into some long derivation, I'll just state the result here. The total variance is given by the sum of the variances plus the sum of the product of the variances taken 2 at a time plus the sum of the product of the variances taken 3 at a time ... plus the product of all N variances.

    This obviously represents a huge number of terms (2N - 1 terms for N bets, 9,903,520,314,283,042,199,192,993,792 terms for 93 bets), but if bets are sufficiently small we can approximate by only looking at terms up to only the second or third order. Given the 93 bets of the original problem, the total variance (up to the second order) would then be:

    Total Variance ≈ ΣNi=12i] + ΣNi=12i * ΣNj=i+12j ]]
    which works out to a standard deviation of 16.629%, implying a t-distribution (with 92 degrees of freedom) p-value of 88.391%. Were we to go to the third order we'd of course find a slightly higher σ (~16.630%) and hence a slightly lower p-value (~88.390%).

    This is faily close to the p-value of 88.215% that I obtained from my 40,000,000-trial Monte Carlo run. The difference stems from having approximated the distribution of results with the t-distribution.

  15. #15
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    I truly appreciate your time on this.

    If I had a larger sample size, what would the impact be? Just an estimate. For example, if I had 93 additional identical bets with identical results, for 186 total bets, what is the magnitude of impact on the p-value?

    I am trying to get a feel for when I meet the threshold for statistical relevance (p-value of 5%)


  16. #16
    VideoReview
    VideoReview's Avatar Become A Pro!
    Join Date: 12-14-07
    Posts: 107
    Betpoints: 270

    Quote Originally Posted by Bullajami View Post
    I appreciate you sharing your scholarly knowledge, your Monte Carlo simulation software,
    Ganchrow, I am wondering if at some point if it would make sense to add your Monte Carlo script and input options (allow cut and paste) to the SBR betting tools? This could prove quite useful to the forum and save you a great deal of time in running simulations for everyone.

  17. #17
    Justin7
    Justin7's Avatar Become A Pro!
    Join Date: 07-31-06
    Posts: 8,577
    Betpoints: 1506

    Bullajami,

    I'd be more interested in your methodology in selecting plays than your actual results.

  18. #18
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Quote Originally Posted by Justin7 View Post
    Bullajami,

    I'd be more interested in your methodology in selecting plays than your actual results.
    Once it becomes statistically relevant I will post. Until then it's just another crazy idea of mine.

  19. #19
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    I truly appreciate your time on this.

    If I had a larger sample size, what would the impact be? Just an estimate. For example, if I had 93 additional identical bets with identical results, for 186 total bets, what is the magnitude of impact on the p-value?

    I am trying to get a feel for when I meet the threshold for statistical relevance (p-value of 5%)

    If you included 93 additional out-of-sample bets, at the same odds and bet sizes as the initial 93, that yielded identical results (and I'm taking "identical results" to imply a return of 120%2 - 1 = 44%), then this would result in a p-value of roughly 96.763%.

  20. #20
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    At the risk of wearing out my welcome, I'd like to ask that this simulation be run again, please. I have increased the sample size to 201 bets, all on moneyline dogs. In reality I had a combination of 1% and 2% flat bets (risked amount), but I converted to a model where I used all 2% flat bets (risked). On this model I show an 83.002% ROI.
    Can you tell me the p-value of my results? (or explain how I could calculate it myself in a language I understand?)
    165
    135
    155
    195
    135
    115
    210
    125
    195
    200
    125
    130
    115
    240
    110
    160
    120
    125
    140
    190
    190
    120
    125
    170
    160
    130
    185
    135
    140
    165
    190
    105
    110
    125
    135
    140
    130
    125
    160
    155
    155
    130
    140
    115
    120
    125
    135
    135
    125
    150
    120
    140
    135
    165
    105
    140
    140
    175
    145
    140
    120
    120
    135
    120
    110
    120
    190
    110
    130
    170
    140
    170
    100
    175
    165
    165
    130
    195
    210
    135
    120
    125
    140
    170
    125
    140
    115
    120
    125
    145
    135
    125
    145
    110
    165
    175
    105
    115
    106
    150
    118
    130
    110
    105
    190
    120
    110
    100
    130
    220
    155
    120
    120
    160
    140
    120
    145
    110
    130
    115
    150
    160
    121
    145
    105
    135
    150
    140
    145
    120
    135
    120
    140
    165
    165
    130
    100
    130
    130
    110
    114
    170
    155
    115
    155
    155
    145
    380
    140
    132
    140
    165
    180
    145
    128
    165
    105
    115
    145
    160
    190
    135
    130
    120
    175
    170
    135
    210
    105
    120
    125
    145
    140
    135
    210
    135
    200
    125
    210
    125
    120
    100
    150
    150
    125
    105
    125
    150
    135
    135
    120
    140
    240
    130
    150
    195
    105
    1000
    140
    170
    140

  21. #21
    dcbt
    dcbt's Avatar Become A Pro!
    Join Date: 04-04-08
    Posts: 185
    Betpoints: 48

    I'm glad this thread got revisited - I missed it the first time around, so it gave me a chance to set up a spreadsheet for my own testing. One question, though - when using 'rate of return,' are we talking about the traditional 'investing' rate of return - ie, if a bankroll goes from 1000 to 1200, that's a 20% ROR? Or, are you using it in the manner many gamblers do, that being a percentage of money bet? I assume the former, but just verifying... Thanks.

    (By the way, I tested your revised data with my template and came up with 99.21% NORMSDIST. I *think* it's accurate, as I was able to tie to Ganchrow's calc in his prior post, so hopefully I've got my template set up right - we'll find out if/when he confirms.)

  22. #22
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Quote Originally Posted by dcbt View Post
    I'm glad this thread got revisited - I missed it the first time around, so it gave me a chance to set up a spreadsheet for my own testing. One question, though - when using 'rate of return,' are we talking about the traditional 'investing' rate of return - ie, if a bankroll goes from 1000 to 1200, that's a 20% ROR? Or, are you using it in the manner many gamblers do, that being a percentage of money bet? I assume the former, but just verifying... Thanks.

    (By the way, I tested your revised data with my template and came up with 99.21% NORMSDIST. I *think* it's accurate, as I was able to tie to Ganchrow's calc in his prior post, so hopefully I've got my template set up right - we'll find out if/when he confirms.)
    Gahhhhh! I was trying to be careful, too. But you are correct, the 83.002% figure is the growth of bankroll number, not the ROI number.

    ROI based on total amount wagered is 15.94%

    Good catch, thanks.

  23. #23
    dcbt
    dcbt's Avatar Become A Pro!
    Join Date: 04-04-08
    Posts: 185
    Betpoints: 48

    Quote Originally Posted by Bullajami View Post
    Gahhhhh! I was trying to be careful, too. But you are correct, the 83.002% figure is the growth of bankroll number, not the ROI number.

    ROI based on total amount wagered is 15.94%

    Good catch, thanks.
    With the 15.94%, I get 67.84% NORMSDIST now.
    Which one is supposed to be used for this calc, the amt based on total wagered, I take it?

  24. #24
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Quote Originally Posted by dcbt View Post
    With the 15.94%, I get 67.84% NORMSDIST now.
    Which one is supposed to be used for this calc, the amt based on total wagered, I take it?
    Ganch the Great never asked for my ROI based on total wagered for part one of this thread, so I tend to think (and hope!) its the 83% number.

  25. #25
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    Gahhhhh! I was trying to be careful, too. But you are correct, the 83.002% figure is the growth of bankroll number, not the ROI number.

    ROI based on total amount wagered is 15.94%

    Good catch, thanks.
    I'm still a bit unclear as to your chosen terminology.

    Given the list of 201 prices above, then at 2 units per wager standard deviation would be 34.41 units.

    If you're saying you risked a flat bet of two units per wager (201×2 = 402 units) and then ended up 15.94%×402 = 64.08 units, then your p-value would be about 1-NORMSDIST(64.08/34.41) ≈ 1-96.87% ≈ 3.13%.

    If OTOH, you're saying you risked a flat bet of two units per wager (201×2 = 402 units) and then ended up 15.92 units, then your p-value would be about 1-NORMSDIST(15.92/34.41) ≈ 1-67.84% ≈ 32.16%, which is exactly the same as dcbt's figure.

    (Oh and btw, were you indeed +83.002 units as your earlier post may have mistakenly implied (or as dcbt and I may have misinterpreted), then dcbt's figure of 99.21% would have been correct).

    The p-value of course, corresponds to the probability (from the normal distribution in these cases) that a player would no edge whatsoever would have obtained results at least this positive. Note that this implicitly assumes no data mining. For example, were I test to 100 different independent models, then we'd fully expect to see a model with a p-value of 1%. This, however, would by itself be of little predictive use.

  26. #26
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Thanks again. Sorry I have not been clear.

    I am saying that the bet size was consistently 2% of bankroll, compounded after each result.

    After betting on the 201 underdogs listed, the bankroll increased 83.002% from its original size.

    The 15.94% number comes from dividing the dollar amount of bankroll growth by the total dollar amount wagered.

    So, which p-value applies?

    Or, if I have screwed up this terminology as well, what I am really after is some statistical evidence that my method of selecting these bets is sound.
    Last edited by Bullajami; 08-07-08 at 05:28 AM.

  27. #27
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    Thanks again. Sorry I have not been clear.

    I am saying that the bet size was consistently 2% of bankroll, compounded after each result.

    After betting on the 201 underdogs listed, the bankroll increased 83.002% from its original size.

    The 15.94% number comes from dividing the dollar amount of bankroll growth by the total dollar amount wagered.

    So, which p-value applies?

    Or, if I have screwed up this terminology as well, what I am really after is some statistical evidence that my method of selecting these bets is sound.
    Let's start with an easy question.

    What would your returns had been had you been flat betting?

  28. #28
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    All bets identical size, 2% of original bankroll. After 201 bets, +38.83 units

  29. #29
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    All bets identical size, 2% of original bankroll. After 201 bets, +38.83 units
    OK I'm still a bit unclear on your terminology. Is 1 unit in this example 1% of initial bankroll or 2%?

    So in other words, after 201 flat bets of let's say 1 unit each, how many units did you wind up?

  30. #30
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Hypothetical bankroll starting = $500
    Betting unit $10 on every bet
    201 bets, odds as posted above
    Final bankroll is $888.30
    +38.83 betting units

  31. #31
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    Hypothetical bankroll starting = $500
    Betting unit $10 on every bet
    201 bets, odds as posted above
    Final bankroll is $888.30
    +38.83 betting units
    So flat betting 1 unit per bet over the 201 listed bets, your standard deviation would be about 17.20 units.

    Since your return over the same period was +38.83 units, this would imply a standard score for of 38.3 17.20 ≈ 2.26.

    This corresponds to a (one-tailed Gaussian) p-value of =1-NORMSDIST(2.26) ≈ 1.200%, which, if these results were strictly obtained out-of-sample, would certainly cause me to sit up and take notice.

  32. #32
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Quote Originally Posted by Ganchrow View Post
    So flat betting 1 unit per bet over the 201 listed bets, your standard deviation would be about 17.20 units.

    Since your return over the same period was +38.83 units, this would imply a standard score for of 38.3 17.20 ≈ 2.26.

    This corresponds to a (one-tailed Gaussian) p-value of =1-NORMSDIST(2.26) ≈ 1.200%, which, if these results were strictly obtained out-of-sample, would certainly cause me to sit up and take notice.
    Ummmm...
    Thank you very much for all of your help, but I would bet (full Kelly) that anyone who knows what you just wrote could probably have done the math for themselves.


    Pretend for a minute that the tank on my toilet is not stacked with calculus text books. In fact, just go ahead and presume that I have not opened a math book of any kind for more than 2 decades.

    Does my data support a picking system that should be profitable in the long term?

  33. #33
    dcbt
    dcbt's Avatar Become A Pro!
    Join Date: 04-04-08
    Posts: 185
    Betpoints: 48

    .
    Last edited by dcbt; 08-07-08 at 11:03 AM. Reason: deleted, will just wait for ganch to reply in case i'm misusing nomenclature

  34. #34
    Ganchrow
    Nolite te bastardes carborundorum.
    Ganchrow's Avatar Become A Pro!
    Join Date: 08-28-05
    Posts: 5,011
    Betpoints: 1088

    Quote Originally Posted by Bullajami View Post
    Ummmm...
    Thank you very much for all of your help, but I would bet (full Kelly) that anyone who knows what you just wrote could probably have done the math for themselves.


    Pretend for a minute that the tank on my toilet is not stacked with calculus text books. In fact, just go ahead and presume that I have not opened a math book of any kind for more than 2 decades.

    Does my data support a picking system that should be profitable in the long term?
    Perhaps I didn't understand your initial question.

    You asked:
    Quote Originally Posted by Bullajami View Post
    Can you tell me the p-value of my results?
    The answer, (i.e., the p-value), as I gave in my previous post, is 1.200%. YOu can feel free to ignore the rest.

    Is that value generally considered significant? Yes, very much so.

    But the real question isn't so much the raw value as how these numbers were obtained.

    For example if you looked at 1,000 different strategies, you'd expect, purely by chance, to find about 12 strategies of significance that great (p-values ~ 1.2%). Imputing too much meaning in these results is known as data dredging or sometimes data mining and carries with it a very negative connotation. (Indeed it's been the ruin of many a poor boy and God I know I'm one ...)

    The point is that if you came up with a single theory, tested it on a data set different from the one used to postulate said theory then these results would most definitely be worth pursuing. To the degree that these results are indeed the result of data dredging I'd be increasingly cautious of its conclusions.

    Anyway, I've made quite a few posts on the subject of data mining and in-sample vs. out-of-sample testing (many of them in response to questions posed by poster VideoReview) for which if you search around a bit you should be able to find.

    Let me know if this makes sense and/or if you have further questions.

  35. #35
    Bullajami
    Bullajami's Avatar Become A Pro!
    Join Date: 12-23-05
    Posts: 472
    Betpoints: 90

    Quote Originally Posted by Ganchrow View Post
    Perhaps I didn't understand your initial question.

    You asked:The answer, (i.e., the p-value), as I gave in my previous post, is 1.200%. YOu can feel free to ignore the rest.

    Is that value generally considered significant? Yes, very much so.

    But the real question isn't so much the raw value as how these numbers were obtained.

    For example if you looked at 1,000 different strategies, you'd expect, purely by chance, to find about 12 strategies of significance that great (p-values ~ 1.2%). Imputing too much meaning in these results is known as data dredging or sometimes data mining and carries with it a very negative connotation. (Indeed it's been the ruin of many a poor boy and God I know I'm one ...)

    The point is that if you came up with a single theory, tested it on a data set different from the one used to postulate said theory then these results would most definitely be worth pursuing. To the degree that these results are indeed the result of data dredging I'd be increasingly cautious of its conclusions.

    Anyway, I've made quite a few posts on the subject of data mining and in-sample vs. out-of-sample testing (many of them in response to questions posed by poster VideoReview) for which if you search around a bit you should be able to find.

    Let me know if this makes sense and/or if you have further questions.
    Muchas gracias, Senor.

    I think I get it now. Once again, thank you for the noggin loan and the patience.

    This was, in fact, out-of-sample data based on a theory/hunch that I had from previous betting experiences. Not the result of data mining, just some casual human observations I had made along the way. I partitioned off an inconsequential part of my BR and started betting my hunch/theory. Once it looked like the results were promising, I decided I had better try to prove it wasn't a fluke.

    Now that I have a system that seems likely to provide an edge (p-values ~ 1.2%) I believe the next step for me would be to quantify that edge so that I can properly size bets for maximum BR growth. Can I find an aggregate edge and apply it to all bets or will I be risking too much value by not establishing this for individual bets?

    Finally, where's the rosetta stone for Ganchrow posts? (As in, what reference(s) do I need to comprehend and learn this for myself?)


12 Last
Top