Finding a Metric

**Bsims** · 05-14-09, 06:26 PM

This has always been a perplexing issue for me. If I have a "system", I'm most comfortable if I can back test the results. In this case I measure the return per dollar. If these previous results were used in developing the system, then the results are problematic.

Back testing isn't easy because you need to be able to recreate the data you're using on each day and apply it to the system to generate the results for that day. Sometimes, the only thing you can do is track the results forward.

As I said, I use return/dollar as my primary metric since that is what I'm trying to optimize. I have done like you by putting probable winning percentage into "bins" and compare to the actual results in those bins.

I've also assigned 1 when a home team wins and 0 when they lose. Then I compute the correlation between this and my predicted win probability. This is has been particularly useful when I'm tweeking the "system" (Note: the highest correlation here is usually the one between the odds and the winners. That's the correlation you need to exceed.)

Finally, I find graphing the results (like those in the bins) and just eyeballing the results is particularly useful. I will try to post one such graph in my next reply.

**tweek** · 05-14-09, 06:38 PM

Originally posted by Bsims

Back testing isn't easy because you need to be able to recreate the data you're using on each day and apply it to the system to generate the results for that day. Sometimes, the only thing you can do is track the results forward.

I'm using a database of statistics broken down on a game-by-game basis, so I'll be able to calculate any stat going into any game that I need (sourced from retrosheet).

Originally posted by Bsims

I've also assigned 1 when a home team wins and 0 when they lose. Then I compute the correlation between this and my predicted win probability. This is has been particularly useful when I'm tweeking the "system" (Note: the highest correlation here is usually the one between the odds and the winners. That's the correlation you need to exceed.)

That's an interesting idea. I'll give that a shot.

**Bsims** · 05-14-09, 07:38 PM

Sorry, but I'm not able to post a sample graph. But trust me, sometimes a picture is worth a thousand words. Recently I did a study on a system and over 20,000 games it was a very small money loser (returned $0.995). Since it was so close I decided to pursue it further. One thing I did was sort the games by increasing home line and plotted it. The net started a fairly step slope down and continued so through about half the data. Then it turned and made a move upward, finishing slightly below zero. But the picture clearly showed that there was a positive subset, and moreover where it started.