I posted this over at BC as a new poster. I doubt I'll get much response as most of those guys are vegas sharps who do less modeling and more action, but who knows...
Anyway in the past, I've always simply divided up the sports modeling work into two basic techniques:
Simulations - where every play, or set of plays are coded
Expected Value - where the game play is synthesized into a metadata set and a value or set of values are compared
In The Simulation world (sims), Monte Carlo (**) seems to be the norm. In the modeling and especially the Statistical world, ** is held in pretty high regard. Typically, ** has two difficulties, 1) it typically requires a very large db to derive its characteristics, and 2) the actual running of the sim can go to thousands of iterations, sometimes requiring a lot of time since a lot of code these days is built with interpreters instead of compilers.
In the Expected Value (EV) world, all sorts of sub-variations seem to exist. One of the earliest that I’ve seen was the Dudley Model described in Art Glantz’s “The Winner’s guide to Pro Football Betting.” While Dudley was simple enough to be easily constructed by using a hand calculator, it simply was not as efficient as a simple Yards per Point formula. As I’ve mentioned in other posts, we’ve had Elo (from the chess World) and the recursive Predictor (Jeff Sagarin calls this Pure Points) that are fairly well known.
The nice thing about EV methods is: it’s simple - doesn’t require a lot of data to build, fast running, and can be built on a spreadsheet so even the unwashed masses can do it.
Lately, I’ve been playing with Hybrids, models that have combinations of both sims and EV. While in the past I’ve used some market efficiency models, but lately I’ve been playing around with a Signal to Noise approach that has it’s derivation in the SONAR/RADAR world. Yeah OK, maybe it's because I’m working with the Navy and I’m always fascinated by weird new algorithms but there just might be something there.
So the point of this whole diatribe is, I’m just wondering if there are any other approaches that folks are working on that you’d (gasp!) care to mention in an open forum.
Anyway in the past, I've always simply divided up the sports modeling work into two basic techniques:
Simulations - where every play, or set of plays are coded
Expected Value - where the game play is synthesized into a metadata set and a value or set of values are compared
In The Simulation world (sims), Monte Carlo (**) seems to be the norm. In the modeling and especially the Statistical world, ** is held in pretty high regard. Typically, ** has two difficulties, 1) it typically requires a very large db to derive its characteristics, and 2) the actual running of the sim can go to thousands of iterations, sometimes requiring a lot of time since a lot of code these days is built with interpreters instead of compilers.
In the Expected Value (EV) world, all sorts of sub-variations seem to exist. One of the earliest that I’ve seen was the Dudley Model described in Art Glantz’s “The Winner’s guide to Pro Football Betting.” While Dudley was simple enough to be easily constructed by using a hand calculator, it simply was not as efficient as a simple Yards per Point formula. As I’ve mentioned in other posts, we’ve had Elo (from the chess World) and the recursive Predictor (Jeff Sagarin calls this Pure Points) that are fairly well known.
The nice thing about EV methods is: it’s simple - doesn’t require a lot of data to build, fast running, and can be built on a spreadsheet so even the unwashed masses can do it.
Lately, I’ve been playing with Hybrids, models that have combinations of both sims and EV. While in the past I’ve used some market efficiency models, but lately I’ve been playing around with a Signal to Noise approach that has it’s derivation in the SONAR/RADAR world. Yeah OK, maybe it's because I’m working with the Navy and I’m always fascinated by weird new algorithms but there just might be something there.
So the point of this whole diatribe is, I’m just wondering if there are any other approaches that folks are working on that you’d (gasp!) care to mention in an open forum.