Hi,
I am looking to improve my tennis model. It's pretty accurate, but still could be improved.
I have a lot of game data, but the data only contains game, set and wins. I haven't found a way to get serve data for low level tennis tournaments, so it doesn't seem workable to use ATP & WTA serve data for some players, but nothing for others. If there's a way to compensate for that, please let me know.
Anyway, I am reading 'Analytic Methods in Sports' and was looking at linear regression. The book is detailed on regression analysis, but not how to incorporate it into predicting a winner between two players. I've found quite strong correlations between winning matches, and winning games and sets. As you would expect. There's also strong, but not as strong correlations between just playing games and sets (win or lose) and winning matches. I guess the spuds only play a few games, then give up. The better players play at a level they can win and often? .
Does this mean that the regression information is useless? I have the regression equations for both men and women for sets won predicting matches won. I'm not sure it helps hower. I haven't found a way to integrate it into my existing model that uses scores and takes into accounts opponents.
Any tips on how you would go about using regression information in a model would be handy.
Thanks.
B.