Hi all,
I have been working on modeling MLB for most of a year and have a (primarily econometric) model that I can confidently say should be a long-term winner. Problem is, I'm not much of a programmer. I did take an Intro Programming course (IN JAVA) in my last semester of school (last spring) and I can program pretty well in Stata (the statistical program I use), but the real problem is updating the model every day with new data. I've written code to generate a prediction (with the two teams and pitchers as inputs) but I'll need to add data new game data every day.
My question is this: Do any of you/have any of you faced this issue? What is your solution? Do I pretty much need to learn to write a webcrawler using Perl to scrape data offline?
Any help/advice would be greatly appreciated.
I have been working on modeling MLB for most of a year and have a (primarily econometric) model that I can confidently say should be a long-term winner. Problem is, I'm not much of a programmer. I did take an Intro Programming course (IN JAVA) in my last semester of school (last spring) and I can program pretty well in Stata (the statistical program I use), but the real problem is updating the model every day with new data. I've written code to generate a prediction (with the two teams and pitchers as inputs) but I'll need to add data new game data every day.
My question is this: Do any of you/have any of you faced this issue? What is your solution? Do I pretty much need to learn to write a webcrawler using Perl to scrape data offline?
Any help/advice would be greatly appreciated.