1. #1
    roasthawg
    roasthawg's Avatar Become A Pro!
    Join Date: 11-09-07
    Posts: 2,990

    Regression software question?

    I'm looking for some cheap software to run multiple linear regressions that can handle a ton of variables and do backward stepwise automatically. The software I'm using now works alright but it's really slow and is having problems with the number of variables I'm inputting. Any ideas? Thanks.

  2. #2
    MrX
    MrX's Avatar Become A Pro!
    Join Date: 01-10-06
    Posts: 1,540

    You can use DataFit free for 30 days:

    http://www.oakdaleengr.com/

  3. #3
    roasthawg
    roasthawg's Avatar Become A Pro!
    Join Date: 11-09-07
    Posts: 2,990

    Quote Originally Posted by MrX View Post
    You can use DataFit free for 30 days:

    http://www.oakdaleengr.com/
    Thanks but it'll only take 20 independant variables... I need more.

  4. #4
    Dave Head
    Dave Head's Avatar Become A Pro!
    Join Date: 07-22-09
    Posts: 73

    hi roasthawg,

    gretl is the GNU Regression, Econometrics, and Time-series library. You can download the source code for free at:

    http://sourceforge.net/search/?type_...rds=regression

    You can also download a statistics macro from:

    http://www.ooomacros.org/user.php#106652

    Here's a package written java:

    http://openforecast.sourceforge.net/...sionModel.html
    Last edited by Dave Head; 11-17-09 at 03:27 PM. Reason: added more links

  5. #5
    roasthawg
    roasthawg's Avatar Become A Pro!
    Join Date: 11-09-07
    Posts: 2,990

    Quote Originally Posted by Dave Head View Post
    hi roasthawg,

    gretl is the GNU Regression, Econometrics, and Time-series library. You can download the source code for free at:

    http://sourceforge.net/search/?type_...rds=regression
    Interesting and thanks for the reply! Before I download this and mess with it myself maybe you could answer a few questions for me.

    1. Is this easy to use? Since all I want to do is run some simple mult. linear regressions I don't want to have to work to hard to learn the software.

    2. Can you do forward and backward stepwise automatically?

    3. Can it handle a lot of independent variables?

    Thanks a lot!

  6. #6
    Dave Head
    Dave Head's Avatar Become A Pro!
    Join Date: 07-22-09
    Posts: 73

    hi roasthawg,

    I've never used it myself. My experience with downloading free source code has been very good, though. Many of these packages come with executable binaries that don't need to be compiled.

    1. Easy to use? Probably not in this case. When I download free source code, I almost always have to re-write it and then compile it. Not that it has bugs, but it's not exactly what I want.

    2. Forward and backward stepwise automatically? I have no idea what you mean, and yes, I'm familiar with multiple regression.

    3. Can it handle a lot of independent variables? If not, you can just re-write it. You want 72 independent variables? no problem.

    By the way, the list of links that I gave earlier was in no way complete. You can do a Google or Yahoo search on "regression multiple download source" or something similar and get a lot more links.
    Last edited by Dave Head; 11-17-09 at 03:42 PM. Reason: typo

  7. #7
    roasthawg
    roasthawg's Avatar Become A Pro!
    Join Date: 11-09-07
    Posts: 2,990

    Thanks for your help Dave. The gretl seems a little out of my league as I don't know how to rewrite or compile source code. I've done a good deal of google searching without much luck so far... it's hard because I want a developed software package or excel add-on but I don't want to spend much money!

  8. #8
    maxdalury
    maxdalury's Avatar Become A Pro!
    Join Date: 05-28-09
    Posts: 67

    Use R its free can handle any amount of variables, but it won't be able to the stepwise automatically (not that I know of), unless you right your own function

  9. #9
    roasthawg
    roasthawg's Avatar Become A Pro!
    Join Date: 11-09-07
    Posts: 2,990

    Quote Originally Posted by maxdalury View Post
    Use R its free can handle any amount of variables, but it won't be able to the stepwise automatically (not that I know of), unless you right your own function
    Yeah, R would take a lot of effort for me to learn... not really worth it considering all I'm doing is linear regressions. I looked into R commander to see if it would simplify things for me but it was all looking too confusing to be worthwhile.

  10. #10
    maxdalury
    maxdalury's Avatar Become A Pro!
    Join Date: 05-28-09
    Posts: 67

    R commander is pretty easy to use. once its installed it really is easier than excel, although it doesn't have the full functionality of R but should suffice.

  11. #11
    dcbt
    dcbt's Avatar Become A Pro!
    Join Date: 04-04-08
    Posts: 185
    Betpoints: 48

    XLSTAT is an Excel add-in with a free 30 day trial. Not sure on the number of variables or your stepwise functionality, though. But I've used it for logit regression, and it's crazy easy to use, not to mention very fast. The hedge fund I used to work for used this software as well, so it must be decently powerful...

  12. #12
    Peregrine Stoop
    Peregrine Stoop's Avatar Become A Pro!
    Join Date: 10-23-09
    Posts: 869
    Betpoints: 779

    Quote Originally Posted by roasthawg View Post
    Thanks but it'll only take 20 independant variables... I need more.
    cannot imagine a sportsbetting situation w/ >20 independent variables

  13. #13
    Dave Head
    Dave Head's Avatar Become A Pro!
    Join Date: 07-22-09
    Posts: 73

    hi Peregrin Stoop,

    Quote Originally Posted by Peregrine Stoop View Post
    cannot imagine a sportsbetting situation w/ >20 independent variables
    A typical box score for a football game has at least 50 independent variables (not including individual stats).

  14. #14
    Peregrine Stoop
    Peregrine Stoop's Avatar Become A Pro!
    Join Date: 10-23-09
    Posts: 869
    Betpoints: 779

    I think you need to examine what "independent" means

  15. #15
    Dave Head
    Dave Head's Avatar Become A Pro!
    Join Date: 07-22-09
    Posts: 73

    Hi Peregrine Stoop

    Quote Originally Posted by Peregrine Stoop View Post
    I think you need to examine what "independent" means
    For example, given:

    total return yards = KO return yards + punt return yards + interception return yards

    total return yards is not independent
    KO return yards is independent
    punt return yards is independent
    interception return yards is independent

    Since there are 2 teams, that's 6 independent variables right there.

    Technically, stats regarding field goals are individual stats, but I classify them as team stats. So, I would include stats such as field goal yardage to the list.
    Last edited by Dave Head; 11-21-09 at 11:24 AM. Reason: added exception to individual stats

  16. #16
    marcoforte
    marcoforte's Avatar Become A Pro!
    Join Date: 08-10-08
    Posts: 140
    Betpoints: 396

    I use a medical research package called medcalc, $400. You can get a free trial. It does stepwise, backward and forward. Easy to use.

  17. #17
    Peregrine Stoop
    Peregrine Stoop's Avatar Become A Pro!
    Join Date: 10-23-09
    Posts: 869
    Betpoints: 779

    Quote Originally Posted by Dave Head View Post
    Hi Peregrine Stoop



    For example, given:

    total return yards = KO return yards + punt return yards + interception return yards

    total return yards is not independent
    KO return yards is independent
    punt return yards is independent
    interception return yards is independent

    Since there are 2 teams, that's 6 independent variables right there.

    Technically, stats regarding field goals are individual stats, but I classify them as team stats. So, I would include stats such as field goal yardage to the list.
    KO return yards is not independent from punt return yards
    LOL @ using interception return yards for any sort of predictive modeling

    I think you need to suss out your model some, or everything you obtain from it will be garbage

  18. #18
    Dave Head
    Dave Head's Avatar Become A Pro!
    Join Date: 07-22-09
    Posts: 73

    Hi Peregrin Stoop,

    Quote Originally Posted by Peregrine Stoop View Post
    KO return yards is not independent from punt return yards
    LOL @ using interception return yards for any sort of predictive modeling

    I think you need to suss out your model some, or everything you obtain from it will be garbage
    We're not talking about my model. We're talking about roasthawg's model. I agree that he probably won't want to use all of those variables, but that's his decision.

    Please explain the dependence between KO return yards and punt return yards, because I just don't see it. (Correlation or inverse correlation is not a dependence.)

  19. #19
    strixee
    I think, therefore I win
    strixee's Avatar Become A Pro!
    Join Date: 05-31-10
    Posts: 432

    What software would you recommend for 10-15 variables regression? My requirement is that it's for free (14-day trial is OK) and it can spit out something else than just a linear combination. A polynomial function of deg 4 is enough.

  20. #20
    mathdotcom
    mathdotcom's Avatar Become A Pro!
    Join Date: 03-24-08
    Posts: 11,689
    Betpoints: 1943

    R


  21. #21
    bztips
    bztips's Avatar Become A Pro!
    Join Date: 06-03-10
    Posts: 283

    Stepwise regression is nothing more than a data-mining technique for fools

  22. #22
    strixee
    I think, therefore I win
    strixee's Avatar Become A Pro!
    Join Date: 05-31-10
    Posts: 432

    Quote Originally Posted by mathdotcom View Post
    R

    I came across this software yesterday and it grabbed my attention, so I'll try it.

  23. #23
    uva3021
    uva3021's Avatar Become A Pro!
    Join Date: 03-01-07
    Posts: 537
    Betpoints: 381

    stata or R, no reason for anything else

    or use some sort of OOP

Top