How important is it in ensuring that assumptions for regression are not violated when sports modelling? It seems to me that all the variables that I'm investigating have assumption violations of one or more criteria.
Should I be transforming these variables to ensure the assumptions for regression are met? I've toyed with this already, but the predictive value (r squared) of these equations are lower than they were when I didn't worry about assumption violations. I'm not too sure of what to think of this.
Maybe I should be identifying and removing outliers to meet the regression assumptions?
For the record, I'm using a population of 447 cases.
Anyone have any thoughts?
Should I be transforming these variables to ensure the assumptions for regression are met? I've toyed with this already, but the predictive value (r squared) of these equations are lower than they were when I didn't worry about assumption violations. I'm not too sure of what to think of this.
Maybe I should be identifying and removing outliers to meet the regression assumptions?
For the record, I'm using a population of 447 cases.
Anyone have any thoughts?