For NCAA football, I have a classification tree algorithm using the past 10 years of team data and my own ranking system. I am using the following equation to setup confidence intervals for my measuring accuracy for specific leaf nodes in my system:
p= (2 * N * (measured accuracy) + Z^2 +/- (SQRT(Z^2 + (4 * N * (measured accuracy) - 4 * N * (m accuracy)^2))
-------------------------------------------------------
(2 * N * Z^2)
Z=1.96 for 95% confidence.
My issue is when the N (# of events) gets small. There are several situations where the accuracy is high (90-100%) but the N is between 10 and 20. I know that standard confidence interval equations are accurate when the N is larger (>30 minimum).
Does anyone have a program or method of calculating confidence intervals with small sample sizes? I am wary of putting down $$$ without confirming the signifigance of these smaller nodes.
RadBet
p= (2 * N * (measured accuracy) + Z^2 +/- (SQRT(Z^2 + (4 * N * (measured accuracy) - 4 * N * (m accuracy)^2))
-------------------------------------------------------
(2 * N * Z^2)
Z=1.96 for 95% confidence.
My issue is when the N (# of events) gets small. There are several situations where the accuracy is high (90-100%) but the N is between 10 and 20. I know that standard confidence interval equations are accurate when the N is larger (>30 minimum).
Does anyone have a program or method of calculating confidence intervals with small sample sizes? I am wary of putting down $$$ without confirming the signifigance of these smaller nodes.
RadBet