ࡱ> "$!%` *,bjbj 40̟̟#       4| | | | $ 4x" ^    f_ $ !!!!!!!$>$h&!9 i ii!    2"i   !i!:[,   PLsb\| p !H"0x"RN'[.N'N'  )hT !! x"iiii444$X$444X444       Assumptions in Correlation and Regression Models Both correlation and regression models are based on the general linear model,  EMBED Equation.3 , but they differ with respect to whether the X variables are considered random or fixed. In the correlation model they are considered random that is, the values of the X variables obtained in the sample and the number of cases obtained at each level of the X variables is random another sample from the same population would yield a different set of values of X and different probability distributions of X. In the (fixed) regression model the values of X and their distributions are assumed to be, in the sample, identical to that in the population. Some have argued that the correlation coefficient is meaningless in a regression analysis, since it depends, in large part, on the fixed particular values of X obtained in the sample and the probability distribution of X in the sample (see Cohen & Cohen, 1975, p. 5). While this relationship between r and the distribution of X in the sample is certainly true, it does not, IMHO, necessarily follow that R and R2 are not useful statistics in a regression analysis, as long as the reader understands that their values depend, in part, on the fixed values and distribution of X. The fixed regression model fits best with experimental research where the researcher has arbitrarily chosen particular values of the X variables and particular numbers of cases at each value of each X. In this context, the fixed regression model is most often called the Analysis of Variance model. It is, however, true, that it is common practice to apply the regression model to data where the X variables are clearly not fixed. When you use t or F to get a p value or a confidence interval involving , you assume that the joint distribution of X and the Y (or Ys) is bivariate (or multivariate) normal. When the distribution is bivariate normal then it is also true that the marginal distributions of X and Y are both normal, the conditional distributions of X given Y and of Y given X are all normal, the variance in X does not vary with Y and the variance in Y does not change with X (see Winkler and Hays, 1975, p. 644-652). When you use t or F to get a p value or a confidence interval involving regression coefficients you make no assumptions about the X variables. You do assume that the distribution of Y is normal at every value of X and that the variance in Y does not change with X. This assumption can be restated in terms of the error term: the distribution of the error term (the residuals) is normal at every value of X and constant in variance across values of X. Pedhazur (1982) stated the assumptions of regression analysis as follows: X is measured without error. Now there is an assumption that will always be violated. The population means of Y|X are on a straight line. I consider this to be part of the tested null hypothesis that is, we are testing a linear model. X is fixed, Y is random. For each observed Y, the mean error is, over very many replications, zero. Errors associated with any one observed Y are independent of errors associated with any other observed Y. The Durbin-Watson statistic can be used to evaluate one form of non-independence the situation where the value of a given case is related to the values of cases which are adjacent in the data stream. This is likely when cases are entered in temporal or geographical order. The Durbin-Watson statistic is available under Statistics in SPSS Regression, and its value is included in the Model Summary table in the output. It is often recommended that one need not worry if the Durbin-Watson statistic is with the range of 1.5 to 2.5. The error variance is constant across values of X. If you encounter heteroscedasticity, you may be able to resolve it by transforming Y. You may also consider  HYPERLINK "http://www2.chass.ncsu.edu/garson/pa765/wls.htm" weighted least squares regression instead of Ordinary Least Squares regression. The errors are independent of X. This sounds like homogeneity of variance to me. Pedhazur states that these assumptions are necessary for the obtained estimators to be best linear unbiased estimators. If t or F will be employed for tests of significance it is also assumed that the errors are normally distributed. References Cohen, J., & Cohen, P. (1975) Applied multiple regression/correlation analysis for the behavioral sciences. Hillsdale, NJ: Lawrence Erlbaum. Pedhazur, E. J. (1982). Multiple regression in behavioral research. (2nd ed.). New York: CBS College Publishing. [especially Chapter 2] Winkler, R. L., & Hays, W. L. (1975). Statistics: Probability, inference, and decision. (2nd ed.). New York: Holt, Rinehart, & Winston. Links  HYPERLINK "http://core.ecu.edu/psyc/wuenschk/docs30/corr6430.doc" Bivariate Linear Correlation  HYPERLINK "http://core.ecu.edu/psyc/wuenschk/docs30/regr6430.doc" Bivariate Linear Regression  HYPERLINK "http://www2.chass.ncsu.edu/garson/pa765/assumpt.htm" \l "normal" Testing of Assumptions01 [ \ a b c   4v|̢̢̳|xxpxpxpxjxfxfxh.A hO^JhOhO6hOhJnh8Uh\h\6H*h\h\6h3|lh\6h(!h\jh (h\EHU!jL h\CJOJQJUVjh (Uh ( h\h[5B* CJ aJ ph h\h (5B* CJ aJ phhw5B* CJ aJ ph(1  u  6 vv^v`gd/ vv^v`gdnB$a$gd^Zgd9*B*ph h3|lh3|ljh3|lh3|lU h9 @c112BYL%bpu()?+,-./0123456789:;<=Root Entry  Fmb\%Data WordDocument 40ObjectPool PLsb\mb\_1275382408FPLsb\PLsb\Ole CompObjfObjInfo ! FMicrosoft Equation 3.0 DS Equation Equation.39q=p` 2Y =a+b 1 X 1 +& +b, p X pOh+'0Equation Native 1Table*b'SummaryInformation( DocumentSummaryInformation8 ,8 X d p|$Correlation and Regression ModelsKarl L. Wuensch Normal.dotKarl L. Wuensch16Microsoft Office Word@;(@d]&@LY\"՜.+,D՜.+,L hp|  ECU$ ' "Correlation and Regression Models Title 8@ _PID_HLINKSAUY 4http://www2.chass.ncsu.edu/garson/pa765/assumpt.htmnormalZY 6http://core.ecu.edu/psyc/wuenschk/docs30/regr6430.doc P]6http://core.ecu.edu/psyc/wuenschk/docs30/corr6430.doc '-0http://www2.chass.ncsu.edu/garson/pa765/wls.htm X@X Normalx5$7$8$9DH$CJOJQJ_HmH sH tH DA@D Default Paragraph FontVi@V  Table Normal :V 44 la (k@(No List 6O6 SASCJOJPJQJ@Z@@ Plain TextCJOJQJ^Jd$@d Envelope Address!@ &+D/^@ ^JaJF%@"F Envelope ReturnCJ^J6U@16 oq Hyperlink >*B*ph# 0 1x    90%0000000 0 0 0 0 0 0 000000000 0 0 01x    90%0000000 0 0 0 0 0 0 000000000000 *,*,*,>|.0t#:XXXX8@0(  B S  ? OLE_LINK1 OLE_LINK2 _Hlt201992511 _Hlt201992512%@@% 4|4Y 5|4W 6|tW 7|t] 8|X 9|4X :|W ;|U <|t[ =|X    %  % 9 *urn:schemas-microsoft-com:office:smarttagsState8 *urn:schemas-microsoft-com:office:smarttagsCity9*urn:schemas-microsoft-com:office:smarttagsplace=*urn:schemas-microsoft-com:office:smarttags PlaceName=*urn:schemas-microsoft-com:office:smarttags PlaceType x9   @BGPy !)u~!%~:A#-9V^k%33333333%!%zZDh=c*8h^`OJQJo(hHh^`OJQJ^Jo(hHohpp^p`OJQJo(hHh@ @ ^@ `OJQJo(hHh^`OJQJ^Jo(hHoh^`OJQJo(hHh^`OJQJo(hHh^`OJQJ^Jo(hHohPP^P`OJQJo(hHh^`OJQJo(hHh^`OJQJ^Jo(hHohpp^p`OJQJo(hHh@ @ ^@ `OJQJo(hHh^`OJQJ^Jo(hHoh^`OJQJo(hHh^`OJQJo(hHh^`OJQJ^Jo(hHohPP^P`OJQJo(hHzZc*8                  vuMh;[G +XjeS7a:ST$~*{ A[A H 9 ~0 8 Fp )<EpA '4|{ 8U *']Z?GD EYFc~gD _e#~.)++@AdauBz=@H8 !(!W!+ "*!",Z"#\#\$$%6p%s%"}%-&R&n'm(R(](3o( )])){)6})9*?*XQ+w`+M,:,:.oi.t./jw/(0M0X51S1&]1 2{2w-34.4+a4 5# 5bV5 6) 6+6Ar6u6%7D7i.899T9[9y9u ::M:c:;g@;t;6|;H<'T<]<=I>^<>B>K>|\>G|>HV?Vk?V@"@5Z@ AdAnB5B@hBjBqB[sBVvBCHC$C2C*D"ODxD)Ea8E9EZrE FG&HX IIYJ K/K)K uK"}K"L]ALdLAMl>ML/O]O1PP9QgdR&fR'T&:T/VWWrWG XnCX^Z4[ =[\ 4]x7]R]X]p^q^t^ _C`h`*a.ibccYc0dc~c\dCecgcitpijj5@j_jskl9l@Vl3|lCm)#m&sm40n7nTnsno+okboo4pNspsCtMtMNtP;uAuquwuv>vTv?hvw*w}2wcwmrwq5x=|x/yyavyWz=/z%{>c{~{|"|-|t|}TA}g~*>$l4w4qQHOzn&Nq{28CR^\1(S"oNqQna tNMSe%^Rd_wx*+UG 5Og..sV aJMKTDkadAE4u6s3q' H'L}]s$)1'm{ t.ed@g(,N2?<R\{jQ60:OdThr> =gxrd%+;i 0^UsH (TIB9LjJM;KXeGimrw{w"Zh47?Tbcf/,:@WP3n\jj{)k7A>MM\!GfIkSa4 bf)>|T_}<N8Eqk~w M I3;,D'C!x&I@.AD \jV8zc n&26LmJn3~TZg#{`1uzgouCsG]I^~!Xc _A{Zx< ~3%&B[v~m9KRn+mBQ~BKC=R<9<xmk(/;<k-RYYa6i:jqM%%M\Kq!5n0CLpz ]3oqI"JKTCp 8YaiV4?O.nRq/0k/1:kus-LSXZ@""""4#@$@X@UnknownGz Times New Roman5Symbol3& z ArialC5  SAS MonospaceO  MS Mincho l r   ?5 z Courier New;Wingdings"qhƆ& " $" $#24d2QHX)? (2!Correlation and Regression ModelsKarl L. WuenschKarl L. Wuensch  CompObj q  FMicrosoft Office Word Document MSWordDocWord.Document.89q