ࡱ> 241` 0bjbj "Pjj*l6666666JRRRR$vDJ2$QSSSSSS$4 TwE6w6666QQ66= 0sJRfͭ=ҷ0_=JJ6666QUANTITATIVE VERSUS QUALITATIVE RESEARCH Quantitative Focus on broad trends Uses numbers Deductive logic: Theory > Hypothesis > Observation > Confirmation Qualitative Detail oriented Relies on thick description Inductive logic: Observe > Pattern > Tentative Hypothesis > Theory Qualitative grand tour question a few sub-questions does not limit inquiry the question changes Quantitative specific question and variables guiding hypothesis question remains unchanged Summary of Research Design and Communication of Quantitative Information: Question? Model/Conceptualize? Operationalize? Data Availability? Analysis? Ultimately, have you answered the question? FUNDAMENTALS OF STATISTICS, DESCRIPTIVE AND INFERENTIAL Types of Data: Primarydata you collect to answer your question Secondarydata collected by other people for other purposes Question to ask yourself: Does secondary data meet the needs of your study? Particularly UofA. Scales of Dataclassifies and tells us what can be done with the data. Nominalmeasures the presence or absence of a characteristic of the data. Example: house architectural styles on your block: Victorian, Tudor, Cape Cod, Modern, etc. Fourth best type of data: can only add and subtract. Ordinalmeasures the presence or absence of a characteristic of the data using an implied ranking or order. Categories must be exhaustive and mutually exclusive. (exhaustive means they cover all possible types; mutually exclusive that data points can only fit into one type). Example: Freshman, Sophomore, Junior, Senior; Small, Medium, Large (what about the fast food problem of a medium and large?) Third best type of data: can only add and subtract. Intervaluses equal, fixed units to measure a characteristic, but with no natural zero. Where natural zero implies the complete absence of the characteristic. Example: temperature Second best type of data: can only add and subtract. Ratiouses fixed units to measure a characteristic, but has a fixed natural zero. Example: income measured in dollars. Height in inches. Best type of data: can add, subtract, multiply and divide. Examples: Evaluate a class on a one to five Lickert Scale where one is poor and five is excellent. What scale of data is this? Ordinal N=100 students. Three circumstances: If the mean = 1.0 or 5.0 it is clear that all were agreed. BUT what if the mean was 3.0? What problem do you see? We cannot know what this means about the data. Did they all say 3 or some 5 and some 1? Example of interval scale: temperature. Is ten degrees twice as hot as five? Does that make any sense? Cant multiply/divide temperature. For proof: is five C multiplied by two the same as five F multiplied by two? What about Kelvin? Kelvin, with a real zero, is ratio scale. Example of ratio scale data: as a professor at UW, I make $18,000. My colleague at Oregon makes $36,000. Does he actually make twice as much as I do? In the summer, both of our salaries are $0. Does this reflect the absence of the characteristic dollars in our summer salary? Soratio data should be obtained at the data collection level whenever possible, depending on the analysis planned. The scale of data should be considered before applying statistical processes, in fact, as part of the research design. Dichotomous Variables Many social science variables are binary. Examples: Employed, unemployed. Go to war, or not. Gender. Dichotomous variables can be nominal, unordered categories; or they can be metric (the general term for interval and ratio scale) by assigning the value of 1 to one category and 0 to the other. This is called dummy-variable scoring. We will cover this more in multiple regression. DESCRIPTIVE STATISTICS Descriptive statistics are just thatdescriptive. They tell us about the data. Like administrative instead of analytical research. They are important because in order to make conclusions about datato bring it to the level of information, we need to know its characteristics. Several questions need to be addressed when describing a variable. Well cover all of these: 1. How it was measured, including UofA and scale of data. 2. How are the results distributed? (Upcoming topic, including displaying results) 3. What is a typical result? (measures of central tendency) 4. How typical is typical? What is the variation of results? 5. How does the sample compare to the broader population? (Starts us into inferential stats) Definitions: Population: all observations of a particular variable. Example: size of every U.S. cityobserve/measure ALL cities for a population. Sample: subset of the population. Example: selecting 50 cities from all US cities. Frequency Distribution, Sampling Distribution, Normal Curve FD is the distribution of observations in one sample. SD is the distribution of parameters of many samples. Example: grading curve, everyone is familiar, within one class of students, versus grading curve of all the classes (effectively many samples). Many observed phenomena generate random variables (in large samples) with essentially normal distributions. Normal means: Bell-shaped curve. Symmetrical around mean (equal number of values above and below). Spread determined by standard deviation. Special case: Standard normal distribution has =0 and =1 SAMPLING The process of selecting units (such as people or organizations) from a population of interest so that by studying the sample you can fairly generalize your results to the population from which those units were chosen. Types of sampling: Probability and non-probability Random (Probability Sampling, allows to judge how well sample reflects population): Simple Random: even odds of selection for every case. Systematic Random: random sampling by cluster (ex. pick one from the first hundred names, then one from the next hundred, and so forth) Stratified Random: population divided into strata or classes, from which samples are drawn Cluster Random: clustered by area, like city block, can be more efficient than random. Non-random (Non-Probability Sampling): Convenience, can be targeted. (purposive) Purposive Sampling: Expert, Heterogeneity, Snowballing Note random sampling versus random assignment! Descriptive Stats questions: What is a typical resultmeasures of central tendency: Typically thought of as the average. Example: Asking about salaries in a firm when you interview. Whats going on with these results? Are they lying to you? Ex. 2003 Bush tax cuts: 92 million people will receive an average tax cut of $1,000. In actuality, the tax cut was over $90,000 for those making over $1 million. The median family or average familys tax cut was $100. Source: Paul Krugman, lecture 10/8/03. Three major ways of measuring center: 1. Modecategory occurring with the greatest frequency. It must be used if you have nominal data because cant take the average of nominal data! Functions as a best guess of what the typical case is. Four Problems: Not very descriptive, may not be unique (bimodal distribution), highly affected by sampling distribution, Very sensitive to categorical classification. 2. MedianThe category in which the middle case falls. Appropriate for ordinal data, also good for metric data. If N is odd, its the middle, if N is even, median is between the two middle values. Not affected my extreme values. 3. Meanthe sum of all values divided by the number of cases. Used frequently on metric data, More stable over repeated samples than other measures of center, Other stats are based on deviation from mean (like variance). Three Problems: Can take on fractional values, even when they are meaningless to the data, Cant be computed when extreme categories of variable are open ended (ex. age, salary), Is highly sensitive to extreme values. There are other specific calculations for measures of central tendency, such as weighted means, and means of grouped data not covered here, not difficult to find, or do. Measures of Dispersion Rangehighest value minus lowest value in sample. Example: (10,20,20,40,80) Range = 80-10 = 70 Clearly of limited valueare the values spread or concentrated? Deviationhow much a value differs from the mean, measured in several ways:  EMBED Equation.3  (deviation of ith value equals the ith value minus the mean) To eliminate negative distances from the mean, we square the results in finding the sample variance:  EMBED Equation.3  We then use this to calculate the standard deviation, s, of the sample set:  EMBED Equation.3  Steps for computing standard deviation for a population: Calculate the mean  EMBED Equation.3  Subtract the mean from each value  EMBED Equation.3  Square these values  EMBED Equation.3  could use mu instead of X-bar Sum the total  EMBED Equation.3  EMBED Equation.3  Divide the sum by the total number of cases  EMBED Equation.3  Take the square root  EMBED Equation.3  For the SD of a sample, just use n-1 in denominator, xs lower case, and mu changes to x. Why n-1? Using n-1 gives a better prediction of the SD of the population. The denominator in this case refers to a concept called degrees of freedom. Degrees of freedom is the number of quantities that are unknown minus the number of equations linking them. Example: suppose you had two variables, x and y, where values were unknown. You could assign any value to x and y. You would have two degrees of freedom. If, though, you know that x+y=20, then you only have one degree of freedom because once you assign one value you know the other. 68,95,99 & 90, 95, 99 Rules: 68% of all measurements will fall within one standard deviation of the mean, 95% within two, and 99% within three standard deviations of the mean. More precisely: 90% of data falls within +-1.64 standard deviations from the mean, 95% w/in +/-1.96 std dev, and 99% w/in +-2.58 std deviations. Notes on Notation Sample Statistics Population Parameters Mean  EMBED Equation.3 (x-bar)  (mu) Variance s2 2 Standard deviation s  (sigma) Z score If you need to compare scores in different distributions that have different means and standard deviations, you can convert them to standardized, relative scores, called z scores:  EMBED Equation.3  which is expressed in standard deviations and thus follows the 68-95-99 rule. Example: A Wisconsin school district administers a test to its students and the mean of scores is 700 with a standard deviation of 100. An Illinois district offers a similar test to students, and it has a mean score of 100 and a standard deviation of 10. How does a score of 600 in Wisconsin compare to a score of 75 in Illinois?  EMBED Equation.3 =  EMBED Equation.3  standard deviation below the mean in Wisconsin,  EMBED Equation.3 = EMBED Equation.3  standard deviations below the mean in Illinois. 6KLXYj% B C T ] Z s %-}r} (XjV6] 5CJ\aJaJ CJaJaJCJ5\Y)6LY$;Ph~() & F8 & F:  & F: ! !0XYJ n C X  S  !h^h  !V^V & F' !VV^V` !S T ^ %ab{|} Eq !qr(YZ@  & F; ! & F; !6lO01`aSTWX K [  & F' ! ! & F? V^V` & F@ & F@ VV^V`V !!!B$Y$Z$_$$$$%I%J%]%^%_%`%%%&&&&&&&k&l&&&&&&&&&&& ' ''' '!'6'7'J'» jEHUjc;C CJUVaJ jEHUj,;C CJUVaJ jEHUjC CJUVaJ j/EHUjC CJUVaJ jUj6EHU]j C CJUVaJj6U]6]5\ 56\]2[ r .!g!!!!"<"}"""#n###A$B$Y$Z$  & F+ !  & F* !  & F) ! !  !h^h  & F( !Z$$$$$I%%%&&&&k&&&&&"'m'''(( )){)|)*  & FE ! !J'K'L'M'{'|'''''''''''''''(((((((*+++.,>,@,f,h,j,l,,,,,9.:.M.N.tj6;C CJUVaJH* j"EHUjYC CJUVaJ>*5\ j|EHUj7;C CJUVaJ jEHUjo;C CJUVaJ jEHUj+;C CJUVaJ jN EHUj;C CJUVaJ jU j EHUja;C CJUVaJ-***J++++.,,,,,-8.9...//N0O000^ !N.O.P.//0000000000O0P0c0d0e0f0g0h0{0|0}0~00 j!EHUj;C CJUVaJ jEHUjn;C UV jxEHUj;C CJUVaJ j5EHUje;C UV jU jEHU 1h/ =!"#$%/DdJ  C A? "2ڈ_uJWmDd`!eڈ_uJWV` H3xcdd``f 2 ĜL0##0KQ* Wt1MRcgbR C0nĒʂT/&`b]F"L 0ro.dabM-VK-WMcxvK{ f3ˈ+џ$SQ ?dB[1)gdo3HE&ܞ_ >@I9 @,\ n, H.ܞ`VrAc KS v 0y{Ĥ\Y\ qAc(\~d_DdDJ  C A? "2 1 F8"S |sd`!1 F8"S |: (+xڥS;KP>$}$ :TPGtBmj6ǂ?g!GREqPkxJՄ2]~XHo<[d2. aĩ5x,L]jӵit|u;4bh $8{`]c@>S`_pM>$ 3W(kŮYkbcE)pQ"˴#pu'$ 3<>I(ț&G7*ނt"ͬS*@-Mg~ME$D4-?xT}7󸨰Nu7=u W2j[uZޓ2Ė|G~_~ j0:p\yD)$SE.&D8:KGNjcMV?DdHJ   C A? "2g7uTE+}d`!ug7uTE+@"hCxڕRJ@]rX ^k"D' nR+kl &pnX̛7E!xZ;T#Bt*q G%EUF;K+ !`{٥28=8N&I"hl,Zuz{FB57̰>S[Xj$OO7U~@OėY'i^goC{ D?KOSumG`udt#ؑ(qQ߶jN囉,[+ecM@'i&OKb ],Dd@J   C A ? "2Dg6G} m [d`!g6G} mfH xcdd``$d@9`,&FF(`T A?dA뀘7$# A( XXbS% d1D YB2sSRsW|)bF:`0F0d"*.b;OF&&\ J$]0;]`q"7Dd@J   C A ? "2pz<#d+@ϿGL= d`!Dz<#d+@ϿG xcdd``~$d@9`,&FF(`TERcgbR  :aP5< %! @_L T̠1[)% dD YB2sSRsW|)bF:`0r1d"*a|P0@penR~CP>v${΀][98t v 0y{Ĥ\Y\ qAC:t@bEdGDdJ   C A ? "2 C\'l(HyYK d`!} C\'l(HyYKxڕK@ƿ{M uDt謸jQA -ۜ:"].*@|s. *p)S$U$1jUZW> 3&Dqm/YOۭ^;PW!)SC/Q){PweՖ:n*:\ ^B ݍCߚe sV:e5H[<7R;V,OÅdnp]^'j]"ϔK;,3qG5WY4='?o?BQULI+=äɞ\ ŦHg MtgDdTJ   C A ? "24yS  d`!4yS   ȽXJkxcdd``d!0 ĜL0##0KQ* Wä2AA?H1Zc@øjx|K2B* R\``0I3YDdJ  C A ? "2 Z1YAN=d`! Z1YAN=. xt]xcdd``$d@9`,&FF(`TS A?dbv@P  !"#$%&'(*+,-./0T3678:9;=<>@?ACBDEFHGIKJLMNPOQSR_^UVWXYZ[\]bcdefghijklmnopqrstuvwxyz{|}~Root EntryS FpFs5Data )n#WordDocumentR"PObjectPoolU@bspFs_11258334826F@bs@bsOle CompObjfObjInfo "%*-0369<ADHKNOSVY[\]^_`acdefgi FMicrosoft Equation 3.0 DS Equation Equation.39q-c4> d i =x i "2x FMicrosoft Equation 3.0 DS EqEquation Native I_1125817767 F@s@sOle CompObj fuation Equation.39q/^d= s 2 =(x i "2x) 2i=1n " n"1 FMicrosoft Equation 3.0 DS Equation Equation.39qObjInfo Equation Native  _1125819640 F`8s`8sOle  CompObjfObjInfoEquation Native F_1127991852'F@s@s/*xcD> s= s 2 FMicrosoft Equation 3.0 DS Equation Equation.39q8!dC XOle CompObjfObjInfoEquation Native 0_1127991907F `s `sOle CompObjfObjInfo FMicrosoft Equation 3.0 DS Equation Equation.39q8`, X"X FMicrosoft Equation 3.0 DS Equation Equation.39qEquation Native 8_1127992161"FzszsOle CompObj fObjInfo! Equation Native !T_1127992314$F`s`sOle #88,, X"X() 2 FMicrosoft Equation 3.0 DS Equation Equation.39q8`0A CompObj#%$fObjInfo&&Equation Native '#_11279926191)FssOle (CompObj(*)fObjInfo++Equation Native ,d FMicrosoft Equation 3.0 DS Equation Equation.39q8Hh X"X()  "  2 FMicrosoft Equation 3.0 DS Eq_1127992687.FPsPsOle .CompObj-//fObjInfo01uation Equation.39q8V(&Ԟ X"X()  "  2 N FMicrosoft Equation 3.0 DS Equation Equation.39qEquation Native 2r_1127992887,@3F s sOle 4CompObj245fObjInfo57Equation Native 8{_11258197378F0&s0&sOle :8_'  X"X()  "  2 N  FMicrosoft Equation 3.0 DS Equation Equation.39q/`{T+ 2xCompObj79;fObjInfo:=Equation Native >-_1127994678=FssOle ?CompObj<>@fObjInfo?BEquation Native CJ FMicrosoft Equation 3.0 DS Equation Equation.39q8.(qA Z=X"2XS8.pMVw Z=X"2X_1127994725;MBFMsMsOle EObjInfoACFEquation Native GJS FMicrosoft Equation 3.0 DS Equation Equation.39q8r8dC 600"700100="100/100="1.0_1127995537FFph!sph!sOle ICompObjEGJfObjInfoHLEquation Native M_1127994734KFP%sP%sOle PObjInfoJLQ8.pMVw Z=X"2XS FMicrosoft Equation 3.0 DS Equation Equation.39qEquation Native RJ_1127995052IDOF0*s0*sOle TCompObjNPUfObjInfoQWEquation Native X~1TableaSummaryInformation(TZ8bdC 75"10010="25/10="2.5Oh+'0 4@ \ h t)QUANTITATIVE VERSUS QUALITATIVE RESEARCHordUANHit-Enter-NowERit-it- Normal.dotoHit-Enter-NowER1t-Microsof5< %!  @_L ĺEf1HX@7ȝ>r  W&0;YlW|)bF:`222d4*1"v$s3@*4*g@47$37X/\!(?71;!2Aplqd2gx 5p#ν LA*t1e$)d RQ 3A\ДN}S v 0y{镑I)$5ウ6p(3XچWnDdJ  C A? "2GwOLzVRd`!GwOLzVRe `\rxڕR=OA8I$ 1ha:+Hh09PHB-4xh޽7vHD |9PaѦX.xZt d1ni1=VZ礪=@ohvEB {aa!t:߱>zk.NUj@H >#ZBlN nWY]Jw|߯?^EYL,}}b$s%o&LxjH?Š2 YA,a k#DLDL AtOA $37X/\!(?71!pŗ( f3߈k s* H&c G &?~P@penR~CPv;3Ѕ3 #J.hqCb nv0o8N+KRsA<.h2]wLNMnCDdt0  # A 23u5vz.Xyd`!3u5vz.XV  @2axmR1KPEctq\E: +hZ(**+P2cqr8wp'88){ E|ݻ/wA(X/A[ BʲLPo5+Agd*<όlSfooW1 {HgeUP߽EGRPhѓ}2ɏξ6:OapAһq %A_sV'Q wr2PEir6q׶>a_tT:wE44;̻nWYZ?Zq?lx";\V{P6"QQ,l9Ec"~WIYDd lJ  C A? " 2c~U.e[t/d`!c~U.e[t/]xmR1K@~R&)6"Eĭ5:IUA۩ &ZY8?C@AݥЃ|{w,-kk  LTXњ3W6 BfpO=S7ha?:&yD  ^F@_uO9o#0~4>~= iO)];1^/qK2!ysO]ۉc0 svlycRJ1j8D`e7t)s:$b}53\wv4MGq6dxCl9WuՅd%N2!3[ϑt)ZQ Y&CDdt0  # A23u5vz.Xd`!3u5vz.XV  @2axmR1KPEctq\E: +hZ(**+P2cqr8wp'88){ E|ݻ/wA(X/A[ BʲLPo5+Agd*<όlSfooW1 {HgeUP߽EGRPhѓ}2ɏξ6:OapAһq %A_sV'Q wr2PEir6q׶>a_tT:wE44;̻nWYZ?Zq?lx";\V{P6"QQ,l9Ec"~WIZDd lJ  C A? "2IiG#lߨX!d`!IiG#lߨj^xmJP9Wj:8NBqWЖNV(4Bt 8;8:"{*4p8/d9lJ!DZn\1ÚXʰ2>6nw5K8'@"Xvo h L|]Z l v\aCZB.L/?oDfsnI @ҖQp.X_5>y^l.|YU"\k %T+{&r% )Z )p`-2<_un.Z _/&wq0Z76JңL 8PT\:-z`;dWTBjj%;wB E"w Fn`QFNNhLIGzB;a(kJ<~hKhL`pEK>)t _/|YU"~}|Ra l "`hLL 8B'@ E=4b\qhWK