AP Notes, Outlines, Study Guides, Vocabulary, Practice Exams and more!

AP Statistics Review 2ICE Flashcards

Terms : Hide Images
95782980195 number summaryMin, Q1, Med, Q3, Max 25% of data between each number The five numbers that help describe the center, and spread of data0
9578298020z scorestd dev from mean -Number of standard deviations a score is above or below the mean (positive above, negative below1
9578298021standard deviationA statistical measure of how far away each value is, on average, from the mean. A measure of spread. Specifically, the typical distance the data points are from the mean.2
9578298022ParameterFrom the whole population- usually unknown We use statistics to estimate them:3
9578298023categorical dataData that can be placed into categories . For example "gender" is a categorical data and the categories are "male" and "female". Labels or names used to identify categories of like items4
9578298024quantitative dataNumbers that it makes sense to average age is quantitative5
9578298025bar grapha type of graph in which the lengths of bars are used to represent and compare data in categories A graph that uses horizontal or vertical bars to represent data.6
9578298027sampleA relatively small proportion of people who are chosen in a survey so as to be representative of the whole. a small part of a population that represents the whole A survey in star city representing the entire state of arkansas7
9578298028randomAssigning participants to experimental and control conditions by chance. Create equal groups Assigning subjects to experimental groups based on chance. pulling names or numbers out of a hat8
9578298029biasAny systematic failure of a sampling method to represent its population Any way that tampers with the accuracy of the sample9
9578298030Undercoverageleaves out people...no phone...no house.... using a list that leaves people out! When some groups in the population are left out of the process of choosing the sample10
9578298031nonresponsebias introduced to a sample when a large fraction of those sampled fails to respond When many people of a sample do not respond11
9578298032voluntary response biasBias introduced to a sample when individuals can choose on their own whether to participate in the sample.12
9578298033statisticApplication of mathematics to describing and analyzing data13
9578298034independent(statistics) a variable whose values are independent of changes in the values of other variables14
9578298035historgramgraphical representation of a frequency distribution using vertical bars but bars touch each other to indicate variables are related15
9578298036box plotA dsiplay that shows the distribution of values in a data set seperated into four equal-sized groups. A box plot is constructed from the five number summary of the data.16
9578298037scatterplotA graphed cluster of dots, each of which represents the values of two variables. The slope of the points suggests the direction of the relationship between the two variables. The amount of scatter suggests the strength of the correlation (little scatter indicates high correlation).17
9578298038correlationr ranging from -1 to +1 ( very linear) describes positive or negative describes how close to a line r = -.23 weak negative18
9578298039skewnessThe extent to which cases are clustered more at one or the other end of the distribution of a quantitative variable rather than in a symmetric pattern around its center19
9578298040variancecommons measure of spread about the mean as center std dev squared20
9578298041statistical significanceP value< alpha Reject the Ho Too unlikely to happen by chance alone.....something caused it!21
9578298042P-valueThe probability of finding the sample or more extreme if the Ho was true. Small P makes us believe the Ho is not true.22
9578298043empirical ruleThe rules gives the approximate % of observations w/in 1 standard deviation (68%), 2 standard deviations (95%) and 3 standard deviations (99.7%) of the mean when the histogram is well approx. by a normal curve23
9578298044lurking variableA variable that has an important effect on the relationship among the variables in a study but is not one of the explanatory variables studied24
9578298045null hypothesisHo: Hypothesis that predicts NO relationship between variables. The aim of research is to reject this hypothesis No effect No difference25
9578298046alternate hypothesisHa is the hypothesis to be considered as an alternative to the null hypothesis. The null hypothesis will be rejected in favor of the Ha only if the sample data strongly indicate that the null hypothesis is false. p< alpha26
9578298048probabilityA number with a value from 0 to 1 that describes the likelihood that an event will occur. example, if a bag contains a red marble, a white marble and a blue marble then the probability of selecting a red marble is 1/3.27
9578298049descriptive statisticsSOCS Shape Outliers Center Spread..... CONTEXT CONTEXT CONTEXT!28
9578298050meanA measure of center in a set of numerical data, computed by adding the values in a list and then dividing by the number of values in the list.29
9578298051medianA measure of center in a set of numerical data. The median of a list of values is the value appearing at the center of a sorted version of the list - or the mean of the two central values if the list contains an even number of values.30
9578298052modeMeasure of central tendency that uses most frequently occurring score.31
9578298053rangeDistance between highest and lowest scores in a set of data.32
9578298054dataFacts and statistics collected together for reference or analysis33
9578298055Q1A location measure of the data such that has one fourth or 25% of the data is smaller than it. Found by dividing the ordered data set in half (excluding the middle observation if n is odd) and finding the median of the lower half of the data.34
9578298056Q3A location to measeure when counting data to such as the median where instead of counting 50% it is 75% from the beginning of the sorted data35
9578298057minimum(n.) the smallest possible amount; (adj.) the lowest permissible or possible36
9578298058outlierA value much greater or much less than the others in a data set37
9578298059margin of errorIn statistical research, the range of outcomes we expect for a population, given the data revealed by a sample drawn from that population38
9578298061simple random samplesample from the whole group...no dividing A sample of size n selected from the population in such a way that each possible sample of size n has an equal chance of being selected.39
9578298062sampling distributionDistribution of sample proportions from sample to sample. A sampling distribution of a sample statistic for a fixed sample size n is the distribution of that statistic derived from every possible sample of size n for a given population. Bigger sample...tall and skinny (less variable)40
9578298063stratified random sampleA method of sampling that involves dividing your population into homogeneous subgroups( like freshmen, sophomore, junior, senior) and taking a simple random sample in each subgroup. a sampling design in which the population is divided into several groups, and random samples are then drawn from each stratum41
9578298064systematic sampleA sample drawn by selecting individuals systematically from a sampling frame ( like take every 10th person in line) A sample drawn by selecting individuals systematically from a sampling frame. When there is no relationship between the order of the sampling frame and the variables of interest, a systematic sample can be representative.42
9578298065cluster sampleIs obtained by selecting all individuals within a randomly selected collection or group of individuals.43
957829806610% rulea sample has to be less than 10% of the whole population Pop> 10 n (Independent assumption)44
9578298067ExtrapolationPredicting outside the domain of the data45
9578298069theoretical probabilityA probability obtained by analyzing a situation. If all of the outcomes are equally likely, you can find the theoretical probability of an event by listing all of the possible outcomes and then finding the ratio of the number of outcomes producing the desired event to the total number of outcomes. For example, there are 36 possible equally likely outcomes (number pairs) when two fair number cubes are rolled. of these six have a sum of 7, so the probability of rolling a sum of 7 is 6/36 or 1/646
9578298128binomial probability2 outcomes( success, failure) F ixed trials (n) I ndependent S ame prob each time (p) P(x = 3)= binompdf(n,p,3) P(x<3)= binomcdf(n,p,2)...stops at 247
9578298070block designThe subjects in an experiment are first divided into groups (called 'blocks') based on some common characteristic (such as gender) that is hypothesised to have an effect on the response. Randomization of treatments then happens within each block (each block is like its own mini-experiment)."48
9578298072double blindAn experiment in which neither the subjects nor the people who work with them know which treatment each subject is receiving Neither the subjects nor the people who have contact with them know which treatment a subject received49
9578298073placeboA fake treatment. A chemically inert substance that produces real medical benefits because the patient believes it will help her50
9578298074least squares regression linethe line with the smallest sum of squared residuals51
9578298075type I errorReject the Ho could result in Type 1 error We rejected the Ho...but it was right.52
9578298076type II errorFail tooooooo reject the Ho....but it was wrong53
9578298077matched pairsself match- you take both treatments (randomized) A....B or B...A match with someone else- one gets one treatment one gets the other54
9578298078conditional prababilityprobability given that something else has already occurred P(B/A)= P(A&B)/P(A)....with a tree diadram or another way: drawing a 3 red cards: (26/52)(25/51)(24/50)55
9578298079sample spaceSet of all possible outcomes of an experiment56
9578298080confounded variableA variable whose effect on the response variable cannot be separated from the effect of the explanatory variable on the response variable. (Note: Usually confounded variables are lurking variables but only a few lurking variables are also confounded.)57
9578298081marginal distributionIn the margin (ONLY USE TOTALS)58
9578298082coefficient of determinationr squared Represents the amount of variance accounted for by that correlation. % of variation in y explained by x59
9578298085bimodalA type of distribution, where there is two or more categories with an equal count or cases and with more cases than the other categories. A distribution with two modes60
9578298086experimentA kind of research in which the researcher controls all the conditions and directly manipulates the conditions, including the independent variable. A treatment Can establish cause and effect61
9578298087law of large numbers(statistics) law stating that a large number of items taken at random from a population will (on the average) have the population statistics62
9578298088extrapolationcalculation of the value of a function outside the range of known values63
9578298090IQRA measure of variability, based on dividing a data set into quartiles Q3-Q1 Spread Difference between upper and lower quartile of a boxplot64
9578298091Confidence intervalinterval that we are ____ confident captured the true mean: proportion or mean, or slope, or difference in proportion, or diff in means If 0 is not in the interval...we have sig evidence of a difference. we are 95% confident that etc.65
9578298092Standard Errormargin of error without the t or the z Standard deviation of sampling distribution66
9578298130Residualy - yhat ex. yhat = 40 + 10x ( grade for study) a student studies 5 hrs makes a 100 would have a residual of 100- (40 + 10(5))= 10.....the student made 10 more points than we predicted.67
9578298093Convenience sampleWhenever a sample is taken it gives an improper results because the sample was taken from a very convenient area instead of representing a population68
9578298094simulationA representation of a situation or problem with a similar but simpler model or a more easily manipulated model in order to determine experimental results.69
9578298095degrees of freedomn-1 x2 chart (R-1)(C-1) LINREG N-270
9578298096two way tableA table containing counts for two categorical variables. It has r rows and c columns. describes to categorical variables with row variable and column variable71
9578298097spreadThe visible variation in a sample distribution IQR STD DEV Range variance72
9578298098centerThe measure of the distance the mode is from the center of a distribution Mean Median73
9578298131shapeYou tell me.74
9578298132discrete random variablecountable75
9578298133central limit theoremAs the sample size gets larger ( say 30) the distribution of the sample becomes normal.76
9578298134standardized valuez score t score77
9578298135mutually exclusiveNothing in common disjoint never independent78
9578298099wording biasWhenever a bias is created in a sample by the way the survey is worded to favor one question79
9578298136causation80
9578298137z test81
9578298138t test82
9578298100chi squared goodness of fittests how well close the observes data is to what would be expected under the model. If a sign diff is found b/w the two then ob. data has not been generated by chance. nominal data Determine if scores from one variable match expectations for that distribution a gambler placed $1,000 into a game of greed in which he lost. He hopes to catch his opponent and bust him for loading the dice. He does this by choosing one dice to roll 36 times. He knows that the each side has an equal chance of landing face up. He hopes to get an outcome abnormal to this. Given the data below, can we prove that the dice are loaded83
9578298101frequency tableA grouping of qualitative data into mutually exclusive classes showing the number of observations in each class. A chart showing the number of times a specific event happens.84
9578298139simpsons paradox85
9578298106uniformA histogram doesn't appear to have any mode and in which all the bars are approximately the same height Evenly spaced consistent; unchanging; the same for everyone86
9578298107symetricWhen in a normal distribution both sides are identical87
9578298108time plotDisplays data that change over time. Often, successive values are connected with lines to show trends more clearly. Sometimes a smooth curve is added to the plot to help show long-term patterns and trends. Displays data that change over time.88
9578298109sstandard deviation of residuals89
9578298110r2overall measure of how successful the regression is in linearlly relating to y and x90
9578298111influential pointa point when omitted will give very different results91
9578298112censusWhen a survey has no sample but instead test or surveys the entire population92
9578298114convenience sampleChoosing a sample because it is convenient. failing to get a proper representation of the population because If you survey everyone on your soccer team who attends tonight's practice, you are surveying a convenience sample.93
9578298115response biasLying A police officer asking teenagers about drug use94
9578298116observational studyA study based on data in which no manipulation of factors has been employed. A study that observes characteristics of an existing population. usually a survey95
9578298118prospective studyan observational study in which subjects are followed to observe future outcomes96
9578298120control groupIn an experiment, the group that is not exposed to the treatment; contrasts with the experimental group and serves as a comparison for evaluating the effect of the treatment.97
9578298121blindingSubject does not know treatment For ex, study subjects may receive capsules with identical appearance and taste; however, the treatment group receives the active drug, whereas the control group receives the placebo.98
9578298122placebo effectExperimental results caused by expectations alone; any effect on behavior caused by the administration of an inert substance or condition, which is assumed to be an active agent.99
9578298123trialA performed experiment based upon the hypothesis you made.100
9578298124maximum(n.) the greatest possible amount or degree in a data sample the largest value in a set of data101
9580648396Assumptions for X2 TestRandom SRS Selection LSS E>5 Independent Pop > 10n102
95807340623 important experimental factors1. Control- blocking, blind, dblind, placebo 2. Random: # all and select at random(like random table) 3. Replication: Did you do enough to minimize results from chance?103

Need Help?

We hope your visit has been a productive one. If you're having any problems, or would like to give some feedback, we'd love to hear from you.

For general help, questions, and suggestions, try our dedicated support forums.

If you need to contact the Course-Notes.Org web experience team, please use our contact form.

Need Notes?

While we strive to provide the most comprehensive notes for as many high school textbooks as possible, there are certainly going to be some that we miss. Drop us a note and let us know which textbooks you need. Be sure to include which edition of the textbook you are using! If we see enough demand, we'll do whatever we can to get those notes up on the site for you!