AP Notes, Outlines, Study Guides, Vocabulary, Practice Exams and more!

AP Statistics Vocabulary Review Flashcards

Terms : Hide Images
13553836559CategoricalUse bar graphs, pie graphs, or segmented bar charts0
13553843695Marginal DistributionIn a two-way table consider only one variable and use the total row/column of the table only.1
13553847319Conditional Distributionsdescribe the distribution of one variable for a specific value of the other (one row/column inside the table).2
13553854864Quantitative DataUse dotplots, stemplots, histograms, or boxplots for quantitative variables such as age or weight.3
13554452346SOCSShape - Skewed Left, Skewed Right, Symmetric, Uniform, Unimodal, Bimodal Outliers - Discuss them if there are obvious ones Center - Mean or Median Spread - Range, IQR, or Standard Deviation Note: Also be on the lookout for gaps, clusters or other unusual features of the data set.4
13554457530Comparing DistributionsAddress: Shape, Outliers, Center, Spread in context! YOU MUST USE comparison phrases like "is greater than" or "is less than" for Center & Spread5
13554463208Outlier RuleUpper Cutoff = Q3 + 1.5(IQR) Lower Cutoff = Q1 - 1.5(IQR) IQR=Q3 -Q16
13554467010Interpret Standard Deviationmeasures spread by giving the "typical" distance that the observations (context) are away from the mean (context).7
13554472372How does shape affect measures of center?In general, Skewed Left (Mean < Median) Skewed Right (Mean > Median) Fairly Symmetric (Mean ≈ Median)8
13554478172Interpret a z-scoredescribes how many standard deviations a value falls away from the mean of the distribution and in what direction.9
13554481989PercentilesThe kth percentile of a distribution is the point that has k% of the values less than that point.10
13554489842Linear TransformationsAdding "a" to every member of a data set adds "a" to the measures of position, but does not change the measures of spread or the shape. Multiplying every member of a data set by "b" multiplies the measures of position by "b" and multiplies most measures of spread by |b|, but does not change the shape.11
13554497418The Standard Normal Distributiondistribution with a mean of 0 and a standard deviation of 1.12
13554504441Using NormalcdfUsing boundaries to find area: Normalcdf (min, max, mean, SD)13
13554508840InvNormUsing area to find boundary: Invnorm (area to the left as a decimal, mean, SD)14
13554516848Describing an association in a scatterplotAddress the following, in context: Direction Outliers Form Strength15
13554520718Interpret rCorrelation measures the strength and direction of the linear relationship between x and y. r is always between -1 and 1. Close to zero = very weak, Close to 1 or -1 = stronger Exactly 1 or -1 = perfectly straight line Positive r = positive correlation Negative r = negative correlation16
13554524993Interpret LSRL Slope "b"For every one unit change in the x variable (context) the y variable (context) is predicted to increase/decrease by ____ units (context).17
13554533971Interpret LSRL y-intercept "a"When the x variable (context) is zero, the y variable (context) is predicted to be ______.18
13554537206What is a Residual?y - y-hat measures the difference between the actual (observed) y-value in a scatterplot and the y-value that is predicted by the LSRL using its corresponding x value.19
13554544260Interpreting a Residual PlotIf there is a leftover pattern, then the model used does not have the same form as the association (the model is not appropriate). If there is no leftover pattern in the residual plot, then the model is appropriate.20
13554732429Interpret L S R L " y-hat"the "estimated" or "predicted" y-value (context) for a given x-value (context)21
13554735122ExtrapolationUsing a LSRL to predict outside the domain of the explanatory variable.22
13554742498Interpret LSRL "s"is the standard deviation of the residuals. It measures the typical distance between the actual y values (context) and their predicted y values (context) in a regression setting23
13554751131Interpret r-squared__% of the variation in y (context) is accounted for by the LSRL of y (context) on x (context)24
13554757576Outliers in RegressionAny point that falls outside the pattern of the association should be considered an outlier.25
13554765782Influential Points in RegressionA point that has a big effect on a calculation, such as the correlation or equation of the least-squares regression line. Points separated in the x-direction are often influential.26
13554776427SRSis a sample taken in such a way that every set of n individuals has an equal chance to be the sample actually selected.27
13554784393Sampling Techniques1. SRS- Number the entire population, draw numbers from a hat (every set of n individuals has equal chance of selection) 2. Stratified - Split the population into homogeneous groups, select an SRS from each group. 3. Cluster - Split the population into heterogeneous groups called clusters, and randomly select whole clusters for the sample. Ex. Choosing a carton of eggs actually chooses a cluster (group) of 12 eggs. 4. Census - An attempt to reach the entire population 5. Convenience- Selects individuals easiest to reach 6. Voluntary Response - People choose themselves by responding to a general appeal.28
13554787888Advantage of using a Stratified Random Sample Over an SRSStratified random sampling guarantees that each of the strata will be represented. When strata are chosen properly, a stratified random sample will produce better (less variable/more precise) information than an SRS of the same size.29
13554791856BiasA sampling method that will consistently produces estimates that are too small or consistently produces estimates that are too large.30
13554801564Experimentresearchers impose a treatment upon the experimental units.31
13554822120Observational Studyresearchers make no attempt to influence the results and cannot conclude cause- and-effect.32
13554827914Confoundingoccurs when two variables are associated in such a way that their effects on a response variable cannot be distinguished from each other33
13554830264Why use a control group?gives the researchers a comparison group to be used to evaluate the effectiveness of the treatment(s).34
13554843270Blindinga technique where the subjects do not know whether they are receiving a treatment or a placebo35
13554847906Experimental DesignsCRD (Completely Randomized Design) - Units are allocated at random among all treatments RBD (Randomized Block Design) -Units are put into homogeneous blocks and randomly assigned to treatments within each block. Matched Pairs - A form of blocking in which each subject receives both treatments in a random order or subjects are matched in pairs with one subject in each pair receiving each treatment, determined at random.36
13554855967Benefit of Blockingthe reduction of the effect of variation within the experimental units.37
13554868598Scope of Inference: Generalizing to a Larger PopulationWe can generalize the results of a study to a larger population if we used a random sample from that population.38
13554874203Scope of Inference: Cause-and-EffectWe can make a cause-and-effect conclusion if we randomly assign treatments to experimental units in an experiment. Otherwise, Association is NOT Causation!39
13558077452Interpreting Probabilitythe proportion of times the event would occur in a very large number of repetitions.40
13558107639Law of Large Numbersif we observe many repetitions of a chance process, the observed proportion of times that an event occurs approaches a single value, called the probability of that event.41
13558117870Conducting a simulationState: Ask a question about some chance process. Plan: Describe how to use a random device to simulate one trial of the process and indicate what will be recorded at the end of each trial. Do: Do many trials. Conclude: Answer the question of interest.42
13558121797Complementary EventsTwo or more mutually exclusive events that together cover all possible outcomes. The sum of the probabilities of complementary events is 1.43
13558125511Conditional Probabilitythe probability that one event happens given that another event is already known to have happened44
13558131257Two Events are Independent If...P(B) = P(B|A) Or P(B) = P(B|Ac) Meaning: Knowing that Event A has occurred (or not occurred) doesn't change the probability that event B occurs.45
13558136373Two Events are Mutually Exclusive If...P(A and B) = 0 Events A and B are if they share no outcomes.46
13558143261Interpreting Expected Value/MeanIf we were to repeat the chance process (context) many times, the average value of _____ (context) would be about _______.47
13558168057Binomial Setting and Random VariableBinary? Each trial can be classified as success/failure Independent? Trials must be independent. Number? The number of trials (n) must be fixed in advance Success? The probability of success (p) must be the same for each trial. X = number of successes in n trials48
13559406658Geometric Setting and Random VariableArises when we perform independent trials of the same chance process and record the number of trials it takes to get one success. On each trial, the probability p of success must be the same. X = number of trials needed to achieve one success49
13559409705Parametermeasures a characteristic of a population, such as a population mean μ or population proportion p.50
13559413648Statisticmeasures a characteristic of a sample, such as a sample mean x or sample proportion pˆ .51
13559416841What is a sampling distribution?Is the distribution of a sample statistic in all possible samples of the same size. It describes the possible values of a statistic and how likely these values are.52
13559419238What is the Central Limit Theorem (CLT)?If the population distribution is not Normal the sampling distribution of the sample mean (x bar) will become more and more Normal as n increases.53
13559425623Unbiased Estimatorif the mean of its sampling distribution equals the true value of the parameter being estimated. In other words, the sampling distribution of the statistic is centered in the right place.54
135594288794-Step Process Confidence IntervalsSTATE: What parameter do you want to estimate, and at what confidence level? PLAN: Choose the appropriate inference method. Check conditions. DO: If the conditions are met, perform calculations. CONCLUDE: Interpret your interval in the context of the problem.55
13559429680Interpreting a Confidence IntervalI am ___% confident that the interval from ___ to ___ captures the true ____.56
13559432322Interpreting a Confidence LevelIf many similar samples were taken, _____% of them would result in intervals that contain the true mean/proportion.57
13559451108What factors affect the Margin of Error?The margin of error decreases when: -The sample size increases -The confidence level decreases58
13559451861Inference for Means (Conditions)Random: Data from a random sample(s) or randomized experiment Normal: Population distribution is normal or large sample(s) (n1 ≥ 30 or n1 ≥ 30 and n2 ≥ 30) Independent: Independent observations and independent samples/groups; 10% condition if sampling without replacement59
13559453569Inference for Proportions (Conditions)Random: Data from a random sample(s) or randomized experiment Normal: At least 10 successes and failures (in both groups, for a two sample problem) Independent: Independent observations and independent samples/groups; 10% condition if sampling without replacement60
135594570834-Step Process Significance TestsState: What hypotheses do you want to test. and at what significance level? Define any parameters you use. Plan: Choose the appropriate inference method. Check conditions. Do: If the conditions are met, perform calculations. Compute the test statistic and find the P-value. Conclude: Interpret the result of your test in the context of the problem.61
13559459292Explain a P-valueAssuming that the null is true (context) there is a ___ probability of observing a statistic (context) as large as or larger than the one actually observed by chance alone.62
13559462918Type I ErrorRejecting H0 when H0 is actually true63
13559467044Type II ErrorFailing II reject H0 when Ha is true64
13559470256PowerProbability of finding convincing evidence that Ha is true when in reality Ha is true.65
13559474682Factors that Affect Power1. Sample Size: To increase power, increase sample size. 2. Increase α: A 5% test of significance will have a greater chance of rejecting the null than a 1% test. 3. Consider an alternative that is farther away from μ0: Values of μ that are in Ha, but lie close to the hypothesized value are harder to detect than values of μ that are far from μ0.66
13559478629Chi-Square Tests (Conditions)Random: Data from a random sample(s) or randomized experiment 10%: The sample must be ≤ 10% of the population. Large Counts: All expected counts are at least 5.67
13559479823Types of Chi-Square Tests1. Goodness of Fit: 2. Homogeniety: 3. Indepencence:68
13559481582Goodness of FitUse to test the distribution of one group or sample as compared to a hypothesized distribution.69
13559482932HomogenietyUse when you you have a sample from 2 or more independent populations or 2 or more groups in an experiment. Each individual must be classified based upon a single categorical variable.70
13559485068IndepencenceUse when you have a single sample from a single population. Individuals in the sample are classified by two categorical variables.71
13559487721Goodness of fit - degrees of freedomdf = k - 172
13559494122Chi-Square Homogeneity/Independence - degrees of freedomdf = (row - 1 )( col. - 1 )73
13559497063Inference for Regression (Conditions)Linear: True relationship between the variables is linear. Independent observations, 10% condition if sampling without replacement Normal: Responses vary normally around the regression line for all x-values Equal Variance around the regression line for all x- values Random: Data from a random sample or randomized experiment74

Need Help?

We hope your visit has been a productive one. If you're having any problems, or would like to give some feedback, we'd love to hear from you.

For general help, questions, and suggestions, try our dedicated support forums.

If you need to contact the Course-Notes.Org web experience team, please use our contact form.

Need Notes?

While we strive to provide the most comprehensive notes for as many high school textbooks as possible, there are certainly going to be some that we miss. Drop us a note and let us know which textbooks you need. Be sure to include which edition of the textbook you are using! If we see enough demand, we'll do whatever we can to get those notes up on the site for you!