AP Notes, Outlines, Study Guides, Vocabulary, Practice Exams and more!

AP Statistics Flash Cards Flashcards

Terms : Hide Images
9739292173Categorical vs Quantitative DataData are categorical if they fall into groups or categories and data are quantitative if they take on numeircal values where it makes sense to find an average. -Use bar graphs, pie graphs, or segmented bar charts for categorical variables -Use dotplots, stemplots, histograms or boxplots for quantitative variables such as age or weight.0
9739318373Marginal vs. Conditional DistributionsIn a two-way table, marginal distributions consider only one variable and use the total row/column of the table only. Conditional distributions describe the distribution of one variable for a specific value of the other (one row/column inside the table).1
9768900763SOCSShape - Skewed Left, Skewed Right, Symmetric, Uniform, Unimodal (one peak), Bimodal (two peaks) Outliers - Discuss them if they are obvious Center - Mean or median Spread - Range, IQR, or Standard Deviation2
9768932036Comparing DistributionsAddress SOCS in context! You must use comparison phrases like "is greater than", etc.3
9768943930Outlier RuleUpper Cutoff = Q3 + 1.5xIQR Lower Cutoff = Q1 - 1.5xIQR IQR = Q3-Q14
9768965302Interpret Standard DeviationStandard deviation measures spread by giving "typical" distance that the observations (context) are away from the mean (context)5
9768984020How does the shape affect measures of center?In general, Skewed Left (meanmedian) Fairly Symmetric (mean and media are close to equal)6
9769047552Interpret a z-scoreZ = (value - mean)/(standard deviation) A z-score describes how many standard deviations a value falls above/below the mean. Positive z-scores are above the mean, negative z-scores are below the mean.7
9769080965Linear Transformationsa+bx adding/subtracting "a" to every member of a data set adds "a" to the measures of center, but does not affect shape or spread. Multiplying every member of a data set by "b" multiplies measures of center by "b", measures of spread by absolute value of "b" and does not affect shape.8
9769129653PercentilesThe kth percentile of a distribution is the point that has k% of the values less than that point. For example, a student who has a test score in the 90th percentile has a score higher than 90% of all test takers.9
9769153310The Standard Normal DistributionThe standard normal distribution with mean=0 and standard deviation = 1. Table displays areas to the left of z-scores.10
9769179022Normalcdf and Invnorm calculator tipsUsing boundaries to find area (probability) normlacdf(lower,upper,mean,SD) Using area (probability/percentile) to find boundary: invnorm(area to the left, mean, SD)11
9769203852Describe an association in a scatterplotS - strength of relationship (strong, moderate, weak) O-Obvious Outliers? F - Form (linear or non-linear) A - Association (positive/negative)12
9769238178Interpret "r", the correlation coefficientr is always between -1 and 1 Close to zero = very weak Close to 1 or -1 = very strong Exactly 1 or -1 = perfectly straight line Positive r = positive correclation Negative r = negative correlation13
9769807531Interpret LSRL slopeFor every one unit change in the x variable (context) the y variable (context) is predicted to increase/decrease by "b" units (context)14
9769836926Interpret LSRL y-intercept "a"When the x-variable (context) is zero, the predicted y-value (context) is "a"15
9769854415What is a residual?residual = actual y (or observed) - predicted y the residual measures the difference between the actual y-value and the predicted y-value.16
9769881518Interpreting a residual plotIf there is a leftover pattern in the residual plot, then the model is not appropriate If there is no leftover pattern in the residual pot, then the model is appropriate.17
9769922575Interpret LSRL "y-hat"y-hat is the estimated or predicted value by the LSRL18
9769927316ExtrapolationUsing a LSRL to predict outside the domain of the explanatory variable. (can lead to ridiculous conclusions if the observed association does not continue)19
9769947749Interprete LSRL "s"s=______, the standard deviation of the residuals or a typical prediction error (context)20
9769961215Interpret r-squared_____% of variation in the y-variable (context) that is explained by the x-variable (context) using the LSRL model21
9769981213Outliers and influetial points in regressionAny point that falls outsied the pattern of association should be considered an outlier. An influential point has a big effect on calculations like correlation and slope. It's usually an outlier in the x-direction.22
9770011631LSRL computer printoutUse foot length to predict height: Predictor Coeff Constant 103.41 Foot Length 2.7469 S=7.95126 LSRL: Predicted Height = 103.41 + 2.7469xFoot length Typical predicted height will differ from the actual height by about 7.951 units23
9770125113SRSAn SRS (simple random sample) is a sample taken in such a way that every set of n individuals has an equal chance of being the sample selected.24
9770141603Using a random digits table to select a sampleLabel - give every member of the population a numerical label wit hteh same number of digits. Use as few digits as possible. Randomize - read consecutive groups of digits of the appropriate length from left to right across a line in the table. Ignore reapeats and groups not used as labels. Stop when you have n different labels. The individuals who correspond to the labels selected are your sample.25
9770180811Sampling TechniquesSRS - names in a hat Stratified - split the population into homogeneous groups, select SRSs from each group Cluster - Split the population into gropus(usually by location) called clusters, and randomly select whole clusters for the sample Census - An attempt to reach the entire population Convenience - Selects individuals in easiest to reach way Voluntary Response - People choose themselves by responding to a general appeal26
9786155280Advantage of using a stratified random sample over SRSStratified sampling guarantees that each of the strata will be represented. When strata are chosen properly, a stratified random sample will produce better (less variable/more precise) information than an SRS of the same size.27
9786170017BiasA sampling method is biased if it consistently produces estimates that are too small or too large28
9786182399Experiment vs. Observational StudyA study is an experiment ONLY if researchers impose a treatment upon the experimental units. In an observational study, researchers make no attempt to influence the results and cannot conclude cause-and-effect.29
9786214455ConfoundingTwo variables are confounded if it cannot be determined which variable is causing the change in the response variable. For example, if people who take vitamins on their own have less cancer, we cannot say for sure that the vitamins are causing the reduction in cancer. It could be other characteristics of vitamin takers, such as diet or exercise.30
9786238638Why use a control group?A control group gives the researchers a comparison group to be used to evaluate the effectiveness of the treatments. (context) It allows the researchers to measure the effect of the treatment (context) compared to no treatment at all.31
9786262154BlindingWhen the subjects in an experiment don't know which treatment they are receiving, they are blind. If the people interacting with the subjects and measuring the response variable don't know which subjects received treatments, they are blind. If both groups are blind, the study is double-blind.32
9786294825Experimental DesignsCompletely Randomized Design - Units are allocated at random among all treatments. Randomized Block Design - Units are put into homogenous blocks and randomly assigned to treatments within each block. Matched Pairs - A form of blocking in which each subject receives both treatments in a random order or subjects are matched in pairs with one subject in each pair receiving each treatment, assigned at random.33
9787372833Benefit of BlockingBlocking helps account for variability in the response variable (context) that is caused by the blocking variable (context). If there really is a difference in the effectiveness of the treatments, using an appropriate blocking variable will increase power (probability of finding convincing evidence that the treatments are not equally effective)34
9787405084Scope of Inference: generalizing to a larger populationWe can generalize the results of a study to a larger population if we used a random sample from that population.35
9787499877Scope of Inference: Cause-and-EffectWe can make a cause-and-effect conclusion if we randomly assign treatments to a experimental units in an experiment. Otherwise Association is NOT Causation!36
9787514940Interpreting ProbabilityThe probability of an event is the proportion of times the event would occur in a very large number of repetitions. Probability is a long-term relative frequency.37
9787530169Law of large numbersThe law of large numbers says that if we observe many repetitions of a chance process, the observed proportion of times that an event occurs approaches a single value, called the probability of that event.38
9787575213Complementary EventsTwo mutually exclusive events whose union is the sample space. For Example: -Rain/No Rain -Draw at least one heart/Draw NO hearts39
9787630720Conditional ProbabilityProbability that one event occurs given that another event is already known to have occurred. On Formula Sheet40
9787652535Two events are independent if41
9787661162Two events are mutually exclusive if42
9787683423Interpreting Expected Value/MeanIf we were to repeat the chance process many times, the average value of __________ (context) would be about __________ (context)43
9787701233Mean and Standard deviation of a discrete random variable44
9789257028Mean and standard deviation of a transformation of a random variable45
9789267128Mean and standard deviation of a difference of two random variables46
9789277664Mean and standard deviation of a sum of two random variables47
9789284235Binomial Setting and random variable48
9789323208Binomial Distribution (Calculator usage)49
9789330244Mean and standard deviation of a binomial random variable50
9789349225What is a sampling Distribution?A sampling distribution is the distribution of a sample statistic in all possible samples of the same sample size. It describes the possible values of the statistic and how likely these values are to occur. Contrast with the distribution of the population, and the distribution of a sample.51
9789408582What is the sampling distribution of p-hat (sample proportions)?52
9789421180What is the sampling distribution of x-bar (sample means)?53
9789429294What is the central limit theorem (CLT)?If the population distribution of a variable is not normal, the sampling distribution of the sample mean will become more and more normal as the sample size (n) increases.54
97894551664-Step Process Confidence Intervals55
9789468769Unbiased estimatorA statistic is an unbiased estimator of a parameter if the mean of its sampling distribution equals the true value of the parameter being estimated.56
9789490876Interpreting a confidence intervalI am ______% confident that the interval from ______ to _______ captures the true ___________ (parameter in context)57
9789501495Interpreting a confidence levelIf many, many samples are selected and many, many confidence intervals are calculated, about _____% of them will capture the true ________(parameter in context)58
9789602391Standard error vs margin of errorThe standard error of a statistic measures how far the value of the statistic typically differs from the true value of the parameter. The margin of error estimates how far we expect the parameter to differ from the statistic, at most.59
9789634878What factors affect the margin of error?The margin of error decreases when: -the sample size increases -the confidence level decreases60
9789651619Inference for means (conditions)61
9789661446Inference for proportions (conditions)62
97896700534-Step process Significance tests63
9789679273Explain a p-valueAssuming that the null hypothesis is true (context) there is a ______ probability of observing a statistic (context) as large or larger than the one actually observed by chance alone.64
9789704119Type I and Type II Error and Power65
9789728958Factors that affect Power66
9789739534Paired t-test, Identification hints67
9789748914Two sample t-test, Identification hints68
9789760216Chi-squared tests (conditions)69
9789768449Types of Chi-squared testsGoodness of Fit: Use to compare the distribution of a categorical variable in one population to a hypothesized distribution. Homogeneity: Use to compare distribution of a categorical variable for 2+ populations or treatments. Independence: Use to test the association between two categorical variables in one population.70
9789807469Chi-squared tests, df and expected counts71
9789816316Inference for regression with computer output72

Need Help?

We hope your visit has been a productive one. If you're having any problems, or would like to give some feedback, we'd love to hear from you.

For general help, questions, and suggestions, try our dedicated support forums.

If you need to contact the Course-Notes.Org web experience team, please use our contact form.

Need Notes?

While we strive to provide the most comprehensive notes for as many high school textbooks as possible, there are certainly going to be some that we miss. Drop us a note and let us know which textbooks you need. Be sure to include which edition of the textbook you are using! If we see enough demand, we'll do whatever we can to get those notes up on the site for you!