AP Statistics Exam Review Flashcards

5135097582	What is a dotplot?	A graphical display which shows "dots" for each point. It's good for categorical data- ie data classified into categories.	0
5135097583	What's the difference between categorical and quantitative data?	Categorical data fits into various categories; whereas, quantitative data has numerical values associated with it.	1
5135097584	What is a bar chart?	A display for categorical data which indicates frequencies or percents for each category.	2
5135097585	What are histograms?	Histograms are good for large quantitative data sets- either having numbers at the left/right of a bar to show the amount of data in-between each value or in the center of a bar to show the amount of data at a certain value. Sometimes, the axis will just be the frequency, but often, it can be the relative frequency (ie. amount/total).	3
5135097586	What do relative areas in histograms mean?	Relative areas correspond to relative frequencies (ie. if 10% of the area for a histogram is between 25-26, that means that 10% of the data falls between 25 and 26.	4
5135097587	What's a stemplot/stem and leaf plot?	It has stems which are some digit and leaves which are the other part of the number (for example depending on context 5\|7 could be 57, 5.7, or some other variant- that's why a key must always be included). It's good for looking at individual data in small data sets.	5
5135097588	What is important in analyzing visual data displays?	SOCS (Shape, Outlier, Center, Spread): Shape-How is the data shaped (skewed left/right, symmetric, bimodal, etc.)? Are there any clusters (subgroups which the data falls into)? Are there any gaps in the data set? Outliers: Are there any outliers within the data set? Center: Give the mean/median- the value which is the approximate midpoint of the data Spread- What is the range OR IQR (if it's easy to find) of the data set?	6
5135097589	What is a mode? How do modes relate to unimodal/bimodal data sets?	A mode is a major peak in the data (most repeated value). A unimodal data set has just one mode; whereas, a bimodal data set has two modes.	7
5135097590	What are some possible descriptions of shapes within distributions?	Symmetric- There is a vertical line of symmetry, splitting the graph into two equal parts. Skewed Right- Data decreases for higher values/has less area for higher values Skewed Left- Data slopes upwards from the left (less area for lower values). Bell shaped- symmetric with a center mound and tails going to the left/right. Uniform- Straight line across/data distribution stays constant.	8
5135097591	What is a cumulative relative frequency plot (or ogive), and how does it relate to skewness?	A CRF plot shows the percentage of data accumulated along the y axis by each value of the data along the x. For instance, (10,0.15) would mean that 15% of the data is less than or equal to 10. A distribution skewed to the left has a frequency plot which rises slowly at first and steeply later; whereas, a distributions skewed to the right has a relative frequency plot rising quickly at first and then slowing down later.	9
5135097592	What's the difference between descriptive statistics and statistical analysis?	Descriptive statistics means summarizing averages, shape of a distribution, etc. while statistical analysis means drawing inferences from limited data.	10
5135097593	What are the two main ways of measuring center?	The median (the middle number of a set when arranged in order). The mean (summing the values in a set and dividing by the number of quantities in that set)	11
5135097594	When does it make more sense to use the median over the mean?	When there are outliers which we want to minimize. We say the median is RESISTANT to outliers (which means it's not affected).	12
5135097595	What are the notations for mean of a population and mean of a sample?	The sample mean usually assumes a simple random sample. The mean is computed by ∑x/n.	13
5135097596	What are the ways of describing variability/dispersion of the measurements?	1) Range - difference between largest and smallest values. 2) IQR- difference between largest and smallest values after removing lower and upper quarters. There are two ways of computing this : way 1) simply take out upper and lower quarters of the data and subtract. 2) Find Q1 by taking the median of the lower half and Q3 by the median of the upper half (median itself must be included if there are an odd number of points). Then do Q3-Q1 to get the IQR. These should be equivalent if there are many data points. 3) Variance- an average of squared distances from the mean 4) Standard deviation- square root of the variance	14
5135097597	What is the rule for designating outliers?	Outliers are considered to be any value above Q3+1.5IQR OR any value below Q1-1.5IQR	15
5135097598	How is the variance calculated for a population?How is it calculated for a sample?	So for a population you sum up all of the squares of the deviations from the mean and divide by the number of terms. You do the same thing for a sample but divide by number of terms-1 due to degrees of freedom.	16
5135097599	How do you calculate the standard deviation?	Take the square root of the variance. The standard deviation shows how far values vary from the mean on average.	17
5135097600	What is a residual?	It is simply each value-the mean.	18
5135097601	What are the three main ways of measuring position?	1) Simple ranking- arrange the elements and note where each value falls in that order. 2) Percentile ranking- the percentage of values falling below yours (ie. number below your value/total). 3)z-score- how many standard deviations a value is away from the mean.	19
5135097602	Where do the quartiles and deciles lie in terms of percentile ranking?	Q1 has a percentile rank of 25% and Q3 has a percentile rank of 75%. The deciles have ranks of 10% and 90%.	20
5135097603	What is the formula for a z score?	This shows the number of standard deviations away from the mean. Also, if you're given a z score, the mean, and the standard deviation, you can solve for an x value.	21
5135097604	What is the empirical rule?	The empirical rule says that for symmetric, bell-shaped data, 68% of the data lies within one standard deviation of the mean, 95% lies within 2 standard deviations of the mean, and 99.7% of the data lies within 3 standard deviations of the mean.	22
5135097605	How is the empirical rule related to range?	The empirical rule can indicate arithmetic errors as the range should be somewhere between 4 times the standard deviation and 6 times the standard deviation.	23
5135097606	How does skewed data affect how the mean compares to the median?	If data is skewed to the left, the mean is usually lower than the median. If data is skewed to the right, the mean is usually higher than the median.	24
5135097607	What is a boxplot?	It gives a 5 number summary with a whisker out to the highest value, a line at Q3, a line for the median, a line at Q1, and a line out to the lowest value. Alternatively, outliers can be depicted as dots on the boxplot, and the lines just go to the highest/lowest values not considered to be outliers.	25
5135097608	What is the effect on mean, median, range, and standard deviation of adding a certain amount or multiplying by a certain amount to every value in the data set?	Adding: Changes the mean & median by that amount but doesn't change the range or standard deviation. Multiplying: Changes mean, median, range, and standard deviation all by that same factor.	26
5135097609	What are some graphical methods of comparing distributions?	1) Dotplots either above or next to each other for each distribution. 2)Double barcharts with bars next to each other to make the comparison. 3) Back to back stemplots with leaves going out to either side 4) Parallel boxplots with boxplots stacked on top of or next to one another 5) Cumulative frequency plots with both plots running next to one another.	27
5135097610	What is bivariate data?	Data that explores the relationship between two variables (x & y)	28
5135097611	What is a scatterplot?	A scatterplot shows (x,y) ordered pairs and helps to give a visual indication of the relationship between the two variables. One can see whether the variables are positively or negatively associated. Also, sometimes scatterplots might be labeled with some markings to show one category and some to show another (dots for men, x's for women, etc.). Clusters and outliers ought to be noed in scatterplot analysis.	29
5135097612	What is the correlation coefficient r?	It describes how well the data fits a linear trend. A positive r means a positive association, a negative r means a negative association and r's with higher absolute values indicate stronger relationships. r is not affected by which variable is called x or called y, and an r of 0 doesn't necessarily indicate no relationship (it could be a strong nonlinear relationship). The formula is the sum of the product of all x and y z scores divided by the sample size-1.	30
5135097613	What is r²?	r² is called the coefficient of determination and gives the percentage of variation in y explained by x. One must be careful when finding r from r² in terms of assigning positive/negative values.	31
5135097614	What is the least squares regression line?	It's the line that is the best fitting as it minimizes the squares of the residuals. It's equation can be determined as it goes through the mean of x (x bar) and the mean of y (y bar). The slope is determined by b1=r *(sy/sx) where sy is the standard deviation of y, and sx is the standard deviation of x.	32
5135097615	What is the equation for the line comparing z scores of y to z scores of x?	zy=rzx	33
5135097616	What's the difference between interpolation and extrapolation?	Interpolation is inside the scope of your data range which is good. Extrapolation is outside your data set and is risky as you don't know whether the linear trend will continue.	34
5135097617	What does y hat really indicate?	The mean prediction for each x value (there could be a variety of y values, so it simply gives the mean)	35
5135097618	What is a residual plot?	Observed-expected value gives the residuals. A residual plot gives the residuals on the y axis and the x values on the x.	36
5135097619	What is the mean and standard deviations of residuals?	The mean of the residuals is always 0. The standard deviation of residuals is given by the following formula: The standard deviation of residuals indicates a typical residual value. In computer output, it's given by S.	37
5135097620	What are you looking for in a residual plot?	Small, balanced residuals which don't show any kind of curve/pattern.	38
5135097621	What are outliers and influential points in regression?	Outliers deviate from the overall pattern. Influential points sharply change the slope of the regression line.	39
5135097622	How do you transform data to make it linear?	Sometimes a line isn't the best model, so you can apply a transformation to improve the trend. The most common transformations are either taking the log of all y values (resulting in an exponential model) or taking the log of all y and all x values (resulting in a power model).	40
5135097623	What can the correlation coefficient tell you about causation?	Absolutely nothing! You only know correlation not causation.	41
5135097624	In a computer output when the slope value is next to one of the variables, is that variable independent or dependent?	It's the independent variable!	42
5135097625	What are two way contingency tables?	They are tables which group data into different categories. For instance, you might want to compare severity of heart attacks to cholesterol level (so you might have severity of heart attacks as the row variable and cholesterol level as the column variable).	43
5135097626	What are marginal frequencies?	Marginal frequencies are the totals along the "margins" in two way contingency tables (ie. sum up each row and each column).	44
5135097627	What is the marginal distribution?	It's each marginal frequency divided by the total (and you can do this for each type- for instance you could get a marginal distribution for cholesterol level and another marginal distribution for heart attack severity). This information can be displayed in a bar chart.	45
5135097628	What are conditional relative frequencies?	Dividing each value by the marginal frequency of that row or column. So you could divide the number of non fatal heart attacks with low cholesterol by the total number of non fatal heart attacks. This information can be displayed in side by side bars in bar charts or alternatively by segmented bar charts in order to gauge association.	46
5135097629	What is perfect independence in two way contingency tables?	Perfect independence is when the conditional relative frequencies all match up. However, even if two variables are completely independent, they may not necessarily show perfect indepndence.	47
5135097630	What is Simpson's paradox?	Simpson's paradox is when the results from a combined grouping contradict the results for an individual group (due to lurking variables). Ie. if there are two doctors and you're comparing survival rates, you may initially conclude that one doctor is better than the other (based on combined survival rate). However, if you split these groups into good & bad condition of the patients that they're treating, you may come to the opposite conclusion.	48
5135097631	What is a census? What are the advantages/disadvantages of a census?	A census is a complete enumeration of the population. It's ideal because you manage to capture everybody. However, it can be very time consuming/costly. Also, it would be far better to take a sample and do it well then to conduct a poorly run census.	49
5135097632	What is a sample survey?	A sample survey just takes a part of the whole population to survey.	50
5135097633	What's necessary for a good sample survey?	Avoiding bias which is frequently achieved by randomization. Also, a large sample size gives more validity to the results (NOTE: It's the actual size not percentage- a group of 500 in a population of 100,000 is just as good as a group of 500 in a population of 1,000,000).	51
5135097634	What is an experiment	The researchers divide subjects into appropriate groups. Most often there is a treatment group which receives the treatment and a control group which does not (often receiving a placebo).	52
5135097635	What are the facets of a well designed experiment?	Double blindness which means that neither researchers nor subjects know what group the subjects are in. Control- conditions as similar as possible for all subjects other than their placed group Blocking- division into representative groups in order to make comparisons Randomization- randomize the group to minimize lurking variables Replication- repetition on a sufficient number of subjects Generalizability- ability to repeat in a variety of settings	53
5135097636	What is an observational study?	There isn't a decision about who goes to treatment or control groups (for instance you can't ask people to smoke more/less, so you simply ask people who already smoke that amount). Sample surveys are one example of observational study. However, experiments show cause/effect while observational studies do not as variables can become confounded with other variables.	54
5135097637	What is a simple random sample? What are some ways to get a simple random sample?	In a simple random sample, every participant has an equal chance of being selected. The best ways to generate a simple random sample are via random digit tables or having a computer generate random samples. One thing you have to be careful of is that you might not have a complete listing of the population in which case randomness is not ensured.	55
5135097638	Are other sampling techniques (stratified, cluster, etc.) just subsets of simple random sampling?	NO!!! In these techniques, every participant does not have equal chance of being selected.	56
5135097639	What is sampling error?	No matter how well designed a survey is, it still gives a sample statistic for a population parameter, so we're always bound to have some error. Generally, the chance of an error occurring is less when the sample size is larger unless the survey was badly conducted.	57
5135097640	What are some common types of biases?	Bias is defined as a tendency to favor certain members of a population. The following are the main types of bias: Household bias- only one member of a households responds, so large households are underrepresented. Nonresponse bias- people don't respond to surveys or are too difficult to contact, thus creating a source of bias. Quota sampling bias- interviewers are at liberty to pick people (ie. a specific percentage Catholic, a specific percentage African-American, etc.). Response bias- People may lie/be untruthful when responding, especially when they're not anonymous if their views are unsavory. Selection bias- for example a newspaper interviewed just people with cars and telephones in a presidential election and predicted a landslide victory for the wrong person due to the fact that the people owning cars and telephones were wealthy and tended to vote Republican. Size bias- For instance if you have a student pick a coin out of a bag to estimate the monetary value, throw a dart at a map, etc. This benefits large states, large coins, etc. Undercoverage bias- Inadequate representation- for instance there were phone surveys to landlines which left out people who only had cell phones. Another instance of this is convenience samples, like interviews at shopping malls which just target easy to reach people. Voluntary response bias- samples where individuals can volunteer or call in often benefit people who have strong opinons. Wording bias- if leading questions are used, then they may lead to biased answers.	58
5135097641	What are other sampling methods in addition to the simple random sample?	Systematic sampling- list the population in order, start at a random point and pick every tenth, hundredth, kth person from the list. This just result in a good sample as long as the list isn't ordered in any way related to the variables under consideration. Stratified sampling- the population is divided into homogeneous groups called strata, and random samples from all strata are chosen (ie. you could stratify by age, income level, race, etc.). You could also do proportional sampling by choosing the sample sizes from each strata in accordance to the proportion of the total population. Cluster sampling- the population is split into heterogeneous groups called clusters, and then, you take a random sample of clusters. For instance, you could randomly pick several high school classes to survey. Multistage sampling- there are two or more steps, each of which involves any of the other sampling techniques. For instance, some organizations randomly select nationwide locations, then randomly pick neighborhoods in each of these locations, then randomly pick households in each of these neighborhoods.	59
5135097642	What is an experiment vs. an observational study vs. a survey?	An experiment is when a treatment or change is assigned. An observational study is when we observe or measure something which is occurring. A sample survey is a particular type of observational study when we look at a sample.	60
5135097643	What are explanatory and response variables? What are treatments?	Explanatory variables (called factors) are what is being changed/tested and is believed to have an effect on the response variable (which is being measured). Treatments consist of factor-level combinations (for instance, you could have two factors and 3 levels of each factor for a total of 6 treatments).	61
5135097644	What is confounding? What are lurking variables? How can both of these effects be overcome?	Confounding is when there's uncertainty with regard to which variable is causing a given set of results (for instance if two or more variables are being altered). A lurking variable is a variable driving two other variables (for instance, those with higher shoe sizes have higher reading levels not because of their shoe size but because of the lurking variable of age). This can also be described as a common response in that the lurking variable and the measured variable seem to be producing the same response.	62
5135097645	What is a control group? What is the placebo effect? How can the placebo effect be minimized?	A control group is one which doesn't receive the treatment, and the treatment group receives the treatment. People can randomly be assigned to control & treatment groups in order to minimize confounding/lurking variables. The placebo effect is when people respond to any treatment (for instance, they might report that a sugar pill makes them feel much better). This can be overcome by either single-blinding in which the subjects don't know what they're receiving or double-blinding in which neither subjects nor researchers know what treatment they're receiving.	63
5135097646	What is randomized paired comparison design?	When you have one person who receives two different treatments or twins, one of whom receives one treatment, the other of whom receives the other.	64
5135097647	What are replication and blocking?	Replication is repeating the experiment sufficiently in order to decide whether the results are statistically significant or not. Blocking is basically the experimental version of stratification. It's dividing subjects up into representative groups called blocks with some characteristic. In that case, you're able to make more comparisons. Note: the paired comparison design is an example of blocking in which each pair is considered to be a block.	65
5135097648	What is probability?	The likelihood a particular event will occur. It is always between 0 and 1.	66
5135097649	What is relative frequency? How does it relate to probability?	Relative frequencies are the number of occurrences over the number of trials (for instance 12 rainy days out of 30=12/30). The more trials that are done, the more the relative frequency approaches a certain number. This is called the Law of Large Numbers.	67
5135097650	What is the probability an event will not occur/the probability of the complement?		68
5135097651	What does it mean for two events to be mutually exclusive? How do you find the probability of A or B occurring?	Two events being mutually exclusive means that both cannot occur. The probability of one or the other occurring for mutually exclusive events is simply P(A) + P(B)	69
5135097652	What is the rule for A or B occurring for two events which aren't mutually exclusive?	P(A∪B)=P(A)+P(B)-P(A∩B) where P(A∩B) denotes the probability of both events occurring.	70
5135097653	What does it mean for two events to be independent? What is the probability of two independent events occurring?	Independent events mean that one doesn't impact the other. To find the probability of two independent events occurring, you simply take the product of their separate probabilities. This can be extended to more than two independent events.	71
5135097654	What is conditional probability? What is its formula?	Conditional probability is the probability of something occurring given that something else has already occurred. Thus we have P(A\|B)=P(A∩B)/P(B) where P(A\|B) represents the probability of A given that B has occurred.	72
5135097655	How do you check for independence with conditional probabilities? When can events be both independent and mutually exclusive?	A and B are independent if P(A\|B)=P(A∩B)/P(B) = P(A)P(B)/P(B)=P(A). This can be used to check independence with probabilities. Mutually exclusive events are NOT independent except in one very special case. This is because mutually exclusive means that P(A∩B)=0, and independence means that P(A∩B)=P(A)P(B). Thus, the only way that both can be true is if P(A)=0 or P(B)=0.	73
5135097656	What is a good way to find probabilities (especially conditional probabilities)?	Via drawing tree diagrams	74
5135097657	What is a random variable? What is the concept of a discrete random variable? What is the concept of a continuous random variable?	A random variable is different numbers which take on different probabilities (for instance there might be a 0.5 chance of winning no prizes, a 0.25 chance of winning one prize, a 0.2 chance of winning two prizes, and a 0.05 chance of winning three prizes). A discrete random variable can only take on a countable number of values. A continuous random variable can take on all values in a given interval.	75
5135097658	What is a probability distribution for a random variable? What is a binomial random variable?	The probability distribution of a random variable is the chance that each outcome will occur. Binomial probabilities are situations where there are two outcomes, repeated a certain number of times. The probability must stay constant from occurrence to occurrence (ie. you could have lightbulbs with a probability of 0.1 of being defective and find out the probability a certain numbe are defective).	76
5135097659	What is the generic formula for a binomial probability? How do you find the probability of there being less than or more than a certain number of occurrences?	To find the probability of less than or more than a certain number of occurrences, you have to add together the probabilities of each occurrence happening.	77
5135097660	What is a geometric probability? How is it calculated?	A geometric probability is like a binomial probability except without a fixed number of trials. You want to find the probability that the first success is on x=K. The formula is as follows:	78
5135097661	What is the notation for something being in a binomial or geometric distribution?	Binomial: X∼B(n,p) where n is the number of trials and p is the probability of success. Geometric: X∼G(p) where p is the probability of success	79
5135097662	How do you simulate probabilities using a random digit table?	1) Set up a correspondence between outcomes and random numbers. 2) Give a procedure for choosing the random numbers. 3) Give a stopping rule 4) Note what is to be counted	80
5135097663	What is the generic formula for expected value, variance, and standard deviation of a given random variable?		81
5135097664	What are the formulas for expected value, variance, and standard deviation of binomial and geometric random variables?		82
5135097665	How do you perform a chi square goodness of fit test?	Ho: Distribution is as stated Ha: At least one value differs Degrees of freedom k-1 Test statistic chi square=Σ(observed-expected)²/expected Conditions: SRS All ≥1 ≤20% of expected values<5	83
5135097666	How do you perform a chi square independence or homogeneity test?	For independence: Ho: No relationship Ha: there is a relationship Conditions: SRS All ≥1 ≤20% of expected values<5 chi squared=(observed-expected)²/expected Expected counts: (column total)(row total)/n dfs=(row-1)(column-1) Chi Square homogeneity: Same as above except you're testing sameness instead of association/relationship	84
5135097667	How do you perform a 1 sample mean t or z test?	Ho: µ=a # Ha: µ≠a #, u>#,µ<# degrees of freedom: n-1 Conditions: 1)Representative data 2) Central Limit Theorem applies (sample size≥30 or distribution has normal boxplots). Test statistic: 1 sample mean z test is the exact same except we know σ and thus, the t becomes a z	85
5135097668	How do you perform a 2 sample mean t test?	Ho: µ1=µ2, Ha:µ1≠µ2, µ1>µ2,µ1<µ2 Conditions: 1)Representative data 2) Central Limit Theorem applies (sample size≥30 or distribution has normal boxplots). 3) Both groups are independent Find dfs on calculator Test statistic:	86
5135097669	How do you do a matched pairs t test?	dfs=pairs-1 Conditions: 1)Representative data 2) Central Limit Theorem applies (sample size≥30 or distribution has normal boxplots). 3) All pairs are independent Test statistic: where n represents the number of pairs	87
5135097670	How do you perform a 1 sample proportion z test?	Ho: p=a #, Ha: p≠a# OR p>a# OR p10n test statistic:	88
5135097671	How do you perform a 2 sample proportion z test?	Ho: p1=p2 Ha: p1≠p2,p1>p2,p110n, for both groups	89
5135097672	How do you perform a t test for slope?	Ho: B=0, Ha: B≠0, degrees of freedom #ordered pairs-2 Conditions: 1)SRS 2) Linear scatterplot 3) Residual plot indicating linear trend 4) Normally distributed residuals	90
5135097673	What is the meaning/form of a confidence interval?	A confidence interval is formed by an estimate±margin of error. The confidence level is the success rate for the method- the proportion of times repeated application of the method would capture the true population parameter.	91
5135097674	How do you find the required number of people for a given confidence level for an interval?	Set your t* or z* times the standard error equal to the margin of error required and solve for n. Note: assume the same number for 2 sample intervals and assume p hat and q hat are both 0.5 for proportions.	92
5135097675	How do you construct a 1 sample proportion z interval?	Conditions: Conditions: 1)SRS, 2) Normality: np≥10, nq≥10, 3)Independence: population>10n	93
5135097676	How do you construct a 2 proportion z interval:	Conditions: 1)SRS, 2) Normality: np≥10, nq≥10, 3)Independence: population>10n for both groups + groups independent to 1 another. If both ends are positive, p1>p2, both negative p1	94
5135097677	How do you construct a t-interval for slope	degrees of freedom #ordered pairs-2 Conditions: 1)SRS 2) Linear scatterplot 3) Residual plot indicating linear trend 4) Normally distributed residuals Interval: b±t*SEb	95
5135097678	What is the formula for Standard Error for a t-interval for slope?		96
5135097679	How do you construct a 1 sample mean t or z interval?	degrees of freedom: n-1 Conditions: 1)Representative data 2) Central Limit Theorem applies (sample size≥30 or distribution has normal boxplots). Test statistic: 1 sample mean z interval is the exact same except we know σ and thus, the t becomes a z	97
5135097680	How do you construct a 2 sample mean t interval?	Conditions: Conditions: 1)Representative data 2) Central Limit Theorem applies (sample size≥30 or distribution has normal boxplots). 3) Both groups are independent Find dfs on calculator Make comparison of means to one another in conclusion	98
5135097681	How do you construct a matched pairs t interval?	dfs=pairs-1 Conditions: 1)Representative data 2) Central Limit Theorem applies (sample size≥30 or distribution has normal boxplots). 3) All pairs are independent Test statistic: where n represents the number of pairs	99
5135097682	How do you test independence of two random variables?	If for all x and y values P(x\|y)=P(x) (this is equivalent to P(X∩Y)=P(x)P(y)	100
5135097683	What is the mean of the sum of two random variables?	µ=µ1±µ2	101
5135097684	What is the variance of the sum of two random variables? What is the condition for this to apply?	σ²=σ₁²+σ₂² . Two things to note: 1) Unlike means, variances are ALWAYS added regardless of whether the random variables are being added or subtracted 2) Random variables MUST BE independent in order to add the variances	102
5135097685	What are the properties of the normal curve?	It is a bell-shaped & symmetric curve for which the mean is the same as the median . There is one standard deviation to each point of inflection (points where the slope is steepest and concavity changes). The mathematical formula for the the normal curve is y=e^-z² where y represents the relative height above the z-score (relative height means the proportion of the height above the mean).	103
5135097686	How do you use the normal distribution to approximate the binomial?	You take a unit interval centered at the desired value. For instance, to determine the binomial probability of 8 successes, you can determine the normal probability of being between 7.5 and 8.5. If you want to determine the probability of at most 155 people supporting budget cuts in a survey of 250 people given that 60% of the population support budget cuts, you ca find the normal probability of being less than or equal to 155.5.	104
5135097687	When is the normal distribution a good approximation to the binomial?	When both np and n(1-p)=nq are greater than 10.	105
5135097688	What are some ways to check normality?	1) Draw a picture (dotplot, boxplot, stemplot, etc.). You could also use a normal probability plot for which a diagonal straight line shows normality.	106
5135097689	What is a population parameter? What is a statistic? What is a sampling distribution?	A population parameter is something that describes the whole population (for instance µ or σ). A statistic is based upon a sample (like s). The probability distribution of a statistic is a sampling distribution. This distribution is unbiased if its mean is equal to the population parameter.	107
5135097690	How do you use the distribution of sample means or sample proportions to calculate probabilities?	Use their distributions in addition to normality (or the the t distribution). For proportions the distribution is a mean of p and a standard deviation of sqrt (pq/n). For means the distribution is a mean of µ and standard deviation of σ/√n. For two proportions or means, simply add the variances of each and squareroot the sum.	108
5135097691	What does the Central Limit theorem state?	If n≥30, the distribution of sample means will fall into an approximately normal distribution with mean equal to µ and standard deviation of σ/√n (or s/√n for a sample).	109
5135097692	What is the t distribution?	The t distribution is for when we don't know the population standard deviation (which we don't the majority of the time). It has n-1 degrees of freedom- it has more room at the tail ends but as n approaches infinity becomes more and more normal. We need n greater than or equal to 30; alternatively, we have to check for skewness within our sample.	110
5135097693	What is the chi square distribution?	It is another distribution with degrees of freedom- it is skewed to the right but becomes more bell shaped/symmetric as the sample size increases. It has its peak at degrees of freedom -2 (except for df=1 for which it peaks at 0).	111
5135097694	What is standard error?	When we estimate using sample statistics instead of population parameters.	112
5135097695	What are type 1 errors, type 2 errors and power?	The type one error- α is the probability of rejecting a true null hypothesis. The type two error- β is the probability of failing to reject a false null hypothesis. The power (1-β) is the probability of rejecting a false null hypothesis.	113

Class Notes

Social Science

Math

Science

Fine Arts

Test Prep

Textbook Notes

Members Only

Forum

Blogs

Textbook Request

AP Statistics Exam Review Flashcards

Primary tabs

Need Help?

Need Notes?

About Course-Notes.Org

You are here

AP Statistics Exam Review Flashcards

Primary tabs

Need Help?

Need Notes?

About Course-Notes.Org