2012583830 | Standardizing | We ________ to eliminate units | 0 | |
2012583831 | Standardized Value | Value found by subtracting the mean and dividing by the standard deviation. | 1 | |
2012583832 | Shifting | Adding a constant to the mean, the median, and the quartiles, but does not change the standard deviation or IQR. | 2 | |
2012583833 | Rescaling | Multiple each data value by a constant multiplies both the measures of position and the measures of spread by that constant. | 3 | |
2012583834 | Normal Model | A useful family of models for unimodel, symmetric distributions. | 4 | |
2012583835 | Parameter | A numerically valued attribute of a model. | 5 | |
2012583836 | Statistic | A value calculated from data to summarize aspects of the data. | 6 | |
2012583837 | Z-score | Tells how many standard deviations a value is from the mean. | 7 | |
2012583838 | Boxplot | Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values. | 8 | |
2012583839 | Far Outlier | If a point is more than 3.0 IQR from either end of the box in a boxplot. | 9 | |
2012583840 | Comparing Distributions | Consider: shape, center, spread | 10 | |
2012583841 | Comparing Boxplots | Compare Shapes; Compare Medians; Compare IQRS; Check for outliers | 11 | |
2012583842 | Timeplot | Displays data that change overtime. | 12 | |
2012583843 | Standard Deviation | Square Root of the Var. | 13 | |
2012583844 | Variance | The sum of squared dev. from the mean, divided by the count minus 1. | 14 | |
2012583845 | Resistant | A calculated summary is said to be ________ if outliers have only a small effect on it. | 15 | |
2012583846 | Mean | Found by summing all the data values and dividing by the count. | 16 | |
2012583847 | 5 Number Summary | Reports the min., Q1, the median, Q3 and the max. | 17 | |
2012583848 | Percentile | The # that falls above i% of the data. | 18 | |
2012583849 | Interquartile Range (IQR) | The difference between the 1st and 3rd Quartiles. | 19 | |
2012583850 | Range | The difference between the lowest and highest values in a data set. Range = Max-Mir | 20 | |
2012583851 | Median | Middle value, if it is not an even #, you take the average of the 2 middle #'s. | 21 | |
2012583852 | Outliers | Extreme values that don't appear to belong with the rest of the data. Any point more than 1.5 IQR from either end of the box in a Boxplot. | 22 | |
2012583853 | Skewed | Distribution is _________ if it's not symmetric and 1 tail stretches out farther than the other. | 23 | |
2012583854 | Tails | The parts that typically trail off on either side. | 24 | |
2012583855 | Symmetric | 2 Halves on either side of the center look approximately like mirror images of each other. | 25 | |
2012583856 | Uniform | A distribution that's roughly flat. | 26 | |
2012583857 | Unimodal | 1 mode | 27 | |
2012583858 | Bimodal | 2 modes | 28 | |
2012583859 | Multimodal | More than 2 modes | 29 | |
2012583860 | Mode | A hump or local high point in the shape of the distribution of a var. | 30 | |
2012583861 | Spread | A numerical summary of how tightly the values are clustered around the center. Measures: IQR, Standard Dev. | 31 | |
2012583862 | Center | The place in the distribution of a variable that you'd point to if you wanted to attempt the impossible by summarizing the entire distribution with a single #. Measures: Mean, Median | 32 | |
2012583863 | Shape | To describe the _____ of a distribution, look for: single vs. mult. modes; symmetry vs skewness; outliers and gaps. | 33 | |
2012583864 | Dotplot | Graphs a dot for each case against a single axis. | 34 | |
2012583865 | Stem and Leaf Display | Shows quantitative data values in a way that sketches the distribution of the data. | 35 | |
2012583866 | Gap | A region of the distribution where there are no values. | 36 | |
2012583867 | Histogram | Uses adjacent bars to show the distribution of a quantitative var. | 37 | |
2012583868 | Frequency Table (Relative Frequency Table) | Lists the categories in a categorical var. and gives the count of percentages of each categories observation. | 38 | |
2012583869 | Distribution | The _____________ of a var. gives: possible values of the variance; the relative frequency of each value. | 39 | |
2012583870 | Area Principle | In a statistical display, each data value should be represented by the same amount of area. | 40 | |
2012583871 | Bar Chart | Shows a bar whose area represents the count (or percentage) of observations for each category of a categorical variance. | 41 | |
2012583872 | Pie Chart | Show how a "whole" divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category. | 42 | |
2012583873 | Contingency Table | Displays counts and, sometimes, percentages of individuals falling into named categories on 2 or more var. | 43 | |
2012583874 | Marginal Distribution | In a contingency table, the distribution of either var. alone. | 44 | |
2012583875 | Conditional Distribution | The distribution of a var. restricting the who to consider only a smaller group of individuals. | 45 | |
2012583876 | Independence | Variables are ________ if the conditional distribution of one variables is the same for each category of the other. | 46 | |
2012583877 | Segmented Bar Chart | Displays the conditional distribution of a categorical var. within each category of another var. | 47 | |
2012583878 | Simpson's paradox | When averages are taken across different groups, they can appear to contradict the overall averages. | 48 | |
2012583879 | Context | Tells who was measured, what was measured, how the data were collected, where the data was collected, and when and why the study was performed. | 49 | |
2012583880 | Data | Systematically recorded info., whether #'s or labels, together with its contact. | 50 | |
2012583881 | Data Table | An arrangement of data in which each row represents a case and each column represents a variable. | 51 | |
2012583882 | Case | Individual about whom or which we have data. | 52 | |
2012583883 | Population | All the cases we wish we knew about. | 53 | |
2012583884 | Sample | The cases we actually examine in seeking to understand the much larger population. | 54 | |
2012583885 | Variable | Holds info about the same characteristic for many cases. | 55 | |
2012583886 | Units | A quantity or amount adopted as a standard of measurement, such as dollars, hours, or grams. | 56 | |
2012583887 | Categorical Variable | A variable that names categories (words/numbers) | 57 | |
2012583888 | Quantitative Variable | A variable in which the numbers act as numerical values - always have units. | 58 | |
2027682233 | Random Phenomenon | If we know what outcomes could happen, but not which particular valves will happen. | 59 | |
2027682234 | Trial | A single attempt or realization of a random phenomenon. | 60 | |
2027682235 | Outcome | The value measured, observed, or reported for an individual instance of that trial. | 61 | |
2027682236 | Event | A collection of outcomes. | 62 | |
2027682237 | Sample Space | The collection of all possible outcome values. | 63 | |
2027682238 | Law of Large Numbers | States that the long run-run relative frequency of repeated independent events gets closer and closer to the true relative frequency as the number of trials increases. | 64 | |
2027682239 | Independence | If one event occurs it does not change the probability thta that the other event occurs. | 65 | |
2027682240 | Empirical Probability | The probability comes from the long-run relative frequency of the event's occurence. | 66 | |
2027682241 | Theoretical Probability | When the probability comes from a model. | 67 | |
2027682242 | Personal Probability | When the probability is subjective and represents your personal degree of belief. | 68 | |
2027682243 | Observational Study | A study based on data in which no manipulation of factors has been employed. | 69 | |
2027682244 | Retrospective Study | An observational study in which subjects are selected and then their previous conditions or behaviors are determined. | 70 | |
2027682245 | Prospective Study | An observational study in which subjects are followed to observe future outcomes. | 71 | |
2027682246 | Experiment | Manipulates factor levels to create treatments. Randomly assigns subjects to these treatment levels. Compares the responses of the subject groups across treatment levels. | 72 | |
2027682247 | Factor | A variance whose levels are manipulated by the experiment. | 73 | |
2027682248 | Response | A variance whose values are compared across different treatments. | 74 | |
2027682249 | Experimental Units | Individuals on whom an experiment is performed. | 75 | |
2027682250 | Level | The specific values that the experimenter chooses for a factor. | 76 | |
2027682251 | Treatment | The process, intervention, or other controlled circumstance applied to randomly assigned experimental units. | 77 | |
2027682252 | Priciples of Experimental Design | Control; Randomize; Replicate; Block | 78 | |
2027682253 | Control Group | The experimental units assigned to a basseline treatment level. | 79 | |
2027682254 | Placebo Effect | The tendency of many human subjects to show a response even when adminstered a placebo. | 80 | |
2027682255 | Blinding | Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups. | 81 | |
2027682256 | Placebo | A treatment known to have no affect. | 82 | |
2027682257 | Confounding | Levels of one factor are associated with the levels of another factor in such a way that their effects cannot be separated. | 83 | |
2027682258 | Sample Survey | A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population. | 84 | |
2027682259 | Bias | Any systematic failure of a sampling method. | 85 | |
2027682260 | Randomization | The best defense against bias; each individual is given a fair, random chance of selection. | 86 | |
2027682261 | Sample Size | Number of individuals in a sample represents the population. | 87 | |
2027682262 | Census | Sample that consists of the entire population. | 88 | |
2027682263 | Population Parameter | Numericlaly valued attribute of a model for a population. | 89 | |
2027682264 | Representative | A sample is said to be ___________ if the stats computed from it accurately reflect the corresponding population parameters. | 90 | |
2027682265 | Simple Random Sample (SRS) | A sample in which each set of "n" elements in the population has an equal chance of selection. | 91 | |
2027682266 | SRS | Simple Random Sample | 92 | |
2027682267 | Sampling Frame | List of individuals from whom the same is drawn. | 93 | |
2027682268 | Sampling Variability | The natural tendency of randomly drawn samples to differ, one from another. | 94 | |
2027682269 | Stratified Random Sample | A sampling design in which the population is divided into several subpopulations, or strata, and random samples are then drawn from each stratum. | 95 | |
2027682270 | Cluster Sample | A sampling design in which entire groups are chosen at random. | 96 | |
2027682271 | Multistage Sample | Sampling schemes that combine several sampling methods. | 97 | |
2027682272 | Systematic Sample | A sample drawn by selecting individuals systematically from a sampling frame. | 98 | |
2027682273 | Pilot | A small trial run of a survey to check whether questions are clear. | 99 | |
2027682274 | Voluntary Response Bias | Bias introduced to a sample when individuals can choose on their own whether to participate in the sample. | 100 | |
2027682275 | Convenience Sample | Consists of the individuals who are conveniently available to sample. | 101 | |
2027682276 | Undercoverage | A sampling scheme that biases the sample in a way that gives a part of the population less representation. | 102 | |
2027682277 | Nonresponse Bias | Bias introduced when a large fraction of those sampled fails to respond. | 103 | |
2027682278 | Response Bias | Anything in a survey design that influences response. | 104 | |
2027682279 | Random | If we know the possible values it can have, but not which particular value it takes. | 105 | |
2027682280 | Simulation | Models a real-world situation by using random-digit outcomes to mimic the uncertainty of a response variance of interest. | 106 | |
2027682281 | Simulation Component | A component uses equally likely random digits to model simple random occurrences whose outcomes may not be equally likely. | 107 | |
2027682282 | Trial (Chapter 11) | The sequence of several componets representing events that we are pretending will take place. | 108 | |
2027682283 | Re-expression | We _______ data by taking the logarithm, the square root, the reciprocal, or some other mathematical operation on all values of a variance. | 109 | |
2027682284 | Ladder of Powers | Places in order the effects that many re-expressions have on the data. | 110 | |
2027682285 | Correlation Coefficient | Numerical measure of the direciton and strength of a line or association. | 111 | |
2027682286 | Scatterplot | Shows relationship between two quantitative variables measured on the same cases. | 112 | |
2027682287 | Lurking Variable | A variable other than x and y that simultaneously affects both variables, accounting for the correlation between the two. | 113 | |
2027682288 | Model | An equation of formula that simplifies and represents reality. | 114 | |
2027682289 | Linear Model | An equation of a line. To interpret a linear model, we need to know the variables and their units. | 115 | |
2027682290 | Predicted Value | The value of y^ found for a given x-value in the data. This is found by substituting the x-value in reg. equation. | 116 | |
2027682291 | Residuals | Difference between data values and the corresponding values predicted by the regression model. Observed Value minus predicted value (e= y-y^) | 117 | |
2027682292 | Least Squares | Specifics the unique line that minimizes the variance of the residuals or, equivalently, the sum of the squared residuals. | 118 | |
2027682293 | Regression to the mean | Because correlation is always less than 1.0 in magnitude, each predicted y^ tends to be fewer standard deviation from its mean than its corresponding x was from its mean. | 119 | |
2027682294 | Intercept | The intercept b (little o), gives a starting value in y-units. It's the y^ - value when x = 0. | 120 | |
2027682295 | Extrapolation | Although linear models provide an easy way to predict values of y for a given value of x, it is unsafe to predict for values of x far from the ones used to find the linear model equation. | 121 | |
2027682296 | Leverage | Data points whose x-value are far from the man of x, are said to exert _____________ on a linear model. | 122 | |
2027682297 | Influential Point | If omitting a point from the data results in a very different regression model. | 123 | |
2031495160 | Disjoint(mutually exclusive) | 2 events share no outcomes in common. | 124 |
Statistics Flashcards
Primary tabs
Need Help?
We hope your visit has been a productive one. If you're having any problems, or would like to give some feedback, we'd love to hear from you.
For general help, questions, and suggestions, try our dedicated support forums.
If you need to contact the Course-Notes.Org web experience team, please use our contact form.
Need Notes?
While we strive to provide the most comprehensive notes for as many high school textbooks as possible, there are certainly going to be some that we miss. Drop us a note and let us know which textbooks you need. Be sure to include which edition of the textbook you are using! If we see enough demand, we'll do whatever we can to get those notes up on the site for you!