130 INDEX causation, 46 center of a his<strong>to</strong>gram, dataset, <strong>or</strong> distribution, 15 Central Limit The<strong>or</strong>em, CLT, 110–114, 121 classes, in a his<strong>to</strong>gram, 12 Clemens, Samuel [Mark Twain], ix CLT, Central Limit The<strong>or</strong>em, 110–114, 121 coin biased, 64 fair, 64, 69, 70, 117 complement, E c , 56–58, 60 conditional probability, P (A | B), 67, 123 confidence interval, 110, 111 confidence interval f<strong>or</strong> μ X <strong>with</strong> confidence level L, 115, 116 confidence level, 115, 116 confirmation bias, 120 confounded, 102, 103 continuous random variable, 69, 90 control group, 100, 102 CORREL, c<strong>or</strong>relation coefficient in spreadsheets, 41 c<strong>or</strong>relation coefficient, r, 36 c<strong>or</strong>relation is not causation but it sure is a hint, 46 countably infinite, 69 critical value, z ∗ L , 115 data, not <strong>the</strong> plural of anecdote, 52 dataset, 7 default significance level, 120 definition, in ma<strong>the</strong>matics, 2 democracy, 96 density function, f<strong>or</strong> a continuous random variable, 74, 77, 112 dependent variable, 33 deterministic, 34 direction of a linear association, 35 disaggregation of experimental results, 105 discrete random variable, 69 disjoint events, 57, 59, 62, 63 Disraeli, Benjamin, ix distribution, 15, 70, 73, 112 do no harm, 104 double-blind experiment, 101 Empirical Rule, 83 empty set, ∅,56 epidemiology, 117 equiprobable, 65 ethics, experimental, 91, 104 even number, definition, 2 event, 55, 57–63 Excel [Microsoft], 41, 83, 113, 121–123 expectation, 72 expected value, 72 experiment, 99, 102, 103 experimental design, 52, 91 experimental ethics, 52, 91 experimental group, 100, 102 experimental treatment, 99 explanat<strong>or</strong>y variable, 33 extrapolation, 47 failure <strong>to</strong> reject H 0 , 118, 119, 123 fair coin, 64, 69, 70, 117 fair, in general, 65 fake news, ix false positive, 124 finite probability models, 63 first quartile, 22 first,donoharm,104 five-number summary, 27 free will, 104 frequency, 7 relative, 7 frequentist approach <strong>to</strong> probability, 53, 123 Gallup polling <strong>or</strong>ganization, 96 Gauss, Carl Friedrich, 78 Gaussian distribution, 78 genetics, 117 “given,” <strong>the</strong> known event in conditional probability, 67 Great Recession, 20 Hippocrates of Kos, 104 Hippocratic Oath, 104, 105 his<strong>to</strong>gram, 12, 13, 32 relative frequency, 14 <strong>How</strong> <strong>to</strong> Lie <strong>with</strong> <strong>Statistics</strong>, ix Huff, Darrell, ix Hygieia, 104
INDEX 131 hypo<strong>the</strong>sis, 124 alternative, H a , 118–123 null, H 0 , 118–121, 123, 124 hypo<strong>the</strong>sis test, 110, 111, 117, 121–123 imperfect knowledge, 66 income distribution, 20 independent events, 65, 67, 112, 117, 124 independent variable, 33 individual in a statistical study, 5 inferential statistics, 110 inf<strong>or</strong>med consent, 105 insensitive <strong>to</strong> outliers, 20, 23, 25, 26 Insert Trend Line, display LSRL in spreadsheet scatterplots, 42 Institutional Review Board, IRB, 106 inter-quartile range, IQR,23,25 interpolation, 43 intersection, ∩, 56,57,60,61 IRB, Institutional Review Board, 106 Kernler, Dan, 83 Law of Large Numbers, 94 leaf, in stemplot, 11 least squares regression line, LSRL, 40 left-skewed his<strong>to</strong>gram, dataset, <strong>or</strong> distribution, 21 LibreOffice Calc, 41, 42, 47, 83, 113, 121–123 lies, ix lies, damned, ix linear association, 35 lower half data, 22 LSRL, least squares regression line, 40 lurking variable, 102, 103 margin of err<strong>or</strong>, 116 mean, 18–21, 25, 31, 112, 122 population, 18, 93–95, 110–112, 114–116, 118, 120–122 sample, 18, 19, 23, 40, 93–95, 112–116, 118, 120–124 media, 28 median, 18, 20, 21, 23, 25, 27, 31, 124 Microsoft Excel, 41, 83, 113, 121–123 mode, 17, 19, 23, 31 MS Excel, 41, 83, 113, 121–123 multi-variable statistics, 2 multimodal his<strong>to</strong>gram, dataset, <strong>or</strong> distribution, 15 mutually exclusive events, 57 negative linear association, 35 news, fake, ix non-deterministic, 34 NORM.DIST, <strong>the</strong> cumulative N<strong>or</strong>mal distribution in spreadsheets, 83, 113, 121–123 N<strong>or</strong>mal distribution <strong>with</strong> mean μ X and standard deviation σ X , 77, 112 n<strong>or</strong>malcdf, <strong>the</strong> cumulative N<strong>or</strong>mal distribution on a TI-8x calculat<strong>or</strong>, 82, 121, 122 N<strong>or</strong>mally distributed <strong>with</strong> mean μ X and standard deviation σ X , N(μ X ,σ X ), 78, 112, 114, 115 not, f<strong>or</strong> an event, 56 null hypo<strong>the</strong>sis, H 0 , 118–121, 123, 124 objectivity, 52 observational studies, 103 observational study, 99, 102 one-tailed test, 122 one-variable statistics, 2 <strong>or</strong>, f<strong>or</strong> events, 56 outcome of an experiment, 55 outlier, 20, 25, 26, 28 bivariate, 45 p-value of a hypo<strong>the</strong>sis test, 118, 119, 123 Panacea, 104 parameter, population, 93–95, 110, 122, 124 personally identifiable inf<strong>or</strong>mation, PII, 105 pho<strong>to</strong>n, 54 pie chart, 9 pig, yellow, 17 PII, personally identifiable inf<strong>or</strong>mation, 105 placebo, 100 Placebo Effect, 100, 104 placebo-controlled experiment, 101 population mean, μ X , 18, 93–95, 110–112, 114, 118, 120–122 population of a statistical study, 5, 93, 112, 122, 124 population parameter, 93–95, 110, 122, 124 population prop<strong>or</strong>tion, 93–95 population size, N,6
- Page 1:
Lies, Damned Lies, or Statistics: H
- Page 5 and 6:
Contents Release Notes Preface iii
- Page 7:
CONTENTS vii 5.3.2. Informed Consen
- Page 10 and 11:
x PREFACE of this book to help you
- Page 12 and 13:
The first instinct of the scientist
- Page 15 and 16:
CHAPTER 1 One-Variable Statistics:
- Page 17 and 18:
1.2. VISUAL REPRESENTATION OF DATA,
- Page 19 and 20:
1.2. VISUAL REPRESENTATION OF DATA,
- Page 21 and 22:
1.3. VISUAL REPRESENTATION OF DATA,
- Page 23 and 24:
1.3. VISUAL REPRESENTATION OF DATA,
- Page 25 and 26:
1.3. VISUAL REPRESENTATION OF DATA,
- Page 27 and 28:
1.4. NUMERICAL DESCRIPTIONS OF DATA
- Page 29 and 30:
1.4. NUMERICAL DESCRIPTIONS OF DATA
- Page 31 and 32:
1.4. NUMERICAL DESCRIPTIONS OF DATA
- Page 33 and 34:
1.5. NUMERICAL DESCRIPTIONS OF DATA
- Page 35 and 36:
1.5. NUMERICAL DESCRIPTIONS OF DATA
- Page 37 and 38:
1.5. NUMERICAL DESCRIPTIONS OF DATA
- Page 39 and 40:
1.5. NUMERICAL DESCRIPTIONS OF DATA
- Page 41 and 42:
EXERCISES 31 EXERCISE 1.5. Twenty s
- Page 43 and 44:
CHAPTER 2 Bi-variate Statistics: Ba
- Page 45 and 46:
2.2. SCATTERPLOTS 35 2.2. Scatterpl
- Page 47 and 48:
2.3. CORRELATION 37 Many electronic
- Page 49 and 50:
CHAPTER 3 Linear Regression Quick r
- Page 51 and 52:
3.1. THE LEAST SQUARES REGRESSION L
- Page 53 and 54:
3.2. APPLICATIONS AND INTERPRETATIO
- Page 55 and 56:
3.3. CAUTIONS 45 3.3. Cautions 3.3.
- Page 57 and 58:
3.3. CAUTIONS 47 3.3.3. Extrapolati
- Page 59:
EXERCISES 49 Exercises EXERCISE 3.1
- Page 62 and 63:
It is something of an aphorism amon
- Page 64 and 65:
54 4. PROBABILITY THEORY the flippi
- Page 66 and 67:
56 4. PROBABILITY THEORY There is o
- Page 68 and 69:
58 4. PROBABILITY THEORY Sometimes,
- Page 70 and 71:
60 4. PROBABILITY THEORY Venn diagr
- Page 72 and 73:
62 4. PROBABILITY THEORY A ∩ B, w
- Page 74 and 75:
64 4. PROBABILITY THEORY The final
- Page 76 and 77:
66 4. PROBABILITY THEORY 4.2. Condi
- Page 78 and 79:
68 4. PROBABILITY THEORY nor are {1
- Page 80 and 81:
70 4. PROBABILITY THEORY In fact, l
- Page 82 and 83:
72 4. PROBABILITY THEORY FACT 4.3.8
- Page 84 and 85:
74 4. PROBABILITY THEORY addition,
- Page 86 and 87:
76 4. PROBABILITY THEORY FACT 4.3.1
- Page 88 and 89:
78 4. PROBABILITY THEORY If X is a
- Page 90 and 91: 80 4. PROBABILITY THEORY Let’s ma
- Page 92 and 93: 82 4. PROBABILITY THEORY One nice t
- Page 94 and 95: 84 4. PROBABILITY THEORY In order t
- Page 96 and 97: 86 4. PROBABILITY THEORY What we kn
- Page 98 and 99: 88 4. PROBABILITY THEORY EXERCISE 4
- Page 100 and 101: 90 4. PROBABILITY THEORY EXERCISE 4
- Page 102 and 103: 92 5. BRINGING HOME THE DATA guidel
- Page 104 and 105: 94 5. BRINGING HOME THE DATA Simila
- Page 106 and 107: 96 5. BRINGING HOME THE DATA The ha
- Page 108 and 109: 98 5. BRINGING HOME THE DATA Note a
- Page 110 and 111: 100 5. BRINGING HOME THE DATA and s
- Page 112 and 113: 102 5. BRINGING HOME THE DATA 5.2.5
- Page 114 and 115: 104 5. BRINGING HOME THE DATA 5.3.
- Page 116 and 117: 106 5. BRINGING HOME THE DATA In th
- Page 119 and 120: Part 3 Inferential Statistics
- Page 121 and 122: CHAPTER 6 Basic Inferences The purp
- Page 123 and 124: 6.1. THE CENTRAL LIMIT THEOREM 113
- Page 125 and 126: 6.2. BASIC CONFIDENCE INTERVALS 115
- Page 127 and 128: 6.3. BASIC HYPOTHESIS TESTING 117 6
- Page 129 and 130: 6.3. BASIC HYPOTHESIS TESTING 119 N
- Page 131 and 132: 6.3. BASIC HYPOTHESIS TESTING 121 M
- Page 133 and 134: The two hypotheses then are 6.3. BA
- Page 135 and 136: EXERCISES 125 Exercises EXERCISE 6.
- Page 137: Bibliography [Gal17] Gallup, Presid
- Page 142 and 143: 132 INDEX population standard devia