Example: bachelor of science

Statistical Analysis Handbook - StatsRef

Dr M J de SmithStatistical AnalysisHandbookA comprehensive Handbook ofstatistical concepts, techniques andsoftware toolsStatistical Analysis HandbookA comprehensive Handbook of Statistical concepts, techniques andsoftware toolsby Dr M J de SmithAll rights reserved. No parts of this work may be reproduced in any form or by any means - graphic, electronic, ormechanical, including photocopying, recording, taping, or information storage and retrieval systems - without the writtenpermission of the that are referred to in this document may be either trademarks and/or registered trademarks of the respectiveowners. The publisher and the author make no claim to these every precaution has been taken in the preparation of this document, the publisher and the author assume noresponsibility for errors or omissions, or for damages resulting from the use of information contained in this document orfrom the use of programs and source code that may accompany it.

with internal and external hyperlinks (shown underlined). In this Handbook we utilize both forms of link, with external links often referring to a small number of well-established sources, including MacTutor for bibliographic information and a number of other web resources, such as Eric Weisstein's Mathworld and the statistics page of

Tags:

  Analysis, Handbook, Statistical, Statistical analysis handbook, Link, Underlined

Information

Domain:

Source:

Link to this page:

Please notify us if you found a problem with this document:

Other abuse

Transcription of Statistical Analysis Handbook - StatsRef

1 Dr M J de SmithStatistical AnalysisHandbookA comprehensive Handbook ofstatistical concepts, techniques andsoftware toolsStatistical Analysis HandbookA comprehensive Handbook of Statistical concepts, techniques andsoftware toolsby Dr M J de SmithAll rights reserved. No parts of this work may be reproduced in any form or by any means - graphic, electronic, ormechanical, including photocopying, recording, taping, or information storage and retrieval systems - without the writtenpermission of the that are referred to in this document may be either trademarks and/or registered trademarks of the respectiveowners. The publisher and the author make no claim to these every precaution has been taken in the preparation of this document, the publisher and the author assume noresponsibility for errors or omissions, or for damages resulting from the use of information contained in this document orfrom the use of programs and source code that may accompany it.

2 In no event shall the publisher and the author be liablefor any loss of profit or any other commercial damage caused or alleged to have been caused directly or indirectly by Analysis Handbook 2014 Dr M J de Smith : cover image: Polar bubble plot (source: MatPlotLib library, Python)Rear cover image: Florence Nightingale's polar diagram of causes of mortality, by date (source: Wikipedia)Citation: de Smith M J (2015) StatsRef : Statistical Analysis Handbook - a web-based statisticsresource. The Winchelsea Press, Winchelsea, UK 2014 Dr M J de Smith : of ContentsPart 1-4 How to use this 1-5 Intended audience and 1-6 Suggested 1-11 Notation and 1-14 Historical 1-20An applications-led disciplinePart IIStatistical 2-17 The Statistical 2-25 Misuse, Misinterpretation and 2-36 Sampling and sample 2-46 Data preparation and 2-48 Missing data and data 2-53 Statistical 2-54 Statistics in Medical 2-57 Causation.

3 2-61 Conduct and reporting of medical research .. 2-64 Randomized controlled 2-67 Case-control 2-69 Cohort 2-71 Meta analysisPart IIIS tatistical 3-3 Probability 3-5 Odds .. 3-6 Risks .. 3-8 Frequentist probability theory .. 3-12 Bayesian probability theory .. 3-17 Probability distributions .. 3-19 Statistical 3-22 Computational 3-28 Hypothesis 3-31 Types of 3-33 Statistical significance 2014 Dr M J de Smith : 3-36 Confidence 3-40 Power and 3-41 Degrees of 3-42 Non-parametric analysisPart IVDescriptive 4-3 Counts and specific 4-5 Measures of central 4-12 Measures of 4-22 Measures of distribution 4-26 Statistical 4-28 MomentsPart VKey functions and 5-3 Key 5-10 Measures of Complexity and Model 5-14 MatricesPart VIData transformation and 6-3 Box-Cox and Power 6-5 Freeman-Tukey (square root and arcsine) 6-8 Log and Exponential 6-11 Logit 6-13 Normal transform (z-transform)Part VIIData 7-3 Graphics and 7-19 Exploratory Data AnalysisPart VIIIR andomness and 8-4 Random 8-12 Random 8-18 Runs 8-19 Random 8-28 Markov 8-34 Monte Carlo 8-35 Monte Carlo Integration.

4 8-39 Monte Carlo Markov Chains (MCMC) Part IXCorrelation and autocorrelation9-2 2014 Dr M J de Smith : 9-3 Pearson (Product moment) 9-12 Rank 9-16 Canonical 9-20 Temporal autocorrelation .. 9-26 Spatial autocorrelation Part XProbability 10-6 Discrete 10-7 Binomial distribution .. 10-11 Hypergeometric distribution .. 10-14 Multinomial distribution .. 10-16 Negative Binomial or Pascal and Geometric distribution .. 10-18 Poisson distribution .. 10-24 Skellam distribution .. 10-25 Zipf or Zeta distribution .. 10-26 Continuous univariate 10-27 Beta distribution .. 10-29 Chi-Square distribution .. 10-32 Cauchy distribution .. 10-34 Erlang distribution .. 10-36 Exponential distribution .. 10-39F distribution .. 10-41 Gamma distribution .. 10-43 Gumbel and extreme value distributions .. 10-47 Normal distribution .. 10-52 Pareto distribution .. 10-54 Student's t-distribution (Fisher's distribution).

5 10-58 Uniform distribution .. 10-60von Mises distribution .. 10-65 Weibull distribution .. 10-67 Multivariate 10-71 Kernel Density EstimationPart XIEstimation and 11-4 Maximum Likelihood Estimation (MLE).. 11-9 Bayesian estimationPart XIIC lassical 12-3 Goodness of fit 12-5 Anderson-Darling .. 12-7 Chi-square test .. 12-10 Kolmogorov-Smirnov .. 12-13 Ryan-Joiner .. 12-14 Shapiro-Wilk .. 12-16 Jarque-Bera .. 12-17 Lilliefors .. 12-18Z-tests 2014 Dr M J de Smith : 12-19 Test of a single mean, standard deviation known .. 12-22 Test of the difference between two means, standard deviations known .. 12-23 Tests for proportions, p .. 12-26 Test of a single mean, standard deviation not known .. 12-28 Test of the difference between two means, standard deviation not known .. 12-30 Test of regression coefficients .. 12-32 Variance 12-33 Chi-square test of a single variance .. 12-35F-tests of two variances.

6 12-37 Tests of homogeneity .. 12-38 Bartlett's M 12-40 Levene-Brown-Forsythe 12-41 Fligner-Killeen 12-42 Wilcoxon rank-sum/Mann-Whitney U 12-46 Sign testPart XIIIC ontingency 13-4 Chi-square contingency table 13-6G contingency table 13-7 Fisher's exact 13-10 Measures of 13-11 McNemar's testPart XIVD esign of 14-9 Completely randomized 14-10 Randomized block 14-12 Latin squares .. 14-15 Graeco-Latin squares .. 14-16 Factorial 14-17 Full Factorial designs .. 14-19 Fractional Factorial designs .. 14-21 Plackett-Burman designs .. 14-23 Regression designs and response 14-25 Mixture designsPart XVAnalysis of variance and 15-10 Single factor or one-way ANOVA .. 15-15 Two factor or two-way and higher-way ANOVA .. 15-21 Non-Parametric 15-22 Kruskal-Wallis ANOVA .. 15-25 Friedman ANOVA test .. 15-27 Mood's Median 2014 Dr M J de Smith : XVIR egression and 16-7 Least 16-12 Ridge 16-13 Simple and multiple linear 16-27 Polynomial 16-29 Generalized Linear Models (GLIM).

7 16-31 Logistic regression for proportion 16-34 Poisson regression for count 16-39 Non-linear 16-43 Smoothing and Generalized Additive Models (GAM).. 16-45 Geographically weighted regression (GWR).. 16-50 Spatial series and spatial 16-57 SAR models .. 16-64 CAR models .. 16-70 Spatial filtering models Part XVIITime series Analysis and 17-7 Moving 17-12 Trend 17-18 ARMA and ARIMA (Box-Jenkins) 17-27 Spectral analysisPart 18-3 Distribution 18-25 Statistical 18-27 Test Datasets and data 18-39 Tests 18-40 Tests and confidence intervals for mean values .. 18-41 Tests for proportions .. 18-42 Tests and confidence intervals for the spread of datasets .. 18-43 Tests of randomness .. 18-44 Tests of fit to a given distribution .. 18-45 Tests for cross-tabulated count data .. 18-46R Code 18-47 Scatter Plot: Inequality .. 18-48 Latin Square ANOVA .. 18-51 Log Odds Ratio Plot .. 18-52 Normal distribution plot.

8 18-53 Bootstrapping 2014 Dr M J de Smith : 2014 Dr M J de Smith : IntroductionThe definition of what is meant by statistics and Statistical Analysis has changed considerably overthe last few decades. Here are two contrasting definitions of what statistics is, from eminentprofessors in the field, some 60+ years apart:"Statistics is the branch of scientific method which deals with the data obtained by counting ormeasuring the properties of populations of natural phenomena. In this definition 'naturalphenomena' includes all the happenings of the external world, whether human or not." ProfessorMaurice Kendall, 1943, p2 [MK1]"Statistics is: the fun of finding patterns in data; the pleasure of making discoveries; the import ofdeep philosophical questions; the power to shed light on important decisions, and the ability toguide in business, science, government, medicine, " Professor David Hand[DH1]As these two definitions indicate, the discipline of statistics has moved from being grounded firmly inthe world of measurement and scientific Analysis into the world of exploration, comprehension anddecision-making.

9 At the same time its usage has grown enormously, expanding from a relatively smallset of specific application areas (such as design of experiments and computation of life insurancepremiums) to almost every walk of life. A particular feature of this change is the massive expansion ininformation (and misinformation) available to all sectors and age-groups in society. Understanding thisinformation, and making well-informed decisions on the basis of such understanding, is the primaryfunction of modern Statistical objective in producing this Handbook is to be comprehensive in terms of concepts and techniques(but not necessarily exhaustive), representative and independent in terms of software tools, andabove all practical in terms of application and implementation. However, we believe that it is nolonger appropriate to think of a standard, discipline-specific textbook as capable of satisfying everykind of new user need.

10 Accordingly, an innovative feature of our approach here is the range offormats and channels through which we disseminate the material - web, ebook and in due course,print. A major advantage of the electronic formats is that the text can be embedded with internal andexternal hyperlinks. In this Handbook we utilize both forms of link , with external links often referringto a small number of well-established sources, notably MacTutor for bibliographic information and anumber of other web resources, such as Eric Weisstein's Mathworld and the statistics portal ofWikipedia, for providing additional material on selected topics. The treatment of topics in this Handbook is relatively informal, in that we do not providemathematical proofs for much of the material discussed. However, where it is felt particularly usefulto clarify how an expression arises, we do provide simple derivations.


Related search queries