Читать книгу Statistics - David W. Scott - Страница 17
Problems
Оглавление1 1.1 A frequency histogram of continuous data is constructed by counting the number of data points that fall into equally spaced bins of width . is called the bin width. Typically the bin edges are 0, , , , and so on. If the bin count in the th bin is denoted by , then the frequency histogram is defined as(1.1) Show that the total area of the frequency histogram is , where . Hint: the histogram is made up of rectangular blocks of width and height .A probability histogram is defined to have total area of one. Show that the following definition of a histogram has area equal to one:(1.2)
2 1.2 One of the most famous epidemiological cases occurred in 1854 when Dr. John Snow successfully tracked down the source of an outbreak of cholera in the London suburb of SoHo. He mapped the households of some 500 victims over a 10‐day period that lived within a quarter of mile of each other. However, many tens of thousands had died of cholera in England during the prior two decades. Dr. Snow believed contaminated water was a primary cause. Just as in the Space Shuttle example, there are choices of an appropriate time interval and the geographical extent that can influence our conclusions. Using the descriptions and maps conveniently assembled at http://www.ph.ucla.edu/epi/snow/snowcricketarticle.html,discuss the evidence and choices that were and could have been made. Hint: these data have been conveniently collected in CRAN Library HistData by Friendly (2018). Look at the help file for dataset snow and its example code.
3 1.3The Tukey power transformation of a variable is for any non‐zero . To better understand why the is used in place of when , we consider the linear re‐expression of the Tukey transformation given by the formula (Box and Cox (1964))(1.3) Since (1.3) is when , use l'Hôpital's rule to find the limit transformation as . The scatter diagram using either formula for fixed non‐zero will be visually identical. Formula (1.3) is referred to as the Box–Cox transformation; see Figure 1.7.Sometimes the transformation is used in place of when and can take on the value 0. In this case, the original and transformed values of 0 are both 0. Try this form on the body–brain data and compare to Figure 1.4.Figure 1.7 Box–Cox transformation on natural and log scales.