Читать книгу The Big R-Book - Philippe J. S. De Brouwer - Страница 235
Tidy Data
ОглавлениеTidy data is in essence data that is easy to understand by people and is formatted and structured with the following rules in mind.
1 a tibble/data-frame for each dataset,
2 a column for each variable,
3 a row for each observation,
4 a value (or NA) in each cell (a “cell” is the intersection between row and column).
The concept of tidy data is so important that we will devote a whole section to tidy data (Section 17.2 “Tidy Data” on page 275) and how to make data tidy (Chapter 17 “Data Wrangling in the tidyverse” on page 265). For now, it is sufficient to have the previous rules in mind. This will allow us to introduce the tools of the tidyverse first and then later come back to making data tidy by using these tools.