Читать книгу Using Stata for Quantitative Analysis - Kyle C. Longest - Страница 6

Оглавление

Preface

Motivation and Purpose

The motivation for this book came from a series of personal experiences. First, as a graduate student, I remember literally lying awake at night dreading the idea of using a computer program to conduct statistical analyses. The first statistics course I took required Stata to complete the assignments and the final research project. This necessity was so overwhelming at the time, in part, because there did not seem to be any straightforward, concise texts explaining the basics of Stata. During my time in graduate school, I came to be very familiar with Stata, even to the point that I developed a serious passion for both learning Stata and teaching it to students who were facing the same fears I once did. When I first began teaching a course on quantitative analysis, I was hoping to use Stata as a significant portion of the classroom experience and requirements. Yet in a somewhat mirrored experience from when I was a student, I soon realized that there still was not a manageable introductory text on the use of Stata for quantitative research.1 Thus, I sought to contribute to filling this void by providing a straightforward, applied introduction to using Stata.

1 Assuredly, there are several very good and effective texts on learning Stata. Virtually all of these, however, are aimed at experienced users or are so detailed and long that they are not helpful for a typical classroom in which teaching Stata is not the primary purpose.

This book will be most beneficial to readers who are novices when it comes to Stata but are at least in the early stages of learning strategies for conducting quantitative analysis. It does assume that the reader has a working knowledge of basic statistical techniques and terminology. The organization and coverage of the book is guided by the content and ordering of topics found in most introductory social statistics textbooks. In this manner, it can serve as an excellent companion, for either a class or a self-learner, to such a textbook.

To be clear, this book should not be used to learn statistics or quantitative analysis. Some basic assumptions and explanations are provided, but these should not be used in place of a more thorough coverage of each of the analytic strategies. The statistical grounding for this book is based primarily on Frankfort-Nachmias and Leon-Guerrero’s Social Statistics for a Diverse Society (2010). The definitions and interpretations of the specific measures and tests are based on those presented in this text. Of course, any inaccuracies or mistakes are solely mine.

Also, this book does not attempt to cover every aspect of each Stata command that is introduced. More experienced users undoubtedly know shortcuts or alternative methods for the techniques that are presented. The given description has been geared to introduce complete novice users to Stata. This targeted audience requires that the explanation start with the basics before jumping into the advanced features. The presented commands and procedures are discussed because they are the most simplified strategies that effectively accomplish the pertinent goals.

About the National Study of Youth and Religion

The data for this book come from the National Study of Youth and Religion (NSYR). The NSYR is a longitudinal, nationally representative telephone survey of U.S. young adults. There are three waves of data, all of which are publically available.

The variables that are used in the examples throughout this book come from the second follow-up survey of 2,532 young adults completed in the fall of 2007. At the time of this survey, the respondents were all between the ages of 18 and 24. Each respondent completed a computer-assisted telephone interviewing survey that lasted approximately an hour. This data set covers a broad array of topics, making it possible, across examples, to use variables pertinent to several disciplines. For example, it contains several standard self-esteem measures of interest to psychologists, a wide array of questions on religion useful for sociologists, numerous questions on finances (e.g., debt) applicable to economics, and measures of substance use behaviors that would be pertinent to social work or health researchers. The full data set and documentation can be downloaded from the Association of Religion Data Archives (www.thearda.com/Archive/Files/Descriptions/NSYRW3.asp).

The first wave of the survey sampled 3,290 U.S. English- and Spanish-speaking teenagers, ages 13 to 17. The sampling and survey were conducted from July 2002 to August 2003 using random-digit dialing, drawing on a sample of randomly generated telephone numbers representative of all noncellular phone numbers in the United States. The overall response rate of 57% for the first survey is lower than desired, but it is similar to other current nationally based surveys using similar methodologies. Further comparisons of the NSYR data with 2002 U.S. Census data on households and with nationally representative surveys of adolescents—such as Monitoring the Future, the National Household Education Survey, and the National Longitudinal Study of Adolescent Health—confirm that the NSYR provides a nationally representative sample of U.S. teenagers aged 13 to 17 years and their parents without identifiable sampling or nonresponse biases (for details, see Smith & Denton, 2005). The follow-up sample used in the data sets comes from this initial sample of 3,290 teens. To obtain more information regarding the technical details and documentation of the NSYR, please visit www.youthandreligion.org/.

A Note on Versions

All the commands and examples for this book were produced using Stata 15 for Windows. The primary commands and options are similar for older versions, dating back until at least Stata 9. There were, however, a few changes between Stata 11 and Stata 12 and then a few more substantial ones with Stata 13. Most of these changes do not affect the actual functionality but rather deal with convenience and appearance. In fact, most of the substantive differences that the new users would encounter fall under the topics covered in Chapter 1.

For users of Stata 13 or prior, I would suggest obtaining the second edition of this text as it presents the material specifically for this version. It also includes the introductory topics specifically for both Stata 12 and Stata 11. Again the key differences exist in the material presented in Chapter 1. So all users should be able to rejoin this edition at Chapter 2. From that point on, all of the commands and strategies are equivalent across versions (although the appearance of the screenshots may be slightly different).

The vast majority of the commands presented are similar for Stata for Mac as well. The appearance and wording of some icons as well as the pathways for the point-and-click menus may be slightly different for a Mac operating system.

A Note on Notation

Certain text in this book will be presented in a slightly different font. Generally, anything that you enter into or that comes out of Stata will be denoted with the typewriter (i.e., Courier New) font. This font will be used to indicate variable names in a particular data set, such as gender or ids. It will also be used to show the display from the Stata Results window (if the actual screenshot is not shown).

This font will be used to denote a command that is entered into the Command window to perform a given operation. Additionally, if these commands are presented by themselves within a sentence, they will be set apart by a dash pre and post (e.g., -replace-) so that they are not confused with a variable name.

The majority of this book discusses the syntax command interface (i.e., the Command window) aspect of Stata. But there will be times when the menu, point-and-click interface is described. Menus (e.g., File), clickable buttons (e.g., OK), or keys on the keyboard (e.g., Enter) will be denoted with the bold font.

Finally, Stata is a case-sensitive program, meaning that all commands and variable names must be typed exactly as they are shown. For the purposes of this book, this sensitivity means that at times, the capitalization may not follow typical grammatical conventions. For example, if a variable name starts a sentence and that variable name is lowercase, then that sentence will start with a lowercase letter.

References

Frankfort-Nachmias, C., & Leon-Guerrero, A. (2010). Social statistics for a diverse society (6th ed.). Thousand Oaks, CA: Pine Forge Press.

Smith, C., & Denton, M. L. (2005). Soul searching: The religious and spiritual lives of American teenagers. New York, NY: Oxford University Press.

Using Stata for Quantitative Analysis

Подняться наверх