Читать книгу GIS Research Methods - Steven J. Steinberg - Страница 93
Develop a hypothesis
ОглавлениеResearch is guided by a question or issue that a researcher or group of researchers thinks is worth investigating or needs to be addressed in some way. Regardless of your motivation, your research question is best guided by a hypothesis. A hypothesis is simply an idea about the research that seems reasonable and that you think explains the situation at hand, or it might be the underlying basis of debate about the topic. In other words, the hypothesis of your research is an educated guess about what you might find, but the validity of your hypothesis has yet to be proven.
When developing your hypothesis, think about the most important aspects of your proposed topic. A hypothesis is an educated guess about relationships you might find between variables. Once you can state your ideas about the research or analysis in one sentence, you will know that you have narrowed your topic enough to create a hypothesis. Your hypothesis should clearly state the main idea or the crux of your research project. In other words, a hypothesis is simply a statement that conveys what you might find or the state of some set of events. A hypothesis presents something that can be tested, explored, and further investigated.
The following hypotheses incorporate a spatial question and can be effectively analyzed in a GIS:
A greater percentage of poor people than of wealthier people live in polluted environments.
Newcomers who move to a rural area tend to cluster in areas with people of similar socioeconomic background.
Rates of AIDS are lower among individuals with a college education.
Individuals with strong social ties in their local communities are more likely to participate in local governance.
A final comment on the hypothesis is that it establishes what you believe to be important about your research project. As a researcher, you can think about how GIS might add to the understanding of your hypothesis by identifying key variables (figure 3.8).
Figure 3.8 A chemical factory releases smoke into the atmosphere. Nickolay Khoroshkov/Shutterstock.com.
Every hypothesis should have a single dependent and one or more independent variables. In the first hypothesis listed previously, a GIS could be used to take census data overlaid with pollution data from the Environmental Protection Agency’s Toxic Release Inventory database. Socioeconomic information would come from the census data, and you could define (according to whatever criteria you deem appropriate) what areas are polluted (figure 3.9).
Figure 3.9 Facilities that are registered on the Environmental Protection Agency’s Toxic Release Inventory are recorded as point locations in the database. Information regarding the specific pollutants released is also recorded. Here these are coded on an ordinal scale from low toxicity (white circles) to high toxicity (black circles). Assume that the polygons on the map represent the census tracts with a specified percentage of the population classified as poor (as defined from census data). We could use the GIS to determine if there is a statistical difference based solely on the point locations of facilities releasing toxic waste (do they fall in the census tract or not?) or based on other spatial concepts such as nearness or adjacency.
If you consider the spatial distribution of the pollution sources in figure 3.9, several things might be apparent. First, the majority of the mid- and high-pollution facilities are somewhat clustered to the left of center on the map in polygons 1, 2, and 7. It would be logical to assume that if the center of the map (polygon 4) were the historic city center, industrial facilities, particularly the older, higher-polluting facilities, would have grown up around the city center. Over time, newer, cleaner facilities might be expected to grow on the edges of the city. Second, you will notice that facilities are generally located on the left side of the map, with no facilities sited on the right side of the map (polygons 3, 5, and 8). Last, we might note that although the central polygon (polygon 4) has no points inside it, and polygon 7 has only one light polluter, several other sources of pollution are just across the line in the adjacent polygons (polygons 1, 2, and 6). We will further discuss this example as we go through the remaining stages of research. As an exercise, you might also consider the other three hypotheses presented earlier or others you are interested in exploring. What considerations might be important as you operationalize this for your spatial analysis of the data in each of these situations?