The purpose of geographic information systems is to represent and analyze geographic phenomena, including entities. An important part of any entity is the set of relationships it has with other entities; such relations are crucial to the nature of geography and geographic inquiry. Relationships are commonly stored in spatial databases in a number of ways, and a variety of tools are available to analyze them.
Plewe, B. (2025). Relationships between Geographic Phenomena. The Geographic Information Science & Technology Body of Knowledge (Issue 2, 2025 Edition), John P. Wilson (ed.). DOI: 10.22224/gistbok/2025.2.3.
Relationships are a crucial part of any ontology or conceptual model of geographic entities, because they are a crucial part of real-world geographic phenomena and of geographic inquiry. Part of what makes location (and thus geography) important is that things in the world are connected to each other in various ways. In fact, it has long been argued that spatial relationships are even more fundamental to understanding geography than the absolute metric space that forms the basis of GIS (Hartshorne 1949, 120; O’Sullivan 2024, 24). Spatial cognition and spatial language are replete with relations (Kitchen and Blades 2002); if someone asks you where you live, you will not give a measured coordinate, but will describe your home as it relates to other places: “it is in country A, in city B, which is near city C; south of landmark D, along street E.”
The importance of relationships has long been enshrined as half of the core geographic concept of site and situation, in which situation is the set of characteristics of a place in relation to other geographic entities (Murphey 1966, 12; Bednarz et al 2024, 9). In fact, many core concepts of geography and spatial thinking are founded on relationships, such as distance, interaction, networks, hierarchies, proximity, connectivity, movement, and diffusion (NCGE 2012). These concepts form the basis for many of the tools and techniques of spatial analysis (O’Sullivan and Unwin 2003, 35-49).
As a random example, perhaps we want to describe Tarrant County, Texas. Among its potentially useful characteristics would be several connections to other geographic entities (i.e., its situation):
These relationships are quite different from each other, and would be used in different ways in geography and geospatial techniques and technologies. We can group the wide variety of relationships into several categories:
Some of these relationship types have received extensive research in geography and geographic information science (especially topology), while others are barely recognized and do not even have a standard term (e.g., definitional, fiat). However, all of them are very common in geographic discourse and geospatial data. Because spatial topological and metric relationships, and the modeling thereof, are well covered elsewhere in the GIS&T Body of Knowledge, this text will focus less on these two types and more on the others.
These types can be organized in several ways. Some are primarily spatial, while others are primarily temporal (see Relationships between Space and Time). Similarity and Functional relationships are not necessarily measured in either dimension, so we might call them adimensional (unless we are studying their spatial patterns, such as with geostatistics).
Some are inherent or identity-based, when the existence of either or both entities are dependent on the relationship (Hornsby and Egenhofer 2000); others are contingent, when there is not such dependence (i.e., they “just happen” to be related). The types can thus be grouped as follows:
Table 1. Relationships among Entities
Spatial | Temporal | Adimensional | |
---|---|---|---|
Contingent-Qualitative | Topological | Time-Topological | Similarity Function |
Contingent-Quantitative | Metric, Similarity (geostatistics) | Time-Metric | Degree of Similarity, Functional Amount |
Inherent | Meronomic, Definitional | Genealogical, Causal, Time0-Meronomic | Taxonomic, Instaniation, Fiat |
These various types of relationships are not independent of one another. In fact, much of geographic inquiry involves the search for patterns and causal processes between them. For example, how does the distance between two places influence the amount of functional interaction (e.g., trade, migration) between them (Hanks 2011, 319)? Does that level of interaction influence the degree of similarity (e.g., having a shared language or religion), or could the causal arrow point the other way?
Given the ubiquity of relationships, it should not be surprising that they have been studied in many fields beyond geography and geographic information science. In mathematics, the disciplines of set theory, graph theory, topology, algebra, and formal logic all have significant models of relationships that can be applied to geographic situations.
The philosophical discipline of ontology, the attempt to understand the nature of real-world phenomena, has studied relationships at length, especially mereology, the study of part-whole relationships (Simons 1987, Herre 2010). In computing, ontological philosophies are encoded as formal ontologies and data models, both documentations of how phenomena should be represented in a system (or in general), of which relationships form a crucial part (see Foundational Ontologies). For example, the SUMO general formal ontology (one of several attempts to formally classify all phenomena, not just geography) recognizes several types of “relations,” including meronomic, topological, causal, metric, and temporal.
Ontologies that focus on geographic phenomena have also recognized the importance of relationships. For example, the spatio-temporal ontology of Bittner et al (2006) focuses on meronomy and taxonomy, but acknowledges the existence of topological, temporal, and metric relationships. Tambassi (2016) discusses topology and mereology as requisite components of any geo-ontology.
One useful formal theory that comes from mathematics (primarily set theory) classifies relationship types according to basic logical deductions that can be made from them. This includes three types (Worboys & Duckham 2004, 95):
Because relationships are an important part of domains that produce data, not just geography, they have also been studied in computer science. An important tool that our discipline has adopted is the use of data modeling tools to design a database, such as Entity-Relationship (E-R) diagrams (Chen 1976) and the Unified Modeling Language (UML) class diagram (OMG 2017). These diagrams visually depict the kinds of things that will be represented in a database, and the real-world relationships between them that will also be stored (see Figure 2).
As with many aspects of ontology, it is easy to see the above theorizing as an esoteric philosophical exercise with little practical value. However, relationships of all of these types are ubiquitous in geospatial data, so their intentional management has practical value.
As a case study, let us look at one of the most prevalent types of GIS database, the cadastre (also known as a parcel fabric). In the United States, such data is almost always maintained by county governments, while other countries maintain land records at the provincial (e.g., Canada, Australia, Germany, Switzerland) or national (e.g., most of Europe) level. This type of GIS data is so ubiquitous because it is the basis for several crucial functions, including the assessment of property taxes, urban planning, land use zoning, recording land transactions, and the guarantee of title (i.e., proving that person A owns the property so they can legally sell it to person B).
To accomplish these purposes, a typical parcel GIS needs to store several relationships:
These relationships may be stored in a number of ways in a spatial database, which is usually based on a relational database model (see Relational Database Management Systems (DBMSs) and their Spatial Extensions).
In addition to the relational databases that have traditionally been the basis of GIS, other architectures have become increasingly popular in recent years (see NoSQL Databases). One example, the graph database is especially relevant because it is fundamentally organized around the storage of relationships as edges between nodes, the entities (O’Sullivan 2024, 41). Another similar architecture, the semantic web based on the Resource Description Framework (RDF) data model, also uses a model that lends itself to the explicit representation of relationships. RDF stores data as a set of subject-predicate-object statements or triples; for a relationship, the subject and object are both entities, such as “parcel:12-2-10-23 rel:created_from parcel:12-2-10-1.”
Topological relationships are not usually explicitly stored in GIS databases when they are purely contingent. However, some kinds of inherent relationships can manifest as topology, especially definitional (boundary line features should line up with the edges of polygons) and meronomic (parts should be within their wholes), and it is often useful to represent these in geospatial data. GIS&&T Body of Knowledge topics The Topological Model (forthcoming) and Spatial Network Modeling address the various techniques for modeling topological data.
While explicit storage in a database is the common solution for representing inherent relationships, it is not as common for contingent relationships, largely for the simple reason that they may not be known a priori. Instead, they are usually discovered using the tools of spatial analysis. One of the oldest tools for doing this is polygon overlay (first introduced in the late 1960s), which uses two or more polygon datasets to create a new one based on the topological relationships between their features (see Overlay). In a similar fashion, spatial query searches for features based on their topological relationships to other features (see Spatial Queries), and spatial join merges the attribute tables of two layers based on their topological relationships. Some current GIS software also allows for spatial joins based on contingent metric relationships (i.e., distance).
Perhaps no branch of GIS analysis encapsulates more of the scope of spatial relationships than network analysis, commonly used for managing networks ranging from telecommunications to utilities to transportation (O’Sullivan 2024, 145). Network datasets require information about topology to know how elements are connected to each other. The analysis of movement and flow through the network is typically based on the functional and metric relationships between places (nodes) in the network, such as finding the shortest route from point A to point B.
The acknowledgment of spatial relationships (especially spatial autocorrelation) is what differentiates spatial statistics and geostatistics from other forms of statistical analysis. The fact that an observed value at a sample location in space is rarely independent of its surroundings (including nearby sample data) breaks one of the fundamental assumptions of most statistical methods; spatially aware methods such as kriging and geographically weighted regression take advantage of spatial similarity and other relationships to produce more accurate results (Fotheringham et al 2002; Wu and Kemp, 2019; Hoffman and Kedron, 2023; Sachdeva and Fotheringham, 2020).
During the Quantitative Revolution of the 1950s and 1960s, geographers adopted several mathematical models of relationships from physics, such as the inverse square law that models the strength of everything from light to gravity based on distance. In geography, it has been found that many kinds of quantitative relationships between two entities, especially similarity and functional integration, are strongly negatively correlated with the the distance between them, a concept commonly called distance decay (see Proximity and Distance Decay). That is, if one person lives further from a store than another person, they are less likely to shop there, assuming all other factors such as brand preference are the same. Unlike physical relationships such as gravity and magnetism, most geographic relationships do not follow this pattern exactly, but enough for it to be a useful basis for modeling. These kinds of relationships are generally studied using the techniques of spatial interaction modeling (see Spatial Interaction).
At first glance, relationships may not seem central to the world of GIS, with its basis in absolute coordinate locations and its focus on how to represent and analyze distinct entities and property fields. Frankly, there is not an extensive GIScience literature studying them as a distinct subject matter; rather, they are often taken for granted as a mundane aspect of geographic phenomena and geospatial data.
However mundane, they are ubiquitous and crucial to understanding and managing our geography. Any exercise to design GIS databases or analytical procedures is well served by being intentional about how to model relationships.
List the relationship characteristics of a given geographic phenomenon.
Describe the types of relationships that would be important to model for a given geographic task.
Determine the appropriate data models and architecture(s) for modeling the particular types of relationships needed in a newly designed geospatial database.
Design and carry out analytical and modeling procedures that effectively incorporate and evaluate relationships.