Where should a retail business occur or locate within a region? What would that trade area look like? Should a retail expansion occur and how would that affect sales of other nearby existing locations? Would a new retail location have the right demographic or socio-economic customer base to be profitable? These are important questions for retailers to consider. Within the evolving landscape of GIS, there is more geospatial data than ever before about the potential customer. In retail, the application of maps and mapping technology is growing to include commercial real estate, logistics, and marketing to name a few. There has been an increased momentum across commercial applications for geospatial technologies delivered in an easy to comprehend format for a variety of end users.
Lucas, B. (2018). GIS&T and Retail Business. The Geographic Information Science & Technology Body of Knowledge (1st Quarter 2018 Edition), John P. Wilson (ed). DOI:10.22224/gistbok/2018.1.12.
Business Intelligence (BI) - the application of geographic knowledge, information, and techniques to assist businesses in making better decisions (Rice, 2016).
Locational Intelligence (LI) - the broad application of geographic knowledge, information, and techniques in the service of improved decision making for organizations of all kinds, including governments and non-profit agencies, as well as for-profit businesses (Rice, 2016).
Market/Trade Area - the geographic area from which a business/retailer location generates the majority (usually 75%) of its customers (Kures, 2011).
2. Importance of GIS&T in Retail
In the world of real estate, it has been said that the key to a successful site is location, location, location (Time Magazine, 1952). Most retail organizations use GIS technologies as a tool to integrate their location strategy. Many organizations go beyond the standard data analysis to do additional spatial analysis to understand the customer base. GIS allows organizations to expand beyond the basic real estate questions, by understanding spatial behavior patterns, understanding supply chain/logistics networks, or project specific potential changes based on store improvements through predictive analytics.
3. Factors that Influence a Trade Area
A trade area is generally defined as the farthest distance a customer is willing to travel to purchase retail goods and services (Thrall, 2002). The size and shape of a trade area depends on the goods and services offered and the proximity to competing retail markets (Kures, 2011). A trade area analysis provides the foundation for understanding the geographic extent and characteristics of a retailer’s customer, and evaluating market penetration (Myles, 2003).
Trade areas most often extend beyond municipal boundaries, depending on a retail cluster’s strength (pulling power) in relation to competing retailer’s in other cities (Myles, 2003). A retail cluster may serve a number of different trade areas depending on the types of products/merchandise sold or customer markets served (Jones, 1990).
Factors that determine the size of a trade area include a community’s population, the retailer draw, and the proximity to other competing retailers that draw on the same customer base (Kures, 2011). Trade area factors include:
Retail trade areas generally fall under two categories: A local/convenience trade area (i.e. mini-mart) or a comparison/destination trade area (i.e. super-regional shopping mall or a car dealership) (Myles, 2003).
Within a local trade area, there can be a mix of customer types who shop at a retailer (Myles, 2003). Some common market segments include:
5. Defining a Trade Area using Geographic Data
Trade areas by definition are geographic (Thrall, 2002). A trade area defines where potential customers live or work and how far they are likely to travel to a particular shopping area. Geographic data, such as highways, distance, and physical barriers, are useful in defining trade areas (Myles, 2003). This includes Simple Rings, Data-Driven Rings, Drive-Time Rings, Thiessen Polygons, Network Partitioning, and Reilly's “law” of retail gravitation/Huff Model.
5.1 Simple Rings
Simple rings are one of the easiest methods in approximating a trade area. Simple Rings can be drawn from any point location (i.e. a retail site or shopping district), and provide a simple approximation of a trade area (Jones, 1990). Figure 1 illustrates 0.5, 1.0, and 1.5 mile rings around McDonald’s locations in the Tri-Cities area in Washington. This is appropriate for this size community, as a 1.5 mile trade area is typical for most suburban oriented McDonald’s locations.
Figure 1. Simple rings of expanding sizes.
This approach works for communities that have a fairly uniform geography with a connected road network and minimal physical barriers (Kures, 2011). When defining a trade area for a small community surrounded by other similar-sized communities that are spaced 40 miles apart (Iowa is a good example), a ring radius of 15-20 miles may be a reasonable proxy for a trade area. However, there are a few caveats to consider when using Simple Rings, as they cannot weigh the pulling power of a retailer or recognize travel barriers. Subsequently, rings are useful for only simple analysis. The Simple Rings weakness is demonstrated with a physical barrier bisecting the area between Pasco and Richland/Kennewick.
5.2 Data-Driven Rings
Data-Driven Rings are based on a set of specific criteria of a retailer such as sales per square foot, the total volume of sales, store size (usually measured in gross or net square feet), or the number of competing stores (Caliper, 2017). These rings can be used to define trade areas by adjusting the size of the ring based on the retailers’ metrics, such as population needed to serve a location. The greater the data value, the larger the ring, which in turn affects the size of a trade area. Figure 2 illustrates Data-Driven Rings around locations, using an 8,000 person buffer (i.e. a hypothetical McDonald’s franchise requirement) to delineate what that trade area might look like. Overlapping franchise trade areas in the Richland area could present a problem with some locations cannibalizing sales from nearby locations.
Figure 2. Data-driven rings, capturing 8,000 people within each ring.
While Data-Driven Rings may be useful in comparing competitive shopping districts, they may not have a direct relationship with a trade area defined by customer origin or based on actual customer location data. Furthermore, data such as total sales or sales per square foot may be difficult to obtain.
5.3 Thiessen Polygons
The Thiessen Polygons method is named in honor of Alfred Thiessen, who pioneered this method of creating areas out of spatially distributed data (Jones, 1990). This method assumes that the customer will travel to the closest retailer based on a “straight-line” distance (Kures, 2011). Depending on the location, each polygon defines an area of influence around each location point, so that any location inside the polygon is closer to that location than any of the other location points. The trade area is shaped by lines drawn exactly halfway between each of the competing retailer locations. Figure 3 illustrates Thiessen Polygons, and what an “equal” trade area might look like.
Figure 3. Thiessen Polygons
As illustrated, the Thiessen Polygon method primary weaknesses is recognizing physical barriers. The McDonalds locations in Kennewick off of State Route 240 has a trade area that extends into West Pasco along the Columbia River. While this location would be the closest location to some West Pasco customers, it is not a realistic location for a customer in West Pasco to patronize. This customer would most likely patronize the location off of I-182, as the Columbia River physical barrier is too circuitous and lengthy.
5.4 Drive-Time Rings
Drive-Time Rings provide a practical method in determining a trade area. This method relies heavily on travel time using the built-in road networks to determine the extent of a trade area (Kures, 2011). Travel time is calculated using distances along actual streets and highways, combined with their respective allowed travel speeds (Caliper, 2017). Drive-Time Rings are important, as customers often make retail shopping decisions based on the street and highway network when deciding on a location to shop. Figure 4 illustrates the Drive-Time analysis using 3, 6, 9, 12, and 15 minute travel times around three Walmart locations.
Figure 4. Drive-Time Rings
Since the road network is being used to derive the Drive-Time Rings, physical barriers are able to be taken into consideration. This example uses the point locations of the three Walmart stores to validate if a majority of the region is adequately served by the three locations and if each location is no more than 12-15 minutes from a customer’s home or work place. Not shown are the locations of Walmart’s competitors, such as Target or Kmart.
One could also calculate specific demographic and market characteristics around specific locations via an overlay (Caliper, 2017). The strength of Drive-Time Rings is the ability to describe the convenience of a location in terms of travel time versus a distance measurement, which can be useful in determining a delivery time or determine an appropriate delivery fee. For a store located in a high density neighborhood or congested area, travel time may be a more accurate description of a trade area.
5.5 Network Partitioning
While similar to Drive-Time Rings, Network Partitioning allows the user to create zones or territories based on the street network, with each road section (link) assigned to the closest or most expedient driving distance or time (Caliper, 2017). Network Partitioning is often used by municipalities to determine the placement of fire stations by dividing a city into zones based on the response time from all of the fire stations (Caliper, 2017). Each zone would be comprised of the streets for which its fire station has the fastest response time. The same concept can be applied to retail GIS.
The primary difference between Network Partitions and Drive-Time Rings, is that Network Partitions can be weighted by a value assigned to the point feature used in the analysis (Caliper, 2017). Figure 5 illustrates Network Partitioning bands around three Walmart locations, using the square footage of each store as the weighting field. In this example, all three locations have a tire & battery center and pharmacies; however, if one of the locations did not have a tire & battery center, that specific attraction could be used as part of a weighed algorithm to give one location a slight preference over another location.
Figure 5. Network Partitioned Bands.
As illustrated, the primary weakness of this method is when the Network Partition band extends across a physical barrier, the physical barrier will generally nudge people to the location on their side of the physical barrier.
5.6 Reilly's "Law" of Retail Gravitation and the Huff Model
Gravity modeling provides an additional method for examining competition and potential shopping patterns around a retail location (Kures, 2011). Other trade area approximation methods discussed do not offer any prediction capabilities. The gravity model attempts to predict the likelihood that a customer will shop at a given location based on distance and the attraction that location offers (Kures, 2011). The model can account for the distribution and attractiveness of competing locations, along with distance the customer will need to travel (Church, 2008).
An early attempt at predicting shopping potential was in 1931 by William J. Reilly. According to Reilly's "law," customers are willing to travel longer distances to larger retail centers given the higher attraction they present to the customer (Reilly 1931). Based on Reilly's formulation, the attractiveness of a center becomes the size (mass) within the physical law of gravity (Reilly 1931).
Equation 1.
Like most other modeling, Reilly’s “law” assumes the geographic study area is flat and there are no physical barriers, to modify a customer's decision of where to travel to buy goods (Church, 2008). Alternatively, Reilly’s “law” also assumes customers are otherwise indifferent to surroundings, including socioeconomic issues such as crime, safety or sales tax rates, as these are not taken into account (Church, 2008).
In 1963, David Huff took Reilly's "law," to the next level, and added a probabilistic feature to the model (Huff, 1964). The Huff Model has been applied in predicting customer spatial behavior. The probability (Pij) that a customer located at i will choose to shop at store j is calculated according to the following equation (Huff 1964).
Equation 2.
As shown, the equation calculates the probability of a person living in one area and may shop in another area, as well as the ability to attract customers from surrounding areas. The α parameter is an exponent to which a store’s attractiveness value is raised, to account for nonlinear behavior of the attractiveness variable (Esri, 2008). The β parameter models the rate of decay in the drawing power as potential customers are located further away from the store (Esri, 2008). An increasing exponent would decrease the relative influence of a store on more distant customers.
When using the Huff Model, the trade area is expressed as a continuous line of probabilities for each location in the analysis (Church, 2008). The point of indifference becomes the point of equal probability that a customer will visit one location or another (Huff, 1964). The advantage of the Huff's Model is that it leaves room for customer choice.
Figure 6 illustrates the model without a parameter estimation or customer spotting data. A typical value for an attractiveness parameter α is 1, and for the distance decay parameter β value of 2. With the default values, this method is generally better than other methods based on the spatial monopoly concept (Jones, 1990).
Figure 6. Huff Model probability analysis using store square footage as a measure of attraction.
As illustrated, store square footage was used as an attractiveness variable, which can be easily determined. A customer probability surface grid size will vary depending on the geographic extent and interpolation efficiently. A potential customer is assumed to be located within every grid cell, so an even distribution of population. The probability of a customer patronizing a selected location is positively related (directly proportional) to the attractiveness of location (store size) and negatively related (inversely proportional) to the distance between the location and the customer, given the presence of competing locations (Huff, 1963). The Huff Model does not take of physical barriers or the road network into consideration.