A similar CLV management framework was also be implemented at a fashion retailer in a B2C setting. Like the IBM case study, this case study also highlights the power of the CLV metric and its related strategies in maximizing the retailer's profitability. This also showcases that CLV scores can be measured and effectively applied by the retailer to maximize both customer and store profitability. For instance, by performing segmentation, profile, and impact analysis in conjunction with the CLV score of the customers, retailers can uncover valuable customer-level insights thereby enabling them to deploy various customer management strategies. Similarly, CLV computation of customers by different stores can enable retailers to uncover some interesting and often counter-intuitive store-level insights leading to store management strategies.
The focal firm mentioned in this study is a fashion retailer, whose name is not revealed for confidentiality reasons. The retailer, which sells apparel, shoes, and accessories for both men and women, has a chain of 30 stores across the USA, with a relatively larger concentration of stores on the east and west coasts. Each of the stores was more or less of the same size and located in regions having similar demographics, thus there is no need to normalize the store's profit potential for these factors.
In essence, the key objective of the retailer was to maximize its profitability. Specifically, the study sought to answer the following research questions for the fashion retailer in particular and retailers in general [4]:
1. What is the right metric to manage customer programs, for example, customer loyalty programs? Can CLV outperform traditional metrics?
2. How can the CLV concept be applied to measure and manage customer value?
3. How can the CLV concept be applied to manage store performance?
The customer data from the fashion retailer were utilized for this study. The total dataset comprised 303 431 customers after excluding 4% outliers from the original dataset that were consistently very high- or very low-value purchasers and probably would skew the sample's distribution. These 303 431 customers were then divided into two cohorts, as shown in Figure 8.5. Cohort 1, called ‘Model Building Sample,’ consisted of customers who made at least one purchase prior to December 31, 2003. Cohort 2, called ‘Model Validation Sample,’ consisted of customers who made at least one purchase prior to December 31, 2004 and who are not included in Cohort 1. Cohort 1 was used to calibrate the model and Cohort 2 was used to validate the calibrated model. There were 242 745 customers in Cohort 1 and 60 686 customers in Cohort 2.
The process adopted to develop the CLV-based framework for maximizing the retailer profits is illustrated in Figure 8.6. The idea was to use CLV as a basis for comparing the profit each customer brings to the firm. The aggregate profits all customers bring to a firm constitute its overall profit level. Therefore, to maximize the firm's overall profitability, the task was to implement marketing strategies that can maximize the profitability of each individual customer, or maximize the CLV score of each customer.
The framework provided in Figure 8.6 can be classified into three main stages: (1) model development, (2) model implementation, and (3) tactics and strategy recommendations. The following subsection reviews all these three stages and explains the process adopted in implementing this approach.
A clear picture of the process to implement the CLV management framework at the fashion retailer is provided in Figure 8.7. There are three main stages. In the first stage, just like the IBM case study, a CLV model was proposed and demonstrated as the most suitable model to measure future profitability for the retailer. In the second stage, based on the proposed CLV model, the CLVs of individual customers were measured, based on which the analyses of customers and stores were conducted. In the last stage, tactics and strategic recommendations were made based on the result of findings in the previous stages.
The model development stage involves two phases:
In this phase, the specific goals were to: (1) demonstrate the shortfall of traditional metrics, and (2) propose the CLV model as the more suitable model to measure the future profitability of a firm. The backward-looking characteristics of the traditional metrics were addressed, with specific reference to poor loyalty and profitability relationships shown by past research. Also, the forward-looking characteristic of the CLV metric was highlighted because it incorporated the future values/profitability that a customer can bring to a firm. Finally, the CLV metric was suggested as the superior metric to measure customer value and firm profitability.
In this phase, the desired output was the development of the CLV computation model. The always-a-share approach was adopted due to the non-contractual nature of the retailing sector. Based on this approach, a CLV model was formulated using discounted future value methodologies, which incorporated predictions of three key components, namely, purchase frequency, contribution margin, and marketing cost, as follows:
where:
GCi,t = the gross contribution from customer i on purchase occasion t
ci,m,l × xi,m,l = the marketing cost (unit marketing cost × number of contacts) to customer i, in channel m, in time period l
frequencyi = the purchase frequency, computed as 12/expinti because months are used as the units of analysis (expinti is the expected interpurchase time for customer i)
r = discounted rate for money
n = number of years to forecast
Ti = total number of purchases made by customer i
i = index for customer
l = index for time period
t = index for time
m = index for channel
In this stage, the suggested CLV model was used to compute the actual CLV score of each individual customer, and conduct analyzes of the customers and stores, with the goal of maximizing the profitability of the retailer. Specifically, it involves the three following phases:
This phase involved the prediction of the three main components of CLV model – purchase frequency, contribution margin, and marketing cost – by using the generalized gamma distribution, panel-data regression, and discounted future value methodologies respectively. Below is the detailed explanation of each step of this phase.
To get the desired output, prediction of the purchase frequency for each customer, the purchase frequency was modeled using the likelihood function, generalized gamma distribution, and inverse generalized gamma distribution.
This model is given by
where:
f(tij | α, λi, γ) = the density function for the generalized gamma distribution, in other words the probability of the jth purchase for customer i occurring at time period t, given α, λi, γ.
S(tij | α, λi, γ) = the survival function for the generalized gamma distribution, in other words the probability of the jth purchase for customer i occurring at a time period greater than t, given α, λi, γ.
cij = the censoring indicator, where
cij = 1: if the jth interpurchase time for the ith customer is not right-censored
cij = 0: if the jth interpurchase time for the ith customer is right-censored
Φijk = the probability of observation j for the ith customer belonging to subgroup k
α, λi, γ = the parameters of the generalized gamma distribution
According to Equation 8.10, purchase frequency was computed as 12/expinti because months were used as the units of analysis, where expinti was expected interpurchase time for customer i, or the expected time until the next purchase. Given that the generalized gamma distribution was used to model the interpurchase time and the likelihood function in Equation 8.11, this expected time until the next purchase would be given by
where:
α, γ = the parameters that establish the shape of the interpurchase time distribution
λi = the individual specific purchase rate parameter, which in the generalized gamma distribution is assumed to be a random draw from an inverse generalized gamma distribution (IGG), specifically
where:
υ, γ = the shape and scale parameters of the IGG distribution respectively
γ = common parameters to both the generalized gamma distribution and the IGG distribution
k = the number of subgroups the population was assumed to consist of
Φik = the probability of a customer belonging to each subgroup, and also provided the mass point (i.e., weight) for each subgroup. Φik was modeled as a probit function of the antecedents and covariates of the purchase frequency. Specifically, the link function can be represented as , where xij are the antecedents and covariates of purchase frequency for customer i on purchase occasion j, and βi are the customer-specific response coefficients.
The model framework presented in Equation 8.12 resembles a hierarchical Bayes formulation of the concomitant continuous mixture model. In order for the issue of endogeneity to be addressed, the one-period lagged value for all antecedents and covariates was used in the analysis [5, 6]. In addition, the performance of the proposed model framework was also evaluated with several alternative model formulations such as: (a) the simple generalized gamma distribution model, (b) the continuous mixture model, and (c) the latent class mixture model. Upon evaluation, the proposed model framework outperformed all the alternative model specifications. Finally, to account for any extraneous factors not accounted for by the antecedent and covariate set, the logarithm of the lagged interpurchase time was used. The specification of the model allowed the individual customer-level coefficients to be estimated for the influence of the various covariates on the probability of a customer belonging to a particular subgroup, hence the interpurchase times.
The finding showed a weak correlation between loyalty and observed future profitability. As given in Table 8.6, the correlation coefficients between observed future profitability and consistency of purchase frequency and RFM were both less than 0.3, while the correlation coefficient between observed future profitability (OFP) and relationship duration was the highest, 0.37. This result reinforced the argument that the retailer cannot afford to use traditional loyalty metrics to manage the customer relationship, which were backward looking. To manage both loyalty and profitability simultaneously, the retailer needed to adopt a forward-looking metric such as the CLV metric.
The gross contribution for each customer was modeled using the panel-data regression method. Gross contribution margin was defined as the revenue that the customer provides to the firm, whenever a purchase is made, minus the cost of goods sold. While the cost of goods sold does not change very much over time, the revenue a customer provides is likely to change over time. As a result, the gross contribution model was given by
where:
GCij = the gross contribution for customer i on purchase occasion j, measured in dollars
Xi,j−1 = the independent variable relevant to customer i on purchase occasion j − 1
ei,j = the error term
n = the number of independent variables
k = the index for independent variables
i = the index for the customer
β0, βj = the coefficients
The performance of the proposed model framework, Equation 8.13, was evaluated with various alternative functional forms for gross contribution margin model, including (a) using log-margin as a dependent variable, and (b) a polynomial specification for the lagged dependent variables to allow for nonlinearity. The specification of Equation 8.13 provided the best in-sample fit and predictive accuracy. Any issues related to endogeneity were addressed by using lagged endogenous variables wherever necessary. The results were tested for multicollinearity using the variance inflation factor and eigenvalues of the principal component analysis of the predictors.
The result of the gross contribution model is provided in Table 8.7. This model provided a good model fit (R2 = 0.71) and the mean absolute deviation (MAD) was 65 for the estimation sample and 70 for the holdout sample. The existence of a unit root in the gross contribution provided by a customer was rejected by the Dickey–Fuller test. On comparison, the proposed model outperformed the log-linear model. In addition, no significant nonlinear effects were observed for any of the parameters in the dataset.
Variables | Standardized coefficients |
Lagged contribution margins | 0.83a |
Lagged cumulative revenue from other channels | 0.76a |
Amount ($) spent on product category A on previous purchase occasion | 0.61b |
Time elapsed since last purchase | 0.11a |
Square of time elapsed since last purchase | −0.88b |
a Significant at α = 0.01 level. | |
b Significant at α = 0.05 level. |
Table 8.8 summarizes the results for the frequency model. The model had a log marginal likelihood (LMD) of 3.4 × 104 and MAD = 2.4.
Variables | Parameter estimates |
Cross-purchase | 6.9a |
Number of distinct channels | 6.1b |
Number of returns | 3.1a |
Number of returns square | −1.7b |
Log of lagged interpurchase time | −5.2a |
a Posterior sample values between the 2.5th and 97.5th percentile do not contain zero. | |
b Posterior sample values between the 0.5th and 99.5th percentile do not contain zero. |
Table 8.9 provides a comparison of the performance of the above-proposed frequency model and alternative competing models described in earlier sections. As illustrated in Table 8.9, the proposed frequency model provided a better LMD and forecasting performance than the competing models. In essence, the concomitant mixture framework appears to be better suited to model abrupt changes in the interpurchase times, which was observed in the dataset.
There are several ways to compute the marketing cost for each customer. For example, it may be calculated as an aggregate measure by dividing the total marketing budget by the number of customers. A more sophisticated approach would entail calculating the total marketing cost separately for each customer based on the various marketing channels expected to be used to interact with that customer. This approach to computing the marketing costs is given by
where:
MCi = the total marketing cost for customer i
ci,m,l = the unit marketing cost for customer i, in channel m, in time period l
m = the marketing channel (for this retailer it was the Web and catalog)
r = the discount rate
l = the index for time
n = the number of years to forecast
Equation 8.14 gives retailers the means to calculate the marketing cost not only on an individual customer basis, but also on the basis of individual marketing channels adopted for the same customer. This is very useful given the fact that direct marketing costs could vary widely across customers and across communication channels. For example, digital communications would cost much less than direct mailing. Furthermore, with the increasing trend of multi-channel marketing to the same customer, computing marketing costs as shown in Equation 8.14 can help retailers accurately arrive at a fair estimate of the expected marketing cost per customer. Accordingly, the expected marketing cost was discounted by l years to arrive at the present value of market cost.
For the purpose of computation, it was assumed that the direct marketing cost for each customer will be the same for the next three years. This assumption was applied because (1) the customer historical database shows that the historical direct marketing cost per customer remained more or less constant for the retailer, and (2) the retailer's directives were followed strictly. According to the retailer, the best prediction of future marketing cost of each customer over the next three years could be taken as three times the direct marketing cost in the most recent year (i.e., 2004). In this particular case study, the available marketing cost in the retailer's database was the total marketing cost (across all channels) for each customer.
After the three main components of the CLV model were predicted, Equation 8.10 was applied to calculate the CLV score of each individual customer.
The desired output of this phase was to answer the following questions:
Here, the CLV scores computed in Phase 3 were ranked in descending order and grouped into deciles so that each decile represents 10% of the customer base. Figure 8.8 shows the distribution of CLV scores across 10 deciles.
The result provided interesting insights. Most companies believe in the Pareto principle, or the 80:20 rule, which indicates that the top 20% (or 30%) of the customers typically generate 80% (or 70%) of the revenue or profits [7]. However, contrary to this belief, as shown in Figure 8.8 the top 20% (decile 1) were actually accounting for 95% of profits. This is because the bottom 30% had negative CLV.
Based on the above distribution of CLV scores, the entire customer group was divided into three segments: high CLV (comprising deciles 1 and 2), medium CLV (comprising deciles 3, 4, and 5) and low CLV segments (comprising deciles 6, 7, 8, 9, and 10). It should be noted that this form of segmentation is sensitive to the CLV scores of individual customers, thus the relative size of the segments will be subject to change over time if there are any changes to the CLV distribution.
In this section, the impact of CLV drivers on the three segments of high, medium, and low CLV was measured in the three following steps.
First, the key drivers of CLV (as outlined in Table 8.10) were selected to assess their impacts on CLV for the high CLV segment of customers. Second, the logistic regression method was used to determine which of these variables in the dataset (variables related to demographic, life style, and shopping behavior (DLSB)) varied significantly across the high and low CLV segments. In this case study, the medium CLV segment was excluded from the analysis to highlight the main difference between a high CLV and a low CLV customer. However, a similar procedure can be extended for medium CLV customers also if need be.
The logistic regression method employed here is
(8.15)
where:
Probi = the probability of customer i to be in high or low CLV segment
Yi = the outcome of regressing Xik on high/low CLV segment indicator
Xik = the ‘K’ DLSB (predictor) variables measured for customer i
β0, βk = the coefficients estimated from the data
This logistic regression method offered a simple procedure to estimate the coefficients by employing maximum likelihood.
Third, this model was run separately for the high and low CLV segments. In the model for the high CLV segment, the dependent variable was set as 1 if the customer belonged to the high CLV segment, or else 0. Similarly, in the model for the low CLV segment, the dependent variable was set as 1 if the customer belonged to the low CLV segment, or else 0. The findings from this analysis were then used to draw corresponding implications for customer profile analysis.
Finally, to assist the retailer's managers and develop strategies to increase the CLV of each customer, the impact of changing the values of some of the drivers of CLV in Table 8.10 was measured. High CLV customer segment and certain customer behavior-related variables were chosen for this measurement. Specifically, the magnitude of each driver was increased by 15% and the corresponding increase in CLV in each customer was evaluated. In practice, managers can make a similar impact on the value of each driver through appropriate marketing interventions, that is, promoting cross-buying behavior by offering customized incentives to customers to purchase across different product categories. The results presented in Figure 8.9 show that a 15% change in these key drivers significantly impacts the CLV of customers. These findings provide useful implications for customer and store management strategies.
Based on the findings from the impact analysis, several group-level differences were identified. For instance, most high CLV customers were professionally employed and married females in the age range of 30–49 years old with at least one child, having high household income and the store's loyalty card. They also shopped in many channels and lived relatively close to the store. Low CLV customers, on the other hand, were primarily single men, in the age range of between 24 and 44 years old, having relatively low income and not necessarily the store's loyalty card. They shopped in one channel and lived further away from the retailer.
One advantage of this profile analysis is that it could help the retailer put a ‘face’ to the CLV score of the customer, which would give useful insights in acquiring new customers. This method also overcomes the shortcomings of most CLV studies, which do not include demographics and product usage variables [8].
The desired output of this phase was the CLV of a store. Customers of the retailer shop at several of the 30 store chains nationwide. Thus, it was possible to calculate the CLV of a store based on the CLVs of customers who have ever purchased from that store. This phase was divided into two smaller steps:
The CLV score of customers by store was measured as the weighted average CLV of all customers shopping at that store. For example, suppose that customer 1 made 80% of his/her purchases from store A, 10% from store B, and 10% from store C. This information was used to assign customer value weights to each store. That is, store A can be assigned 80% of CLV from customer 1, 20% from customer 2, 0% from customer 3 (who did not shop at store A at all), and so on. The same procedure was then applied for all the stores.
The CLV of a store was measured as the weighted sum of the net present value of the CLVs of all customers that shop at that store minus the present value of the rent for the store. It should be noted that this method was based on the current set of customers, and did not take into account new customers that might be acquired by the store in the future.
After all values of store CLVs were computed, all the stores were ranked according to both their past three-year profitability and future three-year profitability (store CLVs) (see Table 8.11). There was no need to normalize the stores' profit potential for the difference in size, location, and demographics, as they were more or less similar in terms of these factors. Finally, Spearman's correlation coefficients between the past and future profitability ranks were computed to draw corresponding implications for store management tactics and strategies.
In this final stage, the result findings were used to suggest suitable tactics and strategy recommendations. The stage includes two phases:
The following managerial implications were suggested for the management of the retailer:
The retailer was advised to incorporate CLV metrics to store performance in order to increase its overall profitability. Two main management strategies were recommended: