Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

4 Firms

In the third stage of the life cycle, most individuals work within firms or some other hierarchical organization. In this section we describe how field experiments have contributed to knowledge on how workers and employees behave in such settings. Following the research areas described in Tables 2 and 3, we frame our discussion on the following research themes: (i) the effects on monetary incentives on worker behavior; (ii) the interplay between monetary and non-monetary incentives; (iii) aspects of the employment relationship, such as gift-exchange between employers and employees, and the link between employer monitoring and employee shirking behavior.³⁹^,⁴⁰

4.1 Monetary incentives

A core principle in economics is that incentives matter. The role of monetary incentives within firms and organizations has been long studied in sociology and management literatures. With the application of contract theory to behavior within firms (Hart and Holmstrom, 1987) and the development of personnel economics (Lazear, 1995), such questions are now integrated within mainstream labor economics. For economists, the basic questions have been: (i) how do workers respond to a given set of incentives?; (ii) what are the optimal set of incentives an employer should provide?⁴¹

An earlier generation of empirical studies exploited firm’s personnel data to measure the productivity effects of compensation schemes on individual workers. An econometric challenge facing these studies is that observed incentive contracts might well be endogenous to firm’s performance (Milgrom and Roberts, 1990; Ichniowski et al., 1997). With such multiple underlying changes, mapping the evidence, however cleanly identified is the change in behavior, to any underlying theory is less clear cut.⁴²

Field experiments introduce exogenously timed variation in incentive structures that are orthogonal to other management practices. This opens up the possibility to identify the causal impact of monetary incentives on the behavior of individual workers, and on firm performance as a whole. Combining personnel files from human resource departments within the firm, with primary data collection that is inherent in field experimentation, allows researchers to examine the effect of monetary incentives on a range of margins of worker behavior, capturing both the intended and unintended consequences of incentive provision.

There are good theoretical reasons for collecting such extensive information on worker behaviors when evaluating the response to incentives. For example, multi-tasking theory suggests that when monetary incentives are provided based on a subset of tasks that the firm can directly measure performance in, workers may reallocate their effort away from other tasks they are engaged in, their employer is affected by, but their compensation is not based on (Holmstrom and Milgrom, 1991). Similarly, if the provision of incentives alters the distribution of pay across workers in the same tier of the firm hierarchy, this might alter worker’s behavior towards co-workers, say through cooperation or sabotage (Lazear, 1989). Finally, there might be ways in which workers can game against any incentive scheme. All such unintended consequences of monetary incentives need to be accounted for to both accurately understand how workers respond to incentives and to begin to think through the optimal incentive design.⁴³

Employers might not collect such information ex ante. Hence the need to engage in primary data collection efforts to complement the rich information available in firm’s personnel files. Field experiments—that involve close cooperation between researchers and firm management—are well placed to advance in this direction. Ultimately, as witnessed in some of the field experiments described below, this allows a closer mapping between the evidence and underlying theory, and to draw implications for optimal incentive provision.

4.1.1 Theoretical framework

To understand some of the theoretical questions and empirical challenges faced in this literature, it is instructive to first reconsider Lazear’s (2000) original analysis of the Safelite Glass Corporation, a large auto-glass firm in which the primary task of worker’s at the bottom-tier of the firm’s hierarchy is to install automobile windshields. Lazear used non-experimental methods to estimate the productivity effects of the firm moving from a compensation scheme in which workers were paid an hourly wage scheme, to one in which they were paid a piece rate for each windshield installed, with a minimum guarantee. This pioneering work brings to the fore many of the issues that have influenced all the subsequent literature, and allows us to highlight the specific issues that field experiments help address.

The model is as follows. Worker’s utility depends on income and effort , with .⁴⁴ Worker’s output depends on effort and her ability, , so with output assumed to be observable and . For any given output , there is a unique effort level that achieves this, denoted . It is then straightforward to see that so that higher ability workers need exert less effort to achieve a given output. If workers choose not to work at any firm, their outside option from leisure is denoted . Hence the lowest ability worker that would accept employment at a firm with a required output level and wage , is denoted and is such that,

(27)

All workers of higher ability earn rents from employment over leisure. Similarly, suppose a worker of a given ability could take up employment at another firm offering a wage-minimum effort pair (). Hence with inter-firm competition there might exist an upper cutoff in ability, , such that,

(28)

where workers of ability higher than prefer to take the alternative employment contract.

This framework makes clear that incentive structures will affect two types of behavior, an idea developed in more detail in Lazear (2005). First, there will be change in effort exerted by individual workers in response to monetary incentives. This is referred to as the “incentive effect”. Second, the compensation scheme will induce a differential composition of workers within the firm over time. Some workers will prefer to join this firm from other employers. These changes in workforce composition can be thought of as the “selection effect” of monetary incentives.

The incentive effect can be easily understood graphically. Figure 9 shows the relationship between output, , and compensation for the two schemes relevant for Lazear’s study: (i) a fixed hourly wage subject to a minimum output requirement , resulting in total compensation ; (ii) a linear piece rate scheme with a minimum guarantee of . As Fig. 9 shows, for output levels between and the worker receives under both compensation schemes, and for output higher than earns strictly more under the piece rate scheme.⁴⁵

Figure 9 Compensation before and after at Safelite.

On the incentive effect, the model makes clear that moving from the fixed hourly wage scheme to the piece rate scheme does not cause the output of any individual to fall, and causes average output to rise. Low ability workers, indicated with a solid indifference curve in Fig. 9, remain indifferent between the two schemes and would produce output at point A under both. Higher ability workers, indicated with a dashed indifference curve in Fig. 9, would prefer to increase their effort and move to point B. This is because the piece rate scheme allows higher ability workers to raise their utility through increased compensation that more than offsets any increase in their effort. As a result, the dispersion in worker effort and output rises as long as there is at least one worker that chooses to produce more than .⁴⁶

On the selection effect, under plausible conditions, the average ability of workers rises with the move to piece rates. This is because low ability individuals remain indifferent between working for this firm under either incentive scheme. If they were willing to work for the firm under the fixed wage scheme, they should remain willing to do so under piece rates all else equal. On the other hand, high ability workers might be attracted to this form from other firms that for example, have higher minimum output standards or pay piece rates but at a lower rate . In short, theory predicts that there should be no change in the number of low ability workers who are willing to work at the firm, but that piece rates attract high ability workers so the right tail of the ability distribution in the firm should thicken.

As described in more detail below, existing field experiments have focused on identifying the incentive effects, and the research designs used have been less amenable to pin down these types of selection effect. Yet it is important to emphasize the need for future research to provide credible research designs to uncover both effects.⁴⁷

In Lazear’s study, he documents the total effect of the change in monetary incentives to workers was around a 44% increase in worker productivity, defined to be the number of windshields installed by the worker per eight hour day. Around half the increase was due to incentive effects, namely a change in effort of the same worker as he moved from a fixed hourly wage to a piece rate scheme. However, the other half was entirely due to the selection effect, namely productivity changes due to endogenous changes in the composition of workers in response to change in monetary incentives. A legacy of Lazear’s study is to show that both motives likely underlie why firms choose to alter their output based incentive structures in the first place. A carefully crafted field experiment that begins to measure whether and how the compensation policies of a given firm have such spillover effects on other firms that compete for similar workers, would open up a rich research agenda tieing together the study of within-firm compensation policies on equilibrium wage-setting behavior in labor markets.⁴⁸

Linking to the design of field experiments

The model provides a series of implications that have impinged on the first generation of field experiments over the last decade. First, given worker heterogeneity, changes in compensation scheme will nearly always affect average effort and output, as well as the dispersion of effort and output. Given the linkage between performance and pay, this inevitably results in changes in the distribution of earnings across workers at the same tier of the firm hierarchy. Hence linking pay to performance might have unintended negative consequences on worker and firm performance as a result of such increased earnings inequality. This might manifest itself in the form of workers reducing cooperation with co-workers (Fehr and Schmidt, 1999; Charness and Rabin, 2002). Field experiments are particularly adept at detecting and quantifying such unintended consequences because researchers are engaged in primary data collection, and the firm would have no incentive to collect such information ex ante as part of its personnel files, especially if pay for performance type compensation schemes have not been previously implemented. Some of the field experiments described below have collected qualitative evidence from workers to explore these channels, in additional to using personnel files to measure the direct productivity effects of incentives.

Second, given the selection effect of monetary incentives, there is inevitably a change in workers’ peer group over time. The composition of peers, or their social ties with each other, play no role in the standard neoclassical model in which worker preferences are only defined over their own income and effort. There are good reasons to probe this assumption in the field. First, if peer effects determine workplace behavior because they alter the marginal return to worker’s effort, then understanding how workers respond to changes in monetary incentives requires an understanding of the mechanisms underlying such peer effects. Second, extending the neoclassical model to take into account such peer effects or social concerns, as has been done in Kandel and Lazear (1992), Lazear (1989), Rotemberg (1994) and Fershtman et al. (2003), has implications for many aspects of firm behavior including the optimal design of incentives. Such concerns are a recurring theme in the series of natural field experiments conducted by Bandiera et al. that are described below.

Third, the model highlights that under fixed hourly wage schemes, there should not be much heterogeneity in workers output or effort, despite workers being heterogeneous in ability. This does not fit the evidence very well. For example, in Lazear’s study there was considerable dispersion in worker productivity even under the fixed hourly wage scheme. To better explain behavior in fixed wage settings, theory suggests workers might respond to other forms of non-monetary or implicit incentives, such as gift-exchange motives where workers exert more effort in response to employers paying higher than market clearing wages, the ability to shirk (Shapiro and Stiglitz, 1984; Macleod and Malcomson, 1989), or promotion prospects and career concerns (Dewatripont et al., 1999). Such mechanisms might also predict how workers sort across firms (Stiglitz, 1975). Some of these aspects are highlighted by the field experiments discussed below on the employment relationship.

Fourth, the model implies that low ability workers should not leave the firm with the move to piece rates. If they were willing to work for the firm under the fixed wage scheme, they should remain willing to do so under piece rates all else equal. On the other hand, given output differences among workers under price rates, management can more easily identify low ability workers when pay is tied to their performance. Over a longer time period, this might lead to them being fired. This type of selection effect caused by employer learning the true ability of workers, has not been studied by field experiments.⁴⁹

Finally, given worker heterogeneity, the first best for the firm would be to set a worker specific piece rate, . This would be chosen to equate the marginal benefit of effort to its marginal cost. However, we generally observe firms being constrained to offer bottom-tier workers the same compensation scheme. This may be because of legal, technological or informational constraints (Lazear, 1989; Bewley, 1999; Encinosa et al., 1997; Fehr et al., 2004). To overcome this, one hypothesis is that firms can get closer to this first best by linking managers’ pay to the firm’s performance. Managers then have greater incentives to target their effort to specific workers, and from the worker’s point of view it is then as if they face an individual specific incentive scheme. This idea is developed in the natural field experiment by Bandiera et al. (2007) described below, which then links managerial incentives to pay inequality among workers.

4.1.2 Evidence from the field

In the decade following the wave of studies using personnel data and insider econometrics to understand responses to monetary incentives (Ichniowski and Shaw, forthcoming), field experiments have begun to exploiting exogenous and randomly timed variation in compensation pay to both reinforce these existing results using non-experimental methods, as well as providing new insights.

Among the first of these studies was Shearer’s (2004) natural field experiment, designed to estimate the productivity gains moving from a piece rate to a fixed wage scheme for tree planters in British Columbia, Canada. In contrast to Lazear (2000), this setting is one in which tree planters are usually paid a piece rate with no guaranteed minimum, and fixed wages are rarely used ex ante. Workers were randomly assigned at the start of a work day, to plant under one of the incentive schemes. Hence a within worker comparison can be exploited to estimate the incentive effects. To reduce the likelihood of this comparison confounding other effects unrelated to the compensation schemes in place, work took place on fields of similar conditions, and there was a constant length of the work day. The incentive effect estimate is then based on a total of 120 observations on daily worker productivity, 60 under each scheme. The relatively small sample size—nine male workers were randomly selected from the firm—reflects the difficulty researchers initially faced in real world settings in convincing firms to randomly assign individuals to alternative compensation schemes. As more field experiments are conducted, some of these constraints are being eased. For example, some of the field experiments described below are based on data on hundreds of workers.⁵⁰

Shearer’s field experiment reveals the incentive effect of having a piece rate rather than a fixed wage compensation schemes to be a 20% productivity increase. The magnitude of this is comparable to Lazear’s (2000) findings—moving from a fixed wage scheme—based on non-experimental data, although this is wholly by chance There is no reason a priori to expect the behavioral response of workers to these two incentive schemes to be of the same magnitude given the very different types of worker involved, nature of the production function, and that the piece rate was not the same across settings. In line with theory, Shearer finds the standard deviation of output across workers was higher under piece rates. Overall, it was found that unit costs under piece rates were around 13% lower than under fixed wages.

To shed more light on how workers would have responded in a slightly different environment and to alternative compensation schemes, Shearer then develops and estimates a structural model. In terms of altering the economic environment, the structural model is used to shed light on what would have been the productivity gains if management was imperfectly informed about planting conditions. This yields comparable estimates of the productivity gains of moving to piece rates. To shed light on alternative compensation schemes, Shearer explores how workers would have responded to an efficiency wage scheme. The efficiency wage that would induce effort levels equal to those observed under fixed wages in the field experiment, is calculated and its implied unit costs are compared with those achieved under piece rates. This exercise suggests that fixed wages would lead to a 2.7% increase in unit costs relative to piece rates.

Shearer (2004) uses a close to best practice methodology in combining estimates from a field experiment with structural modelling within the same setting. This combination, following from discussions in Heckman and Smith (1995) and Keane and Wolpin (1997), first identifies the existence and magnitude of important causal effects using reduced form evidence. To then move away from such black-box findings, the researcher then uses structural modelling to posit an underlying behavioral mechanism behind the effects, assess the sensitivity of the estimates to slight alterations in the economic environment, and to make headway in understanding the optimal compensation structure. Of course, the validity of the structural model can itself be tested by exploring whether it predicts the responses observed to the exogenous variation engineered by the field experiment.

Another example of how field experiments can and should learn from other methodologies is at the heart of the natural field experiment of Hossain and List (2009). They use theoretical insights on framing effects from behavioral economics that have previously found empirical support in laboratory experiments (Kahneman and Tversky, 1979; Thaler, 1980; Samuelson and Zeckhauser, 1988; Ellingsen and Johannesson, 2008), to see if, in the field, framing manipulations affect worker responses to bonus incentives. Their setting is a high tech Chinese firm producing consumer electronics, where workers are organized into both individual and team production. They find that bonuses framed as both “losses” and “gains” increase productivity, for both individuals and teams. Teams respond more to bonuses posed as losses than as comparable bonuses posed as gains. The comparable effects for individuals are of the same sign but are not statistically significantly different to each other. Team productivity is enhanced by 1% purely due to the framing manipulation. Neither the framing nor the incentive effect lose their importance over the six month study period. Nor are there any detrimental effects on the quality of work as measured by product defect rates.

On a practical note, the results highlight that conditional on bonuses being provided, framing matters, and as framing can be adjusted almost costlessly, there are simply ways in which firms can further enhance productivity responses to monetary incentives. Theoretically, these results from the field provide an example of the prevalence of loss aversion in a natural labor market setting. As such the results provide external validity to laboratory evidence, and should be seen to provide a strong argument in favor of field and laboratory experiments being complements, not substitutes.

4.2 Non-monetary incentives

Organizations use a variety of non-pecuniary based incentives to motivate their employees. We discuss three forms of non-monetary incentive: status goods, feedback, and social incentives.⁵¹

Under status incentive schemes, employees are given some positional good, such as an “employee of the month” job title. The notion that individuals crave status has been long studied (Veblen, 1934; Friedman and Savage, 1948; Duesenberry, 1949; Frank, 1985) and more recently formalized in the context of organizations providing status incentives in Moldovanu et al. (2007) and Besley and Ghatak (2008). They emphasize that for status incentives to be effective, the positional good must be valued by employees, it must be scarce, and its allocation rule rewards the deserving.

Recent evidence on these effects have been found in laboratory settings (Ball et al., 2001; Brown-Kruse et al., 2007) but few field experiments in which researchers have worked closely with a firm to exogenously vary such status rewards. One exception is Greenberg (1988), who reports results from a field experiment based on 198 employees in the underwriting department of a large insurance company. These employees were randomly assigned on a temporary basis to the offices of either higher, lower, or equal-status co-workers while their own offices were being refurbished. Relative to those workers reassigned to equal-status offices, those reassigned to higher status offices raised their performance, and those reassigned to lower status offices lowered their performance. The size of these performance changes were directly related to the magnitude of the status changes encountered. The results are interpreted as providing real world evidence on equity theory for non-monetary rewards.

In the future, we envisage field experiments being designed that randomly vary the first two margins on effective status incentives described above: how valued the positional good is, and its scarcity. In contrast, a field experiment that randomly allocated such positional goods might not be as informative, unless it was clearly related to some reallocation that would have occurred in any case, such as in Greenberg’s clever study described above. Otherwise, such random allocations would not be representative of the kinds of allocation rule employers actually use, and so cloud the interpretation of any such results.⁵²

A second class of non-monetary incentive in organizations relates to the provision of feedback. While there is a long tradition in psychology on feedback effects (Thorndike, 1913), economists have only recently begun to investigate its causes and consequences. Much of this research has focused on the theory of optimal feedback provision as mid-term reviews (Lizzeri et al., 2002; Ederer, 2008). Theory indicates that feedback on past performance can affect current performance either directly if past and current performances are substitutes or complements in the agent’s utility function, labelled a preference effect, or indirectly by revealing information on the marginal return to current effort, labelled a signaling effect.⁵³ The direct preference effect is relevant if, for instance, agents are compensated according to a performance target or fixed bonus scheme, so that being informed of high levels of past performance induces the agent to reduce her current effort relative to her past effort, and still meet her overall performance target. The indirect signaling effect is relevant if, for instance, the agent’s marginal return to effort depends on her ability and this would be unknown if feedback were not provided.⁵⁴

Both these mechanisms imply the effect of feedback is heterogeneous across individuals and it might increase or reduce current effort, so the socially optimal provision of feedback remains ambiguous. Research in psychology also suggests feedback effects are heterogeneous and may crowd in or crowd out intrinsic motivation (Meyer et al., 1965; Beer, 1990; Gibbs, 1991).⁵⁵

While there is a growing empirical literature on the effect of feedback in laboratory settings (Bandiera et al. (2009a,b) provide one such analysis, in which the focus is on the provision of feedback to teams. This study is described in detail below.⁵⁶

The third class of non-monetary incentives relate to changes in behavior induced because of the presence and identity of co-workers—namely social relations in the workplace. The idea that there exists an interplay between social relations and monetary incentives in the workplace goes back to the old Hawthorne studies mentioned earlier and have been long considered in the organizational and business sociology literatures (Mas and Moretti, 2009; Bandiera et al., 2010, forthcoming).

In a series of natural field experiments, Bandiera et al. provide evidence on the effect of incentives on individual and firm performance within the same firm. These field experiments engineer exogenously timed variation in the incentive structures faced by workers in the firm. The common thread running through these studies is to provide evidence on the interplay between monetary and non-monetary incentives in the workplace. The specific form of non-monetary incentives considered are those arising from social relations in the workplace, so that workers behavior, and response to monetary incentives, might differ depending on the nature of the social ties they have with co-workers, their superiors, and their subordinates. Given that this form of non-monetary incentive is what field experiments have predominantly focused on, we first develop a framework that makes precise how such incentives can be incorporated into an otherwise standard model, and then map this framework to the empirical evidence from the field.

4.2.1 Theoretical framework

Suppose worker ’s payoff depends on three components. First, she derives some benefit from exerting effort towards a productive task. This benefit, , reflects in part how her effort maps into income through the monetary compensation scheme. To cover a wide range of compensation schemes including absolute performance evaluation incentive schemes such as piece rates, relative performance evaluation schemes such as rank order tournaments, or team incentives, these benefits will in general also depend on co-workers’ effort, . Second, the worker faces a convex cost of effort, , where workers are of heterogeneous ability, . Finally, we assume worker places some weight on the utility of co-worker , . In turn, such social preferences might depend on the existence or strength of the social tie between individuals and . This third component of the worker’s payoff function generates social incentives.⁵⁷ Workers simultaneously choose their efforts to maximize their total payoff,

(29)

The first order condition is⁵⁸,

(30)

The monetary compensation scheme determines the marginal benefit of effort, . As the worker has social incentives, she takes account of the fact that on the margin, her effort also affects the benefits that accrue to others, . As mentioned above, the precise sign of this social interaction depends on the nature of peer effects between workers that are socially connected, and the monetary compensation scheme in place.

The theoretical predictions of such models generate a wide range of behavioral responses. For example, working alongside friends might make work more enjoyable, generate contagious enthusiasm among friends, provide positive role models, or generate incentives to compete to be the best in the network of friends. All such mechanisms, that effectively increase the net benefits of effort, imply workers exert more effort in the presence of their friends relative to themselves when they work in the absence of their friends. Alternatively, working with friends might create contagious malaise, or lead to low effort norms within friends or co-workers more generally. All such mechanisms, that effectively decrease the net benefits of effort, imply workers exert less effort in the presence of their friends. Finally, the presence of friends might have heterogeneous effects across workers in that some exert more effort in the presence of their friends relative to when they work solely with non-friends, and others exert less effort. For example, friends or co-workers may conform to a common norm (Bernheim, 1994), or workers might be averse to pay inequality within their network (Fehr and Schmidt, 1999; Charness and Rabin, 2002). In either case, relative to when they work only with non-friends— (i) low ability workers exert more effort in the presence of their friends, and; (ii) high ability workers exert less effort in the presence of their friends. These aspects are highlighted by the field experiments discussed below on non-monetary incentives.

The field experiments in Bandiera et al. are designed to engineer exogenous variation in the incentives faced by workers to identify , corresponding to a similar reduced form parameter as in Lazear (2000) and Shearer (2004). They then combine this variation with primary data collected on social networks and plausibly exogenous variation in the assignment of friends as co-workers over time, to identify social incentives as embodied in . In these experiments, the authors examine the effects of social incentives both within and across tiers of the firm hierarchy. Namely in some studies and are co-workers engaged in the same tasks, and in other studies the pair correspond to a manager and her subordinate. Moreover, they study cases in which: (i) individual effort hurts co-workers , as in the case of relative incentive schemes; (ii) where it benefits them , as in the case of team incentives; and (iii) where it has no effect , as in the case of a piece rate scheme. We now summarize the main insights from these natural field experiments.

4.2.2 Evidence from the field

Social incentives among bottom tier workers

The firm studied in Bandiera et al. is a leading UK producer of soft fruit. Managerial staff belongs to three classes. The first class consists of a single general manager whom we refer to as the Chief Operating Officer (COO), the second comprises ten field managers, and the bottom-tier of the firm hierarchy consists of workers whose main task is to pick fruit. Field managers are responsible for field logistics, most importantly to assign workers to rows of fruit within the field and to monitor workers. Managerial effort can therefore be targeted to individual workers and is complementary to worker’s effort. The main task of the COO is to decide which workers are selected to pick fruit each day, and which are assigned to non-picking tasks. The field experiments described below together provide insights on behavior at each tier of the firm’s hierarchy.

In each natural field experiment, the researchers worked closely with the CEO of the firm to engineer exogenously timed changes in monetary incentives to workers or managers. The same workers and managers are observed under both incentive schemes and therefore it is possible to control for time invariant sources of heterogeneity across workers, such as their ability, and across managers, such as their management style.⁵⁹ The most important remaining empirical concern is that the estimates of such changes might still reflect naturally occurring time trends in productivity. This is addressed using a battery of tests in each paper. In addition, the time span of study allows the authors to check in each case whether the behavioral response to incentives is long-lasting, or whether they reflect Hawthorne effects, as discussed earlier, whereby individuals respond in the short run to any change in their workplace environment. Being able to use field experiments to estimate short and long run responses to changes in management practice is a theme we will return to below when we present field experimental evidence on gift-exchange in firms, and contrast the evidence from the field and the laboratory.⁶⁰

In each natural field experiment, the authors collected primary data on the social networks of each individual worker. With such a precise mapping of the structure of friendship networks in the firm, personnel data providing workers productivity over time, and the field experiment on monetary incentives, the authors are able to shed light on the interplay between monetary and social incentives in this setting.

Finally, they have daily information on the pool of workers available to pick fruit. This allows them to precisely identify the effect of monetary incentives on the selection of workers from this pool. The entire pool of workers is observed in this context because individuals are hired seasonally from Eastern Europe, and they live on the farm for the duration of their stay. This margin of selection—driven by the COO’s demand for workers—from the firm’s internal labor market proves to be an important margin of response to some changes in incentives, particularly in relation to changes in managerial incentives. Still, these field experiments, like Shearer (2004), are silent on the selection effect highlighted by Lazear (2000) in relation to workers choice of which firm to supply their labor to.

Another obvious similarity between Shearer (2004) and Bandiera et al. is that they study agricultural environments in which worker productivity is easy to measure, comparable across workers at the same moment in time, and comparable within a worker over time. The fact that worker productivity is measured electronically with little measurement error, also makes analysis of the impact of the field experiment on the distribution of productivity, again as highlighted by Lazear (2000), particularly amenable to quantile regression methods for example. However, it remains true that settings in which worker’s output is hard to measure, verify or compare, which might represent the bulk of tasks in the modern service based economy, remain relatively unexplored in field experiments.

In Bandiera et al. (2005) the natural field experiment exogenously changes the monetary incentives to the bottom-tier workers whose primary task is to pick fruit. The study compares the behavior of these workers under a relative incentive scheme to a piece rate scheme. The comparison is revealing because under relative incentives individual effort imposes a negative externality on co-workers’ pay whereas under piece rates individual effort has no effect on others’ pay. The difference in workers’ performance under the two schemes, if any, then provides evidence on whether and to what extent workers internalize the externality they impose on their colleagues. To see this, the framework above is tailored to this specific field experiment as follows.

Consider a group of workers, each worker exerts effort which determines her productivity. The cost of effort is assumed to be . Under relative incentives the benefit from pay depends on the worker’s productivity relative to all her co-workers, , where . The relative scheme has the key characteristics that an increase in worker ’s effort—(i) increases her pay; (ii) increases average effort and hence imposes a negative externality by reducing the pay of co-workers. The effort choice under relative incentives then depends on whether workers have social incentives and therefore internalize this externality. Assuming worker places the same social weight on all co-workers, so , the equilibrium effort for worker solves,

(31)

Assuming worker chooses her effort taking the effort of others as given, the Nash equilibrium effort for worker solves,

(32)

Under piece rates, individual effort is paid at a fixed rate per unit and worker chooses her effort as follows,

(33)

The equilibrium effort level solves the first order condition,

(34)

As worker ’s effort does not affect her co-workers’ pay, her optimal choice of effort is independent of . To compare effort choices under the two schemes, evaluate (34) at so that for a given , the pay per unit of effort is the same under both incentive schemes. The first order condition under piece rates then is,

(35)

so the difference between the first order conditions (32) and (35) can be ascribed to two sources. The first is the externality worker imposes on others under relative incentives, the magnitude of which depends on . When worker ’s productivity is lower under relative incentives compared to piece rates. Second, by exerting more effort, each worker lowers the pay she receives for each unit of effort under relative incentives. This effect, captured by the term, also reduces productivity under relative incentives but is negligible in large groups.

The main results from Bandiera et al. (2005) are then as follows. First, the reduced form estimates suggest that the exogenously timed switch from relative incentives to piece rates had a significant and permanent impact on worker productivity. For the average worker, productivity increased by at least 50% moving from relative incentives to piece rates. As in the earlier literature, both the mean and dispersion of productivity significantly increase with the move to piece rates. The productivity gains achieved under piece rates are not found to be at the expense of a lower quality of picking.

The authors then assess whether this productivity change is consistent with the standard assumption that workers ignore the externality they impose on others under the relative scheme (), or whether they fully internalize it (). To do this they use the structural model above, imposing a functional form assumption on and a production function linking effort to observed output, to calibrate the first order conditions of the workers’ maximization problem to compute an estimate of each worker’s cost parameter, , under each incentive scheme and behavioral assumption. Since worker’s ability is innate, they ought to find the same implied distributions of costs across workers under both incentive schemes if the underlying behavioral assumption is correct.

Calibration of the first order conditions for worker’s efforts reveals that the observed change in productivity is too large to be consistent with the assumption that workers ignore the negative externality they impose on others. At the same time, the observed change in productivity is also too small to be consistent with the assumption that workers maximize the welfare of the group and fully internalize the negative externality. The authors then uncover the distribution of social weights across workers that would explain the productivity increases. To do so they assume the true cost of effort of each worker is that derived under piece rates, and then substitute into the first order condition (32). They find the data is consistent with the average worker placing a weight of on the benefits accruing to all other co-workers, assuming they place a weight of one on their own benefits.

Further analysis combines the experimental variation induced by the change in incentive scheme, with non-experimental variation of the assignment of workers to work alongside their friends on some days but not on other days. The field experiment method allows the collection of primary data on social networks of each worker on the farm. This reveals that under relative incentives workers internalize the externality more when the share of their personal friends in the group is larger and this effect is stronger in smaller groups. In line with the interpretation that social preferences explain the difference in productivity across the two schemes, the relationship among workers does not affect productivity under piece rates. Finally, they find that productivity under relative incentives was significantly lower only when workers were able to monitor each other. Given that monitoring is necessary to enforce collusion while it does not affect altruism, they take this finding to support the hypothesis that workers are able to sustain implicit collusive agreements when relative incentives are in place. Hence, building on a large body of evidence from laboratory settings, this evidence from the field suggests workers behave as if they have social preferences but do not, in structural form, have social preferences that make them unconditionally altruistic towards others.

The results beg the question of why, given the large gains to productivity and profits, of the move to piece rates, were relative incentives ever employed in the first place. The farm management suggested the relative scheme was mainly adopted to difference out common shocks that are a key determinant of workers productivity in this setting. While this is in line with the predictions of incentive theory, the superiority of relative incentives relies on the assumption that workers ignore the externality their effort imposes on others.⁶¹ This assumption on worker behavior is not supported by this field experiment. Relative incentives led to lower productivity because workers internalized the negative externality to some extent. The results of this natural field experiment then speak directly to Lazear’s (1989) observation on how rarely workers are compensated according to rank-order tournaments, and point to new and interesting directions for theory to develop on the optimal provision of incentives under more robust assumptions on worker preferences.

Social incentives among managers

While the evidence from field experiments discussed thus far has focused on the monetary incentives provided to bottom-tier workers, Rosen’s (1982) magnification principle implies the incentives provided higher up in the firm hierarchy can have larger effects on firms’ performance. Bandiera et al. (2007) present evidence from a field experiment in the same setting as previously described to explore this issue.

They examine the effects of providing bonuses to managers based on the average productivity of their subordinates. They extend the framework above to highlight that, as in most firms, in their context managers can affect worker productivity through two channels—(i) they can take actions that affect the productivity of existing workers, and, (ii) they can affect the identity of the workers selected into employment. A simple theoretical framework indicates that, when workers are of heterogeneous ability and managers’ and workers’ effort are complements, the introduction of managerial performance pay makes managers target their effort towards the most able workers. This is labeled a “targeting effect” of managerial incentives. In addition, the introduction of managerial performance pay makes managers select the most able workers into employment. This is labeled as a “selection effect” of managerial incentives.

As in Lazear’s framework, such targeting and selection effects influence both the mean and the dispersion of workers’ productivity. Mean productivity unambiguously rises as managers target the most able workers and fire the least able. The effect on the dispersion is however ambiguous. On the one hand, targeting the most able workers exacerbates the natural differences in ability and leads to an increase in dispersion. On the other hand, if only more able and hence more similar workers are selected into employment in the first place, the dispersion of productivity may fall, depending on the underlying distribution of ability across workers.

They key findings from Bandiera et al. (2007) are as follows. First, the introduction of managerial performance pay increases both the average productivity and the dispersion of productivity among lower-tier workers. The average productivity increases by 21 percent and the coefficient of variation increases by 38 percent.

Second, the increase in the mean and dispersion of productivity is due to both targeting and selection effects. The analysis of individual productivity data reveals that the most able workers experience a significant increase in productivity while the productivity of other workers is not affected or even decreases. This suggests that the targeting effect is at play—after the introduction of performance pay, managers target their effort towards more able workers. The individual data also provides evidence of a selection effect. More able workers, namely those who had the highest productivity when managers were paid fixed wages, are more likely to be selected into the workforce when managers are paid performance bonuses. Least able workers are employed less often and workers at the bottom of the productivity distribution are fired.⁶²

Third, the selection and targeting effect reinforce each other, as workers who experience the highest increase in productivity are also more likely to be selected into employment. The introduction of managerial performance pay thus exacerbates earnings inequality due to underlying differences in ability both because the most able workers experience a larger increase in productivity and because they are selected into employment more often.

Finally, they evaluate the relative importance of the targeting and selection effects through a series of thought experiments. They find that at least half of the 21 percent increase in average productivity is driven by the selection of more productive workers. In contrast, the change in dispersion is nearly entirely due to managers targeting the most able workers after the introduction of performance pay. Namely, the dispersion of productivity would have increased by almost the same amount had the selection of workers remained unchanged. The reason is that the distribution of ability across workers is such that even when the least able workers are fired, the marginal worker selected to pick is still of relatively low ability. Hence there remains considerable heterogeneity in productivity among selected workers.

These findings shed some light on why firms provide performance related pay to managers in the first place. While such incentive schemes are obviously designed to increase unobservable managerial effort, these results suggest another more subtle reason for their use. This stems from the general observation that firms are typically constrained to offer bottom-tier workers the same compensation scheme. This may be because of legal, technological or informational constraints (Lazear, 1989; Bewley, 1999; Encinosa et al., 1997; Fehr et al., 2004). To the extent that bottom-tier workers are of heterogeneous ability, however, offering the same compensation scheme to all of them will be sub-optimal. When managers’ pay is linked to firm’s performance, their interests become more aligned with those of the firm and they have greater incentives to target their effort to specific workers in order to offset the inefficiency that arises because of the common compensation scheme. From the worker’s point of view it is then as if they face an individual specific incentive scheme. This opens a broad research agenda to examine whether firms are indeed more likely to offer managers performance pay in settings where lower tier workers are of heterogeneous ability, managers are able to target their effort towards specific workers, and workers are offered the same compensation scheme.

The findings from this field experiment also highlight the interplay between the provision of managerial incentives and the earnings inequality among lower-tier workers. Such a linkage exists whenever managers can target their efforts towards some workers and away from others, and managers choose which individuals are selected into the workforce. Hence that there might be an important interplay between managerial incentives and earnings inequality among workers highlights a possible link between two important trends in labor markets over the past twenty years that have previously been unconnected in the economics literature—the rising use of managerial performance pay, and the rising earnings inequality among observationally similar workers.⁶³

In Bandiera et al. (2009a,b), the authors use the same introduction of managerial bonuses to understand whether managers favor workers they are socially connected to. In general, social connections between managers and workers can help or harm firm performance. On the one hand, social connections may be beneficial to firm performance if they allow managers to provide non-monetary incentives to workers, or help reduce informational asymmetries within the firm. On the other hand, managers may display favoritism towards workers they are socially connected with, to the detriment of other workers and overall firm performance.⁶⁴

In this experiment, as managerial compensation becomes more closely tied to firm performance, we would expect managers to utilize social connections to a greater extent if indeed, such connections are beneficial for firm performance. On the other hand, if social connection are bad for the firm, we might expect managers to reallocate their effort across workers in response to managerial incentives, towards high ability workers, and away from workers they are socially connected to. To be precise, if the managers’ behavior towards connected workers changes once their interests are more closely aligned with the firm’s, their previous behavior under fixed wages could have not been maximizing the firm’s average productivity.

To measure social connections the authors use a survey they designed to exploit three sources of similarity between managers and workers—whether they are of the same nationality, whether they live in close proximity to each other on the farm, and whether they arrived at a similar time on the farm. The underlying assumption is that individuals are more likely to befriend others if they are of the same nationality, if they are neighbors, or if they share early experiences in a new workplace.⁶⁵

The main findings are as follows. First, when managers are paid fixed wages, the productivity of a given worker is 9% higher when he is socially connected to his manager, relative to when he is not. As workers are paid piece rates, this translates into the same proportionate change in earnings. Second, when managers are paid performance bonuses that tie their pay to the average productivity of workers they manage, being socially connected to the manager has no effect on workers’ productivity.

Third, the introduction of managerial performance pay significantly decreases the productivity of low ability workers when they are connected to their manager relative to when they were connected to their manager and she was paid a fixed wage. The introduction of managerial performance pay increases the productivity of high ability workers, especially when they are not connected to their managers. These findings indicate that when managers face low powered incentives, they favor the workers they are socially connected to, regardless of the workers’ ability. In contrast, when they face high powered incentives, managers favor high ability workers regardless of the workers’ connection status.

Fourth, an increase in the level of social connections between managers and workers has a detrimental effect on the firms’ average productivity when managers are paid fixed wages and has no effect when managers are paid performance bonuses. In this setting, social connections are therefore detrimental for the firm because their existence distorts the allocation of managerial effort in favor of lower ability workers.

This natural field experiment paper contributes to the growing empirical evidence on the interplay between social networks and individual and firm performance. In particular, the design allows the authors to identify not only whether social connections matter within the firm, but also exploit the exogenous variation in incentives to understand whether they are to the benefit or detriment of the firm.

Feedback

In a final natural field experiment from this setting, Bandiera et al. (2010, forthcoming) present evidence to evaluate the effect of performance feedback and monetary prize tournaments, when the workforce is organized in teams. Hence in this set-up workers effort imposes a positive externality on their team members, . They compare the effects of these forms of non-monetary and monetary incentives relative to when teams are paid piece rates, and analyze their effect on two outcomes: how workers sort into teams and team productivity.

This field experiment provides important contributions to the literature along three margins. First, despite the pervasiveness of teams in the workplace, field evidence on team incentives is scarce.⁶⁶ The existing evidence from individual reward schemes provides limited guidance because the margins along which individuals and teams can respond to incentives differ. Specifically, in addition to changes in individual effort, changes in team incentives can lead to changes in team composition. To the extent that workers effort depends on the identity of their team members because of social incentives, changes in team composition can affect the productivity of the individual teams and of the firm as a whole. ⁶⁷

Second, tournaments are widely used to provide incentives across diverse organizations such as salespeople competing for bonuses, managers competing for promotions, and politicians competing for vote shares (Bull et al., 1987; Baker et al., 1988). While several studies have tested whether the response to variation in tournament structure is consistent with theoretical predictions, field evidence on the comparison of monetary prize tournaments against alternative monetary and non-monetary incentive mechanisms is scarce.⁶⁸

Third, whenever tournaments are in place, workers inevitably receive some information on their relative performance. This information might have direct effects on productivity if individuals have concerns for their relative position or status (Moldovanu et al., 2007; Besley and Ghatak, 2008), inequality aversion (Fehr and Schmidt, 1999; Charness and Rabin, 2002) or conformity (Bernheim, 1994). The field experiment allows the authors to de-couple the effect of feedback from the effect of monetary prize tournaments. As the provision of feedback is almost costless, measuring its contribution to the overall tournament effect can lead to considerable cost savings if most of the positive effect of tournaments on productivity is actually due to worker responses to feedback.⁶⁹

In the experiment, at the beginning of the season, teams were paid piece rates based on their aggregate productivity. Halfway through the season teams were additionally provided feedback by posting daily histograms of each team’s productivity. This feedback makes precise the absolute productivity of each team, and their ranking relative to all other teams. Halfway through the remaining part of the season a monetary prize for the most productive team each week was introduced, in addition to the provision of feedback, and conditional on teams being paid according to piece rates.

When workers first arrive at the farm they are assigned to a team by the general manager for their first week. Thereafter workers are free to choose their own team members at a team exchange that takes place every week. A team is formed only if all its members agree. Hence in this setting workers have two choice variables: how much effort to exert into picking, and team composition.

The field experiment is again closely tied to an underlying model. This makes precise two key forces that drive team formation: workers’ ability and social connections. As individual earnings are increasing in the ability of team members, workers have incentives to assortatively match by ability. On the other hand, workers might prefer to form teams with friends because this might limit free-riding within teams (Rosen (1986); Hamilton et al. (2003).⁷⁰ To the extent that workers are not socially connected to colleagues of similar ability, a trade-off emerges. The theoretical framework then makes precise how the introduction of feedback and prizes affect this trade-off.

The key empirical results from the field experiment are as follows. First, the introduction of feedback and of monetary prizes leads to significant changes in team composition. Relative to the piece rate regime, the share of team members connected by social ties is lower and team members’ ability levels are more similar under the feedback and tournament regimes.

Second, the feedback and tournament schemes have opposite effects on average productivity. Relative to the piece rate regime, the introduction of feedback significantly reduces average team productivity by 14%. The further introduction of a monetary prize tournament, conditional on the provision of feedback, significantly increases productivity by 24%. As made precise in the theoretical framework, the reduction in average productivity when feedback is provided is consistent with workers being better off sorting into teams on the basis of ability rather than friendship as feedback increases the strength of incentives faced, and the firm being worse off because it no longer harnesses the ability of socially connected workers to ameliorate free-riding within the team. Hence the endogenous formation of teams under feedback reduces the firm’s productivity overall. In contrast, the tournament incentives are sufficiently high-powered so the increase in worker’s effort more than offsets any increase in free-riding within teams. Hence the firm’s overall productivity rises.

Third, the dispersion of productivity increases under both regimes because both effects are heterogeneous as indicated by the theoretical framework. Quantile regression results show that the introduction of feedback reduces the productivity of teams at the bottom of the conditional productivity distribution compared to piece rates, while it has no effect on teams above the 40th percentile. In contrast, the introduction of prizes increases the productivity of teams at the top of the conditional productivity distribution compared to piece rates, while it has no effect on teams below the 30th percentile.

Fourth, focusing on the teams that remain intact after each change in incentives, the authors evaluate the effect of feedback and prizes on effort, holding constant team composition. They find that while the effect of feedback on team productivity is positive the magnitude appears small. This emphasizes that the documented negative effect of feedback is primarily due to the endogenous changes in team composition caused by the provision of feedback, rather than changes in behavior of the same team. In contrast the additional introduction of monetary prizes increases team productivity by 25% for teams that choose to remain intact. Hence the provision of monetary prizes affects firm performance through both the endogenous changes in team composition and changes in behavior within the same team.

Finally, the authors present qualitative evidence from a worker survey they conducted. As highlighted at the start of this section, this type of primary data collection that is inherent in field experiments, allows the authors to shed light on other margins of behavior between workers that might be affected by the monetary and non-monetary incentives provided, but that the firm does not collect data on ex ante. This survey data reveals that relative to the piece rate regime, during the tournament regime significantly fewer workers report pushing their team members to work hard or giving team members instructions. This is consistent with workers being better matched by ability and having fewer social connections with their team members under the tournament regime, so that peer pressure within the team becomes less effective.

By exploring changes in behavior on a range of dimensions, this evidence from the field highlights new directions for research in understanding how agents react to monetary and non-monetary incentives in workplaces characterized by team production where teams form endogenously.

4.3 The employment relationship

The neoclassical labor market model emphasizes workers behave opportunistically. For example, in the model sketched above from Lazear (2000), when workers compensation is not tied to their performance, as under a fixed hourly wage scheme, all workers exert the minimum effort required to achieve the minimum output requirement, . There is thus no variation in workers in their output or pay. We now explore the insights field experiments have provided on the existence and nature of such opportunistic behavior in real world settings. We do so through examples related to gift exchange in shirking.

4.3.1 Gift exchange

The standard labor market model assumes in equilibrium firms pay market clearing wages and workers provide minimum effort. This prediction does not receive uniform support empirically. There are numerous cases where employers are observed paying above the market equilibrium wage Akerlof (1982), and where workers exert more than the minimum effort level, as we have already discussed in relation to Lazear (2000) and in many other studies on employee performance under fixed wages. This has led to the development of the gift-exchange model which is based on the assumption of their being a positive association between wages and worker effort (Akerlof, 1982; Akerlof and Yellen, 1988, 1990). In this class of model, employers offer higher than market clearing wages, and workers are viewed to positively reciprocate by providing higher than the minimum required effort.

Clearly such theories are hard to test using non-experimental data: there might be a host of unobservable factors that create a correlation between wages and worker effort. Hence, there has been a large body of evidence established in laboratory settings on gift-exchange in firm settings, which began with Fehr et al. (1993). In this original study, they constructed a labor market equilibrium with excess labor supply so that the equilibrium wage was low. Employees also had no pecuniary incentive to raise the quality of their work above the minimum required level, so the best response of employers was to pay the low equilibrium wage. Contrary to the prediction, the majority of employers attempted to induce employees to invest greater effort by offering them higher than market-clearing wages. On average, this high wage was reciprocated by greater employee effort. Overall, it was profitable for employers to offer high wage contracts.

Gneezy and List (2006) use a natural field experiment then look for evidence of gift-exchange in similar real world environments in which equilibrium wages are low and workers earnings are not tied to their performance. In moving from the lab to the field, one important comparative static to evaluate is how behavior changes with the duration of the task. In other words, are the types of positive reciprocity observed by workers in the lab, a long run phenomena. The psychology literature provides two reasons why the duration of tasks might matter. First, there is the distinction between hot and cold decision making (Loewenstein and Schkade, 1999; Loewenstein, 2005). Second, there can be adaptation of behavior over time (Gilbert et al., 1998).

Two subject pools were utilized for the field experiments. In each a between subject design was used. The first field experiment recruited undergraduate students to participate in an effort to computerize the holdings of a small library at the university. The task was to enter data regarding the books into a computer database. In the no-Gift treatment, individuals were offered a flat wage of $12 per hour. In the Gift treatment, once the task was explained to participants, they were surprisingly paid $20 per hour rather then $12 per hour as advertised. In total 19 workers were hired for six hours each; 10 were randomly assigned to the no-Gift treatment. The second field experiment was part of a door to door fundraising drive to support a university research center. Fundraising solicitors were recruited. All solicitors were told they would be paid $10 per hour, and those in the Gift treatment were surprisingly told they would actually receive $20 per hour. In total 23 solicitors were employed over two days, with 10 being randomly assigned to the no-Gift treatment.⁷¹

The main results are as follows. First, in line with earlier evidence from laboratory settings, there are signs of significant gift exchange in the first few hours of the task, as measured both by effort in the library task and money raised in the fundraising task. For example, in the library task, effort is around 25% higher for those in the Gift treatment.

Second, there are significant falls in effort over time. After a few hours, there are no longer any significant differences in effort between the no-Gift and Gift treatments in either task. Figure 10 illustrates clearly how any positive reciprocity by workers is a short run phenomena in these two settings. Overall, the results suggest that with the same budget, the employer would have been better off just paying the market clearing wage as in the no-Gift treatments.

Figure 10 Gift exchange and the duration of tasks.

While the results go against the standard gift-exchange explanation of the positive association of wages and effort, they are in line with survey evidence on wage rigidity.⁷² For example Bewley (1999) considers why wages are downwardly rigid during a recession. He reports that managers are worried that wage cuts might result in decreases in morale that would subsequently result in poor worker performance when the economy recovered, if not immediately. This highlights the importance of fairness considerations in cases of negative reciprocity. With respect to positive reciprocity, as in this field setting, Bewley’s evidence is less conclusive. He argues that morale is less important when considering wage increases, but finds that one main consideration when determining raises is the effect on employee turnover once the recession ends. Bewley’s work suggests that there appears to be little connection between increasing pay and productivity, except to the extent that higher wages make it possible to attract, and retain, higher quality workers. This ties back to the earlier discussion of Lazear (2000) and the subsequent work on monetary incentives in firms, where it is thought there are qualitatively large selection effects of incentives driven by changes in workforce composition. Again, more evidence on these channels related to employee turnover are required to test more precisely a fuller set of theoretical predictions.

4.3.2 Shirking

Field experiments had also provided insights on research questions related to employee shirking behaviors. The standard economic framework emphasizes employees are rational shirkers: they will slack when the marginal benefits of doing so outweigh the marginal costs. Firms respond to such behaviors by choosing compensation and monitoring policies to reduce shirking. As emphasized above, this view that workers will behave so opportunistically when the marginal returns on their effort are low, is often contradicted by empirical evidence and predictions on behavior in such settings from the psychological and sociological literatures (Pfeffer, 1996; Kreps, 1997; Baron and Kreps, 1999). While we have earlier studied the role of compensation schemes and wage setting behavior to raise employee effort, we now focus on the effect employer monitoring has on worker behavior.

If employees are rational cheats then, conditional on a given incentive pay arrangement, a reduction in monitoring will lead to an increase in shirking. The most powerful sanction available to employers is typically dismissal. Thus, an increase in shirking resulting from reduced monitoring should be greatest among individuals for whom the ongoing employment relationship is least valuable.

As with most of the research questions posed in this chapter, establishing credible empirical evidence to support the theory is not straightforward. In this case there are two concerns that have plagued the non-experimental literature. First, shirking behavior is by its nature hard to detect. Moreover, the ability of the econometrician to detect shirkers might itself be endogenously related to the employer’s monitoring practices. Second, there might be unobserved factors, such as other hiring policies, that cause there to be a correlation between monitoring and shirking.

A carefully designed natural field experiment of Nagin et al. (2002) addresses both challenges. The setting was a telephone solicitation company, with employees dispersed across 16 call centers. At each call center, telephone solicitors were paid according to the same piece rate incentive scheme, one in which salary increased with the number of successful solicitations. This piece rate, together with imperfect information on the outcome of pledges, created incentives for employees to falsely claim that they had solicited a donation.

To curb opportunistic behavior, the employer monitored for false donations by calling back a fraction of those who had responded positively to a solicitation. Employees were informed when hired that their activities would be checked by “callbacks” made by management. The results of each week’s callbacks were communicated to both employees and their immediate supervisors, and the bad calls were deducted from each individual’s weekly incentive pay. Stronger sanctions for bad calls were not generally imposed on employees because the number of bad calls was understood to be a noisy indicator of cheating. For example, donors might sometimes change their mind after agreeing to pledge money. ⁷³

To see if the costs of implementing this monitoring system could be reduced, the company conducted a controlled field experiment. This experiment was “double blind” in the sense that neither the employees nor their immediate supervisors were aware of departures from “business as usual.” In the experiment, the employer varied the fraction of bad calls that were reported back to employees and supervisors at each call center. To more precisely estimate the true rate of fictitious pledges, the firm simultaneously increasing the true callback rate from 10% to 25% of pledges. By working closely with the firm, the researchers were able to collect primary survey data on employee attitudes toward the job, their expected job tenure, and the perceived difficulty of finding another, comparable job. These relate closely to the underlying theory of rational shirking behavior. This information is used to test whether those for whom the job was most valuable were also the employees least likely to engage in opportunistic behavior.

The main results are as follows. First, a significant fraction of employees behave according to the predictions of the rational cheater model. In particular, employees respond to a reduction in the perceived cost of opportunistic behavior by increasing the rate at which they shirk. Using the survey data collected, the authors find the employees who responded to reductions in monitoring tended to be those who perceived the employer as being unfair and uncaring. On the other hand, there is no evidence that individuals with good outside options increased shirking by more than other workers when the rate of monitoring declined. Second, a substantial proportion of employees do not appear to respond at all to manipulations in the monitoring rate. As with responses to monetary and non-monetary incentives documented above, there is considerable heterogeneity in how workers respond to employer monitoring. This underlying heterogeneity highlights the need to balance the need to reduce the shirking behavior of some workers inclined to rationally cheat, against those that are unlikely to do so under normal circumstances.

4.4 Moving forward

Economists have only recently begun to exploit field experiments in firms. This nascent literature has already highlighted the strengths of this methodology in being closely linked to testing alternative theories of individual behavior, of utilizing field experiments and structural modelling to make inference on the optimal design of incentive schemes, and to collect primary data to check for non-expected responses on other margins such as the quality of work, or to probe specific tests of the theory. We conclude by highlighting a few key areas for future work to consider.

First, the set of field experiments discussed have focused primarily, although not exclusively, on job tasks in which productivity is easy to observe, measure, compare across workers and time, and the quality of work performed is relatively easily monitored by management and assignable to individuals. Yet many jobs in the economy, or at higher tiers of firms’ hierarchies, do not share such characteristics, and more research is required in such settings where performance is evaluated more subjectively, and might therefore be subject to influence activities (Milgrom, 1988), or favoritism (Prendergast and Topel, 1996). As primary data collection is part of the field experimenter’s arsenal, this approach might especially help to shed light on these types of evaluations and incentive structures.

Second, most field experiments have been implemented to evaluate the effects of one time changes in management practices. Standard theory suggests history does not matter and that these effects should be equal and opposite to changing incentives the other way. This would be relatively straightforward to test, conditional on being able to control for natural time effects on behavior. A rejection of the standard model might then imply there can be persistent effects of short run changes in management practice. Such effects might operate through habit formation or reference point effects for example, that have been found using non-experimental data from real world settings (Mas, 2006).

Third, given the progression of field experiments exploring the effects of incentives on bottom-tier workers, and then to managers in the middle tier of the firm hierarchy, it is natural to ask whether field experiments might in the future extend to understanding executive pay. The last two decades have seen a surge in the popularity of performance pay for individuals in executive and managerial positions, from CEOs down to middle and lower management (Hall and Liebman, 1998; Hall and Murphy, 2003; Oyer and Schaefer, 2004). However as yet there remains mostly an unwillingness of organizations to experiment in relation to such high stakes positions.⁷⁴

Broader methodological issues remain to be borne in mind with regards to field experiments in firms. First, there are concerns over whether the set of firms and organizations that allow field experiments to be conducted within them, are selected in some way. For example, those firms that are most likely to gain from changes in management practices might be most amenable to field experiments on these dimensions. Given the potential for such non-random selection, field experiments ought to be designed to precisely measure differential effects, and less weight given to the levels effects.

Second, this body of field experiments offers an intriguing insight into whether firms choose their management practices optimally. Certainly, Shearer’s (2004) study highlights why the firm was using piece rates and not fixed wages. For the firm studied in Bandiera et al., in each case the firm followed up on the results of the field experiment by maintaining the incentives that were introduced. However we have to be careful that while field experiments have focused on the effects of carefully engineered interventions on productivity, the firm chooses practices to maximize discounted profits. Productivity increases need to translate in profit increases. An example of this is in the study by Freeman and Kleiner (2005) on a US shoe manufacturer, who find that the move from piece rates to hourly wages reduced productivity, but increased the quality of work to such an extent that profits rose overall. Clearly, there remains scope for experimentation within firms to help them learn the optimal behaviors, and for this to have a large impact economy-wide, and perhaps go some way to explaining large productivity differences across otherwise observationally similar firms.

5 Households

Much of an individual’s life cycle is spent in some form of partnership or family union. Despite widespread social changes in family structure in Western economies, families and multi-member households remain a key building block of society. Understanding how households make decisions has implications for many of the choices we have already touched upon, such as educational choices for children, labor market participation and labor supply. Shedding light on the household decision making processes also has profound implications for understanding whether, and how, policies such as income transfers and the regulation of marriage and divorce marriage markets, shape these outcomes.

The benchmark model of household behavior has been the unitary model, pioneered by Samuelson (1956) and Becker (1981). While this generates a rich set of predictions for price and income effects on household behaviors, it remains silent on how conflicts between spouses are resolved. Modelling household decision making as the outcome of a bargaining process provides a natural way in which to introduce conflicts (Manser and Brown, 1980; McElroy and Horney, 1981; Chiappori, 1988). Hence, where these approaches differ is in whether households maximize according to a common or dictatorial set of preferences—the unitary approach—or whether they seek to maximize a weighted sum of household member preferences—the basis of the bargaining approach. On the other hand, a key feature of both modelling frameworks is that households are assumed to make efficient decisions.

Households might reasonably be expected to reach efficient outcomes because they have repeated and long term interactions, in strategic environments characterized by perfect information, and have the ability to communicate costlessly. Nevertheless, a more recent strand of the literature has developed that takes seriously the idea that either household members behave non-cooperatively within marriage (Lundberg and Pollak, 2003; Basu, 2006; Mazzocco, 2004, 2007; Rasul, 2008). In each case, household decisions can then be inefficient.

There are two long-standing strands of the empirical literature on household decision making that stem from these views of the world. First, there have been a number of attempts to uncover whether households bargain efficiently, as is implied by both the unitary and collective choice models. Many of these tests take the form of examining patterns of household demand and consumption (Browning and Chiappori, 1998) or testing for the equality of the marginal product of labor of household members across economic activities (Udry, 1994; Akresh, 2005). A first generation of field experiments on households has begun to shed light on this issue.⁷⁵

Second, there is an older strand of the literature that uses non-experimental approaches to test for the assumption on whether households pool income, consistent with the predictions of the unitary framework, or whether the identity of the income earner matters for outcomes (Thomas, 1990, 1994; Hoddinott and Haddad, 1995; Duflo, 2003; Duflo and Udry, 2004; Rangel, 2006). Rather surprisingly given the roots of field experiments in the social experiments of the 1970s, relatively fewer field experiments have been conducted to help test the specific predictions of either unitary or collective bargaining frameworks.

A parallel stream of literature relates to the use of social experiments to evaluate conditional cash transfer programs, which was touched upon earlier. Two notable studies that have used data from the PROGRESA intervention in rural Mexico are Attanasio et al. (2006) and Todd and Wolpin (2006). These both combine the experimental variation in PROGRESA transfers across randomly assigned villages with structural estimation of a household’s dynamic behavior to shed light on outcomes under alternative policy designs.

5.1 Efficiency

Ashraf (2009) presents evidence from a framed field experiment to understand how information and communication affect household financial decisions. The experiment was conducted with a sample of current or former clients of a rural bank in the Philippines. The main decision each subject had to make was over whether to spend or save income received during the experiment. More precisely, subjects had to choose how to allocate 200 pesos received between: (i) direct deposits into their own or a joint account; (ii) committed consumption using redeemable gift certificates. Each subject was randomly assigned, with his or her spouse, to one of three treatments that varied the privacy of information spouses had, and the ability of spouses to communicate with each other. 149 married couples are involved in the experiment.⁷⁶

In the first treatment, subjects are separated from their spouses at the outset of the experiment. This treatment is referred to as “private information without pre-play communication”. Under this treatment spouses have no information on whether and how much income is received by the spouse, what decisions they have made, or the outcomes obtained. In the second treatment spouses learn each others’ payoffs and choice sets. In this treatment, referred to as “public information without pre-play communication”, spouses make simultaneous decisions and so cannot communicate nor observe each others decisions ex ante. In the final treatment the procedure is as in the previous treatment except that spouses are able to communicate before making their decisions, and their decisions are observable to each other. This is referred to as the “public treatment”.

Clearly, in the absence of a field experiment, trying to uncover and exploit plausibly exogenous sources of variation in the information available to spouses or their ability to communicate is difficult, and likely to be correlated to factors that affect outcomes directly. Hence this research design allows economists to more carefully scrutinize causal changes in behavior along dimensions that are theoretically important, yet empirically almost impossible to measure in the absence of a field experiment. However, as with the other settings considered throughout, field experiments conducted with households raise important issues that need to be taken into consideration when interpreting results.

First, in common with laboratory experiments, behavior in framed field experiments might not mimic behavior in the real world. Ashraf (2009) addresses this concern by running the experiment in conjunction with a rural bank that all participants were familiar with, and by designing treatments that capture real world differences in communication and information across households in this setting. Second, households are engaged in repeated interactions outside of the context of the field experiment. Hence behavior within an experiment can be undone, or potentially reinforced, by behavior outside of the experiment. To try and address this issue, Ashraf provides payoffs in the form of person-specific gift certificates. Both methodological issues need to be considered in all field experiments with households.

Ashraf’s (2009) results shed light on the interplay between information, communication and gender in household decision making. Relative to field experiments in other settings, when the experimenter is engaged in primary data collection with households it is of even greater importance to understand societal norms of behavior within marriage. For example, in the Philippines, women are typically in charge of the financial management of the household, making key decisions on budgeting and allocation. Understanding the context in which the experiment takes place is crucial for designing treatments that reflect real world trade-offs subjects face, and to closely align experimental designs with a theoretical framework. Of course, the cost of this precision in any given context is the limited ability, all else equal, to extrapolate findings to households operating under very different norms.

The three main results are as follows. First, men are found to be more likely to deposit money into their own account under the private treatments, and are more likely to commit it to consumption under the public treatment. Second, the differences in behavior by gender are subtle. A subset of women—those whose husbands normally control household savings decisions—behave in the same way as men whose wives normally control household savings decisions. Third, communication between spouses at the time of decision making induces the majority of men to place the income into their spouses account rather than consume it or put it into their own account. To understand these results, Ashraf discusses a framework of income monitoring within the household where observability of income and communication at the time financial decision making, significantly change the monitor’s ability to enforce contracts. The results can then be understood as spouses responding strategically to changes in information and communication and contract enforceability. They suggests a specific channel through which asymmetric information can create inefficient outcomes in financial decision making, by providing incentives to hide one’s additional income from one’s spouse. ⁷⁷

5.2 Moving forward

Labor economists have sought to explain a far richer set of research questions than just those related to behavior within households. Foremost among these other issues has been the research into the causes and consequences of the formation and dissolution of households. Field experiments have recently begun to help explore issues related to the formation of households or partnerships in the first place. Two examples are Fisman et al. (2006) and Fisman et al. (2008) who conduct a framed field experiment to measure differential preferences in dating across genders and races, respectively. To do so, both studies analyze individual choices of subjects in an experimental speed dating game. ⁷⁸

On differential preferences across genders, Fisman et al. (2006) find that women place greater weight on the intelligence and the race of partner, while men respond more to physical attractiveness. Moreover, men do not value women’s intelligence or ambition when it exceeds their own.

On racial preferences, Fisman et al. (2008) find that there is a strong asymmetry in racial preferences across genders: women of all races exhibit strong same race preferences, while men—of all races —do not. Second, subjects’ background influences their racial preferences: subjects that come from locations that are measured to be more racially intolerant, using data from the General Social Survey and World Values Surveys, reveal stronger preferences for same race preferences. This is despite the subject pool being drawn from individuals that currently reside away from home, and attend a top US university. Third, those exposed to other races in early life—as measured by the fraction of individuals of a given race in the zip code where the subject grew up—are less willing to date someone from this race, suggesting that familiarity might reduce racial tolerance. Finally, physically more attractive individuals are less sensitive to the race of potential partners in the experiment.

This experimental approach provides a nice complement to other non-experimental studies applying structural methods to estimate similar preference parameters in the context of online dating services (Hitsch et al., 2010). Given the growth in availability of online data in economic research, perhaps in the near future we will witness research methods combining field experiments with interventions akin to audit studies that were previously discussed in relation to the economics of discrimination.

As yet though, on many aspects of the formation and dissolution of families, few research designs have credibly exploited experimental sources of variation from which to identify causal effects. The nature of questions involved might mean these sets of research questions remain outside the domain of field experiments.

6 Concluding Remarks

Given that complexities of markets severely constrain the ability of traditional economic tools to examine behavioral relationships, it is not surprising that economists have increasingly turned to experimental methods. Within this recent trend is a relatively new approach—field experiments—which have dramatically risen in popularity over the past several years. Since field experiments will likely continue to grow in popularity as scholars continue to take advantage of the settings where economic phenomena present themselves, we view this study as an opportunity to step back and discuss a few of the areas within labor economics wherein field experiments have contributed to our economic understanding. Our central task is to highlight what we view to be the central advantages of the field experimental approach: (i) using economic theory to design the null and alternative hypotheses; (ii) engineering exogenous variation in real world economic environments to establish causal relations and learn the mechanisms behind them; and (iii) engaging in primary data collection and often working closely with practitioners.

A second goal of this study is to draw attention to a methodological contribution of field experiments: complementing other empirical approaches and allowing an exploration of the generalizability of behaviors across settings, such as lab and field behavior. When taking account of the stock of evidence, it becomes clear how field experiments can play an important role in the discovery process by allowing one to make stronger inference than can be achieved from lab or uncontrolled data alone. In this way, the various empirical approaches should be thought of as strong complements—much like theory and empirical modeling—and combining insights from each of the methodologies will permit economists to develop a deeper understanding of our science.

References

Abdulkadiroglu, A., Angrist, J.D., Dynarski, S.M., Kane, T.J., Pathak, P., 2009. Accountability and flexibility in public schools: evidence from Boston’s charters and pilots. National Bureau of Economic Research Working Paper No. 15549

D. Acemoglu. Technical change, inequality, and the labor market. Journal of Economic Literature. 2002;40:7-72.

G.A. Akerlof. The theory of social custom, of which unemployment may be one consequence. Quarterly Journal of Economics. 1980;94:749-775.

G.A. Akerlof. Labor contracts as a partial gift exchange. Quarterly Journal of Economics. 1982;97:543-569.

G.A. Akerlof, J.L. Yellen. Fairness and unemployment. American Economic Review Papers and Proceedings. 1988;78:44-49.

G.A. Akerlof, J.L. Yellen. The fair wage-effort hypothesis and unemployment. Quarterly Journal of Economics. 1990;105:255-283.

G.A. Akerlof, R. Kranton. Identity and the economics of organizations. Journal of Economic Perspectives. 2005;19:9-32.

Akresh, R., 2005. Understanding Pareto inefficient intrahousehold allocations. Institute for the Study of Labor Discussion Paper 1858

P. Alabarran, O. Attanasio. Limited commitment and crowding out of private transfers: evidence from a randomised experiment. Economic Journal. 2003;113:C77-C85.

A.A. Alchian, H. Demsetz. Production, information costs, and economic organizations. American Economic Review. 1972;62:777-795.

J. Alevy, M. Haigh, J.A. List. Information cascades: evidence from a field experiment with financial market professionals. Journal of Finance. 2007;62:151-180.

J.G. Altonji, R. Blank. Race and gender in the labor market. O. Ashenfelter, D. Card, editors. Handbook of Labor Economics. vol. 3C. Elsevier Science B.V.; 1999:3143-3259.

J.G. Altonji, C. Pierret. Employer learning and statistical discrimination. Quarterly Journal of Economics. 2001;CXVI:293-312.

M.L. Anderson. Multiple inference and gender differences in the effects of early intervention: a reevaluation of the Abecedarian, Perry preschool, and Early Training projects. Journal of the American Statistical Association. 2008;103:1481-1495.

J. Andreoni, R. Petrie. Public goods experiments without confidentiality: a glimpse into fund-raising. Journal of Public Economics. 2004;88:1605-1623.

J.D. Angrist, A.B. Krueger. Empirical Strategies in Labor Economics. Handbook of Labor Economics. vol. 3, Part A. Elsevier; 1999:1277-1366. (Chapter 23)

J.D. Angrist, A.B. Krueger. Instrumental variables and the search for identification: from supply and demand to natural experiments. Journal of Economic Perspectives. 2001;15:69-85.

J.D. Angrist, K. Lang. Does school integration generate peer effects? Evidence from Boston’s Metco program. American Economic Review. 2004;94(5):1613-1634.

J.D. Angrist, V. Lavy. Using Maimonides’ rule to estimate the effect of class size on scholastic achievement. Quarterly Journal of Economics. 1999;114:533-575.

J.D. Angrist, V. Lavy. New evidence on classroom computers and pupil learning. Economic Journal. 2002;112:735-765.

J.D. Angrist, V. Lavy. The effects of high stakes high school achievement awards: evidence from a randomized trial. American Economic Review. 2009;99(4):1384-1414.

J.D. Angrist, E. Bettinger, E. Bloom, E. King, M. Kremer. Vouchers for private schooling in colombia: evidence from a randomized natural experiment. American Economic Review. 2002;92:1535-1558.

J.D. Angrist, E. Bettinger, M. Kremer. Long-term educational consequences of secondary school vouchers: evidence from administrative records in Colombia. American Economic Review. 2006;96:847-862.

Angrist, J.D., Barski, S.M., Kane, T.J., Pathak, P., Walters, C.R., 2010. Who benefits from KIPP? National Bureau of Economic Research Working Paper No. 15740

Aoyagi, M., 2007. Information feedback in a dynamic tournament. mimeo, Osaka University

G.R. Arabsheibani, A. Marin, J. Wadsworth. Gay pay in the UK. Economica. 2005;72:333-347.

P. Arcidiacono. Affirmative action in higher education: how do admission and financial aid rules affect future earnings? Econometrica. 2005;73:1477-1524.

K. Arrow. The theory of discrimination. In: O. Ashenfelter, A. Rees, editors. Discrimination in Labor Markets. Princeton, NJ: Princeton University Press, 1972.

O. Ashenfelter. Nonparametric estimates of the labor-supply effects of negative income tax programs. Journal of Labor Economics. 1990;8:S396-S415.

N. Ashraf. Spousal control and intra-household decision making: an experimental study in the philippines. American Economic Review. 2009;99:1245-1277.

Attanasio, O., Barr, A., Cardenas, J.C., Genicot, G., Meghir, C., 2009. Risk pooling, risk preferences, and social networks. mimeo, Oxford University

Attanasio, O., Meghir, C., Santiago, A., 2006. Education choices in Mexico: using a structural model and a randomised experiment to evaluate progresa. IFS Working Paper

Autor, D., Katz, L., Kearney, M., 2005. Rising wage inequality: the role of composition and prices. mimeo, Harvard University

Azmat, G., Iriberri, N., 2009. The importance of relative performance feedback information: evidence from a natural experiment using high school students. mimeo, Universitat Pompeu Fabra

G. Baker, M.C. Jensen, K.J. Murphy. Compensation and incentives: practice vs. theory. Journal of Finance. 1988;43:593-616.

S. Ball, C. Eckel, P.J. Grossman, W. Zame. Status in markets. Quarterly Journal of Economics. 2001;116:161-188.

O. Bandiera, I. Barankay, I. Rasul. Social preferences and the response to incentives: evidence from personnel data. Quarterly Journal of Economics. 2005;120:917-962.

O. Bandiera, I. Barankay, I. Rasul. Incentives for managers and inequality among workers: evidence from a firm level experiment. Quarterly Journal of Economics. 2007;122:729-774.

Bandiera, O., Barankay, I., Rasul, I., 2009a. Team incentives: evidence from a field experiment, mimeo, University College London

O. Bandiera, I. Barankay, I. Rasul. Social incentives in the workplace. Review of Economic Studies. 2010;77(April):417-459.

Bandiera, O., Larcinese, V., Rasul, I., 2009b. Blissful ignorance? Evidence from a natural experiment on the effect of individual feedback on performance. mimeo, LSE

Bandiera, O., Larcinese, V., Rasul, I., 2010. Heterogeneous class size effects: new evidence from a panel of university students. Economic Journal (forthcoming)

Banerjee, A.V., Suraj, J., Kremer, M., Lanjouw, J., Lanjouw, P., 2001. Promoting school participation in rural rajasthan: results from some prosepctve trials. mimeo, Massachusetts Institute of Technology

A.V. Banerjee, S. Cole, E. Duflo, L. Linden. Remedying education: evidence from two randomized experiments in India. Quarterly Journal of Economics. 2007;122:1235-1264.

J. Banks, R. Blundell, A. Lewbel. Quadratic Engel curves and consumer demand. Review of Economics and Statistics. 1997;79:527-539.

C. Barnard. The Functions of the Executive. Cambridge: Harvard University Press; 1938.

J.N. Baron, D.M. Kreps. Strategic Human Resources: Frameworks for General Managers. New York: John Wiley and Sons, Inc.; 1999.

J.N. Baron, J. Pfeffer. The social psychology of organizations and inequality. Social Psychology Quarterly. 1994;57:190-209.

A. Barr, P. Serneels. Reciprocity in the workplace. Experimental Economics. 2009;12:99-112.

Barrera-Osorio, F., Bertrand, M., Linden, L., Perez-Calle, F., 2008. Conditional cash transfers in education: design features. Peer and Sibling Effects: Evidence from a Randomized Experiment in Colombia, The World Bank: Impact Evaluation Series No. 20

K. Basu. Gender and say: a model of household behavior with endogenously-determined balance of power. Economic Journal. 2006;116:558-580.

G.S. Becker. The Economics of Discrimination, second ed. Chicago: University of Chicago Press; 1957.

G.S. Becker. A Treatise on the Family. Cambridge: Harvard University Press; 1981.

Beer, M., 1990. Performance Appraisal. mimeo, Harvard Business School

J.R. Behrman, Y. Cheng, P.E. Todd. Evaluating preschool programs when length of exposure to the program varies: a nonparametric approach. Review of Economics and Statistics. 2004;86:108-132.

Benabou, R., Tirole, J., 2000. Self-confidence and social interactions. NBER Working Paper 7585

R. Benabou, J. Tirole. Intrinsic and extrinsic motivation. Review of Economic Studies. 2003;70:489-520.

N. Berg, D. Lien. Measuring the effect of sexual orientation on income: evidence of discrimination? Contemporary Economic Policy. 2002;20:394-414.

D. Bernheim. A theory of conformity. Journal of Political Economy. 1994;102:841-877.

Berry, J., 2009. Child control in education decisions: an evaluation of targeted incentives to learn in India. mimeo, Massachusetts Institute of Technology

M. Bertrand, S. Mullainathan. Are Emily and Greg more employable than Lakisha and Jamal? A field experiment on labor market discrimination. American Economic Review. 2004;94:991-1013.

M. Bertrand, A. Schoar. Managing with style: the effect of managers on firm policies. Quarterly Journal of Economics. 2003;118:1169-1208.

T.J. Besley, M. Ghatak. Competition and incentives with motivated agents. American Economic Review. 2005;95:616-636.

T.J. Besley, M. Ghatak. Status incentives. American Economic Review Papers and Proceedings. 2008;98:206-211.

T.F. Bewley. Why Wages Don’t Fall During a Recession. Cambridge: Harvard University Press; 1999.

Bhaskar, V., 2002. Relative performance evaluation and limited liability. mimeo, University of Essex

Bhattacharya, D., Dupas, P., 2010. Inferring welfare maximizing treatment assignment under budget constraints. NBER Working Paper 14447

A. Bjorklund, H. Regner. Experimental evaluation of european labour market policy. G. Scmid, J. O’Reilly, editors. International Handbook of Labour Market Policy and Evaluation. 1996:89-115. (Chapter 3)

Blanes-i-Vidal, J., Nossol, M., 2009. Tournaments without prizes: evidence from personnel records. mimeo LSE

N. Bloom, J. Van Reenen. Human resource management and productivity. New Developments and Research on Labor Markets. first ed. O. Ashenfelter, D. Card, editors. Handbook of Labor Economics. vol. 4B. Elsevier; 2011:1697-1767. (Chapter 19)

R. Blundell, M. Costa-Dias. Alternative approaches to evaluation in empirical microeconomics. Portuguese Economic Journal. 2002;1:91-115.

H. Bleakley. Disease and development: evidence from hookworm eradication in the American South. Quarterly Journal of Economics. 2007;122:73-112.

G.J. Bobonis, E. Miguel, C. Puri-Sharma. Anemia and school participation. Journal of Human Resources. 2006;41:692-721.

M.L. Bognanno. Corporate tournaments. Journal of Labor Economics. 2001;19:290-315.

P. Bohm. Revealing demand for an actual public good. Journal of Public Economics. 1984;24:135-151.

Boozer, M., Goldstein, M.P., 2003. Poverty measurement and dynamics. mimeo, Yale University

D. Bramel, R. Friend. Hawthorne, the myth of the docile worker, and class bias in psychology. American Psychologist. 1981;36:867-878.

J. Brown-Kruse, M.B. Cronshaw, D.J. Schenk. Theory and experiments on spatial competition. Economic Inquiry. 2007;31:139-165.

M. Browning, P.-A. Chiappori. Efficient intra-household allocations: a general characterisation and empirical tests. Econometrica. 1998;66:1241-1278.

M. Browning, C. Meghir. The effects of male and female labor supply on commodity demands. Econometrica. 1991;59:925-951.

C. Bull, A. Schotter, K. Weigelt. Tournaments and piece rates: an experimental study. Journal of Political Economy. 1987;95:1-33.

Burgess, S., Propper, C., Slater, H., Wilson, D., 2005. Who wins and who loses from school accountability? The distribution of educational gain in English secondary schools. CMPO Working Paper Series 05/128

R. Butler. Task-involving and ego-involving properties of evaluation: effects of different evaluation conditions on motivational perceptions, interest, and performance. Journal of Educational Psychology. 1987;79:474-482.

C.B. Cadsby, F. Song, F. Tapon. Sorting and incentive effects of pay for performance: an experimental investigation. Academy of Management Journal. 2007;50:387-405.

P. Cahuc, F. Postel-Vinay, J.-M. Robin. Wage bargaining with on-the-job search: a structural econometric model. Econometrica. 2006;74:323-364.

J. Cameron, W.D. Pierce. Reinforcement, reward, and intrinsic motivation: a meta-analysis. Review of Educational Research. 1994;64:363-423.

S.V. Cameron, J.J. Heckman. The dynamics of educational attainment for black, hispanic, and white males. Journal of Political Economy. 2001;109:455-499.

S.V. Cameron, C. Taber. Estimation of educational borrowing constraints using returns to schooling. Journal of Political Economy. 2004;112:132-182.

D. Card. The causal effect of education on earnings. Handbook of Labor Economics. first ed. O. Ashenfelter, D. Card, editors. Handbook of Labor Economics. vol. 3. Elsevier; 1999:1801-1863. (Chapter 30)

D. Card, A.B. Krueger. School quality and black-white relative earnings: a direct assessment. Quarterly Journal of Economics. 1992;107:151-200.

J. Carpenter, E. Seki. Do social preferences increase productivity? Field experimental evidence from fisherman in Toyama bay. Economic Inquiry. 2010. doi:10.1111/j.1465-7295.2009.00268.x

A. Case, A. Deaton. School inputs and educational outcome in South Africa. Quarterly Journal of Economics. 1999;114:1047-1084.

K.K. Charles, J. Guryan. Prejudice and Wages: an empirical assessment of Becker’s The Economics of Discrimination. Journal of Political Economy. 2008;116:773-809.

G. Charness, M. Rabin. Understanding social preferences with simple tests. Quarterly Journal of Economics. 2002;117:817-869.

Charness, G., Yang, C.-L., 2008. Endogenous group formation and public goods provision: exclusion, exit, mergers, and redemption. UC Santa Barbara Department of Economics Working Paper 13-08

Charness, G., Kuhn, P., 2010. Lab labor: what can labor economists learn from the lab? NBER Working Paper 15913

G. Charness, P. Kuhn. Lab labor: What can labor economists learn from the lab? In: O. Ashenfelter, D. Card, editors. Handbook of Labor Economics, vol. 4A. Elsevier; 2011:229-331.

G. Charness, M.-C. Villeval. Cooperation and competition in intergenerational experiments in the field and laboratory. American Economic Review. 2009;99:956-978.

Z. Chen, F. Woolley. A Cournot-Nash model of family decision making. Economic Journal. 2001;111:722-748.

P.-A. Chiappori. Rational household labor supply. Econometrica. 1988;56:63-90.

P.-A. Chiappori, B. Salanie. Testing contract theory: a survey of some recent work. In: M. Dewatripont, L.P. Hansen, S.J. Turnovsky, editors. Advances in Economics and Econometrics Theory and Applications: Eighth World Congress Volume 1. Cambridge: Cambridge University Press, 2003.

M. Cipriani, A. Guarino. Herd behavior in financial markets: an experiment with financial market professionals. Journal of the European Economic Association. 2009;7:206-233.

D. Clark. The performance and competitive effects of school autonomy. Journal of Political Economy. 2009;117:745-783.

S. Coate, G.C. Loury. Will affirmative-action policies eliminate negative stereotypes? American Economic Review. 1993;83:1220-1240.

J.S. Coleman, et al. Equality of Educational Opportunity. Washington, DC: US Government Printing Office; 1966.

C. Cornwell, D.B. Mustard, D.J. Sridhar. The enrollment effects of merit-based financial aid: evidence from Georgia’s HOPE program. Journal of Labor Economics. 2006;24:761-786.

J.B. Cullen, B.A. Jacob, S.D. Levitt. The effect of school choice on participants: evidence from randomized lotteries. Econometrica. 2006;74:1191-1230.

F. Cunha, J.J. Heckman. The economics of psychology of inequality and human development. Journal of the European Economic Association. 2009;7:320-364.

D.J. Cooper, J.H. Kagel, W. Lo, Q.L. Gu. Gaming against managers in incentive systems: experimental results with Chinese students and Chinese managers. American Economic Review. 1999;89:781-804.

J. Currie. Early childhood education programs. Journal of Economic Perspectives. 2001;15:213-238.

J. Currie, D. Thomas. Does head start make a difference? American Economic Review. 1995;85(3):341-364.

J. Currie, D. Thomas. School quality and the longer term effects of head start. Journal of Human Resources. 2000;35:755-774.

A.S. Deaton. Price elasticities from survey data: extensions and Indonesian results. Journal of Econometrics. 1990;44:281-309.

Dobbie, W., Fryer, R.G., 2009. Are high quality schools enough to close the achievement gap? Evidence from social experiment in Harlem. National Bureau of Economic Research Working Paper No. 15473

De Giorgi, G., Pellizzari, M., Woolston, W.G., 2009. Class-size and class heterogeneity. mimeo, Stanford University

E.L. Deci, R. Koestner, R.M. Ryan. Extrinsic rewards and intrinsic motivation in education: reconsidered once again. Review of Educational Research. 2001;71:1-27.

Delfgaauw, J., Dur, R., 2004. Incentives and workers’ motivation in the public sector. Tinbergen Institute Discussion Papers 04-060/1

M. Dewatripont, I. Jewitt, J. Tirole. The economics of career concerns, Part II: application to missions and accountability of government agencies. Review of Economic Studies. 1999;66:199-217.

M. Ding, R. Grewal, J. Liechty. Incentive-aligned conjoint analysis. Journal of Marketing Research. 2005;42:67-83.

R. Dingwall. Ethics and Ethnography. Sociological Review. 1980;28:871-891.

A. Dixit. Incentives and organizations in the public sector: an interpretative review. Journal of Human Resources. 2002;37:696-727.

Dohmen, T.J., Falk, A., 2006. Performance pay and multi-dimensional sorting: productivity, preferences and gender. Institute for the Study of Labor Discussion Paper No. 2001

Doolittle, F., Traeger, L., 1990. Implementing the national JPTA study. Department of Labor, Washington DC

Dubois, P., Ligon, E., 2004. Incentives and nutrition for rotten kids: intrahousehold food allocation in the Phillipines. mimeo, UC Berkeley

E. Duflo. Grandmothers and granddaughters: old age pension and intra-household allocation in South Africa. World Bank Economic Review. 2003;17:1-25.

Duflo, E., Udry, C., 2004. Intrahousehold resource allocation in Cote d’Ivoire: social norms. Separate Accounts, and Consumption Choices, NBER Working Paper No. 10498

E. Duflo, W. Gale, J. Iebman, P. Orszag, E. Saez. Saving incentives for low- and middle-income families: evidence from a field experiment with H&R block. Quarterly Journal of Economics. 2006;121:1311-1346.

Duflo, E., Dupas, P., Kremer, M., 2009. Peer effects, teacher incentives and the impact of tracking: evidence from a randomized evaluation in Kenya. National Bureau of Economic Research Working Paper No. 14475

F.W. Dunford. Random assignment: practical considerations from field experiments. Evaluation and Program Planning. 1990;13:125-132.

J.S. Duesenberry. Income, Saving, and the Theory of Consumer Behavior. Cambridge, MA: Harvard University Press; 1949.

Ederer, F., 2008. Feedback and Motivation in Dynamic Tournaments, mimeo, MIT

Ederer, F., Fehr, E., 2007. Deception and incentives: how dishonesty undermines effort provision. IZA Discussion Paper 3200

T. Ellingsen, M. Johannesson. Pride and prejudice: the human side of incentive theory. American Economic Review. 2008;98:990-1008.

Encinosa, W.E., Gaynor, M.S., Rebitzer, J.B., 1997. The sociology of groups and the economics of incentives: theory and evidence on compensation systems. National Bureau of Economic Research Working Paper 5953

P. Engle, M. Black, J. Behrman, M. Cabral De Mello, P. Gertler, L. Kapiriri, R. Martorell, M. Young. Strategies to avoid the loss of developmental potential in more than 200 million children in the developing world. The Lancet. 2007;369:229-242.

D. Epple, R.E. Romano. Competition between private and public schools, vouchers, and peer-group effects. American Economic Review. 1998;88:33-62.

D. Epple, R.E. Romano, H. Sieg. Admission, tuition, and financial aid policies in the market for higher education. Econometrica. 2006;74:885-928.

R. Ehrenberg, M. Bognanno. The incentive effects of tournaments revisited: evidence from the european PGA tour. Industrial Labor Relations Review. 1990;43:74-89.

T. Eriksson. Executive compensation and tournament theory: empirical tests on Danish data. Journal of Labor Economics. 1999;17:262-280.

Eriksson, T., Poulsen, A., Villeval, M.-C., 2008a. Feedback and incentives: experimental evidence. IZA Discussion Paper 3440

T. Eriksson, S. Teyssier, M.-C. Villeval. Self-selection and the efficiency of tournaments. Economic Inquiry. 2008;47:530-548.

R. Ferber, W.Z. Hirsch. Social Experimentation and Economic Policy. London: Cambridge University Press; 1982.

E. Fehr, A. Falk. Wage rigidity in a competitive incomplete contract market. Journal of Political Economy. 1999;107:106-134.

E. Fehr, U. Fischbacher. Why social preferences matter–the impact of non–selfish motives on competition, cooperation and incentives. Economic Journal. 2002;112:C1-C33.

E. Fehr, K.M. Schmidt. A theory of fairness, competition and cooperation. Quarterly Journal of Economics. 1999;114:817-868.

E. Fehr, K.M. Schmidt. Fairness, incentives, and contractual choices. European Economic Review. 2000;44(4–6):1057-1068.

E. Fehr, G. Kirchsteiger, A. Riedl. Does fairness prevent market clearing? An experimental investigation. Quarterly Journal of Economics. 1993;108:437-459.

E. Fehr, L. Goette, D. Huffman. Loss aversion and labor supply. Journal of the European Economic Association. 2004;2:216-228.

M.M. Ferreyra. Estimating the effects of private school vouchers in multidistrict economies. American Economic Review. 2007;97:789-817.

Fershtman, C., Hvide, H.K., Weiss, Y., 2003. Cultural Diversity, Status Concerns and the Organization of Work. CEPR Discussion Paper No. 3982

Figlio, D., Getzler, L., 2002. Accountability, Ability and Disability: Gaming the System. National Bureau of Economic Research Working Paper 9307

D.N. Figlio, L.W. Kenny. Individual teacher incentives and student performance. Journal of Public Economics. 2007;91:901-914.

D.N. Figlio, C.E. Rouse. Do accountability and voucher threats improve low-performing schools? Journal of Public Economics. 2006;90(1–2):239-255.

R.A. Fisher. The arrangement of field trials. Journal of the Ministry of Agriculture of Great Britain. 1926;33:503-513.

R. Fisman, S. Iyengar, E. Kamenica, I. Simonson. Gender differences in mate selection: evidence from a speed dating experiment. Quarterly Journal of Economics. 2006;121:673-697.

R. Fisman, S. Iyengar, E. Kamenica. Racial preferences in dating: evidence from a speed dating experiment. Review of Economic Studies. 2008;75:117-132.

P. Francois. Public service motivation as an argument for government provision. Journal of Public Economics. 2000;78:275-299.

R.H. Frank. Choosing the right pond: human behavior and the quest for status. New York: Oxford University Press; 1985.

R.H. Franke, J.D. Kaul. The Hawthorne experiments: first statistical interpretation. American Sociological Review. 1978;43:623-643.

Freeman, R.B., 1987. Labour Economics. The New Palgrave Dictionary of Economics, first edition

R.B. Freeman, M.M. Kleiner. The last American shoe manufacturers: decreasing productivity and increasing profits in the shift from piece rates to continuous flow production. Industrial Relations. 2005;44:307-330.

Freeman, R.B., Gelber, A.M., 2008. Prize Structure and Information in Tournaments: Experimental Evidence. mimeo, Harvard University

M. Friedman, L.J. Savage. The utility analysis of choice involving risk. Journal of Political Economy. 1948;56:279-304.

R.G. Fryer. Racial inequality in the 21st century: the declining significance of discrimination. New Developments and Research on Labor Markets. first ed. O. Ashenfelter, D. Card, editors. Handbook of Labor Economics. vol. 4B. Elsevier; 2011:855-971. (Chapter 10)

E.A.M. Gale. The Hawthorne studies—a fable for our times? Quarterly Journal of Medicine. 2004;97:439-449.

E. Garces, D. Thomas, J. Currie. Longer-term effects of Head Start. American Economic Review. 2002;92:999-1012.

R.S. Gibbons. Piece-rate incentive schemes. Journal of Labor Economics. 1987;5:413-429.

R.S. Gibbons, K.J. Murphy. Relative performance evaluation for chief executive officers. Industrial Labor Relations Review. 1990;43:30-52.

Gibbs, M., 1991. An Economic Approach to Process in Pay and Performance Appraisals. mimeo, University of Chicago GSB

D.T. Gilbert, E.C. Pinel, T.D. Wilson, S.J. Blumberg, T.P. Wheatley. Immune neglect: a source of durability bias in affective forecasting. Journal of Personality and Social Psychology. 1998;75:617-638.

R. Gillespie. Manufacturing Knowledge: A History of the Hawthorne Experiments. New York: Cambridge University Press; 1991.

Gine, X., Karlan, D., Zinman, J., 2007. The Risk of Asking: Measurement Effects from a Baseline Survey in an Insurance Takeup Experiment. Working paper, World Bank

P. Glewwe, M. Kremer. Schools, teachers, and education outcomes in developing countries. In: Handbook of the Economics of Education. Elsevier; 2006:945-1017. (Chapter 16)

Glewwe, P., Kremer, M., Ilias, N., 2003. Teacher Incentives. National Bureau of Economic Research Working Paper No. 9671

P. Glewwe, M. Kremer, S. Moulin, E. Zitzewitz. Retrospective vs. prospective analyses of school inputs: the case of flip charts in Kenya. Journal of Development Economics. 2004;74:251-268.

U. Gneezy, J.A. List. Putting behavioral economics to work: testing for gift exchange in labor markets using field experiments. Econometrica. 2006;74:1365-1384.

Gneezy, U., List, J.A., Price, M.K., 2010. Clean Evidence of Statistical Disrimination. Working Paper, University of Chicago

U. Gneezy, A. Rustichini. Pay enough or don’t pay at all. Quarterly Journal of Economics. 2000;115:791-810.

C. Goldin, C. Rouse. Orchestrating impartiality: the impact of “blind” auditions on female musicians. American Economic Review. 2000;99:715-741.

Goldstein, M.P., Udry, C.R., 1999. Gender and Land Resource Management in Southern Ghana. mimeo, Yale University

S. Grantham-McGregor, C. Powell, S. Walker, J. Himes. Nutritional supplementation, psychosocial stimulation, and mental development of stunted children: the Jamaican study. The Lancet. 1991;338:1-5.

S. Grantham-McGregor, Y. Cheung, S. Cueto, P. Glewwe, L. Richter, B. Strupp. Developmental potential in the first 5 years for children in developing countries. The Lancet. 2007;369:60-70.

J.R. Green, N.L. Stokey. A comparison of tournaments and contracts. Journal of Political Economy. 1983;91:349-364.

J. Greenberg. Equity and workplace status: a field experiment. Journal of Applied Psychology. 1988;73:606-613.

D. Greenberg, M. Shroder, M. Onstott. The social experiment market. Journal of Economic Perspectives. 1999;13:157-172.

D. Greenberg, M. Shroder. The Digest of Social Experiments. Washington: The Urban Institute Press; 2004.

T. Groves, Y. Hong, J. Mcmillan, B. Naughton. Autonomy and incentives in Chinese state enterprises. Quarterly Journal of Economics. 1994;109:183-209.

B. Hall, J. Liebman. Are CEOs really paid like bureaucrats? Quarterly Journal of Economics. 1998;113:653-691.

B. Hall, K.J. Murphy. The trouble with stock options. Journal of Economic Perspectives. 2003;17:49-70.

B.H Hamilton, J.A. Nickerson, H. Owan. Team incentives and worker heterogeneity: an empirical analysis of the impact of teams on productivity and participation. Journal of Political Economy. 2003;111:465-497.

E.A. Hanushek. The economics of schooling: production and efficiency in public schools. Journal of Economic Literature. 1986;24:1141-1177.

E.A. Hanushek. School resources. In: Handbook of the Economics of Education. Elsevier; 2006:865-908. (Chapter 14)

E.A. Hanushek. Some US evidence on how the distribution of educational outcomes can be changed. In: Schools and the Equal Opportunity Problem. MIT Press; 2007:159-190. (Chapter 7)

E.A. Hanushek, M.E. Raymond. The effect of school accountability systems on the level and distribution of student achievement. Journal of the European Economic Association. 2004;2:406-445.

G. Harrison, J. List. Field experiments. Journal of Economic Literature. 2004;152:1009-1055.

O. Hart, B. Holmstrom. The theory of contracts. In: T. Bewley, editor. Advances in Economic Theory. Cambridge: Cambridge University Press, 1987.

J.S. Hastings, J.M. Weinstein. Information, school choice, and academic achievement: evidence from two experiments. Quarterly Journal of Economics. 2008;123:1373-1414.

J.A. Hausman, D.A. Wise. Attrition bias in experimental and panel data: the Gary income maintenance experiment. Econometrica. 1979;47:455-473.

J. Hausman, D. Wise, editors. Social Experimentation. Chicago: University of Chicago Press for National Bureau of Economic Research; 1985:1-55.

J.J. Heckman. Randomization and social policy evaluation. In: C.F. Manski, I. Garfinkel, editors. Evaluating Welfare and Training Programs. Cambridge: Harvard University Press, 1992.

Heckman, J.J., Masterov, D., 2005. The Productivity Argument for Investing in Young Children. National Bureau of Economic Research Working Paper Number 13016

J.J. Heckman, P. Siegelman. The urban institute audit studies: their methods and findings. In: M. Fix, R. Struyk, editors. Clear and Convincing Evidence: Measurement of Discrimination in America. Washington, DC: The Urban Institute Press, 1993.

J.J. Heckman, J.A. Smith. Assessing the case for social experiments. Journal of Economic Perspectives. 1995;9:85-110.

J.J. Heckman, S.H. Moon, R. Pinto, P.A. Savelyev, A. Yavitz. The rate of return to the high/scope Perry preschool program. Journal of Public Economics. 2010;94:114-128.

Heckman, J.J., Moon, S.H., Pinto, R., Savelyev, P., Yavitz, A., 2010. Analyzing social experiments as implemented: a reexamination of the evidence from the high scope Perry preschool program. Quantitaive Economics 1 (1) (forthcoming)

G. Hitsch, A. Hortacsu, D. Ariely. Matching and sorting in online dating. American Economic Review. 2010;100:130-163.

J. Hoddinott, L. Haddad. Does female income share influence household expenditures? Evidence from Côte d’Ivoire. Oxford Bulletin of Economics and Statistics. 1995;57:77-96.

B. Holmstrom. Moral hazard in teams. Bell Journal of Economics. 1982;13:324-340.

B. Holmstrom, P. Milgrom. Multitask principal-agent analyses: incentive contracts, asset ownership, and job design. Journal of Law, Economics and Organization. 1991;7:24-52.

R. Homan. The Ethics of Social Research. London: Longman; 1991.

Hossain, T., List, J.A., 2009. The Behavioralist Visits the Factory: Increasing Productivity Using Simple Framing Manipulations. NBER Working Paper 15623

V.J. Hotz. Designing an evaluation of JTPA. In: C.F. Manski, I. Garfinkel, editors. Evaluating Welfare and Training Programs. Cambridge: Harvard University Press, 1992.

C.M. Hoxby. The effects of class size on student achievement: new evidence from natural population variation. Quarterly Journal of Economics. 2000;116:1239-1286.

C.M. Hoxby. Does competition among public schools benefit students and taxpayers? American Economic Review. 2000;90:1209-1238.

Hoxby, C.M., 2000c. Peer Effects in the Classroom: Learning from Gender and Race Variation. National Bureau of Economic Research Working Paper No. 7867

Hoxby, C.M., Muraka, S., 2009. Charter Schools in New York City: Who Enrolls and How They Affect Their Students’ Achievement. National Bureau of Economic Research Working Paper No. w14852

Hoxby, C.M., Weingarth, G., 2006. Taking Race Out of the Equation: School Reassignment and the Structure of Peer Effects. mimeo, Harvard University

C.T. Hsieh, M. Urquiola. The effects of generalized school choice on achievement and stratification: evidence from Chile’s voucher program. Journal of Public Economics. 2006;90(8-9):1477-1503.

C. Ichniowski, G. Prennushi, K. Shaw. The effects of human resource management practices on productivity. American Economic Review. 1997;86:291-313.

Ichniowski, C., Shaw, K., 2008. Insider econometrics: a roadmap to estimating models of organizational performance. In: Gibbons, R., Roberts, J. (Eds.) Handbook of Organizational Economics (forthcoming)

Jacob, B.A., 2002. Accountability, Incentives and Behavior: The Impact of High-Stakes Testing in the Chicago Public Schools. National Bureau of Economic Research Working Paper 8968

B.A. Jacob. Public housing, housing vouchers, and student achievement: evidence from public housing demolitions in Chicago. American Economic Review. 2004;94:233-258.

B.A. Jacob. Accountability, incentives and behavior: the impact of high-stakes testing in the Chicago public schools. Journal of Public Economics. 2005;89:761-796.

D. Jones, T. Kato. The productivity effects of employee stock–ownership plans and bonuses: evidence from Japanese panel data. American Economic Review. 1995;85:391-414.

R. Jowell, P. Prescott-Clarke. Racial discrimination and white-collar workers in Britain. Race. 1970;11:397-417.

C. Juhn, K.M. Murphy, B. Pierce. Wage inequality and the rise in returns to skill. Journal of Political Economy. 1993;101:410-442.

D. Kahneman, A. Tversky. Prospect theory: an analysis of decisions under risk. Econometrica. 1979;47:313-327.

E. Kandel, E. Lazear. Peer pressure and partnerships. Journal of Political Economy. 1992;100:801-813.

R. Katkar, D.H. Reiley. Public versus secret reserve prices in eBay auctions: results from a Pokémon field experiment. Advances in Economic Analysis and Policy. 6, 2006. Article 7

L.F. Katz. Efficiency wage theories: a partial evaluation. NBER Macroeconomics Annual. 1986;1:235-276.

L.F. Katz, J.R. Kling, J.B. Liebman. Moving to opportunity in Boston: early results of a randomized mobility experiment. Quarterly Journal of Economics. 2001;116:607-654.

Keane, M.P., 2010. A structural perspective on the experimentalist school. Journal of Economic Perspectives 24 (2), 47–58 (forthcoming)

M. Keane, K.I. Wolpin. The career decisions of young men. Journal of Political Economy. 1997;105:473-522.

A.N. Kluger, A.S. Denisi. Effects of feedback intervention on performance: a historical review, a meta-analysis, and a preliminary feedback intervention theory. Psychological Bulletin. 1996;199:254-284.

C. Knoeber, W. Thurman. Testing the theory of tournaments: an empirical analysis of broiler production. Journal of Labor Economics. 1994;12:155-179.

M.S. Kramer, S.H. Shapiro. Scientific challenges in the application of randomised trials. Journal of the American Medical Association. 1984;252:2739-2745.

M. Kremer. Randomized evaluations of educational programs in developing countries: some lessons. American Economic Review. 2003;90:102-106.

M. Kremer. Expanding educational opportunity on a budget: lessons from randomized evaluations. In: Improving Education Through Assessment, Innovation, and Evaluation, American Academy of Arts and Sciences: Project on Universal Basic and Secondary Education. Cambridge: MIT Press; 2006.

Kremer, M., Moulin, S., Namunyu, R., 2002. Unbalanced Decentralization. mimeo, Harvard University

M. Kremer, E. Miguel, R. Thornton. Incentives to learn. Review of Economics and Statistics. 2009;91(3):437-456.

D.M. Kreps. The interaction between norms and economic incentives intrinsic motivation and extrinsic incentives. American Economic Review Papers and Proceedings. 1997;87:359-364.

A.B. Krueger. Experimental estimates of education production functions. Quarterly Journal of Economics. 1999;114:497-532.

A.B. Krueger. Economic considerations and class size. Economic Journal. 2003;113:F34-F63.

A.B. Krueger, P. Zhu. Another look at the New York City school voucher experiment. American Behavioral Scientist. 2004;47(5):658-698.

A. Kruglanski. Endogenous attribution and intrinsic motivation. In: D. Greene, M.R. Lepper, editors. The Hidden Costs of Reward. Hillsdale, NJ: Erlbaum Pub., 1978.

H.F. Ladd. Evidence on discrimination in mortgage lending. Journal of Economic Perspectives. 1998;12:41-62.

V. Lavy. Evaluating the effect of teachers’ group performance incentives on pupil achievement. Journal of Political Economy. 2002;110:1286-1317.

V. Lavy. Performance pay and teacher’s effort, productivity, and grading ethics. American Economic Review. 2009;99:1979-2011.

Lavy, V., Paserman, D., Schlosser, A., 2008. Inside the Black Box of Ability Peer Effect: Evidence from Variation of Low Achievers in the Classroom. National Bureau of Economic Research Working Paper No. 14415

E.P. Lazear. Personnel Economics. Cambridge, Mass: MIT Press; 1995.

E.P. Lazear. Pay equality and industrial politics. Journal of Political Economy. 1989;87:1261-1284.

E.P. Lazear. Performance pay and productivity. American Economic Review. 2000;90:1346-1361.

E.P. Lazear. Educational production. Quarterly Journal of Economics. 2001;116:777-803.

E.P. Lazear. Output-based pay: incentives or sorting?. S.W. Polachek, editor. Research in Labor Economics–Accounting for Worker Well-Being. vol. 23. 2005:1-25.

E.P. Lazear, S. Rosen. Rank order tournaments as optimum labor contracts. Journal of Political Economy. 1981;89:841-864.

E.P. Lazear, K.L. Shaw. Personnel economics: the economist’s view of human resources. Journal of Economic Perspectives. 2007;21:91-114.

Lazear, E.P., Malmendier, U., Weberz, R.A., 2009. Sorting and Social Preferences. mimeo, Stanford University

G.E Ledford, E.E. Lawler, S.A. Mohrman. Reward innovations in Fortune 1000 companies. Compensation and Benefits Review. 1995;27:76-80.

Levine, D.K., Pesendorfer, W., 2002. The Evolution of Cooperation Through Imitation. mimeo, UCLA

S.D. Levitt, J.A. List. What do laboratory experiments measuring social preferences reveal about the real world? Journal of Economic Perspectives. 2007;21:153-174.

S.D. Levitt, J.A. List. Viewpoint: on the generalizability of lab behavior to the field. Canadian Journal of Economics. 2007;40:347-370.

S.D. Levitt, J.A. List. Field experiments in economics: the past, the present, and the future. European Economic Review. 2009;53:1-18.

Levitt, S.D., List, J.A., 2010. Was there really a hawthorne effect at the hawthorne plant? an analysis of the original illumination experiments. American Economic Journal: Applied Economics (forthcoming)

Levitt, S.D., List, J.A., Sadoff, S., 2009. Checkmate: exploring backward induction among chess players. American Economic Review (forthcoming)

Levitt, S.D., List, J.A., Sadoff, S., 2010. The Effect of Financial Incentives on High School Achievement: Evidence from Randomized Experiments. Working Paper

E. Ligon. Risk-sharing and information in village economies. Review of Economic Studies. 1998;65:847-864.

J.A. List. Do explicit warnings eliminate the hypothetical bias in elicitation procedures? evidence from field auctions for sportscards. American Economic Review. 2001;91(5):1498-1507.

J.A. List. Preference reversals of a different kind: the ‘more is less’ phenomenon. American Economic Review. 2002;92(5):1636-1643.

J.A. List. Testing neoclassical competitive market theory in the field. Proceedings of the National Academy of Science. 2002;99(24):15827-15830.

J.A. List. Using random th price auctions to value non-market goods and services. Journal of Regulatory Economics. 2003;23:193-205.

J.A. List. Does market experience eliminate market anomalies? Quarterly Journal of Economics. 2003;118:41-71.

J.A. List. Young, selfish and male: field evidence of social preferences. Economic Journal. 2004;114:121-149.

J.A. List. The nature and extent of discrimination in the marketplace: evidence from the field. Quarterly Journal of Economics. 2004;119:49-89.

J.A. List. Neoclassical theory versus prospect theory: evidence from the marketplace. Econometrica. 2004;72:615-625.

J.A. List. Field experiments: a bridge between lab and naturally occurring data. Advances in Economic Analysis and Policy. 6, 2006. Article 8

List, J.A., Livingstone, J., 2010. Exploring the Link between Market Power and the Nature and Magnitude of Discrimination. Working Paper, University of Chicago

J.A. List, M.K. Price. Conspiracies and secret price discounts in the marketplace: evidence from field experiments. Rand Journal of Economics. 2005;36:700-717.

J.A. List, D.R. Reiley. Field experiments. In Steven N. Durlauf, Lawrence E. Blume, editors: The New Palgrave Dictionary of Economics, second ed., Palgrave Macmillan, 2008.

List, J.A., Sadoff, S., Wagner, M., 2010. So You Want to Run an Experiment, Now What? Some Simple Rules of Thumb for Optimal Experimental Design. NBER Working Paper No. 15701

J.A. List, D.L. Millimet, P.G. Fredriksson, W. Warren McHone. Effects of environmental regulations on manufacturing plant births: evidence from a propensity score matching estimator. Review of Economics and Statistics. 2003;85(4):944-952.

Lizzeri, A., Meyer, M., Persico, N., 2002. The Incentive Effects of Interim Performance Evaluations CARESS. Working Paper 02-09

G. Loewenstein. Hot-cold empathy gaps and medical decision making. Health Psychology. 2005;24:49-56.

G. Loewenstein, D. Schkade. Wouldn’t it be Nice? Predicting Future Feelings. In: D. Kahneman, E. Diener, N. Schwartz, editors. Well Being: The Foundations of Hedonic Psychology. Russell Sage Foundation, 1999.

C.O Longnecker, H.P. Sims, D.A. Gioia. Behind the mask: the politics of performance appraisal. The Academy of Management Executive. 1987;1:183-193.

D. Lucking-Reiley. Using field experiments to test equivalence between auction formats: magic on the internet. American Economic Review. 1999;89:1063-1080.

J. Ludwig, D. Miller. Does head start improve children’s life chances? Evidence from a regression discontinuity design. Quarterly Journal of Economics. 2007;122:159-208.

S. Lundberg, R. Pollak. Efficiency in marriage. Review of Economics of the Household. 2003;1:153-167.

J.L. Lusk, J.A. Fox. Value elicitation in laboratory and retail environments. Economics Letters. 2003;79:27-34.

W.B. Macleod, J.M. Malcomson. Implicit contracts, incentive compatibility, and involuntary unemployment. Econometrica. 1989;57:447-480.

J. Malcolmson. Work incentives, hierarchy, and internal labor markets. Journal of Political Economy. 1984;92:486-507.

U. Malmendier, G. Tate. CEO overconfidence and corporate investment. Journal of Finance. 2005;60:2661-2700.

M. Manser. Existing labor market data: current and potential research uses. In: J. Haltiwanger, M. Manser, R. Topel, editors. Labor Statistics Measurement Issues. Chicago: The University of Chicago Press, 1999.

M. Manser, M. Brown. Marriage and household decision-making: a bargaining analysis. International Economic Review. 1980;21:31-44.

Manski, C.F., 1995. Learning About Social Programs from Experiments with Random Assignment of Treatments. University of Wisconsin-Madison: Institute for Research on Poverty, Discussion Paper 1061-95

C.F. Manski, I. Garfinkel. Introduction. In: C.F. Manski, I. Garfinkel, editors. Evaluating Welfare and Training Programs. Cambridge: Harvard University Press, 1992.

A. Mas. Pay, reference points, and police performance. Quarterly Journal of Economics. 2006;121:783-821.

A. Mas, E. Moretti. Peers at work. American Economic Review. 2009;99:112-145.

Maxfield, M., Schirm, S., Rodriguez-Planas, N., 2003. The Quantum Opportunities Program Demonstration: Implementation and Short-Term Impacts. Mathematica Policy Research Report 8279–093

E. Mayo. The Human Problems of an Industrial Civilization. New York: Macmillan; 1933.

M. Mazzocco. Saving, risk sharing, and preferences for risk. American Economic Review. 2004;94:1169-1182.

M. Mazzocco. Household intertemporal behaviour: a collective characterization and a test of commitment. Review of Economic Studies. 2007;74:857-895.

M. McElroy, M. Horney. Nash-bargained decisions: towards a generalization of the theory of demand. International Economic Review. 1981;22:333-349.

H.H. Meyer, E. Kay, J.R. French. Split roles in performance appraisal. Harvard Business Review. 1965:21-29.

B.D. Meyer. Natural and quasi-natural experiments in economics. Journal of Business and Economic Statistics. 1995;13:151-161.

E. Miguel, M. Kremer. Worms: identifying impacts on education and health in the presence of treatment externalities. Econometrica. 2004;72:159-217.

P.R. Milgrom. Employment contracts, influence activities, and efficient organization design. Journal of Political Economy. 1988;96:42-60.

P.R. Milgrom, J. Roberts. The efficiency of equity in organizational decision processes. American Economic Review Papers and Proceedings. 1990;80:154-159.

Moffitt, R.A., 1981. The Negative Income Tax: Would it Discourage Work? Monthly Labor Review

R.A. Moffitt. Econometric Methods for Labor Market Analysis. Handbook of Labor Economics. Elsevier; 1999. (Chapter 24)

B. Moldovanu, A. Sela, X. Shi. Contests for status. Journal of Political Economy. 2007;115:338-363.

E. Moretti. Workers’ education, spillovers and productivity: evidence from plant-level production functions. American Economic Review. 2004;94:656-690.

Muralidharan, K., Sundararaman, V., 2007. Teacher Incentives in Developing Countries: Experimental Evidence from India. mimeo, Harvard University

D. Nagin, J.B. Rebitzer, S. Sanders, L.J. Taylor. Monitoring, motivation, and management: the determinants of opportunistic behavior in a field experiment. American Economic Review. 2002;92:850-873.

H.R. Nalbantian, A. Schotter. Productivity under group incentives: an experimental study. American Economic Review. 1997;87:314-341.

B.J. Nalebuff, J.E. Stiglitz. Prizes and incentives: toward a general theory of compensation and competition. Bell Journal of Economics. 1983;14:21-43.

T. Nechyba. Mobility, targeting and private school vouchers. American Economic Review. 2000;90(1):130-146.

M. Niederle, L. Vesterlund. Do women shy away from competition? Do men compete too much? Quarterly Journal of Economics. 2007;122:1067-1101.

B.A. Olken. Monitoring corruption: evidence from a field experiment in Indonesia. Journal of Political Economy. 2007;115:200-249.

M.T. Orne. On the social psychological experiment: with particular reference to demand characteristics and their implications. American Psychologist. 1962;17:776-783.

G.H. Orcutt, A.G. Orcutt. Incentive and disincentive experimentation for income maintenance policy purposes. American Economic Review. 1968;58:754-773.

P. Oyer, S. Schaefer. Why do some firms give stock options to all employees? An empirical examination of alternative theories. Journal of Financial Economics. 2004;76:99-133.

P. Oyer, S. Schaefer. Personnel economics: hiring and incentives. New Developments and Research on Labor Markets. first ed. O. Ashenfelter, D. Card, editors. Handbook of Labor Economics. vol. 4B. Elsevier; 2011:1769-1823. (Chapter 20)

H.J. Paarsch, B. Shearer. The response of worker effort to piece rates: evidence from the British Columbia tree-planting industry. Journal of Human Resources. 1999:643-667.

H.J. Paarsch, B. Shearer. Piece rates, fixed wages, and incentive effects: statistical evidence from payroll records. International Economic Review. 2000;41:59-92.

J.M. Pahl. The allocation of money and the structuring of inequality within marriage. The Sociological Review. 1983;31:237-262.

P. Peterson, W. Howell, P. Wolf, D. Campbell. School vouchers: results from randomized experiments. In: The Economics of School Choice. University of Chicago Press; 2003:107-144.

J. Pfeffer. Competitive Advantage Through People: Unleashing The Power of The Work Force. Cambridge, Mass: Harvard Business Press; 1996.

E. Phelps. The statistical theory of racism and sexism. American Economic Review. 1972;LXII:659-661.

Pigou, A.C., 1920. The Economics of Welfare

E. Plug, W. Vijverberg. Schooling, family background, and adoption: is it nature or is it nurture. Journal of Political Economy. 2003;111:611-641.

J.M. Podolny, J.N. Baron. Resources and relationships: social networks and mobility in the workplace. American Sociological Review. 1997;62:673-693.

C. Prendergast. The provision of incentives in firms. Journal of Economic Literature. 1999;37:7-63.

Prendergast, C., 2001. Selection and Oversight in the Public Sector with the Los Angeles Police Department as an Example. National Bureau of Economic Research Working Paper No. 8664

C. Prendergast, R.H. Topel. Favoritism in organizations. Journal of Political Economy. 1996;104:958-978.

Puma, M., Burstein, N., Merrell, K., Silverstein, G., 1990. Evaluation of the Food Stamp Employment and Training Program: Final Report, Bethesda, Md. Abt Associates, Bethesda, MD

Punch, M., 1985. The Politics and Ethics of Fieldwork. Sage, London

Raaum, O., Torp, H., 1993. AMO-kurs: Hvem søker, hvem får plass - og hvem får jobbetterpå? Søkelys på arbeidsmarkedet

M. Rabin, J.L. Schrag. First impressions matter: a model of confirmatory bias. Quarterly Journal of Economics. 1999;114:37-82.

M.A. Rangel. Alimony rights and intrahousehold allocation of resources: evidence from Brazil. Economic Journal. 2006;116:627-658.

I. Rasul. Household bargaining over fertility: theory and evidence from Malaysia. Journal of Development Economics. 2008;86:215-241.

Reback, R., 2005. School Accountability and the Distribution of Student Achievement. mimeo, Columbia University

M. Rege, K. Telle. The impact of social approval and framing on cooperation in public good situations. Journal of Public Economics. 2004;88:1625-1644.

P.A. Riach, J. Rich. Field experiments of discrimination in the market place. Economic Journal. 2002;112:F480-F518.

P.A. Riach, J. Rich. An experimental investigation of sexual discrimination in hiring in the English labor market. B. E. Journal of Economic Analysis & Policy. 6, 2006. Advances Article 1

B. Rice. The Hawthorne effect: persistence of a flawed theory. Psychology Today. 1982;16:71-74.

S.G. Rivkin, E.A. Hanushek, J.F. Kain. Teachers, schools, and academic achievement. Econometrica. 2005;73:417-459.

Robinson, J., 2008. Limited Insurance Within the Household: Evidence from a Field Experiment in Western Kenya. mimeom, UC Santa Cruz

J. Rockoff. Field experiments in class size from the early twentieth century. Journal of Economic Perspectives. 2009;23:211-230.

F.J. Roethlisberger, W. Dickson. Management and the Worker. Cambridge: Harvard University Press; 1939.

S. Rosen. Authority, control, and the distribution of earnings. Bell Journal of Economics. 1982;13:311-323.

S. Rosen. The theory of equalizing differences. In: Orley Ashenfelter, Richard Layard, editors. Handbook of Labor Economics, vol. 1. Amsterdam: North-Holland; 1986.

P. Rosenbaum, D. Rubin. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70:41-55.

M.R Rosenzweig, K.I. Wolpin. Natural ‘natural experiments’ in economics. Journal of Economic Literature. 2000;38:827-874.

Ross, H.L., 1970. An Experimental Study of the Negative Income Tax, Child Welfare, December

J.J. Rotemberg. Human relations in the workplace. Journal of Political Economy. 1994;102:684-717.

C. Rouse. Private school vouchers and student achievement: an evaluation of the Milwaukee parental choice program. Quarterly Journal of Economics. 1998;133(2):553-602.

D. Roy. Quota Restriction and Goldbricking in a Machine Shop. American Journal of Sociology. 1952;57:427-442.

A. Rozan, A. Strenger, M. Willinger. Willingness-to-pay for food safety: an experimental investigation of quality certification on bidding behavior. European Review of Agricultural Economics. 2004;31:409-425.

D.B. Rubin. Comment: Neyman (1923) and causal inference in experiments and observational studies. Statistical Science. 1990;5:472-480.

P.A. Samuelson. Social indifference curves. Quarterly Journal of Economics. 1956;70:1-22.

W. Samuelson, R. Zeckhauser. Status quo bias in decision making. Journal of Risk and Uncertainty. 1988;1:7-59.

R. Sausgruber. A note on peer effects between teams. Experimental Economics. 2009;12:193-201.

T.P. Schultz. School subsidies for the poor: evaluating the Mexican Progresa poverty program. Journal of Development Economics. 2004;74:199-250.

Seabright, P., 2002. Blood, Bribes, and the Crowding-out of Altruism by Financial Incentives. mimeo, Toulouse University

A. Sen. Development as Freedom. New York: Knopf; 1999.

Sethi, R., Somanathan, E., 1999. Preference Evolution and Reciprocity. mimeo, University of Michigan

C. Shapiro, J.E. Stiglitz. Equilibrium unemployment as a worker discipline device. American Economic Review. 1984;74:433-444.

B.S. Shearer. Piece rates, fixed wages and incentives: evidence from a field experiment. Review of Economic Studies. 2004;71:513-534.

P. Slovic, S. Lichtenstein. Comparison of Bayesian and regression approaches to the study of information processing in judgment. Organizational Behavior and Human Performance. 1971;6:649-744.

Snow, C.E., 1927. Research on Industrial Illumination. The Tech Engineering News November, 257–282

J. Sobel, I. Takahashi. A multi-stage model of bargaining. Review of Economic Studies. 1983;50:411-426.

A.R. Soetevent. Anonymity in giving in a natural context–a field experiment in 30 churches. Journal of Public Economics. 2005;89:2301-2323.

J. Splawa-Neyman. On the application of probability theory to agricultural experiments. Essay on principles, Section 9. Statistical Science. 1923;5:465-472. Translated and edited bv D. M. Dabrowska and T. P. Speed from the Polish original, which appeared in Roczniki Nauk Rolniczyc, Tom X (1923): 1-51 (Annals of Agricultural Sciences)

J. Splawa-Neyman. Contributions of the theory of small samples drawn from a finite population. Biometrika. 1923;17:472-479. The note on this republication reads, These results with others were originally published in La Revue Mensuelle de Statistique, Publ. Parl’office Central de Statistique de la Republique Polonaise, Tom. vi. pp. 1–29, 1923

M. Stanley. College education and the mid-century GI bills. Quarterly Journal of Economics. 2003;118:671-708.

P. Stafford. Forestalling the demise of empirical economics: the role of microdata in labor economics research. O. Ashenfelter, R. Layard, editors. Handbook of Labor Economics. Elsevier; 1986. (Chapter 7)

B. Stevenson, J. Wolfers. Marriage and divorce: changes and their driving forces. Journal of Economic Perspectives. 2007;21:27-52.

J.E. Stiglitz. Incentives, risk, and information: notes towards a theory of hierarchy. Bell Journal of Economics. 1975;6:552-579.

D. Street. Fisher’s contributions to agricultural statistics. Biometrics. 1990;46:937-945.

C. Taber, B.A. Weinberg. Labour economics (new perspectives). In Steven N. Durlauf, Lawrence E. Blume, editors: The New Palgrave Dictionary of Economics, second ed., Palgrave Macmillan, 2008.

R. Thaler. Toward a positive theory of consumer choice. Journal of Economic Behavior and Organization. 1980;1:39-60.

D.A. Thomas. The impact of race on managers’ experiences of developmental relationships. Journal of Organizational Behavior. 1990;11:479-492.

D.A. Thomas. Like father, like son or like mother, like daughter: parental education and child health. Journal of Human Resources. 1994;29:950-989.

Thorndike, E.L., 1913. Educational Psychology. Oxford: Columbia University

P. Todd, K.I. Wolpin. Assessing the impact of a school subsidy program in Mexico: using a social experiment to validate a dynamic behavioral model of child schooling and fertility. American Economic Review. 2006;96:1384-1417.

A.S. Tsui, C.A. O’Reilly. Beyond simple demographic effects: the importance of relational demography in superior-subordinate dyads. Academy of Management Journal. 1989;32:402-423.

C. Udry. Risk and insurance in a rural credit market: an empirical investigation in northern Nigeria. Review of Economic Studies. 1994;61:495-526.

Ulph, D.T., 1988. A General Non-cooperative Nash Model of Household Consumption Behaviour. University of Bristol Working Paper 88/205

D. Vandegrift, A. Yavas, P.M. Brown. Incentive effects and overcrowding in tournaments: an experimental analysis. Experimental Economics. 2007;10:345-368.

E. Van Den Steen. Rational overoptimism (and other biases). American Economic Review. 2004;94:1141-1151.

T. Veblen. The Theory of the Leisure Class: An Economic Study of Institutions. New York: Modern Library; 1934.

J. Vollmann, R. Winau. Informed consent in human experimentation before the Nuremberg code. British Medical Journal. 1996;313:1445-1449.

R. Vollmeyer, F. Rheinberg. A surprising effect of feedback on learning. Learning and Instruction. 2005;15:589-602.

R.A. Weber. Managing growth to achieve efficient coordination in large groups. American Economic Review. 2006;96:114-126.

D. Weichselbaumer. Sexual orientation discrimination in hiring. Labour Economics. 2003;10:629-642.

M.A. Wesolowski, K.W. Mossholder. Relational demography in supervisor-subordinate dyads: impact on subordinate job satisfaction, burnout, and perceived procedural justice. Journal of Organizational Behavior. 1997;18:351-362.

K.T. Williams, C.A. O’Reilly. Demography and diversity in organizations: a review of 40 years of research. Research in Organizational Behavior. 1998;20:77-140.

White, M., Lakey, J., 1992. Restart effect: does active labour market policy reduce unemployment? Policy Studies Institute

S. Wuchty, B.E. Jones, B. Uzzi. The increasing dominance of teams in production of knowledge. Science. 2007;316:1036-1039.

J. Yinger. Evidence on discrimination in consumer markets. Journal of Economic Perspectives. 1998;XII:23-40.

D.J. Zimmerman. Peer effects in academic outcomes: evidence from a natural experiment. Review of Economics and Statistics. 2003;85:9-23.

¹ Tel.: +1-773-702-9811; fax: +1-773-702-8490.

² Tel: +44-207-679-5853; fax: +44-207-916-2775.

³ In this Handbook, Charness and Kuhn (2011, 2010) provide a useful discussion of extant laboratory studies in the area of labor economics.

⁴ For a more complete discussion see List and Reiley (2008).

⁵ Frederick Taylor’s seminal book, The Principles of Scientific Management, published in 1911, which creatively considered techniques to shorten task time, was also an important stimulus for the Industrial Psychology field.

⁶ A relay was a switching device activated in the telephone exchange as each number was dialed, and was a fairly mind-numbing task: assemble a coil, armature, contact springs, and insulators by fastening them to a fixture with four screws. On average, it was roughly one minute’s worth of work.

⁷ There are many other evaluations as well. For example, a controversial article written by Bramel and Friend (1981), heavily laced with Marxist ideology, takes a conspiratorial view of industrial psychologists and argues that the Hawthorne effect is simply the result of “capitalist bias among modern industrial psychologists.”

⁸ Derivative of this path-breaking experiment were two experiments run alongside the relay experiment. Both were started in August of 1928; one was a second relay experiment, the other a mica splitting experiment. In the second relay experiment, five women workers were subjected to variations in a small group incentive program from August 1928 to March 1929. In the mica splitting experiment, the researchers began by secretly monitoring the output of five women at their regular department workstations. Their job was to split, measure, and trim mica chips that were to be used for insulation. After observing the workers secretly, they moved the women to a special test room where, unlike their cohorts, they received 10-minute rest breaks at 9:30 a.m. and 2:30 p.m.

⁹ The success of the relay assembly experiments led to in-depth surveys (from 1928-1931) and one final experiment in the Hawthorne plant—the “bank wiring” experiment, designed by Mayo and others from 1931-1932. The researchers began by examining the productivity of 14 men who assembled telephone terminals. They then moved these men to a special test room, without introducing any other changes in work or pay conditions. Despite the move to a separate experimental setting, the men’s output did not increase.

¹⁰ This, and the subsequent subsections, draw from Harrison and List (2004), List (2006), and Levitt and List (2009). There are many definitions of social experiments in the economics literature. Ferber and Hirsch (1982, p. 7) define a social experiment in economics as “... a publicly funded study that incorporates a rigorous statistical design and whose experimental aspects are applied over a period of time to one or more segments of a human population, with the aim of evaluating the aggregate economic and social effects of the experimental treatments.” Greenberg and Shroder (2004) define a social experiment as having at least the following four features: (i) random assignment, (ii) policy intervention, (iii) follow-up data collection, and (iv) evaluation.

¹¹ As the Editors pointed out, the basic idea of a negative income tax was a part of the liberal party platform in the 1940s, and it is usually argued that it was designed by Juliet Rhys-Williams, an amazing advocate for women in that period.

¹² The negative income tax rate works as follows. Assume that John is randomly inserted into the 100% guaranteed income ($3300), 50% negative tax rate treatment. What this means is that when the policy binds, for each $1 that John’s family earns on its own, they receive $0.50 less in federal benefits. Thus, if John’s family earns $2000 in year one, they would receive $1000 less in program benefits, or $2300, resulting in a total income of $4300. In this case, if in any year John’s family earns $6600 or more, program benefits are zero.

¹³ We emphasize large scale because there were a handful of other social experiments—such as the Perry Preschool Project begun in 1962—that preceded the New Jersey Income Maintenance experiment (Greenberg et al., 1999), and that are still being evaluated today (Heckman et al., forthcoming). A prevalent type of social experimentation in recent years is the paired-audit experiments to identify and measure discrimination. These involve the use of “matched pairs” of individuals, who are made to look as much alike as possible apart from the protected characteristics. These pairs then confront the target subjects, which are employers, landlords, mortgage loan officers, or car salesmen. The majority of audit studies conducted to date have been in the fields of employment discrimination and housing discrimination (Riach and Rich, 2002).

¹⁴ The original negative income tax experiment led to three other early experiments on income maintenance, which drew samples from rural areas of North Carolina and Iowa (1970-72); Seattle and Denver (1970-78); and Gary, Indiana (1971-74). These experiments went beyond studying urban husband-wife couples that were studied in the New Jersey income maintenance experiment. For instance, the North Carolina/Iowa study was conducted by the Institute of Research on Poverty to explore behavior among the rural poor. Only one and two parent black households were studied in the Gary, IN test. The Seattle-Denver study represented the most comprehensive, including blacks, Chicanos, and whites who had either one or two parents in the household. By and large, the evidence gathered in these studies reinforced the main result in the New Jersey study, but these new studies highlighted additional insights that were important for policy making, such as in differences between male and female labor force participation, unemployment duration, and welfare participation.

¹⁵ An early social experiment in Europe was the study of Intensified Employment Services in Eskilstuna, Sweden. In 1975, a small-town employment office received a personnel reinforcement for three months and split a group of 410 unemployed job seekers who had been registered at the office for at least three months into a treatment group () and a control group (). The control group received normal service and used the services of the office for an average of 1.5 hours over the course of the experiment, while the treatment group used office services for an average of 7.5 hours, allowing office personnel to work more intensely on the individual problems of the treatment subjects. The findings were that the percent of workers with a job at the end of the experiment, unemployment spells during the experiment, and earnings were all favorably influenced by the employment services studied. A discussion of this study, as well as other European social experiments in labor market policy can be found in Bjorklund and Regner (1996) and the various Digests of Social Experiments due to Greenberg, and Shroder. Two of the more famous examples are the Norwegian Training Experiment (Raaum and Torp, 1993) and the Restart Programme in the United Kingdom (White and Lakey, 1992).

¹⁶ For example, whereas over 80% of social experiments from 1962-74 tested new programs, since 1983 only roughly 33% did so (Greenberg et al., 1999).

¹⁷ There is a growing body of evidence from laboratory settings on how individuals self-select into treatments when allowed to do so. See Lazear et al. (2009) for a recent such study, and the discussion in Charness and Kuhn (2011, 2010).

¹⁸ Problems of attrition are well known and detailed discussions can be found in Hausman and Wise (1979) and the various chapters in Manski and Garfinkel (1992).

¹⁹ Note that the development field experiments that have arisen recently often have to confront this issue directly when making inference from their studies—even though subjects might not know that they are randomized, a survey is used to measure the outcomes so repeated interactions are a certainty. One paper that attempts to quantify the effects is due to Gine et al. (2007). In a similar spirit, Muralidharan and Sundararaman (2007) present evidence from a randomized control trial on educational interventions in India. They also present evidence to distinguish the effects of the intervention from the mere effects of being part of an observational study per se.

²⁰ In this sense, field experiments parallel the research approach that exploits “natural experiments” (Meyer, 1995; Rosenzweig and Wolpin, 2000; Angrist and Krueger, 2001), the difference being that in a field experiment the researcher actually controls the randomization herself, whereas in the natural experiment approach the researcher attempts to find sources of variation in existing data that are “as good as randomly assigned.” In addition, the close involvement of the researcher from the outset allows for primary data collection to perhaps directly help shed light on the underlying mechanisms driving causal effects.

²¹ Several aspects of the approach are not discussed in this discussion. For example, for these conditions to hold the appropriate conditioning set, , should be multi-dimensional. Second, upon estimation of the propensity score, a matching algorithm must be defined in order to estimate the missing counterfactual, , for each treated observation. The average treatment effect on the treated () is given by,

where the outer expectation is over the distribution of . These and other issues are discussed in List et al. (2003).

²² Harrison and List (2004) discuss in detail whether student subjects exhibit different behaviors in laboratory environments that individuals drawn from other subject pools. A parallel trend in laboratory settings has been the use of “real-effort” experiments, as discussed in Charness and Kuhn (2011, 2010).

²³ Of course, this is just a select sampling of the work of this sort, for a more comprehensive list please see www.fieldexperiments.com.

²⁴ Relatedly, there is a recent but steadily expanding literature in statistics and economics on how experimental evidence on treatment effect heterogeneity may be used to maximize gains from social programs. One example is Bhattacharya and Dupas (2010) who study the problem of allocating a binary treatment among a target population based on observables, to maximize the mean social welfare arising from an eventual outcome distribution, when a budget constraint limits what fraction of the population can be treated.

²⁵ This is especially so if we compare field experiments to laboratory experiments that utilize student subject pools. Even by changing the subject pool slightly, as in artefactual field experiments, replicability becomes an issue as more still needs to be understood on the self-selection into experiments of such non-standard subjects (Charness and Kuhn, 2011, 2010).

²⁶ More detailed discussions of how the study of labor economics has evolved over time can be found in Freeman (1987) and Taber and Weinberg (2008).

²⁷ To understand the magnitude of this change, we note that Stafford (1986) finds that among the 759 papers published in six leading journals between 1965 and 1983, virtually none was based on microdata with individual firms or establishments as the unit of analysis.

²⁸ The numbers do not include papers and proceedings volumes.

²⁹ Earlier reviews of trends in published papers in labor economics include Stafford (1986), Manser (1999), and Moffitt (1999).

³⁰ The total number of papers reported in Table 2 is not quite reflected in the totals recorded in Table 3. This is because in Table 3 we sometimes record a paper in more than one column if it utilizes a range of empirical techniques. For example, the total number of non-theory papers by subfield and method in Table 3 is greater than total non-theory papers found in Table 2 (224 > 219) because five papers used multiple methods and so were counted twice.

³¹ For related reviews of the literature, see the excellent work of Card (1999), and in this Handbook, the chapter by Fryer (2011).

³² Rockoff (2009) presents an overview of a substantial, but overlooked, body of field experiments class size that developed prior to World War II.

³³ There is evidence that the first five years of life are critical for lifelong development. Hence resource poor or un-stimulating environments early in life are likely to detrimentally impact children’s cognitive, motor, social-emotional development, and their health status (Grantham-McGregor et al., 1991; Heckman and Masterov, 2005; Engle et al., 2007; Grantham-McGregor et al., 2007). As adults, they are more likely to have high fertility rates and are less likely to provide adequate stimulation and resources for their own children, thus contributing to the intergenerational transmission of poverty and economic inequality (Sen, 1999). The current debate, to which social experiments have contributed, focuses on understanding the types of intervention that might be effective for the child and their families, and cost-effective from society’s viewpoint.

³⁴ Several theoretical papers suggest that school vouchers will lead to overall welfare gains, increased stratification, and efficiency gains (Epple and Romano, 1998; Ferreyra, 2007; Nechyba, 2000; Rouse, 1998; Figlio and Rouse, 2006; Hsieh and Urquiola, 2006; Epple et al., 2006; Arcidiacono, 2005).

³⁵ A comprehensive summary of the regression-based literature on discrimination are contained in Altonji and Blank (1999) and Yinger (1998).

³⁶ Weekly earnings figures are taken from the Current Population Survey. They are for all employed people over age 25 that reported weekly earnings above zero. Data before 1979 is taken from the May supplement of the CPS. After 1979 data is taken from the CPS Annual Earnings File. Earnings from the May supplement for 1969-1972 were reported in ranges. The midpoint of each range was assumed to be the actual earnings for each individual.

³⁷ As far as the law is concerned, both types of discrimination—taste based and statistical—are illegal. For example, in credit markets, the Equal Credit Opportunity Act (Sec. 701, as amended in March 1976) states that it “shall be unlawful for any creditor to discriminate against any applicant, with respect to any aspect of the credit transaction…on the basis of race, color, religion, national origin, sex or martial status, or age…”. The law implies that while it is allowed to differentiate among customers based on characteristics of the customer (e.g., credit history) or the product that are linked to the expected return of the transaction, it is illegal to use the customer’s membership in a group to distinguish among customers. In other words, firms should make decisions about the customer as if they had no information regarding the customer’s race, sex, etc. This, for example, is true regardless of whether race is or is not a good proxy for risk factors in the credit market (Ladd, 1998).

³⁸ The interested reader should also see the recent special Symposium issue on Discrimination in Product, Credit, and Labor Markets that appeared in the Journal of Economic Perspectives Spring (1998).

³⁹ The field experiment approach shares many of the characteristics of the insider econometrics approach to understand the causes and consequences of behavior within a firm (Ichniowski and Shaw, forthcoming). However a key distinction is that field experiments explicitly rely on exogenous variation created with the specific influence of researchers in order to identify causal effects. Clearly, not every intervention that a researcher could design and implement is socially useful—there is little value added in implementing practices that firms are never otherwise observed engaging in. However, this does not preclude the fact that carefully designed interventions can help researchers to uncover causal relations and the mechanisms behind them.

⁴⁰ Our discussion focuses predominantly on natural field experiments within firms. There also exists a separate branch of artefactual field experiments where subject pools are drawn from manufacturing workers (Barr and Serneels, 2009), fishermen (Carpenter and Seki, 2010) and employees in large firms (Charness and Villeval, 2009).

⁴¹ Many of the wider literature related to the research questions we touch upon, such as incentive pay and teams, are discussed in greater detail in the Chapter on Human Resource Management by Bloom and Van Reenen (2011), also in this Handbook. They summarize the evidence from across countries showing the increasing use of performance pay over time. In the Chapter on Personnel Economics in this Volume by Oyer and Schaefer (2011), further issues related to incentive pay and firm hires is discussed at greater length.

⁴² Due to these empirical challenges, it is not surprising that much of the early evidence testing theories in personnel economics originated from laboratory environments. For example, Bull et al. (1987) provide evidence from the lab on the predictions of rank order tournament theory; Fehr and Fischbacher (2002) review the experimental evidence on social preferences in workplace environments. The wider availability of personnel data and ever closer links being forged between researchers and firms has allowed the literature in field experiments within firms to flourish.

⁴³ Charness and Kuhn (2011, 2010) review the extensive evidence from laboratory settings on sabotage.

⁴⁴ There is a long-standing idea in psychology that rewards may hinder performance (Kruglanski, 1978). There is some evidence on this from laboratory settings where offering small amounts of monetary compensation is found to decrease effort relative to paying nothing (Gneezy and Rustichini, 2000), and where explicit incentives sometimes result in worse compliance than incomplete labor contracts (Fehr and Falk, 1999; Fehr and Schmidt, 2000). This might either be because small monetary incentives crowd out intrinsic motivation, an idea formalized by Benabou and Tirole (2000), or because the individual is reluctant to signal his willingness to accept low wages. We do not know of field evidence that examines such non-monotonic effects of monetary incentives on effort.

⁴⁵ Firms typically provide workers some insurance by allowing their output to occasionally fall below the required minimum , but a worker that consistently fails to meet this performance threshold is likely to be fired or assigned to another task.

⁴⁶ Whether workers exert more or less effort in response to a higher piece rate of course depends on the balance of income and substitution effects. Evidence from the lab and field in Gneezy and Rustichini (2000) suggested that the relationship between piece rates and effort was U-shaped with low piece rates eliciting less effort than a zero piece rate. One explanation would be that small levels of financial compensation crowd out workers’ intrinsic motivation to exert effort.

⁴⁷ Laboratory experiments have begun to explore in more detail the selection effects of incentives (Charness and Kuhn (2011, 2010).

⁴⁸ This links together with recent development in structural estimation of search and matching models in labor markets. For example, Cahuc et al. (2006) develop and estimate an equilibrium model with strategic wage bargaining and on-the-job search. An important innovation on their paper is that when an employed worker receives an outside job offer, a three-player bargaining process is started between the worker, her/his initial employer and the employer which made the outside offer. They use the model to examine wage determination in France using matched employer-employee data from 1993 to 2000. They find that inter-firm competition is quantitatively important for wage determination, and raising wages above reservation levels.

⁴⁹ A related concern has been on the existence of rachet effects in response to pay for performance (Gibbons, 1987), whereby workers deliberately underperform to keep the piece rate high. Such ratchet concerns have been documented in firms where productivity shocks are uncommon such as shoe making (Freeman and Kleiner, 2005) and bricklaying (Roy, 1952). Cooper et al. (1999) present evidence from an artefactual field experiment on Chinese students and managers on such ratchet concerns, that might be of particular concern in planned economies.

⁵⁰ The small sample size used in Shearer (2004) also reflects the nature of tree planting firms. They typically employ less than 100 planters.

⁵¹ This list is not meant to be exhaustive. We focus on these because field experiments have provided insights on these margins to a greater extent than for other types of non-monetary incentive such as those discussed in Francois (2000), Dixit (2002), Prendergast (2001), Benabou and Tirole (2003), Seabright (2002), Delfgaauw and Dur (2004), Akerlof and Kranton (2005), and Besley and Ghatak (2005).

⁵² There are field experiments on charitable giving that have exogenously varied the visibility of donations to assess whether such status concerns or prestige motives drive giving behavior Soetevent (2005), something that has been found to be the case in laboratory settings of public goods games (Andreoni and Petrie, 2004; Rege and Telle, 2004). Echoing some of the results below on gift-exchange in the field and the lab, Soetevent (2005) finds evidence that for some times of charitable cause, contributions increase when they can be socially recognized, but that this effect diminishes over time.

⁵³ The organizational behavior and psychology literatures have also emphasized the signaling effects of feedback, as well as other related comparative statics such as how individuals change strategies in response to feedback (Vollmeyer and Rheinberg, 2005), and the specific type of information that should be conveyed in feedback (Butler, 1987; Cameron and Pierce, 1994).

⁵⁴ Two strands of the economics literature have explored aspects of the signaling effect of feedback. The first strand focuses on whether individuals update their priors in response to feedback consistent with Bayes’ rule (Slovic and Lichtenstein, 1971). The second strand focuses on whether agents react more to positive than negative feedback because of self serving biases such as confirmatory bias (Rabin and Schrag, 1999), or overconfidence (Malmendier and Tate, 2005; Van Den, 2004).

⁵⁵ On the heterogeneous effects of feedback, the meta-analysis of Kluger and Denisi (1996) covering 131 studies in psychology with 13,000 subjects finds that two thirds of studies report positive feedback effects. On the optimal provision of feedback, when the agent knows her ability so that there is no indirect signaling effect of feedback, whether feedback should be optimally provided or not is sensitive to the specification of the agent’s cost of effort function (Lizzeri et al., 2002; Aoyagi, 2007). More general results have been derived when agents learn their ability through feedback and ability is complementary to effort (Ederer, 2008).

⁵⁶ A separate branch of the literature has focused on the strategic manipulation of feedback by the principal (Malcolmson, 1984; Gibbs, 1991; Aoyagi, 2007), of which there is anecdotal evidence from the field (Longnecker et al., 1987) and laboratory (Ederer and Fehr, 2007). Evidence from field experiments on feedback remains scarce.

⁵⁷ Social preferences can be thought of as a reduced form representation of a number of models. They depict behavior consistent with reciprocity or altruism (Fehr and Schmidt, 1999), or the evolutionary equilibrium of a repeated Prisoner’s Dilemma game in which workers learn which strategies to play (Levine and Pesendorfer, 2002; Sethi and Somanathan, 1999). In the field experiment reported in Bandiera et al. (2005), they attempt to distinguish between models in which workers’ preferences display altruism towards others, and models in which workers behave as if they are altruistic because, for instance, they play trigger strategies to enforce implicit collusive agreements.

⁵⁸ The model would be complicated if there were also knowledge spillovers such that effort exerted by worker reduced the cost of effort of worker . While such knowledge spillovers have been found in workplace settings (Moretti, 2004; Ichniowski et al., 1997) we abstract from them here.

⁵⁹ Hence this empirical strategy is informed by the evidence that individual “styles” of managers affect firm performance over and above firm level characteristics themselves (Bertrand and Schoar, 2003; Malmendier and Tate, 2005).

⁶⁰ Bandiera et al. study the behavior of nearly all the workers in the firm for each field experiment. However, given the experiment takes place in one firm, to avoid contamination effects across treated and control groups, all workers were simultaneously shifted from one incentive scheme to the other. In contrast, Shearer (2004) exogenously varied the incentive scheme workers were in on each day. In non-experimental studies such as Lazear (2000) on individual pay and Hamilton et al. (2003) on team pay, workers might have had some say on which compensation scheme they would be paid under.

⁶¹ See Lazear and Rosen (1981), Green and Stokey (1983) and Nalebuff and Stiglitz (1983). Relative performance evaluation may also be preferred to piece rates as it lowers informational rents to high types (Bhaskar, 2002), and reduces incentives of workers to exert effort in influence activity (Milgrom, 1988).

⁶² The results from this natural field experiment has implications for environments outside the workplace. For example, the provision of teacher incentives based on the average performance of students may have important consequences for the distribution of test scores among students, and the composition of students, and possibly teachers, admitted into schools. For example, Burgess et al. (2005) find that the introduction of school accountability based on test pass rates improved the performance of students in the middle of the ability distribution, at the expense of both high achieving and low achieving students. Similarly, Hanushek and Raymond (2004) and Reback (2005) provide evidence on the distributional consequences on student achievement under the No Child Left Behind policy. Finally, Jacob (2002) and Figlio and Getzler (2002) provide evidence on the selection effect. They show that the introduction of accountability schemes lead to an increase in grade retention and special educational placement in Chicago and Florida public schools, respectively.

⁶³ Residual, or within-group wage inequality, is a sizeable contributor of the growth in overall wage inequality in the US. This has been argued to have increased throughout the 1970s and 1980s (Juhn et al., 1993), and into the 1990s (Acemoglu, 2002; Autor et al., 2005).

⁶⁴ Both the positive and negative effects of social connections have been stressed in the organizational behavior and sociology literatures. Examples of such work includes that on the effect of manager-subordinate similarity on subjective outcomes such as performance evaluations, role ambiguity, and job satisfaction (Tsui and O’Reilly, 1989; Thomas, 1990; Wesolowski and Mossholder, 1997), and on how social networks within the firm influence within firm promotions (Podolny and Baron, 1997).

⁶⁵ Lazear (1989), Kandel and Lazear (1992), and Rotemberg (1994) develop models incorporating social concerns into the analysis of behavior within firms. While they emphasize that individuals have social concerns for others at the same tier of the firm hierarchy, their analysis is equally applicable across tiers of the hierarchy. Bewley (1999) offers extensive evidence from interviews with managers arguing that concerns over fair outcomes for workers and the morale of employees are important determinants of their behavior.

⁶⁶ More than 70% of major US firms use some form of team based rewards (Ledford et al., 1995). Lazear and Shaw (2007) cite evidence that between 1987 and 1996, the share of large firms that have more than a fifth of their employees in problem solving teams rose from 37 to 66%. The percentage of large firms with workers in self-managed teams rose from 27 to 78% over the same period. In academia, Wuchty et al. (2007) document the increased use of team production in research across disciplines.

⁶⁷ There is only a small literature on selection into teams in laboratory settings (Weber, 2006; Charness and Yang, 2008), although there is a far more extensive lab-based literature on team production, as reviewed in Charness and Kuhn (2011, 2010).

⁶⁸ The empirical literature on tournament theory comprises two distinct branches. The first tests whether a particular compensation scheme has a tournament structure. Two specific predictions have been explored—(i) the wage spread should be positively related to the number of workers at the lower job level; (ii) the wage structure should be convex as in Rosen (1986). These tests typically use data from the market for CEOs (Knoeber and Thurman, 1994; Eriksson, 1999; Bognanno, 2001), or sports (Ehrenberg and Bognanno, 1990). There are few existing field studies—on either individuals or teams—exploring tournament incentives to other incentive schemes such as piece rates or feedback.

⁶⁹ Evidence from the laboratory has tended to focus on feedback to individuals (Freeman and Gelber, 2008). One exception is Sausgruber (2009) who provides experimental evidence on the effects on team performance when told about the performance of one other team, holding team composition constant.

⁷⁰ In line with this, Rotemberg (1994) develops a model showing how altruism between co-workers may endogenously form in the workplace to facilitate cooperation among workers engaged in team production. Empirically, Hamilton et al. (2003) provide non-experimental evidence from the introduction of team production in a garment firm. They find the most able workers sorted first into teams despite a loss in earnings in many cases, suggesting non-pecuniary benefits associated with teamwork.

⁷¹ In all such experiments, it is important to design the set-up to be able to distinguish gift-exchange from the alternative explanation of why there should be a positive relationship between wages and effort—efficiency wages. This hypothesis postulates employers pay above market-clearing wages to motivate workers to increase their effort level so as to avoid being fired, which reduces employer monitoring (Katz, 1986). Hence in both field experiments subjects were made aware that this was a one-time recruitment opportunity.

⁷² Clearly this debate on the existence of positive reciprocity in the field remains in need of further study. Charness and Kuhn (2011, 2010) describe more of the related evidence from the laboratory and other field settings.

⁷³ Olken (2007) presents evidence from a natural field experiment on the effects of top-down monitoring relative to grassroots participation on reducing corruption on road projects organized by village committees in Indonesia. Top-down monitoring via government audits is found to be the far more effective means of reducing corruption.

⁷⁴ A similar set of issues arise for field experiments in public economics. In particular, understanding why individuals give to fundraisers or charitable causes. Large scale field experiments have so far focused on how to induce members of the public or those with affinity to the fundraising organization to give. However a disproportionate amount of funds raised come from a few very wealthy donors. No field experiments have been run on them.

⁷⁵ Tests based on demand patterns exploit the fact that utility maximization by a single consumer subject to a linear budget constraint implies Slutsky symmetry, namely the restriction of symmetry on the matrix of compensated price responses. This prediction is typically rejected in household data (Deaton, 1990; Browning and Meghir, 1991; Banks et al., 1997; Browning and Chiappori, 1998). Browning and Chiappori (1998) derive the counterpart to the Slutsky matrix for multi-member households solely under the assumption of efficient within-household decision making, consistent with Nash bargaining models. They show the assumption of efficiency generates testable restrictions on household demand functions, and distinguish the collective model from both the unitary and the entirely unrestricted case.

⁷⁶ As in any framed field experiment or laboratory experiment, subjects need to be recruited. Framed field experiments that aim to replicate natural settings—say by working in conjunction with local organizations—might provide data from which to assess whether participants differ from those that choose not to participate. As discussed earlier, the nature of self-selection into experiments is a phenomenon that is only beginning to be understood (Lazear et al., 2009). Equally important, given the relatively small sample sizes inherent to many framed field experiments, it is crucial to be clear on how large a sample would be required to detect statistically different observable characteristics between participants and non-participants.

⁷⁷ This strand of field experiments is growing for example, Robinson (2008) presents evidence from a framed field experiment on 142 households in Kenya to test whether intra-household risk sharing arrangements are efficient, and if not, whether limited commitment caused by contractual incompleteness partially explains behavior.

⁷⁸ Stevenson and Wolfers (2007) provide a recent overview of the most pressing issues that are being addressed in research in the economics of the family.

*We gratefully acknowledge financial support from ELSE. We thank the editors, Orley Ashenfelter and David Card for comments. We thank Alec Brandon, David Herberich, Dana Krueger, Richard Murphy, Yana Peysakhovich and László Sándor for excellent research assistance. All errors remain our own.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Handbook of Labor Economics

Create new playlist

Sign In

Sign Up