Economic analysis of law seeks to answer two basic questions about legal rules. Namely, what are the effects of legal rules on the behavior of relevant actors? And are these effects of legal rules socially desirable? In answering these positive and normative questions, the approach employed in economic analysis of law is that used in economic analysis generally: the behavior of individuals and firms is described assuming that they are forward looking and rational, and the frame work of welfare economics is adopted to assess social desirability.
The field of economic analysis of law may be said to have begun with Bentham (1789, 1827,1830), who systematically examined how actors would behave in the face of legal incentives and who evaluated outcomes with respect to a clearly stated measure of social welfare (utilitarianism).Bentham’s writings contain significant and extended analysis of criminal law and law enforcement, some analysis of property law, and a substantial treatment of the legal process. His work was left essentially undeveloped until the 1960s and early 1970s, when interest in economic analysis of law was stimulated by four important contributions: Coase’s (1960) article on externalities and legal liability, Becker’s (1968) article on crime and law enforcement, Calabresi’s articles and culminating book (1970) on accident law, and Posner’s (1972) general textbook on economic analysis of law and his establishment of the Journal of Legal Studies. As this survey will indicate, research in economic analysis of law has been active since the 1970s and is accelerating.1 The field, however, is far from mature; one indication is the lack of empirical work on most topics.
Our focus here will be analytical, and we will cover five basic legal subjects.2 The first three are the central areas of civil law. We begin with liability for accidents, which can be understood as addressing the problem of probabilistic externalities. Second, we discuss property law, which concerns the nature and justification of property rights, how they are acquired and transferred,
how conflicts in the use of property are resolved, and related topics. Third, we examine contract law, including the formation of contracts, their interpretation, and remedies for their breach. The following section concerns civil litigation, that is, the bringing of lawsuits by private actors to enforce their rights in the areas of law that we have just discussed.
The field of law and economics is presented in several, mainly informal books, Law (1998) and the Encyclopedia of Law and Economics (forthcoming). Journals specializing in law and economics include the Journal of Legal Studies, the Journal of Law and Economics, the Journal of Law, Economics, and Organization, and the International Review of Law and Economics. Also, a professional organization, the American Law and Economics Association, is now well established.
The sections on these subjects can be read largely independently of each other. Not treated in our survey are various, more particular areas of law than the five we have mentioned; omitted areas include antitrust law, corporate and securities law, bankruptcy and commercial law, banking law, international trade law, and tax law. Also, excluded from this survey are problems addressed by the literatures on public choice and positive political theory.
1.2 Objective of the Research:
There are 5 general objectives of the research- in general and more specifically about processes- may attempt to achieve, They are :- description, explanation, forecasting, control, modelling.
These objectives are not completely independent from each other for the explanation of a phenomenon relies in part on its description, its forecast requires a detailed explanation, and so on. But researchers may concentrate on one or the other aspect. Most important, the objective pursued will affect the tools and techniques employed for the analyses.
The two most frequent objectives are description and explanation, Description is moist often an exploratory phase undertaken using graphical representations and statistical measures that are not inferential, while explanation involves precise hypotheses to be confronted and employs inferential statistical tests.
Modelling is the latest, broadest objective [Bossel Bosse11994]. It requires that the descriptive and explanatory phases brought sufficient information and knowledge about the system, so to build a model that synthetically gathers the various variables in a coherent and parsimonious way.
Control is an objective rarely set in psychological research (for it brings important ethical considerations), and forecasting is just a little more frequent. We will not address these tow objectives in this work.
Actually the objective of research is to acquire a range of discipline-specific skills in resolving many kinds of problems including finding, understanding and applying rules and principles. The cardinal objectives of the research can be followings:-
- I) To develop understanding of doctrinal and empirical approaches to the study of “ Economic Analysis of Law”;
- II) To find out relation between Economics and law;
III) To understand Justice system and its application in Economics;
- IV) To equip with a basic understanding of empirical research methods;
- V) To enable critically to evaluate the use made by academic writers of such methods and data;
- VI) To encourage understanding of the importance of empirical research in debates concerning the topic;
VII) To equip with advanced research skills in using different databases and the internet;
VIII) To help prepare for my final-year dissertation.
1.3 Research Methodology:
The course Research Method lies at the core of our Talent program. Alongside the other courses of the Talent program, this course aims to provide the theoretical understanding and training in the skills and aptitudes necessary for pursuing (Hons.) level research in the discipline of law. The course provides:
- an introduction in the (meta-) theoretical aspects of research (Primary);
- a training in ‘research action ‘ Secondary).
Both primary and secondary methods methods have been applies in the present research (Economic Analysis of Law). For being successful in primary method, a number of relevant books, articles, encyclopedias etc have been collected. And in secondary method, specialist commentary services with research questions have been taken.
1.4 Historical Background of “Economic Analysis of Law”
Ronaled Coase  brought economic analysis of law to the attention of the general legal academy; by the late 1970s, his work had provoked a vigorous controversy within the legal academy. That controversy has work had provoked a vigorous controversy within the legal academy, That controversy has usually defined the debate around the philosophical foundations of economic analysis of law. Posner made two claims: (I) Common law legal rules are, in fact, efficient; and (II) Legal rules ought to be efficiency.
In both claims, “efficient” means maximization of the social willingness-to-pay. In the course of the controversy, two other claims were articulated in Kornhauser [1984, 1958]: (III) Legal processes select for efficient rules; and (IV) individuals respond to legal rules economically. (In this third claim, “efficient” means “pareto e Kornhauser identified this last, behavioral claim as central to the enterprise. A fifth claim is also implicit in the literature: (V) on the best interpretation of law, efficient. “common law doctrines promote efficiency1. Notice that (V) differs from (I) in important respects. According to (V), an
economic interpretation fits a doctrine not because, as asserted in (I) the legal rules in fact induce efficient behavior but because the rule would induce efficient behavior within the view of the world that seems to underlie the judicial decision (I) is an empirical claim that requires the analyst to determine whether the actual behavior induced by legal rules is efficient; it requires knowledge of how individuals do, in fact, behave and of which behavior in ten real world would, in fact, be efficient. (V) requires only knowledge of the content of judicial opinions; the decision. (V) might be true even though legal rules induced inefficient behavior in the real world because the announced legal rule might be efficient within the implicit model used by judges.
These five claims do not correspond directly to traditional questions in the philosophy of law. The evaluative claim (II) that legal rules ought to be efficient would, if directed to judges, qualify as a theory of adjudication, one of the central concerns of Anglo-American philosophy of law. Central philosophic questions concerning the concept of law, of its normatively, and the obligation to obey the law, however, are not directly addressed. The behavioral claim as well as the evolutionary claim (III) and the positive claim (II), by contrast, concern empirical issues that philosophers of law generally neglect. Nevertheless, the controversy within the legal academy has generally regarded economic analysis of law as providing a comprehensive theory of law that challenges traditional approaches to law. Indeed, an explanation of the vehemence of the controversy should identify differences in fundamental views concerning law.
- Liability for Accidents
Legal liability for accidents (a branch of tort law) is a means by which society can reduce the risk of harm by threatening potential injurers with having to pay for the harms they cause. Liability is also frequently viewed as a device for compensating victims of harm, but we will emphasize that insurance can provide compensation more cheaply than the liability system. Thus, we will view the primary social function of the liability system as the provision of incentives to prevent harm. There are two basic rules of liability. Under strict liability, an injurer must always pay for harm due to an accident that he causes.
Under the negligence rule, an injurer must pay for harm caused only when he is found negligent, that is, only when his level of care was less than a standard of care chosen by the courts, often referred to as due care. (There are various versions of these rules that depend on whether victims’ care was insufficient, as we will discuss below.) In fact, the negligence rule is the dominant form of liability; strict liability is reserved mainly for certain especially dangerous activities (such as the use of explosives). Our discussion of liability begins by examining how liability rules create incentives to reduce risk. The allocation of risk and insurance will then be considered, and following that, the factor of
administrative costs. Then we take up a number of important topics bearing on liability: the magnitude of liability (damages), causation, and the judgment proof problem (assets insufficient to pay for harm). Finally, we consider the subjects of product liability and intentional torts.3
In order to focus on liability and incentives to reduce risk, we assume in this section that parties are risk neutral. Further, we suppose that there are two classes of parties, injurers and victims, and that they are strangers to one another, or at least are not in a contractual relationship.
3A comprehensive economic treatment of accident law is contained in Shavell (1987a), which this section largely follows. See also Landes and Posner (1987a) and Calabresi (1970), an early, innovative, informal economic analysis of
liability. Under the negligence rule, suppose that the due care level x^ is set equal to x*, meaning that an injurer will have to pay h if x < x* but will not have to pay anything if x x*. Then it can be shown that the injurer will choose x*: clearly, the injurer will not choose x greater than x*, for that will cost him more and he will escape liability by choosing merely x*; and he will not choose x < x*, for then he will be liable (in which case the analysis of strict liability shows that he would not choose x < x*).
For example, injurers might be drivers and victims pedestrians, or injurers might be polluting firms and victims affected residents. To begin with, we assume that accidents are unilateral in nature: only injurers can influence risks. Then we consider bilateral accidents, in which victims as well as injurers affect risks. We
also examine two types of action that parties can take that alter risk: first we consider their level of care (such as driving speed) and then their level of activity (number of miles driven).
2.1.1. Unilateral accidents and the level of care. Here we suppose that injurers alone can reduce risk by choosing a level of care. Let x be expenditures on care (or the money value of effort devoted to it) and p(x) be the probability of an accident that causes harm h, where p is declining in x. Assume that the social objective is to minimize total expected costs, x + p(x)h, and let x* denote the optimal x. Under strict liability, injurers pay damages equal to h whenever an accident occurs, and they naturally bear the cost of care x. Thus, they minimize x + p(x)h; accordingly, they choose x*. Thus, under both forms of liability, injurers are led to take optimal care. But note that under the negligence rule, courts need to be able to calculate optimal care x* and to be able to observe actual care x, in addition to observing harm. In contrast, under strict liability courts do not need to do the former two; they only need to observe harm.4
It should also be noted that, under the negligence rule with due care x^ equal to x*, negligence would never actually be found, because injurers are induced to choose x* and thus would be exonerated if they were sued after causing an accident. Findings of negligence may occur, however, under a variety of modifications of our assumptions. Courts might make errors in observing injurers’ actual level of care so that an injurer whose true x is at least x* might mistakenly be found negligent because his observed level of care is below x*. Similarly, courts might err in calculating x* and thus might set due care x^ above x*. If so, an injurer who chooses x* would be found negligent (even though care is accurately observed) because x^ exceeds x*. As emphasized by Craswell and Calfee (1986), the possibility of errors in the negligence determination leads injurers to choose incorrect levels of care; one possibility is that they would take excessive care in order to reduce the risk of being found negligent by mistake.5 There exist other explanations for findings of negligence as well, including the possibility that individuals may not know x* and thus take too little care, the judgment-proof problem, which may lead an actor to choose to be negligent (see section 2.6), and the inability of a person to control his behavior perfectly at every moment or of a firm to control its employees.
4Compare the discussion of corrective taxes versus regulation in section 3.6.2.
5This might explain the phenomenon of “defensive medicine,” on which see Danzon (1985) and, for empirical evidence, Kessler and McClellan (1996). Whether there is a tendency toward excessive care depends upon the degree of legal error and on whether injurers who are found negligent are held responsible for all harm caused or only the incremental harm attributable to their negligence. On the latter, see Grady (1983) and Kahan (1989).
2.1.2. Bilateral accidents and levels of care. We now assume that victims also choose a level of care y, that the probability of an accident is p(x,y) and is declining in both variables, that the social goal is to minimize x + y + p(x,y)h, and that the optimal levels of care x* and y* are positive.6 Under strict liability, injurers’ incentives are optimal conditional on victims’ level of care, but victims have no incentive to take care because they are fully compensated for their losses. However, the usual strict liability rule that applies in bilateral situations is strict liability with a defense of contributory negligence, meaning that an injurer is liable for harm only if the victim’s level of care was not negligent, that is, his level of care was at least his due care level y^. If victims’ due care level is set by the courts to equal y*, then it is a unique equilibrium for both injurers and victims to act optimally: victims can be shown to choose y* in order to avoid having to bear their losses, and injurers will choose x* since they will in fact be liable, as victims will not be negligent.7
Under the negligence rule, optimal behavior, x* and y*, is also the unique equilibrium. Injurers can be shown to choose x* to avoid being liable, and since victims will therefore bear their losses, they will choose y*.8 Two other variants of the negligence rule are negligence with the defense of contributory negligence (under which a negligent injurer is liable only if the victim is not negligent) and the comparative negligence rule (under which a negligent injurer is only partially liable if the victim is also negligent). These rules are also readily shown to induce optimal behavior in equilibrium. Thus, all of the negligence rules, and strict liability with the defense of contributory
negligence, support optimal levels of care x* and y* in equilibrium, assuming that due care levels are chosen optimally. Courts need to be able to calculate optimal care levels for at least one party under any of the rules, and in general this requires knowledge of the function p(x,y). The main conclusions of this and the last section were first proved by Brown (1973).9
6In some early, less formal literature on accidents, for example, Calabresi (1970), reference is made to the notion of the “least-cost avoider,” the party — injurer or victim — who can avoid an accident at the lower cost. The idea of a leastcost avoider relies on the assumption that each party can undertake a discrete amount of care, independently sufficient to prevent an accident.
7That this equilibrium is unique follows from three observations: (1) Victims never have an incentive to take care y exceeding y* (for once they take due care they will be compensated for their losses). (2) Victims will not choose y less than y*, for if they do so, they will bear their own losses, injurers will take no care, and victims thus will minimize y + p(0,y)h. But y + p(0,y)h = 0 + y + p(0,y)h > x* + y* + p(x*,y*)h > y*, implying that victims must be better off choosing due care y* than any y < y*. (3) Because in equilibrium victims thus take due care, injurers choose x to minimize x + p(x,y*)h, which is minimized at x*
.8Uniqueness is demonstrated by the following: (1) Injurers will not take excessive care in equilibrium. (2) If injurers take inadequate care, victims will take no care, so injurers will minimize x + p(x,0)h, which exceeds x* + y* + p(x*,y*)h, which exceeds x*. Thus, injurers are better off taking care of x*. (3) Since injurers must choose x* in equilibrium, victims will choose y*.
9Diamond (1974) proved closely related results shortly afterward. See also Green (1976), who focuses on the case of heterogeneous injurers and victims. The failure of the negligence rule to control the level of activity arises because negligence is defined here (and for the most part in reality) in terms of care alone
2.1.3. Unilateral accidents, level of care, and level of activity Now let us reconsider unilateral accidents, allowing for injurers to choose their level of activity z, which is interpreted as the number of times they engage in their activity (or if injurers are firms, the scale of their output). Let b(z) be the benefit (or profit) from the activity, and assume the social object is to maximize b(z) – z(x + p(x)h); here x + p(x)h is assumed to be the cost of care and expected harm each time an injurer engages in his activity. Let x* and z* be optimal values. Note that x* minimizes x + p(x)h, so x* is as described above in section 2.1.1, and z* is determined by b(z) = x* + p(x*)h, which is to say, the marginal benefit from the activity equals the marginal social cost, comprising the sum of the cost of optimal care and expected accident losses (given optimal care).
Under strict liability, an injurer will choose both the level of care and the level of activity optimally, as his object will be the same as the social objective, to maximize b(z) – z(x + p(x)h), because damage payments equal h whenever harm occurs. Under the negligence rule, an injurer will choose optimal care x* as before, but his level of activity z will be socially excessive. In particular, because an injurer will escape liability by taking care of x*, he will choose z to maximize b(z) – zx*, so that z will satisfy b(z) = x*. The injurer’s cost of raising his level of activity is only his cost of care x*, which is less than the social cost, as it also includes p(x*)h. The excessive level of activity under the negligence rule will be more important the larger is expected harm p(x*)h from the activity.
2.1.4. Bilateral accidents, levels of care, and levels of activity. If we consider levels of care and of activity for both injurers and victims, then none of the liability rules that we have considered leads to full optimality (assuming that activity levels are unobservable). As just explained, the negligence rule induces injurers to engage excessively in their activity. Similarly, strict liability with a defense of contributory negligence leads victims to engage excessively in their activity (the number of times they expose themselves to risk), as they do not bear their losses given that they take due care. The reason that full optimality cannot be achieved is in essence that injurers must bear accident losses to induce them to choose the right level of their activity, but this means that victims will not choose the optimal level of their activity, and conversely.10 The distinction between levels of care and levels of activity was first emphasized in Shavell (1980c), where the results of this and the last section were shown.
2.1.5. Empirical evidence on the effect of liability on safety. Only a modest amount of empirical work has been undertaken on the effect of liability on accident risks. See Dewees, Duff, and Trebilcock (1996) for a general survey of the literature that exists, and, among others, Devlin (1990), Landes (1982), and Sloan (1994) on liability and auto accidents, Danzon (1985) and Kessler and McClellan (1996) on liability and adverse medical outcomes, and Higgins (1978), Priest (1988), and Viscusi (1991) on liability and product safety.
2.2. Risk-bearing and Insurance
We consider next the implications of risk aversion and the role of insurance in the liability system, on which see Shavell (1982a). Several general points may be made.
First, the socially optimal resolution of the accident problem obviously now involves not only the reduction of losses from accidents, but also the protection of risk-averse parties against risk. Note that risk bearing is relevant for two reasons: not only because potential victims may face the risk of accident losses, but also because potential injurers may face the risk of liability. The former risk can be mitigated through first-party insurance, and the latter through liability insurance.
Second, because risk-averse individuals will tend to purchase insurance, the incentives associated with liability do not function in the direct way discussed in the last section, but instead are mediated by the terms of insurance policies. To illustrate, consider strict liability in the unilateral accident model with care alone allowed to vary, and assume that insurance is sold at actuarially fair rates. If injurers are risk averse and liability insurers can observe their levels of care, injurers will purchase full liability insurance coverage and their premiums will depend on their level of care; their premiums will equal p(x)h. Thus, injurers will want to minimize their costs of care plus premiums, or x + p(x)h, so they will choose the optimal level of care x*. In this instance, liability insurance eliminates risk for injurers, and the situation reduces to the previously analyzed risk-neutral case. 10However, there exist ways to induce fully optimal behavior using tools other than conventional liability rules.
For example, if injurers have to pay the state for harm and victims bear their own losses, both victims and injurers will choose levels of care and of activity optimally. On the possibility of such decoupling of what injurers pay and what victims receive, see note 102.
If, however, liability insurers cannot observe levels of care, ownership of full coverage could create severe moral hazard, so would not be purchased. Instead, as we know from the theory of insurance, the typical amount of coverage purchased will be partial, for that leaves injurers an incentive to reduce risk. In this case, therefore, the liability rule results in some direct incentive to take care because injurers are left bearing some risk after their purchase of liability insurance. But injurers’ level of care will still tend to be less than first best.
This last situation in which liability insurance dilutes incentives leads to our third point, concerning the question whether the sale of liability insurance is socially desirable. (We note that because of fears about incentives, the sale of liability insurance was delayed for decades in many countries and that it was not allowed in the former Soviet Union; further, in this country liability insurance is sometimes forbidden against certain types of liability, such as against punitive damages.) The answer to the question is that sale of liability insurance is socially desirable, at least in basic models of accidents and some variations of them. In the case just considered, the reason is evident. Injurers are made better off by the presence of liability insurance, as they choose to purchase it. Victims are indifferent to its purchase by injurers because victims are fully compensated under strict liability for any losses they sustain. In particular, it does not matter to victims that the likelihood of accident may rise due to the sale of liability insurance. This argument must be modified in other cases, such as when the damages injurers pay are less than harm. In that circumstance, the sale of liability insurance may not be socially desirable. See section 2.6.
Fourth, consider how the comparison between strict liability and the negligence rule is affected by considerations of risk-bearing. It is true that the immediate effect of strict liability is to shift the risk of loss from victims to injurers, whereas the immediate effect of the negligence rule is to leave the risk on victims (injurers will tend to act non-negligently). However, the presence of insurance means that victims and injurers can substantially shield themselves from risk. Of course, as was just discussed, insurance coverage may be incomplete due to moral hazard; this makes risk-bearing of some relevance to the comparison of liability rules, but which rule becomes more favorable is not obvious. Finally, as we stated at the outset of section 2, the presence of insurance implies that the liability system cannot be justified primarily as a means of compensating risk-averse victims against loss. Rather, the justification for the liability system must lie in significant part in the incentives that it creates to reduce risk. To amplify, although both the liability system and the insurance system can compensate victims, the liability system is much more expensive than the insurance system (see the next section).11 Accordingly, were there no social need to create incentives to reduce risk, it would be best to dispense with the liability system and to rely on insurance to accomplish compensation.12
11Also, victim compensation through liability generally implies that possibly risk-averse injurers bear risk. In some contexts, such as auto accidents, one supposes that injurers are not substantially less risk-averse than victims.
12Some jurisdictions have implemented “no-fault” (essentially, first-party insurance) regimes for automobile accidents. See Dewees, Duff, and Trebilcock (1996). Also, there are intermediate schemes, like workers’ compensation, that provide compensation and charge experience-rated premiums to injurers to instill incentives to reduce risk. See Moore and Viscusi (1990)
2.3. Administrative Costs
2.3.1. Administrative costs of the liability system. The administrative costs of the liability system are the legal and other costs (notably the time of litigants) involved in bringing suit and resolving it through settlement or trial. These costs are substantial; a number of estimates suggest that on average, administrative costs of a dollar or more are incurred for every dollar that a victim receives through the liability system. In contrast, the administrative cost of receiving a dollar through the insurance system is often below fifteen cents.13The factor of administrative costs affects the comparison between the forms of liability. On one hand, we would expect the volume of cases — and thus administrative costs — to be higher under strict liability than under the negligence rule. This is because, under strict liability, a victim can collect whether or not the injurer was at fault, whereas under the negligence rule fault must be established, so that in many cases of accident there will be no suit or, if there is a suit, it will be dropped after little has been spent.14 On the other hand, given that there is a case, we would anticipate administrative costs to be higher under the negligence rule than under strict liability, because under the negligence rule due care will be at issue.
2.3.2. Administrative costs and the social desirability of the liability system. The existence of administrative costs and their significant magnitude raises rather sharply the question whether it is worthwhile for society to bear them to gain the benefits of the liability system — the incentives to reduce risk. Unfortunately, it is quite possible for suits to be attractive for private parties to bring even if the social benefits of the liability system are small and make it socially undesirable.
2.4. Magnitude of Liability: Damages
The magnitude of the payment a liable party must make is known as damages, because it is normally set equal to the harm the victim has sustained. In this section, we discuss various issues relating to damages.
2.4.1. Basic theory. As a general matter, damages should equal harm under strict liability for incentives to be optimal in the unilateral model of accidents. Clearly, for injurers to be led to choose optimal levels of care, their expected liability must equal expected harm p(x)h, meaning that damages d should equal h. Likewise, for their levels of activity to be optimal, the same must be true.15
We should add that this point essentially carries over to the situation, not yet considered, where the magnitude of harm is stochastic.
14Farber and White (1991) provide evidence that many medical malpractice cases are dropped after discovery, when plaintiffs learn that the defendant probably was not negligent. Relatedly, Ordover (1978) analyzes a model in which victims are uncertain about injurers’ negligence; the result is that some victims of negligence may not sue and others who are not victims of negligence might sue.
15In the bilateral model, damages equal to harm would also be optimal under a rule of strict liability with a defense of contributory negligence if victims’ activity level is not variable. If their activity level is variable, then optimal damages may well be less than harm, for this will induce victims to moderate their level of activity.
Under the negligence rule, analysis of the optimal magnitude of damages is somewhat different. Recall that if damages equal harm h, injurers will be induced to take care of x* (assuming that due care x^ = x*). It is also the case that damages higher than h would induce injurers to take care of x*: this will increase the incentive to be non-negligent, to choose x*, but it will not lead injurers to take excessive care because they can escape liability merely by taking care of x*. Moreover, it can be shown that damages somewhat below h will also induce due care because, by taking due care rather than slightly less care under the negligence rule, injurers do not
just reduce liability slightly but avoid liability altogether.16 Thus, optimal damages are not unique but range from a level somewhat below h to any greater level. When, however, one introduces the possibility of uncertainty in the negligence determination (see section 2.1.1), the situation becomes more complicated. For example, we noted that error in the negligence determination might lead injurers to take excessive care to reduce the risk of being found negligent by mistake. If so, a level of damages exceeding h would only exacerbate this problem, and it might be beneficial for d to be lower than h.17
To sum up, we can say that in simple cases damages should equal harm under strict liability and under the negligence rule, although there are complications, such as that concerning uncertainty in the negligence determination. In fact, the law generally does impose damages equal nto harm, but subject to some exceptions (which we will note in sections 2.4.3 and 2.4.5).
2.4.2. Nonpecuniary elements of loss. Accidents often involve nonpecuniary losses, such as pain and suffering. To provide injurers with proper incentives to reduce accidents, they should pay for all nonpecuniary harms that they cause. However, it may be better for the state to receive these payments than for victims to receive them. Victims would often not elect to insure against
nonpecuniary losses because these losses would not create a need for money, that is, raise their marginal utility of wealth.18 Parents usually would not insure against the death of a child, for example, as this frequently would not generate a need for money, however devastating the loss would be for the parents. Thus, as initially proposed by Spence (1977), liability for pecuniary losses accompanied by an appropriate fine for nonpecuniary losses may be socially desirable.19
16 This point depends upon the particular formulation of the negligence rule (whether a person who takes less than due care is responsible for all harm caused or only the increment to harm resulting from x falling below x^). See Kahan (1989).
17An additional issue is that erroneous findings of liability tend to remedy the problem of excessive levels of activity under the negligence rule, raising the possibility that setting damages above h would be desirable.
18 For empirical evidence, see Viscusi and Evans (1990).
19 Alternatively, victims might enter into contracts under which insurers would receive pain and suffering recoveries in exchange for a reduction in premiums on other coverage.
2.4.3. Punitive damages. When an injurer’s behavior departs substantially from what is appropriate, damages in excess of harm, so-called punitive damages, may be imposed. If imposition of such damages causes expected liability to exceed expected harm, injurers will be induced to take excessive precautions, at least under strict liability, and they will also reduce their levels of activity undesirably.20
higher should be damages when the party is found liable. Accordingly, a firm that dumps toxic wastes at night, or an individual who tries to conceal a bad act, should have to pay punitive damages, but not an injurer who in a noticeable way causes harm. On these points and others, see, for example, Cooter (1989), Diamond (1997), and Polinsky and Shavell (1998a).
2.4.4. Accuracy of damages. Much expense is incurred in litigation about the magnitude of a victim’s harm, which raises the question of what the social value of greater accuracy is and whether the private value of accuracy is different from the social value. As stressed in Kaplow and Shavell (1996b), the private value of accuracy about harm generally exceeds the social value.
To explain, there is social value in establishing harm accurately if and only if injurers know the harm they might cause when they choose their level of care. For example, if an injurer anticipates that the atypically large harm he might cause will be accurately measured, he will exercise an appropriately high degree of care, as is socially desirable.21 However, injurers often lack (and could not reasonably obtain) considerable information about the harm they might cause when they decide on their precautions. Drivers, for example, know relatively little about how much harm a potential victim would suffer in an accident (the seriousness of injuries, the magnitude of lost earnings). Thus, drivers’ incentives to avoid accidents would be largely the same if, instead of using precise measurements of harm, courts employed rough averages (based, perhaps, upon
abbreviated litigation over damages or upon figures from a table). Nevertheless, victims and injurers have very strong incentives to spend to establish damages accurately in court. A victim will always be willing to spend up to a dollar to prove that harm is a dollar higher, and an injurer will always be willing to spend up to a dollar to prove that harm is a dollar lower.
20 Under a perfectly operating negligence rule, punitive damages would not affect injurers’ behavior, as explained in section 2.4.1. But if there is uncertainty in the negligence determination, the problem of excessive precautions may be exacerbated by punitive damages; also, punitive damages may reduce injurers’ activity levels (although this effect may be desirable
21Relatedly, a prospective injurer’s incentive to acquire information about the harm he may cause (whether it will be atypically large) will be greater when he knows that harm will be accurately determined.
2.4.5. Components of loss that are difficult to estimate.
Some components of loss are hard to estimate, for example, the decline in profits caused by a fire at a store (as opposed to the cost of repairing the store) or certain non npecuniary harms, and the law sometimes excludes such difficult-to-measure elements of loss from damages. This legal policy might be justified when the cost of ascertaining a component of loss outweighs the value of the improvement in incentives that its inclusion would accomplish. However, the cost of estimating a component of loss would be low if rough estimates were used (and the analysis of the last section suggests that this often would not much compromise incentives to reduce risk). Therefore, the policy of excluding components of loss that are hard to evaluate may be unwarranted.
2.5.1. Basic requirement of causation. A fundamental principle of liability law is that a party cannot held liable unless he was the cause of losses. For example, if cancer occurs in an area where a firm has polluted, the firm will in principle be liable only for the cancer that it caused, not for cancer due to other carcinogens.
This principle is clearly necessary to achieve social efficiency under strict liability, because otherwise incentives would be distorted. Socially desirable production might be rendered unprofitable if the firm were held responsible for all cases of cancer.
Under the negligence rule, restricting liability to accidents caused by an actor may be less important than under strict liability: if negligent actors were held liable for harms they did not cause, they would only have greater reason to act non-negligently, but would not take excessive precautions if there were no uncertainty surrounding the negligence determination. In the
presence of such uncertainty, however, relaxation of the causation requirement might adversely affect incentives. Further, under both liability rules, absence of the causation requirement might raise the volume of litigation and thus administrative costs. On the basic causation requirement and incentives, see originally Calabresi (1975) and Shavell (1980a).22
22On the causation requirement under the negligence rule, see also Grady (1983) and Kahan (1989), who study restriction of liability to losses that are in excess of the possibly positive losses that the actor would have caused had he not been negligent. The possibility that a party would not be said to be the proximate cause of losses on account of coincidence (as opposed to the freak character of losses) is illustrated by the following case: a
2.5.2. Uncertainty over causation. In many situations there is uncertainty about causation. For example, it may not be known which manufacturer out of many sold the product that resulted in injury, or whether harm was due to the defendant firm or to background factors (was cancer attributable to a firm’s pollutant or to unknown environmental carcinogens?). The traditional approach of the law is to hold a defendant liable if and only if the probability that the defendant was the cause of losses exceeds 50%. This approach can lead either to inadequate or to excessive incentives to reduce risk. For example, a firm that supplies only 20% of the market demand will escape liability for any harm caused by its product (assuming that harm cannot be traced to particular firms). Consequently, the firm will have no liability-related incentive to take precautions. If, however, a firm’s market share exceeds 50%, the firm will be liable for all harms due to the product that it and other firms sell, for it will always be correctly said to be more likely than not the cause of harm. Thus, the firm’s liability burden will be socially excessive (under strict liability). These potential problems of inadequate and of excessive incentives may arise under any liability criterion based on a threshold probability of causation; they are not unique to a 50% threshold. Essentially this point has been made frequently, and it is formally developed in Shavell (1985b).
The legal system has recently adopted (in limited settings) the approach of imposing liability in proportion to the likelihood of causation. Under this approach, a firm supplying 20% of the market would be liable for 20% of harm in every case. Note, therefore, that the firm’s liability bill would be the same under this regime as it would be if it paid for all the harm in the 20% of cases it truly caused — implying that its incentives would be socially appropriate. That the proportional liability principle engenders optimal incentives (without there being a need to establish causation in particular cases) is an advantage of the principle relative to the traditional threshold probability criterion. See Rosenberg (1984) and Shavell (1985b).
2.5.3. Proximate causation. Even if a party is a cause of losses, he may still escape liability under tort law because he was not the “proximate cause” of losses, where proximately-caused losses are, mainly, those that came about in an ordinary manner and that were not the product of coincidence. Liability often is not found for freak accidents, such as where a dog imbibes nitroglycerin left at a mining site and then explodes, injuring nearby persons. Allowing parties to escape liability for such unusual accidents is sometimes thought not to undermine incentives, on the ground that no one could have foreseen such accidents. This argument, however, is subject to the criticism that courts may find it difficult to discriminate between accidents that can and cannot
be foreseen. Moreover, the argument leads to the reductio ad absurdum that there should never be liability: any accident may be viewed as extraordinarily unlikely (of essentially zero probability) if it is described in sufficient detail. speeding bus happened to be at just the “right” point on its route to be struck by a falling tree; the bus company escaped liability for the injuries to passengers even though they would not have occurred but for the excessive speed of the bus. Allowing parties to escape liability for such coincidental accidents might not affect precautions, however. One presumes that the probability of a bus being struck by a falling tree is independent of its speed, so that imposing liability would not affect the speed at which buses are driven. On proximate causation, see Calabresi (1975) and Shavell (1980a).
2.6. Judgment-Proof Problem
The possibility that injurers may not be able to pay in full for the harm they cause is known as the judgment-proof problem and is of substantial importance, for individuals and firms often pose risks significantly exceeding their assets (a person of modest means could cause a devastating fire; a small firm’s product could cause many deaths).
First, if there is another party who has some control over the behavior of
the party whose assets are limited, then the former party can be held vicariously liable for the losses caused by the latter. Thus, holding a large contractor liable for the accidents caused by a small subcontractor or an employer for accidents caused by its employees will induce the former to control the risks posed by the latter.23 See Kornhauser (1982) and Sykes (1981).
Second, parties with assets less than a specified amount could in some contexts be prevented from engaging in an activity. However, such minimum asset requirements are a somewhat blunt instrument for alleviating the incentive problems under consideration. A third response to inadequate incentives, one closely related to asset requirements, is regulation of liability insurance. See Shavell (forthcoming). One form of insurance regulation would mandate purchase of (perhaps full) coverage.24 This approach would be especially appealing when insurers can observe the precautions taken by injurers. An opposite form of insurance regulation would prohibit purchase of liability insurance. This could improve incentives to take care if insurers cannot observe injurers’ precautions, because in that case insurance coverage would dilute incentives to take care when these incentives are inadequate to begin with.
23Imposing liability on corporations for behavior of their judgment-proof subsidiaries or requiring that liability of shareholders be unlimited (at least with respect to third-party tort victims) might serve a similar function. See Hansmann and Kraakman (1991). Also, liability might be imposed on parties who supply services to potentially judgment-proof entities and are in a position to monitor them, such as accountants, lawyers, and lenders. See Kraakman (1986) and Pitchford (1995).
24Many jurisdictions require liability insurance of those who drive, although the required amount is usually small perhaps $20,000); the result is that there is an excessive incentive to drive (which may be greatest among the young, who also impose the largest risks). Fourth, the use of Pigouvian taxes equal to expected harm may help to alleviate the judgment-proof problem. When harm will be caused with a low probability, the expected harm will be much less than actual harm; hence, parties with limited assets may be able to pay the appropriate tax on risk-creating behavior even though they could not pay for the harm itself.
A fifth way of correcting for dilution of incentives is for the state to regulate parties’ behavior directly, such as with traffic laws or by insisting that food and drugs meet certain safety requirements. Regulation, however, may involve inefficiency because regulators’ limited knowledge of risk and the cost and ability to reduce it. (We discuss regulation further in the section 3.6.2.)
A final way of mitigating dilution of incentives is resort to criminal liability. A party who would not take care if only his assets were at stake might be induced to do so for fear of imprisonment.
2.7. Product Liability
We have not yet considered accidents where the victims are customers of the injurer (or more generally where victims are in some contractual relationship with injurers). In this case, the role of liability in providing incentives may be attenuated or even nonexistent. The reason, obviously, is that firms producing risky products may be unable to sell them or may have to accept a reduction in price commensurate with the risk of loss attaching to the products.
If customer knowledge of product risk is perfect, then firms’ incentives to reduce risk will be optimal even in the absence of liability. For example, if the expected losses caused by a product risk are $100, the firm will have to accept a $100 lower price than otherwise, so it will be willing to spend up to $100 to eliminate the risk. Therefore, liability is not needed to generate incentives toward safety.
If, however, customer knowledge of risk is imperfect, liability is potentially useful in reducing risk.25 In the absence of liability, firms that increase safety generally will be unable to obtain an increase in the price fully reflecting the reduction in risk.26 (Indeed, in the extreme case where customers cannot observe anything about the true risk, firms would have no incentive to reduce
it.) Therefore, the prospect of liability for product-caused harms will increase incentives to reduce risk. Also, imposing liability will result in prices that reflect the full costs of products, leading to more efficient purchasing decisions.
A question concerning liability is whether court-determined liability or customer-selected liability, namely, warranties, is likely to be better.27 The answer depends on the nature of customers’ information or lack thereof and on other factors.
25See, for example, Goldberg (1974). Another potentially useful policy is supplying information about risk to customers, on which see Magat and Viscusi (1992) and Viscusi and Magat (1987).
26If a sufficient fraction of customers are informed about risk, producers may be induced to offer better products to all customers. See, for example, Schwartz and Wilde (1979).
27See generally Priest (1981) and Rubin (1993).
Suppose instead that customers misperceive risk. Then their selection of warranties may be skewed, as emphasized by Spence (1977). For example, if customers believe the risk of a product failure causing a loss of $10,000 to be 1% when it is really 5%, then a warranty would not be purchased: a seller of a full warranty would have to charge $500 for it, but the perceived expected value of it would be only $100. In this circumstance, it might be better for the courts to impose liability because that would create incentives to reduce risk. producer liability might dilute their incentives to do so (assuming that defenses such as contributory negligence are unsuccessful because of difficulties in observing customers’ behavior). Also, the administrative costs of liability are high. Thus, whether imposition of liability will improve social welfare, given customers’ ability to purchase warranties, involves a complicated weighing of considerations, and courts’ ability to do this is not clear.
2.8. Liability versus Other Means of Controlling Accidents
Liability is only one means of controlling harm-causing behavior; safety regulation and Pigouvian taxes are among the alternatives, as we indicated in section 2.6. For a general comparison of methods of controlling harm, see our discussion of regulating externalities in section 3.6.2.
2.9. Intentional Torts
To this point, we have examined liability for accidents, but we have not dealt explicitly with so-called intentional torts, such as assaulting someone or stealing his property (which also are crimes).28 See Landes and Posner (1981). An intentional tort may be defined as a harm than an injurer causes in which either of two things are true: the injurer acted in a manner that caused harm to occur with a very high probability, or the injurer obtained utility from the victim’s suffering itself. It would be possible to apply the foregoing analysis of accidents to intentional torts without modification. The conclusions reached did not depend on the magnitude of the probability of harm or on the source of benefits to injurers. However, both of these aspects of intentional harms suggest changes in assumptions that could alter our analysis and conclusions.
First, in situations where harm would be very likely to occur, bargaining between injurers and victims would often be possible.29 If so, it may be desirable to forbid injurers from harming victims unless they obtain consent in advance, presumably in exchange for payment.
Second, where injurers derive utility directly from the fact that harm is suffered by victims, some analysts suggest that injurers’ utility should not count in assessing social welfare. If so, deterrence becomes more valuable. Also, it may be optimal to deter some harmful acts even when the injurer’s benefit exceeds the victim’s loss, which calls for damages greater than harm, or for supplemental sanctions, notably imprisonment. (One suspects, however, that with most such intentional torts, injurers’ benefits rarely exceed victims’ losses.)
28On crime, see section 6.4.
29We have implicitly ignored bargaining, except in our product liability discussion, because with most accidents — such as automobile accidents — bargaining with potential victims would be infeasible.
- Property Law
We begin our discussion by reviewing reasons why property rights should exist and by describing instances of their emergence. Then we consider the major questions addressed by property law: the division and form of property rights, public property, the acquisition and transfer of property, and conflicts in the use of property (externalities). Last, we examine the subject of intellectual property. Many of the topics in this section have not been formally analyzed.
3.1. Justifications for Property Rights
A time-honored and fundamental question is why should there be any property rights in things.30 That is, in what respects does the protection of property and the ability to transfer property promote social welfare? One justification for the protection of property is that it furnishes incentives to work, a common example being that people would not grow crops unless they could keep the product of their labor. Similarly, property rights provide incentives to maintain and improve durable things: repair buildings, to fertilize and irrigate land, to conserve renewable resource stocks.31
Another justification for property rights is that, were they absent, individuals would spend time and effort trying to take things from each other and protecting things in their possession, and they would often find themselves involved in conflict. Enforcement of property rights by the state, while involving its own costs, reduces these serious disadvantages that would be incurred in the absence of property rights. A related benefit of enforcing property rights is that it protects people against risk.In addition, it is important that a system of property rights allows for things to be transferred freely. Most obviously, if things can be traded, they will tend to be allocated to those who value them most.32 Moreover, the ability to transfer things is indirectly necessary to our enjoyment of economies of mass production and specialization of labor, for when a large quantity of a good is produced by a single entity, the output ultimately will have to be distributed, which is to say, transferred, to many other individuals, and the entity will also often need to obtain inputs from other parties. In addition, transferability of property (particularly of land) allows it to be used effectively as collateral, thus enabling credit markets to function.33
30A related question concerns how such rights should be protected. See Calabresi and Melamed (1972) and Kaplow and Shavell (1996a).
31Problems with conserving renewable resources that arise in the absence of property rights are often referred to as the tragedy of the commons. See Gordon (1954), Hardin (1968), Libecap (1998), and Ostrom (1998).
32We also note that protecting the security of property rights promotes the transfer of property: without protection of property rights, prospective buyers would not be inclined to buy things that might subsequently be stolen, and prospective sellers would be wary of making their ownership of valuable possessions known to others.
33Empirically-oriented literature on the various benefits of property rights includes Alston, Libecap, and Schneider (1996), Atwood (1990), Besley (1995, 1998), and Feder and Feeny (1991). Early writing about property rights — by Bentham (1830), Blackstone (1765-1769), and Hobbes (1651), among others — stressed the justifications involving incentives to work and avoidance of strife.
can be provided by paying workers on the basis of effort, even if a state enterprise owns what they produce. (Indeed, employees of profit-maximizing firms in private-enterprise economies are generally motivated by pay rather than by the literal ability to sell what they produce.) And the benefits of avoiding strife and theft might be enjoyed just as much under a centrally planned economy as under a market economy. The arguments for the social value of the marketenterprise system over central planning are different from those justifying the existence of property rights per se. (The arguments favoring market systems are based largely on the informational burdens that central planners face, problems of corruption, and the like.)
3.2. Emergence of Property Rights
We would expect property rights to emerge from a background of no rights or only poorly established rights when the various advantages of their existence substantially outweigh the costs of establishing and maintaining the rights.34 Property rights will be likely to arise in these circumstances because, if many individuals recognize that they will probably be better off under a regime with property rights, pressures will be brought to bear to develop them. Various examples of the emergence of property rights have been studied. Umbeck (1981) examines property rights during the California Gold Rush. When gold was discovered in California in 1848, property rights in land and minerals were largely undetermined and there were virtually no authorities to enforce the law. Almost immediately, however, arrangements were made to protect property rights in gold-bearing land and river beds. This encouraged individuals to pan for gold, to build sluices, and otherwise to invest to extract gold; it also curbed wasteful efforts to grab land and gold from one another.
An additional example of historical interest is the establishment by the Indians of the Labrador Peninsula of rights in land where none had existed. Demsetz (1967) connects this 17th century event to the increased value of furs. He suggests that without property rights in land, overly intensive hunting of fur-bearing animals (especially beaver) would have taken place and the stock of animals would have been depleted.
34We note that property rights can be established and enforced by the state or informally, through social norms. On the latter, see Ellickson (1989, 1991) and Sethi and Somanathan (1996). Property rights also might be enforced by private organizations, such as the Sicilian mafia. See Gambetta (1993).
A more recent instance of the emergence of property rights concerns resources of the sea, as described in Biblowit (1991) and Eckert (1979). For most of history, there were no property rights in the ocean’s fisheries because fish were in inexhaustible supply for all practical purposes, but fish populations have come under strain with the use of modern fishing methods. To provide incentives to preserve fisheries, it has come to be accepted that countries have property rights in fish found in their coastal waters. Also, property rights have recently been established in the sea bed to foster exploration and extraction of oil and mineral resources. Another important example of the appearance of property rights concerns rights to the electromagnetic spectrum; assignmentof these rights prevents garbling of signals and encourages investment in programming and transmission as well as trade of rights to high-value users. See DeVany, et al. (1969) and McMillan (1994).35
3.3. Division and Form of Property Rights
3.3.1. Division of property rights. From a conceptual viewpoint, what we speak of somewhat loosely as property rights can be divided into more basic rights, composed of particular possessory rights and rights to transfer these rights. A possessory right in a thing is the right to use it in a specified way at a named time and under a particular contingency. A right to transfer a possessory right is the right to give or sell a possessory right to another person. Thus, what we commonly conceive of as “ownership” of something (say, land) entails both a large swath of possessory rights (rights to build on land, plant on it, and so forth, under most contingencies, and into the infinite future) and associated rights to transfer them.
In fact, property rights in things are generally held in substantially agglomerated bundles, but there is also significant partitioning of rights contemporaneously, according to time and contingency, and according to whether the rights are possessory or are for transfer.
For example, an owner of land may not hold complete possessory rights, in that others may possess an easement giving them the right of passage upon his land, or the right to take timber, or the right to extract oil if found (thus a contingent right). A rental agreement constitutes a division of property rights over time; wills provide for future and often contingent division of rights (depending on the survival of beneficiaries). Trust arrangements, such as those under which an adult manages property for a child, divide possessory rights and rights to transfer.
35Literature on the emergence of property rights is surveyed in a general discussion of property rights and economic activity in Libecap (1986).
The division of possessory rights may be valuable when different parties derive different benefits from them, because gains can then be achieved if rights are allocated to those who obtain the most from them. There are, however, several disadvantages to the division of possessory rights or too fine a division of the rights. Individuals may wish to exercise the same rights at the same time (a person with a right of passage may wish to use a path that is currently blocked by the owner’s use); externalities and related conflicts may arise (a person with a right of passage might trample crops). In addition, logistical problems may impede the division of rights (consider the problem of many individuals trying to share the use of a single automobile).
3.3.2. Consolidated form of property rights and the theory of the firm. Ownership of separate productive assets is often consolidated; namely, it is held by a single entity, the firm. The question of what constitutes the benefits of this form of ownership was initially posed by Coase (1937) and has subsequently been developed by, among others, Williamson (1975, 1985), Klein, Crawford, and Alchian (1978), Grossman and Hart (1986), and Hart (1995).36 Here, we review the main factors that bear upon the relative advantages of separate versus consolidated holding of assets by firms.37
First, consolidated ownership of assets reduces transaction costs because internal transfers of goods and services may be accomplished by command, eliminating the need for negotiation and bookkeeping expense.38 Such reduction of transaction costs, however, often could be obtained as nwell by separate owners if they entered into long-term supply contracts, honored standing orders, and the like.
Second, consolidated ownership may lead to a dilution of incentives to work, in comparison to the situation where each individual owns the assets he uses in production. Firms can combat this incentive problem in two familiar ways: if they can observe individuals’ efforts, they can penalize shirking; if not, they can tie compensation to measures of output.39 Of course, both methods have costs. (Interestingly, the latter may re-introduce transactions costs, such as if transfer pricing is required to compute a manager’s contribution to the firm’s profits.)
36See also Alchian and Demsetz (1972), Hart (1989), Hart and Moore (1990), Holmstrom and Tirole (1989), and Jensen and Meckling (1976). For a discussion of different forms of consolidated ownership (including employee-owned firms, cooperatives, and nonprofits), see Hansmann (1996).
37The subject of consolidated versus separate ownership of productive assets could be viewed as falling under the heading of division of property rights (separate ownership being division of consolidated ownership), but we find distinguishing the two subjects helpful.
38This savings may involve some sacrifice. For example, information on the profitability of separate functions may be lost (unless there is internal transfer pricing, which may involve transaction costs similar to those of market exchange).
39Ellickson (1993), among others, suggests that most communal farming efforts have failed because individual rewards were not linked to effort or output, which led to widespread shirking. It may be observed as well that modern firms succeed despite their often large size through monitoring of workers’ behavior (which may be more feasible with the use of mechanized technology) and use of performance pay.
Third, consolidated ownership enables a firm to avoid breakdowns in bargaining that would occur under separate ownership due to asymmetric information. For example, under separate ownership, the seller of a factor input might overestimate its value to the next-stage producer and demand too much for it, stymieing an efficient transfer. Under consolidated ownership, efficient transfers can be ordered.40 Alternatively, however, separate owners could contract in advance for transfers to occur at a predetermined price.
Fourth, consolidated ownership may help to alleviate problems of inadequate investment in assets. An asset owner may not have a sufficient incentive to make a relationship-specific investment (upgrading a plant for producing a factor input) because he anticipates that his gains will be partially expropriated by the owner of a complementary asset at the time when he is to put his asset to use. But if both assets are owned by the same party, the problem of expropriation of the gains from the relationship-specific investment in the first asset will be mitigated, and investment in it should be more efficient. However, the other individual’s incentive to invest in what otherwise would have been his asset may be dulled if the first party owns both assets; thus, consolidated ownership does not necessarily improve investment incentives overall. Additionally, it may sometimes be possible under separate ownership of assets to guarantee that investments in them be sufficient by making a contract to that effect; but this requires that investments be observable.
We close by noting that the distinction between consolidated ownership of assets by firms and separate ownership is blurred because, as we have mentioned, under separate ownership together with contractual arrangements, it is often possible to replicate the advantages of firms.
Indeed, separate ownership combined with sufficiently encompassing contracts may be indistinguishable from the consolidation of ownership of assets by firms. Conversely, firms themselves can be understood to consist of a set of contracts (a corporation is a particular contract among its shareholders).
40The manager might know that the transfer is efficient without knowing the precise cost and/or value of the factor input, for the cost distribution may be below the value distribution. When distributions are, instead, substantially overlapping, a manager will not know whether a transfer is efficient, and in this case bargaining between separate parties may well promote efficiency.
3.4. Public Property
Before continuing with our analysis of property rights, we consider briefly an important class of property, that owned by the public. We review the justifications for public property and then two methods of acquisition of such property: by purchase and by unilateral public taking.
3.4.1. Justifications for public property.
The main justifications for public property concern problems with private supply. The government builds and maintains roads, for example, because private supply often would not be forthcoming due to difficulties that would be faced in collecting for road use. And even if roads were privately supplied, suppliers would charge tolls, raising problems of monopoly pricing and wasteful expenditures on toll-collecting. Problems with private supply, however, do not constitute an argument for public ownership of goods, only for public financing of them or for public regulation of private suppliers. A road could be constructed, maintained, and owned by a private party paid by the state. And when private ownership might involve problems of monopoly pricing, government regulation is an alternative to direct ownership. These observations underlie the growing attention to privatization of public property and of government activities. The comparative virtues of public versus private ownership depend on the relative abilities of the government and of the private sector to operate efficiently and maintain quality.41
3.4.2. Acquisition of public property: purchase versus compensated takings. The state may acquire property through purchase or through exercise of the state’s power of eminent domain, which is to say, by taking the property. In the latter case, the law typically provides that the state must compensate property owners for the value of what has been taken from them, and it will be assumed that this is the case until the next section.
The difference between purchases and compensated takings is that the amount owners receive is determined by negotiation in the former case but unilaterally by the state in the latter case. Because of possible errors in governmental determinations as well as concerns about the behavior of government officials, purchase would ordinarily be superior to compensated takings.
An exception, however, arises where the state needs to assemble many contiguous parcels, such as for a road. Here, acquisition by purchases might be delayed or prevented by hold-out problems, making the power to take advantageous.
41See, for example, Hart et al. (1997), Shleifer (1998), and Viscusi et al. (1995, pp. 468-70)
3.4.3. Compensation for takings. Assuming that there is a reason for the state to take property, consider the effects and desirability of a requirement that the state pay compensation to property holders. As emphasized by Blume et al. (1984), payment of compensation to property owners creates a potential moral hazard: it leads them to invest excessively in property.
A second effect of compensation for takings is that risk-averse property owners will bear less risk.42 But were takings not compensated, insurance against takings would be likely to emerge.
Moreover, private insurance would naturally alleviate the problem of excessive investment in property.43
Third, payment of compensation also may alter the incentives of public authorities to take property by reducing possible problems of overzealousness and abuse of authority. However, requiring compensation may also exacerbate potential problems of too little public activity (public authorities do not directly receive the benefits of takings). Therefore, it is not clear whether a compensation requirement would improve the incentives of public authorities. For further discussion of these various issues about compensation for takings, see Kaplow (1986a, 1992a).44
3.5. Acquisition and Transfer of Property
We return now to the subject of private property and consider a number of topics relating to its acquisition and its transfer.
3.5.1. Acquisition of unowned property. Wild animals and fish, long lost treasure, certain mineral and oil deposits, and, historically, unclaimed land, constitute primary examples of unowned property that individuals may acquire. The law has to determine under what conditions a person will become a legal owner of such previously unowned property, and a general legal rule is that anyone who finds, or takes into his possession, unowned property becomes its owner. Under this finders-keepers rule, incentives to invest in capture (such as to hunt for animals or explore for oil) are optimal if only one person is making the effort. However, if, as is typical, many individuals seek unowned property, they will invest socially excessively in search: for one person’s investment or effort usually will not simply increase the total probability of success, but rather will come at least partly at the expense of other persons’ likelihood of finding unownedn property.45
42It should be noted, however, that many property owners — namely, firms with diversified ownership — are not very risk averse.
43Notably, insurance premiums would be based on the value of property, so further investments would raise premiums.
44On the topic of compensation for loss in value of property due to regulation (as opposed to the physical taking of property), see Fischel (1995) and Miceli and Segerson (1996).
45This problem is similar to the tragedy of the commons. See Gordon (1954) and Hardin (1968).
Various aspects of the law governing the acquisition of property may be regarded as ameliorating this problem of excessive search effort under the finders-keepers rule.46 Notable examples are that regulations may limit the quantities that can be taken of fish and wild animals, the right to search for oil and minerals on the ocean floor may auctioned off, and oil extraction may be “unitized” (assigned to one party).47
3.5.2. Loss and recovery of property. When property is lost by its owner and is found by another person, the question arises whether the original owner should retain property rights or the finders-keepers rule should apply. The general stance of the law is that original owners maintain their property rights in lost things (unless they abandon them). This beneficially discourages original owners from socially excessive investment in preventing losses: a farmer might otherwise invest in an expensive fence to prevent his cattle from straying, which might be inefficient because often his private loss would not constitute a social loss (someone would be likely to find the strays).
Moreover, original owners usually can either search themselves or efficiently organize recovery efforts by others (including by offering rewards). If, however, original owners cannot do this, the finders-keepers rule does have the advantage of inducing recovery effort, even though the rule tends to encourage races to find the effectively unowned property.
3.5.3. Acquisition of stolen property and problems of establishing valid title. A basic difficulty associated with sale of property that a legal system must solve is establishing validity of ownership or “title.” How does the buyer know whether the seller has good title, and how does the buyer obtain good title? If these questions are not readily answered, sales transactions are impeded, and theft may be encouraged.
One route that legal systems may take involves the use of registration systems: lists of items and their owners. Important examples are registries of land, ships, motor vehicles, and many financial instruments. Presuming that an item is recorded in a registry, it will be easy for a buyer to check whether the seller holds good title to it, and the buyer will obtain title by having his name recorded in the registry as the new owner. a general deterrent against theft. (An individual contemplating registration will not take into account that, as the proportion of registered property rises, thieves anticipate that it will be more difficult to sell stolen property and thus are discouraged from theft.)
46For a survey of relevant literature, see Lueck (1998).
47Other laws limit indirectly how much property can be taken by individuals by giving them title only if they make productive use of the property that they find. This was true of homestead laws that gave land to individuals who worked it and of water rights regimes that gave priority to the extent that water supplies were regularly used. Such rules, however, create excessive incentives to exploit property. For most goods, however, registries do not exist because of the expense of establishing and maintaining them relative to the value of the goods and of the deterrence of theft.
These rules have different effects on incentives for theft. Notably, under the bona fide purchaser rule, theft is made attractive because thieves will often be able to sell their property to buyers (who will be motivated to “believe” that the sale is bona fide); the buyers can use the now validly-held property or resell it.
Another social cost of the bona fide purchaser rule is that original owners will spend more to protect their property against theft because theft will be more frequent and, when it occurs, owners will be less likely to recover their property. (These costs of protection, note, are analogous to those arising under the rule allowing finders of lost property to keep it.) Finally, under the bona fide purchaser rule, buyers will not have an incentive to expend effort determining whether there exists a third party original owner. This is an advantage in the direct sense that it reduces transaction costs, but it also compromises deterrence of theft.
3.5.4. Involuntary transfer of property: adverse possession. The legal doctrine of adverse possession effectively allows involuntary transfer of land (and some other types of property): a person who is not the owner of land becomes its legal owner if he takes possession of it and uses it openly and continuously for at least a prescribed period, such as ten years. Some have suggested that a rationale for the rule is that it permits the transfer of land from those who would leave it idle to those who will use it productively. But this overlooks the possibility that there may be good reasons for allowing land to remain idle (perhaps it will be built upon later, and thus an investment in it now would be a waste). A historical justification for the rule is that, before reliable land registries existed, it allowed a landowner to establish good title to a buyer relatively easily: the seller need only show that he was on the land for the prescribed period. Another advantage of the rule is that it reduces disputes that would arise where structures turn out to encroach on neighboring parcels.48
3.5.5. Constraints on sale of property. Legal restrictions are often imposed on the sale of goods and services, including taxation and the outright banning of sale. One standard justification for such policies is externalities. See section 3.6. The other standard justification for legal restrictions on sale is lack of consumer information. For instance, a drug may not be sold without a prescription because of fear that buyers would not use it appropriately. Here, though, one must compare the alternative of the government supplying relevant information to consumers (say that the drug has dangerous side effects, or that it should only be taken with the advice of a medical expert).49
48On adverse possession, see Netter (1998).
49For further discussion, see section 4.1.9 on legal overriding of contracts
3.5.6. Gifts. The making of gifts, including bequests, is the major way in which property changes hands other than by sale. Gifts are, as one would expect, rather freely permitted because, like sales, they typically make both parties better off.50 It should be observed that, in the absence of a state subsidy, the level of giving may well fall short of the socially optimal level because a donor’s private incentive to make a gift does not take into full account the donee’s benefit. See
Kaplow (1995b).51 In addition, some gifts, particularly to charities, may support public goods or accomplish redistribution, which may provide a further ground for subsidy.52 In fact, the law does favor certain types of giving by conferring tax advantages on donees (and, in the case of charities, on donors). On the other hand, heavy gift and estate taxes are levied on large donative transfers to individuals. Another issue concerning gifts is that a person may want to make a transfer in the future, in which case issues concerning contracts to give gifts arise. This subject will be discussed below in section 4.3.2 on donative contracts.
3.6. Conflicts in the Use of Property: Externalities
3.6.1. Socially optimal resolution of externalities. When individuals use property, they may cause externalities, namely, harm or benefit to others. As a general matter, it is socially desirable for individuals to do more than is in their self-interest to reduce detrimental externalities and to increase beneficial externalities.
It should be noted, as emphasized by Coase (1960), that the socially optimal resolution of harmful externalities often involves the behavior of victims as well as that of injurers (and similarly with regard to generators of positive externalities and beneficiaries). Where victims can do things to reduce the amount of harm (install air conditioning to avoid pollution) more cheaply than injurers, it is optimal for victims to do so. Moreover, victims can sometimes alter their locations to reduce their exposure to harm. When the latter possibility is not incorporated into the analysis of externalities (suppose that victims are assumed to continue to live adjacent to a hazardous waste site), what is referred to as the optimal resolution of externalities may only be conditionally optimal.
3.6.2. Resolution of externalities through state intervention. We now consider various means of government intervention, along the lines of Shavell (1984a, 1984c, 1993a).53 For convenience, we confine our attention to the case of harmful externalities, and we will assume (until the next section) that parties affected by externalities cannot bargain with the generators of externalities.
51See also Friedman (1988) on the gift externality.
52See, for example, Atkinson (1976) on redistribution and charitable contributions.
53See also Bovenberg and Goulder (forthcoming, chapter __ of this Handbook) on environmental taxation.
Under direct regulation, the state restricts permissible behavior. It might impose a quantity constraint (a fisherman may be required to limit his catch to alleviate depletion of the fishery) or other behavioral constraints (a factory may be required to use a smoke scrubber). Closely related to state regulation is privately-initiated regulation through use of the legal injunction, whereby a
potential victim can enlist the power of the state to force a potential injurer to take steps to prevent harm or to cease his activity.
Society can also make use of financial incentives to induce injurers to reduce harmful externalities. Under the Pigouvian tax, a party pays the state an amount equal to the expected 50There are some limits on disinheriting one’s immediate family and other rules that prevent individuals from controlling the use of their gifts long into the future (the rationale for which is not entirely clear).
An additional type of financial incentive is a subsidy, an amount paid by the state to a party equal to the reduction in expected harm from some benchmark level that he accomplishes.
There is also liability — a privately-initiated means of providing financial incentives — as we discussed in section 2. Under strict liability, a party who causes harm has to pay the victim for his losses. (Such liability differs from the corrective tax because payment is to the victim rather than to the state, and also because injurers pay for actual harm rather than for expected harm.) Under
the negligence rule, an injurer must pay the victim only if the injurer failed to take a cost-effective precaution.
In fact, liability and regulation are the preeminent tools that society uses to control externalities; the use of corrective taxes and subsidies is unusual. Since Pigou (1932), who first emphasized the problem of externalities, economists have focused on corrective taxes and regulation, essentially ignoring liability. We will now sketch some factors bearing on the relative desirability of these methods of controlling externalities. The review of factors will show that any
of the methods (or a combination) could be the best, depending on the context.
One factor of relevance is the quality of the state’s information. If the state has complete information about acts, that is, it knows the injurer’s benefit or cost of precautions along with the victim’s harm, then all of the approaches allow achievement of optimality. But if the state’s information is imperfect, it will not be able to calculate which actions (such as installing a smoke scrubber) are desirable and thus sometimes will err. However, if the state knows the expected harm, it can induce injurers to act optimally under the corrective tax or a rule of strict liability because the injurer, who is presumed to know the cost of a precaution, will then appropriately balance the cost against the reduction in expected harm that would be brought about.54
54This advantage, as it applies to the comparison between strict liability and the injunction, is suggested by Calabresi and Melamed (1972) and is further explored in Kaplow and Shavell (1996a) and Polinsky (1980b).
We emphasize that this basic informational argument favoring Pigouvian taxes or strict liability over regulation or the negligence rule extends to the case where the state is uncertain about the magnitude of harm. The reason, essentially, is that under the former rules, the state only needs to estimate expected harm (as the injurers themselves implicitly supply complete information about the costs of precaution when making their decisions). By contrast, under regulation and the negligence rule the state must estimate both expected harm and precautioncosts. Because the state’s effectively available information is strictly better under the corrective tax or strict liability, it can achieve a superior outcome. (This point holds notwithstanding Weitzman’s argument suggesting that quantity regulation may be superior to correctivetaxation.55) An implication is that the use of pollution taxes is superior to the use of tradeable pollution permits because, under the latter, the government sets the total quantity of pollution using its own estimate of abatement costs rather than implicitly relying on firms’ information.56
A second factor is the information available to victims. For many externalities, victims have better information than the state about who is causing harm or about its extent — because they actually suffer the harm — so they are the most appropriate enforcement agents, suggesting the desirability of the liability tool or the injunction. In other instances, however, victims may be unaware of the harm or its cause, making the state a better enforcer. State enforcement, such as by regulation or by corrective taxes based upon statistical evidence of expected harm, avoids the need to identify, say, which pollutants ultimately harmed which victims.
A third factor concerns the level of activity of an injurer (how much a firm produces, how many miles a person drives), as opposed to the precautions an injurer takes given the level of activity (whether a firm uses a smoke scrubber while producing, whether a person exercises care when driving). Regulation and the negligence rule are most often concerned with precautions taken but not with the level of activity: a factory may be required by regulation to install smoke scrubbers but not to reduce its output. Thus injurers may not have incentives to moderate their level of activity although that would be desirable (their activity may result in harm despite the exercise of optimal precautions — even with smoke scrubbers, some pollution will result).
55Weitzman’s (1974) conclusion that regulation could be superior to taxation rests on his assumption that the state must, in advance, set a corrective tax rate that is independent of the quantity of pollution. Yet, when the marginal harm
depends on the quantity of pollution, the optimal tax rate depends on the quantity of pollution. See Roberts and Spence (1976). Kaplow and Shavell (1997) emphasize that taxes that depend on quantity are usually feasible to implement and are superior to quantity regulation.
56To be sure, tradeable permit regimes are themselves superior to quantity constraints imposed at the level of individual firms because trading allows a given total pollution target to be reached at minimum cost.
By contrast, under the corrective tax and strict liability, injurers pay for harm done, so that they will optimally moderate their level of activity (as well as efficiently choosing their level of precautions).
A fourth pertinent factor, noted above, is the ameliorative behavior of victims. Under regulation, corrective taxation, and other approaches that do not compensate victims for their harm, victims have a natural incentive to take optimal precautions (or to relocate) because they bear their residual losses; they will want to take any precaution (install air filters to reduce pollution) whose cost is less than the reduction in harm it accomplishes. Under a strict liability rule, however, a victim might not have such an incentive because he would be compensated for his losses. But under a negligence rule, victims are not compensated if injurers have behaved properly, and, under strict liability, compensation might be given only to victims who took optimal precautions (if this can be determined).57
Still another factor is administrative costs, the costs borne by the state in applying a legal rule and the legal and related costs borne by the affected parties (aside from direct costs, such as the costs of precautions). Liability rules possess a general administrative cost advantage over regulation in that under liability rules, administrative costs are incurred only if harm is done. This advantage may be significant when the likelihood of harm is small. Nevertheless, administrative costs will sometimes be lower under other approaches. For example, compliance with a regulation may readily be detected in some circumstances (determining whether factory smokestacks are sufficiently high would be easy) and also may be accomplished through random monitoring, saving enforcement resources. Also, imposing corrective taxes might be inexpensive. Notably, suppose that they are levied at the time of the purchase of a product. Last, the ability of injurers to pay for harm is of relevance. For liability rules to induce potential injurers to behave appropriately, injurers must have assets sufficient to make the required payments; otherwise they will have inadequate incentives to reduce harm, as discussed in section 2.6. Where the inability to pay is a problem, bonding requirements may be helpful, and regulation may become more appealing (although it may need to be enforced through the threat of nonmonetary, criminal sanctions). In addition, corrective taxes have an advantage over liability rules when harm is probabilistic because, under the corrective tax, an injurer would pay only the expected harm (with certainty) rather than the actual harm (if there is a 1% chance of causing$1,000,000 of harm, the payment would be only $10,000). Many firms that would be able to pay the tax and thus have correct incentives would not be adequately deterred under a liability rule, on account of their inability to pay for harm when it actually occurs.
57For further discussion of this aspect of liability rules, see section 2.1.
3.6.3. Resolution of externalities through bargaining by affected parties. Parties affected by unregulated externalities will sometimes have the opportunity to make mutually beneficial agreements with those who generate the externalities. In the classic example, if a factory’s pollution causes harm of $1,000 that can be prevented by installing a smoke scrubber that costs $100, then, in the absence of any legal obligation on the factory, one would expect a
potential victim of pollution to pay the factory to install the scrubber. An agreement for any amount between $100 and $1,000 would be mutually beneficial. Let us first consider this possibility and then evaluate its significance.
If it is posited that there are no obstacles to reaching a mutually beneficial agreement concerning externalities, then that will occur. This tautology is one version of the Coase Theorem; Coase (1960) stressed the point that externality problems could be remedied through private bargains. A closely related version of the Coase Theorem asserts that the outcome as to the externality — whether a smoke scrubber is installed or pollution is generated — does not depend on the legal rule that applies. For example, if the scrubber costs $100 and there is no law that controls pollution, a bargain as we have described it will come about and the scrubber will be installed; and likewise if there is a law that leads to installation of the scrubber, the same will happen.58 The outcome, however, might be affected by the legal rule because of the level of wealth of parties. Most obviously, the potential victims might not have assets sufficient to pay for the scrubber, in which case the scrubber would not be installed unless a legal rule leads to this; moreover, legal rules may affect the distribution of wealth and thus the demand for goods, including that of being free from pollution.59 There are, however, many obstacles to bargaining. Bargaining may fail to occur when victims are numerous and face collective action problems in coming together. This is often the situation with respect to victims of industrial pollution. Similarly, in important contexts, bargaining will be impractical because victims will not know in advance who will injure them; this is the case for automobile accidents and most other accidents between strangers.
58Similarly, if there is a law permitting victims to enjoin factories from polluting but pollution does less harm than it costs to prevent, the factory would pay the victim to forgo the injunction, resulting in the same outcome — pollution — as would occur with no regulation of pollution.
59The outcome following from a legal rule might also be affected by an “endowment effect,” wherein individuals’ valuations depend on whether or not they originally enjoy legal protection. See Kahneman, Knetsch, and Thaler (1990).
3.7. Property Rights in Information
Legal systems accord property rights in information, including inventions, books, movies, television programs, musical compositions, computer software, chip design, created organisms, and trademarks. The generation and use of such information, and therefore the law governing it, is growing increasingly important in modern economies. We divide our review of this subject into three parts: First, we discuss certain information like an invention that can be used repeatedly to produce something; here we discuss patent, copyright, and trade secret law. Second, we examine diverse other types of information, such as where oil is likely to be located, and its legal protection. Third, we consider labels of various types and their protection under trademark law.
3.7.1. Inventions, compositions, and other intellectual works of repeat value. The classic forms of intellectual works that receive legal property rights protection are inventions and literary, musical, or other artistic compositions. The well-known description of socially optimal creation and use of such intellectual works is as follows. First, it is socially optimal for an intellectual work, if created, to be used by all who place a value on it exceeding the marginal cost of producing or disseminating the good (or service) embodying it; thus a new mechanical device should be used by all who place a value on it exceeding the cost of its manufacture, and a book by all who value it more highly than its printing cost.
Second, an intellectual work should be created if the cost of doing so is less than its total value to the public, net of production cost. Given this description of social optimality, the advantages and disadvantages of property rights in intellectual works are apparent. In the absence of property rights, a creator of an intellectual work will obtain profits from it only for a limited period — until competitors are able to copy the creator’s work. Thus, the generation of intellectual works is likely to be suboptimal.
But if there exist property rights, whereby a creator of an intellectual work obtains a monopoly in goods embodying the work, incentives to produce the works will be enhanced (although they will still be less than ideal because innovators do not capture all of the surplus that their works create).60 The major drawback to intellectual property rights, however, is that monopoly pricing leads to socially inadequate production and dissemination of intellectual works.61 This problem can be severe where the monopoly price is much higher than the cost of production.
60Kitch (1977) emphasizes a somewhat different view, under which patent rights are often granted at an early stage of invention, and the rights allow their holders to develop the inventions into commercially viable products.
61Relatedly, subsequent innovators whose inventions depend on prior patented works will need to obtain licenses from existing patent-holders, and hold-up problems may arise. See Chang (1995), Green and Scotchmer (1995), and Heller and Eisenberg (1998).
A good example is computer software, which may be sold for hundreds of dollars a copy even though its cost of dissemination is essentially zero. Another problem (with patent rights in particular) is the race to be the first to develop intellectual works. Given that the rights are awarded to whoever is first, a socially wasteful degree of effort may be devoted to winning the race, for the private award of the entire monopoly profits may easily outweigh the social value of creating a work before a competitor does.62
Patent law and copyright law are the most familiar forms of legal intellectual property right protection.63 The extent of protection afforded by each body of law is partial in various dimensions, however, so that they might be considered to represent a compromise between providing incentives to generate intellectual works and mitigating the monopoly problem. Patents and copyrights are limited in time (usually 20 years for patents, and the author’s lifetime plus 50 years for copyrights) and also in scope.64 As an example of the latter, the copyright doctrine of fair use often allows a person to copy short portions of a copyrighted work. This probably does not deny the copyright holder significant revenues (a person would be unlikely to purchase a book just to read a few pages), and the transaction costs of the copier having to secure permission would be a waste and might discourage his use.
A distinct form of legal protection is trade secret law, comprising various doctrines of contract and tort law that serve to protect not only processes, formulas, and the like that might be protected by patent or copyright law, but also other commercially valuable information such as customer lists. An example of trade secret law is the enforcement of employment contracts stipulating that employees not use employer trade secrets for their own purposes.
A party can obtain trade secret protection without having to incur the expenses and satisfy the legal tests necessary for patent or copyright protection. Also, trade secret protection is not limited in duration (Coca-Cola’s formula has been protected for over a century). However, trade secret protection is in some respects weaker than patent protection; notably, it does not protect against reverse engineering or independent discovery. On the economics of trade secret law, see Friedman, Landes, and Posner (1991).
62The economic literature on intellectual property, focusing on patents, is discussed in Scherer and Ross (1990) and Tirole (1988); see also the historical review in Machlup (1958) and Reinganum’s (1989) survey on the timing of innovation.
An interesting and basic alternative to property rights in information is for the state to offer rewards to creators of information and for information that is developed to be made available to all who want it.65 Thus, under the reward system, an author of a book would receive a reward from the state for the writing of the book — possibly based on sales of the book — but anyone who wanted to print it and sell it could do so. Like the property rights system, the reward system encourages creation of information because the creator gains from producing intellectual works.
But unlike the property rights system, the reward system results in the optimal dissemination of information because the intellectual works are placed in the public domain; anyone may use them for free. Hence, the reward system may seem to be superior to the property rights system. A major problem with the reward system, however, is that the state needs information about the value of innovations to determine rewards. We note that, to some degree, society does use a system akin to the reward system in that it gives grants and subsidies for basic research and for other intellectual works. But society does this largely when these intellectual works do not have direct commercial value.
63See Besen and Raskind (1991) and Landes and Posner (1989).
64See Gilbert and Shapiro (1990), Kaplow (1984), Klemperer (1990), and Scotchmer (1996, 1998).
65See Kremer (1998), Shavell and van Ypersele (1998), and Wright (1983).
3.7.2. Other types of information. There are many types of information different from what we have discussed above. One type of information is that which can be used only a single time, for example, where oil is located under a particular parcel of land. With regard to this type of information, there is sometimes no need for property rights protection. If the party who possesses the information can use it himself (to extract the oil), then once he does so, the issue of others learning it becomes moot — there will be no further value to the information. To the degree, though, that the party is unable to use the information directly (perhaps he cannot conveniently purchase drilling rights), his having property rights in the information might be valuable and beneficially induce the acquisition of information.66 Moreover, we observe that giving property rights in the information will not undesirably reduce the use of information when the optimal use of it is only once. In fact, the legal system usually does furnish property rights protection in such information as where oil is located through trade secret law and allied doctrines of tort and contract law.67
Another type of information is that relevant to future market prices. Here, the private and the social value of gaining such information can diverge, as emphasized by Hirshleifer (1971). For example, a person who first learns that a pest has destroyed much of the cocoa crop and that cocoa prices are therefore going to rise can profit by buying cocoa futures. The social value of his information inheres principally in any beneficial changes in non-financial behavior that it brings about. For example, an increase in cocoa futures prices might lead candy producers to reduce wastage of cocoa or to switch from chocolate production to production of another kind of candy. But the profit that a person with advance information about future cocoa prices makes can easily exceed its social value (suppose he obtains his information only an hour before it would otherwise become available, so that it has no social value) or fall short of its social value (suppose that he obtains information early on, but that his profits are low because he has limited funds to invest in futures).
Hence, it is not evident whether it is socially desirable to encourage acquisition of such information about price movements by giving individuals property rights in the information. The law does not generally discourage such information acquisition (but an exception is regulation of trading based on insider information), and the law often encourages acquisition through trade secret protection.68
66In addition, firms may need to be able to prevent employees from diverting a firm’s benefit to themselves.
67See also our discussion of disclosure in section 4.1.2 on contract law.
68On insider trading, see Leland (1992) and Scott (1998).
Last, consider information of a personal nature about individuals. The cost of acquiring this information is the effort to snoop, although the information is sometimes adventitiously acquired, so costless. The social value of the information involves various complexities. The release of information of a personal nature to the outside world generally causes disutility to those persons exposed and utility for others, the net effect of which is ambiguous. Further, a person’s behavior may be affected by the prospect of someone else obtaining information about him: he may be deterred from socially undesirable behavior (such as commission of crimes) or from desirable but embarrassing-if-publicly-revealed behavior, and he may make costly efforts to conceal his behavior. Thus, there are reasons why the acquisition and revelation of personal information are socially undesirable, and reasons as well why they might be socially beneficial. The law penalizes blackmail and in this way attempts to discourage profit from acquisition of personal information. But otherwise the law does not generally retard the acquisition of personal information, and it also extends limited property rights in such information; notably, an individual who wants to sell to a publisher personal information he has obtained usually can do so.69.As this brief discussion has illustrated, the factors bearing on the desirability of protecting property rights in information vary significantly according to the type of information and call for analysis quite different from that concerning information of repeat value that we considered above.
3.7.3. Information valuable as labels. Many goods and services are identified by labels. The use of labels has substantial social value because the quality of goods and services may be hard for consumers to determine directly. Labels enable consumers to make purchase decisions on the basis of product quality without going to the expense of independently determining their
quality (if this is even possible). A person who wants to stay at a high quality hotel in another city can choose such a hotel merely by its label, such as “Ritz Hotel”; the consumer need not directly investigate the hotel. In addition, sellers will have an incentive to produce goods and services of quality because consumers will recognize quality through sellers’ labels. The existence of property rights in labels — that is, the power of holders of the rights to prevent other sellers from using holders’ labels — is necessary for the benefits of labels to be enjoyed.
In view of the social value of property rights in labels, it is not surprising that the legal system allows such rights, according to trademark law. Also, trademarks are of potentially unlimited duration (unlike patents or copyrights), which makes sense because the rationale for their use does not wane over time.
69See Ginsburg and Shechtman (1993), Posner (1993b), and Shavell (1993c).
The private and social functions of contracts and of contract law are examined here. In section 4.1 the basic theory of contracts is considered, in section 4.2 production contracts (which have been the focus of a substantial literature) are analyzed, and in section 4.3 several other types of contract are discussed.
4.1. Basic Theory
4.1.1. Definitions and framework of analysis. A contract is a specification of the actions that named parties are supposed to take at various times, as a function of the conditions that then obtain. The actions usually comprise delivery of goods, performance of services, and payments of money, and the conditions include uncertain contingencies, past actions of parties, and messages sent by them. A contract is said to be complete if the list of conditions on which the actions are based is exhaustive, that is, if the contract provides explicitly for all possible conditions. Otherwise, a contract will be referred to as incomplete. Typically, incomplete contracts do not include conditions which, were they easy to include, would allow both parties to be made better off in an expected sense. It should be noted that an incomplete contract may well not have literal gaps in
that it covers all conditions at least by implication. Consider, for example, a contract stating merely that a specified price will be paid for a named quantity of wheat. Although this contract is incomplete because it does not mention many contingencies that might affect the buyer or the seller of wheat, it has no gaps, as it stipulates what the parties are to do (pay a price, deliver wheat) in all circumstances. A contract is Pareto efficient if it is impossible to modify in a manner that raises the expected utility of both of the parties; such a contract will sometimes be referred to simply as efficient or as mutually beneficial.
Contracts are assumed to be enforced by a tribunal, which will usually be interpreted to be a state-authorized court, but it could also be another entity, such as an arbitrator or the decision making body of a trade association or a religious group. (Reputation and other non-legal factors may also serve to enforce contracts but will not be examined here.70)
70See, for example, Bernstein (1992, 1998), Charny (1990), Greif (1998), and Klein and Leffler (1981).
Enforcement refers to actions taken by the tribunal when parties to the contract decide to come before it. Tribunals may impose money sanctions — so-called damages — for breach of contract or insist on specific performance of a contract — require parties to do what a contract specifies (for example, convey land). Tribunals may also fill gaps, settle ambiguities, and override terms in contracts.
4.1.2. Contract formation. The formation of contracts is of interest in several respects. Search effort. Parties expend effort in finding contracting partners, and it is apparent that their search effort will not generally be socially optimal. On one hand, they might not search enough: because the surplus gained when one party locates a contract partner will generally be divided between them in bargaining, the private return to search may be less than the social return.
On the other hand, parties might search more than is socially desirable because of a negative (“common pool”) externality associated with discovery of a contract partner: when one party finds and contracts with a second, other parties are thereby prevented from contracting with that party.71 Both of these externalities arise in Diamond and Maskin (1979), who examine a specific model of search and contracting. Although policies to promote or to discourage search might be desirable, one wonders whether social authorities could obtain the information needed to determine the nature of problems with search effort.
Mutual assent and legal recognition of contracts.
A basic question that a tribunal must answer is at what stage of interactions between parties does a contract become legally recognized, that is, become enforceable. The general legal rule is that contracts are recognized if and only if both parties give a clear indication of assent, such as signing their names on a document. This rule obviously allows parties to make enforceable contracts when they so desire. Moreover, because the rule requires mutual assent, it protects parties against becoming legally obligated against their wishes. Thus, it prevents the formation of what would be undesirable contracts, and it means that search for contracting partners will not be chilled due to the risk of unwanted legal obligations.
However, certain legal doctrines sometimes result in parties becoming contractually bound without having given their assent; there exist cases in which a party became contractually bound when the other party with whom he was negotiating made substantial investments in anticipation of contract formation. This legal policy not only may result in undesirable contracts, it may also induce wasteful early investment as a strategy to achieve contract formation. It is true that early investment is sometimes efficient, but a party who wants to make early investment could attempt to advance the time of contract formation or make a preliminary contract about the matter. See Bebchuk and Ben-Shahar (1996), Craswell (1996), Katz (1996), and Wils (1993).
71Compare our discussion in section 3.5.1 of excessive incentives to search for unowned property.
Offer and acceptance. Mutual assent sometimes is not simultaneous; one party will make an
If an offeror is held to his terms, offerees will often be led to invest effort in investigating contractual opportunities. Otherwise offerees might be extorted by offerors if the offerees expressed serious interest after investigation. The anticipation of such offeror advantage-taking would reduce offerees’ incentive to engage in investigation and thus diminish mutually beneficial contract formation. Hence, it may be in offerors’ and society’s interests for offered terms to be enforced for some period of time. Yet offerors’ circumstances may change, making it privately and socially advantageous for them to alter contract terms. On this and other issues concerning offer and acceptance, see Craswell (1996) and Katz (1990b, 1993).
Disclosure. The law may impose an obligation to disclose private information at the time of contract formation.72 Such a legal duty is beneficial in the respect that disclosed information may be desirably employed by the buyer; suppose, for instance, that he learns from the seller that the basement of his new house leaks and thus decides not to store valuables there. However, as initially emphasized by Kronman (1978a), a disclosure obligation discourages parties from investing in acquisition of information. For example, a company might decide against conducting aerial surveys to determine the mineral-bearing potential of land if it would be required to disclose
its findings to sellers of land, as sellers would then demand a price reflecting the value of the land.The social welfare consequences of the effect of a disclosure obligation on the motive to acquire information, analyzed in Shavell (1994), depend on whether the information is socially valuable or mere foreknowledge, on whether the party acquiring information is the buyer or the seller, and on inferences that would be made from silence.73
Duress and emergency. Even if both parties have given their assent, a contract will not be recognized if it was made when one of the parties was put under undue pressure, as when he is physically or otherwise threatened by another. This legal rule has virtues similar to those of laws against theft; it reduces individuals’ incentives to expend effort making threats and to defend against them.In addition, contracts may not be legally recognized if they are made in emergency situations, such as when the owner of a ship in distress promises to pay an exorbitant amount for rescue. Non enforcement in such situations beneficially provides victims with implicit insurance against having to pay high prices, but it also reduces incentives for rescue (yet rescue incentives might tend to be excessive, for the general reasons that there is excessive fishing effort).74
72For discussions of various ways that asymmetry of information affects the contract terms that parties will agree to when there is no compulsory disclosure, see Ayres and Gertner (1989), Bebchuk and Shavell (1991), Spier (1992b), and Stole (1992).
73On inferences from silence in other contexts, see Fishman and Hagerty (1990), Grossman (1981), and Milgrom (1981).
74On rescue, see Landes and Posner (1978).
4.1.3. Why contracts and their enforcement are valuable to parties. At the most general level, parties make contracts when they have a need to make plans. They want contracts enforced to ensure that promised payments are made and to prevent opportunistic behavior that otherwise might occur over the course of the contractual relationship and stymy fulfillment of their plans. There are two basic contexts in which parties make enforceable contracts.
The first is that concerning virtually any kind of financial arrangement. The necessity of contract enforcement here is transparent. For example, because borrowers would not be forced to repay loans in the absence of contract enforcement, loans would be unworkable without enforcement. In financial arrangements, there is often a party who extends credit to another for some time period, and contract enforcement prevents his credit from being appropriated, which would render the arrangements impossible. In addition, financial contracts that allocate risk would generally be rendered useless without enforcement, even though there might not be an initial extension of credit, because once the risky outcome became known, one of the parties would not wish to honor the contract.
The second context in which parties make enforceable contracts involves the supply of custom or specialized goods and services — those which cannot simply be purchased on a spot market in a simultaneous exchange for money. The need for enforcement of agreements for supply of custom goods and services inheres mainly in averting what is often described as the holdup problem (discussed further in sections 4.2.1 and 4.2.2). To illustrate, consider a buyer who wants a custom desk which would be worth $1,000 to him and would cost $700 for a seller to produce. In the absence of contract enforcement, the buyer will not pay the seller in advance (for the seller could walk away with what he receives).
The buyer will pay the seller only after the seller makes the desk. But at that point, the seller’s production cost is sunk and he is vulnerable to holdup; the situation is that he has a desk which, being custom-made, has little or no alternative value.75 The outcome of bargaining between him and the buyer might thus be a price lower than the seller’s cost of $700; say the price is $500. If so, and the seller anticipates receiving only the $500 price, he will not produce the desk. This is true even though production and sale at a price between $700 and $1,000, such as $800, would be mutually beneficial for the seller and the buyer.
75Similar forms of holdup would arise in the absence of contract enforcement where parties want to convey property that already exists, such as land; for instance, a seller might worry about being held up by the buyer if he waits and forgoes a present opportunity to sell his land to a new party who makes a bid for it
Enforcement of the buyer’s promise to pay $800 for the desk on delivery, or of the seller’s promise to produce and deliver the desk (if the buyer paid the price of $800 in advance), is thus desirable for the parties.
More broadly, enforcement of contracts will stimulate all manner of investments which, like the seller’s expenditure on production, have specific value in a contractual relationship.
Enforcement will lead buyers to train workers to use new contracted-for equipment, sellers to engage in research to reduce production costs, and so forth. In the absence of contract enforcement, there would be too little investment in these things, for, at the final stage of negotiation for performance and for payment, each side would be subject to holdup by the other, so would tend to obtain only a part of the surplus created by its investment.
The foregoing idea of contract enforcement as a cure for holdup-related underinvestment was initially stressed in the economics literature by Klein, Crawford, and Alchian (1978), Grout (1984), and Williamson (1975). However, the general notion that contract enforcement is privately and socially desirable because it fosters production and trade is made (usually with little articulation) by most writers on contract law and, one supposes, has always been appreciated.
See, for example, Farnsworth (1982, pp. 16-17) and Pound (1959, pp. 133-134).
4.1.4. Incomplete nature of contracts and their less-than-rigorous enforcement.
Although enforceable contracts are desirable, they are observed to be substantially imperfect. They are significantly incomplete, leaving out all manner of variables and contingencies that are of potential relevance to contracting parties, and they also often fail to employ included variables in a
mutually beneficial manner. Moreover, contracts are not enforced rigorously, despite the seeming strength of the reasons for contract enforcement: penalties for violation of contractual obligations are often modest, and breach is not an uncommon event. There are three important reasons for the incompleteness of contracts. The first is the cost of writing more complete contracts.
Parties may not include variables in a contract, or not in a detailed, efficient way, due to the cost of evaluating, agreeing upon, and writing terms. (In particular, parties will tend not to specify terms for low probability events, because the expected loss from this exclusion will be minimal, whereas the cost of including the terms is borne with certainty.)
The second reason for incompleteness is that some variables (effort levels, technical production difficulties) cannot be verified by tribunals.76 Of course, many such variables can be made verifiable (effort could be made verifiable through videotaping), but that would involve expense.
The third reason for the incompleteness of contracts is that the expected consequences of incompleteness may not be very harmful to contracting parties. Incompleteness may not be harmful simply because a tribunal might interpret an imperfect contract in a desirable manner. In addition, as we shall see, the prospect of having to pay damages for breach of contract may serve
as an implicit substitute for more detailed terms. Furthermore, the opportunity to renegotiate a contract often furnishes a way for parties to alter terms in the light of circumstances for which contractual provisions had not been made. Finally, in some settings parties’ concern for their reputation may induce them to refrain from opportunistic behavior. That contracts are less than rigorously enforced is intimately related to their incompleteness.
For incomplete contracts not to disadvantage parties, tribunals must be able to reinterpret or override imperfect contractual terms rather than always enforce these terms as written. Also, for damage measures to be employed beneficially by parties, notably for parties to be able to escape from contractual obligations when performance and renegotiation are difficult, damages for breach must not be excessive. Additionally, for parties to avoid bearing high risks in the form of
payments that they would be induced to make when renegotiating imperfect contractual terms, the damages for breach must again not be severe. These points will be expanded in the discussion below of contract interpretation, remedies for breach, and renegotiation.
4.1.5. Interpretation of contracts. Contractual interpretation, which includes a tribunal’s filling gaps, resolving ambiguities, and overriding literal language, can benefit parties by easing their drafting burdens or reducing their need to understand contractual detail.77 For example, if it is efficient to excuse a seller from having to perform if his factory burns down, the parties need not incur the cost of specifying this exception in their contract, assuming that they can trust the tribunal to interpret their contract as if the exception were specified.78
76The problem of unverifi ability of variables is diminished by the possibility that parties can plan in their contract to use a tribunal of experts in their area, such as individuals in the same business as the contracting partners. In many
industries, this practice is common.
77On various aspects of contract interpretation, see, for example, Ayres and Gertner (1989), Hadfield (1994), and Schwartz (1992).
78Another example, where it may be efficient for a tribunal to override particular terms that appear in contracts, is when a seller offers only a detailed, fine-print contract in conjunction with the sale of an inexpensive good or service. Because it would be irrational for consumers to read such contracts, sellers would have incentives to include inefficient, onesided terms if such terms would be enforced. See Katz (1990c). The extent to which such contracts will be problematic will depend on the fraction of consumers who are informed about contract terms and shop among competing sellers. See Schwartz and Wilde (1979).
It may be worthwhile elaborating somewhat by viewing contract interpretation more formally, as a function that transforms the contract individuals write into the effective contract that the tribunal will enforce. Given a method of interpretation, parties will choose contracts in a constrained-efficient way. Notably, if an aspect of their contract would not be interpreted as they want, the parties would either bear the cost of writing a more explicit term that would be respected by the tribunal, or else they would not bear the cost of writing the more explicit term and accept the expected loss from having a less than efficient term. The best method of contract interpretation will take this reaction of contracting parties into account and can be regarded as implicitly minimizing the sum of the costs the parties bear in writing contracts and the losses resulting from inefficient enforcement.79
4.1.6. Damage measures for breach of contract. When parties breach a contract, they often have to pay damages in consequence. The damage measure, the formula governing what they should pay, can be determined by the tribunal or it can be stipulated in advance by the parties to the contract.80 One would expect parties to specify their own damage measure when it would better serve their purposes than the measure the tribunal would employ, and otherwise to allow the tribunal to select the damage measure. In either case, we now examine the functioning and utility of damage measures to contracting parties (assuming here that there is no renegotiation of contracts). Clearly, the prospect of payment of damages is an incentive to perform contractual obligations, and thus generally promotes enforcement of contracts and the goals of the parties, as
discussed in section 4.1.3. As emphasized in section 4.1.4, however, damages for breach in fact are not chosen to be so high that they virtually guarantee performance of contracts as written. Under the commonly employed expectation measure, damages equal the amount that compensates the victim of breach for his losses; these damages are often quite willingly paid by a party who commits a breach.
Why are damages not chosen to be so high as to guarantee performance? An important explanation is that parties do not always want performance of the less-than-complete contracts that they write. For example, suppose that a contract is very incomplete: it merely states, “The seller will produce a custom desk for the buyer and receive full payment of $800 in advance.”
79The determination of the optimal method of interpretation may involve subtleties. For example, according to the optimal method, a term might not be interpreted in the way that is best in the majority of transactions. Suppose that term A is best in the majority of transactions and that the parties to these transactions can include A explicitly, at little cost on a percontract basis, because they are repeat players. Suppose that term B is best only in the minority of transactions, but that for the parties to these transactions to include B explicitly will not be cheap on a per-contract basis because they are not repeat players. Then the optimal method of interpretation would make B the default term even though it is best only in a minority of transactions.
80A contractual provision that states a particular amount of damages is referred to as a liquidated damages clause.
The buyer and the seller do not really want the desk always to be produced. It is readily shown that, had they made a Pareto efficient complete contract, they would have specified that there should be performance if and only if the production cost is less than the $1,000 value of the desk to the buyer. (For instance, in a complete contract, they would have jointly decided against a contractual term specifying performance when the production cost is $2,000, for the seller would have been willing to reduce the contract price sufficiently to induce the buyer to strike the term.)
Now if the incomplete contract calling for the desk always to be produced is enforced by the expectation measure of damages of $1,000, the seller will behave exactly as he would have under the Pareto efficient complete contract, that is, he will perform if and only if the production cost is less than $1,000. Higher damages than the expectation measure might induce performance when it is inefficient, and lower damages might lead to breach when that is inefficient. Indeed, for this reason, the parties would often agree to choose the expectation measure over other measures of damages.
This understanding of damage measures as a device to induce the behavior that the parties would have specified in more complete contracts sheds light on the notion held by some legal commentators and philosophers that contract breach is immoral, that it constitutes the breaking of a promise. That belief is often incorrect, it is submitted, and might fairly be considered to be the opposite of the truth. The view that a contract breach is the breaking of a promise overlooks the point that the contract that is breached is generally an incomplete contract, and that the “breach” constitutes behavior that the parties truly want and would have provided for in a complete contract. In the example of the simple incomplete contract calling for a desk to be produced, the seller who finds that his production cost would be $2,000 will commit breach under the expectation measure. But in so doing, he will be acting precisely as would have been set out in a complete contract, and it is that contract which is best regarded as the promise between the parties that ought to be kept.
The point that a moderate damage measure, and in particular the expectation measure, is desirable because it induces performance if and only if the cost of performance is relatively low was apparently first clearly stated (informally) in Posner (1972), who emphasized the social efficiency of the measure. Shavell (1980b) formally demonstrated this and also stressed the mutual desirability of the measure for contracting parties and its role as a substitute for more complete contracts.81
81Two other writers, Birmingham (1970) and Barton (1972), adumbrate these points, although the meaning of their articles is at times obscure. See also Diamond and Maskin (1979), who consider damage measures in analyzing search behavior
Several more comments should be made about damage measures and incentives. First, damage measures influence the motive of contracting parties to make reliance investments (so called because the investments are made relying on contract performance). Reliance investments are illustrated by the earlier-noted instance of a buyer training workers to use a contracted-for machine or by advertising the contracted-for appearance of an entertainer. Under the expectation measure, there is a tendency for reliance investment to exceed the Pareto efficient level: the buyer will treat an investment like advertising as one with a sure payoff — either he will receive performance or receive expectation damages, a form of insurance — whereas the actual return to investment is uncertain, due to the possibility of breach (advertising will be a waste if the entertainer does not appear). This tendency toward overreliance due to the receipt of contract damages was initially noted in Shavell (1980b), and stands in contrast to the problem of inadequate reliance investment associated with lack of contract enforcement. The issue of reliance investments has been elaborately analyzed, as will be described in section 4.2.2.
A second comment is that the value of damage measures as an incentive toward efficient performance would not exist if renegotiation of contracts in problematic contingencies would always result in efficient performance. But, as will be discussed below, it seems plausible that renegotiation would not always result in efficiency.
An important function of damage measures which is quite distinct from their incentive role concerns risk-spreading and compensation. Notably, because the expectation measure compensates the victim of a breach, the measure might be mutually desirable as a form of insurance if the victim is risk averse. However, the prospect of having to pay damages also constitutes a risk for a party who might commit breach (such as a seller whose costs suddenly rise), and he might be risk averse as well. The latter consideration may lead parties to want to lower damages (see Polinsky 1983) or to avoid use of damages as an incentive device, by writing more detailed contracts (for instance, the parties could go to the expense of specifying in the contract that a seller can be excused from performance when his costs are high).82 A full consideration of damage measures and efficient risk allocation would also take into account whether the risk that a party bears is detrimental or beneficial,83 whether the risk is monetary or non-monetary,84 and whether the parties can obtain insurance.
82When parties do not so specify in advance, certain legal doctrines may serve this function. See Joskow (1977), Posner and Rosenfield (1977), and Sykes (1990).
83For example, if a party wants to breach because he has a superior opportunity, optimal damages might be higher, although adjusting damages in the case of beneficial risks is not likely to matter as much on risk-bearing grounds.
84For example, if the victim’s loss is non-monetary, such as the loss due to failure of musicians to appear at a
wedding, financial compensation in the form of damages may not constitute an optimal form of insurance. See section 2.4.2.
4.1.7. Specific performance as a remedy for breach. As observed at the outset, an alternative to use of a damage measure for breach of contract is specific performance: requiring a party to satisfy his contractual obligation.85 Specific performance can be accomplished with a sufficiently high threat or by exercise of the state’s police powers, such as by a sheriff removing a person from the land that he promised to convey. (Note that if a monetary penalty can be employed to induce performance, then specific performance is equivalent to a damage measure with a high level of damages.)
It is apparent from what has been said about incomplete contracts and damage measures that parties should not want specific performance of many contracts that they write, for they do not wish their incomplete contracts always to be performed. It is therefore not surprising that, in fact, specific performance is not used as the remedy for breach for most contracts for production of goods and for provision of services. Additionally, it may be observed that specific performance might be peculiarly difficult to enforce in these contexts because of problems in monitoring and controlling parties’ effort levels and the quality of production.
However, specific performance does have advantages for parties in certain contexts, such as in contracts for the transfer of things that already exist, like land, and specific performance is the usual legal remedy for sellers’ breaches of contracts for the sale of land. This point is discussed briefly below, in section 4.3.1. On specific performance and its general comparison to damage remedies, see Bishop (1985), Kronman (1978b), Schwartz (1979), Shavell (1984b), and Ulen (1984). (Specific performance also is examined in some of the articles on production contracts cited in section 4.2.2.)
4.1.8. Renegotiation of contracts. Parties often have the opportunity to renegotiate their contracts when problems arise. Indeed, the assumption that they will do this has appeal because, having made an initial contract, the parties know of each other’s existence and of many particulars of the contractual situation. For this reason, much of the economics literature on contracts assumes that renegotiation always occurs when inefficiency would otherwise result; see, for example, Hart (1987), Hart and Holmstrom (1987), and Rogerson (1984).
Nevertheless, in many circumstances contracts will not be renegotiated because parties are not in contact with each other when difficulties are experienced and one party would benefit from acting quickly.
85Some economists have employed the term “specific performance” in an unconventional sense, to refer to
enforcement of all provisions in a contract, including any damage measure named in it. Thus, they would say that a contract is specifically performed when the parties name expectation damages in their contract and parties who breach are thus required pay these damages.
A problem may occur during the course of production and the producer may have to decide on the spot whether to abort the process or proceed at greater cost. Or a new bid may be heard and have to be immediately answered. Furthermore, even if the parties are in contact with one another, asymmetric information may lead to breakdowns in renegotiation.
In any event, let us assume that successful renegotiation tends to occur and consider how it affects the welfare of contracting parties. Plainly, renegotiation often allows parties to avert Pareto inefficient breach decisions. For example, if damages exceeding the expectation measure or specific performance were the remedy for breach, a seller might be led to perform when his production cost exceeds the value of performance to the buyer. To avoid this inefficient outcome, the seller might pay the buyer to release him from his obligation to perform. That renegotiation may result in performance if and only if it is efficient means, as we noted, that damage measures for breach are not necessary to accomplish this, and also helps to explain why contracts lack detail.
But even if renegotiation tends to occur, it may represent only a partial substitute for explicit contractual terms or for appropriate damage measures for breach. One reason (see section 4.2.3) is that renegotiation cannot affect actions that are taken before the time of renegotiation, which influence the likelihood of nonperformance; renegotiation can only affect future decisions about
breach. Another reason involves the allocation of risk-bearing. Consider, for instance, the substantial risks borne by a producer who may have to purchase a release from an obligation to perform when his production costs would be extremely high. Such risks could be mitigated by use of a clause excusing him from performance or by a damage measure such as expectation.
Additionally, the prospect of renegotiation affects the incentives of parties to invest in the contractual relationship. A party’s level of reliance investment will be inefficient if renegotiation results in the extraction of part of the surplus that the party’s reliance investment creates. Yet renegotiation is influenced by, among other elements, the damage measure that applies for breach; and if the damage measure is appropriately chosen, the damage measure together with renegotiation may, in principle, spur desirable reliance investment; see section 4.2.2.
One presumes that the ability to renegotiate is usually desirable for contracting parties, because it allows them to improve their situation when difficulties arise and to write simpler contracts than otherwise.
Thus, we would expect that parties will want their renegotiated contracts enforced, and the law generally does enforce renegotiated contracts. However, the ability to renegotiate can also work to the detriment of parties because they might thereby be prevented from committing themselves to particular outcomes in their initial contract. See Jolls (1997) and the literature cited therein, especially Fudenberg and Tirole (1990). Nevertheless, the law usually prevents parties from binding themselves not to renegotiate, even though that could in theory be done.86
4.1.9. Legal overriding of contracts. A basic rationale for legislative or judicial overriding of contracts is the existence of externalities. Contracts that are likely to harm third parties are often not enforced, for example, agreements to commit crimes, price-fixing compacts, liability insurance policies against fines, and certain simple sales contracts (such as for machine guns).87 Another general rationale for nonenforcement of contracts is to prevent a loss in welfare to one or both of the parties to contracts. This concern may motivate nonenforcement when a party is incompetent, lacks relevant information, or is in an emergency situation (see section 4.1.2).
The rationale also applies in the context of contract interpretation by tribunals; as discussed in section 4.1.5, contract interpretation may amount to overriding terms of contracts, and this may promote the welfare of contracting parties by allowing them to write simpler contracts.88
86It is true that parties will not usually be able to bind themselves against engaging in renegotiation, for they could ordinarily renegotiate in secret. However, as Jolls (1997) observes, one of the parties will usually prefer that the original contract be enforced, so that if the courts stand ready to enforce the original contract, renegotiation cannot result in a new contract. For example, in the standard principal-agent contract, after the agent exerts effort, the principal and the agent will have an incentive to arrange for the agent to be paid a constant amount. But if this were contemplated, then after the output is realized, the agent would have an incentive to assert the original contract if his pay would be higher according to it, and the principal would assert the original contract if he could pay less under it.
87See also Posner (1995), who suggests that such contractual limits as usury laws, which constrain consumers’ ability to borrow, might be justified by a type of externality: when high-risk borrowers fail, they may become eligible for social welfare programs, imposing costs on taxpayers.
88Also, at least in theory, nonenforcement of contracts might also be beneficial to parties where they would be led to include terms constituting wasteful signals of unobservable characteristics. See Aghion and Hermalin (1990).
Additionally, contracts sometimes are not enforced because they involve the sale of things said to be inalienable, such as human organs, babies, and voting rights. In many of these cases, the inalienability justification for lack of enforcement can be recognized as involving externalities or the welfare of the contracting parties.89
4.2. Production Contracts
In this section, the literature on production contracts is discussed. The first case considered is that where symmetrically informed, risk-neutral parties enter into contracts, and the only variables of concern are the value of performance and production cost. Then the case where parties make reliance investments to raise the value of the contract during the contract period is examined. Finally, several other issues, including risk-bearing and asymmetric information, are reviewed. Throughout, when remedies for breach are discussed, one can imagine them either to be chosen by the parties or by the courts.
4.2.1. Value of performance and production cost. Assume that a risk-neutral buyer and a risk-neutral seller have met; the seller faces uncertain production cost c, which he will learn before he decides whether to produce; v is the certain value of performance to the buyer; and the parties are symmetrically informed. The Pareto efficient outcome is for the seller to produce if and only if c < v. (That is, in a complete contract, with terms for all contingencies, performance would be required if and only if c < v; a change in the contract price would compensate a party for agreeing to alter a term from any initially considered contract under which performance does not occur if and only if c < v.)
In the absence of contract enforcement, then (amplifying on section 4.1.3) there would be too little production because the buyer would only pay the seller for actual delivery of the good and cannot guarantee the price. In particular, supposing that the seller would obtain a fraction á of the surplus from a transaction (á reflects bargaining strength), he would obtain a price of áv.
(After the seller produces the good, the surplus from the transaction would be v, presuming for
simplicity that the custom good has no alternative value for the seller.) Thus, the seller would decide to produce only when c < áv, rather than whenever c < v.
Suppose now that there is contract enforcement and that the parties are not able to renegotiate before the seller decides whether to produce (an assumption that is relaxed below). If c is verifiable by the tribunal, the parties could write a complete contract specifying performance if and only if c < v. The parties would want a damage measure d for breach of this contract to be sufficiently high to induce performance when c < v, and thus any d exceeding c would work.
89See generally Rose-Ackerman (1985) and Trebilcock (1993).
If c is not verifiable, the parties are able to write an incomplete contract specifying “The seller shall deliver the good to the buyer, who will pay price p at the outset,” accompanied by damages d for seller breach. Under such a contract, the seller will perform when c < d and will commit breach otherwise.90 If the expectation measure is employed, that is, d = v, the seller will perform if and only if c < v, so that performance will be efficient.91 If damages d exceed v, there will be excessive performance, as there will be if there is specific performance. If d is less than v, there will be too little performance. The points of these paragraphs were, as noted, emphasized in
Posner (1972) and Shavell (1980b). If, instead, it is assumed that the buyer and the seller can renegotiate their contract after c becomes known but before the seller decides whether to produce, then, given symmetric information, it is natural to suppose that there will always be Pareto efficient performance, regardless of d. Let us also note that if the buyer’s value v is uncertain as well as the seller’s production cost c, the major difference in the outcome is that, since v cannot be prescribed as damages in the contract, v must be verifiable for the expectation measure d = v to be applied by the tribunal (c still need not be verifiable).92
4.2.2. Reliance investment during the contract period. Now assume that parties can make investments during the period of the contract that affect its value v or the production cost c. Such investments are, as noted, sometimes called reliance investments, since they are made in anticipation of contractual performance. We will begin with the case in which just one party invests before discussing the case where both sides invest.
90Because we assume that the price p is paid at the outset, the seller faces cost c if he performs and will compare it to damages of d that he would have to pay if he breaches. If the price were to be paid only at the time of performance, then
the seller would perform if and only if c – p < d. Hence, the performance that is induced under d if the price is paid at the outset will be achieved under d¢ = d – p if the price is paid only at performance.
91A related issue concerns post-breach mitigation behavior of the buyer: efficiency requires that if there is a breach, the buyer should mitigate the consequences of breach by searching for alternative suppliers and the like. Let z be mitigation expenditure of the buyer to raise his post-breach value, say w(z). Efficiency requires the buyer to choose z to maximize w(z) – z; let z* be the optimal value of z. If y is the gross value of seller performance to the buyer, then we can define v, the net value of performance, as v = y – (w(z*) – z*). Thus, expectation damages for breach should equal this v, not the gross value y.
And if damages equal v, then the buyer will choose z* if he is the victim of a breach, and the net value of performance will actually be v. On this issue of mitigation of the consequences of breach, see, for example, Wittman (1981).
92However, if c is verifiable and v is not, Pareto efficient performance can be achieved by constructing the contract so that the buyer will commit breach by refusing to pay for performance when performance would be inefficient. Specifically, let the price p be paid at performance, and let damages for buyer breach be d = p – c, the seller’s profits. Then the buyer will breach and refuse performance whenever v – p < -(p – c), or when v < c. (If, as is realistic, it is assumed that p – c cannot be negative, then the parties can choose p high enough that it always exceeds c (assuming c is bounded), with the buyer being compensated for the high p through an up-front rebate.) The parties’ ability to determine who will make the breach decision, as described here, is emphasized in Edlin (1996).
93We comment in note 95 below on another case of reliance investment: where the party who chooses the reliance investment is the same party who faces uncertainty, such as where the seller chooses r to lower his production cost and faces uncertainty about his production cost.
In the absence of contract enforcement, there will be too little production, as before; it will occur only when c < áv(r). But now, in addition, the buyer will choose an incorrect value of r because he will only obtain a fraction 1 – á of the value created by investment.94 Assume next that there is contract enforcement and that the parties do not renegotiate before the seller’s production decision (we relax this assumption below). This is the setting analyzed in Shavell (1980b), who first studied reliance investment. If c and r are verifiable by the tribunal, the parties can write a contract specifying efficient performance (when c < v) and also specifying r*; again, they would want the contract enforced by a damage measure high enough to ensure performance, and any such measure of damages would serve their purposes.
Now assume that c and r are not verifiable, that the parties write a simple contract specifying “The buyer will pay price p at the outset and the seller will deliver the good to him,” and consider what occurs under different damage measures. If the expectation measure is employed, that is, d = v(r), the seller will perform when c < v(r), so that performance will be efficient. However, as
the buyer will always receive v(r) (either he obtains performance, worth v(r), or damages of that amount), he will choose r to maximize v(r) – r. Consequently, the buyer will select an inefficiently high r; the problem is that the buyer does not take into account that investment does not have any value when performance does not occur.95 Under a sophisticated expectation measure based on efficient investment, namely d = v(r*), however, investment as well as performance can be shown to be efficient.96
94Specifically, he will choose r to maximize (1-á)v(r)G(áv(r)) – r, so the first-order condition determining r is (1-á)v(r)G(áv(r)) + (1-á)v(r)áv(r)g(áv(r)) = 1, or (1-á)v(r)[G(áv(r)) + áv(r)g(áv(r))] = 1. Although one might expect r to be
less than r*, it is apparent from the latter first-order condition that there is a possibility that the r chosen would exceed r*. The reason is that increasing r raises the probability that the buyer will obtain performance from the seller.
95We observe that the problem of an inefficiently high r does not arise under the expectation measure where the seller makes the reliance investment to lower his production cost and also faces uncertainty about it. Specifically, suppose
that production cost is c(r,è), where è is an uncertain state of nature, cè > 0, and c r < 0. In this case, the seller will choose the efficient r; the explanation in essence is that then the seller obtains the benefit of his reliance only when there is performance. The efficient r is that maximizing
è(v,r) (v – c(r,è))g(è)dè – r, 0 where è(v,r) is the è such that c(r,è) = v. Thus, r* is determined by -cr(r,è))g(è)dè = 1. Now under the expectation measure,
the seller will perform when c < v and pay v otherwise. Thus, the seller chooses r to maximize è(v,r) p – c(r,è)g(è)dè – v(1 – G(è(v,r))) – r, 0 and, differentiation of this yields the same condition as that which determines r*. This point is noted in Shavell (1980b).
96If d = v(r*), the seller will perform when c < v(r*), so the buyer will maximize v(r)G(v(r*)) – v(r*)(1 – G(v(r*))) – r. Accordingly, r will be determined by v(r)G(v(r*)) = 1, and this condition is clearly satisfied at r*. The explanation is that the buyer’s choice of r affects his return only when he obtains performance.
Next assume that the parties do renegotiate after the reliance investment is made and before the seller decides about production, so that, assuming symmetric information, there will always be efficient performance. This version of the model of production contracts was originally studied by Rogerson (1984). Here, damage remedies may influence investment through their effect on the outcome of renegotiation. To illustrate, consider what would occur under specific performance. Under this remedy, as suggested earlier, there will be renegotiation in which the seller pays the buyer to be allowed not to perform whenever c > v(r), since then performance would be inefficient. In particular, the assumption is that the seller would pay the buyer v(r) + (1 – á)(c – v(r)) to be allowed not to perform; for v(r) is needed to compensate the buyer for not receiving performance, 1 – á is the buyer’s share of the surplus from renegotiation, and c – v(r) is that surplus. Anticipating this, the buyer can be shown to choose an r exceeding the efficient level.97
The nature of the results about reliance investment in the case with renegotiation are very close tom those where there is no renegotiation. Indeed, they are identical under the expectation measure, essentially because there is no renegotiation under the expectation measure; thus, with d = v(r), investment will be excessive because the buyer will always be compensated for his investment. Furthermore, under the sophisticated expectation measure based on efficient investment, d = v(r*), investment will be efficient. See Spier and Whinston (1995).. For example, under the expectation measure, investment will tend to be excessive for both parties, but performance will be efficient. Much recent literature, beginning with Hart and Moore (1988), has focused on this general situation, assuming that parties can renegotiate after reliance investments are made and è is revealed, and that they will always then agree on efficient production decisions because information is symmetric. The literature in question furthermore usually supposes that none of the variables (costs, values of performance, reliance investments) are verifiable by the tribunal. Thus, a contract can depend only on what is recorded in it, certain subsequent communications between the parties, whether there has been performance, and, if not, who committed breach.
97In general, the buyer will choose an r in between the excessive level he would choose under the expectation measure (determined by v(r) = 1) and r*. This can be explained as follows. If the buyer’s fraction of surplus is 0, he will receive v(r) whether or not there is renegotiation, so his situation will be the same as under the expectation measure. If the buyer’s fraction of surplus is 1, he will clearly choose r*. This suggests what can be shown, that if the buyer’s fraction of surplus is positive and less than 1, he will choose an r exceeding r* and less than the r chosen under the expectation measure. Of note are a number of results establishing the existence of contracts that will produce efficient outcomes, that is, in both parties choosing efficient levels of reliance investment (performance will always be efficient). Aghion, Dewatripont, and Rey (1994) and Chung (1991) demonstrate the efficiency result using a contract in which one party is effectively given the right to make a single take-it-or-leave-it offer to the other in renegotiation. It is evident that this party will invest efficiently, as he can extract in bargaining the full marginal return from his investment.
For instance, if the buyer has the right to make an offer and is paying the seller to perform, he will pay only the minimum needed to induce the seller to do so, and will obtain any increase in value v(r,è) due to his having chosen a higher r. Less apparent is how the other party is given an incentive to invest efficiently; that is accomplished by properly choosing the quantity of the good or the probability of delivery. (For instance, if the named quantity of the good is chosen to be higher than is likely to be efficient, the buyer will usually pay the seller to agree to lower the quantity. The amount the buyer will pay must compensate the seller for the profits he would have
made at that higher contracted-for quantity. But the profits the seller would have made at that quantity will depend on his investment in lowering production costs — thereby giving the seller an incentive to invest in lowering his production costs, and an incentive that is greater the higher the contracted-for quantity.) Also, Noldeke and Schmidt (1995) establish that a simple option contract will induce efficient investments for reasons that are closely related to those just reviewed. Additionally, Edlin and Reichelstein (1996) and Hermalin and Katz (1993) adduce contracts leading to efficiency under somewhat different conditions, and Rogerson (1992) shows that efficiency can be achieved under wide circumstances, but assuming that parties can commi not to renegotiate their contracts. Cooperative reliance investments. It has been assumed above that a reliance investment benefits directly only the party who makes it. Another possibility is that a reliance investment benefits the other party to the contract; importantly, suppose that a seller’s investment raises
product quality and thus value for the buyer. Such cooperative reliance investment is studied in Che and Chung (1999). As they emphasize, when cooperative investment cannot be verified by courts, then under the expectation measure, there will be too little investment (in contrast to the usual case under the expectation measure, where investment is excessive). Indeed, there will be no investment if the seller who makes a cooperative investment will not benefit directly or in damages he receives in the event of breach. Moreover, there is no contract that will result in efficient cooperative investment (again in contrast to the usual case); this point is stressed in Che and Hausch (1999), who also demonstrate that contracting offers no advantage over no contracting in a wide set of circumstances.98
98For recent attempts to provide a unified theoretical framework for the various problems concerning reliance discussed in section 4.2.2, see Maskin and Moore (forthcoming) and Segal and Whinston (1998). Asymmetric information. Another factor about production contracts that we have not examined is asymmetric information between the parties. When parties are asymmetrically
4.2.3. Further considerations. Risk-bearing. We have not discussed in this section on production contracts the allocation of risk among possibly risk-averse contracting parties, about which several comments should be made. First, if all variables are verifiable by a tribunal, the presence of risk-averse parties does not affect when it is Pareto efficient to perform; it continues to be efficient to perform if and only if c < v. However, efficiency requires that the resulting risk be allocated appropriately; for instance, if the seller is risk averse and the buyer risk neutral, the seller would be insured against fluctuations in c by the buyer’s paying him c plus a constant. In addition, the level of efficient reliance investment will generally be affected by considerations of risk-bearing. Second, when variables of relevance are not verifiable, then damage measures and other mechanisms that may be employed to induce efficient behavior when parties are risk-neutral have to be reconsidered. For instance, the expectation measure imposes risk on the party who might breach and pay these damages; if that party is risk averse, the expectation measure would become less attractive relative to lower measures of damages. Furthermore, as we earlier noted, renegotiation does not generally lead to efficient risk bearing, even though it may lead to efficient performance.informed, renegotiation of contracts might not be successful, so that it becomes more important that the initial contract induces efficiency. Hermalin and Katz (1993) show that efficiency can be achieved under certain types of asymmetry of information using a relatively complicated mechanism in the contract.
New entrants. We have not examined the possibility that new buyers would appear and bid for the seller’s good (a similar possibility is that new sellers would appear and make offers to the buyer). In this regard, it should be noted that it is Pareto efficient for the initial contracting parties that a sale be made to a new buyer if and only if his bid exceeds the contract buyer’s valuation.
Moreover, the contracting parties will want to maximize the amount that they can extract from a new buyer if he purchases the good. This observation raises the possibility that the buyer and the seller may wish to set damages for seller breach at a high level in order to induce a new party to bid more (which he would have to do to make it in the seller’s interest to commit breach). Such
an incentive of contracting parties to set damages at high levels can, though, result in too little breach and sale to new parties; thus, at least in principle, the incentive in question is a ground for tribunals not to enforce the high damage level specified by the contracting parties. This point was first made in Diamond and Maskin (1979) and has been refined in a number of articles; see Aghion and Bolton (1987) and Chung (1992).
However, Spier and Whinston (1995) observe that threeway renegotiation would seem to vitiate the advantage to the contracting parties of setting high damages. Yet they emphasize another reason (concerning induced reliance investment) that the parties will, after all, benefit from setting high damages. Precautions and probabilistic breach. It has been supposed throughout that breach occurs when a party decides not to perform, but often breach does not occur in this way: rather a party chooses a level of precaution which affects the likelihood of performance, and a random factor then determines whether breach or performance results. For example, a shipper’s care in packing dishes affects the likelihood that they will arrive unbroken, and a chance event (a jolt) determines whether they arrive broken or unbroken. In this setting, the conclusions reached about damage measures in the absence of renegotiation continue to apply: the expectation measure results in efficient precautions, the buyer’s reliance investment is excessive, and so forth. The very issue of
renegotiation is made moot because the precautions are chosen before breach might occur (if the dishes arrive broken, it is too late for renegotiation). See Cooter (1985), Craswell (1988), and Kornhauser (1983).
4.3. Other Types of Contract
4.3.1. Contracts for transfer of possession. A different contractual context from production is where something that already exists is to be conveyed to a buyer. Examples include contracts for transfer of real estate, goods in inventory, and durable goods. Here a major uncertainty of interest concerns bids by new parties. With regard to these bids, the points just discussed concerning new entrants apply; the parties would like for there to be a sale to a new buyer when he will pay more than the contract buyer’s valuation, and so forth. It is of interest to explore why contracting parties often adopt specific performance as the remedy for breach of contracts for transfer of possession, even though damage measures are commonly employed for other types of contract. Initially, suppose that the contract buyer and the contract seller have equal access to bids from new parties. Then the buyer’s always receiving the good does not result in any loss of opportunity to sell to a new party willing to bid a high amount;
that is, specific performance does not suffer from any clear disadvantage relative to damage measures that would allow the seller to breach and sell to a new party. Moreover, specific performance offers the parties an advantage over damage measures. Namely, because under specific performance it will always be the buyer who will be bargaining with a new party, the good will never be sold to a new party bidding less than the buyer’s valuation. In contrast, such a sale could occur if the seller might pay damages, commit breach, and bargain with a new party (suppose that bargaining is not three-way, involving the contract buyer as well). And, after such a sale, the buyer would have to obtain the good through repurchase from the new party, but in general this will be at a higher price than the seller obtained — meaning that some of the surplus would be shared with the new party. (Although the contracting parties would be worse off if the buyer repurchases at a higher price, society would not be worse off as the good would still be allocated to the user who places the highest value on it.) See Shavell (1984b) and Bishop (1985). The foregoing advantage of specific performance in preventing inefficient sales to new parties is clearly reduced if the buyer does not have equal access to bids from new parties (suppose that the seller is a dealer and the buyer is not). Also, the use of specific performance might increase transaction costs, if the new party turns out to purchase only after delivery of the good to the buyer.m Notice too that some of the disadvantages of specific performance in the production context are less significant in the present context of transfer of possession. In production contracts, specific performance imposes a possibly large risk of loss on sellers whose production costs might be very high; here, specific performance only reallocates a beneficial risk (of a sale at a high price) from seller to buyer. In addition, enforcement of specific performance in the context of contracts for transfer of possession is often easier than in the production context, where enforcement might involve policing the quality of production or services.
4.3.2. Donative contracts. An important category of contractual arrangement is donative, concerning gifts. Assuming that the motivation for gifts is altruism,99 a basic question is why a donor would want to defer his gift rather than make it immediately (in which case no contract would be required). The answers include the possibilities that the donor may face liquidity
constraints and that he may wish to wait for resolution of uncertainties concerning, among other factors, his own needs and future income, and the donee’s needs, future income, and subsequently revealed character. However, if the donee knows about the altruism of the donor, a contract may not be necessary to induce donee reliance activity; if so, a contract would be disadvantageous for the donor. On these issues, see Goetz and Scott (1980) and Shavell (1991a); and see also Posner (1977b) and Posner (1997).
99Other motives for gift giving exist (such as obtaining utility from expressions of gratitude from donees); some have similar implications to those of altruism.
4.3.3. Additional types of contract. In this section, mention has not been made of many additional types of contract, including principal-agent contracts, even though they have been studied, often intensively, in the economics literature. The omission of such contracts is explained in part by convention (by what is and is not considered to be a law and economics topic) and in part by the relative inattention that has been paid to contract enforcement.
In this section, we consider civil litigation, that is, the bringing of lawsuits by private actors to enforce their rights in the areas of civil law that we have just discussed. Until now, we have largely assumed that the operation of the legal system is frictionless, in the sense that the bringing and adjudication of lawsuits is without cost. We now analyze the implications of the expense involved in the operation of the legal system.
We begin with what may be called the basic theory of litigation: the choice of a party who has suffered a loss whether to sue; if suit is brought, the choice of the litigants whether to settle with each other or instead go to trial; and the choice of litigants, before or during trial, of how much to expend on litigation. Then we discuss various extensions to the basic theory of litigation, including nuisance suits, the shifting of legal fees, lawyers and agency problems in litigation, and legal discovery. We subsequently consider the provision of legal advice, the appeals process, alternative dispute resolution, and the formulation of legal rules.
5.1.1. Private incentive to sue.
As a general matter, the plaintiff will sue when the cost of suit cP is less than the expected benefits from suit. The expected benefits from suit incorporate potential settlements and trial outcomes, but in this section we usually assume for simplicity that if suit is brought, the plaintiff obtains as a judgment a certain amount h equal to harm suffered. Thus the plaintiff will sue if and only if his litigation cost, cP, is less than h. (Obviously, if there is only a probability p of winning this amount, the plaintiff, if risk neutral, would sue if and only if cP < ph; and if the plaintiff is risk averse, he would be less likely to sue.) The effect on the private incentive to sue of many variations in the legal environment is straightforward to identify, as we will note below.
5.1.2. Socially optimal suit versus the private incentive to sue. The private incentive to bring suit is fundamentally misaligned with the socially optimal incentive to do so, given the social costs and social benefits of suit. The deviation between the privately-motivated and socially appropriate level of suit could be in either direction. The general reasons for these conclusions may be understood as follows.
On one hand, there is a divergence between social and private costs that can lead to socially excessive suit. Specifically, when a plaintiff contemplates bringing suit, he bears only his own costs; he does not take into account the defendant’s costs or the state’s costs that his suit will engender. of suit, its deterrent effect on the behavior of injurers. But the plaintiff does consider his private benefit, the gain he would obtain from prevailing. This private gain is not a social benefit but instead is a transfer from the defendant; it could be either larger or smaller than the social benefit. The contrast between the socially optimal and private incentive to sue is initially examined in Shave1l (1982b).100
It should be clear from out discussion that the point that the private and social incentives to bring suit may diverge is robust. On one hand, it will always be the case that the private cost of use of the system will be less than the social. And, on the other hand, the private benefits from suit will be what the plaintiff will win from suit, usually money, whereas the social benefits from suit will ordinarily be different: they will always include deterrence benefits and may also include compensation of victims (if insurance is unavailable) and the setting of precedent. These benefits litigants either will not take into account or will tend to weigh differently from their social importance.101
5.1.3. Implications of the social and private divergence. The main implication of the social and private divergence is that state intervention may be desirable, either to correct a problem of excessive suit — notably, by taxing suit or barring it in some domain — or a problem of inadequate suit — by subsidizing suit in some way.102 For the state to determine optimal policy, however, requires it to determine the effects of suit on injurer behavior and weigh them against the social costs of suit. It is thus not correct for the state to base policy on some simple, even though superficially appealing, criterion, and notably, whether the plaintiff’s expected gains from suit would have exceeded the aggregate litigation costs.
100See also subsequent examination of the issue in Menell (1983), Kaplow (1986b), Rose-Ackerman and Geistfeld
101Whereas this section was concerned with the implications of litigation costs for the frequency of suit, another litigation-cost related issue has to do with the level of precautions taken by injurers. The optimal level of precautions should reflect not only the direct harm that would be caused by an accident, but also litigation costs. To induce this higher level of
precautions, injurers who are sued should pay not only the harm, but also (perhaps as a penalty) litigation costs borne by the plaintiff and by the state. On this issue, see Hylton (1990), Polinsky and Rubinfeld (1988a), and Shavell (1997, 1999). 102Another response to the problem of inadequate suit on account of small stakes, when there are many victims so that aggregate stakes are substantial, is the use of class actions. See Dam (1975). Yet another approach is to encourage suits with pro-plaintiff fee-shifting (see section 5.4.2) or damage multipliers (see Kaplow 1993), or to use “decoupling” — that is, a system in which damages are raised (to enhance deterrence) and plaintiffs’ recoveries are raised (if suit must be encouraged, but not necessarily by the amount of damages) or are lowered (if there otherwise would be excessive suits), on which see Polinsky and Che (1991) and Shavell (1997, 1999).
It should also be emphasized that the importance of the private-social divergence in incentives to sue may be substantial. This is suggested by the fact that the costs of use of the legal system are high; indeed, legal costs may on average actually equal the amounts received by those who sue.103 Hence, the incentives created by the legal system must be significant to justify its use.
5.2. Settlement versus Trial
Assuming that suit has been brought, we now take up the question whether parties will reach a settlement or go to trial.104 A settlement is a legally enforceable agreement, usually involving a payment from the defendant to the plaintiff, in which the plaintiff agrees not to pursue his claim further. If the parties do not reach a settlement, we assume that they go to trial, that is, that some tribunal determines the outcome of their case. In fact, the vast majority of cases settle.105 We discuss here two different models describing whether settlement occurs and then consider the socially optimal versus the private decision whether to settle.
5.2.1. Exogenous beliefs model. One model of settlement versus trial presumes that parties have each somehow come to a belief about the probability of the trial outcome; let pP represent the probability of the plaintiff prevailing in his opinion, and pD be that same probability in the defendant’s opinion. Let w be the amount that would be won (for simplicity assume that they agree about w). Assume also that the parties are risk neutral. The plaintiff’s expected gain from trial, net of litigation costs, is pPw – cP. This is the minimum amount he would accept as a settlement, rather than go to trial. The defendant’s expected loss from trial, including his litigation costs, is pDw + cD; this is the maximum amount he would pay in settlement rather than go to trial. Hence, a settlement is possible if and only if pPw – cP pDw + cD, in which case the settlement amount will be in the settlement range [pPw – cP, pDw + cD]. Note that if the parties agree on the probability p, the settlement range will be positive and
cP + cD in length. A settlement range does not exist, and trial will occur, when pPw – pDw > cP + cD. This means that the expected award in the plaintiff’s opinion exceeds the expected award in the defendant’s opinion by more than the sum of litigation costs. Thus, trial will tend to occur when the plaintiff is sufficiently more optimistic about winning than the defendant believes he
103See section 2.3.1.
104Cooter and Rubinfeld (1989), Daughety (forthcoming), and Hay and Spier (1998) survey this general topic; Farmer and Pecorino (1996) review the asymmetric information literature on settlement versus trial. 105Recent data on state courts show that, in fiscal year 1992, over 96% of civil cases did not go to trial; see Ostrom and Kauder (1996). Similarly, recent data on federal courts demonstrate that, for fiscal year 1995, almost 97% of federal civil cases were resolved without trial; see Administrative Office (1995). These figures, however, overstate the settlement rate because many of the cases not tried were dismissed by a court rather than being settled. On the other hand, many disputes are settled before any complaint is filed.
106Lowenstein et al. (1993) and Mnookin (1993) suggest that litigant overoptimism is plausible.
The model under discussion originated with Friedman (1969), Gould (1973), Landes (1971), and Posner (1973), and was further elaborated in Shavell (1982c). It has the virtue of clarifying several basic intuitions: that settlement is fostered by litigation cost savings and by risk aversion, and that trial might result when plaintiffs expect to gain more than defendants expect to lose.
The model also helps to explain the striking predominance of settlement in actuality. First, lawyers, who are experts on the law, are typically advising both litigants, and much information is acquired and comes to be shared by the opposing sides; we should thus expect beliefs of the two sides to be similar. Second, the costs of trial tend to be substantial.
These observations suggest that a settlement range typically exists and thus that settlement would be likely to occur.107 The model has the additional virtue of being simple and easy to manipulate, because it focuses on the calculation of the settlement range.108 The model is unsatisfying, however, in two respects. It does not explain the origin of parties’ beliefs.
And it does not include a description of rational bargaining between the parties; thus, it does not explain whether there will be a settlement when there is a positive settlement range or the amount of any settlement within the range.
107The observations also raise interesting questions about the timing of settlement — will it occur early or late? (In fact, many cases settle early, but many also settle late, on the eve of trial or even during trial.) A reason for settlement to be delayed is that at the outset of settlement negotiations, information may be disparate; but, as noted, over time, as information is acquired and shared, the parties’ beliefs tend to converge. A reason for settlement to occur early, however, is that this maximizes the parties’ savings in litigation costs. To express the point differently, as time passes, more litigation costs are sunk, meaning that the savings from settlement are lowered, tending to decrease the chances of settlement. For analysis of the timing of settlement (in a model of asymmetric information), see Spier (1992a).
108For example, the larger the possible judgment amount w, the greater the chance of trial, for a larger judgment magnifies the effect of differences of opinion in the likelihood of trial outcomes (pPw – pDw, when positive, is increasing in
w). However, larger judgments tend to reduce the likelihood of trial if litigants are risk averse.
5.2.2. Asymmetric information model. A second type of model of settlement versus litigation presumes that there is asymmetry of information between litigants and includes an explicit account of bargaining. The simplest of such models is that of Be bchuk (1984), in which there is one-sided asymmetry of information and bargaining consists of a single take-it-or-leave-it settlement offer made by the party without private information.109 Suppose, for example, that the defendant has private information about the probability p that the plaintiff will win at trial (perhaps the defendant possesses private information bearing on whether he will be found negligent).110
The plaintiff makes a settlement offer x, knowing that low p defendants will reject his offer and high p defendants will accept; specifically, if pw + cD < x, the defendant will reject and the plaintiff will therefore obtain only pw – cP, but if pw + cD x, the defendant will accept and pay x. The plaintiff, who knows the probability distribution over p, chooses x to maximize his expected payoff from settlement or trial.111 The higher his offer x, the more he will obtain if his offer is accepted, but the greater is the likelihood of rejection and thus of his bearing trial costs.
At the optimal offer for the plaintiff, there will generally be a positive probability of trial and of settlement. Furthermore, it can be shown that the higher are litigation costs, the more likely is settlement, and that risk aversion increases the likelihood of settlement. This model, note, is roughly consistent with the previous one of section 5.2.1 in the sense that trial results due to disparate beliefs (arising out of the asymmetry of information). In particular, the plaintiff’s opinion of the probability of winning is the mean probability E(p) over the distribution of defendants, and trial will occur if the defendant’s p is sufficiently low in the distribution. In addition, the comparative statics of the present model are similar to that of the previous one (for instance, as just noted, higher litigation costs make settlement more likely). The primary virtues of asymmetric information models are twofold.
109Asymmetric information models of trial versus settlement have been refined and extended in various ways. See, for example, Daughety and Reinganum (1994) (in which asymmetry of information is two-sided), Hay (1995) (in which unobservable case preparation contributes to asymmetry of information), Reinganum and Wilde (1986) (in which the informed party makes the offer, and the uninformed party makes an inference from it), Schweizer (1989) (in which asymmetry of information is two-sided), and Spier (1992a) (in which there are multiple rounds of bargaining and, as discussed in note 107, the focus is on the timing of settlement). For a useful survey of asymmetric information models of litigation, see Farmer and Pecorino (1996), and for a general survey of asymmetric information models of bargaining, see Kennan and Wilson (1993). For an empirical investigation of litigation that emphasizes asymmetric information, see Farber and White (1991).
110Asymmetric information could also concern the magnitude of the judgment or factors independent of the trial itself, such as parties’ degree of risk aversion, their short-run need for funds, their tastes for litigation, and, as already noted, unobservable aspects of case preparation.
111Specifically, the plaintiff’s expected payoff as a function of x is 0z(pw – cP)f(p)dp + (1 – F(z))x, where z =(x – cD)/w), f is the density of p, and F is the cumulative distribution of p.
First, they include an explicit account of bargaining and thus of the probability of settlement and the magnitude of the settlement offer. (But the ability to predict the probability of settlement and the magnitude of the settlement offer is to some extent specious. Under the bargaining models studied, essentially arbitrary modeling choices are made over such matters as who makes the offer, the informed or the uninformed party; these choices substantially influence the probability of settlement and the settlement offers.)112 Second, the models explain differences of opinion that give rise to trial in terms of differences in possession of information. (However, the models do not explain why there should be such differences in information, given the incentives for sharing of information and its orced disclosure through legal discovery; we discuss these issues below in sections 5.4.7 and 5.4.8.)
5.2.3. Socially optimal versus privately-determined settlement. The private and the social incentive to settle may diverge for reasons related to those explaining the difference between the private and the social incentive to sue (see section 5.1.2).113 First, because the parties involved in litigation do not bear all the costs of a trial — the salaries of judges and ancillary personnel, the forgone value of juror time, implicit rent on court buildings — the parties save less by settling than society does, which suggests that the private incentive to settle is socially inadequate.114 Second, when there is asymmetric information, parties will fail to settle — and thus litigation costs will be incurred — when their demands turn out to have been too aggressive.
But their desire to obtain from each other a greater share of their litigation cost savings does not itself translate into any social benefit. Third, the prospect of settlement may reduce deterrence because defendants gain from settlement. This need not, however, be socially undesirable because settlement lowers the real total social cost of harmful acts, making less deterrence appropriate.115
Also, the division of surplus in settlement may affect deterrence. Fourth, the prospect of settlement may increase deterrence because it lowers plaintiffs’ expected litigation costs and thus increases the chance of suit. These latter factors are not, of course, taken into account by the parties to settlement negotiations.
112For an attempt to address this problem, see Daughety and Reinganum (1993).
113The normative question concerning the social versus the private value of settlement has received little attention relative to the positive question of when parties will settle. On the normative question, see Polinsky and Rubinfeld (1988b), Shavell (1997, 1999), and Spier (1997).
114In addition, the parties generally do not properly take into account each others’ costs when there is asymmetric information. For example, the adverse consequence of rejecting an offer and going to trial involves incurring one’s own trial costs but not the other side’s.
115Indeed, settlement reduces ex post costs by the sum of the plaintiff’s, defendant’s, and court’s costs, but deterrence is reduced only by the fraction of the savings in the plaintiff’s and defendant’s costs that is captured by the defendant in settlement bargaining.
Finally, by averting trial, settlement may have other effects on social welfare. For example, trials may reveal socially valuable information (such as about product hazards that consumers could guard against) or lead to new precedents. These are also factors that parties may ignore or treat inappropriately (a firm might have a socially perverse incentive to avoid trial to conceal information about product hazards). The state can act to correct a divergence between private and social incentives to settle. A factor that should be stressed in considering optimal social policy is that if settlements were to reduce deterrence undesirably, this does not imply that trial should be fostered; deterrence could be enhanced by raising damages to induce settlements for greater amounts or by imposing a tax on defendants (regardless of whether they settle). Trial is desirable only when there is no less costly way to raise social welfare, and a conjecture is that the usual social problem is that there are too many trials, not too few.116
5.3. Litigation Expenditures
5.3.1. Private incentives to spend on litigation. Here we focus on litigant expenditures given that suit has been brought. (We should note that litigation expenditures are made prior to trial as well as during trial; indeed, most are incurred in cases that settle.) Suppose that each litigant’s expenditures are made noncooperatively, as in Braeutigam, Owen, and Panzar (1984), Katz (1987, 1988), and Posner (1973).117 Under this assumption, a plaintiff will make litigation expenditures as long as this raises his expected return from settlement or trial (net of litigation costs), and a defendant will make such expenditures as long as this lowers his expected total outlays. The effects of each litigant’s expenditures will generally depend on what the other does; indeed, the two will often be spending to rebut one another.118
5.3.2. Social versus private incentives to make litigation expenditures. There are several sources of divergence between social and private incentives to spend during litigation Such expenditures have a negative social value. Third, even if expenditures do improve the accuracy of outcomes, they may not be socially optimal in magnitude. By analogy to what we stressed in section 5.1.2, the parties decide on their expenditures based on how they influence the litigation outcome, without regard to the influence (if any) on incentives. This could lead to expenditures that are too great or too small, relative to what is socially correct.119
116In fact, courts attempt to promote settlement in a variety of ways.
117We are abstracting from the possibility that the parties might be able to enter into agreements to limit litigation expenditures.
118One may contrast a system in which a single authority, perhaps the tribunal, makes decisions about litigation expenditures. This is done to an extent in many European countries for criminal proceedings. In the United States, federal trial court judges occasionally use special masters or court-appointed experts to perform similar functions.
119See generally Kaplow (1994a)
An important instance of the possibility that expenditures could be socially excessive concerns the assessment of damages. See Kaplow and Shavell (1996b). Suppose that the presently estimated harm deviates from the truth by $100. Then one of the litigants will be willing to spend up to $100 to prove the correct amount (it will be the defendant if the estimate exceeds the correct level, and the plaintiff if the estimate is too low). It can be shown that the social value
of the more accurate estimate tends, however, generally to be lower than $100, because the social value of accuracy is based on its effects on incentives. Indeed, there will sometimes be no beneficial incentive effect from more accurate assessment of harm, such as when errors (in the absence of additional expenditures) are unbiased and not predictable ex ante by potential injurers.
In particular, potential injurers, at the time they choose their precautions, will often know only a probability distribution of possible harm, so litigation expenditures ex post that provide a precise assessment of a particular victim’s actual harm would not affect incentives.120 Because private and social incentives to spend on litigation may diverge, it may be beneficial for expenditures to be either controlled or encouraged.
Expenditures on determining whether a party is liable (as opposed to the magnitude of damages) could also be socially excessive or inadequate.121 To illustrate the latter possibility, suppose that the cost of establishing that a defendant was negligent exceeded the amount of harm suffered. Plaintiffs would not have an incentive to make the necessary expenditure, with the result that negligence might not be discouraged. But if the deterrent effect of liability were significant, that result would be undesirable. (Suppose that deterrence would eliminate most negligently caused harm, so that ex post litigation costs would not often have to be incurred; see section 5.1.2.)122
5.4. Extensions of the Basic Theory
We consider here various extensions to the basic theory discussed above; for the most part, these extensions are concerned with the description of litigation rather than with its normative analysis.123
120See section 2.4.4. If compensation of risk-averse victims were important due to the unavailability of insurance (see section 2.2), more accurate compensation will have social value, although parties’ incentives to make litigation expenditures would still tend to be excessive because the insurance benefit of avoiding a $1 error in compensation is less than the maximum of $1 that a party would be willing to expend to correct the error. See Kaplow (1994b).
121The determination of liability in the context of law enforcement (see section 6) is analyzed in Kaplow and Shavell (1994a).
122In the light of our analysis in section 5.1.2, it should be clear that cases opposite to those illustrated in the text — that is, inadequate incentives to prove damages or excessive incentives to establish liability — can also arise.
123For empirical studies that bear on the theory of litigation, see, for example, Hughes and Snyder (1995), Ramseyer and Nakazato (1989), and Viscusi (1986a, 1988).
5.4.1. Nuisance suits. A nuisance suit is often defined as a suit that the plaintiff
brings even though he would not actually pursue his case to trial, because the expected award he would obtain is less than the trial cost; in this sense a nuisance suit is a negative expected value suit. We should first point out that we cannot infer that nuisance suits should not be brought: as we stressed in section 5.1.2, it is quite possible that the social deterrence benefits of a type of suit make it desirable to bring even though litigation costs exceed the expected judgment.124 (Nor, as we emphasized, can it be assumed that non-nuisance suits — positive expected value suits — ought to be brought.)
A major question about nuisance suits, and the one to which primary attention has been given, is why they are brought in view of their negative expected value. One important explanation concerns asymmetric information: that plaintiffs who are not willing to go to trial are not identifiable to defendants and ride on the coattails of plaintiffs who would be willing to go to trial. As a consequence, the plaintiffs who are unwilling to go to trial are able to settle for a positive amount with defendants; see Bebchuk (1988) and Katz (1990a). Another possibility, not premised on asymmetric information, is that a plaintiff can initiate a suit at low cost and, although he would lose if the defendant undertook substantial litigation effort, he would prevail if the defendant did not. In this case, the defendant might prefer to settle to avoid paying defense costs; see Rosenberg and Shavell (1985).
An additional reason concerns the point that, as plaintiffs spend continuously on litigation, their willingness to go to trial increases (because the amount that they would then save by not going forward diminishes); see Bebchuk (1996).125
124This is obviously not to deny that some types of nuisance suits are undesirable. For example, where plaintiffs would not prevail because their cases are fictitious, their bringing of suits would tend to distort incentives as well as waste resources on litigation.
125There is also a literature on how nuisance suits (or, relatedly, suits with a low probability of success) might be discouraged. See Bebchuk and Chang (1996), Katz (1990a), and Polinsky and Rubinfeld (1993, 1996).
5.4.2. Shifting of legal fees. Thus far, we have assumed that parties bear their own legal costs, a regime referred to as the American rule. By contrast, under the English rule, the loser pays the legal costs of both sides. Fee-shifting may also be one-way, favoring the plaintiff (that is, shifted only to the defendant, if the plaintiff wins) or favoring the defendant (shifted only to the plaintiff, if the defendant wins).126 Fee-shifting has clear implications for the incentive to sue; for example, under the English rule, suit is encouraged, relative to a regime of no fee-shifting, if the plaintiff’s probability of winning is sufficiently high, because then his expected costs of trial fall.
Fee-shifting may increase the chance of trial, essentially because it accentuates differences in litigant estimates of the expected gains and losses from trial; see Posner (1977a) and Shavell (1982c). Under the English rule (the effects under one-way fee shifting are similar), if the plaintiff and the defendant are each optimistic about winning, then each will be optimistic about passing on his legal expenses to the other, which tends to reduce the settlement range and increase the chances of trial.127 However, fee-shifting tends to raise the amounts the parties will spend at trial, as a party’s expenditure will only be a cost to him with a probability rather than with certainty.128 This attenuates the rise in the chance of trial. (The increase in litigation costs is, of course, significant in itself.129) Also, fee-shifting makes trial riskier, so that if parties are risk averse, it can reduce the chance of trial.
A variant of simple fee-shifting is an offer-of-settlement scheme, according to which fees are shifted only if a settlement proposal is rejected and the amount actually awarded differs in a specified way from the rejected proposal. For instance, if a defendant rejects a plaintiff’s offer and the actual trial award exceeds that offer, fees might be shifted to the defendant. The effects of such schemes on settlement are complex and not readily summarized. See Bebchuk and Chang (1997), Miller (1986), and Spier (1994).
126For a description of the use of fee shifting, see Derfner and Wolf (1995). Fee-shifting favoring only plaintiffs is used to stimulate suits where the private incentive is thought to be inadequate, whereas fee-shifting favoring only defendants is usually proposed as a means to discourage frivolous litigation.
127The same conclusion holds, for closely related reasons, in the asymmetric information model of Bebchuk (1984).
128See Braeutigam, Owen, and Panzar (1984), Hause (1989), and Katz (1987).
129In Katz’s (1987) simulation, the English rule increases costs by 125 percent.
5.4.3. Additional elements of trial outcomes. We have assumed that the only outcome of a trial is a judgment paid by the defendant and received by the plaintiff, but there are other possibilities. First, a trial outcome may have implications for a litigant beyond the immediate judgment. For example, a firm may believe that a loss at trial would invite a string of future law suits; thus, a loss would be more costly for it than the judgment.130 This would tend to make
settlement more likely, as it would raise the amount the defendant firm would be willing to pay in settlement. Second, a litigant may care whether a trial is held per se: a plaintiff might, say, wish the defendant to be exposed to public scrutiny. This would make trial more likely. Or a party might want to avoid a trial because it would result in the airing of embarrassing facts or the disclosure of valuable business information, which would tend to make trial less likely. Third, in whether cases that go to trial are representative of the underlying population of cases, and notably, whether the likelihood of plaintiff victory at trial or the amounts won are typical of the cases that settled. This question is important because, often, the most readily available data is on cases that go to trial, whereas the great majority of cases settle. As Priest and Klein (1984) first emphasized, the cases that go to trial may be quite different from settled cases.
For example, if in 99% of cases defendants would be found liable for a certain amount, but in 1% of cases defendants would prevail, then, if plaintiffs cannot distinguish the two groups, they will likely insist on a settlement amount that the former defendants would pay and the latter would reject. Hence, defendants would win all cases that go to trial, which would be wholly unrepresentative of the cases that settled. In general, cases that go to trial are not representative of the underlying population of cases, and the proper manner of making inferences is complex.132
5.4.5. Lawyers as agents of litigants. Because clients and their lawyers are in a principal and agent relationship, the general problems of principals and agents are relevant. Consequently, to the degree that clients cannot observe lawyers’ effort levels and lack legal expertise, a fee arrangement linked to lawyers’ performance might have joint value to them, but it would impose risk on lawyers (although it would simultaneously reduce clients’ risk, and many clients — particularly individuals or small entities — may be risk averse).133 130See, for example, Che and Yi (1993).
131See Shavell (1993b). For instance, suppose that for each parent in a custody dispute, the value of custody is equivalent to $1,000,000, each parent believes custody would be awarded with probability 50%, and the cost of trial for each is $10,000. Then to induce either parent to settle and give up the opportunity of custody, an offer of at least $490,000 would have to be made, yet neither parent may have assets nearly equal to that amount. 132Priest and Klein (1984) suggested that cases that go to trial would be won by plaintiffs approximately 50% of the time, regardless of the underlying population of cases. This somewhat surprising conclusion of theirs is correct given their assumptions; but it is not borne out in fact, and does not hold under general assumptions about the population of cases and bargaining over settlement and trial. See Eisenberg (1990), Eisenberg and Farber (1997), Hylton (1993), Shavell (1996), Waldfogel (1995b), and Wittman (1985).
133Another problem in the agency relationship is that, at the time of contracting, the client may not know the
lawyer’s quality, and there may also be asymmetric information regarding the strength of the client’s case.
It frequently are paid a fraction of the amount they obtain for their clients under a so-called on tingent fee agreement.134 In addition, lawyers are implicitly rewarded on the basis of performance in the sense that they (and their firms) acquire reputations, so that their future business depends on performance. Lawyers’ conduct is also controlled to some extent by the threat of suit by clients for malpractice, by court-mandated penalties, and by bar association
discipline. See Wilkins (1992). Principal-agent problems that are specific to the legal context arise in the decisions to sue and to settle versus go to trial. See Miller (1987). For example, when lawyers are paid on a contingent fee basis, they might have perverse incentives to favor not bringing suits or to settle, because their own gain would be only a fraction of the total gain from winning. See Danzon (1983) and Hay (1996).135 When lawyers are paid on an hourly basis, it is often said that they have an excessive incentive to sue and to reject settlement offers in favor of trial. (This claim, however, assumes that their hourly rate exceeds their opportunity costs; if, for example, additional, more profitable work comes into the office after the hourly rate is set, then hourlycompensated lawyers may have an excessive incentive to settle.)
5.4.6. Insurers as agents of litigants. Insurers often play a role in litigation. In accident suits, for example, plaintiffs may own medical or disability insurance policies with (subrogation) clauses giving their insurers the right to bring suit and conduct litigation, and defendants frequently hold liability insurance policies that give insurers a role in litigation. Conflicts may arise between litigants and their insurers as their agents in litigation when the coverage ceiling is less than the amount at stake in litigation. See Meurer (1992) and Sykes (1994). To illustrate, suppose that there is a 20% chance that trial would result in a finding of liability and losses are $500,000; also assume that the defendant’s liability coverage ceiling is $150,000. The liability insurer would prefer to reject a settlement offer of $75,000, even though the offer falls below the expected judgment of $100,000. (If the settlement offer is accepted, the insurer pays $75,000 for sure, whereas if there is a trial the insurer makes a payment of $150,000 only 20% of the time, which has an expected cost of $30,000.) By similar reasoning, a plaintiff’s insurer would tend to want to settle for less than plaintiffs would like, which would increase the chance of settlement. Note, however, that reputational interests of insurers as well as the possibility of renegotiation between insurers and insureds serve to mitigate their conflicts of interest.
134See generally Rubinfeld and Scotchmer (1998) on contingent fees. We note that payment arrangements that are contingent upon outcomes may be common because lawyers who nominally charge hourly rates may submit higher bills when successful and may trim their bills when they lose.
135Hay (1997) discusses how bifurcated contingent fees (which pay a higher rate if there is no settlement) can help to address this problem. Additional principal-agent problems arise in the context of class actions, where many plaintiffs are joined in a class and there is a free-rider difficulty with regard to supervision of attorneys. See Coffee (1986) and Macey and Miller (1991).
5.4.7. Voluntary sharing of information. In the discussion of settlement versus litigation in section 5.2, we assumed that the information of parties was somehow exogenously determined: either information was in the background and formed parties’ perhaps disparate beliefs, or else information was explicitly presumed to be asymmetric. However, litigants in general have strong motives to share information. See Shavell (1989a). Most obviously, parties will want to share favorable information in order to foster settlement and to improve its terms. A plaintiff, for example, would want to show the defendant information establishing that his losses were in fact higher than the defendant otherwise believes; in this way, the plaintiff can induce the defendant to pay more in settlement and perhaps avoid an impasse leading to trial. Likewise, a defendant would want to show the plaintiff evidence pointing toward his lack of responsibility, in order to convince the plaintiff to accept a lower settlement offer. In addition, parties will want to reveal information to avoid negative inferences that would be made from their silence. If a plaintiff says nothing about the magnitude of his losses, the defendant will be likely to infer that the plaintiff is withholding information that his losses are lower than average, and if this inference is made, the defendant will not be willing to make an average offer. Both this incentive to avoid negative inferences and the incentive to reveal favorable information tend to produce significant voluntary disclosure and help to explain the high rate of settlement.136
Nevertheless, some information will not be shared, and this helps to explain why some cases do not settle. First, a party may decide against disclosing information because revealed information can often be countered at trial if the opposing side has foreknowledge of it. Second, information may be difficult to share, even though a party wants to do that. For instance, a plaintiff might know that his business losses from a breach of contract will be high, but not be able to demonstrate this during settlement negotiations (because, say, experts will have to be hired for trial to verify the losses). Another difficulty faced by a party who wants to reveal favorable information is that it may consist of the absence of unfavorable information. (For example, if the defendant was not drinking before a traffic accident, his favorable information may be the nonexistence of anyone who saw him drinking, and he may have no way to demonstrate this.137) Third, information may not be shared because it is unfavorable and the negative inference drawn from silence is not too strong. Note that the negative inference from silence will be weakened to the extent that some parties do not disclose favorable information for the first two reasons just given.
136Farber and White (1991) find that many malpractice cases settle after plaintiffs obtain information from defendants.
137If the case does not settle, the plaintiff may ultimately be able to verify the defendant’s claim implicitly: investigations may fail to locate any person who saw the defendant drinking (whereas if there really is a witness, there is some probability that the witness would be located)
5.4.8. Required disclosure of information — legal discovery. The courts may require that a litigant disclose certain information to the other side; this practice is known as discovery. It is commonly believed that discovery significantly increases the likelihood of settlement because it reduces differences in parties’ information. But, as just emphasized, there may well be substantial voluntary sharing of information, so the influence of compulsory disclosure will not be so great and is in fact nonexistent in a natural model of disclosure. See generally Shavell (1989a), and see also Hay (1994).138
Discovery will, nevertheless, tend to increase the rate of settlement and also will affect the terms of settlements. First, when parties would otherwise withhold favorable information to disable the opponent from countering it at trial, discovery will force disclosure, which in turn will
make settlement more likely. Second, when parties would otherwise withhold unfavorable information (because the negative inference from so doing would not be too strong), discovery will mandate disclosure and lead to settlement on less favorable terms. It should be noted, however, that such parties with unfavorable information would have settled in the absence of discovery.139 Settlement will increase overall because, when those with unfavorable information are required to disclose it, more generous offers will be made to those who remain silent in the face of discovery (perhaps those with favorable information who cannot verify the strength of their cases). Third, the prospect of legal sanctions for false statements may make more credible parties’ insistence that they lack certain unfavorable information (such as the assertion that there is no witness who could testify to the party having been drinking before an accident); this would encourage settlement of such cases.140
Discovery may also be used strategically. Obeying discovery requests is often expensive because significant time and resources may be needed to produce the desired information.
138Other models of discovery are Sobel (1989) and Mnookin and Wilson (1998); because these articles do not compare outcomes when there is discovery with outcomes with voluntary sharing of information, they are hard to interpret. See also Cooter and Rubinfeld (1994), which considers discovery, but in a model without an explicit treatment of asymmetric information.
139Those who withhold unfavorable information when there is no compulsory discovery seek to mimic others with favorable information who remain silent (whether because they strategically withhold information or because they cannot credibly verify their favorable situation). Accordingly, they receive settlement offers that reflect the average characteristics of the silent group; being those in the group with the least favorable cases, they will be the ones who settle, on terms that are better than they can expect if they were to disclose their unfavorable information. See Shavell (1989a).
140There may, however, be limitations on the feasibility of enforcement of discovery obligations. If a side fails to divulge unfavorable information, often this will not come to light (because the case may settle beforehand or because, even if there is a trial, the other side may never learn the truth in any event). Accordingly, very high sanctions for misrepresentations and possibly selective investigation (perhaps by the state) of the veracity of discovery responses may be necessary, although the present system does not follow either course.
5.4.9. Criminal adjudication. The analysis of suit and settlement for criminal adjudication (see, for example, Landes (1971) and the literature cited in section 6.3.8 on plea bargaining) is in some respects similar to that for civil adjudication, but there are differences in parties’ incentives that are worthy of note. First, in criminal cases the complaining party is a public prosecutor.
Accordingly, litigation decisions will not be based on a simple comparison of litigation costs and the expected gain because the prosecutor neither directly bears these costs nor benefits monetarily from winning (costs are borne by the state and there is no actual recovery). Instead, a prosecutor’s decisions will be dictated by the complex of factors determining his salary and his professional future. Second, a criminal defendant is often impecunious and will have been assigned a public defender. Not having to pay for his defense, such a defendant will not save legal expenses by settling, making him less willing to settle than otherwise. But those who serve as public defenders or are appointed to represent indigent defendants will often have limited budgets and receive low compensation, so they may exert less effort than what defendants would demand were they not liquidity constrained. Also, criminal defendants, and especially first-time defendants, may not care so much about the magnitude of punishment as about the fact of a criminal conviction or about having to spend some time incarcerated.
5.4.10. Additional aspects of legal procedure. There are many aspects of legal procedure that merit study but which we do not examine here, due mainly to their having received only limited treatment in the literature. Topics include the burden of proof,141 rules of evidence (and tribunals’ making inferences from evidence),142 the use of juries,143 the behavior of judges,144 summary adjudication,145 class actions,146 sequential versus joint adjudication of multiple issues in a single case,147 the sharing of liability among multiple defendants,148 and the advantages of the adversarial system of adjudication (in which each side substantially controls its litigation activity) versus the European inquisitorial model (in which the tribunal controls much litigation activity).149
141See Davis (1994), Hay and Spier (1997), Kaplow (1994a), Posner (1973), Rubinfeld and Sappington (1987), and Sobel (1985).
142See Daughety and Reinganum (1995), Froeb and Kobayashi (1996), Lewis and Poitevin (1997), Schrag and Scotchmer (1994), and Shavell (1989b).
143See Klevorick, Rothschild, and Winship (1984) (jury deliberations) and Schwartz and Schwartz (1996) (peremptory challenges).
144See Cohen (1992), Elder (1987), Higgins and Rubin (1980), Kimenyi et al. (1993), Kornhauser (1992a, 1992b), Posner (1993a), Ramseyer (1998), and Rasmusen (1994).
145See Posner (1986) on summary jury trials.
146See Che (1996), Dam (1975), and Miller (1998).
147See Landes (1993).
148See Easterbrook et al. (1980), Klerman (1996), Kornhauser and Revesz (1994), and Polinsky and Shavell (1981).
149See, for example, Langbein (1985) on the German system
5.5. Legal Advice
Because legal advice is costly, individuals must make decisions whether or not to obtain it, and questions about its social desirability also arise. In discussing the topic of legal advice, it is useful to consider separately ex ante legal advice — obtained when a party is contemplating an action — and ex post legal advice — secured after a party has acted or someone has been harmed, which is to say, at the stage of possible or actual litigation.
A notable difference between the types of advice is that ex ante advice can channel behavior directly in conformity with law, whereas ex post advice comes too late to accomplish that (although it has indirect effects on behavior). Ex ante legal advice was first studied from an economic perspective in Shavell (1988) and Kaplow and Shavell (1992); ex post legal advice was initially investigated from this standpoint in Kaplow and Shavell (1989, 1990).150
5.5.1. Ex ante legal advice: when acts are contemplated.
Advice has private value to a party who is considering taking some action with a possible legal consequence if the advice might lead him to alter his decision. The private value of legal advice is just an instance of the conventional definition of the expected value of information to a decisionmaker, as presented for instance in Raiffa (1968).
The social, as opposed to the private, value of ex ante legal advice inheres in the social desirability of advice-induced changes in parties’ behavior. In general, advice has positive social value because it promotes adherence to legal rules. The specific nature of the comparison between the social and the private values of legal advice depends on the form of liability. When liability is strict, the private value of legal advice is the same as its social value.
conclusion follows essentially because a party’s liability burden equals the harm he causes. When, however, liability is based on negligence, the private value of legal advice can be shown to exceed its social value. The explanation is in part that if a person avoids negligence because of advice, his liability saving will generally be larger than the reduction in expected harm he accomplishes, for he will escape liability entirely even though his non-negligent behavior might still cause harm.
150Legal advice was further studied in Bundy and Elhauge (1991, 1993) and Fischel (1998).
5.5.2. Ex post legal advice: at the stage of litigation. The private value of ex post legal advice resides in the possibility that the advice will lead a party to change his decisions about suit, settlement, and trial.
In considering the social value of ex post legal advice, observe first that because such advice is, by its nature, imparted to parties only after they have acted, it cannot have aided them initially in conforming with the law. A firm that does not know whether discharging a chemical waste into a river will violate an antipollution statute obviously cannot be led to behave appropriately by learning what the law is after it decides about discharging the chemical. This simple but fundamental observation means that ex post advice does not raise social welfare in the direct way that ex ante advice does. Nonetheless, ex post advice certainly may influence behavior and social welfare. Ex post advice that defendants obtain in the course of a lawsuit may affect social welfare by lowering sanctions for those who knowingly violate the law, that is, ex post advice may dilute deterrence of undesirable conduct. Lawyers may lower expected sanctions by advantageous use of legal strategy and, importantly, by counseling defendants on the selection of evidence to present and to suppress. Given that individuals anticipate that their expected sanctions for causing harm will be reduced due to the subsequent availability of legal advice, fewer individuals will be deterred from engaging in undesirable behavior. Thus, legal advice may have negative social value, a point that was early emphasized by Bentham (1827). This reasoning, however, is incomplete, in part because the state may be able to raise overall sanctions to offset the dilution of deterrence due to advice.
5.5.3. Other aspects of legal advice. Subversion of the law. One issue that we have not mentioned is that advice may directly subvert the law. Lawyers may lower the effective magnitude of sanctions by helping clients to hide assets, and lawyers may also decrease the likelihood of sanctions if they have knowledge of enforcement strategies (such as how the tax authorities choose whom to audit). Of course, lawyers are not supposed to thwart law enforcement, but they have an economic incentive to do so and can fairly easily avoid punishment for it (lawyers give advice in private and can phrase their advice in hypothetical but readily understood terms). From the social perspective, legal advice that frustrates law enforcement I obviously undesirable.
Confidentiality of legal advice. The legal system protects the confidentiality of communications between lawyers and their clients under wide circumstances. Confidentiality of legal advice will benefit clients when there is a positive probability that disclosure of advice would lower its value. This would usually be true of advice about the selection of evidence to present in litigation: such advice generally would be robbed of effectiveness if it were disclosed to the opposing side and the court. Confidentiality is also of obvious importance to those obtaining advice subversive of the law. By contrast, confidentiality often should not matter to parties obtaining advice about the legality of an act or about magnitude or likelihood of sanctions, because disclosure of such advice will usually not disadvantage them. (This is because they seek advice with the intention of following the law.151) Still, whatever is the character of strictly legal advice, maintaining the confidentiality of much business or personal information about clients themselves will frequently be of importance to the clients.
Because protection of confidentiality can benefit clients (and never is a disadvantage to them), it encourages clients to consult with and reveal information to their lawyers. This in itself is sometimes thought to imply that confidentiality is socially desirable. That reasoning, however, is mistaken: confidentiality is socially desirable only if the legal advice that confidentiality encourages is socially desirable, and as has been explained above, that may not be the case. Confidentiality of legal work product. The legal system also protects the confidentiality of legal work product (documents and other records of lawyers’ effort) that they generate on behalf of clients in connection with litigation. The protection of work product is accomplished principally by denying opposing litigants the right to legal discovery of it.
As Easterbrook (1981) stressed, protection of work product encourages lawyers to engage in research on their clients’ cases, for much of the value of the research would be lost if it became immediately known to the other side. But whether protection of work product is socially desirable is not evident a priori; for it depends on whether or not the legal advice that the work product supports is socially desirable.152 A further complication is that, even when the advice is socially desirable, the private value of advice, and thus the amount of work product, may be socially excessive.
151See Shavell (1988).
152To illustrate, investigation may be necessary to determine or document facts that will improve accuracy, but the social value of greater accuracy may or may not exceed the cost of investigation. Also, with work product protection, two parties may engage in duplicative efforts.
The appeals process — the process whereby a litigant disappointed with the decision of a first-order tribunal can seek reconsideration before a higher tribunal — is a widely observed feature of adjudication; in virtually all legal systems today, there exists a fairly general right of appeal of trial court decisions. An important social justification for the appeals process concerns correction of error. See Shavell (1995b). Suppose that litigants possess information about the occurrence of error and that appeals courts can frequently verify it. Then litigants may be induced to bring appeals when errors are likely to have been made but not otherwise, because of the costs of appeals. This outcome may be fostered by imposition of fees for bringing appeals, so as to discourage appeal when decisions were likely to have been correct.153 In other words, if there is an appropriate price for pursuing appeals, the appeals process can harness the information that litigants have about the occurrence of error and tend to remedy it. When this process functions well, appeals not only result in error correction, they also do so cheaply, for the legal system is burdened with reconsidering only the subset of cases in which errors were more probably made. This may render society’s investment in the appeals process inexpensive in comparison to the alternative it has of improving the accuracy of the trial process (by investing in the length and quality of trial court adjudication). Under that alternative approach, extra expenditure would be required in all cases rather than only in the subset of cases that are appealed. The appeals process, in other words, may be an economical way of correcting error by taking advantage of litigants’ information that it has occurred.
5.7. Alternative Dispute Resolution
When parties need to resolve a dispute, they may turn not only to the state-sanctioned method of dispute resolution, namely, trial before a court, but also to arbitration and other forms of alternative dispute resolution (ADR).154 In examining ADR, it is helpful to distinguish between ex ante agreements to employ ADR — arrangements made before disputes arise — and ex post resort to ADR — use of ADR after disputes have arisen. See Shavell (1995a).155
5.7.1. Ex ante ADR agreements. Ex ante ADR agreements may be adopted because they are to the mutual benefit of the parties to a contract.156 In particular, ADR may lower the cost of resolving disputes or reduce risk.
153In fact, however, public fees for appeal are nominal, although private costs may be nontrivial.
154We focus on binding ADR; nonbinding ADR, such as mediation, is often used to foster settlement.
155On other issues raised by ADR, see Landes and Posner (1979). It should also be mentioned that there is a
literature considering arbitration alone, not arbitration as an alternative to state-authorized litigation. See, for example, Ashenfelter (1989, 1992), Ashenfelter and Bloom (1984), and Farber (1980).
156We observe that to obtain many of the benefits noted in the text, the agreement to use ADR must be made ex ante; if the parties wait until a dispute arises, it will often be in the interest of one of the parties to refuse t accept ADR.
Second, ADR may engender superior incentives for the parties through greater accuracy of results. Suppose, for instance, that substandard performance of a contract would be correctly assessed by expert arbitrators under ADR but not by courts. Then the parties to the contract might well prefer to adopt ADR because it would induce good performance, thereby raising the willingness of the promisee to pay for the contract. Third, ADR may result in improved incentives to engage in disputes or to refrain from that. For example, it may be that the number of disputes brought under the legal process would be excessive, dissipating substantial resources of the parties without instigating mutually desirable changes in behavior; thus an ADR agreement that would serve to limit the number of disputes would be advantageous.
Because ex ante ADR agreements made by knowledgeable parties raise their well-being, it seems that ex ante ADR agreements should ordinarily be enforced by the legal system, as they are in fact. It is sometimes suggested that society should go further and subsidize ADR. A subsidy might be justified on second-best grounds, because the state already subsidizes ordinary litigation by not charging litigants for its full costs. It would seem, however, that the optimal solution is to remove the latter subsidy, unless it is justified on the ground of inadequate private incentives to sue.
5.7.2. Ex post ADR agreements. Parties will tend to make ex post ADR agreements in order to reduce dispute resolution costs and risk. On this account, ex post ADR would also tend to be socially desirable. A full evaluation of ex post ADR, however, must recognize other effects, notably, how the prospect that parties would adopt ADR ex post would affect their ex ante behavior. The proper analysis is similar to that bearing on the private versus the social value of
settlements, in section 184.108.40.206
5.8. Formulation of Legal Rules
Economic analysis of the operation of the legal system often takes the legal rules being enforced as given. The formulation of legal rules itself, however, raises interesting economic issues.158 One issue concerns the optimal level of detail of rules. On one hand, greater detail allows better tailored control of behavior. On the other hand, greater detail involves higher compliance and litigation costs. Moreover, it cannot be assumed that parties will become informed of the precise content of more detailed rules. See Diver (1983), Ehrlich and Posner (1974), and Kaplow (1995a).159
157Indeed, parties’ adoption of ADR can be seen as a form of out-of-court settlement because use of ADR means that there will be no trial and instead the parties will be bound by the alternative they have chosen.
158For a survey of the literature, see Kaplow (1998).
159The subject of legal complexity has received particular attention in the context of the income tax. See, for example, Blumenthal and Slemrod (1992), and Kaplow (1996).
Another issue is whether rules should be formulated fully ex ante, or instead should be incompletely specified initially and fully articulated only ex post, during adjudication of particular disputes. Fuller ex ante specification is more costly for the state, but may provide greater predictability for parties and hence induce better behavior, and it also may reduce adjudication costs. See Diver (1983), Ehrlich and Posner (1974), and Kaplow (1992c). Full ex ante specification of legal rules tends to be advantageous when the governed behavior is frequent and has common characteristics, essentially because of economies of scale (the rule is formulated only once). A closely related subject is the issuance of precedents by courts; for example, major disagreements about issuing precedents concern the degree to which details of rulings beyond those necessary to decide the case before a court should be specified and when courts should take the opportunity to announce new legal rules or modify existing ones.160 Additional issues are presented by the frequent need to modify legal rules. New rules, if fully and immediately applicable, will typically affect the returns to previous investments. The prospect of such application of new rules imposes risk on actors and also affects their investment decisions, but the latter effect tends to be efficient when the legal reform reflects certain economically relevant information or changed circumstances.161 See generally Kaplow (1986a, 1992a).
5.9. Relevance to General Incentive Schemes
In closing, we suggest that the topic of the operation of the legal system should in substantial respects be viewed as a basic one in the theory of incentives. This is because incentive schemes often require that parties come before authorities who apply rules — that is, the incentive schemes — and this adjudication process is costly. (Even if the adjudication is informal, it will involve expense.) Therefore, many of the issues that we addressed are of relevance. Notably, questions arise concerning private versus system-appropriate motives to come before authorities who apply rules, for individuals will not take into account total adjudication costs nor the incentive effects of adjudication (such as whether the bringing of employee complaints will induce better behavior in a firm). Also, many of the more particular issues that we considered about litigation and the design of legal procedure — including settlement, discovery, and appeals — have general analogs in other incentive systems.162
160See also Landes and Posner (1976), who analyze the body of precedent as a capital stock that depreciates over time161For example, suppose that the government learns that a type of emission is harmful and, accordingly, imposes some sort of regulation or corrective tax. It will tend to be efficient for the new rule to apply to preexisting sources of the emissions because the prospect of such application will induce actors to take into account the probability that the emissions will turn out to be harmful. (Compare the discussion of compensation for government takings in section 3.4.3.)
162The discussion in the text concerning the cost and process of verifying variables in incentive schemes obviously bears on whether parties should include variables that analysts might treat as unverifiable but that they understand to be verifiable, to some degree of accuracy, if the parties incur sufficient costs.
- Law Enforcement
In this section we consider the theory of public enforcement of law — the use of hired agents (inspectors, tax auditors, police) to detect violators of legal rules and to impose sanctions. Outside the scope of our discussion are many other factors that affect compliance with the law, including public programs (such as job training for low-income individuals, which affects their opportunity cost of crime) and private behavior (such as the carrying of guns and the use of locks to prevent theft).163
We begin by noting the justification for using public law enforcement rather than relying exclusively on private suits. Then, we analyze basic issues concerning the optimal probability, magnitude, and form of sanctions and the rule of liability. Next, we examine a variety of extensions of the central theory, including accidental harms, error, marginal deterrence, repeat offenders, self-reporting, and incapacitation. Finally, we briefly discuss criminal law in the light of the theory.
Before proceeding, we observe that economically-oriented analysis of public law enforcement dates from the eighteenth century contributions of Montesquieu (1748), Beccaria (1770), and, especially, Bentham (1789), whose investigation of deterrence was sophisticated and expansive. But, curiously, after Bentham (1789), the subject of law enforcement lay essentially dormant in economic scholarship until the late 1960s, when Gary Becker (1968) published a highly influential article, which has led to a voluminous literature.164
6.1. Rationale for Public Enforcement A basic question is why there is a need for public enforcement of law in the light of the availability of private suits brought by victims. The answer depends very much on the locus of information about the identity of injurers. When victims of harm naturally possess knowledge of the identity of injurers, allowing private suits for damages will motivate victims to sue and thus harness the information they have for purposes of law enforcement. This may help to explain why the enforcement of contractual obligations and of accident law is primarily private.165
163On other public policies, see, for example, Donohue and Siegelman (1998) and Wilson and Herrnstein (1985). On private behavior and crime, see, for example, Ayres and Levitt (1998), Cook et al. (1995), Lott and Mustard (1997), and Shavell (1991c).
164For surveys and references, see Garoupa (1997), Mookherjee (1997), and Polinsky and Shavell (1998b) (which is similar to this section).
165It may not be the case, however, that private incentives to bring suit are optimal, as we discussed in section 5.1.2.
When victims do not know who caused harm and penalizing wrongdoing is difficult, society tends to rely instead on public investigation and prosecution; this is broadly true of crimes and of many violations of environmental and safety regulations. Even in contexts where sanctioning violators is difficult, however, we should ask why society cannot rely on inducements to private parties — rewards of some type — to supply information or otherwise to help in sanctioning. One difficulty with such private enforcement is that if a reward is available to anyone, there might be wasteful effort devoted to finding violators (akin to excessive fishing activity).
Another problem is that the best technologies for finding liable parties often require coordination of many individuals, sometimes on a vast scale. Additionally, it may be advantageous for expensive information systems (fingerprint records, data banks on offenders) to be developed and maintained, even though their benefits would be hard for the private sector to capture fully; such enforcement technologies may constitute natural monopolies. An additional obstacle to private enforcement is that force (or the threat of it) may be needed to gather information, capture violators, and prevent reprisal, yet the state for various reasons may not want to permit private parties to use force. Thus, there appear to exist arguments favoring public enforcement when effort is required to identify and sanction violators.166
6.2. Basic Theory of Enforcement
Suppose that an individual (or a firm) chooses whether to commit an act that causes harm with certainty (see section 6.3.1 on uncertain harm). If he commits the act, he obtains some gain and also faces the risk of being caught, found liable, and sanctioned. The rule of liability could be either strict — under which the individual is definitely sanctioned — or fault-based — under which he is sanctioned only if his behavior fell below a fault standard.167 The sanction that he suffers could be a monetary fine, a prison term, or a combination of the two.
Whether an individual commits a harmful act is determined by an expected utility calculation.
166The comparison between public and private enforcement has received modest attention in the literature. See Becker and Stigler (1974), Landes and Posner (1975), and Polinsky (1980a); see also Friedman (1995) and Shavell (1993a).
167Fault-based liability is often employed in accident law (the negligence rule) and in many regulatory scheme which penalize only parties that fail to meet regulatory standards). On reflection, criminal law may be seen to be faultbased; it only punishes certain harmful acts whose characteristics make them almost always undesirable He will commit the act if that would raise his expected utility, taking into account the gain he would derive and the probability, form, and level of sanction that he would then face.168 We will usually first examine the case in which individuals are risk neutral with respect to sanctions, but we will also consider other possibilities. We assume, as is conventional, that fines are socially costless to employ because they are mere transfers of money, whereas imprisonment involves positive social costs because of the expense associated with the operation of prisons and the disutility due to imprisonment (which is not naturally balanced by gains to others).169 We also assume that the higher is the probability of detecting and sanctioning violators, the more resources the state must devote to enforcement.170 Social welfare generally is presumed to equal the sum of individuals’ expected utilities. An individual’s expected utility depends on whether he commits a harmful act, on whether he is sanctioned, on whether he is a victim of someone else’s harmful act, and on his tax payment, which will reflect the costs of law enforcement, less any fine revenue collected. If individuals are risk neutral, social welfare can be expressed simply as the gains individuals obtain from committing their acts, less the harms caused, and less the costs of law enforcement. (The assumption that individuals’ gains are always credited in social welfare could be relaxed without affecting most of our conclusions. The principal difference that altering the assumption would make is that more acts would be treated as socially undesirable and that optimal sanctions and enforcement effort would thus be higher.) The enforcement authority’s problem is to maximize social welfare by choosing enforcement expenditures, or, equivalently, a probability of detection, and also the level of sanctions, their form (a fine, prison term, or combination), and the rule of liability (strict or fault-based). 6.2.1. Optimal enforcement given the probability of detection.
Then the optimal fine f is h/p, the harm divided by the probability of detection, for then the expected fine equals the harm. This fine is optimal because, when the expected fine equals the harm, an individual will commit a harmful act if, but only if, the gain he would derive from it exceeds the harm he would cause.
168We assume that the probability and magnitude of the sanction are known. See, however, Bebchuk and Kaplow (1992) on the case where individuals misperceive the probability of sanctions, Kaplow (1990b) on the case where individuals may make expenditures to acquire information about sanctions, and Sah (1991) on the process by which individuals learn about actual levels of enforcement. For empirical evidence on offenders’ knowledge of expected sanctions, see Wilson and Herrnstein (1985).
169Imposing fines may, in fact, be costly, due to the need for adjudication and fine collection. Were we to take this into account, the main effect on our conclusions would be that the optimal expected sanction would be higher because harmful acts would cause not only direct harm but also, if detected, additional administrative costs.
170We note that when the method of enforcement involves investigating particular acts after they have been If individuals are risk averse, one might expect the optimal fine to be lower than in the riskneutral case for two reasons.
First, because risk-averse individuals are more easily deterred than risk-neutral individuals, the fine does not need to be as high as before to achieve any desired degree of deterrence. Second, lowering the fine reduces the bearing of risk by individuals who commit the harmful act. However, lowering the fine also increases the number of individuals who commit the harmful act and hence bear risk.171
Next assume that imprisonment is the form of sanction, so that social costs will be incurred in imposing sanctions. In this case, there is not a simple formula for the optimal imprisonment term. See Polinsky and Shavell (1984). The optimal term could be such that there is either underdeterrence or overdeterrence, compared to socially ideal behavior. On one hand, a relatively low imprisonment term, implying underdeterrence, might be socially desirable because it means that imprisonment costs are reduced for those individuals who commit harmful acts. On the other hand, a relatively high term, implying overdeterrence, might be socially desirable because it means that imprisonment costs are reduced due to fewer individuals committing harmful acts.172 (For reasons that we will discuss below and because of factors outside the model, our conjecture is that overdeterrence is unlikely to be optimal.)
Now consider the combined use of fines and imprisonment. Here, the main point is that fines should be employed to the maximum extent feasible before resort is made to imprisonment. In other words, it is not optimal to impose a positive imprisonment term unless the fine is maximal. (The maximal fine might be interpreted as the wealth of an individual.) The rationale for this conclusion is that fines are socially costless to impose, whereas imprisonment is socially costly, so deterrence should be achieved through the cheaper form of sanction first. This point is noted by Bentham (1789, p. 183) and Becker (1968); see also Polinsky and Shavell (1984).
Then deterrence and the amount of harm will be unchanged, but the cost of imposing the imprisonment sanction will fall, raising social welfare. Hence, it must be optimal for the fine to be maximal before imprisonment is used.173 (It can be shown that this argument holds regardless of individuals’ attitudes toward risk in either fines or imprisonment.) Fault-based liability. As we explained in section 2.4.1 on accident law, damages equal to harm, in excess of harm, or even somewhat less than harm, will be sufficient to induce optimal behavior under fault-based liability.
171An additional complication, which might favor a higher optimal fine when individuals are risk averse, is that individuals who commit a harmful act might obtain such great benefits that they would be wealthier (and thus have a lower marginal utility of wealth) than others even if they paid a fine equal to h/p.
172See also Kaplow (1990a), who notes that extreme sanctions (zero or the maximal sanction) may well be optimal in the standard model.
173See Levitt (1997b) on why it may be optimal to rely more on imprisonment when offenders’ wealth cannot be observed. For empirical evidence on the use of fines versus imprisonment, see Lott (1992) and Waldfogel (1995a).
See section 2.1.1. (Moreover, for reasons we discuss in section 6.3.2 below, strict liability encourages better decisions by injurers regarding their level of participation in harm-creating activities.) 6.2.2. Optimal enforcement including the probability of detection. We now consider the optimal system of enforcement when expenditures on enforcement, and hence the probability of detection, are allowed to vary. Consideration of this issue originated with Becker (1968). Strict liability. Assume first that the sanction is a fine and that individuals are risk neutral. Then the optimal level of the fine is maximal, fm, and the optimal probability is low (in a sense to be described). The explanation is that if the fine were not maximal, society could save enforcement costs by simultaneously raising the fine and lowering the probability without affecting the level of deterrence: if f < fm, then raise the fine to fm and lower the probability from p to (f/fm)p; the expected fine is still pf, so that deterrence is maintained, but expenditures on enforcement are reduced, implying that social welfare rises. Becker (1968) suggested this result (although much of his analysis implicitly presumes that the fine is not maximal); Carr-Hill and Stern (1979) and Polinsky and Shavell (1979) note it explicitly. The optimal probability is low in that there is some underdeterrence: the optimal p is such that the expected fine pfm is less than the harm h. See Polinsky and Shavell (1984). The reason for this result is that if pfm equals h, behavior will be ideal, meaning that the individuals who are just deterred obtain gains just equal to the harm. These are the individuals who would be led to commit the harmful act if p were slightly reduced. Decreasing p, in turn, must be socially beneficial because these individuals cause no net social losses (because their gains essentially equal the harm), but reducing p saves enforcement costs.
If individuals are risk averse, the optimal fine may well be less than maximal, as shown in Polinsky and Shavell (1979). This is because the use of a very high fine would impose a substantial risk-bearing cost on individuals who commit harmful acts.174 For further discussion of the optimal fine when individuals are risk-averse, see Kaplow (1992b).175
174A more particular explanation involves reconsidering the argument that we used in the risk-neutral case. If the fine f is less than fm, it is still true that f can be raised to fm and p lowered so that prospective violators’ expected utility remains constant; hence, everyone’s behavior will be unchanged. However, because of risk aversion, this adjustment implies that pf falls, meaning that fine revenue falls. (The reduction in fine revenue reflects the disutility caused by imposing greater risk on risk-averse individuals.) If individuals are sufficiently risk averse, the decline in fine revenue associated with greater risk-bearing could more than offset the savings in enforcement expenditures from reducing the probability of detection, implying that taxes would have to rise to make up the shortfall; accordingly, social welfare would be lower.
175Another reason that optimal fines may not be maximal is that higher sanctions may induce violators to expend additional resources to avoid punishment. See Malik (1990). Further reasons are discussed below.
Next, assume that the sanction is imprisonment and that individuals are risk neutral in imprisonment, that is, the disutility of a year of imprisonment is the same for each additional year.176 Then the optimal imprisonment term is maximal for essentially familiar reasons: if the imprisonment term is raised and the probability of detection lowered so as to keep the expected sanction constant, neither individual behavior nor the costs of imposing imprisonment are affected (by construction, the expected prison term is the same), but enforcement expenditures fall. See Shavell (1991b). Thus, not only are there greater savings in enforcement expenditures, but also the social costs of imposing imprisonment sanctions decline because the expected prison term falls. See Polinsky and Shavell (1999a).
Last, suppose that individuals are risk preferring in imprisonment (the disutility of each year of imprisonment falls with the number of years in prison). This possibility seems particularly important: the first years of imprisonment may create special disutility, due to brutalization of the prisoner or due to the stigma of having been imprisoned at all, and potential offenders may have unusually high discount rates. In this case, the optimal sanction may well be less than maximal: if the sanction were raised, the probability that maintains deterrence could not be lowered proportionally, implying that the expected prison term would rise. See Polinsky and Shavell (1999a).
Now consider the situation when both fines and imprisonment are employed as sanctions. Recall that under the optimal enforcement policy, the fine must be maximal, for otherwise it cannot be desirable to employ imprisonment. The main point we wish to add is that, unlike when imprisonment is used alone, the optimal imprisonment term may not be maximal even if individuals are risk neutral or risk averse in imprisonment. The basic reason is that, if the imprisonment term is raised and the probability of detection is lowered so as to keep deterrence constant, there will be relatively greater reliance on imprisonment than on fines, which is more socially costly.177 fault-based liability. The least expensive way to accomplish compliance with the fault standard is to use the highest possible sanction and, given this sanction, the lowest probability of detection that deters individuals who would be at fault. The reason is that, if all individuals who would be at fault are deterred, the only cost incurred is associated with the setting of the probability; this cost is minimized by using the maximal sanction and a correspondingly low probability.
176We did not discuss individuals’ attitudes toward the risk of imprisonment above because the points we made there did not depend on this consideration.
177Reducing the probability reduces the expected disutility attributable to fines (which are constant in nominal amount, at the maximum level); to keep deterrence constant, expected disutility attributable to imprisonment must rise. See Shavell (1991b).
6.3. Extensions of the Basic Theory
6.3.1. Accidental harms. We initially assumed that individuals consider committing acts that cause harm with certainty. In many circumstances, however, individuals cause harms only by accident — harm occurs only with a probability. For instance, if someone drives while intoxicated, he only creates a likelihood of a collision; or if a firm stores toxic chemicals in a substandard tank, the firm only creates the probability of a harmful spill. Essentially all that we have said above applies in a straightforward manner when harms are accidental. There is, however, an additional issue that arises when harm is uncertain: a sanction can be imposed either on the basis of the commission of a dangerous act that increases the chance of harm — storing chemicals in a substandard tank — or on the basis of the actual occurrence of harm — only if the tank ruptures and results in a spill. In principle, either approach can achieve optimal deterrence: when individuals are risk-neutral, the sanction for committing a dangerous act would equal the expected harm, and the sanction for causing harm would simply equal the magnitude of the harm itself.
Several factors are relevant to the choice between act-based and harm-based sanctions. See Shavell (1993a). First, act-based sanctions, being based only on expected harm, need not be as high to accomplish a given level of deterrence, and thus offer an underlying advantage over harm-based sanctions because of limitations in parties’ assets. See section 2.6. Such lower sanctions will also be beneficial when parties are risk averse. Second, act-based sanctions and harm-based sanctions may differ in the ease with which they can be applied. In some circumstances, act-based sanctions may be simpler to impose (it might be easier to determine whether an oil shipper properly maintains its vessels’ holding tanks than to detect whether one of the vessels leaked oil into the ocean). In other circumstances, harm-based sanctions may be
more readily applied (it may be easy to identify that a truck exploded but may be difficult to detect a truck illegally carrying explosives). Third, calculation of the appropriate sanction may be less difficult in one context or the other: actual harm may be apparent when it occurs, whereas the probability may be difficult to assess at the time of an act; or expected harm may be statistically determinable but identifying actual harm (for example, tracing particular pollutants to particular victims) may be nearly impossible. 6.3.2. Level of activity. We have been assuming that the sole decision that an individual makes is whether to act in a way that causes harm when engaging in some activity. In many contexts, however, an individual also chooses whether to engage in that activity, or, more generally, at what level to do so. Thus, as we discussed in section 2.1.3 on liability for accidents, individuals decide both how carefully to drive and how much to drive. Similarly, firms decide on a pollution technology and a level of production. And, as we observed previously, even parties who act with appropriate care may impose harm; hence, their activity levels will tend to be optimal only if they bear the cost of that residual harm. Thus, under strict liability, choices about activity levels tend to be correct, but under fault-based liability, parties generally will participate in activities to a socially excessive extent. An important application of this point concerns safety and environmental regulation. Such regulation is typically framed in terms of standards that have to be met, but which, if met, free regulated parties from liability. Under such regulation, levels of regulated activities tend to be excessive. 6.3.3. Enforcement error. Errors of the two classic types can occur in law enforcement: an individual who should be found liable might mistakenly not be found liable, and an individual who should not be found liable might mistakenly be found liable. Let the probabilities of these errors be å1 and å2, respectively, for an individual who has been detected. Thus, an individual will commit the wrongful act when his gain g net of his expected fine if he does commit it leaves him better off than paying the expected fine if he does not commit it, namely, when g – p(1 – å1)f > -på2f, or, equivalently, when g > (1 – å1 – å2)pf. The first point to note is that, as emphasized in Png (1986), both types of error reduce deterrence: the term (1 – å1 – å2)pf is declining in both å1 and å2. The first type of error diminishes deterrence because it lowers the expected fine if an individual violates the law. The second type of error, when an individual is mistakenly found liable, also lowers deterrence because it reduces the marginal benefit of complying with the law. Because errors dilute deterrence, they reduce social welfare. Specifically, to achieve any level of deterrence, the probability p must be higher to offset the effect of errors. Also, when sanctions are socially costly, greater sanctioning costs may be incurred to achieve a given level of deterrence.178 See generally Kaplow and Shavell (1994a). Now consider the optimal choice of the fine. Given any probability of detection, the dilution in deterrence caused by errors requires a higher fine to restore deterrence. If the probability and the fine are variable, then, as before, the optimal fine is maximal for the now familiar reason.
Next, consider the possible risk aversion of individuals. As we emphasized, the optimal fine under strict liability may well be less than maximal when individuals are risk averse, in part because lowering the fine from the maximum level reduces the bearing of risk. Introducing the possibility of errors may do not violate the law are subject to the risk of having to pay a fine.179 Indeed, because the number of persons who do not violate the law often would far exceed the number who do, the desire to avoid imposing risk on the former group can lead to a substantial reduction in the optimal fine. The possibility of error has analogous effects on our analysis of nonmonetary sanctions.
178First, sanctions will sometimes be imposed on those who did not commit the harmful act. Second, to maintain a given gap in disutility from sanctions due to committing the harmful act, the expected sanction for actual injurers must rise as well.
179See Block and Sidak (1980).
The effect of error on the performance of fault-based liability was already noted in section 6.2.1. Finally, observe that, although we have treated the probabilities of error as fixed, they can be influenced by procedural choices: generally, increasing resources devoted to investigation and adjudication tends to decrease errors, and adjusting the burden of proof affects the tradeoff between the two types of errors. Because both types of error reduce deterrence and increase the imposition of socially costly sanctions for a given level of deterrence, expenditures made to reduce errors may be socially beneficial.
See Kaplow and Shavell (1994a). 6.3.4. General enforcement. Enforcement sometimes is general in the sense that several different types of violations will be detected by an enforcement agent’s activity. For example, a police officer waiting at the roadside may notice a driver who litters as well as one who goes through a red light or who speeds, and a tax auditor may detect a variety of infractions when he examines a tax return.
To analyze such situations, suppose that a single probability of detection applies uniformly to all harmful acts, regardless of the magnitude of the harm. (The contrasting assumption is that enforcement is specific, meaning that the probability is chosen independently for each type of harmful act.180) The main point that we want to make is that in contexts in which enforcement is general, the optimal sanction rises with the severity of the harm and is maximal only for relatively high harms. See Shavell (1991b); Mookherjee and Png (1992) is closely related. To explain, assume that liability is strict, the sanction is a fine, and injurers are risk neutral. Let f(h) be the fine given harm h. Then, for any given general probability of detection p, the optimal fine schedule is h/p, provided that h/p is feasible; otherwise — for high h (all h such that h/p > fm) — the optimal fine is maximal. This schedule is obviously optimal given p because it implies that the expected fine equals harm, thereby inducing ideal behavior, whenever that is possible. The question remains whether it would be desirable to lower p and raise fines to the maximal level for the range of relatively low-harm acts for which h/p is less than maximal. The answer is that if p is reduced for the relatively low-harm acts (and the fine raised for them), then p, being general, is also reduced for the high-harm acts for which the fine is already maximal, which raises the extent of underdeterrence of these acts. The decline in deterrence of high-harm acts may cause a greater social loss than the savings in enforcement costs from lowering p. To express this point differently, p must be sufficiently high to avoid significant underdeterrence of high-harm acts (for which fines are maximal).
180These assumptions correspond to different law enforcement technologies. Investigation (the police following
leads after the commission of a particular crime) tends to be specific, whereas auditing and monitoring tend to be general.
But since this p also applies to less harmful acts, the fines for them do not need to be maximal in order to deter them appropriately. 6.3.5. Marginal deterrence. Sometimes a person may consider which of several harmful acts to commit, for example, whether to release only a small amount of a pollutant into a river or a large amount, or whether to kidnap a person or also to kill the kidnap victim. In such contexts, the threat of sanctions influences not only whether individuals are deterred from committing harmful acts but also, for those who are not deterred, which harmful acts they will choose to commit. Notably, undeterred individuals will have a reason to commit less harmful rather than more harmful acts if expected sanctions rise with harm — a phenomenon that is sometimes referred to as marginal deterrence, named by Stigler (1970). The benefits of achieving marginal deterrence were noted long ago by Beccaria (1770, p. 32) and Bentham (1789, p. 171). There are, however, costs of accomplishing marginal deterrence: for sanctions to rise with the magnitude of harm it may be necessary to apply lower sanctions to less harmful acts, which will reduce the deterrence of such acts. Two additional observations should be made about marginal deterrence. First, marginal deterrence can be promoted by adjusting the probability of detection as well as the magnitude of sanctions. (Thus, rather than achieving marginal deterrence by lowering the sanction for the less harmful act, the state can lower the probability of detection for that act; this accomplishes the same result with regard to deterrence and saves enforcement resources.)181 Second, marginal deterrence is naturally accomplished if the expected sanction equals harm for all levels of harm; for instance, if a polluter’s expected fine would rise from $100 to $500 if he dumps five gallons instead of one gallon of waste into a lake, where each gallon causes $100 of harm, his marginal incentives to pollute will be correct. For formal analyses of marginal deterrence, see Friedman and Sjostrom (1993), Mookherjee and Png (1994), Shavell (1992), and Wilde (1992).
6.3.6. Repeat offenders. In practice, the law often sanctions repeat offenders more severely than first-time offenders. We discuss here when such a policy might be socially desirable. Note first that sanctioning repeat offenders more severely cannot be socially advantageous if deterrence always induces first-best behavior.182 Thus, it is only when there is underdeterrence (which is often optimal even when acts are always undesirable, as with many crimes) that it might be optimal to punish repeat offenders more severely.
181When enforcement is general, the same probability will be applicable to a range of offenses, in which case adjusting sanctions may be the only way to achieve marginal deterrence.
182If the sanction for polluting and causing a $1,000 harm is $1,000, then any person who pollutes and pays $1,000 is a person whose gain from polluting (say the savings from not installing pollution control equipment) must have exceeded $1,000. Social welfare therefore is higher as a result of his polluting. If such an individual polluted and was sanctioned in the past, that only means that it was socially desirable for him to have polluted previously. Raising the current sanction because of his having a record of sanctions would overdeter him now.
The main justification for a greater sanction for repeat offenders is that repeat offenders may reveal themselves to be different in some manner that bears on the optimal sanction.183 Another reason to raise sanctions is if additional imprisonment has less deterrent effect per unit, as discussed in section 6.2.2. We also note that sanctions for repeat offenders not only deter repeat offenses but also initial offenses.
6.3.7. Self-reporting. We have thus far assumed that individuals are subject to sanctions only if they are detected by an enforcement agent, but in fact parties sometimes disclose their own violations to enforcement authorities. For example, firms often are required to, and do, report violations of environmental and safety regulations, individuals sometimes notify police of their involvement in traffic accidents, and even criminals occasionally turn themselves in. We explain here why it is generally socially desirable for the structure of enforcement to be such as to encourage self-reporting. See Kaplow and Shavell (1994b) and Malik (1993). Self-reporting can be induced by lowering the sanction for individuals who disclose their own infractions. Moreover, the reward for self-reporting can be made small enough that deterrence is only negligibly reduced. To amplify, assume for simplicity that the sanction is a fine f, that the probability of detection is p, and that individuals are risk neutral. If an individual commits a violation and does not self report, his expected fine is pf. Suppose the fine if an individual selfreports is set just below pf, say at pf – e, where e > 0 is arbitrarily small. Given that self-reporting can be induced, essentially without compromising deterrence, why is self-reporting socially advantageous? One reason is that self-reporting reduces enforcement costs: when a party self-reports, the enforcement authority does not have to identify and prove who the violator was; if a polluter or a burglar turns himself in, investigatory resources are saved.184 Second, self-reporting reduces risk, and thus is advantageous if injurers are risk averse. Drivers bear less risk because they know that if they cause an accident, they can (and will be led to) report this to the police and suffer a lower and certain sanction, rather than face a substantially higher sanction (for hit and run driving) imposed only with some probability. Third, the magnitude of harm sometimes will be mitigated as a consequence of self-reporting; for example, when firms are induced to report leaks of toxic substances when they occur, prompt remediation is more likely to take place.185
183Observe that the mere fact that the sanction for the first offense was inadequate to deter repeat offenders is not enough to justify a higher sanction, for this fact was known at the time they were sanctioned for the first offense.
184When enforcement is accomplished by means of auditing or monitoring, self-reporting results in only modest savings. For self-reporting then only reduces, perhaps slightly (if most individuals comply with the law), the population of individuals to be audited or monitored.
185See Innes (1999).
6.3.8. Plea bargaining. Plea bargaining refers to settlement negotiations between a public prosecutor and a criminal defendant. We examined this subject in section 5.4.9. Plea bargaining in particular has received some attention in the economics literature. See, for example, Froeb (1993), Grossman and Katz (1983), Reinganum (1988), Kobayashi and Lott (1996), and Miceli (1996). 6.3.9. Corruption of law enforcement agents. An enforcement agent and a potentiall violator might well find it mutually profitable to make an agreement under which the violator pays the agent to keep silent. This problem of corruption would seem to be worse the larger is the sanction faced by a violator. To combat corruption and the undermining of deterrence that it brings about, two general approaches can be employed. One is to raise the overall level of
sanctions, so that bargained-for payments will also rise. This, however, is a gross strategy, and also suffers from the limit on the magnitude of sanctions that can actually be imposed. The second approach is to attempt to control corruption by use of sanctions against those who participate in it. This, however, is expensive and involves the issues of enforcement that we have discussed generally here.186 6.3.10. Principal-agent relationship. Although we have assumed that an injurer is a single actor, the injurer is often an agent of some principal. For example, the agent could be an employee of a firm, or the agent could be a subcontractor working for a contractor.
When harm is caused by the behavior of principals and their agents, many of the
conclusions of our prior analysis carry over to the sanctioning of principals. Notably, if a riskneutral principal faces an expected fine equal to harm done, he will in effect be in the same position vis-à-vis his agent as society is vis-à-vis a single potential violator of law. See Newman and Wright (1990). Consequently, the principal will behave socially optimally in controlling his agents and, in particular, will contract with them and monitor them in ways that will give the agents socially appropriate incentives to reduce harm.187 A question about enforcement that arises when there are principals and agents is the allocation of financial sanctions between the two parties.188 It is apparent, however, that the particular allocation of sanctions does not matter when, as would be the natural presumption, the parties can reallocate the sanctions through their own contract.
186For fairly general, mainly informal discussions of corruption, see Becker and Stigler (1974), Klitgaard (1988), Rose-Ackerman (1978), and Shleifer and Vishny (1993); and for models of various aspects of corruption, see, for example, Bowles and Garoupa (1997), Cadot (1987), Mookherjee and Png (1995), and Polinsky and Shavell (1999b). In the principal-agent context, analogous problems are examined in Kofman and Lawarree (1993) and Tirole (1986).
187But, as Arlen (1994) indicates, firms’ internal monitoring of agents might be discouraged if such monitoring makes firms’ exposure to external sanctions more likely.
188See Kraakman (1984).
For example, if the agent finds that he faces the risk of a large fine but is more risk averse than the principal, the principal can assume the risk; conversely, if the risk of the fine is imposed on the principal, he will retain it. Thus, the post-contract sanctions that the agent bears are not affected by the particular division of sanctions initially selected by the enforcement authority. The allocation of monetary sanctions between principals and agents does matter if some allocations allow the pair to reduce their total burden.
An important example is when a fine is imposed only on the agent and he is unable to pay it because his assets are less than the fine; see Kornhauser (1982) and Sykes (1981). Then, he and the principal (who often would have higher assets) would jointly escape part of the fine, diluting deterrence. Imposing the fine on the principal rather than on the agent avoids this problem.189 A closely related point is that the imposition of imprisonment sanctions on agents may be desirable when their assets are less than the harm that they can cause, even if the principal’s assets are sufficient to pay the optimal fine. See Polinsky and Shavell (1993). The fact that an agent’s assets are limited means that the principal may be unable to control him adequately through use of contractually-determined penalties. For example, a firm may not be able, despite Pitchford .
189The converse problem, when the principal has insufficient assets, may also arise. Then, it may be optimal to hold agents or other contracting parties, such as lawyers or lenders, liable as well. See, for example, Kraakman (1986) and 1995).
6.3.11. Incapacitation. Our discussion of public enforcement has focused on the
deterrent effect of sanctions. However, a different way for society to reduce harm is by imposing sanctions that remove parties from positions in which they are able to cause harm — that is, by incapacitating them. Imprisonment is the primary incapacitative sanction, although there are other examples: individuals can lose their drivers’ licenses; businesses can lose their right to operate in certain domains, and the like. Here, we consider imprisonment, but what we say applies to incapacitative sanctions generally. On the economic theory of incapacitation, see Shavell (1987c). To better understand the role of public enforcement when sanctions are incapacitative, suppose that the sole function of sanctions is to incapacitate; that is, sanctions do not deter. In this case, continued imprisonment will be desirable as long as the reduction in crime from incapacitation exceeds the total costs of imprisonment. Observe that this condition could hold for a long period, even for offenses that are not the most serious.
Several comments may be made on the relationship between optimal enforcement when incapacitation is the goal versus when deterrence is the goal. First, when incapacitation is the goal, the optimal magnitude of the sanction is independent of the probability of apprehension, which contrasts with the case when deterrence is the goal. Second, when deterrence is the goal, the probability and magnitude of sanctions depend on the ability to deter, and if this ability is limited (as, for instance, with the insane), a low expected sanction may be optimal, whereas a high sanction still might be called for to incapacitate.
6.3.12. Empirical evidence on law enforcement.
There has been a great deal of empirical work on deterrence of crime and incapacitation of criminals. See, for example, Blumstein et al. (1978), DiIulio and Piehl (1991), Ehrlich (1973, 1975), Eide (1994, 1998), Grogger (1991), Kessler and Levitt (1998), Levitt (1996, 1997a, 1998a, 1998b), Nagin (1978),
Pyle (1983), Tauchen, et al. (1994), Viscusi (1986b), and Witte (1980). Much of this literature, however, does not distinguish between deterrence and incapacitation as the source of any reduction in crime following from greater law enforcement. Another issue in the literature is the simultaneity problem. Notably, when greater law enforcement is not associated with a significant reduction in crime, the explanation could be either that deterrence and incapacitation are unimportant or else that their importance is masked because enforcement effort and sanctions are increased in response to higher crime rates.
6.4. Criminal Law
The subject of criminal law may be viewed in the light of the theory of public law enforcement. First, that the acts in the core area of crime — robbery, murder, rape, and so forth — are punished by the sanction of imprisonment makes basic sense. Were society to rely on monetary penalties alone, deterrence Posner (1985) and Shavell (1985a). Notably, the probability of sanctions for many of these acts is small, making the money sanction necessary for deterrence large, but the assets of many individuals who might commit these acts is quite low; hence, the threat of prison is needed for deterrence. Moreover, the incapacitative aspect of imprisonment is valuable because of difficulties in deterring many of the individuals who are prone to commit criminal acts. Second, many of the doctrines of criminal law appear to enhance social welfare. The basic feature of criminal law that punishment is not generally imposed on harmful acts but instead is usually confined to undesirable acts (for example, murder, not all accidental killing; see note 167), is socially advantageous. As we have stressed, when the socially costly sanction of imprisonment s employed, the fault system is desirable because it results in less frequent imposition of
punishment than strict liability. The focus on intent in criminal law, another of its defining features, may be relevant to deterrence because those who intend to do harm are more likely actually to cause harm, may be more likely to hide their acts, and may be harder to discourage because of the benefits they anticipate. That unsuccessful attempts to do harm are punished in criminal law is an implicit way of raising the likelihood of sanctions for undesirable acts.190 Study of specific doctrines of criminal law seems to afford a rich opportunity for economic analysis.191 Third, the level of sanctions commonly employed is in some respects in accord with the theory on optimal enforcement; notably, offenses that are relatively more serious or more difficult to detect tend to be punished more severely than others. Sanctions, however, do not always seem
to be as high as the theory suggests would be optimal. To be sure, the theory surveyed above provides many reasons that optimal sanctions may be less than maximal. Yet some sanctions (fines for traffic violations) appear to be much lower than can readily be explained by the theory. An important reason for this may be that the public would view as unfair the imposition of punishment that was disproportionate to the magnitude of an offense, although the precise nature of this constraint on the use of sanctions is difficult to ascertain.192
190See Shavell (1990).
191See Posner (1985) and Shavell (1985a) on the various doctrines of criminal law, and Kaplow (1990b) and Murphy and O’Hara (1997) on whether ignorance of the law should excuse criminal liability. See also Fischel and Sykes(1996) and Khanna (1996) on corporate criminal liability.
192An implication is that jurors may not always be willing to convict if sanctions are viewed as excessive. For a
model of optimal enforcement in this case, see Andreoni (1991).
- Criticism of Economic Analysis of Law
Many observers, and particularly noneconomists, view economic analysis of law with skepticism. In this section, we briefly note some of the most common criticisms.
7.1. Positive Analysis
It is often claimed that individuals and firms do not respond to legal rules as rational maximizers of their well-being. Sometimes this criticism of the conventional economic approach verges on an outright rejection of the use of models.. Ultimately, such criticisms raise questions that can only be answered by empirical investigation.. In addition, laws may frame individuals’ understanding of problems, which could affect their probability assessments or willingness to pay. See, for example, Kahneman et al. (1990) on the assignment of entitlements and the Coase theorem. The emerging field of behavioral economics and work in various disciplines that address social norms is beginning to examine these sorts of issues.193
7.2. Normative Analysis
7.2.1. Distribution of income. A frequent criticism of economic analysis of law concerns its focus on efficiency, to the exclusion of the distribution of income. The claim is that legal rules — such as the choice between strict liability and negligence to govern automobile-pedestrian accidents — should be selected in significant part based upon which type of party is typically richer or poorer.
There is not a good reason, however, to employ legal rules to accomplish redistributive objectives given the general alternative of achieving sought-after redistribution through the income tax and transfer programs. Such direct methods of redistribution tend to be superior to redistribution through choice of legal rules: selecting legal rules other than those that are most efficient in order to effect redistribution is itself costly, and it also will distort individuals’ laborleisure decision in the same manner as the income tax. See Shavell (1981) and Kaplow and Shavell (1994c).194 Moreover, it is difficult to redistribute income systematically through the choice of legal rules. In the first place, many individuals are never involved in litigation. Also, for those who are, there is substantial income heterogeneity, both among plaintiffs and among defendants.
7.2.2. Victim compensation. Another major criticism of economic analysis of law is that it usually emphasizes the effects of legal rules on behavior, but not the compensation of victims .
193See, for example, Baron (1994), Jolls et al. (1998), Kahneman et al. (1982), and Rabin (1998).
194There are subtle qualifications to this claim of the sort identified in the optimal income tax literature (for example, that activities that make leisure relatively more attractive than work should be disfavored), but these qualifications are largely independent of the main criticism at issue, holding that legal rules should be designed to favor whichever party has lower income. which, some believe, is the main purpose of private law. Economic analysis does not, however, ignore victim compensation per se; victim compensation is relevant to social welfare if victims are risk averse. However, as we have discussed, if first-party insurance is available, as it often is, then the legal system need not be relied on to provide compensation. Moreover, providing compensation through legal rules tends to be significantly more expensive than insurance.195
7.2.3. Concerns for fairness. An additional source of criticism is that the welfare economic approach slights important concerns about fairness, justice, and rights. Some of these notions refer implicitly to the appropriateness of the distribution of income and, accordingly, are encompassed by our preceding remarks. Also, to some degree, the notions are motivated by instrumental concerns. For example, the appeal of punishment must inhere in part in its deterrent effect, and the appeal of obeying contracts must rest in part on the beneficial effect this has on production and exchange. To this extent, critics’ concerns are already taken into account in standard welfare economic analysis.
However, many who advance ideas of fairness and cognate notions do not regard them merely as some sort of proxy for attaining instrumental objectives. Instead, they believe that satisfying the notions is intrinsically valuable. This view too can be partially reconciled with economists’ conception of social welfare: if individuals have a taste for a legal rule or institution because they regard it as fair, that should be credited in the determination of social welfare, just as any taste should. (Note that, in this case, the importance of fairness is converted from a philosophical issue to an empirical question about individuals’ tastes.)
But many uphold the view that notions of fairness are important as ethical principles in themselves, without regard to any possible relationship to individual welfare. This opinion is, of course, the subject of longstanding debate among moral philosophers.196
195The reader will also recall from section 2.2 that there may be additional reasons that providing compensation is undesirable, including adverse effects of victims’ incentives and imposing risk on injurers.
196Kaplow and Shavell (1999) provides an extensive investigation of the issues discussed in this section. See generally Sen and Williams (1982) for representative views of leading economists and philosophers on normative analysis.
Although some readersn(along with us) may be skeptical of normative views that are not grounded in individuals’ wellbeing, it is clear that such views will be reflected in assessments of economic analysis of law for the foreseeable future.
7.3. Purported Efficiency of Judge-Made Law
Also criticized is the contention of some economically-oriented academics — notably, Posner (1975) and Landes and Posner (1987a) — that judge-made law tends to be efficient (in contrast to legislation, which is said to reflect the influence of special interest groups).197 Instead, critics believe that the judge-made law is guided by notions of fairness, justice, and rights and thus will not necessarily be efficient. Several observations about these competing views may be made.
197The argument is advanced by examining particular common law rules and presenting arguments that they are efficient. In addition, the argument has been examined in the context of models of common law evolution. See Cooter and Kornhauser (1980), Priest (1977), and Rubin (1977).
Having surveyed the basic areas of economic analysis of law, let us comment on possible directions for future research. Although accident liability has been fairly well explored, relatively little formal work has been done on the subject of property law. With regard to contract law, most analysis has concerned remedies for breach, but little attention has been paid to contract formation. In the area of litigation, research effort so far has focused on settlement versus trial, whereas other aspects of litigation, including its adversarial character and its optimal design, merit study. With regard to law enforcement, an issue worthy of further consideration is the incentives of enforcers (including the problem of corruption); also, many of the doctrines of criminal law deserve investigation.