William DAUBERT, Joyce Daubert, individually and as
Guardians Ad Litem for Jason Daubert, a minor;
Anita De Young, individually, and as
Guardian Ad Litem for Eric
Schuller,
Plaintiffs-Appellants,
v.
MERRELL DOW PHARMACEUTICALS, INC., a Delaware corporation,
Defendant-Appellee.
No. 90-55397.
United States Court of Appeals,
Ninth Circuit.
Argued and Submitted March 22, 1994.
Decided Jan. 4, 1995.
Michael H. Gottesman, Georgetown Univ. Law Center, Washington, DC, Kenneth J. Chesebro, Cambridge, MA, Mary F. Gillick, Luce, Forward, Hamilton & Scripps, San Diego, CA, and Barry J. Nace, Paulson, Nace, Norwin & Sellinger, Washington, DC, for the plaintiffs-appellants.
Charles Fried, Cambridge, MA, Hall R. Marston, George E. Berry, Dickson, Carlson & Campillo, Santa Monica, CA, and Joel I. Klein, and Richard G. Taranto, Washington, DC, for the defendant-appellee.
On Remand from the United States Supreme Court.
Before: KOZINSKI and O'SCANNLAIN, Circuit Judges, and McNAMEE,* District Judge.
KOZINSKI, Circuit Judge.
On remand from the United States Supreme Court, we undertake "the task of ensuring that an expert's testimony both rests on a reliable foundation and is relevant to the task at hand." Daubert v. Merrell Dow Pharmaceuticals, Inc., --- U.S. ----, ----,
* A. Background
Two minors brought suit against Merrell Dow Pharmaceuticals, claiming they suffered limb reduction birth defects1 because their mothers had taken Bendectin, a drug prescribed for morning sickness to about 17.5 million pregnant women in the United States between 1957 and 1982. See Resp't's Br. on Writ of Cert. at 2; Turpin v. Merrell Dow Pharmaceuticals, Inc.,
For the most part, we don't know how birth defects come about. We do know they occur in 2-3% of births, whether or not the expectant mother has taken Bendectin. See Jose F. Cordero & Godfrey P. Oakley, Jr., Drug Exposure During Pregnancy: Some Epidemiologic Considerations, 26 Clinical Obstetrics & Gynecology 418, 424-25 (June 1983). Limb defects are even rarer, occurring in fewer than one birth out of every 1000. Turpin,
Not knowing the mechanism whereby a particular agent causes a particular effect is not always fatal to a plaintiff's claim. Causation can be proved even when we don't know precisely how the damage occurred, if there is sufficiently compelling proof that the agent must have caused the damage somehow. One method of proving causation in these circumstances is to use statistical evidence. If 50 people who eat at a restaurant one evening come down with food poisoning during the night, we can infer that the restaurant's food probably contained something unwholesome, even if none of the dishes is available for analysis. This inference is based on the fact that, in our health-conscious society, it is highly unlikely that 50 people who have nothing in common except that they ate at the same restaurant would get food poisoning from independent sources.
It is by such means that plaintiffs here seek to establish that Bendectin is responsible for their injuries. They rely on the testimony of three groups of scientific experts. One group proposes to testify that there is a statistical link between the ingestion of Bendectin during pregnancy and limb reduction defects. These experts have not themselves conducted epidemiological (human statistical) studies on the effects of Bendectin; rather, they have reanalyzed studies published by other scientists, none of whom reported a statistical association between Bendectin and birth defects. Other experts proffered by plaintiffs propose to testify that Bendectin causes limb reduction defects in humans because it causes such defects in laboratory animals. A third group of experts sees a link between Bendectin and birth defects because Bendectin has a chemical structure that is similar to other drugs suspected of causing birth defects.
The opinions proffered by plaintiffs' experts do not, to understate the point, reflect the consensus within the scientific community. The FDA--an agency not known for its promiscuity in approving drugs--continues to approve Bendectin for use by pregnant women because "available data do not demonstrate an association between birth defects and Bendectin." U.S. Department of Health and Human Services News, No. P80-45 (Oct. 7, 1980). Every published study here and abroad--and there have been many--concludes that Bendectin is not a teratogen. Turpin,
It is largely because the opinions proffered by plaintiffs' experts run counter to the substantial consensus in the scientific community that we affirmed the district court's grant of summary judgment the last time the case appeared before us. Daubert v. Merrell Dow Pharmaceuticals, Inc.,
B. Procedural Issues
First, however, we address plaintiffs' argument that we should simply remand the case so the district court can make the initial determination of admissibility under the new standard announced by the Supreme Court. There is certainly something to be said for this position, as the district court is charged with making the initial determination whether to admit evidence. In the peculiar circumstances of this case, however, we have determined that the interests of justice and judicial economy will best be served by deciding those issues that are properly before us and, in the process, offering guidance on the application of the Daubert standard in this circuit.
The district court already made a determination as to admissibility, albeit under a different standard than we apply on remand, and granted summary judgment based on its exclusion of plaintiffs' expert testimony. Daubert v. Merrell Dow Pharmaceuticals, Inc.,
One other procedural matter detains us. According to plaintiffs, they weren't required to come forward with any evidence to survive summary judgment because the affidavit of Merrell's expert was itself inadmissible under Daubert; the burden thus never shifted to plaintiffs to demonstrate a genuine issue as to causation. Plaintiffs not only fail to mention the many other exhibits offered by Merrell, they also misunderstand the moving party's burden on summary judgment. Because plaintiffs bear the ultimate burden of proof on causation, Merrell had only to point to the absence of a genuine issue of material fact; it wasn't required to produce any evidence at all. See Maffei v. Northern Insulation of New York,
II
A. Brave New World
Federal judges ruling on the admissibility of expert scientific testimony face a far more complex and daunting task in a post-Daubert world than before. The judge's task under Frye is relatively simple: to determine whether the method employed by the experts is generally accepted in the scientific community. Solomon,
The first prong of Daubert puts federal judges in an uncomfortable position. The question of admissibility only arises if it is first established that the individuals whose testimony is being proffered are experts in a particular scientific field; here, for example, the Supreme Court waxed eloquent on the impressive qualifications of plaintiffs' experts. Id. at ---- n. 2,
The task before us is more daunting still when the dispute concerns matters at the very cutting edge of scientific research, where fact meets theory and certainty dissolves into probability. As the record in this case illustrates, scientists often have vigorous and sincere disagreements as to what research methodology is proper, what should be accepted as sufficient proof for the existence of a "fact," and whether information derived by a particular method can tell us anything useful about the subject under study.
Our responsibility, then, unless we badly misread the Supreme Court's opinion, is to resolve disputes among respected, well-credentialed scientists about matters squarely within their expertise, in areas where there is no scientific consensus as to what is and what is not "good science," and occasionally to reject such expert testimony because it was not "derived by the scientific method." Mindful of our position in the hierarchy of the federal judiciary, we take a deep breath and proceed with this heady task.
B. Deus ex Machina
The Supreme Court's opinion in Daubert focuses closely on the language of Fed.R.Evid. 702, which permits opinion testimony by experts as to matters amounting to "scientific ... knowledge." The Court recognized, however, that knowledge in this context does not mean absolute certainty. --- U.S. at ----,
Which raises the question: How do we figure out whether scientists have derived their findings through the scientific method or whether their testimony is based on scientifically valid principles? Each expert proffered by the plaintiffs assures us that he has "utiliz[ed] the type of data that is generally and reasonably relied upon by scientists" in the relevant field, see, e.g., Newman Aff. at 5, and that he has "utilized the methods and methodology that would generally and reasonably be accepted" by people who deal in these matters, see, e.g., Gross Aff. at 5. The Court held, however, that federal judges perform a "gatekeeping role," Daubert, --- U.S. at ----,
While declining to set forth a "definitive checklist or test," id. at ----,
One very significant fact to be considered is whether the experts are proposing to testify about matters growing naturally and directly out of research they have conducted independent of the litigation, or whether they have developed their opinions expressly for purposes of testifying. That an expert testifies for money does not necessarily cast doubt on the reliability of his testimony, as few experts appear in court merely as an eleemosynary gesture. But in determining whether proposed expert testimony amounts to good science, we may not ignore the fact that a scientist's normal workplace is the lab or the field, not the courtroom or the lawyer's office.5
That an expert testifies based on research he has conducted independent of the litigation provides important, objective proof that the research comports with the dictates of good science. See Peter W. Huber, Galileo's Revenge: Junk Science in the Courtroom 206-09 (1991) (describing how the prevalent practice of expert-shopping leads to bad science). For one thing, experts whose findings flow from existing research are less likely to have been biased toward a particular conclusion by the promise of remuneration; when an expert prepares reports and findings before being hired as a witness, that record will limit the degree to which he can tailor his testimony to serve a party's interests. Then, too, independent research carries its own indicia of reliability, as it is conducted, so to speak, in the usual course of business and must normally satisfy a variety of standards to attract funding and institutional support. Finally, there is usually a limited number of scientists actively conducting research on the very subject that is germane to a particular case, which provides a natural constraint on parties' ability to shop for experts who will come to the desired conclusion. That the testimony proffered by an expert is based directly on legitimate, preexisting research unrelated to the litigation provides the most persuasive basis for concluding that the opinions he expresses were "derived by the scientific method."
We have examined carefully the affidavits proffered by plaintiffs' experts, as well as the testimony from prior trials that plaintiffs have introduced in support of that testimony, and find that none of the experts based his testimony on preexisting or independent research. While plaintiffs' scientists are all experts in their respective fields, none claims to have studied the effect of Bendectin on limb reduction defects before being hired to testify in this or related cases.
If the proffered expert testimony is not based on independent research, the party proffering it must come forward with other objective, verifiable evidence that the testimony is based on "scientifically valid principles." One means of showing this is by proof that the research and analysis supporting the proffered conclusions have been subjected to normal scientific scrutiny through peer review and publication.6 Huber, Galileo's Revenge at 209 (suggesting that "[t]he ultimate test of [a scientific expert's] integrity is her readiness to publish and be damned").
Peer review and publication do not, of course, guarantee that the conclusions reached are correct; much published scientific research is greeted with intense skepticism and is not borne out by further research. But the test under Daubert is not the correctness of the expert's conclusions but the soundness of his methodology. See n. 11 infra. That the research is accepted for publication in a reputable scientific journal after being subjected to the usual rigors of peer review is a significant indication that it is taken seriously by other scientists, i.e., that it meets at least the minimal criteria of good science. Daubert, --- U.S. at ----,
Bendectin litigation has been pending in the courts for over a decade, yet the only review the plaintiffs' experts' work has received has been by judges and juries, and the only place their theories and studies have been published is in the pages of federal and state reporters.8 None of the plaintiffs' experts has published his work on Bendectin in a scientific journal or solicited formal review by his colleagues. Despite the many years the controversy has been brewing, no one in the scientific community--except defendant's experts--has deemed these studies worthy of verification, refutation or even comment. It's as if there were a tacit understanding within the scientific community that what's going on here is not science at all, but litigation.9
Establishing that an expert's proffered testimony grows out of pre-litigation research or that the expert's research has been subjected to peer review are the two principal ways the proponent of expert testimony can show that the evidence satisfies the first prong of Rule 702.10 Where such evidence is unavailable, the proponent of expert scientific testimony may attempt to satisfy its burden through the testimony of its own experts. For such a showing to be sufficient, the experts must explain precisely how they went about reaching their conclusions and point to some objective source--a learned treatise, the policy statement of a professional association, a published article in a reputable scientific journal or the like--to show that they have followed the scientific method, as it is practiced by (at least) a recognized minority of scientists in their field. See United States v. Rincon,
Plaintiffs have made no such showing. As noted above, plaintiffs rely entirely on the experts' unadorned assertions that the methodology they employed comports with standard scientific procedures. In support of these assertions, plaintiffs offer only the trial and deposition testimony of these experts in other cases. While these materials indicate that plaintiffs' experts have relied on animal studies, chemical structure analyses and epidemiological data, they neither explain the methodology the experts followed to reach their conclusions nor point to any external source to validate that methodology. We've been presented with only the experts' qualifications, their conclusions and their assurances of reliability. Under Daubert, that's not enough.
This is especially true of Dr. Palmer--the only expert willing to testify "that Bendectin did cause the limb defects in each of the children." Palmer Aff. at 8. In support of this conclusion, Dr. Palmer asserts only that Bendectin is a teratogen and that he has examined the plaintiffs' medical records, which apparently reveal the timing of their mothers' ingestion of the drug. Dr. Palmer offers no tested or testable theory to explain how, from this limited information, he was able to eliminate all other potential causes of birth defects, nor does he explain how he alone can state as a fact that Bendectin caused plaintiffs' injuries. We therefore agree with the Sixth Circuit's observation that "Dr. Palmer does not testify on the basis of the collective view of his scientific discipline, nor does he take issue with his peers and explain the grounds for his differences. Indeed, no understandable scientific basis is stated. Personal opinion, not science, is testifying here." Turpin,
The failure to make any objective showing as to admissibility under the first prong of Rule 702 would also fatally undermine the testimony of plaintiffs' other experts, but for the peculiar posture of this case. Plaintiffs submitted their experts' affidavits while Frye was the law of the circuit and, although they've not requested an opportunity to augment their experts' affidavits in light of Daubert, the interests of justice would be disserved by precluding plaintiffs from doing so. Given the opportunity to augment their original showing of admissibility, plaintiffs might be able to show that the methodology adopted by some of their experts is based on sound scientific principles. For instance, plaintiffs' epidemiologists might validate their reanalyses by explaining why they chose only certain of the data that was available, or the experts relying on animal studies might point to some authority for extrapolating human causation from teratogenicity in animals.12
Were this the only question before us, we would be inclined to remand to give plaintiffs an opportunity to submit additional proof that the scientific testimony they proffer was "derived by the scientific method." Daubert, however, establishes two prongs to the Rule 702 admissibility inquiry. See pp. 1315-16 supra. We therefore consider whether the testimony satisfies the second prong of Rule 702: Would plaintiffs' proffered scientific evidence "assist the trier of fact to ... determine a fact in issue"? Fed.R.Evid. 702.
C. No Visible Means of Support
In elucidating the second requirement of Rule 702, Daubert stressed the importance of the "fit" between the testimony and an issue in the case: "Rule 702's 'helpfulness' standard requires a valid scientific connection to the pertinent inquiry as a precondition to admissibility." --- U.S. at ----,
Plaintiffs do not attempt to show causation directly; instead, they rely on experts who present circumstantial proof of causation. Plaintiffs' experts testify that Bendectin is a teratogen because it causes birth defects when it is tested on animals, because it is similar in chemical structure to other suspected teratogens, and because statistical studies show that Bendectin use increases the risk of birth defects. Modern tort law permits such proof, but plaintiffs must nevertheless carry their traditional burden; they must prove that their injuries were the result of the accused cause and not some independent factor. In the case of birth defects, carrying this burden is made more difficult because we know that some defects--including limb reduction defects--occur even when expectant mothers do not take Bendectin, and that most birth defects occur for no known reason.
California tort law requires plaintiffs to show not merely that Bendectin increased the likelihood of injury, but that it more likely than not caused their injuries. See Jones v. Ortho Pharmaceutical Corp.,
None of plaintiffs' epidemiological experts claims that ingestion of Bendectin during pregnancy more than doubles the risk of birth defects.14 To evaluate the relationship between Bendectin and limb reduction defects, an epidemiologist would take a sample of the population and compare the frequency of birth defects in children whose mothers took Bendectin with the frequency of defects in children whose mothers did not. See DeLuca,
With the exception of Dr. Palmer, whose testimony is inadmissible under the first prong of the Rule 702 analysis, see p. 1319 supra,18 the remaining experts proffered by plaintiffs were equally unprepared to testify that Bendectin caused plaintiffs' injuries; they were willing to testify only that Bendectin is "capable of causing" birth defects. Crescitelli Aff. at 3, 8; Glasser Aff. at 6, 8; Gross Aff. at 9; Newman Aff. at 5, 9; Swan Aff. at 7. Plaintiffs argue "these scientists use the words 'capable of causing' meaning that it does cause. This is an ambiguity of language.... If something is capable of causing damage in humans, it does." Tape of Oral Arg. Mar. 22, 1994. But what plaintiffs must prove is not that Bendectin causes some birth defects, but that it caused their birth defects. To show this, plaintiffs' experts would have had to testify either that Bendectin actually caused plaintiffs' injuries (which they could not say) or that Bendectin more than doubled the likelihood of limb reduction birth defects (which they did not say).
As the district court properly found below, "the strongest inference to be drawn for plaintiffs based on the epidemiological evidence is that Bendectin could possibly have caused plaintiffs' injuries."
Conclusion
The district court's grant of summary judgment is AFFIRMED.
Notes
The Honorable Stephen M. McNamee, United States District Judge for the District of Arizona, sitting by designation
Limb reduction defects involve incomplete development of arms, legs, fingers and toes, such as the defects associated with the Thalidomide disaster of the 1960s
We had adopted Frye as the law of the circuit in United States v. Solomon,
These factors raise many questions, such as how do we determine whether the rate of error is acceptable, and by what standard? Or, what should we infer from the fact that the methodology has been tested, but only by the party's own expert or experts? Do we ask whether the methodology they employ to test their methodology is itself methodologically sound? Such questions only underscore the basic problem, which is that we must devise standards for acceptability where respected scientists disagree on what's acceptable
Two of the four factors mentioned by the Supreme Court would be difficult or impossible to apply to the expert testimony in this case. Only one of plaintiffs' experts has done original research. Dr. Crescitelli mentions that he "specifically performed studies" on Bendectin and its antihistamine component, Aff. at 3, but does not explain the nature of those studies or the methodology employed. The others have examined the available literature and studies within their respective fields and drawn different conclusions than the scientists who performed the original work. As to such derivative analytical work, it makes little sense to ask whether the technique employed "can be (and has been) tested," Daubert, --- U.S. at ----,
There are, of course, exceptions. Fingerprint analysis, voice recognition, DNA fingerprinting and a variety of other scientific endeavors closely tied to law enforcement may indeed have the courtroom as a principal theatre of operations. See, e.g., United States v. Chischilly,
We refer, of course, to publication in a generally-recognized scientific journal that conditions publication on a bona fide process of peer review. See Daubert, --- U.S. at ----,
For instance, peer review might well have brought to light the more glaring arithmetical errors in the testimony presented by plaintiffs' experts in other Bendectin cases. See DeLuca v. Merrell Dow Pharmaceuticals, Inc.,
As Judge Frank Johnson has succinctly noted, "the examination of a scientific study by a cadre of lawyers is not the same as its examination by others trained in the field of science or medicine." Perry v. United States,
There may well be good reasons why a scientific study has not been published. For example, it may be too recent or of insufficiently broad interest. Daubert, --- U.S. at ----,
This showing would not, of course, be conclusive. Proffering scientific testimony and making an initial showing that it was derived by the scientific method enables a party to establish a prima facie case as to admissibility under Rule 702. The opposing party would then be entitled to challenge that showing. This it could do by presenting evidence (including expert testimony) that the proposing party's expert employed unsound methodology or failed to assiduously follow an otherwise sound protocol. Where the opposing party thus raises a material dispute as to the admissibility of expert scientific evidence, the district court must hold an in limine hearing (a so-called Daubert hearing) to consider the conflicting evidence and make findings about the soundness and reliability of the methodology employed by the scientific experts. See Fed.R.Evid. 104(a) ("In making its determination [the court] is not bound by the rules of evidence."); Fed.R.Evid. 706 (on the use of court-appointed experts)
This underscores the difference between Daubert and Frye. Under Frye, the party proffering scientific evidence had to show it was based on the method generally accepted in the scientific community. The focus under Daubert is on the reliability of the methodology, and in addressing that question the court and the parties are not limited to what is generally accepted; methods accepted by a minority in the scientific community may well be sufficient. However, the party proffering the evidence must explain the expert's methodology and demonstrate in some objectively verifiable way that the expert has both chosen a reliable scientific method and followed it faithfully. Of course, the fact that one party's experts use a methodology accepted by only a minority of scientists would be a proper basis for impeachment at trial
Dr. Palmer could not similarly bolster his testimony. Unlike the other experts, who speak in terms of probabilities, Dr. Palmer goes so far as to conclude that plaintiffs' injuries were in fact caused by Bendectin rather than another cause. The record in this case categorically refutes the notion that anyone can tell what caused the birth defects in any given case. See p. 1313 supra
No doubt, there will be unjust results under this substantive standard. If a drug increases the likelihood of birth defects, but doesn't more than double it, some plaintiffs whose injuries are attributable to the drug will be unable to recover. There is a converse unfairness under a regime that allows recovery to everyone that may have been affected by the drug. Under this regime, all potential plaintiffs are entitled to recover, even though most will not have suffered an injury that can be attributed to the drug. One can conclude from this that unfairness is inevitable when our tools for detecting causation are imperfect and we must rely on probabilities rather than more direct proof. In any event, this is a matter to be sorted out by the states, whose substantive legal standards we are bound to apply. See O'Melveny & Myers v. FDIC, --- U.S. ----, ----,
The only exception is Dr. Done, who in another case presented metaanalysis studies purporting to show a relative risk greater than two. But his conclusion in that case rested on a demonstrably faulty methodology, see DeLuca,
For a more complete explanation of the relationship between the burden of proof and relative risk, see Robert P. Charrow & David E. Bernstein, Scientific Evidence in the Courtroom: Admissibility and Statistical Significance after Daubert 28-33 (Wash.Legal Found., 1994)
A statistical study showing a relative risk of less than two could be combined with other evidence to show it is more likely than not that the accused cause is responsible for a particular plaintiff's injury. For example, a statistical study may show that a particular type of birth defect is associated with some unknown causes, as well as two known potential causes--e.g., smoking and drinking. If a study shows that the relative risk of injury for those who smoke is 1.5 as compared to the general population, while it is 1.8 for those who drink, a plaintiff who does not drink might be able to reanalyze the data to show that the study of smoking did not account for the effect of drinking on the incidence of birth defects in the general population. By making the appropriate comparison--between non-drinkers who smoke and non-drinkers who do not smoke--the teetotaller plaintiff might be able to show that the relative risk of smoking for her is greater than two. Here, however, plaintiffs' experts did not seek to differentiate these plaintiffs from the subjects of the statistical studies. The studies must therefore stand or fall on their own
The Supreme Court recognized that the "fit" requirement "goes primarily to relevance," Daubert, --- U.S. at ----,
Dr. Palmer's testimony would easily meet Rule 702's fit requirement, were it not rendered inadmissible by the total lack of scientific basis for his conclusions. See pp. 1319-20 & n. 12 supra. Dr. Palmer's testimony thus illustrates how the two prongs of Rule 702 work in tandem to ensure that junk science is kept out of the federal courtroom
Several circuits have conducted a similar analysis in finding plaintiffs' expert testimony insufficient to prove causation as a matter of law. See Elkins,
