Lead Opinion
¶ 1. Claimant appeals from the superior court’s order granting summary judgment to insurer in this workers’ compensation case.
¶ 2. The trial court found the following facts to be undisputed. Claimant worked for the City of Burlington Fire Department for thirty-six years, first as a firefighter and later as assistant chief. It is not clear if claimant was actively fighting fires throughout his career or if, at some point, he engaged in a combination of administrative work and active duty.
¶4. Claimant appealed this decision to the superior court, and the Commissioner certified the following question for determination: was claimant’s NHL causally related to his work as a firefighter? In August 2007, insurer moved for summary judgment on this question. It asserted that the opinions of claimant’s experts should be excluded under Vermont Rule of Evidence 702 as both irrelevant and scientifically unreliable, and that without any admissible evidence of causation, claimant was not entitled to workers’ compensation benefits.
¶ 5. Claimant responded by moving to strike the motion for summary judgment. He argued that the standard for the admissibility of expert testimony under Rule 702, delineated in Daubert v. Merrell Dow Pharmaceuticals, Inc.,
¶ 6. The trial court denied claimant’s motion to strike in a written order. It explained that the rules of evidence applied “to all actions and proceedings in the courts of this state,” V.R.E. 1101(a), and that it had no discretion to create a new standard for the admission of expert testimony in workers’ compensation cases. It noted, moreover, that there was nothing in the workers’ compensation statutes that would alter the application of the rules of evidence in such cases. As to claimant’s remaining arguments, the court indicated that its review of the Commissioner’s decision was de novo, and thus, insurer’s failure to raise the Daubert issue below was irrelevant. The court did not directly address the jurisdictional argument.
¶ 7. The court allowed claimant additional time to respond to the merits of the summary judgment motion, and following a hearing, it granted summary judgment to insurer. As discussed in additional detail below, the court found that the expert testimony proffered by claimant did not meet the requirements of Rule 702, and that it was therefore inadmissible. This appeal followed.
¶ 8. We begin with claimant’s procedural arguments. Claimant asserts that insurer could not challenge the admissibility of his expert evidence in the trial court proceedings because (1) it failed to raise this argument before the Commissioner; (2) collateral estoppel precluded the relitigation of this issue; and (3) there was no certified question concerning the admissibility of this evidence. Claimant reiterates his position that the trial court should not have applied the rules of evidence, but rather, that it was obligated to apply the same standard as that employed by the Commissioner. Finally, claimant maintains that the trial court lacked jurisdiction to grant insurer’s motion for summary judgment because its jurisdiction is limited to answering questions of fact or mixed questions of fact and law.
¶ 9. Claimant appears to misunderstand the nature of the review conducted by the trial court. The court’s review of the Commissioner’s decision “involves a retrial de novo.” Farris v. Bryant Grinder Corp./Wausau Ins. Co.,
¶ 10. It is true, as claimant asserts, that the trial court’s jurisdiction in workers’ compensation proceedings is limited to a review of questions of fact or questions of fact and law certified to it by the commissioner. 21 V.S.A. § 671; Roethke v. Jake’s Original Bar & Grill,
¶ 11. In evaluating this case, the trial court was obligated to apply the rules of evidence and to determine if the expert testimony proffered by claimant was relevant and admissible. See V.R.E. 1101(a) (rules of evidence apply “to all actions and proceedings in the courts of this state”); V.R.E. 104(a) (preliminary questions concerning admissibility of evidence shall be determined by trial court); V.R.E. 402 (evidence that is not relevant is not admissible); 985 Assocs., Ltd. v. Daewoo Elecs. Am., Inc.,
¶ 12. We reject claimant’s suggestion that these rules are somehow inapplicable in workers’ compensation cases heard by the superior court. Claimant cites Crosby v. City of Burlington,
¶ 13. The fact that insurer was granted judgment as a matter of law in this case does not transform the certified question into one of pure law that must be decided only by this Court. The trial court’s inquiry under Rule 702 was plainly fact-based. And the court merely recognized, following its evidentiary rulings, that claimant failed to identify a triable issue
¶ 14. We thus turn to the heart of claimant’s appeal — whether the trial court erred in concluding that the expert testimony he offered was inadmissible under Rule 702, and that insurer was therefore entitled to summary judgment in its favor. Rule 702 provides that:
If scientific, technical, or other specialized knowledge will assist the trier of fact to understand the evidence or to determine a fact in issue, a witness qualified as an expert . . . may testify thereto in the form of an opinion or otherwise, if (1) the testimony is based upon sufficient facts or data, (2) the testimony is the product of reliable principles and methods, and (3) the witness has applied the principles and methods reliably to the facts of the case.
This rule is intended to create “a flexible standard requiring only that expert testimony be both relevant and reliable to be admissible.” Daewoo,
¶ 15. In reaching our conclusion, we emphasize several fundamental principles. As the United States Supreme Court has recognized, an expert’s “conclusions and methodology are not entirely distinct from one another.” Gen. Elec. Co. v. Joiner,
¶ 16. We have held that “the trial court’s inquiry into expert testimony should primarily focus on excluding ‘junk science’ — because of its potential to confuse or mislead the trier of fact — rather than serving as a preliminary inquiry into the merits of the case.” Daewoo,
¶ 17. In this case, claimant relied on the testimony of three experts — Dr. Tee Guidotti, Dr. James Lockey, and Dr. Grace LeMasters — to prove that his NHL was causally related to his employment.
[e]pidemiological evidence identifies agents that are associated with an increased risk of disease in groups of individuals, quantifies the amount of excess disease that is associated with an agent, and provides a profile of the type of individual who is likely to contract a disease after being exposed to an agent. The focus of epidemiology is on general causation (ie., is the agent in question capable of causing disease?) and not specific causation (ie., did the agent cause a disease in a particular individual?).
Id. (quotation omitted). As the trial court recognized, and as noted above, epidemiological studies can assist in demonstrating a general association between a substance and a disease or condition, but they cannot prove that a substance actually caused a disease or condition in a particular individual. Merrell Dow Pharms., Inc. v. Havner,
¶ 19. Notwithstanding this limitation, numerous courts have considered the role that epidemiological studies can play in estabhshing specific causation. Green, supra, at 382. The existing case law focuses both on the admissibility of epidemiological evidence, as well as whether, and under what circumstances, such evidence could be considered sufficient to meet a plaintiffs burden of production. Id. According to Green, “[a]n epidemiologic study that is sufficiently rigorous to justify a conclusion that is scientifically valid should be admissible, as it tends to make an issue in dispute more or less likely.” Id. As to sufficiency, Green explains that
*242 [t]he civil burden of proof is described most often as requiring the fact finder to believe that what is sought to be proved is more likely true than not true. The relative risk from epidemiologic studies can be adapted to this 50% plus standard to yield a probability or likelihood that an agent caused an individual’s disease.
Id. at 383 (quotation omitted).
¶ 20. With this background in mind, we turn to the nature of epidemiological studies, generally. Epidemiological studies quantify the degree of association between a given substance and disease by assigning a “relative risk” factor to the association.
¶ 21. The trial court found that only two of the eight epidemiological studies relied upon by the experts in this case reflected a relative risk greater than 2.0 — Figgs and Sama — while the remaining six showed “little or no association” between firefighting and lymphomas. Notwithstanding the results of these studies, Dr. Guidotti opined that firefighting was in fact what caused claimant’s lymphoma. Other than an undefined reference to “weight-of-the-evidence methodology,” however, the court could not discern the scientific method that Dr. Guidotti used to reach his conclusion. The court also noted that the studies that Dr. Guidotti relied upon may have been overinclusive, reflecting associations between other types of lymphomas and generic cancers in firefighters.
¶ 22. The court was equally unpersuaded by the “meta-analysis” conducted by Dr. Lockey and Dr. LeMasters. See Knight v. Kirby Inland Marine, Inc.,
¶ 23. The court also found that of the thirty-two studies included in the meta-analysis, only eight dealt with lymphoma. The court could not discern if Dr. Lockey and Dr. LeMasters’ opinions as to claimant were based upon the meta-analysis of the thirty-two studies, including all types of cancers, or only on the eight dealing specifically with lymphoma. The court noted, moreover, that Dr. Lockey and Dr. LeMasters asserted in their study only that there was a general association between firefighting and lymphoma, but not that claimant’s work as a firefighter actually caused him to develop NHL. Thus, the court found that the meta-analysis failed the third prong of Rule 702, which requires that the experts apply the scientific method reliably to the facts of the case. Indeed, the court found, claimant’s brief stated as much, at least with respect to Dr. LeMasters, indicating that she “felt it was beyond her expertise to apply epidemiological evidence to an individual’s medical condition to arrive at an informed expert opinion on ‘specific causation.’ ”
¶ 24. Finally, the court reasoned that even if the meta-analysis conducted by Dr. Lockey and Dr. LeMasters was a reliable scientific method, which it may have been, the court had no way to know if it was based upon sufficient facts or data as required by Rule 702(1). As stated above, these experts were not persuasive in presenting their findings, and the court could not determine which studies of which cancers were included in the meta-analysis.
¶ 25. Claimant challenges these findings and conclusions on appeal. We begin with the court’s evaluation of Dr. Guidotti’s testimony. Claimant raises two related arguments. First, he asserts that the court should not have used a relative risk of 2.0 as a benchmark in evaluating whether the experts’ testimony was based on sufficient facts or data. He also maintains that the court erred in stating that six of the epidemiological studies he offered showed “little or no association” between NHL and firefighting. In a related vein, claimant argues that, contrary to the trial court’s
¶ 26. We find these arguments without merit. Claimant was required to show by a preponderance of the evidence that his NHL was causally related to his employment. To meet his burden of proof, claimant relied on epidemiological studies, studies that focus on general causation rather than specific causation. Dr. Guidotti acknowledged that the demonstration of “more likely than not” in the epidemiological literature corresponded to a relative risk, or an odds ratio, of 2.0, although he argued that this standard should not be followed here.
¶ 28. We also recognize that the very use of epidemiological evidence to show specific causation reflects a compromise, given that epidemiological studies “cannot indicate the actual cause of a given individual’s disease or condition.” Havner,
*247 the law must balance the need to compensate those who have been injured by the wrongful actions of another with the concept deeply imbedded in our jurisprudence that a defendant cannot be found liable for an injury unless the preponderance of the evidence supports cause in fact. The use of scientifically reliable epidemiological studies and the requirement of more than a doubling of the risk strikes a balance between the needs of our legal system and the limits of science.
Id, Mindful of this balance, we conclude that the trial court did not abuse its discretion in considering a relative risk greater than 2.0 as a reasonable and helpful benchmark under the circumstances presented here.
¶29. We view the court’s finding of “little or no association” between lymphomas and firefighting in six of the eight epidemiological studies in the context of its discussion of statistically significant risks. While “association” may have been the wrong word, we reject claimant’s assertion that the court’s use of this term reflects a fundamental misunderstanding of the scientific data.
¶ 30. We turn next to the specific methodology employed by Dr. Guidotti in reaching his conclusion that firefighting caused claimant’s NHL. Dr. Guidotti stated that his opinion was based on his “interpretation of the literature.” He maintained that his interpretation was supported by “available evidence in the extant literature,” and he later reiterated his conclusion that “the weight of evidence favors the interpretation that [claimant’s] lymphoma arose from work as a firefighter.” As noted above, claimant argues that Dr. Guidotti’s methodology was sound. The court concluded otherwise, finding that Dr. Guidotti’s opinion was not based on sufficient facts or data, and that Dr. Guidotti had not applied scientific principles and methods reliably to the facts of this case. The court did not abuse its discretion in reaching its conclusion. See USGen New Eng., Inc. v. Town of Rockingham,
¶ 31. It is true, as claimant asserts, that other courts have discussed a “weight-of-the-evidence” approach. Yet those courts have also required far more information in support of such an approach than Dr. Guidotti provided here. As is often repeated, there must be “a scientific method of weighting that is used and explained,” Magistrini,
¶ 32. In Magistrini, for example, an expert witness proposed to testify, based on a weight-of-the-evidence analysis, that the plaintiffs leukemia was caused by her exposure to a particular chemical. Id. at 599-601. The court found the testimony inadmissible, finding “the single most serious flaw [in the expert testimony] is the most basic: [the expert] simply has not set forth the methodology he used to weigh the evidence.” Id. at 606. As the Magistrini court explained, “because the weight-of-the-evidence methodology involves substantial judgment on the part of the expert, it is crucial that the expert supply his method for weighting the studies he has chosen to include in order to prevent a mere listing of studies and jumping to a conclusion.” Id. at 602. In Magistrini, the expert’s “failure to adequately address the relative risks found in the studies that he relied on weighted] heavily in th[e] Court’s ultimate conclusion that his methodology [was] not sufficiently reliable to pass through the ‘gate’ to the jury.” Id. at 606.
¶ 33. The trial court was faced with a similar situation here. Dr. Guidotti did not specify the precise weight he gave to each study or how he reached his conclusion that the studies, taken together, demonstrated a statistically significant result, when seventy-five percent of the studies, individually, failed to reach that conclusion. Dr. Guidotti stated that his analysis was “based on the observation that improving the accuracy of cumulative exposure to combustion products in whatever data set is available results in an increased estimate of risk, which reflects the strength of association.” He opined that “[i]n the key studies available, a career of 40 years clearly places a firefighter at increased risk of NHL and is sufficient to conclude that the risk was in fact elevated to at least an approximate doubling.” How? Why? Dr. Guidotti failed to specifically account for the level of relative risk shown by each of the studies, describe what precise weight was given to each study, particularly in light of the different types of studies involved, or account specifically for showings such as that found in the Baris study that the level of excess risk of NHL was not associated with an increased number of lifetime runs, and that, in fact, the standardized mortality ratio was highest in those individuals who made the lowest number of firefighting runs.
¶ 35. Claimant suggests for the first time on appeal that specific causation is established by the Figgs and Sama studies alone. Claimant’s experts made no such assertion below, however, and we will not address this argument for the first time on appeal. See Lane v. Town of Grafton,
¶ 36. Of course, we are not blind to fundamental misapplications of Daubert, but claimant fails to show that is what occurred here. See USGen,
¶ 37. We next consider the court’s evaluation of the meta-analysis conducted by Dr. Lockey and Dr. LeMasters. Claimant maintains that the court miseharacterized the methodology and scientific reliability of this study. Claimant also asserts that, contrary to the court’s statement that it could not discern what studies were included, it was clear from the “summary risk estimate” that eight studies were used in the analysis of the association between NHL and firefighting.
¶ 38. While it is not entirely clear from the introduction to the meta-analysis, it does appear, as claimant argues, that the meta-analysis with respect to NHL was based on eight studies, apparently the same eight studies used by Dr. Guidotti in reaching his conclusion. The meta-analysis found the summary risk estimate for NHL to be 1.51, again a value less than 2.0. The study concluded that the findings of an association between firefighting and significant increased risk for specific types of cancer raised red flags and should encourage further development of innovative comfortable protective equipment, allowing firefighters to do their jobs without compromising their health. A conclusion that NHL is considered a “probable cancer risk” for firefighters is not sufficient to establish that claimant’s NHL was caused by firefighting, particularly given that this conclusion rests on a finding of relative risk of less than 2.0. See Sanderson,
¶ 39. We reject claimant’s assertion that the trial court misapprehended the foundation of Dr. Lockey’s testimony. According to claimant, Dr. Lockey did not rely solely on meta-analysis in reaching his conclusion that claimant’s NHL was caused by firefighting. Rather, claimant argues, Dr. Lockey stated that he examined the potential exposures experienced by firefighters; he considered whether there were any nonoccupational factors as applied to claimant that were known to be associated with a risk for NHL; and he examined all of the published and peer-reviewed medical literature as it applied to firefighters and the occurrence of NHL. According to claimant, Dr. Lockey found that the majority of studies demonstrated a significantly elevated risk from a statistical perspective, but not at a statistically significant level. Claimant asserts that, in Dr. Lockey’s expert opinion, the scientific data demonstrated a consistent cause-effect relationship between firefighting and NHL.
¶ 40. Dr. Lockey may have testified as to other things, but his conclusion as to causation plainly rested, as it must, on his evaluation of the scientific evidence, which here, was his interpretation of epidemiological studies. Our law requires claimant to show, not merely that firefighting increased the likelihood of injury, but that it more likely than not caused his disease. See id. (expressing similar sentiment). Claimant failed to establish good grounds for such a conclusion here. We need not consider Dr. LeMasters’ testimony separately because she had no opinion whether claimant’s NHL was caused by firefighting, and thus, her opinion alone would be insufficient to meet claimant’s burden of proof. The fact that she was not a physician, and thus, incapable of opining as to causation, is irrelevant.
¶ 41. As set forth above, the trial court was obligated to evaluate the reliability and relevance of the proposed expert
[o]ur legal system requires that claimants prove their cases by a preponderance of the evidence. In keeping with this sound proposition at the heart of our jurisprudence, the law should not be hasty to impose liability when scientifically reliable evidence is unavailable. As Judge Posner has said, “[l]aw lags science; it does not lead it.”
Havner,
Affirmed.
Notes
The appellant in this case is the Estate of Albert George, but for simplicity’s sake, we refer to appellant here as “claimant.”
The Department of Labor apparently admits scientific evidence “if it is ‘plausible,’ leaving the ultimate weighing of that evidence to the finder of fact — first a hearing officer, then the Commissioner.”
Claimant appears to have recognized this basic principle below. In response to insurer’s statement of undisputed material facts, claimant asserted that “[f]or purposes of this de novo trial, the findings and conclusions of the Department of Labor are wholly irrelevant.” Indeed, he asked that any reference to the Department’s conclusions “be stricken from the record.”
We note that Vermont law now specifically provides that a firefighter who suffers disability or death from certain types of cancer is “presumed to have suffered the cancer as a result of exposure to conditions in the line of duty, unless it is shown by a preponderance of the evidence that the cancer was caused by non-service-connected risk factors or non-service-connected exposure,” with certain limitations set forth by statute. 21 V.S.A. § 601(11)(E).
These studies and others are cited and collected within G. LeMasters et al., Cancer Risk Among Firefighters: A Review and Meta-analysis of 32 Studies, 48 J. Em. Med. 1189 (2006), available at http://www.iaff.org/HS/PDF/ Cancer%20Risk%20Among%20Firefighters%20-%20UC%20Study.pdf.
Green cautions, however, that “before an association or relative risk is used to make a statement about the probability of individual causation, the inferential judgment. . . that the association is truly causal rather than spurious is required: An agent cannot be considered to cause the illness of a specific person unless it is recognized as a cause of that disease in general.” Id. at 383-84 (quotation omitted).
The Magistrini court explained this process in greater detail as follows:
Epidemiological studies attempt to identify agents that are associated with an increased risk of disease. Thus, the first question an epidemiologist asks is whether an association exists between exposure to an agent and a particular disease. An association between exposure to an agent and a disease exists when the two occur together more frequently than they would by mere chance. Once an association is observed, the scientist undertaking the study must assess the strength of the association as well as whether the reason for the observed association is due to bias, chance or real effect.
Relative risk is commonly calculated by dividing the risk of developing a disease observed in an exposed group by the risk observed in an unexposed, but otherwise similar group. If the risks of the unexposed and exposed are the same, then the relative risk estimate (which mathematically is simply the former divided by the latter) is 1.0. This value is also called the null value, and indicates that exposure is not associated with the disease in that study. Thus, a relative risk of 1.0 means that the agent has no effect on the incidence of disease. Similarly, if the relative risk estimate is 1.3, then risk appears to be 30% higher among the exposed compared to the non-exposed. When the relative risk reaches 2.0, the risk has doubled, indicating that the risk is twice as high among the exposed group as compared to the non-exposed group. Thus, the threshold for concluding that an agent was more likely than not the cause of an individual’s disease is a relative risk greater than 2.0.
As Dr. Guidotti noted, NHL “is a collection of widely disparate diseases that are not commonly separated in epidemiological studies.” He stated that NHLs consist of at least thirty recognized types, and he opined that new types will be identified as immunological and genomic methods become more sophisticated.
The dissent concludes sua sponte that the court abused its discretion by failing to hold a Daubert hearing. No party raises this issue on appeal. In fact, the court did hold a hearing on the motion for summary judgment, and heard argument from both parties as to the admissibility of this evidence under Daubert and Rule 702. The parties also provided extensive written argument and evidence concerning the admissibility of claimant’s expert testimony in their motions for summary judgment. The trial court had discretion in deciding whether an evidentiary hearing was necessary, and it acted within its discretion here. See, e.g., Nelson v. Tenn. Gas Pipeline Co.,
This appears to be a slight misstatement, as it appears that a relative risk of greater than 2.0 corresponds to a legal standard of “more likely than not.” See, e.g., Magistrini,
As one court explains:
Association is a term of art in epidemiology. It is defined as “[t]he degree of statistical dependence between two or more events or variables. Events are said to be associated when they occur more or less frequently together than one would expect by chance. Association does not necessarily imply a causal relationship. Events are said not to have an association when the agent (or independent variable) has no apparent effect on the incidence of a disease (the dependent variable).”
In re TMI Litig.,
A “confidence interval” is:
“[a] range of values within which the results of a study sample would be likely to fall if the study were repeated numerous times. . . . The width of the confidence interval provides an indication of the precision of the point estimate or relative risk found in the study; the narrower the confidence interval, the greater the confidence in the relative risk estimate found in the study. Where the confidence interval contains a relative risk of 1.0, the results of the study are not statistically significant.”
In re TMI Litig.,
Concurrence Opinion
¶ 42. concurring. I agree with the majority opinion that decides this case more as an adequacy-of-proof case than an admissibility-of-evidence case. This is exactly how the Commissioner of Labor decided the case, because no issue of evidence admissibility was raised there.
¶43. I write in concurrence only to point out that this is another in the line of cases that attempts to resolve appellate jurisdiction in workers’ compensation cases based on unworkable and inconsistent distinctions. See Stoll v. Burlington Elec. Dep’t,
¶44. The real inquiry here should be into the nature of the Commissioner’s decision on appeal. As I noted above, the Commissioner looked at all the evidence and ruled that “it is merely possible, not probable” that claimant’s firefighter work caused his lymphoma. This ruling was the functional equivalent of granting
¶ 45. I am concurring because, whatever the rationale for the majority’s decision, we should allow cases like this to go to the superior court and not require that they go exclusively to this Court. This ease represents some loosening of the rigid rule of Stoll; how much and in what way remains to be seen.
There is no indication that the employer made a Dcmbert objection before the hearing officer that took the evidence for the Commissioner. Nor is there any indication whether the Commissioner would have recognized such an exception. In workers’ compensation proceedings, “process and procedure . . . shall be as summary and simple as reasonably may be.” 21 VS.A. § 602. The Administrative Procedure Act statute on rules of evidence in administrative proceedings does not apply to workers’ compensation proceedings. See 3 V.S.A. §§ 810, 816(a)(3). As
Dissenting Opinion
¶ 46. dissenting. I cannot agree with the majority’s decision to affirm the trial court’s conclusion that summary judgment was appropriate after it improperly excluded claimant’s expert opinions. In addition, where the trial court appeared to be confused about the meta-analysis, it should have held a Daubert hearing or at least engaged in further review of the submitted materials. I would reverse and remand this case.
¶ 47. It is telling that the concurring opinion summarizes the majority as deciding this appeal “more as an adequacy-of-proof case than an admissibility-of-evidenee case.” Ante, ¶ 42. That statement is unfortunately true: both the trial court and the majority have exceeded their proper roles in this case and evaluated the evidence put forward by claimant to determine whether claimant should ultimately prevail on the merits. As the concurrence states, the trial court and the majority have concluded that “the evidence was inadequate.” Id. The problem is that this is a merits determination that should have been put to the jury. If the concurring opinion is correct that this is about the adequacy — not the admissibility — of the evidence, then on summary judgment we must view all of that evidence “in a light most favorable to” claimant. In re Carroll,
¶ 48. The only way that insurer could prevail on summary judgment is if the expert opinions of both of claimant’s medical doctors are held to be inadmissible. Perhaps it is the foundation for the medical doctor’s opinions that the majority and the concurrence find “inadequate.” Ante, ¶ 42. Regarding that question — a question of admissibility — the only way to dismiss the medical doctors’ opinions here would be if there were “too great an analytical gap between the data and the opinion[s] proffered.” Gen. Elec. Co. v. Joiner,
¶ 49. The standard for admissibility is whether there is too great a gap between the studies offered and the medical doctor’s opinions based in part on those studies. Id. But here, there is no gap at all: two of the studies relied upon by the doctors — the Figgs study and the Sama study — show statistically significant results that meet even the trial court’s strict 2.0 admissibility standard.
¶ 50. The trial court’s summary judgment decision is premised on its erroneous exclusion of claimant’s expert testimony linking claimant’s firefighting service to non-Hodgkin’s lymphoma. It is undisputed that claimant’s three experts here — one epidemiologist and two medical doctors — are well qualified. Indeed, they are arguably some of the most qualified experts in their respective fields.
¶ 51. The opinions offered by claimant’s experts were based on numerous statistically significant scientific studies with confidence intervals for relative risk entirely above 1.0. Those studies,
¶ 52. There are several defects in the trial court’s decision, but the main problem is that it ignores Vermont’s limitation on the gatekeeping role of trial courts in evaluating expert testimony. By failing to limit itself to adopting a legal standard for statistical significance, and instead adopting the requirement that each study meet the 2.0 standard — meaning a doubling of the risk, which is the same standard for showing at the merits stage that causation is more likely than not — the trial court improperly thrust itself into a merits determination. This Court has squarely stated that trial courts should not engage in “a preliminary inquiry into the merits of the case.” Daewoo,
¶ 53. Specifically, the trial court abused its discretion in the following ways: (1) by conducting a preliminary inquiry into the merits of the case and adopting a standard requiring that each piece of evidence be sufficient to make claimant’s entire case; (2) by ignoring the fact that two doctors looked at a number of health-related factors that were peculiar to claimant and based their opinions on these factors; (3) by failing to explain why the standard the court adopted was not met here, at least as to two
I.
¶ 54. The majority is correct that we review trial court decisions excluding evidence for abuse of discretion. USGen New Eng., Inc. v. Town of Rockingham,
¶ 55. We have previously stated that “the trial court’s inquiry into expert testimony should primarily focus on excluding ‘junk
¶ 56. The trial court’s role here is limited because evaluating an expert’s “credibility and [the] weight of the evidence [is] the ageless role of the jury.” McCullock v. H.B. Fuller Co.,
¶ 57. The trial court’s adoption of the 2.0 relative risk standard as the threshold for admitting evidence of epidemiological studies, with no consideration of a study’s statistical significance, goes far enough in passing judgment on the evidence to amount to an evaluation of the merits of the case, rather than a proper inquiry into the methodology and reliability of the studies used by the experts. Whether it is more likely than not that claimant’s firefighting caused his non-Hodgkin’s lymphoma is the exact fact question that must be resolved on the merits.
¶ 58. The 2.0 standard for admissibility is also problematic because it sets a threshold that requires each study to prove that claimant should win on the merits. By definition, the 2.0 standard only admits each study if that study independently meets the more-likely-than-not standard for proving causation. But we have said: “The admitted evidence does not alone have to meet the proponent’s burden of proof on a particular issue.” USGen,
¶ 59. The trial court’s analysis appears to stem in part from a mistaken belief that an epidemiological study that fails to meet the 2.0 relative risk standard is not statistically significant. That is simply not true. Statistical significance and relative risk are two different concepts, and a doubling of the risk is not required for
¶ 60. In summarizing its holding, the Joint Eastern court noted that the trial court “erred ... in rendering independent assessments of the epidemiological evidence far beyond the role authorized by Daubert) in rejecting all epidemiological studies that yielded [a relative risk] below the unexplained floor of 1.50; . . . and in generally encroaching upon the factfinding role of the jury.”
¶ 61. Scientists usually determine statistical significance by looking at a study’s confidence interval, rather than the exact relative risk arrived at in a particular study. The confidence interval is generally set at a 95% confidence level and is rendered as a range with endpoints on both sides. The lower endpoint represents the lowest possible relative risk (RR) or odds ratio (OR) that appeared (or would be expected to appear) in repeated trials:
[T]he RR or OR has a “confidence interval” around it that expresses how “stable” the estimate is in repeated trials. A 95% confidence interval is the range of numbers that would include the “real” risk 95 times out of 100 if the same study were done over and over again, allowing for random fluctuations of the data inherent in the selection of subjects. Thus, a RR of 1.8 with a confidence*263 interval [between] 1.3 [and] 2.9 could very likely represent a true RR of greater than 2.0, and as high as 2.9 in 95 out of 100 repeated trials.
R. Clapp & D. Ozonoff, Environment and Health: Vital Intersection or Contested Territory?, 30 Am. J.L. & Med. 189, 210 (2004).
¶ 62. The confidence interval is important because it speaks to whether a study is statistically significant. Rather than focusing on the exact relative risk that a study produces, courts engaged in a gatekeeper analysis need to look at a study’s confidence interval. Here, all of claimant’s experts — and the epidemiological studies they relied upon — discussed confidence intervals in great detail in numerous documents submitted to the trial court. This was good science. The Fifth Circuit Court of Appeals has noted that “a study with a relative risk of greater than 1.0 must always be considered in light of its confidence interval before one can draw conclusions from it.” Brock v. Merrell Dow Pharms., Inc.,
¶ 63. It is important to look at confidence intervals because doing so is the best way to determine whether a study’s results are statistically significant. “If the confidence interval is so [wide] that it includes the number 1.0, then the study will be said to show no statistically significant association between the factor and the disease.” Id. The Brock court specifically defined a “statistically significant” epidemiological study as “one whose confidence interval [is entirely above and does] not include 1.0.” Id.
¶ 65. The Brock standard is “generally accepted” as the proper way to evaluate whether a study is statistically significant. In re Viagra Prods. Liab. Litig.,
¶ 66. The experts here — all indisputably well qualified in their respective fields — used this standard for determining statistical significance. Claimant’s expert Dr. Guidotti explicitly stated that when the lowest number in the confidence interval “does not include 1.0, [it] means that [the study] is statistically significant.” Similarly, claimant’s expert Dr. Lockey stated that “[f]our of the seven [studies] were statistically significant (Burnett, Ma, Figgs and Sama) as the confidence intervals around the risk estimates do not include 1.0.” Claimant’s epidemiologist Dr. LeMasters applied the same standard and also stated that the four studies with confidence intervals above 1.0 were statistically significant.
¶ 67. If the trial court wanted to impose a minimum threshold for admissibility, that threshold should have been to require studies to show a statistically significant relationship. See Brock,
¶ 68. If a study has a confidence interval in a range that is entirely above 1.0, it is statistically significant, and any questions about the strength of the relationship shown by the study go to the study’s weight, not its admissibility. If the trial court applied this standard here, the experts would be allowed to rely on the Burnett, Ma, Figgs, and Sama studies — all of which had a confidence interval entirely above 1.0. The trial court abused its discretion in excluding those studies.'
¶ 70. While “acknowledging] that courts disagree on the appropriate relative risk threshold that a study must satisfy to support a general causation theory,” id. at 45, the King court held that those courts that have adopted the 2.0 standard often “failed to distinguish between general causation and its brother, specific causation,” id. at 46. The trial court here made this exact mistake when it excluded expert testimony based upon peer-reviewed studies simply because — in the trial court’s opinion — those studies did not show a sufficiently strong association between firefighting and non-Hodgkin’s lymphoma.
¶ 71. The King court noted that “general causation addresses whether a substance is capable of causing a particular injury or condition in a population, while specific causation addresses whether a substance caused a particular individual’s injury.” Id. at 34 (emphasis added). Because “a plaintiff must show both general and specific causation,” id., evidence that survives the Daubert test is admissible if it speaks to either general or specific causation. See V.R.E. 401 (“ ‘Relevant evidence’ means evidence having any tendency to make the existence of any fact that is of consequence to the determination of the action more probable or less probable than it would be without the evidence.”); cf., e.g., In re Neurontin Mktg., Sales Practices, & Prods. Liab. Litig.,
¶ 72. Further, although “epidemiology focuses on general causation rather than specific causation,” King,
¶ 73. The trial court abused its discretion by adopting a standard for admissibility that requires each study to make claimant’s entire case. See, e.g., USGen,
II.
¶ 74. The trial court’s error in adopting the 2.0 standard stems in part from a misunderstanding of the proffered testimony in this
¶ 75. Although the trial court is correct that some courts have adopted the 2.0 standard when determining whether to admit epidemiological studies, see, e.g., Merrell Dow Pharms., Inc. v. Havner,
For an epidemiological study to show causation under a preponderance standard, the relative risk of limb reduction defects arising from the epidemiological data will, at a minimum, have to exceed ‘2’. That is, the study must show that children whose mothers took Bendectin are more than twice as likely to develop limb reduction birth defects as children whose mothers did not. While plaintiffs’ epidemiologists make vague assertions that there is a statistically significant relationship between Bendectin and birth defects, none states that the relative risk is greater than two. These studies thus would not be helpful, and indeed would only serve to confuse the jury, if offered to prove rather than refute causation. A relative risk of less than two may [be] suggestive] . . . , but it actually tends to disprove legal causation, as it shows that Bendectin does not double the likelihood of birth defects.
Id. at 1321 (quotation omitted). Indeed, the trial court here cited part of this very passage as a rationale for adopting the 2.0
¶ 76. First, both doctors considered claimant’s extraordinarily long forty years of service as a firefighter. Dr. Lockey specifically looked at the fact that claimant “worked as a fireman for forty years.” Similarly, Dr. Guidotti noted that claimant’s forty years of exposure “places him in a high-risk category,” specifically for non-Hodgkin’s lymphoma “among other things.” This deposition testimony in itself is sufficient to allow claimant to argue that the epidemiological studies underestimate the real risk that claimant faced through his firefighting and that even studies showing a relative risk of less than 2.0 can therefore support his claim that firefighting more than doubled his risk of getting non-Hodgkin’s lymphoma. The trial court completely failed to address the fact that the experts in this case rendered opinions that this particular claimant was a firefighter for a much longer period of time than the average firefighter discussed in the studies.
¶ 77. Second, Dr. Lockey and Dr. Guidotti looked at the fact that claimant was likely exposed to more toxins than the average firefighter, since claimant’s firefighting career covered a time when protective equipment was often not used. Dr. Lockey noted that claimant was a firefighter “during a timeframe back in the ‘60s and ‘70s when control measures more likely than not were not as good as they are currently.” Dr. Guidotti similarly noted that it was not until the 1970s that a self-contained protective breathing apparatus was widely introduced and that even then “relatively few” firefighters actually used such an apparatus. According to Dr.
¶ 78. Third, Dr. Guidotti also looked at the particular type of non-Hodgkin’s lymphoma that claimant contracted. Non-Hodgkin’s lymphoma does not refer to just one disease; rather, it is a large category that includes at least thirty recognized types of lymphoma. Dr. Guidotti noted that only some of those types “are known to be associated with environmental exposures and occupations.” Claimant had small cell lymphoma, which Dr. Guidotti noted is associated with environmental exposures. In particular, it is associated with exposure to solvents, including some of the same chemicals that are “released during firefighting.” Thus, Dr. Guidotti concluded that “the chemicals that are known to be associated with small cell lymphocytic lymphoma seem to be more than likely the kinds of things that one would encounter on the job.” This information was a significant factor leading Dr. Guidotti to state that, although “[scientific certainty in the matter is unattainable,” it was his opinion that the evidence favored the conclusion that claimant’s “lymphoma arose from work as a firefighter.”
¶ 79. Finally, both doctors also ruled out other possible causes of claimant’s disease before reaching their ultimate conclusions. Dr. Lockey examined claimant’s medical records and looked at whether there were “any other potential factors as it applies to [claimant] that would be known to be associated with a risk for the occurrence of non-Hodgkin’s lymphoma.” Dr. Lockey concluded that he “could not identify any other known risk factors based on the information that was available to me.”
[A] statistical study may show that a particular type of birth defect is associated with some unknown causes, as well as two known potential causes — e.g., smoking and drinking. If a study shows that the relative risk of injury for those who smoke is 1.5 as compared to the general population, while it is 1.8 for those who drink, a plaintiff who does not drink might be able to reanalyze the data to show that the study of smoking did not account for the effect of drinking on the incidence of birth defects in the general population. By making the appropriate comparison — between non-drinkers who smoke and nondrinkers who do not smoke — the teetotaller plaintiff might be able to show that the relative risk of smoking for her is greater than two.
¶ 81. Courts have previously held that it is an “accepted methodology” to engage in an “analysis of medical literature and case study comparison with the individual characteristics of the patient’s case to determine” the cause of a disease. Celia v. United States,
¶ 82. The majority notes that the Baris study found that the level of excess risk of non-Hodgkin’s lymphoma “was not associated with an increased number of lifetime runs, and that, in fact, the standardized mortality ratio was highest in those individuals who made the lowest number of firefighting runs.” Ante, ¶ 33. There are three problems with the majority’s approach here: (1) it does not address any of the other factors particular to this claimant that the doctors relied upon in making their conclusions, such as claimant’s lack of protective safety equipment, lack of other known risk factors, and contraction of a type of non-Hodgkin’s lymphoma that is linked to solvents released during fires; (2) the majority’s questioning of the experts’ opinions is precisely the type of issue that goes to the weight of those
¶ 83. The Baris study itself noted that “[s]mall numbers of observed deaths in the subcategories of the . . . cumulative runs analyses resulted in imprecise risk estimates.” Dr. Guidotti noted that “Baris is very clear . . . that they don’t consider that the runs analysis was particularly useful.” Dr. Lockey, working with Dr. LeMasters, takes the same position and lists numerous possible alternative explanations, including “gross miselassifieation,” “a chance finding,” and “a healthy survivor effect.”
¶ 84. Dr. Guidotti and Dr. Lockey unsurprisingly found these alternative explanations more convincing than insurer’s counter-intuitive claim (adopted by the majority today) that increased exposure to fires can decrease the likelihood of getting cancer. It is undisputed that firefighting exposes firefighters to known carcinogens: most of the studies presented to the trial court below state that as a given. For instance, the first sentence of the Baris study notes that “[f]irefighters are exposed under uncontrolled conditions to a wide variety of toxic chemicals including known and suspected carcinogens, such as benzene and formaldehyde in wood smoke, polycyclic aromatic hydrocarbons (PAHs) in soot and tars, arsenic in wood preservatives, asbestos in building insulation, diesel engine exhaust, and dioxins.” A carcinogen is defined as “a substance or agent producing or inciting cancer.” Webster’s New Collegiate Dictionary 165 (1981). Thus, it is difficult to understand why the majority puts any stock in the claim that increased exposure to carcinogens decreases one’s chance of cancer — a claim that is inherently self-contradictory, is called into question by the Baris study itself, and is resoundingly rejected by all three of claimant’s experts below.
¶ 86. The trial court recognized that “[t]he court in Daubert II even acknowledged that a study showing a ‘relative risk of less than two’ could be admissible, if ‘combined with other evidence to show it is more likely than not that the accused cause is responsible for a particular plaintiff’s injury.’ ” (quoting Daubert II,
¶ 87. These facts could easily lead a reasonable jury to conclude that because claimant fought fires for forty years, he was exposed to more carcinogens — and was at greater risk for contracting non-Hodgkin’s lymphoma — than the average firefighter discussed in the epidemiological studies. Again, this comes back to this Court’s well-reasoned statement that “[t]he admitted evidence does not alone have to meet the proponent’s burden of proof on a particular issue.” USGen,
¶ 88. Although it is my view that adopting the 2.0 standard was a clear error of law here, even if that standard were acceptable the trial court abused its discretion by failing to provide any explanation as to why it excluded evidence based upon the Aronson, Figgs, and Sama studies — all three of which exceeded the 2.0 standard. Granted, the Aronson study could properly be excluded because its 2.04 relative risk finding was not statistically significant, as it had a confidence interval that included the number 1.0. But the trial court never explains that as a reason for excluding the Aronson study. More importantly, the Figgs and Sama studies could not be excluded as statistically insignificant, because both of these studies had confidence intervals entirely above 1.0. The Figgs study found a relative risk of 5.6, and the Sama found a relative risk of 3.27. Both of these studies were statistically significant and met the trial court’s strict 2.0 standard. Therefore, the court’s failure to explain why these studies were not themselves sufficient support for the opinion evidence constitutes an abuse of discretion that requires reversal. Joint Eastern,
¶ 89. In this situation, the majority is wrong to conclude that there is “too great an analytical gap between the data and the opinion proffered.” Joiner,
¶ 90. The trial court itself recognized that “two [of the studies] show a relative risk greater than 2.0 — Figgs and Sama.” But the trial court appears to require some unspecified percentage (the majority? all?) of surveyed epidemiological studies to meet the 2.0 standard before the jury can even hear about any of the studies. Even Daubert IPs hard-line adoption of the 2.0 standard noted that in that case “[n]one of plaintiffs’ epidemiological experts claims that ingestion of Bendectin during pregnancy more than
¶ 91. Although the trial court found that the epidemiological studies “reflect widely varying degrees of relative risk,” that is not a reason to exclude all of the studies. Just because the studies had different results does not mean that they are all wrong, and claimant should be allowed to argue to the jury why the Figgs and Sama studies are the studies that arrived at the correct relative risk for claimant. That is particularly true here, where claimant’s experts found that although the relative risks were different, they for the most part all pointed in the same direction. As Dr. Lockey stated, there was “consistency across the medical literature based on epidemiology studies of, in fact, a cause-effect relationship between this profession and the occurrence of non-Hodgkin’s lymphoma.”
¶ 92. The trial court’s unexplained dismissal of the Figgs and Sama studies is also problematic because it implies that the trial court improperly weighed these studies against other studies. Rule 702 “is not intended to authorize a trial court to exclude an expert’s testimony on the ground that the court believes one version of the facts and not the other.” Advisory Committee Notes, 2000 Amendments, F.R.E. 702. Numerous courts have come to the same conclusion. See, e.g., Heller v. Shaw Indus., Inc.,
IV.
¶ 93. In the final part of the trial court’s opinion, it addressed the methodology of claimant’s experts in producing a meta-analysis, and the court concluded that this methodology was not
¶ 94. In the final two pages of the opinion below, the trial court openly disclosed the following instances of confusion regarding the proposed expert testimony: “it is unclear from Plaintiff’s brief how [Dr. Guidotti] reached his opinion”; “[w]e do not know what scientific method he used”; “[t]his [study] apparently includes other types of cancers”; “Lockey and Masters apparently used a ‘weight-of-the-evidence’ approach”; “[it] is unclear whether Lockey and LeMasters’ opinions in this case are based upon the meta-analysis.” (Emphases added.) Finally, the trial court concluded that although the meta-analysis conducted by Dr. Lockey and Dr. LeMasters “may be” a reliable scientific method, “we do not know which studies of which cancers were included in the meta-analysis,” and “we have no way of knowing whether it is based upon sufficient facts or data.” (Emphases added.)
¶ 95. But the trial court did have a way of knowing whether these expert studies were relevant and reliable: holding a Daubert hearing. The trial court never held a Daubert hearing.
¶ 96. At the end of claimant’s response to insurer’s motion for summary judgment, claimant specifically “requested] a hearing pursuant to [Vermont Rule of Evidence] 104” if the trial court found one to be necessary. Although the trial court has a great deal of discretion in determining whether a hearing is necessary, here the court’s own opinion recognizes its confusion on numerous critical issues, and it was therefore an abuse of discretion for the trial court to exclude the experts’ testimony without holding a Daubert hearing or at least engaging in further analysis of the submitted materials.
¶ 97. Courts have noted that a trial court cannot exclude expert testimony without considering all of the data that the experts put forward in support of their conclusions:
*278 Although the district court properly may exclude expert testimony if the court concludes too great an analytical gap exists between the existing data and the expert’s conclusion, here the gap was of the district court’s making. The court did not consider all of the data relied upon by Dr. Spindler, namely, studies by the defendant and others finding that Zyderm can induce autoimmune reactions. Consequently, the court abused its discretion in concluding that Dr. Spindler’s testimony failed to meet Daubert’s scientific knowledge requirement.
Kennedy,
¶ 98. Similarly, in USGen, we upheld the exclusion of evidence in part because the trial judge “heard from all three experts” before excluding evidence,
¶ 99. In summary, the trial court abused its discretion in numerous ways by summarily excluding all of claimant’s evidence and granting insurer’s motion for summary judgment. Although some of claimant’s studies can be excluded because they are statistically insignificant, and although a remand is needed to determine whether the meta-analysis is inadmissible, this does not justify excluding all of the proposed expert testimony.
¶ 100. In the end, this case is indistinguishable from Daewoo, where this Court found an abuse of discretion when a trial court “excluded] expert testimony that met the standards articulated in Daubert and adopted by this Court.” Daewoo,
¶ 101. For these reasons, I would reverse and remand to the trial court to apply the proper legal standard for the admission of evidence. I therefore dissent.
¶ 102. I am authorized to state that Justice Johnson joins this dissent.
As noted by the majority, ante, ¶ 17 n.5, these studies and others are cited and collected within G. LeMasters et al., Cancer Bisk Among Firefighters: A Review and Meta-analysis of 32 Studies, 48 J. Em. Med. 1189 (2006), available at http://www.iaff.org/HS/PDF/Cancer%20Risk%20Among%20Fireflghters%20%20UC%20Study. pdf.
Dr. Lockey, for instance, is an occupational pulmonary physician and has “obtained thousands of occupational histories” in his practice and research. He is especially well qualified to combine his special knowledge of epidemiological patterns with a review of claimant’s particular occupational and medical history to form an opinion on whether firefighting caused claimant’s disease.
The Vermont Legislature recently stated that when a firefighter dies from certain cancers — including lymphoma, which is the type of cancer that killed claimant — “the firefighter shall be presumed to have suffered the cancer as a result of exposure to conditions in the line of duty.” 2007, No. 42, § 2 (emphasis added). In adopting this presumption, the Legislature noted that around “28 states and the provinces of Canada have adopted legislation creating a presumption that certain cancers suffered by eligible firefighters are caused by exposure during their employment as firefighters.” Id. § 1(4). Although here claimant died before the passage of this legislative presumption, it is notable that many others agree with the views of claimant’s experts. Cf. Ambrosini v. Labarraque,
Indeed, the court even excluded studies that exceeded the 2.0 standard.
Thus, the majority’s decision today does not preclude future trial courts from admitting evidence based on a more lenient standard than the 2.0 standard used by the trial court below.
"When the Brock court adopted its standard (requiring studies to have a confidence interval that does not include 1.0), the court noted that it viewed its decision as “encouraging district judges faced with medical and epidemiologic proof in subsequent toxic tort cases to be especially vigilant in scrutinizing the basis, reasoning, and statistical significance of studies presented by both sides.” Id. at 315, as modified on reh’g,
Turpin, like Brock, was also cited with approval by the United States Supreme Court in Daubert. See Daubert,
Indeed, several courts and commentators have stated that even the Brock standard requiring statistical significance is too stringent a test for admissibility. See, e.g., Joint Eastern,
Several courts have held that epidemiological studies that meet the 2.0 standard are in themselves “sufficient to support an inference that an agent caused the particular plaintiffs disease.” King,
Although neither of the doctors ever had a chance to personally examine claimant, this Court has previously stated that an expert need not have “firsthand” knowledge of something to make “conclusions [that] were not speculative but instead were ‘based on what is known.’ ” JAM Golf,
The Baris study itself recognized that a “healthy survivor effect” can underestimate the true risk of exposure: “if there is a survivor effect in which the healthiest workers continued to be employed for a long term, using duration [of employment] as a proxy for exposure may mask a true relationship over the range of duration of employment.” For the same reason, a healthy survivor effect could underestimate the true risk of exposure when only the healthiest workers (those who are naturally less susceptible to contracting non-Hodgkin’s lymphoma) are able to make large numbers of lifetime runs.
Although the majority claims that “[n]o party raises this issue on appeal,” ante, ¶ 25 n.9, the ultimate question on appeal is whether the trial court abused its discretion in dismissing evidence. This Court has previously noted that how “rigorously” a trial court reviewed proposed expert testimony is relevant to determining whether the trial court abused its discretion in dismissing that testimony. USGen,
This does not necessarily mean that the trial court would need to hold a Daubert hearing on remand. Further analysis of the documents that have already been submitted could make it clear that the meta-analysis is either admissible or inadmissible. For instance, although the trial court noted that the meta-analysis is suspicious because it looks at studies involving related cancers in addition to non-Hodgkin’s lymphoma, the court ignored the experts’ explanation that such studies are in fact more reliable than narrower studies —• an explanation that could tip the balance in favor of admissibility. Nevertheless, this should be left for the trial court to decide in the first instance after a more thorough analysis.
For instance, as claimant told the trial court at oral argument, the meta-analysis “is not the only piece of the puzzle,” but is rather “just one study” among many.
