MEMORANDUM OPINION AND ORDER
This suit concerns allegations that Defendant Fola . Coal Company, LLC,. has violated the narrative water quality standards of three separate permits for discharges from three mines into tributaries of-Leatherwood Creek. On June 1-4, 20Í5, the Court held a bench trial regard
As explained below, the Court FINDS that Plaintiffs have established, by a preponderance of the evidence, that Defendant has committed at least one violation of its permits governing Fola Mine No. 2 and Fola Mine No. 6 by discharging into Road Fork and Cogar Hollow high levels of ionic pollution, which have caused, or materially contributed to a significant adverse impact to the chemical and biological components of the applicable streams’ aquatic ecosystem, in violation of the narrative water quality standards that are incorporated into those permits.- However, the Court further FINDS that Plaintiffs have not met their burden in establishing liability for alleged violations with respect to discharges from Fola Mine No. 4A into Right Fork, under NPDES Permit No. WV1013815.
I. Background
Plaintiffs Ohio Valley Environmental Coalition (“OVEC”), West Virginia Highlands Conservancy, and Sierra Club filed this case pursuant’ to the citizen suit provisions of the Federal Water Pollution Control Act (“Clean Water Act” or “CWA”), 33 U.S.C. § 1251 et seq., and the Surface Mining Control and Reclamation Act (“SMCRA”), 30 U.S.C. § 1201 et seq. CompL, ECF No. 1. Before proceeding to the parties’ evidence and arguments, the Court will first discuss the relevant regulatory framework and the factual background of this case.
A. Regulatory Framework
The primary goal of the CWA is “to restore and maintain the chemical, physical, and biological integrity of the Nation’s waters.” 33 U.S.C. § 1251(a). To further this goal, the Act prohibits the “discharge of .any pollutant by any person” unless a statutory exception applies; the primary exception is the procurement of a National Pollutant - Discharge Elimination System (“NPDES”) permit. 33 U.S.C. §§ 1311(a), 1342. ■ Under, the ,NPDES? the U.S. Environmental Protection Agency (“EPA”) or an authorized state agency can issue a permit for thé discharge of any pollutant, provided that the discharge complies' with the ■ conditions of the CWA. 33 ■ U.S.C. § 1342. A state may receive approval to administer a state-run NPDES program under the authority-of 33 U.S.C. § 1342(b). West Virginia received such approval, and its NPDES program is administered through the West Virginia Department of Environmental Protection (“WVDEP”). 47 Fed.Reg. 22363-01 (May 24, 1982). All West Virginia NPDES permits incorporate by reference West Virginia Code of State Rules § 47-30-5.1.f, which states that “discharges coyered by a WV/NPDES permit are to be .of such quality so as not to cause violation of applicable water quality standards promulgated by [West Virginia Code of State Rules ■§ 47-2].”
Coal mines are also subject to regulation under the SMCRA, which prohibits any person from engaging in or carrying out surface coal mining operations without first obtaining a permit from the Office of Surface Mining Reclamation and Enforcement (“OSMRE”) or an authorized state agency. 30 U.S.C. §§ 1211,1256,1257. A state may receive approval to administer a state-run surface mining permit program under the authority of 30 U.S.C. § 1253. In 1981, West Virginia received conditional approval of its state-run program, which is administered through the WVDEP pursuant to the West Virginia Surface Coal Mining and Reclamation Act (“WVSCMRA”). W. Va.Code §§ 22-3-1 to -33; .46 Fed. Reg. 5915-01 (Jan. 21, 1981). Regulations passed pursuant to the WVSCMRA require permittees to comply with the terms and conditions of their permits and all applicable performance standards. W. Va. Code R. § 38-2-3.33.C. One of these performance standards requires that mining discharges “shall not violate effluent limitations or cause a violation of applicable water quality standards.” Id. § 38-2-14.5.b. Another performance standard mandates that “[a]dequate facilities shall be installed, operated and maintained using the best technology currently available ... to treat any water discharged from the permit area so that it complies with the requirements of subdivision 14.5.b of this subsection.” Id. § 38-2-14.5.C.
B. Factual Background
This controversy concerns discharges from three surface mines along the southern portion of the Leatherwood Creek watershed: ,(1) Fola Surface Mine No. 2 in Clay and Nicholas Counties, West Virginia; (2) Fola Surface Mine No. 4A in Clay County, West Virginia; and (3) Fola Surface Mine No. 6 in Nicholas County, West Virginia.' Stipulátion ¶ 6, ECF No. 53.
Defendant’s mining activities at Surface Mine No. 2 are regulated under WV/ NDPES Permit WV1013840 and West Virginia Surface Mining Permit S201293, both originally issued in 1994. Id. at ¶¶ 6-7. WVDEP reissued WV/NPDES Permit No. WV1013840 in 2001, 2004, 2008, and 2014. At the time this complaint, was filed, the 2008 reissuance was in effect. Outfall 001 of Surface Mine No. 2 discharges into Road Fork and Leatherwood Creek. Id.
Defendant’s mining activities at Surface Mine No. 4A' are regulated under WV/ NPDES Permit WV1013815 and West Virginia Surface Mining Permit S200502. Id. at ¶¶ 21, 23. WV/NPDES Permit WW1013815 was originally issued in 1993, and was reissued in 1999, 2006, 2008, and 2014. At the time this complaint was filed, the 2008 reissuance was in effect.- Outfalls 22, 23, and 027 of Surface Mine No. 4A discharge into Right Fork of Leatherwood Creek and Cannal Coal Hollow.
Finally, Defendant’s mining activities at Surface Mine No. 6 are regulated under WV/NPDES Permit WV1018001 and West Virginia Surface Mining Permit S2011999, both' originally issued in 2000. Id. at ¶¶ 42-44. WVDEP reissued WV/NPDES
In recent years, water quality measurements from the above listed discharges have routinely shown discharges of high conductivity. Stipulation ¶14, ECF No. 53 (showing discharges from Outlet 001 at Mine No. 2 with conductivity measurements consistently around 3000 ■ pS/cm); id. at ¶ 32 (showing discharges from Outlets 22, 23, 27 at Mine No. 4A consistently ranging from approximately 1500 pS/cm to above 3000 pS/cm); id. at ¶47 (showing discharges from Outlets 013, 015, 017 with conductivity measurements consistently ranging from approximately 2500 pS/cm to 4000 pS/cm). Water quality measurements have also revealed elevated conductivity in Leatherwood Creek and its tributaries. Id at ¶ 13 (showing conductivity levels ranging from 3000 pS/cm to 4000 pS/cm in Road Fork); id. at ¶ 33 (showing conductivity levels consistently above 1000 pS/em below Mine No. 4A); id. at ¶ 46 (showing conductivity levels ranging from 3000 pS/cm to 5000 pS/cm in Cogar hollow).
On June 1-4, 2015, the Court conducted a bench trial on liability issues. At the close of the evidence, the Court entered an oral finding on general causation, but reserved judgment on issues of specific causation. Tr. 4 at 259-60, June 4, 2015, ECF No. 107. Since that time, the parties have provided timely post-trial briefing. In Section II, the Court will review the evidence and arguments concerning general causation and elaborate on its general causation finding. ■ In Section III, the- Court will move on to review the evidence and arguments concerning specific causation for each of the ■ three- mine permits at issue.
Generally speaking, Plaintiffs are faced with the dual burden of establishing both general and specific causation.
A. Introducing the EPA’s Benchmark
Yet again, the Court begins its analysis of general causation with arguments concerning the import and reliability of the EPA’s Benchmark. See Elk Run,
In the nearly three hundred page Benchmark, the EPA reached the conclusion that “salts, as measured by conductivity, are a common cause of impairment of aquatic macroinvertebrates” in central Appalachian streams only after considering and' then ruling out the potential confounding effects of habitat, organic enrichment, nutrients, deposited sediments, pH, selenium, temperature, lack of headwaters, catchment areas, settling ponds, dissolved oxygen, and metals. EPA’s Benchmark at. A-l, B — 1; see also id. at A-40 (“This causal assessment presents clear evidence that the deleterious effects to benthic invertebrates are caused, by, not just associated with, the ionic strength [, i.e., conductivity,] of the water.-... When [other potential] eauses are absent or removed, a relationship between conductivity and ephemeropteran [, i.e. mayfly,] richness is still evident.” (emphasis added)); id. at A-37 (“As conductivity increases, the occurrence and capture probability decreases for many genera in West Virginia ... at the conductivity levels predicted to cause effects. The loss of these genera is a severe and clear effect.”). The Benchmark also found that “of the [nine] land uses ... analyzed, only mining especially associated with valley fills[, i.e,, mountaintop mining with valley fills,] is a substantial source' of the salts that are measured as conductivity.” Id. at A-18.
The EPA ultimately concluded that the “chronic aquatic life benchmark value for conductivity” in West Virginia streams is 30Ó |xS/cm. Id. at xv. To derive this recommended high-end threshold value, the EPA used the 5th percentile of a species sensitivity distribution, based on the standard methodology for deriving water-quality-criteria, meaning that this 300 gS/cm benchmark value for conductivity is “expected to avoid the local extirpation [due to the salts measured as conductivity] of 95% of native species.” Id. at xiv.
In support of' both the specific 300 gS/cm benchmark value and the general causal linkage between conductivity and impairment to aquatic macroinvertebrates, the Benchmark contains a graph which charts, for 163 genera, the level of ionic exposure above which a genus is effectively, absent from water bodies in a region, with conductivity readings on the x axis and proportion of genera extirpated on the y axis. Id. at xiv, 18 fig. 8. A fairly consistent line is formed as conductivity and extirpation both increase, illustrating the causal connection between conductivity and significant biological impairment which Plaintiffs seek to prove. See id. at 18 fig. 8. Relatedly, the EPA reported its finding that “the probability of impairment’ at 500 jxS/cm is 0.72 and at 300 gS/cm, is 0.59.” Tr. 2 at 111, June 2, 2015, EOF No; 100;
Upon reviewing the EPA’s findings, the Scientific Advisory Board (“SAB”) made the follow comments:
Mountaintop mining and valley fills are important sources of stress to aquatic systems in the Central Appalachian region, both from the perspective of localized and cumulative regional impacts. In a companion report, the Panel provides a review .of EPA’s assessment of the impacts associated with mountaintop .mining and valley fills. There is clear ..evidence that valley fills are associated with increased levels of dissolved ions (measured as conductivity) in downstream .waters, and that these increased levels of conductivity are associated with changés in the composition of stream biological communities.
Pis.’ Ex. 128 at PE1418, Tr. 2 at 112-13, ECF No. 100. The SAB further concluded that the EPA had presented a “convincing case” for establishing the causal relátionship between conductivity and loss of genera. Pis.’ Ex. 128 at PE 1431, Tr. 2 at 115, ECF No. 100.
Plaintiffs rely . on EPA’s Benchmark here as they have elsewhere: as a scientific study which, among others, supports Plaintiffs’ general causation theory that high conductivity levels in streams impacted by alkaline mine ■ drainage causes or contributes to biological impairment. Defendant’s. many critiques of the EPA Benchmark will be considered below. Before doing so‘, however, it is necessary to briefly revisit general principles regarding the degree of deference owed EPA’s Benchmark in the analysis to follow.
“Particularly with environmental statutes such as the Clean Water Act,.the regulatory framework ... requires sophisticated evaluation of complicated data____ [A court] therefore do[es] not sit as a scientific body in such cases, meticulously reviewing all data.under a laboratory microscope.” - Crutchfield v. Cnty. of Hanover, Virginia,
“[technological and scientific issues ... are by their very nature difficult
In light of these precedents, and as previously analyzed by this Court, EPA’s Benchmark must be afforded deference. See Elk Run,
B. Critiques of the EPA Benchmark
Turning to consider newly presented evidence and argument, two recurring questions underlie the Court’s instant analysis of general causation as related to EPA’s Benchmark: (1) whether specific expertise in epidemiology is required for the development or review of EPA’s Benchmark; and, in a similar vein, (2) whether specific expertise in ecology is required for the development or review of EPA’s Benchmark.
As argued by Defendant, because the EPA incorporated principles of epidemiology into its causal analysis, assessing the reliability of the EPA’s findings requires review by an epidemiologist.
Taking a particularly illustrative example, Dr. Garabrant’s critique of Table B-7 of the EPA Benchmark suggests an inability to correctly apply common statistical tools presumably well within his epidemiological expertise and, in doing so, to adequately interpret available ecological data. Table B-7 offers a relatively straightforward presentation of data purporting to represent two regression lines. Joint Ex. 17 at JE0773. ‘ Dr. Garabrant demonstrated the perceived failings of the table by attempting to recreate the graph. See Tr.
Contrary to Dr. Garabrant’s opinion, Plaintiffs’ expert found the same Table B-7 to be perfectly sensible with .the addition of a single interpretive move. Though explained elsewhere in the Benchmark (see, e.g., Figures 13a, 13b, 13c, 13d, and 13e), the authors made no explanation of whether a logarithmic scale shquld.be used to interpret the data shown at Table B-7.
Thus, what appeared to one epidemiological expert to be an incorrect and nonsensical table, was in fact a perfectly sensible table that the authors merely neglected to. adequately label for non-expert reviewers.' While there is likely no across-the-board answer to what respective degrees of epidemiological and ecological expertise are 'necessary to evaluate the EPA’s Benchmark, this - example serves as a ready reminder in-the analysis-to follow that something beyond wholly non-contextual data analysis may be needed.
1. Assessment of Effect Modification
Turning to Dr. Garabrant’s broader critiques of the EPA Benchmark, we begin with the suggestion ¡that the Benchmark is fundamentally flawed insofar as the EPA failed to account for effect modification. As explained by Dr. Garabrant, “[e]ffect modification occurs when the association between two factors is different depending on' the presence or absence of a third factor. If, for example, the association
Returning to Dr. Garabrant’s critique, indeed, the words “effect modification” cannot be found in the text of the Benchmark, suggesting to Dr. Garabrant that the EPA did no analysis of effect modification. In an effort to assess the presence or absence of effect modification, Dr. Gara-brant turned to the underlying data and produced a series of figures purportedly showing the presence of effect modification. See Def.’s Exs. 31-36. In each table, the percent of sites with ephemeropt-era present is represented along the y-axis, conductivity is represented along the x-axis, and blue, red, and green lines run across the graph as representations of low-, mid-, and upper-range values for a given potential effect modifier, respectively. Taking the example of 'pH as a potential effect modifier (Def.’s Ex. 31), Dr. Garabrant explained that the effect of pH can be gleaned from looking at “the vertical distance between the green line and the blue. line holding conductivity constant,” with greater vertical distance suggesting greater likelihood of effect modification. Tr. 1 at 36-37, ECF No. 105.
Using these tables to thusly visualize the data, Dr. Garabrant reached the conclusion that effect modification was present with respect to pH (Tr. 1 at 37, ECF No. 105 (discussing Def.’s Ex. 31: “[i]f pH is neutral to high, there is no relationship between conductivity and Ephemera. If pH is low, the insects are adversely affected. That’s what effect modification looks like”)), stream size (Tr. 1 at 39, ECF No. 105 (discussing Def.’s Ex. 32: “I think you have evidence here of effect modification ..You are getting a different answer according to stream size. ' That’s effect modification”)), dissolved oxygen (Tr. 1 at 41, ECF No. 105 (discussing Def.’s Ex. 33)), iron (Tr. 1 at 43, ECF No. 105 (discussing Defi’s Ex. 34)), and manganese (Tr. 1 at 44-45, ECF No. 105 (discussing Defi’s Ex. 36)). Thus, the Court has one trained epidemiologist, with no formal background or experience in ecology, claiming that the EPA neglected to consider effect modification, and in so doing, missed the presence of several effect modifiers, thereby undermining the entirety of its causal analysis.
A second trained epidemiologist offered testimony on the same issue, but reached starkly different conclusions. Responding to Dr. Garabrant’s analysis of effect modification and EPA’s Benchmark, Plaintiffs’ expert, Dr. Wing, cautioned that, “in order to make á. decision about interaction or effect modification, it’s first necessary to have some idea about the topic one is investigating because without that, one can make egregious mistakes about an analysis which can be done by someone who doesn’t know anything about the topic but could result in essentially meaningless conclusions or actually conclusions that are misleading.” Tr. 2 at 20, ECF No. 100. He
Consistent with that fundamental reservation and despite his considerable epidemiological expertise, Dr. Wing was unable to agree with Dr. Garabrant’s conclusion that effect modification is present in the dataset and yet left unaddressed by EPA. Looking, for example, at Dr. Garabrant’s figure assessing dissolved oxygen as a potential effect modifier (Def.’s Ex. 33), Dr. Wing observed similar trends across low-, mid-, and upper-range dissolved oxygen levels. Tr. 2 at 45, ECF No. 100. According to Dr. Wing, the similarity of the trend, or slope, suggests an absence of effect modification. Id. Moreover, solely based on the graph relied upon by Dr. Garabrant, Dr. Wing explained that it was impossible to definitively assess effect modification because Dr. Garabrant neglected to include any information on sample size or precision (e.g., no slope estimates or standard error estimates are provided). Tr. 2 at 46, ECF No. 100. And so the opinion of a second epidemiologist without ecological training reaches not only a contrary conclusion about effect modification, but further identifies an analytical barrier to reliably interpreting the graphs relied upon by the first.
To that uncertain mix, Dr. Baker contributes his opinion on effect modification as an ecologist without formal epidemiological training. Like Dr. Wing, Dr. Baker similarly critiqued the absence of information on sample size or precision.
Even standing independently, the rebuttal arguments offered by Dr. Wing and Dr. Baker arguably do enough to dispose of Dr. Garabrant’s critique of the EPA Benchmark related to analysis of effect modification.' Dr. Wing’s testimony effectively draws the adequacy of Dr. Gara-brant’s expertise into question, and Dr. Baker’s testimony demonstrates the importance of ecological expertise in reading and evaluating the EPA’s work. Furthermore, Plaintiffs’ expert testimony does not stand alone; it is acéompanied by and consistent with the expertise and analysis of the EPA — an expert federal agency acting in its area of expertise.
2. Analysis of Confounding
Dr. Garabrant further argued that the EPA performed an unreliable analysis of confounding, thereby rendering EPA’s causal conclusions invalid. The Dictionary of Epidemiology defines “confounding” as
[T]he ■ distortion of a ‘-measure of the effect of an exposure on an outcome due to the association of the exposure with other; factors that influence the occurs rence of the outcome.- Confounding occurs when all or part of the apparent association between the exposure and the outcome is in fact accounted for by other variables that affect the outcome and are not themselves affected by exposure,”17
As argued by Dr. Garabrant; the EPA failed to adequately and reliably assess
Asked to comment on the validity of EPA’s approach to analyzing confounding, Dr. Garabrant hesitated to say whether the approach was valid or not. Tr. 1 at 16, ECF No. 105 (“It’s hard to say whether it is valid. I have never seen it used. I haven’t seen any-validation of it. I-have never seen any test of this method to show that it works. So I would say it’s not known whether it’s reliable or not. EPA created it.”). Beginning broadly, Dr. Gar-abrant called attention to the following paragraph from the Benchmark explaining the EPA’s approach to confounding in its causal analysis:
Weighing evidence for confounding factors differs from weighing evidence for causation. The causal assessment in Appendix A determines whether dissolved salts are an important cause of biological impairment ■ in the region. This assessment of confounding accepts the result of the causal assessment and attempts to determine whether any of the known potential confounders interfere with estimating effects of conductivity to a significant degree.
EPA Benchmark at B-3. As explained by Dr. Garabrant, this paragraph reflects an analytical error on the part of the EPA akin to “putting the cart before the horse.” Tr. 1 at 17,‘ ECF No. 105 (“[T]he idea that you accept the result of causal assessment and then look at confounding is simply putting the cart before the horse. It’s backwards.”). In addition to this analytical error, Dr. Garabrant suggested that EPA’s approach was relatively arbitrary and subjective. Tr. 1 at 16, ECF No. 105.
Dr. Garabrant then went on to explain that epidemiologists commonly rely on a relatively straightforward way to identify the presence of confounding effects: compare the results of a crude analysis testing the association between conductivity and extirpation against the results of an adjusted analysis that introduces a potential confounding factor.
[[Image here]]
[[Image here]]
Referring to this table, Dr. Garabrant offered testimony that any change in parameter estimate after adjusting for a given variable greater than 10% signals the presence of confounding. Tr. at 71-72. On cross-examination, however, Dr. Garabrant readily acknowledged that while “there is widespread agreement -that more than fifty percent change is important,” in the range of ten to twenty percent, judgments about confounding would depend on the analyst’s background knowledge in the subject matter at issue. Tr. 1 at 72, ECF No. 105. As already mentioned, Dr. Garabrant is without precisely that background knowledge. Accordingly, the Court is left to conclude that Dr. Garabrant’s'ten percent threshold is itself arbitrary and unreliable.
Beyond analytical differences of opinion, the Court is further unmoved by Dr. Gara-brant’s analysis of confounding given the quality of underlying data and the nature of certain variables. First, as will be discussed at length in the next section, some of the variables analyzed by Dr. Garabrant for confounding are known to ecologists to have little to no relevance in the context of West Virginia streams impacted by alkaline mine drainage. (e.g., orthophosphates, see infra Section II.B.3.a). Second, and also discussed below, the database lacks a significant number of data points for some of these variables unless total and dissolved values are considered together. (e.g., magnesium, calcium, selenium, and manganese, see infra Section IÍ.B.3.a).
Like Dr. Garabrant, this Court would be unable to set anything but an arbitrary threshold for recognizing potential confounding variables. Instead, -the Court continues to. rely on the expertise of ecologists. and testimony assuring the Court that the EPA engaged a reasonable and verified analysis of confounding.
3. Adequacy of the underlying data
Dr. Garabrant offered two critiques of the data relied upon by EPA in developing
a. “Missing ” Data
With respect to allegations of fatally missing data, Dr. Garabrant prepared a table reporting the number and percent of missing data points for each variable missing greater than 50% of the possible data points. Def. Ex. 38 (reproduced below).
[[Image here]]
According to the table, greater than 98% of the data points are missing for dissolved magnesium, dissolved calcium, dissolved manganese, dissolved orthophosphates, and total orthophosphates. Def.’s Ex. 38. Additionally, between 67% and 78% of the data was missing for various land cover variables, 85% of the data was missing for dissolved selenium and 77% missing for total selenium. According to Dr. Gara-brant, these deficiencies in the dataset prevented the EPA from meaningfully analyzing potential effects of these variables.
However, as an ecologist, Dr. Baker was not similarly troubled by the missing data. First, he explained that deficiencies found in dissolved magnesium, were well compensated for by data on total magnesium.
With respect to alleged missing data on land cover categories, Dr. Baker was similarly untroubled. Dr. Baker testified that, as an ecologist, one would not be likely to consider any of these variables as potential confounding variables and would therefore likely ignore these categories as immaterial. Id. Finally, with respect to ortho-phosphates, Dr. Baker explained again that the lack of data would not trouble an ecologist in this context, because ortho-phosphates in high concentrations are associated with agricultural landscapes, which are generally not found near — much less coextensive with — mining areas in West Virginia. Id. at 201. Stated differently, the absence of land cover data or orthophosphate measurements would only trouble a reviewer to the extent that he did not have the necessary background familiarity with ecology and land use patterns to independently recognize the insignificance of the variables.
b. “Hidden” Data
In addition to purportedly fatally missing data, Dr. Garabrant also criticized the EPA Benchmark for “hiding data.” As explained by Dr. Garabrant, a series of tables provided in Appendix B (Tables B-8 (habitat), B-13 .(Embeddedness), B-15 (pH), B-23 (stream size), B-25 (dissolved oxygen), B-28 (iron), B-29 (Aluminum), B-30 (Manganese)) all share a common flaw: failure to include significant chunks of data for conductivity levels between 200 ¡xS/cm and 1500 pS/cm, instead, only showing data at extreme conductivity conditions (i.e., < 200 fxS/cm and > 1,500 pS/cm). In some instances, these tables fail to include significant mid-range data for not only conductivity, but also for the variable of interest (e.g., mid-range iron data was not included in Table B-28). Dr. Garabrant recognized and critiqued the absence of mid-range data, and was unable to provide any methodological or analytical justification for its absence.
Making good use of database access, Dr. Garabrant recreated the suspect tables to include mid-range data (see Def.’s Exs. 39, 43-49) and then continued to plot the data represented in each table (see Def.’s Exs. 30-36). Based on this information, Dr. Garabrant reached two conclusions: (1) effect modification is present with respect to each co-variate represented; and (2) there is. a consistent absence of conductivity effects regardless of .co-variate levels until conductivity reaches 1200 pS/cm to 1500 pS/cm. These conclusions suggested to Dr. Garabrant that there are serious flaws in the EPA’s analysis. Tr. 1 at 33-46, ECF No. 105. In turn, Dr. Garabrant’s only explanation for how the Benchmark reached publication despite hidden data became the suggestion that the SAB and peer reviewers would not have had access to the dataset and the ability to perform the analysis he did. Tr. 1 at 48-49, ECF No. 105.
Through the testimony of Dr. Baker and Dr. Wing, Plaintiffs supplied convincing rebuttals to each criticism related to “hidden data” raised by Dr. Garabrant. Not only are these rebuttal arguments convincing, but to some extent, they also highlight the drawbacks of data analysis performed with relatively limited understanding of the subject matter being analyzed. First, Dr. Baker explained a methodological reason EPA did not include mid-range data in
Given Dr. Baker’s more nuanced explanation of the import of the data shown and the data not shown, Dr, Garabrant’s criticism of hidden data does little to impugn the work of EPA. scientists, instead illustrating Dr. Wing’s point that data analysis is not the same as the interpretation of data. While an epidemiologist may be- qualified to run data analyses on any dataset, it should not be assumed that an epidemiologist is necessarily otherwise qualified to interpret the results of that analysis.
Second, Dr. Baker tailored his interpretation of the data presented according to the function. of the tables in the overall analysis. Importantly,, these tables were not relied upon by the EPA .to identify threshold effects; these tables were introduced to explain EPA’s confounding analysis. Presence or absence of some mayfly genera without any data on abundance or variety explains very little that would help to identify a conductivity threshold at which the most sensitive macroinverteb-rates suffer extirpation.
4. Inter-state differences in species sensitivity
Dr.- Garabrant’s observations about the differences between XC95 values in West Virginia and- in Kentucky are among his most immediately compelling observations. The Benchmark’s XG95 values report genera-specific response thresholds at which you can expect 95% of freshwater macroin-veterates to tolerate conductivity levels. While -it is not surprising to expect different genera to have different response thresholds, Dr. Garabrant made the troubling observation that there are variances in the response thresholds within genera based on whether the data was sourced from West Virginia or Kentucky. For example, cross comparison of tables reveals the following differences between response
[[Image here]]
It is easy to share Dr. Garabrant’s, shock upon noticing that different genera apparently have different response thresholds depending on whether they are observed in one state or another.
Though Dr. Garabrant could only imagine such discrepancies suggested fatal methodological and analytical flaws, Dr. Baker readily offered sound explanations rooted in the nature of data analysis and data collection. Tr. 4 at 211-14, ECF No. 107. First, whatever the differences in XC95 values of a few genera, across the entire datasets, the XC95 are very well correlated.
Given these differences in collection methods and database size, one would expect that the EPA would identify two different and loosely associated benchmarks. Yet the methodology used by-the EPA identified remarkably similar benchmark values notwithstanding species-specific differences.
C. Scholarly Publications and Expert Opinions
In addition to EPA’s Benchmark, Plaintiffs further relied on a seemingly ever-growing colléction of published, -peer-reviewed journal articles addressing the connection between conductivity and impairment in ' Appalachian streams. Through the testimony of experts, ■ the Court was introduced to myriad peer-reviewed articles. In revisiting that collection of articles below, note the complete absence of peer-reviewed scientific articles to the contrary.
The scientific literature concerning the relationship between conductivity and impairment likely began in earnest in 2003 with publication of an Environmental Impact Study (EIS) of mountaintop mining valley fills. Tr. 2 at 99, ECF No. 100. Authors of the EIS examined changes in water chemistry and biological assemblages, finding increased concentrations of sulfates and dissolved solids, increased specific conductance, and a coincident decrease in sensitive taxa in impacted streams. Tr. 2 at 99-100, ECF No. 100. Other early publications included a paper by Kennedy et al. the following year, relaying the finding that exposure to elevated conductivity levels resulted in loss of organisms (Joint Ex. 9; Tr. 2 at 100, ECF No. 100), and a 2005 publication by Hartman examining the relationship between conductivity and mayfly richness. Tr. 2 at 100, ECF No. 100.
In 2008, Gregory Pond et al. published a paper in the Journal of North American Benthological Society, titled “Downstream
Furthermore, Pond et al. calculated correlation coefficients of “[GLIMPSS] and [WVSCI] and genus- and family-level non-metric multidimensional scaling (NMS) axis scores [verses] a truncated list of environmental variables,” including conductivity, embeddedness scores, sediment deposition scores, and total rapid biological protocol (“RBP”) habitat scores.
Though relatively insignificant, Pond 2008 did find some positive correlation between habitat quality and aquatic life; however, subsequent studies have made further efforts to parse the difference between habitat influenced effects and the effects of water chemistry. See e.g. Pis.’ Ex. 173 at PE Í537 (“This suggests that degradation of water quality and the resultant increases in specific conductivity, component ions, and trace metals limit aquatic life regardless of habitat quality.”); Virginia Tech, Pis.’ Ex. 173 at PE 1703 (“Nonetheless, the extensive effort undertaken to locate test. sites with abiotic conditions comparable to those of reference sites was successful in minimizing biotic influence from non-TDS [total dissolved solids] stressors, including poor habitat quality. This was an important-step toward defining TDS sensitivity ... ”)
The following year, Pond published a second article, “Patterns of Ephemeropt-era taxa loss in Appalachian headwater streams,” in Hydrobiologia. Pls.’ Ex. 131. As was the case with his earlier work, Pond again relied on experimental fieldwork, but he conducted unique fieldwork in a different area. Id. In this second article, Pond' compared mayfly assemblages at some ninety-two sites' in Kentucky, focusing on taxa richness (i.e., “the number of different groups of mayflies,” Tr. 2 at 105, ECF No. 100) and relative abundance. In so doing, Pond discovered that both mayfly richness and relative abundance were significantly higher at reference sites and both were significantly lower at mined sites. Tr. 2 at 105-06, ECF No. 100. Furthermore, consistent with earlier analyses, Pond reported that “[Relative mayfly abundance was most strongly correlated to specific conductance (r = 0.72) compared to total habitat score (r = 0.59).” Pis.’ Ex. 173 at PE1526; id. at PE1536 (“Analyses from WV mining areas (Hartman et al., 2005; Merricks et al., 2007; Pond et al. 2008) indicated that the decline in mayflies from mountaintop mining correlates most strongly to specific conductance.”).
In the same year thát Pond 2010 was published, Science Policy Forum published an article co-authored by Dr. Palmer and titled “Mountaintop Mining Consequences.”
The following year, the Journal of the North American Benthological Society published a paper by Eric Merriam et al., titled “Additive effects of mining and residential development on stream conditions in a central Appalachian watershed.” Joint Ex. 11. As explained by Dr. Palmer, Merriam et al., examined the combined effects of streams impacted by mining as compared to streams otherwise impacted by development, finding that mining impacts do contribute to changes in macroin-vertebrate community structure. Tr. 2 at 108-09, ECF No. 100. These changes in community structure would appear to have been more closely related to changes in water chemistry as compared to changes in habitat. Pis.’ Ex. 173 at JE0173-74 (“We found significant effects of mining on in-stream conditions. Increased levels of mining resulting in poorer water quality, primarily through increases in specific conductance and associated dissolved chemical constituents_Mining had no measurable effect on habitat complexity or quality.”). Thus, relying on unique data and methodology the authors were able to conclude as follows:
Our results are similar to those of recent studies that have identified changes in water quality to be the dominant stres-sor in mined systems (Fulk et al. 2003, Freund and Petty 2007, Pond et al. 2008, Petty et al. 2010, Pond 2010). Increased specific conductance is consistently the dominant stressor in streams affected by mountaintop removal mining in southern West Virginia (Hartman et al. 2005, Merricks et al. 2007, Pond et al. 2008).... Furthermore, increased specific conductance is a consistently -imporr tant predictor of ecological condition in these systems .,. Our results corroborate those of numerous studies in which Ephemeroptera was identified as one of the most sensitive taxa to increases in ionic strength associated with large-scale surface mining in the Central Appalachian region.
Pis.’ Ex. 173 at JE0174.
This brings us to publication of EPA’s Benchmark. As evident by the foregoing discussion, by the time the EPA published the Benchmark, scientific literature on the subject was already well developed, and according to Dr. Palmer, had already established a likely relationship between conductivity and impairment in Appalachian streams impacted by alkaline mine drainage. Tr. 2 at 110, ECF No. 100. Nevertheless, studies examining the probable relationship between mining, high conductivity, and impairment continued to reach publication in peer-reviewed scientific journals.
In 2011, Dr. Bernhardt and Palmer published an article titled “The environmental costs of mountaintop mining valley fill operations for aquatic ecosystems of the Central Appalachians” in the Annals of the New York Academy of Sciences. Joint Ex. 1; Tr. 2 at 117, ECF No. 100. There, the authors concluded that there was a significant relationship between mining, activities and changes in the chemical composition of streams below mining.. Tr. 2 at 117,. ECF No,. 100. Such changes were strongly associated with biological impairment of those streams. Id. at 117. Particularly, the authors explained that “[a]ll available data show that it becomes increasingly unlikely to find an unimpaired aquatic benthic community as conductivity increases.” Pis.’ Ex. 173 at JE0010. Elaborating on the same point, the article goes on to say that:
Whether or not individual component ions within mining-derived runoff reach streamwater concentrations that are individually lethal or toxic to aquatic life, the cumulative effect of elevated concentrations of multiple contaminants is clearly associated with a substantial reduction in water quality and biological integrity in streams and rivers below mine sites. All research to date indicates that conductivity is a robust measure of the cumulative or additive impacts of the elevated concentrations of multiple chemical stressors from mine sites that lead to biological impairment of streams.
Pis.’ Ex. 173 at JE0014 (emphasis added).
Dr. Lindberg next joined Dr. Bernhardt on a paper published in the Proceedings of the National Academy of Sciences titled “Cumulative impacts of mountaintop mining on an Appalachian watershed.”
Still in 2011, another study authored by Dr. Pond reached publication in Hydro-biologia: “Biodiversity loss in Appalachian headwater streams (Kentucky, USA): Ple-coptera and Trichoptera communities.” Pis.’ Ex. 137. Here, Dr. Pond again documented the effects of mining and residential land use disturbances on macroinver-tebrates, particularly stonefly (Plecoptera) and caddisfly (Trichoptera) assemblages. Tr. 2 at 119, ECF No. 100. Dr. Pond found not only extirpation of these genera associated with mining disturbances, but further remarked that habitat factors could not explain the observed impacts. Tr. 2 at 119-20, ECF No. 100; Pis.’ Ex. 173 at PE1775-76 (“no habitat factors were significantly correlated with relative abundancé metrics”).
In 2013, Pond published yet another coauthored paper on the subject; this time in Environmental Monitoring and Assessment and titled “Calibration and validation of a regionally and seasonally stratified macroinvertebrate index for West Virginia wadeable streams.” Pis.’ Ex. 138. The article describes “the development, validation, and application of a geographically- and seasonally partitioned genus-level index of most probable stream status (GLIMPSS) for West Virginia wadeable streams.” Pis.’ Ex. 173 at PE1786. Importantly, the genus-level index developed therein proved to. be a more reliable predictor of stream quality than its family-level counterpart, WVSCI. Tr. 2 at 121, ECF No. 100 (“pointing out that a genus-level index is much more appropriate to use because the family-level index is not adequately sensitive [] because it lumps genera that have very different tolerance levels.”). These findings would suggest that WVDEP reporting of streams impaired according to WVSCI score is under-inclusive. Pis.’ Ex. 173 at PE1803 (“Overall, GLIMPSS rate more than twice the number of sites as ‘severely degraded’ compared with WVSCI across all strata.”).
Adding to the variety of methods used to address the question, Bernhardt et al., later published “How Many Mountains” in Environmental Science and Technology. Joint Ex. 2. In an attempt to further assess the extent of pollution resulting from mining, the authors “mapped surface mining from 1976 to 2005 for a 19,581 km2 ai'ea of southern West Virginia and linked these maps with water quality and biological data for 223 streams.” Pis.’ Ex. 173 at JE0020. In so doing, they observed that the amount of mining in an area was highly correlated to increased conductivity in area streams. Pis.’ Ex. 173 at JE0020 (“The extent of surface mining within catchments is highly correlated with the ionic strength and sulfate concentrations of receiving streams.”). Then, relying on generalized additive models, the authors identified the following thresholds at which amount of watershed mining, stream ionic strength, or sulfate concentrations render impairment likely: “We find this threshold is reached once surface coal mine occupy >5.4% of their contributing watershed area, ionic strength exceeds 308 |xS/cm or sulfate concentrations exceed 50 mg/ L -1.” Id. Though the authors relied on different data and a unique methodology, they nonetheless arrived at a conductivity threshold remarkably similar to the
In 2013, Dr. James' Kunz introduced yet another novel methodological approach and reached consistent results. Kunz et al., Use of Reconstituted Waters to Evaluate Effects of Elevated Major Ions Associated with Mountaintop Coal Mining on Freshwater Invertebrates, 32 Envtl. Toxicology & Chemistry 2826 (2013), Joint Ex. 10. The authors of this paper exposed selected freshwater organisms to mixtures with different ionic strengths in a laboratory in order to perform toxicity testing. Particularly of note, the ionic composition of some the reconstituted mixtures was representative of the ionic' composition of central Appalachian waters impacted by alkaline mine drainage from mountaintop removal and valley fills:
Two of the reconstituted waters had ionic compositions representative of alkaline mine drainage associated with mountaintop removal and valley fill-impacted streams (Winding Shoals and Bo-ardtree, with elevated Mg, Ca, K, SO4, HCOs), and a third reconstituted water had an ionic composition representative of neutralized mine drainage (Upper Dempsey, with elevated Na, K, SO4, and HCO3). The waters with similar conductivities but, with different ionic compositions had different effects on the test organisms. The Winding Shoals and Boardtree reconstituted waters were consistently toxic to‘the mussel, the amphipod, and the mayfly.
Pis.’ Ex. 173 at JE0152; Tr. 2 at 137, ECF No. 100. Furthermore, through laboratory analysis, Kunz et al., identified toxic impacts to Centroptilum, a mayfly, between 800 and 1,300 p,S/em; remarkably consistent with the Benchmark value of 1,092 |i,S/cm derived from statistical analysis of the WVDEP database. Joint Ex. 17 at D-3.
Next, the scientific literature returns to the authors of EPA’s Benchmark, Drs. Susan Cormier and Glenn Suter, and the subsequent publication of several component sections of the Benchmark in the peer-reviewed scientific journal Environmental Toxicology .and. Chemistry.
Through this assessment, the authors found that a 'mixture containing the ions [calcium, magnesium, bicarbonate, and sulfate], as measured by conductivity, is a common cause of extirpation of aquatic macroinvertebrates in Appalachia where surface coal mining is prevalent. The mixture of ions is implicated as the cause rather than any individual constituent of the mixture. The authors also expect that ionic concentrations sufficient to cause extirpations' would occur with a similar salt mixture containing predominantly [bicarbonate, sulfate, calcium, and magnesium] in other regions with naturally low conductivity.
Pis.’ Ex. 173 at JE0080.
Cormier and Suter independently published their confounding factor analysis as well.: Cormier & , Suter, A Method for Assessing the Potential for Confounding Applied to Ionic Strength in Central Appalachian Streams, 32(2) Envtl. Toxicology and Chemistry 288 (2013), Pis.’ Ex. 139. Using a weight-of-evidence approach,- that analysis considers twelve potential con-founders: habitat, organic enrichment, nu-ments, deposited sediments, pH, selenium, temperature, lack of headwaters, catchment area, settling ponds, dissolved oxygen, and metals. Pis.’ Ex. 173 at. JE0091. By,.adapting principles of epidemiology to the applied study of multivariate ecological field data, Cormier and Suter examine and methodically eliminate each potential con-founder. Particularly of. note, Cormier and Suter considered and rejected the idea that embeddedness or the presence of upstream ponds confounds the relationship between conductivity and impairment. Pis.’ Ex. 173 at JE0094 (“No evidence supported embeddedness as a factor”); Id. at JE0096 (“The weight of evidence for confounding from ponds is uniformly negative, so we conclude that the presence of ponds has little or .no effect on invertebrate response to conductivity.”).
More recently, in 2014, Drs. Pond, Margaret Passmore, Kelly Krock, and Jennifer Pulton — all with the EPA — along with Nancy Pointon, John Felbinger, Craig Walker,, and Whitney Nash — colleagues from the OSMRE — published a peer-reviewed scientific article in Environmental Management finding, among other conclusions, that the vast majority of streams adjacent to reclaimed mine sites with valley fills were still impaired eleven to thirty-three years after reclamation. Pond et al., Long-Term Impacts on Macroinver-tebrates Downstream of Reclaimed Mountaintop Mining Valley Fills in Central Appalachia, 54(4) Envtl. Mgmt. 919 (October 2014), Pis.’ Ex. 141, Tr. 2 at 129-33 (“Although these [valley fills] were constructed pursuant to permits and regulatory programs that have as their stated goals that (1) mined land be reclaimed and restored to its original use or a use of higher value, and (2) mining does not cause or contribute to violations of water quality standards, we found sustained eeo-
Furthermore, it is also of note that in selecting reference streams for this most recent study, Dr. Pond and his collaborators selected sites with comparable temperature and habitat regimes to the mined sites. Pis.’ Ex. 173 at PE1827 (“Local reference streams were sighted in close proximity (range .75 to 10.5 km) to paired [valley fills] ... and had similar catchment areas, forest types, and base geology”). That methodological approach 'had the effect of eliminating temperature and habitat scores as potential confounding factors. Pis.’ Ex. 173 at PE1886 (“Habitat can be a limiting factor, but by design, we removed significant habitat degradation factors by selecting sample reaches with relatively good habitat and intact riparian vegetation at reference and [valley fill] sites”). The experiment also relied on reference sites that “were not pristine, as their catchments frequently had poorly maintained roads and culverts, utility right-of-ways, gas wells, or underground mining that did not discharge to the watershed,” thereby further eliminating potentially confounding factors. Pis.’ Ex. 173 at PE1827.
On the basis of this outstanding collection of peer-reviewed ■ studies, the Court finds that the link between surface mining and biological impairment of downstream waters has been sufficiently — if not definir tively — established in the scientific literature. “There’s field, data. There’s lab data. There’s observational data. There’s field experimental data. There’s toxicity testing.” Tr. 2 at 141, EOF No. 100. Through -myriad lines of evidence, researchers have reached the same general causation conclusion, without a single peer-reviewed publication reporting contrary findings. In Dr. Palmer’s expert opinion, there is no remaining doubt on the question of general causation, leaving only surprise that researchers are continuing to study the question. Id. at 141 (“I would say there’s no doubt. What surprised me is that the studies continue to go on....' because it’s been so well-established.”).
III. Specific Causation
Having met the burden of establishing the general principle that high conductivity levels in streams, caused by alkaline mine drainage, lead to biological impairment, Plaintiffs’ must next establish by a preponderance of the evidence that high conductivity levels, caused by Defendant’s mine discharges, are causing or materially contributing to biological impairment in this particular instance.
A. Stream Conditions
As introduced above, this case concerns discharges from three surface mines operated by Defendant: (1) Fola Surface Mine
1. Fola Surface Mine No. 2 — Road Fork
Fola Surface Mine No. 2 is regulated under WV/NPDES Permit No. WV1013840 and West Virginia Surface Mining Permit S201293. Stipulation ¶¶ 6-7, ECF No. 53. Three valley fills at Mine No. 2 drain into Outlet 001, which discharges into Road Fork. Id. at ¶¶ 3-5, 10.
Prior to Defendant’s mining activities, Road Fork was without notable water quality issues. Collected in 1992 and 1993, pre-mining samples taken downstream from Outlet 001 and upstream from the confluence of Road Fork and Leatherwood Creek, showed conductivity levels ranging from 40 jiS/cm to 73 p,S/cm and sulfates ranging from 0.01 mg/1 to 30 mg/1. Stipulation, ECF No. 53, Pis.’ Ex. 73. In its 1994 Cumulative Hydrologic Impact Assessment (CHIA), WVDEP observed that “Road Fork does not appear heavily impacted by extensive past mining which has occurred in this area. This is indicated by low metals and sulfates that are less than 30 milligrams per liter.” Pis.’ Ex. 118 at PE1209-10.
Since Defendant’s mining activities began, Road Fork water quality has notably suffered. Monitoring in 2010 through 2012 at that same discharge point revealed conductivity levels ranging from 1,803 ixS/cm to 5,700 p,S/cm and sulfates ranging from 886-mg/L to 3,304 mg/L. In May 2014, the conductivity level of water discharged from Outlet 001 was 2,920 p,S/cm and the sulfate level was 1,900 mg/L. Pis.’ Exs. 2, 3. The following table provided by Dr. Palmer compiles the data collected at Fola Mine No. 4A discharges, including Broadtree Branch measurements as a comparison point for water chemistry characteristic of alkaline mine drainage. See Pis.’ Ex. 38.
[[Image here]]
Consistent with the observed decline in water quality, the aquatic community at Road Fork is biologically impaired. Between 2011 and 2014, Defendant reported
In addition to conducting water sampling, multiple experts also provided assessments of habitat at Road Fork. Dr. Swan reported a 163 RBP habitat score in Road Fork. Pis.’ Ex. 31. In contrast to Dr. Swan’s .optimal-range RBP score, Dr. Menzie also conducted or oversaw RBP assessments, The average RBP score taken by two Exponent staff members in October 2014 was 137.5, or suboptimal.
2, Fola Surface Mine No.4A — Right Fork
Fola Surface Miné No. 4A is regulated under WV/NPDES Permit No. WV1013815 and West Virginia Surface Mining Permit S200502. Stipulation ¶¶ 23-24,26, ECF No. .53. Plaintiffs' specified three outlets in their complaint — Outlets 22, 23, and 27 — each of which discharges into tributaries of Leatherwood Creek. M at ¶¶ 21-22,
Though there was some pre-Fola mining in the area, Right Fork was unimpaired and in fair shape prior to Defendant’s mining operations. In its 2003 CHIA, the WVDEP observed that though some sub-watersheds of Right Fork had elevated Manganese and sulfates related.to pre-Fola mining, the upper reaches of the watershed maintained low sulfates.- Pis.’ Ex. 89 at PE0574-75. Indicating high water quality, the report further noted that “all [monitoring] stations provide adequate habitat and contain populations of benthic macroinvertebrates. All the stations have high EPT indices.” Pis.’ Ex. 89 at PE 0577-78. The majority of pre-Fola water
Not surprisingly, the relatively good water quality at Right Fork contributed to unimpaired conditions. In 1997, WVDEP reported an excellent WVSCI score of 84 for Right Fork. Joint Ex. 23 at 68, Tr. 2 at 173, ECF No. 100. In 2000 and 2001, Fola’s consultant collected a number of biological surveys from seventeen different sampling locations. Among thirty-three samples from those seventeen sites, only six returned WVSCI scores below 68. Stipulation ¶¶22, 29-31, Tr. 2 at 177, ECF No. 100.
Since Defendant’s mining activities began, Right Fork water quality has notably suffered. Though jumps in conductivity had previously been rare, since 2001, conductivity levels in Right Fork have been almost entirely above 1,500 pS/cm, now with jumps up to and exceeding 2,500 pS/cm. Stipulation ¶ 33. . Similarly, since 2001, sulfate levels are consistently above 600 mg/1;. and sometimes as high as 1,200 mg/1. Stipulation ¶ 33.
Consistent with conditions in Right Fork itself, in 2011 and 2012, discharges from Outlets 022, 023, and 027 consistently ranged from 1,500 pS/cm to more than 3,000 pS/cm. Stipulation ¶ 32. In May and September 2014, conductivity from the three discharges ranged from 1820 to 2,958 pS/cm, with sulfate levels between 920 and 1,800 mgd. Pis.’ Ex. 2-5. The following table provided by Dr. Palmer compiles the data collected at Fola Mine No. 4A discharges, including Broadtree Branch measurements as a comparison point for water chemistry characteristic of alkaline mine drainage. See Pis.’ Ex. 49.
[[Image here]]
The noted decline in water quality has been accompanied by a decline in WVSCI scores. On May 9, 2014, Dr. Christopher Swan conducted field sampling downstream of Outlets 022, 023, and 027.
With respect to habitat, Dr. Swan calcu7 lated an RBP score of 172 at Right Fork, which is in the optimal range. Pis.’ Ex. 32; Tr. 2 at 185, ECF No. 100. The average RBP score taken by two Exponent staff members in October 2014 was 128, or suboptimal. Def.’s Ex. 138.
3. Fola Surface Mine No. 6 — Cogar Hollow
Fola Surface Mine No. 6 is regulated under WV/NPDES Permit ■ No. WV1018001 and West Virginia Surface Mining Permit S2011999. Stipulation ¶¶ 42-44, ECF No. 53. Three valley fills at Mine No. 6 drain into Outlets .013, 015, and 017, to be discharged to Cogar Hollow, a tributary of Leatherwood Creek. Stipulation ¶¶ 40-41; Tr. 2 at 187-88, ECF No. 100.
Before Defendant’s mining activities began, Cogar Hollow enjoyed healthy water quality. Measured at monitoring point S3-1A, conductivity levels were usually well below 300 pS/cm. Stipulation ¶'45. Sulfate levels were similarly low. Id. As it did with other Leatherwood Tributaries, here WVDEP again noted that “all [monitoring] stations provide adequate habitat and contain populations of benthic macro-invertebrates. All the stations have high EPT indices.” Pis.’ Ex. 89 at PE 0577-78.
Since Defendant’s mining activities be-gaq, Cogar Hollow water quality has diminished considerably, including extremely elevated conductivity and sulfate levels. Measurements taken since July 2012 consistently reveal conductivity levels ranging from 3,000 pS/cm to 5,000 pS/cm. Stipulation ¶ 47; Tr. 2 at 191-92, ECF No. 100. In May 2014, conductivity from the three discharges ranged from 2,910 pS/cm to 3,202 pS/cm and the sulfate level was 1,900 mg/1 to. 2,400 mg/1. Pis.’ Ex. 2-3, 5. The following table provided by Dr. Palmer compiles the. data, collected- at Fola Mine No. 4A discharges, including Broadtree Branch measurements- as a comparison point for water chemistry characteristic of alkaline mine drainage. Pis.’ Ex. 57.
[[Image here]]
On May 9, 2014, Dr. Christopher Swan conducted field sampling immediately downstream of Outlets 018, 015 and 017. Based on that sampling effort, Dr. Swan identified a WVSCI score of 41.81 and a GLIMPSS score of 20.03. Pis.’ Ex. 25; Tr. 2 at 194,' EOF No. 100. Both numbers clearly indicate impairment. Dr. Swan further conducted a RBP habitat assessment, scoring the stream at 145, or suboptimal. Pis.’ Ex. 30; Tr. 3 at 44, EOF No. 106. Similarly, the average RBP score taken by two Exponent staff members in October 2014 was 138.5, again, suboptimal. Defi’s Ex. 138.
On the basis of the foregoing site-specific evidence, the Court FINDS by a preponderance of the evidence that Road Fork, Right Fork, and Cogar Hollow are biologically impaired.
B. Legal Standard
The Court must find a violation here if Defendant’s discharges cause or materially contribute to a significant adverse impact to the chemical or biological components of aquatic ecosystems. W. Va.Code R. § 47-2-3.2.Í. Through requiring that a discharge “cause or materially contribute” to biological impairment, West Virginia law imposes something less stringent than traditional but-for causation. Unfortunately, available state law does not elaborate on what precisely that standard means as applied.
This Court has previously ruled that “[i]t is readily conceivable that multiple pollutants or stream characteristics might simultaneously materially contribute to impairment; and [a plaintiff] need only provide evidence showing it is more probable than not that ionic pollution as measured by conductivity is among some collection of material contributors.” Fola (Stillhouse),
This Court’s interpretation of the material contribution standard flows from the observation that, “as a matter of plain meaning and common sense, it is possible to identify a factor that is materially contributing to a given condition without conclusively eliminating contributions by additional factors in a dynamic system.” Fola (Stillhouse),
While something less than traditional but-for causation is required, Defendant argues that, if the word “material” is to have any import, “material contribution” must amount to something more than a contributing factor test. Def.’s Post-Trial Brief, ECF No. 116 at 27 (citing Artz v. Chicago, R.I. & P.R. Co.,
Thus, the Court continues to require that Plaintiffs demonstrate that it "is more probable than not that ionic pollution, measured as conductivity, is among some collection of material contributors to biological impairment. Conductivity levels in the streams at-issue need not be the sole cause of observed biological impairment, but must be a substantial contributor. This standard does not require scientific certainty,-but rather legal probability. Fola (Stillhouse),
C. Specific Causation Analysis
1. Defendant’s Motion for a Directed Verdict regarding Fola Surface Mine No, 4A
At the close of Plaintiffs’ ease in chief, Defendant moved for a. directed verdict as to claims involving Fola Surface Mine No. 4A, covered by WV/NPDES Permit No. WV1013815. Tr. 3 at 202, ECF No. 106. Defendant argues that it is entitled to a directed verdict regarding Fola Surface Mine No. 4A because Plaintiffs’ compliance evidence does not isolate the effects of the’ discharges from Outlets 022, 023, and 027.’ Id. Outlets 022 and 023 at Mine No. 4A discharge into Right Fork, a tributary of Leatherwood Creek. Outlet 027 at Mine No. 4A discharges into Cannel Coal Hollow, a small tributary of Leather-wood Creek. Plaintiffs’ consultant conducted biological sampling to determine a WVSCI score for Right Fork at a stream reach below the confluence of Cannel Coal Hollow. In, addition to Outlets 022, 023, and 027, some twelve additional outlets discharge into Right Fork or Cannel Coal Hollow upstream of Plaintiffs’ compliance sampling location. As a result, Plaintiffs’ sampling does not isolate the effect of discharges only from Outlets 022, 023, and 027. Tr. 2 at 199, ECF No. 100 (testimony of Dr. Palmer, agreeing that sampling did not isolate the effects of discharges only from Outlets 022, 023, and 027).
In answer to Defendant’s oral motion, Plaintiffs maintained that, under a material contribution standard, they presented sufficient evidence to show that high conductivity discharges from the three outlets are materially contributing to the impairment observed downstream in Right Fork. Tr. 3 at 203,- ECF No. 106. Plaintiffs report that the combined flow from Outlets
The Court disagrees. While “material contribution” does not require evidence that a single stressor independently causes impairment, it must require something more than what Plaintiffs have demonstrated here. First, the flow evidence offered by Plaintiffs is thin at best. It does not account for seasonal variations or for the variability of inputs from rainfall. Moreover, the Court heard no expert testimony to suggest that the some 600,000 gallons per day coming from the three outlets at issue materially contributes to impairment observed downstream after the introduction of nearly four million additional gallons from sources unknown, but including at least twelve other discharge outlets — each of which is a potential source of ionic pollution. Without supporting scientific testimony, the Court cannot connect the discharges at issue to the area where Plaintiffs performed compliance sampling. Doing so would be little more than nonscientific speculation. Thus, the Court cannot conclude that Plaintiffs have demonstrated that Outlets 022, 023, and 027 are materially contributing to downstream impairment.
Accordingly, the Court FINDS that Plaintiffs have not met their burden of proving that discharges from Outlets 022, 023, and 027 cause or materially contribute to biological impairment of Right Fork, and Defendant is therefore entitled to judgment as a matter of law regarding alleged violations of WV/NPDES Permit No. WV1013815, governing Fola Surface Mine No. 4A.
2. Liability with respect to Fola Surface Mine No. 2 and No. 6.
Having' found that both Road Fork and Cogar Hollow are biologically impaired, the Court finally turns to the question of whether Plaintiffs’ have met their burden in establishing that high conductivity discharges from Defendant’s Mine No. 2 and Mine No. 6 cause or materially contribute to impairment.
- Plaintiffs have sufficiently demonstrated that the discharges in question share the characteristic ionie composition identified and analyzed, by the EPA in its Benchmark. Dr. Palmer testified that the ionic composition of the discharges matches the characteristic ionic composition associated with alkaline mine drainage in the region (e.g., sulfates, bicarbonates, calcium, and magnesium). Stipulation ¶¶ 18, 36, 38, 48; Pis.’ Exs. 38, 49, 57; Tr. 2 at 93, 156-57, 181-84, and 192-93, ECF N6; 100. To demonstrate the similarity, Dr. Palmer compared water quality measurements at each discharge outlet to the composition of reconstituted water from Boardtree Branch — recognized, as representing the particular ionic composition of regional alkaline mine drainage. See supra Tables 4, 5, and 6; Kunz et al., Use of Reconstituted Waters to Evaluate Effect of Elevated Major Ions Associated with Mountaintop Coal Mining on Freshwater Invertebrates, 32 Envtl. Toxicology & Chemistry 2826 (2013), Joint Ex. 10. Testing the characteristically composed water,- Kunz et al., reported adverse effects to the mayfly Iso-nychia at conductivity levels of 1090
Additionally, Plaintiffs have unequivocally demonstrated that the conductivity levels at issue here are sufficiently high to cause the observed impairment. Discharges from Fola Surface Mine No. 2 and No. 6 consistently and grossly exceed the threshold identified by EPA at which it becomes more likely ..than not that a stream will suffer biological impairment. See Mancuso v. Consolidated Edison Co. of New York, Inc.,
Sampling at Eoad Fork and Cogar Hollow revealed changes' in the macroinver-tebrate community that are consistent with impairment caused by high conductivity levels. Multiple peer-reviewed scientific articles report finding that high conductivity associated with alkaline mine drainage leads to the extirpation of mayflies. Pis.’ Ex. 173 at PE1536, JE0010, PE1832; Tr. 2 at 105-07, 137, ECF No. 100. Here, consistent with those reported findings, sampling revealed the complete extirpation of mayflies. Tr. 3 at 162-64,.ECF No. 106 (noting that the complete extirpation of mayflies — organisms known to be particularly sensitive to conductivity — is strong evidence that conductivity is driving impairment); Joint Ex. 1 (“The clear patterns linking high conductivity to a loss of mayfly taxa has ecosystem-scale importance since mayflies often account for 25-50 percent of total macroinvertebrate, abundance in the least-disturbed Central Appalachian streams.”).
On the basis of the evidence presented, the Court FINDS that Plaintiffs have demonstrated the following by a preponderance of the'evidence: (1) Eoad Fork and Cogar Hollow are biologically impaired, as measured by WVSCI scores below the federally approved threshold score of 68; (2) the. discharges complained of share the same characteristic composition presently known in the'scientific community to cause or materially contribute to impairment in central Appalachian streams; (3) the conductivity levels observed at Defendant’s discharges are far in excess of the thresholds identified by the EPA and the available scientific literature known to cause stress to aquatic communities; (4) Defendant’s mining operations are the only land use that could have caused impairment; and (5) changes in the biological community particularly show the loss of conductivity-intolerant organisms. Thus, the Court FINDS that high conductivity discharges from Fola Surface Mine No. 2 and No. 6 are causing or materially contributing to the biological impairment of Eoad Fork and Cogar Hollow in violation, of Defendant’s current WV/NPDES permits.
These findings are consistent with WVDEP’s determination that ionic toxicity
Moreover, these findings are- consistent with the testimony of Dr. Menzie, Defendant’s only expert testifying on specific causation. Dr. Menzie offered consistent and repeated testimony that conductivity is one among a small collection of substantial causes of impairment. Tr. 3 at 240, ECF No, 106 (“[Conductivity is] going to play some role.”); Tr. 4 at 104, ECF No. 107 (“I include, conductivity as a stressor along with that”); Tr. 4 at 76, ECF No. 107 (listing precipitates, temperature, and conductivity as the drivers of impairment at the Leatherwood tributaries); Tr. 4 at 98-100, ECF No. 107 (explaining that conductivity is one of the factors causing impairment, but adding that he did not think it possible to allocate contribution more precisely). Though inconsistent with all other scientific evidence addressing thresholds, Dr. Menzie further maintained the opinion that conductivity levels in the range of 1,000 |xS/cm to 3,000 [xS/cm are capable of independently influencing WVSCI scores. Tr. 4 at 100,' ECF No. 107. Even assuming Dr. Menzie’s idiomatic threshold is correct despite all evidence to the contrary, this case concerns discharges with conductivity levels in that range and well above. Thus, WVDEP and all testifying experts in this case share the •opinion that conductivity levels are causing or materially contributing to biological impairment.
3. Defendant’s Rebuttal Arguments
Defendant advances a variety of arguments designed to defeat Plaintiffs’ theory of specific causation. These arguments question the reliability and credibility of Plaintiffs’ experts, the adequacy of Plaintiffs causation evidence, and the possibility of alternative drivers of impairment. Notwithstanding the fact that experts for both parties agree that, conductivity levels are matérially contributing to the observed biological impairment of Road Fork'and Co-gar Hollow, counsel continues to piecemeal attack Plaintiffs’ evidence on specific causation.
As stated by Defendant, with regard to specific causation, “[d]etermining which factors are material is the stuff of scientific investigation,' not assumption.” ECF No. 116 at 8. Here, Plaintiffs’ experts, Defendant’s expert, and the WVDEP all agree: high conductivity is materially contributing to biological impairment at Road Fork and Cogar Hollow. The Court will not displace these expert opinions based on independent scientific investigation in favor of lay assumptions. Accordingly, the Court declines to delve further into counsel’s evidence-based arguments; leaving only Defendant’s legal argument regarding the adequacy of notice of potential liability.
Contrary to Defendant’s assertion, in 2008, when Defendant’s permits were last renewed, Defendant had notice of potential liability should it cause or contribute to biological impairment. Defendant’s permits incorporate § 47-30-5.1.f., an unambiguous permit condition imposing liability should a permittee violate West Virginia’s narrative water quality standards. See OVEC v. Elk Run,
Not only did Defendant therefore have notice of potential liability provided in the plain language of its permits and each subsequent reissuance, but at the time of reissuance, Defendant further had the ben
IV. CONCLUSION
In sum, the Court FINDS that Plaintiffs have established, by a preponderance of the evidence, that the Defendant has committed at least one violation of its permits for Mine No. 2 and Mine No. 6 by discharging into Road Fork and Cogar Hollow high levels of ionic pollution, which have caused or materially contributed to a significant adverse impact to the' chemical and biological components of the applicable streams’ aquatic ecosystem, in violation of the narrative water quality standards that are incorporated into those permits.
The Court DIRECTS the Clerk to send a copy of this written Opinion and Order to counsel of record and any unrepresented parties.
Notes
, At the time of filing, the mining operations at issue here were each regulated under WV/ NPDES permits reissued in 2008. At that time, W.Va.Code- R." § 47-30-5.l.f read as quoted here. See Mem. Op. & Order 12-21, May 29, 2015, ECF No. 94. .
. The Court pauses here to reject- Defendant’s recurring and spurious mischaracterizations of prior holdings. It is the express language of Defendant’s federally approved WV/ NPDES permits as issued by the WVDEP that requires compliance with state narrative water quality standards as an enforceable permit condition. It is the. WVDEP as the-issuing .agency — not Congress and not,this Court— that incorporated narrative water quality standards as enforceable WV/NPDES permit conditions.
. Plaintiffs’ Amended Complaint also complains of discharges from Outlet 025 at Surface Mine No. 4A. ECF No. 39. However, Plaintiffs did not present any testimony regarding Outlet 025 during trial. Upon Defendant's oral motion for a directed verdict regarding Outlet 025, Plaintiffs’ counsel advised , the Court and Defendant that they had intended to drop any claims related to Outlet 025. Tr. 3 at 204, ECF No. 106, Accordingly, any . claims relating to-Outlet 025.were dismissed.
. Defendant expresses concern regarding potential bias related to one member of the Court’s staff in an early and lengthy footnote in its post-trial briefing. ECF No. 116 at 3 n. 3. Specifically, Defendant provides a survey of publically available electronic media detailing the employment and advocacy history of one of the Court’s term law clerks, Ms. McCrae. Id. Defendant refers to a denied recusal motion in a 2008 case, but does not expressly request anything from the Court on this occasion. Absent some motion, Defendant's concerns are not properly before the Court. The Court nevertheless offers the following comments.
"There is always some risk of bias; to constitute grounds for disqualification, the probability that a judge will decide a case on a basis other than the merits must be more than 'trivial.”’ United States v. DeTemple,
In 1948, Congress authorized district courts to employ law clerks, though it did so without elaborating on the specific duties of law clerks. 28 U.S.C. § 752. Law clerks are considered part of a judge’s personal staff. Code of Conduct for Judicial Employees § 310.30(a), "The proper role of the judicial adjunct, who in the federal setting may be defined as anyone who helps with the work of Article III courts but whose conditions of employment are not as prescribed in Article III, is to advise and assist the real judge. It is not to be the real judge, only called something else.” Geras v. Lafayette Display Fixtures, Inc.,
Here, Defendant appears to offer no complaint of bias on behalf of the Court, instead ■questioning potential biases of a subordinate staff-member, a term law clerk. Such concerns are trivial at best. At the risk of stating the obvious, a term law clerk is not a judge. A term law clerk performs tasks as delegated to
Even ignoring that obvious point, the Court further notes that whatever the specific content of individual statements, the surveyed conduct occurred over four years ago. See DeTemple,
Moreover, this Court has decided cases similar to the case at bar for nearly a decade. The decision here is consistent with existing case law. That consistent, existing case law predates riot only Ms. McCrae's service as a term clerk with this Court, but also predates - much of the conduct causing Defendant’s concern. All told, given the nature of term clerk employment, the remoteness of the conduct complained of, and the existence of earlier, consistent precedents, it is difficult to characterize Defendant’s concerns as anything.more than a trivial and tangential waste of judicial resources.
. See Dittrich-Bigley v. Gen-Probe, Inc., No. 11-1762,
. The West Virginia Streairi Condition Index, or WVSCI, is a multimetric index used to conduct biological assessments of stream conditions. The WVDEP relied on WVSCI.as a means to determine whether a violation of. the biological standard in subsection 3.2.i was occurring," such that á stream needed to be ’ listed as "impaired” under a Section 303(d) of the Clean Water Act. ”[T]he EPA — the final authority regarding whether a state’s narrative water quality are being violated for the purposes of Section 303(d) listing — recently '•made the specific finding that WVSCI scores below 68 ‘indicate that [the] waters [at and in which such scores were assessed] do not achieve the West Virginia narrative criteria as applied to the aquatic life uses' — defined by the EPA to mean the biological standards embodied in § 47-2-3.23 and -3.21.” Elk Run,
. The Court further notes that sub-parts of the Benchmark were later published in an esteemed peer-reviewed scientific journal. See Susan M. Cormier, Glenn W. Suter II & Lei Zheng, Derivation of a Benchmark for Freshwater Ionic Strength, 32 Envtl. Toxicology & Chemistry 263 (2013), Joint Ex. 3; Susan M. Cormier & Glenn W. Suter II, A Method for Assessing Causation of Field Exposure-Response Relationships, 32 Envtl. Toxicology & Chemistry 272 (2013), Pis.’ Ex. 140; Susan M. Cormier et ah, Assessing Causation of the Extirpation of Stream Macroinvertebrates by a Mixture of Ions, 32 Envtl. Toxicology & Chemistry 277 (2013), Joint Ex. 4; Susan M. Cormier et al., Relationship of Land Use and Elevated Ionic Strength in Appalachia Watersheds, 32 Envtl. Toxicology & Chemistry 296 (2013), Joint Ex. 6; Susan M. Cormier & Glenn W, Suter II, A Method for Deriving Water-Quality Benchmarks Using Field Data, 32 Envtl. Toxicology & Chemistry 255 (2013), Pis.’ Ex. 139; Glenn W. Suter II & Susan M. Cormier; A Method for Assessing the Potential for Confounding Applied to Ionic Strength in Central Appalachian Streams, 32 Envtl. Toxicology & Chemistry 288 (2013), Joint Ex. 5.
. Compare Dr. Garabrant, Tr. 1 at 49, ECF No. 105 (explaining his qualifications as a reviewer of the EPA Benchmark despite having no formal ecological training or experience as follows: “Well, I understand epidemiology, and I understand how to analyze data. I’ve spent my career doing that. I've spent my career doing divisional research, analyzing complex datasets, publishing papers in the peer-reviewed literature, and I know how to analyze data. And I do understand the principles of epidemiology. The EPA said they used epidemiology. They didn’t use it properly. They did not do what scientists agree they have to do to analyze data properly. So while I am not an ecologist, I do understand complex data, I do understand biostatistics, and I do understand
. Meaning that whatever their qualifications, no testifying expert held a graduate-level degree in both the natural sciences and epidemiology, nor did any testifying expert claim expertise in both fields. That said, testifying ecological experts, e.g,, Dr. Menzie, Dr. Palmer, and Dr. Baker, each have considerable expertise in statistical analysis of datasets ■within their respective disciplines. In contrast, Dr. Garabrant has no formal training related to aquatic ecology and testified to only reviewing literature on aquatic ecology and freshwater macroinvertebrates for purposes of this litigation.- Tr. 1 at 53-55, ECF No. 105.
. However, also note that "epidemiological studies are not necessarily required to prove causation, as long as the methodology employed by the expert in reaching his or her conclusion is sound.” Benedi v. McNeil-P.P.C., Inc.,
. For instance, Plaintiffs' expert, Dr. Wing opined that, in his assessment, "[Dr. Gara-brant’s] statistical analysis and his opinions about the Benchmark -are uninformed for the most part by a nuanced and thorough understanding of the topic of stream ecology, which would be necessary for a proper evaluation of that topic.” Tr. 2 at 41, ECF No. 100.
. Dr. Baker recalled a general explanation specifying use of a logarithmic scale offered elsewhere in the Benchmark, but he was not able to quickly find such a reference while on the stand. Even with the benefit of greater time and digital copy, the Court was similarly unable to find such a general reference.
. As explained by Dr. Garabrant, "[ijssues such as confounding and effect modification are universal issues in complex, multivariate datasets. Epidemiology has worked out approaches to correctly recognize and, when possible, adjust for and deal with those issues ____[T]hose issues are not unique to ecologic datasets. They run through ecology, psychology, economics, human health studies. Those are just characteristics of large multi-variable datasets.” Tr. 1 at 50, ECF No. 105. While these principles and concepts may be universally applicable, Dr. Garabrant nevertheless managed to handily demonstrate how readily and unabashedly ah analyst can misstep when applying them if he is unfamiliar with the subject matter under analysis.
Dr. Garabrant’s misstep gives pause precisely because it was the result of a simple mathematical exercise; one that required mastery of little more than most are taught in high-school. Presumably, it is not Dr. Gara-brant’s competency at graphing that failed him, but his ability to sensibly interpret and analyze ecological data. As cautioned by Dr. Wing, "the further one goes from one’s substantive knowledge, the more difficultly one would have even if the methods used, for example, in statistics were, the same." Tr. 2 . at 21, ECF No. 100.
. On cross-examination, Dr. Baker was asked why he did not recreate plots, just as Dr. Garabrant had done, and calculate measures of significance necessary to interpret the plots created by Dr. Garabrant. Tr. 4 at 236-38, ECF No. 107. Like Dr. Garabrant, Dr. Baker did have access to the dataset and the expertise necessary to repeat the analysis. Since he could have repeated Dr. Garabrant’s analysis to reveal error bars and since we find ourselves in an adversarial context, it is not unreasonable to ask why Dr. Baker did not do so.
That said, even in an adversarial context, it is arguably reasonable to expect that experts will proffer opinions in such a way that enables others to evaluate the significance and reliability of their work, As reminded by the current and past presidents of the International Epidemiological Association in the Forward to the Sixth Edition of the Dictionary of Epidemiology, “[t]he nature of science is not to reach consensus but to advance our knowledge by bringing conflicting ideas to critical examinations.” Consistent with this interest in enabling critical examinations, in the context of science conducted purely for sake of advancing knowledge, experts are expected to provide sufficient information to enable peer review. This expectation of transparency ensures integrity and allows for meaningful discussions of what is known or unknown, and to what degree. It is not unreasonable for a court to have similar minimum expectations in the service of reaching reliable final judgments.
Thus, the better question might have been why did Dr. Garabrant not include indicators of significance in the analysis he conducted and presented to this Court. One can only imagine what dimensions of bias may be introduced to expert findings when moved from the theoretically unbiased universe of Science to the purposefully adversarial universe of courts. Without making judgments as to motive, reasonable possible inferences could be as troubling as thinking that Dr. Garabrant did not do so because he did not understand the interpretive significance of such indicators — calling the competency of his analysis more generally into question — or that he did not do so precisely because he did understand the inteipretive significance — calling the credibility of his analysis more generally into question. Such musings could and should be rendered unnecessary merely by experts taking care to present findings to the Court with a degree of transparency that enables critical examination of conflicting ideas.
. Whatever conclusion is to be reached in light of the alleged obstacles to .interpreting the significance of the plots, Dr. Baker added that conclusions regarding pH would be immaterial because EPA controlled for any effect modification occurring at pH<6 by truncating the data accordingly.
Though criticized, truncating data and sample selection appear to be accepted methodological approaches to control for confounding. See e.g. Virginia Tech, Pis.’ Ex. 173 at PE 1692 ("This was accomplished by seeking study streams with attributes such as habitat quality that were as similar as possible to minimally-disturbed reference streams of the region. The design was intended to ensure that TDS, including its component ions, was the primary factor associated with biotic stress in these streams”); Dr. Wing, Tr. 2 at 32-33, ECF No. 100 (explaining the methodological validity of truncating data as a means of controlling for potential confounding effects):
. As previously noted by the Court, in developing the Benchmark, the EPA’s findings were subjected to review and comment by the Science Advisory Board. Fola (Stillhouse),
.In addition to this textbook definition, we might also keep in mind that, • "confounding, like many concepts-in science, is a basic concept, but it’s not a settled concept, meaning that there are disagreements between epidemiologists, even in textbooks, about exactly how to define ‘confounding.’ ” Dr. Wing, Tr. . 2 at 25, ECF No.; 100; see also Sharon Schwartz et al., Toward a Clarification of the Taxonomy of "Bias” in Epidemiology Textbooks, Epidemiology 26:2 (March 2015), Pis.’ Ex. 170 (“We identified and reviewed 28 textbooks, that met the inclusion criteria. Our review verified the norm of categorizing bias into confounding, selection bias, and information bias. All textbooks that included an or
. In contrast, Dr. Wing offered testimony that "there is not a set of rules for saying whether or not there's confounding.” Tr. 2 at 27, ECF No. 100. Moreover, "different epidemiologists have somewhat different definitions of ‘confounding’ ” such that "even if there were rules, they would have different rules.” Dr. Wing, Tr. 2 at 27-28, ECF No. 100. Ultimately, according to Dr. Wing, "[y]ou have to understand the substance of the topic” when conducting a causal analysis precisely because "[tjhere’s no generic analytic method which produces reliable, science.” Tr. 2 at 28, ECF No. 100. Here, it is perfectly reasonable to expect that the EPA possesses such expertise,
. Furthermore, the Court notes that EPA’s findings are corroborated by considerable peer-reviewed scientific literature. See infra Section II.C. Such studies, which will be reviewed below, reach corroborating conclusions regarding the causal relationship between ionic pollution and loss of freshwater macroinvertebrates despite reliance on unique methodologies and distinct datasets. Dr. Baker estimated that, across the published literature, roughly six to ten different statistical techniques have been relied upon to test that causal relationship. Tr. 3 at 123, ECF No. 106. Whatever technique is used, experts have consistently identified conductivity as the most likely cause of biological impairment. Similarly, ■ Dr. Baker estimated that at least five different methods had been used across the published' literature to analyze potential confounding factors. Id, at 124.' Again, whatever the method used, experts' have consistently ruled out potential confounders and identified conductivity as the most likely cause of biological impairment.
. Surprisingly, Dr. Garabrant did not similarly conclude that these perceived data deficiencies prevent meaningful analysis of potential effect modification. Nearly half of Dr. Garabrant's likely effect modifiers also appear in his table listing variables with insufficient data points. Compare Def.'s Ex. 38 and Def.’s Ex. 40. For example, we see that there is only one data entry for dissolved magnesium among a total 2,210 observations, yet Dr. Garabrant concluded that dissolved magnesium is a likely effect modifier. Similarly, Dr. Garabrant identified dissolved manganese as the most likely effect modifier, despite there being only twenty data points for dissolved manganese among a total 2,210 observations.
. Dr. Baker represented that the dataset includes over 1,000 data points for Total Magnesium, Total Calcium, and Total Manganese. No testimony was offered to suggest different effects based on whether substances are present in ionized or non-ionized forms.
. Notwithstanding observations about the amount of selenium data available, all experts agreed that the EPA Benchmark recognized that -given the limited available data, occurrence and effects of selenium should be investigated further.
. As explained directly by Dr. Baker, ''[t]he criteria the EPA was using here was to find any site with a mayfly. Now because the mayfly is an order, there are many families with different characteristics and also genera within those families, and we know that at least a few of them can tolerate exceptionally high conductivities. Therefore, it would not be surprising at all to find mayflies, a mayfly, at nearly all of the sites between 200 and 1,500 nS/cm." Tr. 4 at 203-04, ECF No. 107.
. It is worth noting that Plaintiffs are not complaining of .conductivity levels that mildly exceed EPA’s Benchmark of 300 p-S/cm, With few exceptions, the conditions at issue here exceed nearly all the XC95 threshold values from either state.
. Dr. Baker further noted that the particular taxa discussed by Dr. Garabrant "represent the most extreme -outliers, so they are an example of just taking the [worst] cases to make a comparison." Tr. 4 at 212, ECF No. 107.
.Dr. Baker offered testimony explaining that the XC95 values in both states were closely correlated even in the aggregate and in an analog scale. Tr. 4 at 227, ECF No. 107. ’
. In collaboration with colleagues, Dr. Men-zie developed a paper titled “A Cautionary Note About Deriving Causal Relationships in Water Quality Benchmarks from Field Observations Data:- A Case Study in West Virginia Headwaters.” That paper includes some of the same material, tables, and conclusions offered as testimony before this Court, on this and previous occasions. Tr. 4 at 89, ECF No. 107. Development for publication was supported and funded by Fola, Alpha, and Rio Tinto. Id. at 90-91. Dr, Menzie submitted the paper for publication to Environmental Science and Technology, a journal which has previously published Dr. Menzie’s work and the journal that published a series of articles by Cormier and Suter ostensibly comprising the EPA’s Benchmark. Id. at 89-90. "A Cautionary Note” was not accepted for publication.
. As relayed by Dr. Palmer, "[o]ne of the things we’re taught very early on as a scientist, that the strongest form of inference you can make is if you have multiple ways to reach the same conclusion; if yoú use multiple methods, different kinds of experiments, observations, and particularly if different people do this work so you can eliminate potential methodological differences, that individuals might impart." Tr. 2 at 96, ECF No. 100.
. Authors include Gregory J. Pond, Margaret E. Passmore, Frank A. Borsuk, Lou Reynolds, and Carole J. Rose.
. The EPA’s rapid bioassessment protocol is a multimetric tool for scoring habitat quality based on the aggregation of scores assigned to ten different variables. Total RBP scores are grouped into four different categories, or levels: optimal, suboptimal, marginal, and poor.
.“TDS stressors” would include the sort of ionic pollution alleged in this case. Similarly, an assessment of “TDS sensitivity” would include analysis of ionic pollution sensitivities.
. Additional authors include E.S. Bernhardt, W.H. Schlesinger, K.N, Eshleman, E, Foufou-la-Georgiou, M.S. Hendryx, A.D. Lemly, G.E. Likens, O.L. Loucks, M.E. PoWer, and P.R. Wilcock.
, The Proceedings of the National Academy of Science is regarded as a rigorously peer-reviewed scientific journal. Tr. 2 at 119.
. See Cormier et al., Derivation of a Benchmark for Freshwater Ionic Strength, supra note 5; Cormier et al., A Method for Assessing Causation, supra note 5; Cormier et al., Assessing Causation of the Extirpation of Stream Macroinvertebrates by a Mixture of Ions, supra ■ note 5; Cormier et al., Relationship of Land Use and Elevated Ionic Strength in Appalachia Watersheds, supra note 5; Cormier et al, A Method for Deriving Water-Quality Benchmarks Using Field Data, supra note 5; Suter et al., A Method for Assessing the Potential for Confounding Applied to Ionic Strength in Central Appalachian Streams, supra note 5.
. Of note, time order was the only factor that did not strongly support the -causal relationship. Instead, Cormier and Suter found that they lacked the necessary evidence to assess whether time order supported the causal relationship, and therefore scored time order as providing “no evidence” either way. Pis. Ex. 173 (explaining that the authors-"could not obtain conductivity and biological survey data collected before and after construction of a valley fill or release of ion-rich effluents from other sources. Hence, this : characteristic of causation is scored as no evidence.”). Though Cormier and Suter were
. See Sanne H. Knudsen, The Long-Term Tort: In Search of a New Causation Framework for Natural Resource Damages, 109 Nw. U.L.Rev. 475, 532 (Winter 2004) ("In the oil spill context,, proving general- causation would require showing that exposure to oil is capable of causing the alleged injury — e.g., disrupting the reproductive capacity of sea otters. Proving specific causation would require showing that sea otters were exposed to oil in doses capable of causing injury and that the oil came from the defendant’s release.”)
. Whatever critiques one may have of RBP habitat assessment protocol, presumably all can agree that the experience and qualifications of the person performing the habitat assessment are extremely relevant to determining the reliability of that assessment. Though he deliberately deviated from the WVDEP protocol for conducting an RBP habitat assessment, the Court has no doubt that Dr. Menzie possesses the necessary experience and qualifications to render his assessment reliable. However, the Court . heard nothing on the identity, experience, or qualifications of Exponent staff members that performed RBP habitat assessments at Dr. Menzie’s request. Accordingly! the Court hesitates to rely on such assessments,
. As will be discussed in" greater detail below, Dr. Swan’s sampling area was not imme
. Though not required for a finding of violation, the Court notes that the evidence presented is arguably sufficient to establish that though conductivity may not be the sole driver of impairment here, conductivity levels such as these áre likely capable of causing impairment even standing alone.
. Commenters on the' draft EIS included several representatives for the West Virginia Coal Association, as well as representatives on behalf of the Kentucky Coal Association and Ohio Coal Association. EPA, Public Comment Compendiumr Mountaintop Mining/Valley Fills in Appalachia Final Programmatic Environmental Impact Statement, Vol. 1 (Oct. 2005) (available at: http://www.epa, gov/region3/mtiitop/documents. htm# cd).
Given the prevalence of such commenters, it is difficult to imaginé that Fola could have remained uninformed-of the EPA’s-draft EIS. To the, contrary, it more likely suggests regional, industry-wide awareness.
