Opinion for the Court filed by Circuit Judge RANDOLPH.
Edison Electric Institute and organizations representing corporate and municipal dischargers brought these consolidated petitions for review, claiming that certain of EPA’s “whole effluent toxicity” or ‘WET” test methods were invalid. The tests are set forth in rules promulgated pursuant to the Clean Water Act, 33 U.S.C. § 1251 et seq. (the “Act”). The Act prohibits the discharge of pollutants except in compliance with individual permits issued by EPA or the states. States prescribe their own water quality criteria, which EPA reviews for conformity with the Act. Water quality standards typically consist of two complementary parts: numerical limits on the allowable concentration of particular pollutants in ambient water (e.g., “no more mercury than 5 parts per billion”), and a descriptive, “narrative” criterion regarding the entire effluent (e.g., “no toxic pollutants in toxic amounts”). See 33 U.S.C. § 1251(a)(3). WET tests are used to measure compliance with standards of the latter type.
While the numerical restrictions comprise the backbone of the permitting system, EPA has found that, standing alone, these limits are not sufficient Effluents may contain many different pollutants. Even if no single pollutant were present in a harmful amount, the mix of different pollutants still might have negative effects upon aquatic organisms. In light of the myriad potential interactions among various pollutants, traditional instrumental tests are ill-suited to making the determination. Instead, laboratories expose aquatic organisms to samples of the ef
This approach has an appealing simplicity, but the use of living specimens introduces a significant potential for variability between and within tests. In designing and refining the WET test methods, EPA sought to minimize the effect of organic idiosyncracy by taking experimental and statistical precautions. The crux of petitioners’ complaint is that EPA has not gone far enough. We disagree, and therefore deny the petitions for review.
I.
These WET test methods were first implemented in 1995. 60 Fed. Reg. 53,529 (Oct. 16, 1995). Petitioners brought an action challenging them, as a result of which the WET tests were modified pursuant to a settlement of the action, after which EPA repromulgated them in 2002. 67 Fed. Reg. 69,952 (Nov. 19, 2002) (“Final Rule”). 1 It is this most recent version of the tests that we now review.
A.
Petitioners’ primary concern is that EPA did not adhere to its usual criteria and procedures for ensuring the scientific validity of the test methods. 2 These criteria include accuracy, precision, practical applicability, establishment of detection limits, and the minimization of external interference. See EPA, Availability, Adequacy, and Comparability of Testing Procedures for the Analysis of Pollutants Established Under Section S0k(h) of the Federal Water Pollution Control Act 3-2 to 3-5 (Sept. 1988) (“Report to Congress”). 3 While EPA concedes that its WET tests do not incorporate every one of these factors, the real question is whether EPA adequately accounted for any departures. We find that it did.
EPA explained at length, both in its response to public comments and in the Final Rule, that there are two major distinctions between WET tests and most
EPA admits that accuracy, in its technical rather than colloquial sense, is inapplicable to WET testing, but it does not follow that the tests are therefore “inaccurate.” Accuracy is a composite of two distinct characteristics: “precision” and “bias.” The former measures the variation among the results of multiple tests of the same sample; the latter describes any systemic and persistent deviation of the average value of a test method from an accepted “true value.” Final Rule,
While conceding the inapplicability of bias, EPA stated in the rulemaking that its WET test methods satisfy precision
Petitioners’ analysis of this data does not convince us that EPA’s action was “arbitrary, capricious, an abuse of discretion, or otherwise not in accordance with law.” 5 U.S.C. § 706(2)(A). And this is not just because of the deference we give to EPA when it evaluates “scientific data within its technical expertise.”
City of Waukesha v. EPA
Another of petitioners’ central contentions is that the WET test methods produce an unacceptably high number of false positives. EPA’s test design had contemplated a positive error rate of no more than five percent, and as low as one percent in certain instances; this understanding was reflected in the 1998 Settlement Agreement. Petitioners allege false positive rates between 12.5% and 56%, Reply Brief at 27, while EPA, again analyzing the same data, finds an overall false positive rate of 1.8%, with no individual test’s rate exceeding 5%.
See
Final Rule,
This is certainly a problem for which EPA’s system must account. It is not, however, a problem of false- positives. What petitioners describe relates to precision, which we already have discussed. Multiple measurements will exhibit some degree of variation, yielding an error band that extends above and below some intermediate value. This is the case with chemical-specific instrumental tests and, indeed, with virtually every water quality test EPA uses.
See
40 C.F.R. pt. 136. Furthermore, petitioners neglect to mention that just as some permittees who “should be” in compliance may be deemed
It is worth pausing here before we examine petitioners’ other attacks on the WET test methods. There is an important distinction between the validity of a test method and the validity of a particular result from the test when it is used to determine compliance with permit conditions. Even by EPA’s calculations, WET tests will be wrong some of the time, which is why EPA warned against using a single test result to institute an action for a civil penalty.
See
Petitioners’ next objection is to EPA’s failure to establish detection limits for WET test methods. The public commenters raised this point and EPA explicitly addressed it in promulgating its Final Rule.
Petitioners also assert that EPA failed to demonstrate the availability and applicability of WET testing — that is, the ability of laboratories across the nation to conduct WET testing properly and consistently. One of the main purposes of the Interlaboratory Study was to ensure that a wide range of laboratories could implement the prescribed test methods without introducing an undue degree of variability or error. More than 90% of laboratories were able to complete the ratified tests in accordance with all mandatory procedures, with success rates reaching 100% for several tests. Final Rule,
Another important test characteristic is “representativeness,” that is, the ability of test results to predict instream effects accurately. Petitioners claim that EPA failed to establish the presence of such correlations for several of the WET tests, particularly with regard to Western state waters, which differ chemically from their Eastern counterparts. EPA responds by pointing to the results of numerous studies on this subject conducted throughout the 1990s. These studies support the representativeness of the WET test methods in general, and several demonstrate representativeness with regard to particular Western waters.
See, e.g.,
EPA,
A Review of Single Species Toxicity Tests: Are the Tests Reliable Predictors of Aquatic Ecosystem Community Responses?
47-50 (July 1999). It is unrealistic in the extreme to require correlation studies on every stream in the nation. EPA took the sensible approach of relying on sampling techniques to draw general conclusions, while leaving some implementation details to local entities.
See Am. Iron & Steel Inst. v. EPA
The role of state permitting authorities also should allay the concern, which petitioners express, that the correlation between laboratory toxicity and instream impacts grows weaker at lower levels of toxicity. Before implementing a test method, EPA must establish that the measured characteristic bears a rational relationship to real-world conditions; the available studies reasonably support such a conclusion with regard to chronic toxicity. EPA, Technical Support Document for Water Quality-Based Toxics Control 8 (Mar. 1991) (finding likelihood that data may be explained by randomness, rather than actual correlation, to be 0.1%). Petitioners are worried that they might be subject to excessive restrictions; such limits, however, would be imposed by local authorities, and are not part of the rule-making under review in this case. The WET test methods offer only a means of measuring compliance with those limits— individual dischargers remain free to challenge their permits, on a case-by-ease basis, if they believe that local authorities are regulating at a level that poses only a minimal risk to aquatic life. See 40 C.F.R. §§ 124.19, 124.21.
The ratified WET tests are not without their flaws. But perfection is not the standard against which we judge agency action.
WorldCom, Inc. v. FCC,
II.
American Petroleum Institute (“API”) seeks to intervene for the purpose of challenging EPA’s
failure
to ratify for use in the Pacific Ocean three WET test methods that measure acute toxicity. “An intervening party may join issue only on a matter that has been brought before the court by another party.”
III. Bell Tel. Co. v. FCC,
III.
For the reasons set forth above, having considered and rejected petitioners’ other
Notes
. Petitioners object to four of the ten test procedures described in the 2002 Final Rule: the Fathead Minnow Larval Growth Test Method 1000.0, the Fathead Minnow Embryo-larval Teratogenicity Test Method 1001.0,
Ceriodaphnia dubia
(water flea) Reproduction Test Method 1002.0, and Green Alga Growth Test Method 1003.0.
See
. Petitioners suggest, without supporting authority, that because the test results will be used as evidence in enforcement proceedings, EPA's rulemaking had to comply with the standard for scientific evidence articulated in FED. R. EVID. 702, as interpreted in
Daubert v. Merrell Dow Pharmaceuticals,
.This Report was an internal study on various testing methods, undertaken at Congress's express behest. Pub.L. No. 100-4, § 518(a), 101 Stat. 7, 86-87 (1987). The Report itself nowhere contemplates being anything but a "study.” It is not strictly binding upon EPA and any deviation from the Report is not per se arbitrary and capricious. Cf. Report to Congress at 3-2 ("In most cases, no single [test] method will contain all of the desirable characteristics.”).
. The preferred metric for assessing precision is the coefficient of variation (CV), which measures the extent to which multiple measurements tend to depart from their average value. The greater the CV, the less precise the measurement. By computing the CV using toxicity- units (TUcs) rather than the percentages originally recorded by EPA, petitioners arrive at a grossly inflated result. For example, analyzing reference toxicant data, Interlaboratory Study at 81-82 tbl. 9.8, EPA's approach yields a CV of approximately 0.43— well within the range of EPA’s other approved tests, Memorandum from Marion Kelly, EPA Engineering and Analysis Division 1 (Oct. 16, 2002) (CVs of approved chemical methods range from 0.03 to 0.64, and CVs of organic methods from 0.12 to 1.04). Petitioners’ approach, however, using the distorting TUC scale, results in a CV of 1.47 — -more than triple the correct value.
. One page of petitioners’ opening brief contains what purports to be a constitutional argument — that if a particular WET test indicates toxicity, this will constitute an irrebutta-ble presumption of petitioners' guilt in violation of the Due Process Clause. As we stated in the text, we are concerned here only with test methodology, not results of particular tests in the field. Our decision does not endorse the validity of any test result in the future, nor does it foreclose a defense that the result is wrong. Those issues are simply not presented in this judicial review of rulemak-ing. Furthermore, when the Supreme Court has recognized the constitutional dimensions of presumptions, it has done so solely with regard to statutory classifications, which tended to have strong equal protection components as well.
See, e.g., Weinberger v. Salfi,
