This case is centrally about the responsibilities of a trial court to ensure that its determinations based on expert testimony are founded upon reliable, scientifically trustworthy procedures. The issue arises here in the context of our review of a trial court’s determination concerning the weight rather than the admissibility of evidence. Libas, Ltd. (Libas), a fabric importer, challenges the determination of a trial court that the United States Customs Service (Customs) properly classified certain fabric imported by Libas as power-loomed rather than as hand-loomed and therefore subject to a higher duty rate and an entry quota. Libas argues that the trial court erred in holding, first, that Customs has the legal authority to make the determination by testing the fabric and
I.
Libas imports fabric from India through the Port of Los Angeles. In August 1994, it imported 32 bales of rolled cotton fabric which had been certified by the Indian Government as hand-loomed, and entered this fabric as “certified hand-loomed” under Subheading 5208.42.1000, Harmonized Tariff Schedules of the United States (HTSUS). In September 1994, Customs demanded redelivery of the fabric and subjected it to a new test developed by the Los Angeles Customs Laboratory, namely, the “Methodology for the [A]nalysis of Woven Fabric to Determine whether Fabric had been Power-loomed or Hand-loomed” (the Customs test). In November 1994, on the basis of this test, Customs reclassified the fabric as power-loomed, Subheading 5208.42.4000, HTSUS.
Under the Customs test, fabrics are classified as hand-loomed or power-loomed based on characteristics which are supposed to result from different means of manufacture. Woven fabric of any kind is made by running horizontal “weft” or “woof” yarns through a set of vertical “warp” yarns with a shuttle; patterns in the fabric are crеated by lifting or lowering selected warp yarns at each pass or “pick” of the shuttle. Use of a special kind of shuttle called a “fly-shuttle” can increase the speed at which a pick is “thrown” or completed. Fly-shuttles are in common use in India and can be either hand-thrown or machine-powered. The same kinds of yarns can be used with both processes.
The Customs test is premised on the idea that, because weavers cannot regulate their movements with the precision of a machine, hand-loоmed fabrics exhibit less uniformity, evenness and consistency than machine-loomed fabrics. The fabric at issue exhibited two particular characteristics upon testing which were central in Customs’ classification of them as power-loomed. First, Customs found a weft tension defect of only one-and-one half inches, associated, according to Customs, with the small variations found in machine weaving as against the greater weft tension defects in hand weaving, where weft tension is unregulated. Second, Customs found an area where the thread ran out, which Customs concluded was characteristic of machine weaving because it assumed that hand weavers, being close to their work, would notice that the thread had ended and would therefore reweave the fabric to remove the defect.
Classifying the fabric as power-loomed was a matter of some import to Libas. Power-loomed fabrics are subject under HTSUS to nearly double the duty rate for hand-loomed fabrics and, unlike hand-loomed fabrics, are subject to a quotа and require a visa for entry. Libas protested in a timely way, but in December 1994, Customs denied the protest on the ground that the laboratory report of the test showed that the fabric “exhibitfed] characteristics of machine-made fabric.” Fourteen additional shipments of similar fabric by Libas were held up by this determination at the time of the filing of this lawsuit in January 1996. At a bench trial held in May 1996, the Court of International Trade heard evidence regarding the production process of the fabric both from experts who testified about vаrious aspects of the Customs test and from a witness who claimed personal knowledge of the process of manufacture. The trial court found against Libas in an opinion issued in October 1996. Libas then filed this appeal.
II.
Libas denies that Customs has the statutory authority to unilaterally reclassify the fabric based on the test because
Libas, however, argues that this determination was erroneous because statutes must be interpreted as consistent with subsequent international agreements. At the least, Libas argues, later international agreements control over prior statutes if there is a direct conflict. HTSUS became effective January 1,1989.
See
Pub.L. 100-418. Libas maintains that Customs has no authority to reclassify fabric which has been certified as hand-loomed by the Indian Government in view of the Agreement Relating to Trade in Textiles and Textile Products, Feb. 6, 1987, U.S.-India (the Bilateral Agreement), and the later Amendment to that Agreement, Dec. 21, 1989,
To win this argument, Libas would have to show that Note 4, read together with the Bilateral Agreement and its Amendment, clearly and unambiguously mandates that if Customs disagrees with the Indian Gоvernment’s certification of some fabric, the only recourse Customs has is consultation with the Indian authorities under Paragraph 21 of the Bilateral Agreement. But the most that Libas in fact argues is that the language will bear this reading. Libas says that Paragraph 21 of the Bilateral Agreement “can be interpreted” as a blueprint for Customs to follow. Customs, Libas urges, “could have” sought consultation with the Indian Government instead of unilaterally reclassifying the fabric as power-loomed. The language of Paragraph 21 of the Bilaterаl Agreement is concededly permissive. It does not, even if read together with Note 4, clearly and unambiguously mandate that the United States must consult with the Indian authorities if it disagrees with their certification of imported fabric instead of unilaterally reclassifying the fabric. There is no requirement of consultation, only a right to consult. 1 Because the Bilateral Agreement, as amended, can be read consistently with Note 4 to grant Customs the authority it claims, we have no occasion to consider whether the subsequently amended Agreement trumps the statute. The language of the statute and of the Agreement clearly states that Customs has the required authority. The trial court, therefore, did not err in determining that Customs has the required authority.
We now address the Court of International Trade’s determination that the fabric was power-loomed rather than hand-loomed. Legal determinations made by that court are reviewed
de novo,
while its factual findings are reviewed for clear error.
See United States v. Hitachi America, Ltd.,
Libas argues, first, that the trial court wrongly ignored the direct testimony from personal knowledge of one of Libas’ main witnesses, Mr. S. Ponnuswamy of J.L.C. International, thе Indian exporter of the fabric. He testified at trial that the fabric came from Kovur, a village in India about 200 miles from Madras, which he visits once a month and where he deals with three master weavers. He stated that there are no power looms in or near Ko-vur. He also provided videotapes of the Kovur operation. The United States attempted to impeach his testimony by cross-examination, but did not present rebuttal witnesses.
Libas is aware that we accord the credibility determinations of a trial court “great deference,”
Refac Int’l, Ltd. v. Lotus Dev. Corp.,
Libas also argues that the trial court clearly erred in relying on expert testimony about the Customs test while ignoring the only direct testimony from personal knowledge presented. Libas cites
Trans-Orient Marine Corp. v. Star Trading and Marine, Inc.,
rv.
Libas argues that the trial court erred in holding that the fabric was power-loomed because the methodology involved in the Customs test was not shown to be reliable or properly validated by scientific or other appropriate technical means. And, very importantly, the results of this test provided the sole basis for the trial court’s conclusion about the source of the fabric. The Supreme Court has held that
Daubert and Kumho were decided in the context of determining standards for the admissibility of expert testimony under the Federal Rules of Evidence, which are not at issue here. 2 We agree with Libas, however, that the proposition for which they stand, that expert testimony must be reliable, goes to the weight that evidеnce is to be accorded as well as to its admissibility. Neither the plain language of the relevant Supreme Court opinions nor the underlying principles requiring reliability for expert testimony are narrowly confined in application to questions of admissibility. The difference between weight and admissibility, moreover, is in many instances a close question. See Heidi Li Feldman, Science and Uncertainty in Mass Exposure Litigation, 74 Texas L.Rev. 1, 3 n. 10 (1995) [hereinafter Feldman, Science and Uncertainty ] (criticizing a focus on admissibility in treatments of Daubert).
In any event, if a trial court relies upon expert testimony, it should determine that the expert testimony is reliable. It would make littlе sense to say that a trial court in its fact-finding role should accord much if any weight to expert testimony, the reliability of which is not established.
See Perreira v. Dep’t of Health and Human Serv.,
When the issue of reliability is raised, it is a key consideration in determining the weight to be given to expert testimony.
See United States v. Velasquez,
As Professor Feldman has argued, attention to these considerations brings the law into line with “scientific standards for respectable science.” Feldman,
Science and Uncertainty,
There is no iron law that the Daubert factors be applied in Customs classification cases. The Court of International Trade obviously need not use them in every case, or even in most such cases. These factors are primarily applicable when the question involves a technical process where the reliability of a scientific or technical methodology has been raised as an issue. There may, of course, be other factors which may in a given case be more relevant than the Daubert considerations. The trial court has broad discretion in these matters, but, before turning from Daubert altogether, the court should assure itself that it has effectively addressed the important issue of reliability when it has been raised in an appropriate case.
In this case, the Court of International Trade did not ascertain whether, or explain why, the Customs test was reliable according to appropriate standards when the plaintiff made a timely challenge to its reliability. Although Libas did not raise a Daubert attack as such against the Customs test, since admissibility was not at issue, it did clearly present the question of reliability to the trial court. Libas introduced expert testimony that skilled hand weavers can produce results similar to machine weaving and that washing reduces any discrepancies between the results of the two production processes, raising questions about whether the Customs test would accurately distinguish between them. As noted below, Libas also raised grave questions about the basis of Customs’ confidence in the test. Here, where Libas effectively challenged the reliability of the Customs procedure, the trial court should have examined the Customs test either with a Daubert-style analysis or in some other equally searching way.
The trial court, however, reasoned that because two witnesses had testified that the tests Customs used were “widely accepted in the textile industry and ha[d] been used for years,” therеfore, “Customs’ methodology appear[ed] to be reasonably calculated to determine whether the fabric is loomed by machine or by hand.” The trial court cited two witnesses to show that the test was “generally accepted.” Professor Emeritus Mary Jane Leland of California State University, Long Beach, stated that the physical description of the fabric reported in the Customs analysis was generally accurate, but did not say or imply that the Customs test was widely accepted. Mary Carillo, Textile Analyst at the Customs Laboratory, one of the people who devised the test and applied it to the fabric here, did say that several elements of the test are “standard textile tests,” but she did not clearly state that they were standard ways of distinguishing whether fabric is power-loomed as opposed to determining some other feature of the fabric. For example, a test might show whether fabric was cotton rather than some other material, since the composition of the fabric in dispute was one of the things that Carillo found by testing. To assess a test’s reliability, it is necessary to know what it tests. Further, even if the test was a test for the source of the fabric rather than for something else, a test or methodology might consist of standard elements without itself being standard. 3
Granting for the sake of argument, however, that there was some testimony that the Customs test was standard in a relevant sense, the fact that a test is widely accepted is not merely for that reason “reasonably calculated” to determine definitively whether fabric was hand-loomed or power-loomed. While. “[widespread acceptance can. be an important factor” in an assessment of reliability,
Daubert,
Here, particular reason for doubt does arise. The only evidence in the record directly relating to the validation of the Customs test was Carillo’s testimony on cross examination that the reliability of the test had not been established by the obvious and natural method of double-blind testing. That would involve running the Customs test on fabric, the source of which was known in some other way, perhaps by direct observation, and determining whether testers who themselves had no knowledge of whether test samples were hand-loomed or power-loomed could reliably distinguish power-loomed from hand-loomed fabric within a respectable rate of error. Testing a methodology in this manner would satisfy two of the Daubert factors, verification and known error rate, and for this reason would enhance confidence in the reliability of the test.
Double-blind testing is not required for reliability although it is indicated, especially where it is possible, not terribly burdensome, and reasonably inexpensive. Still, there may be other indicia оf reliability, and the Court of International Trade may rely on methodologies which have not been double-blind tested for making Customs tariff classifications as long as there is some other good reason to have confidence in their results or if double-blind testing is impracticable. Customs’ confidence in the test here, Carillo testified, was based on: (1) some non-double-blind testing against reference samples where the “right” result was known to the tester, (2) the textile literature from which elements of the test were derived and (3) the persоnal experience of five analysts, one of whom is a weaver. As Libas insists, non-double-
The Daubert test is certainly not a rigid formula. But it is significant that the Customs test fails to satisfy any of its factors except-possibly-general acceptance, and it offers no other assurances of reliability based on factors not mentioned in Daubert to make up for those defects. Satisfaction of a Daubert analysis is not, as we have indicated, demanded in every classification case. Here, however, the Customs test was the only basis for the trial court’s determination. This test was the sort of evidence to which a Daubert-type analysis would apply. And the importer raised a serious challenge to the reliability of the test at trial. Without a more persuasive showing of reliability, therefore, it wаs clearly erroneous for the trial court to credit the testimony of Customs’ expert witnesses that the test could distinguish between power-loomed and hand-loomed fabric.
We are in no position to determine the proper classification of the fabric at issue here. Whether the fabric was power-loomed or hand-loomed is a factual determination for the trial court. However, the determination must be made on the basis of reliable expert testimony, if expert testimony of a scientific or tеchnical sort is the appropriate way to decide the matter. Although the Customs test may very well in fact be reliable, its reliability has not been established and the record is insufficient as it stands to allow a court to make a determination of its reliability with any confidence. While we might not normally allow Customs a second evidentiary opportunity, it is appropriate to vacate and remand for further findings because the trial court was not on notice that in circumstances like those presented here, it was obliged to make an assessment of reliability based on the sort of analysis we have described. Further evidentiary hearings are probably called for. Therefore, “[t]he original classification having been tested and found wanting on our review, the case can now be remanded to the Court of International Trade to find a correct answer, whether previously claimed or not claimed by the importer. If need be, a further remand to the Customs Service for consideration by it is also permissible.... ”
Simod America Corp. v. United States,
Thus, we AFFIRM-IN-PART, VACATE-IN-PART, and REMAND.
COSTS
Each party shall bear its own costs.
Notes
. Contrary to what Libas suggests, the Indian Government still has a right to consultation about this matter. The record does not show whether it has availed itself of this right.
. Libas did not challenge the admissibility of the Customs test, but the reliability issue was not waived. Under the Tariff Act of 1930, an importer objects to a Customs tariff classification by filing a protest with Customs, see 19 U.S.C. § 1514; if this is denied, jurisdiction lies with the Court of International Trade, see 28 U.S.C. § 1581. According to the procedure of that court, Customs "shall file ... as part of the official record, any document, paper, information or data relating to the entry of merchandise and the administrative determination that is the subject of the protest or petition.” Id. § 2635(a) (emphasis added). The test related to Customs' determination at issue here and so was filed as part of the record. Libas could not, therefore, attack its admissibility and its only recourse was to argue against the weight accorded to the test. Libas’ argument at trial against the reliability of the test was sufficient to rebut the statutory presumption of correctness accorded to Customs classifications.
. The same defects vitiate the testimony of Désirée Koslin, an expert witness for the government whose testimony was not cited by the trial court, but who testified that her own methodology and the Customs test "overlap[ped]” and were "standard.”
