51 Fair Empl.Prac.Cas. 330,
Roberta OTTAVIANI, Individually and on Behalf of all other
persons similarly situated, Plaintiff-Appellant,
Carolee Schneemann, Joan Marie de la Cova, Dorothy Jessup,
Individually and on behalf of all other persons
similarly situated,
Plaintiffs-Intervenors-Appellants,
v.
STATE UNIVERSITY OF NEW YORK AT NEW PALTZ, and Clifton R.
Wharton, Jr., in his capacity as Chancellor of the
State University of New York,
Defendants-Appellees.
Harriet KLAPPER, Plaintiff-Intervenor-Appellee,
v.
STATE UNIVERSITY OF NEW YORK AT NEW PALTZ, Clifton R.
Wharton, Jr., Alice Chandler, Peter Vukasin, and
the Trustees of the State University of
New York, Defendants-Appellants.
No. 49, Docket 88-7159.
United States Court of Appeals,
Second Circuit.
Argued Oct. 17, 1988.
Decided May 9, 1989.
Eleanor Jackson Piel, New York City, for plaintiffs-appellants.
Judith T. Kramer, Asst. Atty. Gen., New York City (Robert Abrams, Atty. Gen. State of N.Y., Jan P. Ryan, Marilyn T. Trautfield, Asst. Attys. Gen., of counsel), for defendants-appellees.
Before VAN GRAAFEILAND, CARDAMONE and PIERCE, Circuit Judges.
PIERCE, Circuit Judge:
This is an appeal from a judgment of the United States District Court for the Southern District of New York, Kram, J., in which the court found in favor of defendants on all of the Title VII claims asserted by individual faculty members and a class of similarly situated plaintiffs, following a lengthy bench trial.1 The decision of the district court is published in a thorough and lengthy opinion at
BACKGROUND
This complicated Title VII suit was commenced by and on behalf of full-time, academic rank female faculty members at the State University of New York ("SUNY") at New Paltz ("the University") who were employed in the University's Division of Liberal Arts and Sciences at any time between academic years 1973 and 1984. The plaintiffs alleged that between 1973 and 1984, the University discriminated against female members of its faculty on the basis of gender in three separate categories: (1) placement in initial faculty rank at the University, (2) promotion into higher rank, and (3) salary. Judge Kram conducted a bench trial which extended over nine months on all of the plaintiffs' claims, and both parties presented extensive evidence to the court. For the sake of brevity, we will discuss only so much of the proceedings below as is relevant to our discussion of the key issues raised on appeal.
During the trial, the district court basically considered two types of evidence--objective statistical evidence and extensive "anecdotal" evidence. The statistical evidence presented by both sides consisted primarily of data produced by means of various "multiple regression analyses." Depending upon the party presenting the statistical evidence, the data was intended to either demonstrate or rebut the plaintiffs' claim of a pattern of ongoing discrimination against women within the University in all three of the contested categories.
A. The Statistical Evidence
Multiple regression analysis is a statistical tool commonly used by social scientists to determine the influence that various independent, predetermined factors (so-called "independent variables") have on an observed phenomenon (the so-called "dependent variable"). See Eastland v. Tennessee Valley Auth.
The first step in such a regression analysis is to specify all of the possible "legitimate" (i.e., nondiscriminatory) factors that are likely to significantly affect the dependent variable and which could account for disparities in the treatment of male and female employees. See Sobel,
In this case, the parties' statistical experts each determined what factors they thought were relevant to the setting of salaries and rank at the University, and used those factors as independent variables in their multiple regression analyses. By accounting for all of the "legitimate" factors that could affect salary and rank in general, the plaintiffs hoped to prove that there was a net "residual" difference or disparity between the predicted and actual salaries and rank of female faculty members that could only be attributed to ongoing gender discrimination within the University. Conversely, the defendants sought to attribute observed disparities in the pay and rank of male versus female faculty members to "legitimate" factors such as unequal job qualifications.
1. Plaintiffs' Proof of Salary Discrimination
a. Plaintiffs' Main Salary Study
The plaintiffs' main salary study was contained in Trial Exhibit 882 and purported to demonstrate the difference in salaries between male and female faculty members at New Paltz. According to the plaintiffs' statistical expert, Dr. Mary Gray, women actually earned from $1,036 to $2,277 less than their predicted salaries in each year of the class period. The defendants challenged these findings on several grounds, but principally attacked the plaintiffs' study for its failure to include certain independent variables which the defendants claimed were influential in the setting of faculty salaries at the University.
The plaintiffs' main salary study incorporated the following independent variables: (1) number of years of full time teaching experience prior to hire at New Paltz; (2) number of years' teaching experience in academic rank at New Paltz; (3) possession of a doctorate degree; (4) number of years since obtaining the doctorate degree; (5) number of publications; (6) other experience prior to hire at New Paltz; and (7) years of full-time high school teaching experience. The plaintiffs' statistical expert, however, did not include academic rank variables in her main salary study such as prior rank, current rank, and years in current rank. Although Dr. Gray conceded that these three factors may influence salary decisions, she maintained that academic rank itself was subject to discrimination at New Paltz, and that the use of rank variables would therefore be inappropriate.
In connection with this assertion, the plaintiffs attempted to demonstrate that female faculty members were placed in lower academic ranks at New Paltz than their male counterparts, and promoted more slowly into higher academic ranks than their male counterparts, solely because of their gender.3 The defendants' statistical expert, Dr. Judith Stoikov, responded by attempting to prove that rank at New Paltz was not discriminatory. After considering all the evidence as to rank, the district court rejected plaintiffs' proof as "unpersuasive," and concluded that plaintiffs had "failed to prove that rank at New Paltz was discriminatory." Ottaviani,
The district court's rejection of plaintiffs' claims as to discrimination in rank at New Paltz had two important consequences for the plaintiffs' case. First, the court's ruling eliminated two of the contested categories of discrimination at New Paltz, and left the salary discrimination claim as plaintiffs' only remaining Title VII claim. Second, and equally important from the plaintiffs' perspective, the court's ruling "validated" academic rank as one of the legitimate factors to consider in accounting for salary disparities between male and female faculty members. Since the court considered the academic rank of faculty members to be a legitimate influencing factor on faculty salaries at New Paltz, and since the plaintiffs' main salary study failed to include academic rank variables, the court found the plaintiffs' principal study to be fundamentally flawed and less probative of discrimination than it otherwise might have been.
b. Plaintiffs' Other Salary Studies
Apart from their main salary study, the plaintiffs had also performed salary regressions which did include rank variables. Since these other studies did include what the court considered to be most of the relevant legitimate factors which could influence salary at New Paltz, the court accordingly looked primarily to these studies to determine whether the plaintiffs had made out a prima facie case of gender discrimination.
After considering and weighing all the evidence presented, the district court reached certain conclusions with respect to both the plaintiffs' and the defendants' statistical evidence. While the district judge found some of the plaintiffs' statistical evidence "persuasive," she thought that it was insufficient to establish a prima facie case of gender discrimination. On the other hand, the district judge did not believe that defendants' statistical evidence was sufficient to rebut the plaintiffs' discrimination claims altogether. Since the judge found the statistical evidence to be inconclusive one way or the other, she ruled that whether or not the plaintiffs could prevail on their discrimination claims would depend upon whether the totality of the evidence adduced at trial supported a finding of discrimination. Accordingly, the district judge next considered whether the extensive anecdotal evidence proffered by plaintiffs supported their claims of discrimination.
B. The Anecdotal Evidence
The anecdotal evidence at trial consisted of various narrative descriptions of events at the University which the plaintiffs contended illustrated or proved that the University had discriminated against its female faculty members. Specifically, the plaintiff class members sought to establish that: (1) the University did not have a viable affirmative action program; (2) New Paltz's methods for identifying and correcting existing salary inequities from 1973 to 1984 were either flawed or non-existent; (3) the University either retrenched or eliminated faculty positions to the detriment of its female faculty members; and (4) the University demonstrated a disdain for women's issues through its handling of the Women's Studies Program at New Paltz. Eleven witnesses also testified about individual instances of alleged salary discrimination at New Paltz, which the plaintiffs contended were illustrative of the administration's policies toward women as a whole.4
On rebuttal, the defendants sought to negate the plaintiffs' claims through the specific testimony of University administrators and faculty members, and other types of anecdotal evidence. The defendants contended that such evidence demonstrated that there were nondiscriminatory reasons for all of the actions taken by the University during the period in question which negatively affected its female faculty members, and that none of the employment practices at issue were motivated by discriminatory animus.
After reviewing the anecdotal evidence, the district judge held that the plaintiffs had not proven their Title VII claims against the University. Although she found that the anecdotal evidence supported an inference of prima facie discrimination in a few of the individual class members' cases, in each of those cases she either accepted the defendants' explanations for the pay disparities, or found that the isolated incidents of discrimination were insufficient to support the class' claim of a pattern or practice of gender discrimination. Accordingly, the district court entered judgment in favor of defendants on all of the Title VII claims.
On appeal, the appellants contend that the district court erred in its treatment and analysis of the evidence in several key respects, and that as a result, the court's finding of no discrimination was erroneous. First, appellants challenge the district court's determination that the statistical evidence was inconclusive. Appellants contend that the statistical evidence adduced at trial was more than sufficient to establish a prima facie case of gender discrimination as a matter of law. Moreover, they also contend that the district judge's decision to allow allegedly "tainted" variables such as "rank" to be used in the multiple regression analyses minimized the overall impact of defendants' alleged discriminatory treatment of female faculty members, and resulted in weaker statistical proof. Appellants also take issue with the district court's rejection of the proffered anecdotal evidence of discrimination. Finally, appellants contend that the district court erroneously excluded or ignored evidence of pre-Title VII discrimination, in contravention of the Supreme Court's decision in Bazemore v. Friday,
For the reasons that follow, we hold that Judge Kram did not clearly err in finding in favor of the defendants, and we affirm the decision of the district court.
DISCUSSION
We begin by noting that the district court correctly stated the familiar legal standards to be applied in Title VII cases. Since the plaintiff class herein had raised a "disparate treatment" claim under Title VII, the claimants bore the burden of not only establishing discriminatory intent on the part of SUNY administrators, but that "unlawful discrimination [was] a regular procedure or policy followed by [the University]." International Bhd. of Teamsters v. United States,
The Supreme Court has established a three-step process for evaluating disparate treatment claims brought pursuant to Title VII. McDonnell Douglas Corp. v. Green,
Even though the burden of production shifts to the defendants during the second stage of the process, the ultimate burden of persuasion rests always with the plaintiffs to prove their claims of discrimination. See United States Postal Serv. Bd. of Governors v. Aikens,
A. Significance of Plaintiffs' Statistical Evidence
At trial, the plaintiffs herein contended that the statistical evidence alone was sufficient to establish a prima facie case of discrimination. According to plaintiffs, female faculty members were clearly treated less favorably than their male counterparts, and that unfavorable, disparate treatment was due solely to gender bias. The district court, however, found that the plaintiffs' statistical evidence was not "statistically significant" enough to establish a prima facie case of discrimination. For the reasons that follow, we conclude that the district court did not clearly err in ruling that the plaintiffs' proffered statistical evidence was not dispositive of their Title VII claims.
As discussed earlier, plaintiffs in a disparate treatment case frequently rely on statistical evidence to establish that there is a disparity between the predicted and actual treatment of employees who are members of a disadvantaged group, and to argue that such disparities exist because of an unlawful bias directed against those employees. Not all disparities, however, are probative of discrimination. Before a deviation from a predicted outcome can be considered probative, the deviation must be "statistically significant."
Statistical significance is a measure of the probability that a disparity is simply due to chance, rather than any other identifiable factor. See Segar,
One unit of measurement used to express the probability that an observed result is merely a random deviation from a predicted result is the "standard deviation." D. Baldus & J. Cole, supra, Sec. 2.224, at 63; id. Sec. 9.03, at 294-95; see also Berger,
Cognizant of the important role that statistics play in disparate treatment cases, the Supreme Court has held that "[w]here gross statistical disparities can be shown, they alone may in a proper case constitute prima facie proof of a pattern or practice of discrimination." Hazelwood School Dist. v. United States,
In the present case, the three salary studies which the district court considered most probative of a pattern or practice of discrimination produced a range of standard deviations between approximately one and five, and of the total thirty-three standard deviation measures cited, twenty-four exceeded two standard deviations.7 Significantly, however, nine of the measures cited fell below two standard deviations. Also, the negative residuals associated with being female were not significant in every year of the liability period.
Given the range of standard deviations associated with their salary regressions, the plaintiffs contended that the statistical evidence clearly gave rise to a presumption of discrimination. As discussed earlier herein, however, although the district judge found the studies to be "persuasive," she nevertheless held that these levels of "statistical significance" alone were "not sufficiently high to support a prima facie claim of salary discrimination." Ottaviani,
On appeal, appellants argue inter alia that, as a matter of law, a finding of two standard deviations should be equated with a prima facie case of discrimination. According to appellants, the district court therefore erred in finding that they had not met their burden of establishing a prima facie case. In support of this argument, appellants point out that several courts have accepted two standard deviations as prima facie proof of discrimination. See, e.g., Berger,
It is certainly true that a finding of two to three standard deviations can be highly probative of discriminatory treatment. See Segar,
Moreover, as a practical matter, the issue of whether or not two or more standard deviations establish a prima facie case of discrimination is not pivotal in this case. It is well-established that "once a Title VII case has been 'fully tried on the merits,' the question whether the plaintiff has established a prima facie case 'is no longer relevant.' " Mitchell v. Baldrige,
At the close of the plaintiffs' case-in-chief, the defendants moved to dismiss the action specifically on the grounds that plaintiffs had not met their burden of establishing a prima facie case. The court denied the motion without explanation, however, and directed the defendants to proceed with their case. Later, at the close of defendants' rebuttal case, the court also allowed the plaintiffs to respond to the defendants' rebuttal evidence. Accordingly, the question remaining for the district court at the close of the entire trial was whether the plaintiffs should ultimately prevail on their Title VII claims. See Bazemore,
The net import of Judge Kram's rulings regarding the significance of plaintiffs' statistical evidence is that she found the evidence to be "persuasive" but not dispositive. Contrary to appellants' assertions, it is clear from the district judge's rulings that she did not simply ignore the statistical evidence of discrimination presented by plaintiffs. The court found this evidence sufficient to cause her to deny the defendants' motion to dismiss at the end of plaintiffs' case, and to accept rebuttal evidence from the defendants. On rebuttal, however, the defendants were able to successfully undermine the plaintiffs' case by attacking the validity of the plaintiffs' statistical evidence, and by introducing statistical evidence of their own to negate the inference of discrimination that had been raised. Cf. Berger,
Specifically, the defendants criticized the plaintiffs' most probative studies for excluding one factor which they claimed exerted a "highly significant positive influence[ ] on current salary," namely, whether a faculty member had held a prior, full-time administrative position at SUNY New Paltz before returning to full-time teaching. The defendants also criticized these studies because the salary regressions were "fitted" only to male faculty members, i.e., they used independent variables that were derived only from the male population. The district court noted in its opinion that a "males only regression" based exclusively on values existing only in the male population might have tended to overestimate the predicted salaries of certain female faculty members, because it might not have taken into account legitimate factors existing solely in the female population which could have affected the rate of pay for women teachers at the University. See Ottaviani,
Recent Supreme Court precedent has made it clear that this court can reverse such a factual determination "only if it is clearly erroneous in light of all the evidence in the record or if it rests on legal error." Palmer,
B. Use of Rank Variables
In conjunction with their attack on the district court's assessment of the sufficiency of plaintiffs' statistical evidence, appellants also challenge the district court's determination that "rank" was an appropriate factor to consider in assessing pay disparities between male and female faculty members. According to appellants, if the court had rejected the rank variables and considered only those salary studies which excluded rank, then the number of standard deviations associated with their findings of discrimination would have been much greater, and their statistical proof would have been even more probative.
Although we recognize that the use of rank variables in testing for salary discrimination against women faculty members is not universally accepted, see Finkelstein, The Judicial Reception of Multiple Regression Studies in Race and Sex Discrimination Cases, 80 Colum.L.Rev. 737, 741-42 (1980); D. Baldus & J. Cole, supra, Sec. 8.23, at 113-14 (Supp.1987), in Sobel v. Yeshiva University, this court specifically upheld the use of rank variables in a multiple regression analysis, stating that rank could be used as a legitimate factor in explaining pay disparities so long as rank itself was clearly not tainted by discrimination,
At trial, the plaintiffs failed to adduce any significant statistical evidence of discrimination as to rank. As the district court stated in its opinion, the plaintiffs' studies of rank, rank at hire, and waiting time for promotion "were mere compilations of data" which neither accounted for important factors relevant to assignment of rank and promotion, "nor demonstrated that observed differences were statistically significant." Ottaviani,
C. Anecdotal Evidence
Appellants also contend on appeal that the district court did not give sufficient weight to the anecdotal evidence adduced at trial, and that the court should have rejected the explanations proffered by University administrators to explain pay and rank inequities as "pretextual." Our review of the anecdotal evidence, however, is limited to ascertaining whether the district judge committed clear error in making her findings. See Anderson v. City of Bessemer,
In this case, the district court found that the defendants had successfully rebutted the plaintiffs' anecdotal proof, and that, in any event, the anecdotal evidence on its face was too limited to prove class-wide discrimination. After reviewing the entire record, we do not think that the court's decision to credit the testimony of the defendants rather than that of the plaintiffs was clearly erroneous. Since the district court's "account of the evidence is plausible in light of the record viewed in its entirety," we may not overturn the findings of the court even if we might "have weighed the evidence differently," had we been sitting as the trier of fact. Anderson,
D. Bazemore Claim
In Bazemore v. Friday,
On appeal, appellants contend that the district court erroneously excluded evidence of pre-Act discrimination in violation of the Supreme Court's dictates in Bazemore. In particular, appellants claim that the district judge improperly excluded Exhibit 990, which purported to document statistically significant evidence of discrimination as to initial faculty rank. This claim is without merit, however. At trial, the defendants objected to the admission of Exhibit 990 not because it was offered to prove pre-Act discrimination, but because it was unreliable and incomplete. While the weakness of statistical evidence should not ordinarily preclude its admission, see Bazemore,
Moreover, we note that appellant's reliance on this court's decision in Sobel v. Yeshiva University as support for their more generalized, Bazemore -type claims is misplaced. In Sobel, the plaintiffs introduced evidence specifically designed to prove that women were discriminated against prior to the effective date of Title VII, and argued that "Yeshiva had a legal obligation to equalize women's salaries immediately upon application of Title VII to universities."
CONCLUSION
In sum, the burden of persuasion was on the plaintiffs to prove by a preponderance of the evidence that there was a pattern or practice of discrimination at SUNY New Paltz, and they failed to meet that burden. We have considered all of the arguments presented on appeal, and find them to be without merit. For the reasons stated above, the judgment of the district court is affirmed.
Notes
The district court ruled in favor of plaintiff-intervenor Harriet Klapper on her Equal Pay Act claim. That decision is not challenged on appeal by defendants, and appellants rely on the favorable ruling only insofar as it supports their class-wide claims of discrimination
Another way in which statisticians can measure the influence of gender on a particular employment decision is by using gender as one of the independent variables in a regression analysis. For each independent variable in a multiple regression analysis, the statistician calculates a coefficient, which is a measure of the effect that the variable has on the dependent variable being examined. If the regression coefficient for gender is sufficiently large, then it is probative of the impact that gender has on the employment decision at issue. D. Baldus & J. Cole, supra, Secs. 8.01 to 8.02, at 240-45; see, e.g., Segar,
There are four types of "academic rank" at New Paltz: (1) professor, (2) associate professor, (3) assistant professor, and (4) instructor. Faculty members in one of these academic ranks either hold tenure or are on a "tenure track."
The plaintiffs presented testimonial evidence of discrimination against: three New Paltz faculty members who were neither class members nor individual plaintiffs (i.e., Nancy Schniedewind, Susan Puretz, and Sheila Schwartz); four individual members of the class who had not brought individual Title VII claims (i.e., Susan Lehrer, Barbara Scott, Johanna Sayre, and Samantha Joe Mullen), and four members of the class who had asserted individual Title VII claims as well (i.e., Roberta Ottaviani, Dorothy Jessup, Joan Marie de la Cova, and Carolee Schneemann)
The Supreme Court's recent decision in Price Waterhouse v. Hopkins, --- U.S. ----,
The commentators are careful to point out, however, that no matter how great the number of standard deviations is, statistical tests can never entirely rule out the possibility that chance caused the disparity. See Palmer,
As discussed supra, two standard deviations corresponds roughly to a 1 in 20 chance that the outcome is a random fluctuation. Three standard deviations corresponds to approximately a 1 in 384 chance of randomness. Finally, a range of four to five standard deviations corresponds to a probability range of 1 chance in 15,786 to 1 chance in 1,742,160. M. Abramowitz & I. Steigan, Handbook of Mathematical Functions, National Bureau of Standards, U.S. Government Printing Office, Applied Mathematics Series No. 55 (1966) (Tables 26.1, 26.2)
