In 1993, Myrtle Thomas, the only black Customer Service Representative in Eastman Kodak's Wellesley, Massachusetts office, was laid off. Thomas responded with a race discrimination suit against Kodak under Title VII, 42 U.S.C. §~ 2000e to e-17, arguing that Kodak's layoff decision was discriminatory because it resulted from a ranking process that relied on racially biased performance appraisals prepared in 1990, 1991, and 1992. Kodak made two arguments in its motion for summary judgment: first, that Thomas's claim was time-barred because the performance appraisals were conducted outside of Title VII's statutory limitation period, and second, that Thomas failed in any event to present enough evidence of racial animus to support a disparate treatment claim. The district court disagreed with the first point but agreed with Kodak's second argument and granted summary judgment. Both issues are before us on appeal.
We hold that the date of the notice of the layoff is the date from which the limitations period ran. Because the evaluations caused no concrete harm until the layoff, we reject the employer's argument that, instead of the layoff date, the date of the evaluations should be used as the start of the limitations period. Because we also find that she has presented enough evidence to support her claim that the performance appraisals were racially biased, we reverse the district court's grant of summary judgment and remand for further proceedings.
After thirty-five years of litigation under Title VII, cases can still present new wrinkles. This is one such case. Because it raises a number of important issues-some new and some familiar but difficult-we preview the key holdings.
First, when an employee claims that a layoff is discriminatory and the employer utilizes scores from past performance appraisals in an objective formula to determine who will be laid off, the limitations period runs from the date of the notice of layoff where the laid off employee has suffered no concrete earlier harm from the appraisals.
Second, once there is sufficient evidence to create a material issue of fact that the employer's articulated reason for an adverse employment action is a pretext, there is no requirement that a plaintiff always produce direct evidence to demonstrate that the real reason was discriminatory.
Third, Title Vii's prohibition against "disparate treatment because of race" extends both to employer acts based on conscious racial animus and to employer decisions that are based on stereotyped thinking or other forms of less conscious bias.
Fourth, under the McDonnell Douglas/Burdine framework, a court may not enter summary judgment for an employer based upon a non-discriminatory reason not articulated by the employer but identified sua sponte by the district court.
I
In reviewing a grant of summary judgment, we consider the facts in the light most favorable to the nonmoving party, drawing all reasonable inferences in that party's favor. See Aponte Matos v. Toledo Davila,
Thomas was a long-term Kodak employee. She first began working for the company in 1974. In 1980, after working for six years in clerical and administrative positions in Kodak’s Rochester, New York facility, she was promoted to Customer Support Representative (“CSR”) within the Office Imaging Division and transferred to Kodak’s offiсe in Wellesley, Massachusetts.
Along with five other CSRs working out of the Wellesley office, Thomas supported customers in an assigned territory who owned Kodak copiers and other Kodak equipment. She helped salespeople perform installations, trained customers in the use and maintenance of Kodak equipment, facilitated communication between customers and sales and service personnel, and provided other forms of marketing support.
Thomas generally performed her job well. Kodak managers who supervised Thomas during her first ten years as a CSR in the Wellesley office reported variously that they were never dissatisfied with her performance, that they were “delighted” with Thomas, that her work was “excellent” and “far superior” to that of some of the other CSRs, that she was “very much on top of things,” and that she was “the perfect support person.”
Co-workers and customers expressed similar sentiments. A sales representative who worked with Thomas sent a memorandum to Thomas’s supervisor praising her “continuous professionalism,” “very high level of commitment,” and “total dedication.” The sales representative later noted that he was particularly impressed with the way a certain customer “really went out of his way” to emphasize his satisfaction with Thomas’s support. Another customer who contacted Kodak after Thomas’s layoff described Thomas as “an irreplaceable part of the Kodak team” and explained that Thomas was the primary reason for his selection of Kodak copiers over copiers from other companies. The customer concluded: “In my many contacts with company representatives, I have not met anyone of the class and caliber of Myrtle Thomas.”
Because of her high level of performance, Thomas received awards and bonuses from Kodak. In 1989, the company also changed Thomas’s grade from K4 to K6, which resulted in a salary increase. According to Kodak’s job description, the K6 grade was limited to CSRs who were “making an outstanding contribution to the support activities,” defined as “servicing the largest and/or most sensitive accounts, developing and giving individualized presentations and/or demonstrations, and training new CSRs.” Thomas received at least eight other salary increases during her years as a CSR. At the time of the 1993 layoff, Thomas was the fourth most senior CSR in the Wellesley office and earned the third highest salary.
The Kodak compensation plan stated that “[t]he company’s goal for [its] pay program is to reward each individual’s job performance appropriately” and to ensure that “[p]eople with higher performance will, over time, be paid more than average performers.” The company made use of annual performance appraisals in order to reach this goal. According to the compensation plan, performance appraisals were also used for a number of other purposes, including:
A. Evaluating and documenting the performance of each individual in comparison with performance expectations for the job.
B. Providing individuals with constructive feedback.
C. Identifying the guidance and training that can help individuals be as successful as their ability permits.
[and]
D. Determining who should be promoted, transferred, demoted, terminated, laid off, and re-employed.
*44 To conduct an appraisal, the supervisor who directed an employee’s day-to-day activities filled out an appraisal form. The form contained a section pеrtaining to “basic performance measures,” which included categories for quality of results, quantity of results, job skills, and teamwork. Another section pertained to “additional performance measures,” including dependability, versatility, communications, and leadership. The form required the supervisor to give the employee a rating from 1 to 7 for each applicable category, as well as an overall rating. 1 A separate section of the appraisal form contained space for the supervisor’s comments on the evaluated categories, a description of any developmental opportunities, a description of matters discussed during the post-appraisal interview with the employee, and both supervisor and employee signatures. Each appraisal was also reviewed and signed by the appraiser’s supervisor. The appraiser and the appraiser’s supervisor were together responsible for the appraisal’s accuracy, consistency, and conformance to company policy, including a policy favoring fair and objective evaluations, conducted “without regard to non-job-related criteria such as[ ] race.”
Kodak intended the appraisal scores to be on a curve, company-wide. The compensation plan suggested that “performance appraisal ratings for large groups of people (approximately 100 or more) [should] average around the middle of the rating scale,” but acknowledged that factors such as the amount of turnover and the percentage of long-term employees could influence the distribution of ratings in any given group, particularly groups with a small number of employees. The compensation plan suggested that appraisers attempt to validate their distribution of appraisal ratings by rank-ordering employees in the same grade and job category. After the rank ordering was complete, supervisors would finalize the preliminary ratings to agree with the rank-order results. However, these rank orderings were not discussed with employees. Supervisors were told that “[a]p-praisals verified through this process [should be] communicated by referencing performance relative to job expectations with no reference to the rank-order process.”
Thomas’s appraisals for 1988 and 1989 show that she was performing at a high level. In 1988, when she was responsible for more than 400 machines, Thomas received seven 5 ratings and one 6 rating, for an overall rating of 5, which was a rating “appropriate for individuals who not only achieve results and meet all expectations on a regular basis, but who go beyond these requirements from time to time.” Her supervisor’s comments were uniformly positive. In 1989, when she was responsible for more than 500 machines, Thomas received even higher ratings — five 6s and three 5s, for an overall rating of 6, which meant that her “[pjerformance consistently exceeded] the requirements of the position and [was] characterized by unusual initiative, resourcefulness, and creativity.” Once again, the supervisor’s comments were uniformly positive.
In 1989, Kodak created the position of Customer Support Manager throughout its Office Imaging organization. Thomas asked to be considered for the Wellesley position, but was told that she was not qualified. Instead, the position went to Claire Flannery, a former CSR who had been working as a division secretary. As Customer Support Manager, Flannery was responsible for supervising the six Wellеs-ley CSRs. She remained in this position until 1993, when both she and Thomas *45 were laid off as part of a company-wide reduction in force.
The appointment of Flannery as Customer Support Manager marked a significant downturn in Thomas’s fortunes at Kodak. Although Thomas states that she and Flannery “were on a professional basis,” and Flannery denies having any problems- with Thomas’s job performance, it appears that their working relationship was strained. Thomas alleges that Flan-nery treated her differently from the other five CSRs, all of whom were white 2 (as were, in fact, all of the other CSRs during Thomas’s thirteen years in the Wellesley office).
She describes a number of occasions on which she claims Flannery unnecessarily damaged her professional standing with customers. For example, on the only occasion on which Flannery accompanied Thomas to a customer training session in order to observe Thomas’s work, Flannery instead took over the session and conducted the entire training herself. On another occasion, Flannery told Thomas the wrong time for a training session Flannery had scheduled on site, and then refused to write to the customer, who was upset, to explain why Thomas had been several hours late. (After Thomas proved to Flan-nery what had occurred by playing back Flannery’s voice mail message to her, Flannery did agree to call — but not to write — the customer.) On a third occasion, Flannery became quite angry and attempted physically to block Thomas from leaving a CSR meeting which had been scheduled at the same time as an important training session for one of Thomas’s customers.
Thomas also provides evidence from which the inference can be drawn that Flannery did not evaluate her skills fairly or give her appropriate opportunities for growth and success. For example, Flan-nеry did not travel with Thomas as she did with the other CSRs in order to observe Thomas’s interactions with customers. Flannery did not give Thomas the same type of developmental opportunities available to other CSRs. She criticized Thomas for lack of computer skills, but then failed to train Thomas when computer equipment became available. After Thomas was asked to prepare a presentation for a Kodak meeting, Flannery told her that there would not be time for her presentation, although time was found for a presentation by a white employee on a less pressing topic. Thomas was denied the opportunity to apply for sales jobs, despite her good track record in sales support activities. Flannery expressly discouraged her from applying for a management position in another division, telling her she was not qualified because she did not yet have a master’s degree (Thomas was studying for one at the time), even though none of the other managers in that position had a master’s degree or even a bachelor’s degree. Finally, Flannery went to great lengths to prevent Thomas from meeting with Bill Cassidy, the Regional Vice President for Office Imaging, to discuss advancement opportunities, and became angry when Thomas nevertheless managed to schedule an appointment.
Thomas’s most significant and concrete allegation is that Flannery gave her inaccurately low scores on her annual performance appraisals. For example, after receiving only 5s and 6s in 1988 and 1989, Thomas received a 2, four 3s, and a 4 from Flannery in 1990, for an overall score of 3. This was a below-average rating, appropriate for employees who had “a need for further improvement to aсhieve a middle rating [of 4]” or for employees “whose overall performance has slipped from a higher level.” Thomas’s performance appraisal scores in 1991 and 1992, while higher than her 1990 scores, were also inappropriately low, in Thomas’s estimation — especially when compared to the higher scores that Flannery gave to other *46 CSRs. Thomas presents specific comparisons, discussed further below.
Thomas was distressed by the 1990 performance appraisal and refused to sign it. She also refused to sign the 1992 appraisal, and signed her 1991 appraisal only “ ‘out of a joke,’ because it was a joke.” It is clear that Thomas disagreed with Flannery’s evaluations; more than that — she found them “insulting” and “shameful.” To the extent that it was possible for her to compare salary and raises with other CSRs, however, she would not have noted any obvious effects of the negative appraisals, since neither her salary nor the raises she was given during Flannery’s tenure as Customer Support Manager differed significantly from those of other employees. She did complain about the appraisals, both to Flannery herself and to others within Kodak, including Bill Cassidy, the Regional Vice President for Office Imaging, and Patti Weissinger, a Human Resources Representative. However, fearing retaliation from her new boss, she did not file a formal charge against Flannery with the Human Resources Department, and Kodak did not take any action in response to her informal complaints.
In January 1993, Kodak decided to reduce the number of employees in its Office Imaging Division. It selected employees for layoff using a “Performance Appraisal Ranking Process” (“PAR process”), which produced a numerical score for each emрloyee by adding together the employee’s overall performance appraisal score for each of the three preceding years, after weighting the most recent score by a factor of 25 and the second most recent score by a factor of 5. 3 .Thomas’s PAR ranking, which was derived from the three appraisals conducted by Flannery, was the second lowest of the Wellesley CSRs. Since Kodak had decided to cut two Wellesley CSR positions, as well as the Customer Support Manager position, Flannery, Thomas, and the lowest ranked CSR (Eileen Lavallee) were laid off in March 1993.
II
In July 1993, four months after her layoff, Thomas filed a charge of race discrimination with the Equal Employment Opportunity Commission. The EEOC issued a “right to sue” letter in February 1996, and Thomas brought suit in May 1996.
Although her complaint asserts a variety of race discrimination claims, Thomas has focused primarily on Kodak’s 1993 decision to terminate her, arguing that this decision was discriminatory because it was based on discriminatory performance appraisals conducted by Flannery from 1990 through 1992.
4
It is the layoff which is the subject
*47
of this appeal. Kodak moved for summary judgment, contending that Thomas’s claim was time-barred, and in the alternative, that she had failed to present enough evidence of racial animus to reach a jury under the First Circuit’s standard for showing disparate treatment once pretext has been shown.
See Udo v. Tomes,
Ill
We review the district court’s grant of summary judgment de novo,
see Lennon v. Rubin,
A
We first address Kodak’s argument that the district court erred in not dismissing the action as untimely. Title VII requires aggrieved individuals to file a charge with the EEOC “within one hundred and eighty days after the alleged unlawful employment practice occurred.” 42 U.S.C. § 2000e-5(e)(1). In a “deferral jurisdiction,” such as Massachusetts, this period is extended to three hundred days.
5
See
42 U.S.C. § 2000e—5(e)(1);
Mohasco Corp. v. Silver,
However, Title VII’s statute of limitations is not self-executing. The three hundred day (or one hundred and eighty day) rule cannot be mechanically applied, because it is not possible to pinpoint when the limitations period begins — i.e., when the plaintiffs claim “accrues” — without first determining the date, or the temporal boundary, of the “alleged unlawful employment practice” referred to in § 2000e-5(e)(1).
The crux of Thomas’s claim is that her layoff is tainted because the 1990-92 performance appraisals were tainted by Flan-nery’s alleged racial bias. Thomas does not argue that Kodak’s decision to utilize the PAR process to determine who would be laid off was racially biased.
See Thomas,
According to Kodak, this argument can only mean that the performance appraisals themselves constitute the “unlawful employment practice” at issue under § 2000e-5(e)(l), and thus that any claim regarding the allegedly biased performance appraisals must have accrued at the time the appraisals were conducted. Since even the most recent appraisal (presented to Thomas in May 1992) was conducted more than three hundred days before Thomas filed her complaint with the EEOC in July 1993, Kodak asserts that Thomas is precluded from questioning the racial neutrality of any of the three performance appraisals. And without the ability to challenge the appraisals upon which the PAR process was based, Kodak notes, Thomas’s discriminatory termination claim collapses.
According to Thomas, the fact that the performance appraisals were conducted more than three hundred days before she filed her EEOC charge is irrelevant, because the adverse employment decision that she is challenging — the “unlawful employment practice” under § 2000e-5(e)(1) — is Kodak’s decision to lay her off, and not the performance appraisals independently, and this decision indisputably fell within the statutory period.
Kodak’s general policy stance makes sense: given that the central purpose of the statute of limitations is to protect employers from stale claims, it cannot be true that plaintiffs have an unfettered ability to reach back and litigate biased evaluations. Yet Thomas’s position is also reasonable, since it seems unlikely that Title VII permits employers to take a new employment action such as a layoff which first causes an employee harm, as long as the allegedly discriminatory evaluations on which the layoff is based were conducted more than three hundred days earlier.
Counsel did not call the court’s attention to any First Circuit caselaw on this point. The district court, led to believe that the First Circuit had not addressed this question, “look[ed] for guidance elsewhere.”
Thomas,
We think many aspects of Colgan’s reasoning are correct. However, we find more direct guidance in a First Circuit case decided several years before
Colgan.
In
Johnson v. General Electric,
The decision was motivated, in part, by concerns about ripeness. The court noted that “it is far from clear whether claims under Title VII and analogous statutes would be ripe for adjudication until discriminatory systems [or evaluations] were actually applied to plaintiffs in particular employment decisions,” and observed that “[i]t is unwise to encourage lawsuits before the injuries resulting from the violations are delineated, or before it is even certain that injuries will occur at all.” Id. at 136.
Johnson’s notice standard serves multiple functions. By starting the limitations clock as soon as harm is noticed, or should be noticed, the standard protects employers from stale claims while also lessening the risk that employees will bring unripe claims. The notice standard protects employees as well, because it ensures that claims will not be foreclosed by events that ocurred outside the limitations period that did not put the employees on notice.
The notice standard in Johnson also resolves the apparent contradiction between the two reasonable policy stances pressed by the litigants here. Thomas is correct that Title VII extends to a neutral employer decision-making process that relies on discriminatory evaluations. But Kodak is' also right that employees do not have an unfettered right to reach back to challenge previous evaluations. The key is whether those evaluations had tangible, concrete effects at the time they were conducted. If the evaluation did cause tangible, concrete harm, the notice standard requires the injured employee to promptly bring suit to recover for those harms. Failure to do so will render any later claim regarding those particular harms time-barred. 8
This means that an employer could be exposed to Title VII liability for harms stemming from discriminatory evaluations *51 some years after the evaluations were conducted, if the evaluations first cause tangible harm to the employee at that later point. Kodak argues that this puts employers in an untenable position. We disagree, for three reasons.
First, as a matter of practicality, employers are unlikely to rely on “stale” evaluаtions. The older the evaluation, the less likely an employer would be to use that evaluation, as the basis of an employment decision. Employers’ preference for more recent evaluations is demonstrated dramatically by Kodak’s own PAR formula, which used only the latest three evaluations and then weighted the most recent by a factor of 25.
Second, the passage of time would affect employees as well as employers. An employee who sought to challenge an old evaluation would bear the burden of proving that the evaluation was discriminatory — and the older the evaluation, the more difficult that task would be.
Third, the standard gives concomitant advantages to employers. It avoids forcing employees to “run to the EEOC” each time they disagree with a performance evaluation. As the district court aptly observed:
If we apply the time bar to plaintiffs [performance appraisals], then we require a given plaintiff to file EEOC charges successively for each performance evaluation, informal feedback from a supervisor, or office rumor, so long as these events — even if nonharmful in themselves — might be informed by racial animus and could someday contribute to a later, harmful result. This requirement would surely disrupt the American workplace....
Thomas,
In arguing against the application of the Colgan rule (which, as explained above, resembles the First Circuit standard announced in Johnson), Kodak has presented a two-stranded argument. The first strand is based on notions of fair notice to the employer. The second asserts that Thomas’s timeliness argument relies on an impermissible combination of the equitable tolling and continuing violation doctrines.
Kodak would substitute for the accrual rule in Johnson a rule that focuses on notice to the employer. According to Kodak, the Colgan accrual rule makes “bad law because it allows an employee to introduce a time-barred incident where the employee has never previously contended, even in an informal process afforded by the employer, that the employee was subjected to unlawful discrimination.” Appel-lee’s Brief at 17. According to Kodak, this violates “the policy underlying Title VII [entitling] an employer ... to some kind of prior notice.” Id.
This is not an irrational position, but Kodak cites no authority for it. The argument calls to mind the exhaustion rule imposed by courts in the ERISA context,
see
Employee Retirement Income Security Act, 29 U.S.C. §§ 1001 et seq., which requires employees to take full advantage of employer-internal appeals processes before bringing suit to recover denied benefits,
see McMahon v. Digital Equipment Corp.,
In Title VII, by contrast, Congress chose not to impose a particular employer-internal appeals procedure. Furthermore, the statute of limitations for a Title VII claim is not tolled while an employee exhausts any internal remedy the employer has made available.
See Ricks,
Kodak’s position bears an even closer resemblance to one aspect of the approach first explicated in
Stoller v. Marsh,
But the Stoller court added a second part to this rule:
[A]fter preparation of employee evaluations, the employing organization may protect itself from Title VII liability by establishing procedures to allow employees to screen their personnel files and to remove damaging, discriminatory information .... If established procedures have gven an employee a reasonable opportunity to inspect the supervisory evaluations in his or her file, to challenge allegedly inaccurate materials, and to have such materials corrected or removed, and if the organization gives its employees adequate notice that these rights may be exercised, then it may rely in good faith on such evaluations in making subsequent employment decisions without violating Title VII.
Stoller,
We can imagine advantages of the
Stol-ler
rule for both employers and employees. Employers, of course, would be able to limit their liability. Employees could con
*53
ceivably benefit as well, if the rule increased the likelihood that employers would implement effective review procedures. But we decline to adopt this rule— which was, in any event, implicitly rejected in
Johnson
— because we think any possible advantages would be outweighed by two serious risks. One is the obvious risk that employers would have carte blanche to rely on discriminatory appraisals as long as they offered their employees an “Open Door.” Avoiding the first risk would entail a second: courts would need to interpret strictly the concept of “[adequate] procedures to allow employees to screen their personnel files and remove damaging, discriminatory information.” This would in turn entail a degree of judicial scrutiny and interference with day-to-day employer operations that employers would be unlikely to welcome and which courts interpreting Title VII have tried to avoid.
See Hidalgo v. Overseas Condado Ins. Agencies, Inc.,
The second strand of Kodak’s timeliness argument is the claim that both Thomas and the district court have relied on an illegitimate “hybrid” of the equitable tolling and continuing violation doctrines. Thomas disavows any such reliance. We make several observations in response to Kodak’s argument.
First, the First Circuit doctrine of equitable tolling is simply inapplicable to these facts. Some courts permit tolling of the statute of limitations if the plaintiff knew of a harm but not of its discriminatory basis.
See
2 Lindemann & Grossman,
supra,
at 1350. But our approach to equitable tolling is narrower; First Circuit law permits equitable tolling only where the employer has actively misled the employee.
See Mack,
Second, there is no need to apply the continuing violation doctrine. Commentators have labeled this doctrine “the most muddled area in all of employment discrimination law,” 2 Lindemann & Gross-man,
supra,
at 1351, and some courts have gone so far as to conclude that the entire doctrine is misguided and unnecessary,
see, e.g., Moskowitz v. Trustees of Purdue Univ.,
The First Circuit has recognized two different types of continuing violations: systemic violations, which “ha[ve] [their] roots in a discriminatory policy or practice ... [that] itself continues into the limitation period,”
DeNovellis v. Shalala,
In the Title VII context, the continuing violation doctrine applies where “a number of discriminatory acts emanat[e] from the same discriminatory animus, [with] each act constituting a separate wrong actionable under Title VII.”
Jensen,
Thomas’s claim presents a different question: not whether she is permitted to bring suit after many repeated harms, but rather whether harm cognizable under Title VII existed at the time she first received her performance appraisals sufficient to mean that the notice of layoff is not the date on which the limitations period began for her layoff claim. This question is more appropriately addressed under the Johnson notice standard.
We consider whether the date on which the limitations period begins to run should be earlier than the notice of layoff because there was earlier concrete harm to Thomas resulting from the performance appraisals. According to Kodak’s compensation plan, appraisal scores were intended to affect salary levels and determine “who should be promoted, transferred, demoted, terminated, laid off, and re-employed.” It is not clear whether Kodak informed employees of this intention. But even if employees were familiar with the intended possible uses of the scores, it appears that the effects listed in the compensation plan remained abstract-mere possibilities, not certainties.
This was particularly true in Thomas’s case. Flannery’s appraisals evaluated Thomas’s performance as only average or below average. But the appraisals did not specify that Thomas was to suffer an immediate consequence for her alleged performance failures,
9
much
*55
less that they would mechanically lead to her being laid off. Because Thomas seeks to recover for an allegedly discriminatory layoff, we consider whether that particular consequence was apparent at the time Thomas received the appraisals.
Cf. Colgan,
Kodak emphasizes the fact that Thomas refused to sign two of the three performance appraisals. According to Kodak, her refusal to sign “demonstrates that she was dissatisfied with [the appraisals] for some reason” and makes it “evident that she then believed she had been treated unfairly.” Appellee’s Brief at 18 n. 6. A trier of fact could, but need not necessarily, conclude that Thomas had notice of the appraisals’ possible racial bias as soon as they were presented to her. However, this fact does not go to the relevant question under Johnson’s notice rule: whether Thomas had notice of immediate, tangible consequences of her poor scores. Kodak’s argument pertains to notice of bias rather than the notice of harm required under Johnson. But notice of bias alone, absent harm, is clearly not sufficient under Johnson. 11
We hold that the performance appraisals Thomas received in 1990, 1991, and 1992 did not trigger the statute of limitations in § 2000e-5(e)(l) at the time they were presented to Thomas, because they did not initially have any crystallized implications or apparent tangible effects. The notice of the layoff is the date on which the limitations period began to run because Thomas’s low appraisal scores first resulted in concrete injury in 1993 when they led to her layoff. Since Thomas filed a charge with the EEOC within one hundred and eighty days of her layoff (the shortest possible period imposed under § 2000e-5(e)(1)), her сlaim is timely.
*56 B
The second question on appeal is whether a reasonable jury could find, based on Thomas’s evidence, that Flannery discriminated against Thomas because of her race when she assigned Thomas’s performance appraisal scores.
See
42 U.S.C. § 2000e-2(a)(1). To reach an answer, we follow the three stages of the “familiar burden-shifting framework,”
Mulero-Rodriguez v. Ponte, Inc.,
First, the plaintiff! ] must establish a prima facie case that [plaintiff] (1) was within a protected class; (2) met [the employer’s] legitimate performance expectations; (3) was adversely affected; and (4) was replaced by another with similar skills and qualifications. Once [plaintiff] do[es] so, the burden shifts to [the employer] to produce a valid and nondiseriminatory reason for the dismissal. In the final stage, the burden shifts back to the plaintiff[ ] to show that [the employer’s] stated reason for [plaintiffs] dismissal was false and but a pretext for discrimination.
Mulero-Rodriguez,
We focus particularly on the final stage of the
McDonnell Douglas/Burdine
framework, because we agree with the district court’s careful analysis of the first two stages.
See Thomas,
At the third stage of the
McDonnell Douglas/Burdine
framework, the ultimate burden is on the plaintiff to persuade the trier of fact that she has been treated differently because of her race.
See Hidalgo,
Applying this standard, the district court found that Thomas had met the first part of her burden. It determined that a jury could find that the appraisal scores were objectively unfair or in some sense skewed against Thomas. Accordingly, the court found that “[biased only on the evidence presented so far, a reasonable fact finder could determine that Flannery’s reasons for lowering plaintiffs [performance appraisal] ratings were pretextual.”
Thomas,
But the court also found that Thomas did not meet the second, “plus” part of her burden: she failed to present evidence to create a genuine issue of fact about whether the allegedly unfair scores were due to her race and not some other factor. See id. at 137-38. The court compared the evidence presented by Thomas, which in the court’s view amountеd only to evidence of “an unwelcoming office environment,” “conspicuously unfair treatment” by Flan-nery, and “a personality conflict between Flannery and plaintiff,” id., with the evidence presented by plaintiffs in other First Circuit cases in which plaintiffs had survived summary judgment:
In all of these cases, the plaintiff alleged at least one piece of evidence that explicitly referred to plaintiffs membership in a protected class, and stated or implied that this membership was or would soon adversely affect plaintiffs employment prospects. By contrast, in cases where plaintiff fails to make any connection between adverse employment actions and membership in a protected class, summary judgment is granted to the employer, and judgment is affirmed.
Id. at 138. The court granted summary judgment to Kodak because it found that Thomas had failed to “make any connection” between Flannery’s actions and her own membership in a protected class. Id. In particular, the court found that Thomas failed to present any evidence that Flan-nery had expressly linked the low scores to Thomas’s race and that Thomas’s evidence of pretext was not “’flagrant ” enough to support a finding of discrimination on its own. Id. (emphasis added).
The district court’s analysis mischarac-terizes the plaintiffs burden at the third stage of the McDonnell Douglas/Burdine framework. To clarify the actual nature of that burden, we make a number of points.
First, the labels “pretext” and “plus” must be used with great care. Although it uses the label “plus,” the First Circuit’s “pretext-plus” standard “does not necessarily require the introduction of additional evidence” beyond that required to show “pretext,” i.e., evidence showing that the employer’s articulated reason is false.
Dichner,
A corollary is that there can be no mechanical formula at the third stage of the
McDonnell Douglas/Burdine
framework. “The strength of the prima facie case and the significance of the disbelieved pretext will vary from case to case depending on the circumstances. In short, everything depends on the individual facts.”
Woods v. Friction Materials, Inc.,
Our second point is related. That “everything depends on the individual facts” means we need to look carefully at the particular type of claim that Thomas is bringing. As we understand Thomas’s argument, she alleges a more subtle type of disparate treatment than the type often used to exemplify the operation of the McDonnell Douglas/Burdine framework. She does not argue that Kodak has articulated a false reason for her layoff (for example, excessive tardiness) in order to disguise the actual, unrelated reason (her race) — what one might describe as a “truth versus lies” claim — rather, she challenges the racial neutrality of the proffered reason itself. The latter type of challenge is also cognizable as a form of disparate treatment: if an employer evaluates employees of one race less favorably than employees of another race who have performed equivalently, and if race, rather than some other factor, is the basis for the difference in evaluations, then the disfavored employees have been subjected to “discriminat[ion] ... because of ... race.” § 2000e-2(a)(1).
The “pretext-plus” label may seem inapposite for this type of claim, since the word “pretext” could suggest a conscious lie. But issues of pretext should not be confused with the issue of whether there has been discrimination “because of race.” As the Supreme Court has used the term “pretext,” the term provides “no justification for assuming ... that those employers whose evidence is disbelieved are perjurers and liars.”
Hicks,
*59
The Supreme Court has long recognized that unlawful discrimination can stem from stereotypes and other types of cognitive biases, as well as from conscious animus. Indeed, discussing age discrimination in
Hazen Paper Co. v. Biggins,
The role of such stereotyping has been discussed most thoroughly in that branch of disparate treatment law developed apart from the
McDonnell Douglas/Burdine
framework and known as the
Price Waterhouse
framework.
See Price Waterhouse v. Hopkins,
In keeping with [Title VII’s remedial] purpose, the Supreme Court has never applied the concept of intent so as to excuse an artificial, gender-based employment barrier simply because the employer involved did not harbor the requisite degree of ill-will towards the person in question. As the evidentiary framework established in McDonnell Douglas makes clear, the requirement ] of discriminatory motive in disparate treatment cases does not function as a “state of mind” element, but as a method of ensuring that only those arbitrary or artificial employment barriers that are related to an employee or applicant’s *60 race, sex, religion, or national origin are eliminated.
Id.
at 468-69 (footnotes omitted);
see also Lynn v. Regents of the Univ. of Cal.,
The court of appeals found it unsurprising that the disparate treatment doctrine focuses on causality rather than conscious motivations, since “unwitting or ingrained bias is no less injurious or worthy of eradication than blatant or calculated discrimination.”
Hopkins,
The Supreme Court upheld this portion of the D.C. Circuit’s decision.
See Price Waterhouse,
*61
The concept of “stereotyping” includes not only simple beliefs such as “women are not aggressive” but also a host of more subtle cognitive phenomena which can skew perceptions and judgments.
Price Waterhouse
highlighted one such phenomenon: the tendency of “unique” employees (that is, single employees belonging to a protected class, such as a single female or a single minority in the pool of employees) to be evaluated more harshly in a subjective evaluation process. See
id.
at 235-36,
We make one final general point about the third stage of the
McDonnell Douglas /Burdine
framework: in the context of a motion for summary judgment, courts must be very cautious about sua sponte finding non-discriminatory reasons for apparently disparate treatment. Although the presumption of discrimination has dropped out of the case by the third stage of the
McDonnell Douglas /Burdine
framework, its rationale remains relevant: discrimination, rarely explicit and thus rarely the subject of direct evidence, may be proven through the elimination of other plausible non-discriminatory reasons until the most plausible reason remaining is discrimination.
See Burdine,
Having clarified the actual nature of the plaintiffs burden under
McDonnell Douglas/Burdine,
we now consider whether Thomas has met that burden. Because the employer here has articulated a nondiscriminatory reason for its action, the initial presumption of discrimination creat
*62
ed under
McDonnell Douglas/Burdine
has dropped from the case.
See Hicks,
The first portion of Thomas’s third-stage burden is to produce sufficient evidence to show that she was not evaluated on the same terms as her colleagues, but rather was evaluated more harshly than the other, non-minority CSRs. The district court found that the plaintiff met this portion of her burden: she produced enough evidence to allow a reasonable fact-finder to determine that Flannery’s reasons for assigning her scores were “pretextual.”
Thomas,
One strong piece of evidence is the sharp drop in Thomas’s overall score (from 6 in 1989 to 3 in 1990) after Flannery became her supervisor. While the drop in score would not demonstrate uneven treatment if Flannery were simply a “tough grader,” this does not appear to have been the case. In 1990, when Thomas received a 3, all of the other CSRs whom Flannery graded received eithеr a 5 or a 6. 15 Furthermore, over her three years as supervisor Flannery gave CSRs other than Thomas a number of 7s in individual categories, a score that earlier supervisors characterized as extremely rare or even unheard of.
The drop in Thomas’s score from 1989 to 1990 would also fail to demonstrate uneven treatment if it merely reflected a higher standard imposed upon Thomas due to her grade change in 1989. According to Kodak, “the higher the wage grade and salary, the more Kodak expected from the employee.” Appellee’s Brief at 34. But Thomas presents deposition testimony from other supervisors suggesting that there was no such rule, and the scores that Flannery assigned to other Wellesley CSRs seem to bear this out: one CSR who received a grade change in 1991 received a 5 in 1992 after receiving a 6 in 1991 (a drop of only one point rather than three after the grade change), and another CSR who received a grade change in 1991 received a higher score the following year (from a 4 in 1991 to a 5 in 1992).
Thomas makes other specific comparisons as well. The strongest of these concern her scores for “quantity of results,” which was the most objective of the categories appraised, since it was based directly on the number of machines and installations for which a CSR was responsible. Thomas’s scores appear low when compared to the scores for quantity given to other CSRs. In 1990, for instance, Thomas received only a 3 for quantity of results — a below average score — for managing 730 machines and 110 installations, while another CSR, also in the K6 grade, received
*63
a 5 for quantity in 1991 for managing far fewer machines (504) and fewer installations (81). Kodak argues that the comparison is “unfair and analytically unsound” because the other employee had been at the K6 grade for many years, and “Sustained performance at a higher grade counts for more than a relatively shorter period of performance at the higher grade.” Appellee’s Brief at 35. This argument might persuade a jury to discount the value of this comparison, but it does not render the comparison meaningless at summary judgment, where Thomas need only produce evidence sufficient to support her contention that her scores were lower than those given to similarly performing non-minority CSRs.
See Molloy v. Blanchard,
In addition to the performance appraisals themselves, Thomas presents other evidence to show that Flannery treated her differently than she treated the other CSRs. According to Thomas, Flannery used flimsy grounds to prevent her from delivering an important presentation at a Kodak meeting, refused to provide her with computer training, failed to allow her appropriate developmental opportunities, and failed to evaluate her accurately on the basis of her interaction with customers (since she never accompanied Thomas on site in order to observe that interaction, as she did with the other CSRs).
On the whole Thomas has presented enough evidence to survive summary judgment on the question of disparate treatment. Some of Thomas’s scores, and particularly the 1990 scores, appear lower than one would expect given her apparent performance level. A reasonable jury, examining the evidence Thomas has submitted, could conclude that Thomas was evaluated more harshly than other, non-minority CSRs supervised by Flannery.
Kodak argues that, even if true, this would not matter, because Thomas would have been laid off anyway. Even if Thomas had received the highest possible score (7) in 1990, Kodak says, her PAR ranking would not have changed: because the PAR formula so heavily weighted the two more recent scores, Thomas would still have had the second lowest PAR score among the CSRs and would still have been selected for layoff. There are two flaws with this argument. First, it ignores the natural implications of a finding that the 1990 scores were unfairly low. To the extent that Thomas’s 1990 scores were inappropriately low, a factfinder could infer that her later scores' — although higher than her 1990 scores — were inappropriately low as well 16 (and indeed, they were consistently lower than the scores given to the other CSRs, as demonstrated by the performance appraisals that Kodak has produced for 1991 and 1992). Second, Kodak’s argument ignores the fact that performance appraisal scores were expressly on a curve. According to Kodak’s compensation plan, scores were to be “validated” by reference to an informal supervisor ranking of employees in the group. Thus, if Thomas’s scores had gone up, other CSRs’ scores might have gone down. Whether accurate scoring of Thomas in relation to the other CSRs would have saved her from layoff is something Thomas must prove at trial. But she has created a genuine issue with respect to this question. Not only has she produced evidence which suggests that Flannery treated her differently and graded her more harshly than the other CSRs, she has also produced evidence to support the claim that her earlier evaluations were a more accurаte *64 appraisal of her performance: (1) her promotion in 1989 to the K6 grade, which was reserved for “outstanding” CSRs “servicing the largest and/or most sensitive accounts,” (2) her salary, which was higher than her seniority would predict, presumably reflecting the fact that Kodak had, over the years, found her performance level to be comparatively high, and (3) the positive comments from customers and from other supervisors, which suggest that she was in fact one of the better CSRs.
Our inquiry does not end with this finding, since even the most blatant unfairness cannot, on its own, support a Title VII claim. “Title VII does not grant relief to a plaintiff who has been discharged unfairly, even by the most irrational of managers, unless facts and circumstances indicate that discriminatory animus was the reason for the decision.”
Smith,
As we emphasized above, Thomas can meet her burden with respect to this question using the same evidence used to show unequal treatment. The district court’s opinion can be read to say that at this stage the plaintiff must produce what has been called “direct” evidence, such as racially explicit statements. That is not so. There is no requirement that a plaintiff, having shown differential treatment and pretext, must present direct, “smoking gun” evidence of racially biased decision-making in order to prevail. Where the disparity in treatment is striking enough, a jury may infer that race was the cause, especially if no explanation is offered other than the reason rejected as pretextual. That is the case here, since Kodak argues that Thomas’s scores were objectively fаir, not that they were unfair due to a personality conflict, as the district court surmised.
In fact, Thomas does present additional evidence on this point. She describes incidents and situations which suggest that Flannery had a general disregard for her professional abilities and status. It also appears, from the picture that Thomas paints (and which Kodak does not dispute) that Flannery was at times inappropriately upset or angry with Thomas, to the point of behaving unprofessionally. This, in turn, suggests that she did not respond neutrally to Thomas. A jury might reasonably infer from Thomas’s description of these incidents that Thomas’s race was an issue for Flannery and that Flannery’s evaluations of Thomas were affected by some form of conscious animus or less conscious bias. 17
Our assessment of the evidence would be quite different if Thomas had been one of several black employees supervised by Flannery, some of whose scores were not low relative to those of the non-minority CSRs. But that was not the case. Thomas was the only black CSR, and one can infer from the evidence that she was also the only CSR who was evaluated unfairly.
*65
Given this, it is reasonable to infer that race played a determinative role in the evaluation process — especially since there is also other evidence that Flannery treated Thomas poorly. Indeed, the very fact that Thomas was the only black CSR at the Wellesley office may have increased the likelihood that she would be evaluated more harshly.
See Price Waterhouse,
In the end, Thomas is in a different position than a plaintiff who merely “disagree[s] with [the employer’s] assessment of her relative performance.”
Compton v. GTE Gov’t Sys., Corp.,
No. 93-11383,
IV
Thomas’s claim is timely, and it is based on sufficient evidence to permit a reasonable jury to find that Flannery discriminated against Thomas because of her race when she assigned the performance appraisal scores that led to Thomas’s layoff. Accordingly, we reverse the district court’s grant of summary judgment and remand for further proceedings consistent with this opinion.
Costs are awarded to appellant.
Notes
. The lowest rating was 1 (for performance that "does not meet essential job requirements”) and the highest was 7 (for performance that "far exceeds requirements of the position”). The middle rating of 4 was considered appropriate “for individuals who regularly perform all assigned responsibilities with independence and initiative, and achieve expected results on a continual basis.”
. We use the terms “black" and “white,” as Thomas has chosen this terminology.
. This formula was used for employees whose grade remained the same for all three years. If an employee had changed grades during the three year period, the formula gave less weight to the score immediately following the grade change.
. Thomas described a number of other events as evidence of unlawful race discrimination by Kodak. The district court provided a succinct summary:
When plaintiff was transferred to Welles-ley, Kodak did not fly her to Massachusetts, as it had other transferred employees. Instead, Thomas was told to drive a car that had been requested by Richard Austin, her new supervisor. The car was in disrepair, requiring a new muffler. Though other employees had two weeks in a new location before reporting to work, plaintiff was told to report right away.
As part of plaintiff’s move benefits, Kodak provided storage for her belongings. The warehouse burned in 1981, destroying virtually all of Thomas' belongings. Thomas gave Austin an inventory of her losses; in response, he asked, "Are you rich or something?” Before Thomas was reimbursed, another Kodak employee encouraged her to change her inventory listing. Defendant did not reimburse Thomas for five years.
A few months after Thomas began working in Wellesley, she was standing with a group of employees when Austin introduced two visiting sales managers. Austin introduced everyone in the group except Thomas.
In 1983, Thomas passed a co-worker in the hallway outside the department manager's office. The co-worker asked the manager if plaintiff was "here to do the cleaning.”
*47 Thomas lost a gold bracelet at work. A co-worker announced finding it just before a sales meeting, as other employees arrived. Thomas claimed the bracelet, and described it at his request. He refused to return the bracelet until he had confirmed that it did not belong to another, white female employee.
Thomas v. Eastman Kodak Co.,
. The full story is more complicated, in part because § 2000e-5(e) interacts with § 2000e-5(c), which imposes a sixty-day waiting period between the filing of a charge with state or local authorities and the filing of a charge with the EEOC:
[A] complainant in a deferral State ... need only file his charge within 240 days of the alleged discriminatoiy employment practice in order to insure that his federal rights will be preserved. If a complainant files later than that (but not more than 300 days after the practice complained of), his right to seek relief under Title VII will nonetheless be preserved if the State happens to complete its consideration of the charge prior to the 300-day period.
Mohasco Corp. v. Silver,
. Section 112 of the 1991 Civil Rights Act later overruled Lorance, allowing employees to challenge a seniority system "when the system is adopted, when an individual becomes subject to the seniority system, or when a person aggrieved is injured by the application of the seniority system.” 1991 Civil Rights Act, Pub.L. No. 102-166, § 112, 105 Slat. 1071, 1079 (1991) (codified at 42 U.S.C. § 2000e-5(e)(2)). This may mean that seniority systems should be treated as an exception to the "notice rule” discussed in Col-gan; alternatively, it may simply mean that Congress thought that the Court had misapplied the notice rule in the seniority system context.
.
Johnson,
which was decided shortly before the Supreme Court’s decision in
Lorance,
suggested that this rule would apply to discriminatory seniority systems as well.
See Johnson,
. If, for example, a poor job evaluation resulted in a denial of a salary increase, notice of the denial would mark the accrual point for a pay inequity claim. Under
Rides,
the statute would also begin to run on any other harm that was the “delayed, but inevitable, consequence” of the denial of a salary increase,
Ricks,
. Under
Ricks,
we must also consider whether the appraisals led to any other concrete harm for which the layoffs were the "delayed, but inevitable, consequence.”
Ricks,
*55 As noted above, it is not clear whether a separate, unrelated harm would also trigger the accrual of Thomas's layoff claim. We need not reach this question. Assuming ar-guendo that unrelated harms would trigger accrual of Thomas's layoff claim, accrual was not triggered here, because Thomas's appraisals did not in fact result in any tangible, concrete harms at the time they were presented to her. Although Thomas was denied some promotion opportunities during Flan-nery's tenure, she was not told that she lost those opportunities because of her low performance appraisals; rather, Flannery told her she lacked the proper credentials. Thomas was also not told that her salary would be affected by her low scores, and indeed, it appears that her salary kept pace with the other fivе CSRs throughout Flannery’s tenure as Customer Support Manager.
. The compensation plan instructed appraisers not to discuss appraisals in comparative terms, and appraisal scores were not openly discussed among employees. Thus, Thomas had no way to discover this information on her own. Although she knew that her scores were low in comparison to the scores she had received before Flannery became her supervisor, she could not know that they were low in comparison to the scores of other CSRs whom Flannery was evaluating.
. Our analysis of the notice-of-harm issue might be different in the context of a hostile environment claim. In that context, we might find that Thomas’s low appraisal scores caused some tangible harm as soon as they were presented to her, since Thomas reportedly found the low scores to be "insulting” and "shameful” (and demonstrated this by refusing to sign them). But pinpointing the moment of accrual for a hostile environment claim is a different problem altogether than the one here, since a certain accumulation of discriminatory events or situations is necessary before an employee is expected — or permitted — to bring a hostile environment suit.
See Provencher v. CVS Pharmacy,
. This method of proving a Title VII claim is all the more important now than it was when
McDonnell Douglas
was written, since "smoking gun’’ evidence is "rarely found in today’s sophisticated employment world.”
Hodgens v. General Dynamics Corp.,
. The language in certain other First Circuit cases might suggest that an express and conscious employеr intent to discriminate is critical to the third stage of the
McDonnell Douglas/Burdine
inquiry. Cases have stated, for example, that a plaintiff must "elucidate specific facts which would enable a jury to find that the reason given is not only a sham,
*59
but a sham intended to cover up the employer’s real motive,”
Hidalgo,
.
Price Waterhouse
also held that an employer could avoid liability for disparate treatment if it could demonstrate that it would have taken the same action in the absence of the discriminatory motive.
See Price Waterhouse,
In defining those violations of Title VII for which compensatory damages may be awarded, the 1991 Act uses the phrase "unlawful intentional discrimination.” 42 U.S.C. § 1981a(a)(l). This phrase merely distinguishes the disparate treatment framework, and other frameworks in which plaintiffs are individually affected, from the disparate impact framework.
See id.
(limiting compensatory damages to actions in which respondents "engaged in unlawful intentional discrimination (not an employment practice that is unlawful because of its disparate impact)”). Cases decided after the enactment of the 1991 Civil Rights Act continue to recognize the validity of claims based on employers' biased or stereotypical thinking.
See, e.g., Galdieri-Ambrosini
v.
National Realty & Dev. Corp.,
. This information comes from a summary chart of performance appraisal scores produced by Kodak. For 1990 Kodak apparently retained only Thomas’s appraisal form and the overall scores of the other CSRs; it was unable to produce the other five appraisal forms for 1990. Thomas suggests in a footnote that the summary chart "would be inadmissible pursuant to FRE 1006 since the documents summarized are not so voluminous that a jury could not reasonably review them at trial and because the documents summarized were not produced." Appellant's Brief at 18 n. 3. We do not address this argument, since issues not squarely raised by the parties are considered waived.
See Service Employees Int’l Union v. Local 1199 N.E.,
. In addition to drawing such an inference, a factfinder might also reach this conclusion — that Thomas’s 1991 and 1992 scores were inappropriately low — by relying on the type of direct evidence discussed above with reference to Thomas’s 1990 scores.
. Thomas describes a number of other incidents from her time at Kodak, not involving Flannery, which she believes to have been racially tinged. (These incidents are recounted in note 4 above.) If true, these incidents are unfortunate, but they are not relevant to the issue whether Flannery evaluated Thomas, differently because of her race. As a general matter, "[t]he biases of one who neither makes nor influences the challenged personnel decision are not probative in an employment discrimination case.”
Medina-Munoz v. R.J. Reynolds Tobacco Co.,
