When the Science Was Never There: Junk Forensics, Unethical Investigations, and the Institutions That Kept Using Both

Q: What did the 2009 NAS report find about forensic science?

The 2009 National Academy of Sciences report Strengthening Forensic Science in the United States found that most forensic disciplines outside of nuclear DNA analysis lacked the scientific foundation to support the certainty with which they were presented in court. The report called for mandatory accreditation, independent oversight, and standardized research protocols. Many of its recommendations remain unimplemented.

Williams, Rita

When the Science Was Never There: Junk Forensics, Unethical Investigations, and the Institutions That Kept Using Both

By Rita Williams July 2, 2026

22 Min Read

Direct Answer

Discredited forensic methods have contributed to wrongful convictions in the United States for decades. The problem is not limited to rogue analysts. It is structural: courts that admit testimony without rigorous gatekeeping, labs that operate under prosecutorial supervision with no independent accreditation, and a legal system that treats the appearance of scientific authority as adequate substitute for scientific validity. The techniques have changed. The structural conditions enabling them have not.

Key Points

Bite mark comparison, microscopic hair analysis, and certain blood spatter pattern testimony lack the empirical foundation to support the certainty with which they are routinely presented in court.

The 2009 National Academy of Sciences report found that most forensic disciplines outside of nuclear DNA analysis could not demonstrate the scientific validity assumed by courts and juries.

Lab fraud cases, including Fred Zain in West Virginia and Annie Dookhan in Massachusetts, were not discovered through internal oversight. They were discovered through external pressure, post-conviction DNA testing, or investigative reporting.

Shaken baby syndrome testimony has been contested by significant portions of the medical community, yet courts have been slow to permit challenges to convictions built on that diagnosis.

Reform requires mandatory independent accreditation, published error rates, and a conviction review mechanism that does not require defendants to first prove innocence before being permitted to challenge discredited forensic evidence.

The Problem Is Structural, Not Incidental

The standard account of forensic fraud treats it as aberration. A bad analyst. A corrupt lab. An outlier. That account is wrong, and the evidence that it is wrong has been available for decades.

In 2009, the National Academy of Sciences released a report on the state of forensic science in American courts. The findings were direct: with the exception of nuclear DNA analysis, no forensic discipline had demonstrated, through rigorous empirical research, that its methods could reliably produce accurate results or that practitioners could consistently apply those methods the same way. The report identified bite mark analysis, microscopic hair comparison, blood spatter interpretation, tool mark analysis, and shoe print comparison as disciplines in which the scientific foundation was, at best, contested and, at worst, invented.

The courts had been admitting this testimony for years. They continued admitting it after the report was published.

Structural Finding

The forensic science problem is not a quality control failure. It is an admissibility failure. Courts permitted testimony that overstated certainty, and they did so under legal standards that were never designed to evaluate whether a scientific discipline had actually been validated. The Daubert standard requires that methods be tested, peer-reviewed, have known error rates, and enjoy general acceptance in the relevant scientific community. Bite mark analysis, to take the most documented example, meets none of those criteria. It was admitted anyway.

What Bite Marks Actually Demonstrate

Bite mark analysis is premised on two claims: that human dentition is unique, and that skin reliably preserves the impression of that dentition in a way that allows comparison. Neither claim has been validated by controlled research. The claim of dental uniqueness is an assumption, not a finding. The claim of reliable skin impression is contradicted by research on how human skin deforms, decomposes, and changes shape after death.

The Innocence Project has documented multiple exonerations in which bite mark testimony was a central element of the conviction. Ray Krone spent ten years in an Arizona prison, two of them on death row, before DNA evidence excluded him and identified another person. The bite mark testimony at his trial had been presented by an expert who described his methodology as reliable and his conclusions as certain. Neither characterization held up.

The American Board of Forensic Odontology, the professional body that certifies bite mark analysts, has periodically updated its guidelines. It has not produced the controlled error rate research that would be required to justify the certainty with which analysts testify. A 2016 report from the President’s Council of Advisors on Science and Technology concluded that bite mark analysis should not be admitted in criminal proceedings pending development of a scientific foundation. Courts have largely continued to admit it.

The gap between what the science supports and what analysts say in court is not a matter of interpretation. It is documented, quantified, and has been documented for long enough that continued admission of this testimony is not a gatekeeping failure. It is a choice.

Microscopic Hair Analysis and the FBI’s Correction That Did Not Correct Much

For decades, FBI analysts testified that microscopic hair comparison could positively associate a questioned hair sample with a known individual. The technique was presented as a reliable scientific method. It is not. Hair lacks the individualizing characteristics that would allow a positive identification. Two hairs can be indistinguishable under a microscope and come from different people. The technique can exclude, but it cannot include with scientific validity.

In 2015, the FBI, in cooperation with the Innocence Project and the National Association of Criminal Defense Lawyers, acknowledged that hair analysts had overstated the significance of hair comparison evidence in at least 95 percent of cases reviewed in an initial audit. The affected cases spanned decades. Some defendants had already been executed. Some had died in prison. The audit covered a fraction of the cases in which this testimony had been offered.

Enforcement Gap

The FBI’s acknowledgment did not trigger automatic review of all affected convictions. Defendants whose convictions included this testimony were required to file individual post-conviction motions, and those motions faced substantial procedural barriers. The remedy was theoretical. For most affected defendants, it was not practical.

The problem with hair analysis is not that individual analysts were dishonest, though some were. The problem is that the method was taught as reliable, certified as reliable, and presented in court as reliable for generations before anyone was required to produce the research that would justify that characterization. The scientific community did not validate the technique. Courts validated it by admitting it repeatedly, and repetition became its own form of authority.

Fred Zain, Annie Dookhan, and the Lab Fraud Architecture

Fred Zain worked as a serology analyst for the West Virginia State Police crime lab from 1977 to 1989, and subsequently for the medical examiner’s office in Bexar County, Texas. During that time, he fabricated or altered results in a number of cases that has never been fully determined. The precise scope of his fraud was never established because the documentation necessary to determine it was inadequate. West Virginia did not have a system that would have caught him.

The West Virginia Supreme Court of Appeals, reviewing Zain’s conduct in 1993, described his work as representing a systematic practice of fraud. The court found that supervisors had received complaints and had not acted on them, that external auditors had identified anomalies that were not investigated, and that the lab had operated for years without meaningful quality controls. The fraud was discovered because a defendant whose conviction was built on Zain’s testimony sought post-conviction DNA testing, not because the system identified a problem.

Annie Dookhan worked as a chemist at the William A. Hinton State Laboratory Institute in Massachusetts from 2003 to 2012. She confirmed the presence of controlled substances in samples without testing them, forged colleague signatures, and manipulated test results. Investigations following her arrest estimated that approximately 24,000 cases were potentially affected by her work. The lab had given her performance awards for processing a volume of samples that should have been physically impossible to test accurately. No one investigated why her numbers were so much higher than her colleagues’.

~24,000 Cases potentially affected by Annie Dookhan’s conduct at the Hinton Lab

9+ Years Fred Zain fabricated serological evidence before external review surfaced the fraud

257+ FBI cases with overstated hair analysis testimony identified in the initial 2015 audit alone

These are not stories about exceptional wrongdoers. They are stories about systems that created conditions in which wrongdoing was easy, difficult to detect, and, once detected, difficult to remediate. Zain operated in a lab where no one verified his work. Dookhan operated in a lab where throughput was treated as a metric of quality. In both cases, the lab existed within a prosecutorial ecosystem in which the expectation was that the analyst’s job was to confirm what investigators already believed.

Pattern

Forensic labs that operate under law enforcement supervision, without independent accreditation or external audit, have a structural incentive problem. The analyst’s institutional relationship is with the people who need convictions. That relationship does not have to produce deliberate fraud to produce distorted results. It produces distorted results because the institutional context shapes what gets questioned, what gets reported, and what gets buried.

Shaken Baby Syndrome and the Medical Diagnosis as Legal Proof

Shaken baby syndrome, now more commonly referred to by practitioners as abusive head trauma, is a medical diagnosis based on a triad of findings: subdural hematoma, retinal hemorrhage, and brain swelling. For decades, the presence of that triad was treated in courts as sufficient evidence to establish that an infant had been violently shaken, and the person responsible for the child at the time of the medical emergency was treated as the responsible party.

The scientific basis for that inference has been contested by a significant portion of the biomechanical and pediatric medical community. Research published in peer-reviewed journals has raised questions about whether shaking alone, without impact, can produce the force necessary to generate the triad, whether the triad can result from accidental falls or short-distance drops, whether medical conditions can mimic the triad, and whether the retinal hemorrhages previously treated as a specific indicator of abuse occur in other circumstances. The medical community has not reached consensus on these questions. Courts have frequently proceeded as if it had.

Kathleen Folbigg in Australia and a number of defendants in the United States and United Kingdom have had convictions examined or overturned in proceedings that turned on the contested science underlying this diagnosis. In each instance, the original conviction was built on expert testimony that presented the diagnosis as more settled than the underlying research supported.

Gatekeeping Failure

When a medical diagnosis functions as legal proof of criminal intent, the standards for admissibility must account for the contested nature of the underlying science. Courts have generally not required that the triad-based inference be validated through the same evidentiary standards applied to other forensic disciplines. The result is that defendants are convicted on the basis of a medical framework whose reliability is actively debated among the specialists who developed it.

Why Courts Keep Admitting What Scientists Have Rejected

The persistence of discredited forensic methods in courts is sometimes explained as a lag problem. Science advances, courts catch up slowly. That explanation is too generous. The 2009 NAS report was not the first time the scientific community raised concerns about these disciplines. Challenges to bite mark analysis appeared in the research literature in the 1970s. Challenges to microscopic hair analysis predate the FBI’s 2015 acknowledgment by decades. The lag is not a matter of information. It is a matter of institutional incentive.

Courts rely on forensic evidence. Prosecutors build cases around it. Judges who were once prosecutors learned to trust it. Defense attorneys, particularly public defenders operating under severe resource constraints, frequently cannot afford the expert witnesses necessary to challenge it. The infrastructure of the criminal legal system is built around the assumption that forensic evidence is reliable, and that assumption is not examined until a specific defendant, in a specific case, raises a specific challenge with sufficient resources to make it.

The Daubert standard, properly applied, should have filtered out methods without validated error rates and peer-reviewed scientific support. It has not, in part because the standard gives trial judges substantial discretion, and trial judges are not scientists. Judges evaluate whether a method has been generally accepted in the relevant scientific community, but the relevant scientific community for bite mark analysis is forensic odontologists, many of whom practice bite mark analysis. The community that accepts the method and the community that could validate it are not the same community.

The admissibility problem and the lab integrity problem compound each other. A method that overstates its certainty in court is most dangerous when the analyst producing the results operates without independent verification. Those two conditions, unvalidated methods and unaudited labs, have coexisted in the American forensic system for long enough that the combination cannot be described as an accident.

What Reform Would Actually Require

The 2009 NAS report made specific recommendations. A significant number remain unimplemented. The report called for the creation of a National Institute of Forensic Science, independent of the Department of Justice, that would set standards, fund research, and oversee accreditation. Congress has not created that institution. The report called for mandatory accreditation of all forensic labs. Many labs remain unaccredited. The report called for development and publication of error rates for every forensic discipline used in criminal proceedings. That research has not been systematically conducted.

The gap between what the report recommended in 2009 and what exists in 2026 is itself a finding. It documents who benefits from the current system and who absorbs the cost of its failures.

Reform Framework

Minimum structural reform requires four elements: mandatory independent accreditation of all forensic laboratories with published, binding standards; defined and publicly available error rates for every method offered in criminal testimony; mandatory disclosure of methodological limitations to defendants before trial; and a conviction review mechanism that does not require defendants to prove actual innocence before they are permitted to challenge the forensic basis of their conviction. None of these reforms require legislation that is technically difficult to draft. They require political will that the current system does not generate because the current system’s beneficiaries are better positioned than its victims.

The history of forensic accountability in the United States is a history of external pressure producing internal acknowledgment that does not produce systemic change. The FBI acknowledged the hair analysis problem after advocates forced disclosure. Massachusetts addressed the Dookhan fallout after defendants organized and litigated in volume. West Virginia reviewed Zain’s work because a specific defendant pursued a specific remedy. In each case, the system did not self-correct. It was corrected from outside, at substantial cost, by people with inadequate resources and considerable determination.

That is not a sustainable model for ensuring that the forensic evidence used to deprive people of their liberty is accurate. It is, however, the model that currently exists.

Interactive Tool

Forensic Reliability Risk Sorter

Tap each method to see the core reliability problem the article identifies.

Individualization problem

Bite mark comparison

The method has been presented as if skin injuries can reliably identify one person’s teeth. The article’s point is that the scientific foundation does not support that certainty.

Overstatement problem

Microscopic hair analysis

Hair comparison can sometimes exclude. It cannot validly identify a person to the level analysts claimed for decades. The FBI’s later review showed how widespread that overstatement became.

Interpretation problem

Blood spatter testimony

Pattern interpretation can become dangerously subjective when analysts present reconstruction opinions with more certainty than the underlying method can support.

Diagnosis as proof

Shaken baby syndrome / abusive head trauma

The danger is not that medical expertise is irrelevant. The danger is when contested medical inference becomes courtroom proof of criminal intent without enough room for scientific uncertainty.

Interactive Tool

Gatekeeping Failure Map

This map shows where unreliable forensic evidence should be stopped, and where the article says the system keeps letting it through.

Lab level Missing accreditation, poor verification, and performance incentives can let weak or fraudulent work become official evidence.

Prosecution level Prosecutors may rely on forensic conclusions as case architecture instead of treating them as claims requiring independent scrutiny.

Court level Judges applying Daubert or similar standards can admit testimony from communities that accept methods they have not scientifically validated.

Interactive Tool

Conviction Review Trigger Checklist

If a conviction contains any of these forensic warning signs, the case deserves a serious integrity review.

Method later repudiated Trigger A forensic discipline was later rejected, limited, or formally criticized by scientific review bodies.

Certainty overstated Trigger The analyst used language stronger than the method’s known limits can support.

No independent audit Trigger The lab or analyst operated without meaningful outside verification or accreditation.

External scandal Trigger The same lab, analyst, or method later appeared in an audit, misconduct finding, or innocence review.

Quick FAQs

What is junk science in criminal courts?

Junk science in criminal courts refers to forensic methods presented as scientifically valid that lack sufficient empirical support, have never been rigorously tested for accuracy, or have been formally repudiated by the scientific community. Examples include bite mark comparison, microscopic hair analysis, and certain blood spatter interpretation practices.

What is the Daubert standard and why does it matter?

Daubert v. Merrell Dow Pharmaceuticals (1993) established the federal standard for admissibility of expert scientific testimony, requiring that methods be tested, peer-reviewed, have known error rates, and be generally accepted in the relevant scientific community. Critics argue that courts apply the standard inconsistently, allowing forensic disciplines with contested foundations to continue producing convictions.

Who was Fred Zain and why does he matter?

Fred Zain was a West Virginia State Police serologist whose fabricated lab results contributed to numerous wrongful convictions across two states over more than a decade. His misconduct was not discovered through internal oversight but through post-conviction DNA testing. His case is a structural example of how lab fraud persists when there are no meaningful quality controls or independent verification mechanisms.

What did the 2009 NAS report find about forensic science?

The National Academy of Sciences found that most forensic disciplines outside of nuclear DNA analysis lacked the scientific foundation to support the certainty with which they were presented in court. The report called for mandatory accreditation, independent oversight, and standardized research protocols. Many of its recommendations remain unimplemented.

What reforms would actually address the junk science problem?

Meaningful reform would require mandatory accreditation of all forensic labs independent of law enforcement, defined and published error rates for every method used in testimony, mandatory disclosure of method limitations to defendants, and a conviction review mechanism that does not require defendants to prove actual innocence before being allowed to challenge the forensic basis of their conviction.

Sources and Further Reading

Primary Report
National Academy of Sciences. Strengthening Forensic Science in the United States: A Path Forward. National Academies Press, 2009.

Federal Review
President’s Council of Advisors on Science and Technology. Forensic Science in Criminal Courts: Ensuring Scientific Validity of Feature-Comparison Methods. Executive Office of the President, 2016.

Hair Analysis
Federal Bureau of Investigation. “FBI Testimony on Microscopic Hair Analysis Contained Errors in at Least 90 Percent of Cases in Ongoing Review.” Press Release, April 20, 2015. fbi.gov.

Lab Fraud
In re Investigation of the West Virginia State Police Crime Laboratory, Serology Division, 438 S.E.2d 501 (W. Va. 1993). (Zain findings.)

Lab Fraud
Commonwealth v. Dookhan, Mass. Superior Court. See also Office of the Inspector General, Commonwealth of Massachusetts. Investigation of the Drug Laboratory at the William A. Hinton State Laboratory Institute. March 4, 2013.

Bite Mark
Innocence Project. “Bite Mark Evidence.” innocenceproject.org. (Case documentation including Ray Krone exoneration.)

Shaken Baby
Findley, Keith A., et al. “Shaken Baby Syndrome, Abusive Head Trauma, and Actual Innocence.” Houston Law Review, vol. 52, no. 2, 2014, pp. 1–82.

Admissibility Standard
Daubert v. Merrell Dow Pharmaceuticals, Inc., 509 U.S. 579 (1993).

Cite This Article

Bluebook: Williams, Rita. When the Science Was Never There: Junk Forensics, Unethical Investigations, and the Institutions That Kept Using Both, Clutch Justice (July 2, 2026), https://clutchjustice.com/2026/07/02/when-the-science-was-never-there/.

APA 7: Williams, R. (2026, July 2). When the science was never there: Junk forensics, unethical investigations, and the institutions that kept using both. Clutch Justice. https://clutchjustice.com/2026/07/02/when-the-science-was-never-there/

MLA 9: Williams, Rita. “When the Science Was Never There: Junk Forensics, Unethical Investigations, and the Institutions That Kept Using Both.” Clutch Justice, 2 July 2026, clutchjustice.com/2026/07/02/when-the-science-was-never-there/.

Chicago: Williams, Rita. “When the Science Was Never There: Junk Forensics, Unethical Investigations, and the Institutions That Kept Using Both.” Clutch Justice, July 2, 2026. https://clutchjustice.com/2026/07/02/when-the-science-was-never-there/.

Continue Your Investigation

If this reporting raised more questions, use the Clutch Justice ecosystem to keep going.

Use The Lab Free tools, databases, explainers, and interactive resources for understanding Michigan courts and accountability systems. Explore the Field Kit Paid guides, templates, systems, and professional resources built from Clutch Justice investigative methodology. Get Future Investigations Download the free evidence checklist and join the Clutch Justice newsletter for accountability reporting and case updates. Support the Work Help fund independent investigative journalism and public accountability reporting.

Categorized in:

Blog,

Last Update: June 30, 2026

When the Science Was Never There: Junk Forensics, Unethical Investigations, and the Institutions That Kept Using Both

The Problem Is Structural, Not Incidental

What Bite Marks Actually Demonstrate

Microscopic Hair Analysis and the FBI’s Correction That Did Not Correct Much

Fred Zain, Annie Dookhan, and the Lab Fraud Architecture

Shaken Baby Syndrome and the Medical Diagnosis as Legal Proof

Why Courts Keep Admitting What Scientists Have Rejected

What Reform Would Actually Require