What errors do peer reviewers detect, and does training improve their ability to detect them?

J R Soc Med. 2008 Oct;101(10):507-14. doi: 10.1258/jrsm.2008.080062.

Abstract

Objective: To analyse data from a trial and report the frequencies with which major and minor errors are detected at a general medical journal, the types of errors missed and the impact of training on error detection.

Design: 607 peer reviewers at the BMJ were randomized to two intervention groups receiving different types of training (face-to-face training or a self-taught package) and a control group. Each reviewer was sent the same three test papers over the study period, each of which had nine major and five minor methodological errors inserted.

Setting: BMJ peer reviewers.

Main outcome measures: The quality of review, assessed using a validated instrument, and the number and type of errors detected before and after training.

Results: The number of major errors detected varied over the three papers. The interventions had small effects. At baseline (Paper 1) reviewers found an average of 2.58 of the nine major errors, with no notable difference between the groups. The mean number of errors reported was similar for the second and third papers, 2.71 and 3.0, respectively. Biased randomization was the error detected most frequently in all three papers, with over 60% of reviewers rejecting the papers identifying this error. Reviewers who did not reject the papers found fewer errors and the proportion finding biased randomization was less than 40% for each paper.

Conclusions: Editors should not assume that reviewers will detect most major errors, particularly those concerned with the context of study. Short training packages have only a slight impact on improving error detection.

Publication types

  • Randomized Controlled Trial
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Education, Professional / methods*
  • Education, Professional / standards
  • Humans
  • Peer Review, Research / standards*
  • Professional Competence / standards*
  • Quality Control
  • Reproducibility of Results