[Skip to Navigation]
Sign In
Figure.  Study Flowchart Identifying Chinese-Sponsored Randomized Clinical Trials With Duplicates
Study Flowchart Identifying Chinese-Sponsored Randomized Clinical Trials With Duplicates

ChiCTR indicates Chinese Clinical Trial Registry; CT.gov, ClinicalTrials.gov; DCTRP, Drug Clinical Trial Registry Platform.

Table 1.  Types of Duplicatesa
Types of Duplicatesa
Table 2.  Factors Associated With Duplicate Publication Bias
Factors Associated With Duplicate Publication Bias
1.
Lundberg  GD.  Statement by the International Committee of Medical Journal Editors on duplicate or redundant publication.   JAMA 1993; 270(20): 2495. doi:10.1001/jama.1993.03510200101040Google ScholarCrossref
2.
Huston  P, Moher  D.  Redundancy, disaggregation, and the integrity of medical research.   Lancet. 1996;347(9007):1024-1026. doi:10.1016/S0140-6736(96)90153-1 PubMedGoogle ScholarCrossref
3.
Tramèr  MR, Reynolds  DJM, Moore  RA, McQuay  HJ.  Impact of covert duplicate publication on meta-analysis: a case study.   BMJ. 1997;315(7109):635-640. doi:10.1136/bmj.315.7109.635 PubMedGoogle ScholarCrossref
4.
Fairfield  CJ, Harrison  EM, Wigmore  SJ.  Duplicate publication bias weakens the validity of meta-analysis of immunosuppression after transplantation.   World J Gastroenterol. 2017;23(39):7198-7200. doi:10.3748/wjg.v23.i39.7198 PubMedGoogle ScholarCrossref
5.
Gwilym  SE, Swan  MC, Giele  H.  One in 13 ‘original’ articles in the Journal of Bone and Joint Surgery are duplicate or fragmented publications.   J Bone Joint Surg Br. 2004;86(5):743-745. doi:10.1302/0301-620X.86B5.14725 PubMedGoogle ScholarCrossref
6.
Rosenthal  EL, Masdon  JL, Buckman  C, Hawn  M.  Duplicate publications in the otolaryngology literature.   Laryngoscope. 2003;113(5):772-774. doi:10.1097/00005537-200305000-00002 PubMedGoogle ScholarCrossref
7.
Bailey  BJ.  Duplicate publication in the field of otolaryngology-head and neck surgery.   Otolaryngol Head Neck Surg. 2002;126(3):211-216. doi:10.1067/mhn.2002.122698 PubMedGoogle ScholarCrossref
8.
Cheung  VWF, Lam  GO, Wang  YF, Chadha  NK.  Current incidence of duplicate publication in otolaryngology.   Laryngoscope. 2014;124(3):655-658. doi:10.1002/lary.24294 PubMedGoogle ScholarCrossref
9.
Schein  M, Paladugu  R.  Redundant surgical publications: tip of the iceberg?   Surgery. 2001;129(6):655-661. doi:10.1067/msy.2001.114549 PubMedGoogle ScholarCrossref
10.
Durani  P.  Duplicate publications: redundancy in plastic surgery literature.   J Plast Reconstr Aesthet Surg. 2006;59(9):975-977. doi:10.1016/j.bjps.2005.11.039 PubMedGoogle ScholarCrossref
11.
von Elm  E, Poglia  G, Walder  B, Tramèr  MR.  Different patterns of duplicate publication: an analysis of articles used in systematic reviews.   JAMA. 2004;291(8):974-980. doi:10.1001/jama.291.8.974 PubMedGoogle ScholarCrossref
12.
Viergever  RF, Li  K.  Trends in global clinical trial registration: an analysis of numbers of registered clinical trials in different parts of the world from 2004 to 2013.   BMJ Open. 2015;5(9):e008932. doi:10.1136/bmjopen-2015-008932 PubMedGoogle Scholar
13.
National Science Board. Science and engineering indicators 2018. NSB-2018-1. National Science Foundation. Published 2018. Accessed November 2, 2020. https://www.nsf.gov/statistics/indicators/
14.
Jia  Y, Liang  J, Wen  J,  et al.  Drug Randomized Controlled Trials in China, 1999-2018: A Scoping Review. Onging; 2020.
15.
Galandi  D, Schwarzer  G, Antes  G.  The demise of the randomised controlled trial: bibliometric study of the German-language health care literature, 1948 to 2004.   BMC Med Res Methodol. 2006;6(1):30. doi:10.1186/1471-2288-6-30 PubMedGoogle ScholarCrossref
16.
Elsevier. List of journal titles in Embase. Published 2020. Accessed May 21, 2020. https://www.elsevier.com/solutions/embase-biomedical-research/embase-coverage-and-content
17.
National Library of Medicine. List of all journals cited in PubMed. 2020. Accessed May 23, 2020. https://www.nlm.nih.gov/bsd/serfile_addedinfo.html
18.
Higgins  JP, Thomas  J, Chandler  J,  et al, eds.  Cochrane Handbook for Systematic Reviews of Interventions. John Wiley & Sons; 2019.
19.
von Elm  E, Altman  DG, Egger  M, Pocock  SJ, Gøtzsche  PC, Vandenbroucke  JP; STROBE Initiative.  The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies.   Ann Intern Med. 2007;147(8):573-577. doi:10.7326/0003-4819-147-8-200710160-00010 PubMedGoogle ScholarCrossref
20.
Jia  Y, Huang  D, Wen  J,  et al.  Assessment of language and indexing biases among Chinese-sponsored randomized clinical trials.   JAMA Netw Open. 2020;3(5):e205894. doi:10.1001/jamanetworkopen.2020.5894 PubMedGoogle Scholar
21.
United States Food and Drug Administration. Drugs@FDA: FDA approved drug products. Accessed April 21, 2020. https://www.accessdata.fda.gov/scripts/cder/daf/
22.
European Medicine Agency. Search medicines. Accessed April 21, 2020. https://www.ema.europa.eu/en/medicines
23.
Hartung  DM, Zarin  DA, Guise  J-M, McDonagh  M, Paynter  R, Helfand  M.  Reporting discrepancies between the ClinicalTrials.gov results database and peer-reviewed publications.   Ann Intern Med. 2014;160(7):477-483. doi:10.7326/M13-0480 PubMedGoogle ScholarCrossref
24.
Ross  JS, Tse  T, Zarin  DA, Xu  H, Zhou  L, Krumholz  HM.  Publication of NIH funded trials registered in ClinicalTrials.gov: cross sectional analysis.   BMJ. 2012;344:d7292. doi:10.1136/bmj.d7292 PubMedGoogle ScholarCrossref
25.
Koffel  JB.  Use of recommended search strategies in systematic reviews and the impact of librarian involvement: a cross-sectional survey of recent authors.   PLoS One. 2015;10(5):e0125931. doi:10.1371/journal.pone.0125931 PubMedGoogle Scholar
26.
Xia  J, Wright  J, Adams  CE.  Five large Chinese biomedical bibliographic databases: accessibility and coverage.   Health Info Libr J. 2008;25(1):55-61. doi:10.1111/j.1471-1842.2007.00734.x PubMedGoogle ScholarCrossref
27.
Saldanha  IJ, Dickersin  K, Wang  X, Li  T.  Outcomes in Cochrane systematic reviews addressing four common eye conditions: an evaluation of completeness and comparability.   PLoS One. 2014;9(10):e109400. doi:10.1371/journal.pone.0109400 PubMedGoogle Scholar
28.
Saldanha  IJ, Scherer  RW, Rodriguez-Barraquer  I, Jampel  HD, Dickersin  K.  Dependability of results in conference abstracts of randomized controlled trials in ophthalmology and author financial conflicts of interest as a factor associated with full publication.   Trials. 2016;17(1):213. doi:10.1186/s13063-016-1343-z PubMedGoogle ScholarCrossref
29.
Easterbrook  PJ, Berlin  JA, Gopalan  R, Matthews  DR.  Publication bias in clinical research.   Lancet. 1991;337(8746):867-872. doi:10.1016/0140-6736(91)90201-Y PubMedGoogle ScholarCrossref
30.
Dickersin  K, Min  Y-I, Meinert  CL.  Factors influencing publication of research results: follow-up of applications submitted to two institutional review boards.   JAMA. 1992;267(3):374-378. doi:10.1001/jama.1992.03480030052036 PubMedGoogle ScholarCrossref
31.
Decullier  E, Lhéritier  V, Chapuis  F.  Fate of biomedical research protocols and publication bias in France: retrospective cohort study.   BMJ. 2005;331(7507):19. doi:10.1136/bmj.38488.385995.8F PubMedGoogle ScholarCrossref
32.
Page  MJ, Shamseer  L, Altman  DG,  et al.  Epidemiology and reporting characteristics of systematic reviews of biomedical research: a cross-sectional study.   PLoS Med. 2016;13(5):e1002028. doi:10.1371/journal.pmed.1002028 PubMedGoogle Scholar
33.
Rosenthal  R, Dwan  K.  Comparison of randomized controlled trial registry entries and content of reports in surgery journals.   Ann Surg. 2013;257(6):1007-1015. doi:10.1097/SLA.0b013e318283cf7f PubMedGoogle ScholarCrossref
34.
Viergever  RF, Karam  G, Reis  A, Ghersi  D.  The quality of registration of clinical trials: still a problem.   PLoS One. 2014;9(1):e84727. doi:10.1371/journal.pone.0084727 PubMedGoogle Scholar
35.
Dwan  K, Altman  DG, Cresswell  L, Blundell  M, Gamble  CL, Williamson  PR.  Comparison of protocols and registry entries to published reports for randomised controlled trials.   Cochrane Database Syst Rev. 2011;(1):MR000031. doi:10.1002/14651858.MR000031.pub2PubMedGoogle Scholar
36.
Mao  L, Zhang  P, Ding  J,  et al.  Quality assessment of randomized controlled trial report in the field of breast cancer published in Chinese.   Chinese J Gen Pract. 2016;19(05):615-620. doi:10.3969/j.issn.1007-9572.2016.05.029 Google Scholar
Original Investigation
Medical Journals and Publishing
December 3, 2020

Assessment of Duplicate Publication of Chinese-Sponsored Randomized Clinical Trials

Author Affiliations
  • 1Bloomberg School of Public Health, The Johns Hopkins University, Baltimore, Maryland
  • 2Welch Medical Library, The Johns Hopkins University, Baltimore, Maryland
  • 3Institute of Medical Information and Medical Library, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
  • 4School of Medicine, The Johns Hopkins University, Baltimore, Maryland
  • 5Michigan Medicine, School of Public Health, University of Michigan, Ann Arbor
JAMA Netw Open. 2020;3(12):e2027104. doi:10.1001/jamanetworkopen.2020.27104
Key Points

Question  Are publication language (English and Chinese) and trial results associated with duplicate publication of randomized clinical trials conducted in mainland China?

Findings  This cohort study included 470 Chinese-sponsored randomized clinical trials identified from clinical trial registries and published between 2008 and 2019. Approximately 12% of those randomized clinical trials have at least 1 duplicate publication; most are published in a different language than the main article. When main articles are published in Chinese, those with positive findings are more likely to have subsequent duplicate publication than those with negative findings.

Meaning  Results of this study suggest that publishing randomized clinical trials in different languages may impede identification of duplicates and be associated with duplicate publication bias.

Abstract

Importance  Duplicate publications of randomized clinical trials are prevalent in the health-related literature. To date, few studies have assessed the interaction between duplicate publication and the language of the original publication.

Objective  To assess the existence of duplicate publication and the extent to which duplicate publication is associated with the language of the original publication.

Design, Setting, and Participants  In this retrospective cohort study, eligible randomized clinical trials were retrieved from trial registries, and bibliographic databases were searched to determine their publication status. Eligible randomized clinical trials were for drug interventions from January 1, 2008, to December 31, 2014. The search and analysis were conducted from March 1 to August 31, 2019. The trial registries were either primary registries recognized by the World Health Organization or the Drug Clinical Trial Registry Platform sponsored by the China Food and Drug Administration.

Exposures  Individual randomized clinical trials with positive vs negative results.

Main Outcomes and Measures  Journal articles were classified as main articles (determined by largest sample size and longest follow-up among all journal articles derived from that randomized clinical trial) and duplicates. The duplicates were classified into 4 types: (1) unreferenced subgroup analysis (article did not disclose itself as a subgroup analysis or reference its main article); (2) unreferenced republication (article did not disclose itself as a replicate of the main article or reference it); (3) unreferenced interim analysis (article did not disclose itself as an interim analysis or reference its main article); and (4) partial duplicate (article did not disclose its sharing a subset of participants with other articles or reference them).

Results  Among 470 randomized clinical trials published by August 2019 as journal articles, 55 (11.7%) had 75 duplicates, of which 53 (70.7%) were cross-language duplicates. Of the 75 duplicates, 33 (44.0%) were unreferenced republications, 25 (33.3%) unreferenced subgroup analyses, 15 (20.0%) unreferenced interim analyses, and 2 (2.7%) partial duplicates. When the main article of a randomized clinical trial was published in Chinese, those with positive findings were 2.48 (95% CI, 1.08-5.71) times more likely to have subsequent duplicate publication than those with negative findings.

Conclusions and Relevance  In this study, most duplicates were cross-language duplicates and the most common type was unreferenced republication of the main article. Duplicate publication bias exists when the main articles of randomized clinical trials were published in Chinese, potentially misleading readers and compromising journals and evidence synthesis.

Introduction

A duplicate publication (referred to as a duplicate) “overlaps substantially with one already published, without clear, visible reference to the previous publication.”1 Duplicates waste resources, breach copyright, undermine the integrity of research, and distort evidence if included in meta-analyses and systematic reviews.2-4

When disseminating substantial information, secondary publications across languages can be justified or even recommended to maximize audience and potential reach.1 Such secondary publications should appropriately disclose the relationship to the main publication to avoid being treated as an independent study.1

Duplicates are prevalent in the health-related literature.5-10 The challenge remains, however, for readers, journal editors, and meta-analysts to detect duplicates, especially when the authorship, design, or results of duplicates deviate from those of the main publication.11 Moreover, duplicate status may be concealed by language barriers that increase the challenge of detection. For example, English-speaking peer reviewers and editors may be unlikely to detect a duplicate of a non-English original publication.

To our knowledge, few studies have assessed the interaction between duplicates and the language of publication. How often cross-language duplicates occur or the role of language in the production of duplicates and subsequent biasing of evidence syntheses is unclear. We selected randomized clinical trials (RCTs) from mainland China as an example for our study for several reasons: first, the biomedical research community is witnessing a proliferation of RCTs from mainland China12,13; second, most Chinese literature is not indexed in English bibliographic databases and a shift toward publishing RCTs from mainland China in English has not been observed14,15; third, the coverage of the Chinese literature by English bibliographic databases has been gradually increasing16,17; and fourth, the Chinese literature and bibliographic databases have been increasingly recognized by the evidence synthesis community.18 The aims of this study were to estimate prevalence and detect types of duplicates among RCTs from mainland China, and to evaluate the existence of duplicate publication bias with a focus on the interaction between duplication and language of publication.

Methods

In this cohort study, we retrieved eligible RCTs from trial registries, identified the corresponding journal articles (including duplicates) from bibliographic databases, and evaluated duplicate types and duplicate publication bias. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline for cohort studies.19 The study was not subject to institutional review board approval because all the data were open source and no participants were involved (per US Department of Health and Human Services regulation under the 45 CFR 46 Common Rule).

Identifying RCTs From Trial Registries and Databases

Our study sample comprised RCTs sponsored by organizations located in mainland China. The process to retrieve registry records from trial registries and the corresponding journal articles from bibliographic databases has been described previously.20 Briefly, we considered RCTs to be eligible if they evaluated the efficacy and/or safety of drugs and were conducted between January 1, 2008, and December 31, 2014. We considered a substance to be a drug if recognized and regulated by the US Food and Drug Administration and/or the European Medicine Agency.21,22 We excluded bioequivalence studies, pharmacokinetics studies, and RCTs with missing information on eligibility of participants, study period, experimental drugs, principal investigator, or sponsor.

We included only journal articles originating from eligible RCTs. The search strategy and terms updated from previous studies,23-25 list of bibliographic databases,26 and criteria to match journal articles with registry records are presented in the eMethods, eTable 1, eTable 2, and eTable 3 in the Supplement. An example of literature search can be found in the eAppendix in the Supplement. Two of us (Y.J. and D.H.) independently searched bibliographic databases and identified the corresponding journal articles of eligible RCTs. Discrepancies were discussed and resolved with a third author (J.W.).

Identifying the Main Article and Duplicates

We defined the main article for an RCT as the study with the largest sample size and the longest follow-up among all journal articles derived from that RCT.11 If multiple articles existed with the same sample size and follow-up period, we considered whichever was submitted earliest to an academic journal as the main article. We operationally defined a duplicate as an article that had at least 1 identical outcome but did not reference its relationship to the main article.

In this study, we focused on duplicates that might distort evidence; therefore, although article elements such as background, methods, or discussion were used in the identification of duplicates, we only considered duplicate outcomes for analyses. We categorized outcomes into 5 elements: domain (eg, systolic blood pressure), specific measurement (eg, a device at sitting position), specific metric (eg, change from baseline), the method of aggregation (eg, mean/median), and time point.27 An outcome was considered duplicated if it had identical domain, specific measurement, and time point sets with an outcome in the main article, regardless of the specific measure or the method of aggregation. For example, we considered an outcome measuring systolic blood pressure a duplicate if it reported the mean systolic blood pressure while the main article reported the median, as long as the method of measurement and time point were consistent.

Duplicate Types

We classified duplicates into 4 types based on the progress of recruitment and follow-up of the RCTs: (1) unreferenced subgroup analyses; (2) unreferenced republications; (3) unreferenced interim analyses; and (4) partial duplicates. Unreferenced subgroup analysis was defined as an article that did not disclose itself as a subgroup analysis or reference its main article. Unreferenced subgroup analyses are published following the main articles. We defined 4 subtypes: (1) subgroup of recruiting centers, in which only participants from a subset of recruiting centers are included; (2) subgroup of treatment groups, in which only participants from a subset of treatment groups are included; (3) subgroup of participants' characteristics, in which only participants with specific characteristics are included, for instance, the main article was on heart failure while an unreferenced subgroup analysis was on heart failure and diabetes; and (4) subgroup of recruiting centers and participants' characteristics.

Unreferenced republication was defined as an article that did not disclose itself as a replicate of the main article or reference the main article. A replicate means the duplicate reports identical participants and follow-up period as the main article. Unreferenced republications are published following the main articles.

Unreferenced interim analysis was defined as an article that did not disclose itself as an interim analysis or was not referenced by the subsequently published main article. We defined 3 subtypes: (1) interim report of recruitment, which reports an interim analysis on a subset of participants before recruitment is complete, usually the participants recruited in the early stage; (2) interim report of follow-up, which reports an interim analysis on all the participants but before the follow-up is complete, usually on short-term outcomes; and (3) interim report of both recruitment and follow-up, ie, an interim analysis on a subset of participants before recruitment and follow-up are complete.

A partial duplicate was an article that contained a portion of unique participants while sharing a portion of participants, eg, a subset of recruiting centers, with a main article that is not referenced. Partial duplicates were published after the main articles.

We also classified duplicates into cross-language duplicates and same-language duplicates. Cross-language duplicates referred to those published in a different language than the main articles, ie, the main article was published in Chinese and the duplicate in English, or vice versa. Same-language duplicates referred to those published in the same language as the main articles, ie, the main article and the duplicate were both published in Chinese or English. We hypothesized that the main articles with positive findings were more likely to have subsequent duplicates than those with negative findings.

Statistical Analysis
Exposure

The exposure was the finding of a main article classified as positive or negative according to its primary outcome. If multiple primary outcomes were reported, the first one reported in the results section was selected. If no primary outcome was defined, we selected the primary outcome based on the following hierarchical order: (1) the first outcome used in the sample size calculation, (2) the first outcome defined in the study objectives, or (3) the first outcome reported in the results section.28 When the primary outcome was measured at multiple time points, we considered the last time point in the analysis.

We defined positive findings as those favoring the study hypothesis with statistical significance. For example, a superiority trial was positive when the experimental drug was significantly superior to the comparator. We defined negative findings as those not statistically significant or that did not support the study hypothesis.

Outcome and Effect Modifier

A main article had duplicates when at least 1 unreferenced subgroup analysis, 1 unreferenced republication, or 1 partial duplicate was identified from the same RCT. We hypothesized that the language of the main article was a possible effect modifier, ie, the association between the study findings and having subsequent duplicates might vary in the language of the main article.

Measurements of Associations

Duplicate publication bias was measured by relative risk (RR). An RR larger than 1.00 indicated that the main articles with positive findings were more likely to have subsequent duplicates than those with negative findings. The RRs were estimated using log binomial models with 4 covariates: language of the main article (Chinese vs English), sample size (<100 vs ≥100 participants), funding source (industry vs nonindustry), and number of recruiting centers (single vs multiple).29-31 An RCT was defined as funded by industry if at least 1 funder was from industry. An interaction term was included in the model to account for the language of the main article as an effect modifier. Statistical significance was defined as a 2-sided P < .05 for the main effect and P = .10 for the interaction. Data management and analysis were performed with SAS version 9.4 (SAS Institute Inc).

Results

The search through trial registries and bibliographic databases was conducted from March 1 to August 31, 2019. A total of 470 of 891 (52.8%) eligible RCTs identified from trial registries had been published as journal articles. Fifty-five (11.7%) RCTs were found to have 75 duplicates, of which 48 (64.0%) were published in Chinese (Figure). None of the 27 English duplicates were labeled as duplicate publications by PubMed. Ten (18.2%) RCTs had multiple duplicates (range, 2 duplicates to 6 duplicates).

Fifty-three (70.7%) duplicates crossed languages, including 20 (26.7%) published in English (with the main article in Chinese) and 33 (44.0%) published in Chinese (with the main article in English). Twenty-two (29.3%) duplicates were published in the same language as their corresponding main articles; of these, 15 (20.0%) were published in Chinese and 7 (9.3%) in English (Table 1).

The characteristics of RCTs having duplicates were generally similar to RCTs without duplicates. Among 55 RCTs having duplicates, 10 (18.8%) were sponsored by industry (as opposed to nonindustry), 25 (47.2%) recruited fewer than 100 participants, 13 (24.5%) were conducted at 1 recruiting center only, and 38 (71.7%) reported positive findings. Among 415 RCTs without duplicates, 83 (19.9%) were sponsored by industry, 162 (38.9%) recruited fewer than 100 participants, and 135 (32.4%) were conducted at 1 recruiting center. Two RCTs were excluded from the analysis because their exposure status could not be determined (no between-group comparison was reported). In the remaining 413 RCTs without duplicates, 292 (70.4%) reported positive findings.

Duplicate Types

The most prevalent duplicate type was unreferenced republication of the main article (Table 1). Thirty-three (44.0%) of 75 duplicates fit this type, including all 4044 participants reported in the main articles. Of these, 25 (75.8%) duplicates crossed languages, including 15 (45.5%) published in English (with the main article in Chinese) and 10 (30.3%) in Chinese (with the main article in English).

Twenty-five (33.3%) duplicates were unreferenced subgroup analyses, of which 7 (28.0%) were subgroups of centers, 4 (16.0%) were subgroups of treatment groups, 4 (16.0%) were subgroups of participants' characteristics, 2 (8.0%) were subgroups of both centers and participants' characteristics, and 8 (32%) were unclear owing to lack of sufficient information. The 25 unreferenced subgroup analyses included 2193 (66.6%) of 3295 participants reported in the main articles. Fifteen (60.0%) duplicates crossed languages, including 11 (44.0%) published in Chinese (with the main article in English) and 4 (16.0%) in English (with the main article in Chinese).

Fifteen (20.0%) of 75 duplicates were unreferenced interim analyses, of which 12 (80.0%) reported early-stage recruitment, 2 (13.3%) reported early-stage follow-up, and 1 (6.7%) reported a mixture of early-stage recruitment and follow-up. The 15 unreferenced interim analyses included 1125 of 2468 participants (45.6%) reported in the main articles. Twelve duplicates (80.0%) crossed languages, all of which were in Chinese (with the main articles in English).

There were 2 partial duplicates (2.7%), of which 1 crossed languages (the main article in Chinese) and the other was the same language as the main article (in Chinese).

Duplicate Publication Bias

We did not find evidence supporting the language of the main article as an effect modifier (χ2 = 1.60, P = .21). After adjusting for covariates, when published in Chinese, the main articles with positive findings were 2.48 (95% CI, 1.08-5.71) times more likely to have subsequent duplicates than RCTs with negative findings. There was no evidence supporting a similar bias when the main articles were published in English (RR, 0.99; 95% CI, 0.31-3.13). The main articles published in Chinese were 8.03 (95% CI, 3.91-16.46) times more likely to have subsequent duplicates than those published in English. None of other covariates reached statistical significance (Table 2).

Discussion

Findings of this cohort study suggest that duplicates continue to populate the medical literature, especially studies published in more than 1 language that are challenging to detect. Whether the occurrence of duplicates owes to a lack of awareness on behalf of trial investigators (some investigators may produce duplicates unaware of the inappropriateness) or to deliberate production is unclear.

Aside from research integrity and legal considerations, the presence of duplicates carries the potential to distort evidence when an article may be counted more than once in systematic reviews.3,4 Currently there is a gap between English and Chinese literature: most Chinese RCTs are published in Chinese and only indexed in Chinese bibliographic databases, and very few Chinese journals are covered by major English bibliographic databases.14,16 At the same time, English-language systematic reviews, which should bridge the language gap, rarely search Chinese bibliographic databases and subsequently do not include RCTs published in Chinese.32 Under these circumstances, the cross-language duplicates raise serious concerns, especially when the main articles are published in Chinese and the duplicates are in English, because English duplicates may selectively report subsets of participants from the main articles with positive findings or larger effect sizes. Although this type was only present in 5% of all duplicates in our sample, reviewers should be vigilant considering the sheer quantity of RCTs being conducted in China.

In contrast, simply excluding the Chinese literature is inappropriate as it may lead to other negative consequences such as missing relevant RCTs.14 Moreover, language bias has been found among RCTs from mainland China; excluding the Chinese literature may increase the risk of overestimating treatment effects, as RCTs with positive findings are more likely to be published in English than those with negative findings.20

The coverage of Chinese journals by major English bibliographic databases has been increasing gradually while increasing numbers of systematic reviews have recently begun to include Chinese bibliographic databases to reduce bias.18 In theory, expansion of the overlap between English and Chinese literature may help increase the possibility of unintentional inclusion of duplicates in systematic reviews. While an unreferenced republication may be detected by a meticulous reviewer or editor based on identical study participants and results as the main article, unreferenced subgroup analyses and unreferenced interim analyses can be much more challenging to detect owing to different participants or results from the main articles.

We found that the RCTs in which the main articles were published in Chinese were more likely to have subsequent duplicates than those in which the main articles were published in English, a phenomenon that might be explained by the desire to publish research in English journals with higher impact factors and reputation. Although the statistical test on the effect modification by the language of the main article was not statistically significant, duplicate publication bias only existed when the main articles were published in Chinese. This finding suggests that the language of publication may have the potential to modify the duplicate publication bias; our limited sample size is likely a factor associated with the negative result of the statistical test.

In our study, we could not determine the duplication status of some articles owing to a lack of key information or inconsistencies between trial registries and published journal articles.33-35 Improving the data quality of trial registries and the reporting quality of journal articles could enable reviewers to identify duplicates more efficiently.

Limitations

This study has limitations. First, our sample size was limited and a larger sample of RCTs would enable us to better quantify the association between language and bias. Second, we applied strict rules with high specificity to identify duplicates at the cost of sensitivity. Consequently, we may have missed some duplicates, especially those that are substantially inconsistent with the main articles or lacking information for classification. Therefore, the actual prevalence of duplicates may be higher than our estimate and there may be duplicate types that we did not identify. Third, because less than 15% of RCTs from mainland China were registered,36 RCTs in trial registries may not be representative of all RCTs from China. Our conclusions may not be generalizable to other settings. Fourth, although we found evidence supporting the existence of duplicate publication bias, further study is warranted to determine the extent to which duplicates, especially cross-language duplicates, may affect the findings of individual systematic reviews. In addition, we selected only RCTs from mainland China as our study sample. The prevalence, types, and biases regarding duplicates may vary in other languages. Nonetheless, we believe our study may help inform future research on RCTs in other languages.

Conclusions

Most duplicate publications of RCTs conducted in mainland China in this study were cross-language duplicates and unreferenced republications of the main articles. Duplicate publication bias exists when the main articles were published in Chinese, potentially misleading readers and compromising journals and evidence synthesis.

Back to top
Article Information

Accepted for Publication: September 30, 2020.

Published: December 3, 2020. doi:10.1001/jamanetworkopen.2020.27104

Open Access: This is an open access article distributed under the terms of the CC-BY License. © 2020 Jia Y et al. JAMA Network Open.

Corresponding Author: David D. Celentano, MHS, ScD, Bloomberg School of Public Health, The Johns Hopkins University, 615 N Wolfe St, Room W6041, Baltimore, MD 21205 (dcelent1@jhu.edu).

Author Contributions: Dr Celentano had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. Drs Ehrhardt and Celentano contributed equally to this study.

Concept and design: Jia, Wen, Wang, Rosman, Robinson, Gagnier, Ehrhardt, Celentano.

Acquisition, analysis, or interpretation of data: Jia, Huang, Wen, Qureshi, Wang, Chen, Robinson, Gagnier, Ehrhardt, Celentano.

Drafting of the manuscript: Jia, Wen, Gagnier, Ehrhardt.

Critical revision of the manuscript for important intellectual content: All authors.

Statistical analysis: Jia, Wen, Chen, Gagnier, Ehrhardt.

Obtained funding: Jia.

Administrative, technical, or material support: Jia, Wang, Rosman, Celentano.

Supervision: Gagnier, Ehrhardt, Celentano.

Conflict of Interest Disclosures: None reported.

Funding/Support: This study was supported by the Doctoral Thesis Research Fund at the Department of Epidemiology, Bloomberg School of Public Health, The Johns Hopkins University (Jia) and the China Scholarship Council (Jia).

Role of the Funder/Sponsor: The funding organizations had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Additional Contributions: Kevin Fain, JD, DrPH, at the United States Food and Drug Administration provided comments that greatly improved the structure and content of the manuscript. Dr Fain did not receive compensation for his contribution.

References
1.
Lundberg  GD.  Statement by the International Committee of Medical Journal Editors on duplicate or redundant publication.   JAMA 1993; 270(20): 2495. doi:10.1001/jama.1993.03510200101040Google ScholarCrossref
2.
Huston  P, Moher  D.  Redundancy, disaggregation, and the integrity of medical research.   Lancet. 1996;347(9007):1024-1026. doi:10.1016/S0140-6736(96)90153-1 PubMedGoogle ScholarCrossref
3.
Tramèr  MR, Reynolds  DJM, Moore  RA, McQuay  HJ.  Impact of covert duplicate publication on meta-analysis: a case study.   BMJ. 1997;315(7109):635-640. doi:10.1136/bmj.315.7109.635 PubMedGoogle ScholarCrossref
4.
Fairfield  CJ, Harrison  EM, Wigmore  SJ.  Duplicate publication bias weakens the validity of meta-analysis of immunosuppression after transplantation.   World J Gastroenterol. 2017;23(39):7198-7200. doi:10.3748/wjg.v23.i39.7198 PubMedGoogle ScholarCrossref
5.
Gwilym  SE, Swan  MC, Giele  H.  One in 13 ‘original’ articles in the Journal of Bone and Joint Surgery are duplicate or fragmented publications.   J Bone Joint Surg Br. 2004;86(5):743-745. doi:10.1302/0301-620X.86B5.14725 PubMedGoogle ScholarCrossref
6.
Rosenthal  EL, Masdon  JL, Buckman  C, Hawn  M.  Duplicate publications in the otolaryngology literature.   Laryngoscope. 2003;113(5):772-774. doi:10.1097/00005537-200305000-00002 PubMedGoogle ScholarCrossref
7.
Bailey  BJ.  Duplicate publication in the field of otolaryngology-head and neck surgery.   Otolaryngol Head Neck Surg. 2002;126(3):211-216. doi:10.1067/mhn.2002.122698 PubMedGoogle ScholarCrossref
8.
Cheung  VWF, Lam  GO, Wang  YF, Chadha  NK.  Current incidence of duplicate publication in otolaryngology.   Laryngoscope. 2014;124(3):655-658. doi:10.1002/lary.24294 PubMedGoogle ScholarCrossref
9.
Schein  M, Paladugu  R.  Redundant surgical publications: tip of the iceberg?   Surgery. 2001;129(6):655-661. doi:10.1067/msy.2001.114549 PubMedGoogle ScholarCrossref
10.
Durani  P.  Duplicate publications: redundancy in plastic surgery literature.   J Plast Reconstr Aesthet Surg. 2006;59(9):975-977. doi:10.1016/j.bjps.2005.11.039 PubMedGoogle ScholarCrossref
11.
von Elm  E, Poglia  G, Walder  B, Tramèr  MR.  Different patterns of duplicate publication: an analysis of articles used in systematic reviews.   JAMA. 2004;291(8):974-980. doi:10.1001/jama.291.8.974 PubMedGoogle ScholarCrossref
12.
Viergever  RF, Li  K.  Trends in global clinical trial registration: an analysis of numbers of registered clinical trials in different parts of the world from 2004 to 2013.   BMJ Open. 2015;5(9):e008932. doi:10.1136/bmjopen-2015-008932 PubMedGoogle Scholar
13.
National Science Board. Science and engineering indicators 2018. NSB-2018-1. National Science Foundation. Published 2018. Accessed November 2, 2020. https://www.nsf.gov/statistics/indicators/
14.
Jia  Y, Liang  J, Wen  J,  et al.  Drug Randomized Controlled Trials in China, 1999-2018: A Scoping Review. Onging; 2020.
15.
Galandi  D, Schwarzer  G, Antes  G.  The demise of the randomised controlled trial: bibliometric study of the German-language health care literature, 1948 to 2004.   BMC Med Res Methodol. 2006;6(1):30. doi:10.1186/1471-2288-6-30 PubMedGoogle ScholarCrossref
16.
Elsevier. List of journal titles in Embase. Published 2020. Accessed May 21, 2020. https://www.elsevier.com/solutions/embase-biomedical-research/embase-coverage-and-content
17.
National Library of Medicine. List of all journals cited in PubMed. 2020. Accessed May 23, 2020. https://www.nlm.nih.gov/bsd/serfile_addedinfo.html
18.
Higgins  JP, Thomas  J, Chandler  J,  et al, eds.  Cochrane Handbook for Systematic Reviews of Interventions. John Wiley & Sons; 2019.
19.
von Elm  E, Altman  DG, Egger  M, Pocock  SJ, Gøtzsche  PC, Vandenbroucke  JP; STROBE Initiative.  The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies.   Ann Intern Med. 2007;147(8):573-577. doi:10.7326/0003-4819-147-8-200710160-00010 PubMedGoogle ScholarCrossref
20.
Jia  Y, Huang  D, Wen  J,  et al.  Assessment of language and indexing biases among Chinese-sponsored randomized clinical trials.   JAMA Netw Open. 2020;3(5):e205894. doi:10.1001/jamanetworkopen.2020.5894 PubMedGoogle Scholar
21.
United States Food and Drug Administration. Drugs@FDA: FDA approved drug products. Accessed April 21, 2020. https://www.accessdata.fda.gov/scripts/cder/daf/
22.
European Medicine Agency. Search medicines. Accessed April 21, 2020. https://www.ema.europa.eu/en/medicines
23.
Hartung  DM, Zarin  DA, Guise  J-M, McDonagh  M, Paynter  R, Helfand  M.  Reporting discrepancies between the ClinicalTrials.gov results database and peer-reviewed publications.   Ann Intern Med. 2014;160(7):477-483. doi:10.7326/M13-0480 PubMedGoogle ScholarCrossref
24.
Ross  JS, Tse  T, Zarin  DA, Xu  H, Zhou  L, Krumholz  HM.  Publication of NIH funded trials registered in ClinicalTrials.gov: cross sectional analysis.   BMJ. 2012;344:d7292. doi:10.1136/bmj.d7292 PubMedGoogle ScholarCrossref
25.
Koffel  JB.  Use of recommended search strategies in systematic reviews and the impact of librarian involvement: a cross-sectional survey of recent authors.   PLoS One. 2015;10(5):e0125931. doi:10.1371/journal.pone.0125931 PubMedGoogle Scholar
26.
Xia  J, Wright  J, Adams  CE.  Five large Chinese biomedical bibliographic databases: accessibility and coverage.   Health Info Libr J. 2008;25(1):55-61. doi:10.1111/j.1471-1842.2007.00734.x PubMedGoogle ScholarCrossref
27.
Saldanha  IJ, Dickersin  K, Wang  X, Li  T.  Outcomes in Cochrane systematic reviews addressing four common eye conditions: an evaluation of completeness and comparability.   PLoS One. 2014;9(10):e109400. doi:10.1371/journal.pone.0109400 PubMedGoogle Scholar
28.
Saldanha  IJ, Scherer  RW, Rodriguez-Barraquer  I, Jampel  HD, Dickersin  K.  Dependability of results in conference abstracts of randomized controlled trials in ophthalmology and author financial conflicts of interest as a factor associated with full publication.   Trials. 2016;17(1):213. doi:10.1186/s13063-016-1343-z PubMedGoogle ScholarCrossref
29.
Easterbrook  PJ, Berlin  JA, Gopalan  R, Matthews  DR.  Publication bias in clinical research.   Lancet. 1991;337(8746):867-872. doi:10.1016/0140-6736(91)90201-Y PubMedGoogle ScholarCrossref
30.
Dickersin  K, Min  Y-I, Meinert  CL.  Factors influencing publication of research results: follow-up of applications submitted to two institutional review boards.   JAMA. 1992;267(3):374-378. doi:10.1001/jama.1992.03480030052036 PubMedGoogle ScholarCrossref
31.
Decullier  E, Lhéritier  V, Chapuis  F.  Fate of biomedical research protocols and publication bias in France: retrospective cohort study.   BMJ. 2005;331(7507):19. doi:10.1136/bmj.38488.385995.8F PubMedGoogle ScholarCrossref
32.
Page  MJ, Shamseer  L, Altman  DG,  et al.  Epidemiology and reporting characteristics of systematic reviews of biomedical research: a cross-sectional study.   PLoS Med. 2016;13(5):e1002028. doi:10.1371/journal.pmed.1002028 PubMedGoogle Scholar
33.
Rosenthal  R, Dwan  K.  Comparison of randomized controlled trial registry entries and content of reports in surgery journals.   Ann Surg. 2013;257(6):1007-1015. doi:10.1097/SLA.0b013e318283cf7f PubMedGoogle ScholarCrossref
34.
Viergever  RF, Karam  G, Reis  A, Ghersi  D.  The quality of registration of clinical trials: still a problem.   PLoS One. 2014;9(1):e84727. doi:10.1371/journal.pone.0084727 PubMedGoogle Scholar
35.
Dwan  K, Altman  DG, Cresswell  L, Blundell  M, Gamble  CL, Williamson  PR.  Comparison of protocols and registry entries to published reports for randomised controlled trials.   Cochrane Database Syst Rev. 2011;(1):MR000031. doi:10.1002/14651858.MR000031.pub2PubMedGoogle Scholar
36.
Mao  L, Zhang  P, Ding  J,  et al.  Quality assessment of randomized controlled trial report in the field of breast cancer published in Chinese.   Chinese J Gen Pract. 2016;19(05):615-620. doi:10.3969/j.issn.1007-9572.2016.05.029 Google Scholar
×