Incomplete indicates terminated, suspended, or withdrawn, and active indicates recruiting, not yet recruiting, active, or enrolling by invitation.
eTable 1. Selected Terms and Definitions Used on ClinicalTrials.gov
eTable 2. Number of Trials Started by Year Between 2000 and 2019
eTable 3. Predictors of Sample Size for Completed Trials in Multivariable Regression by Phase
eTable 4. Anticipated and Actual Sample Sizes for Trials Started and Completed Between 2010 Through 2019 by Lead Sponsor and Phase
eFigure 1. Time to Study Completion by Lead Sponsor
eFigure 2. Median Time (Years) to Trial Completion by Lead Sponsor and Phase
eFigure 3. Anticipated vs Actual Sample Size of Trials Started and Completed Between 2010 Through 2019 by Lead Sponsor and Year
eAppendix. Example postgreSQL Code to Generate CT.gov Registration Dataset Used for Analysis
Customize your JAMA Network experience by selecting one or more topics from the list below.
Identify all potential conflicts of interest that might be relevant to your comment.
Conflicts of interest comprise financial interests, activities, and relationships within the past 3 years including but not limited to employment, affiliation, grants or funding, consultancies, honoraria or payment, speaker's bureaus, stock ownership or options, expert testimony, royalties, donation of medical equipment, or patents planned, pending, or issued.
Err on the side of full disclosure.
If you have no conflicts of interest, check "No potential conflicts of interest" in the box below. The information will be posted with your response.
Not all submitted comments are published. Please see our commenting policy for details.
Gresham G, Meinert JL, Gresham AG, Meinert CL. Assessment of Trends in the Design, Accrual, and Completion of Trials Registered in ClinicalTrials.gov by Sponsor Type, 2000-2019. JAMA Netw Open. 2020;3(8):e2014682. doi:10.1001/jamanetworkopen.2020.14682
What are the characteristics and trends of clinical trials registered in ClinicalTrials.gov over time, and how do they differ by sponsor type?
In this cross-sectional study of ClinicalTrials.gov registration data on 245 999 interventional studies started between 2000 and 2019 that were sponsored by the National Institutes of Health or other US government agencies, industry, or other sources (foundations, universities, hospitals, clinics, and others), most trials were small, single-site studies that did not have US Food and Drug Administration–defined phases and were sponsored by other sources. Median sample sizes and years to trial completion decreased over time.
The findings suggest that the composition and design of trials changed between 2000 and 2019 and differed substantially by sponsor type; increased funding toward larger randomized clinical trials may be warranted to inform clinical decision-making and guide future research.
ClinicalTrials.gov is a valuable resource that can be used to trace the state and nature of trials. Since its launch in 2000, more than 345 000 trials have been registered. Little is known about the characteristics and trends in clinical trials over time and how they differ by sponsor type.
To assess trends in clinical trials registered in ClinicalTrials.gov over time and by sponsor type.
Design, Setting, and Participants
This cross-sectional study included clinical trials (interventional studies) registered in ClinicalTrials.gov from January 1, 2000, through December 31, 2019. The trials were grouped by lead sponsor: National Institutes of Health (NIH) and other US government agencies, industry, and other sources (foundations, universities, hospitals, clinics, and others). A static version of the Clinical Trials Transformation Initiative Aggregate Analysis of ClinicalTrials.gov database was downloaded on January 1, 2020, for analysis.
Main Outcomes and Measures
ClinicalTrials.gov registration fields, including overall status, phase, intervention, number of sites, use of masking and randomization, sample size, and time to study completion by start year and lead sponsor (organization that provided funding or support for a clinical study).
A total of 245 999 clinical trials (interventional studies) were started between 2000 and 2019, of which 135 144 (54.9%) were completed. Among completed trials, 5113 (3.8%) were sponsored by the NIH or a US government agency, 48 668 (36.0%) by industry, and 81 363 (60.2%) by other sources. Most trials were single center (61.3%), randomized (65.6%), and phase 1 to 2 (35.5%) or did not have a US Food and Drug Administration–defined phase (38.4%), with fewer drug trials being conducted over time. Sample sizes were small (median, 60; interquartile range [IQR], 30-160) and diminished over time. Trial median completion times varied by lead sponsor: 3.4 years (IQR, 1.9-5.0 years) for NIH- and US government–sponsored trials, 1.2 years (IQR, 0.5-2.4 years) for industry trials, and 2.1 years (IQR, 1.1-3.7) for trials sponsored by other sources.
Conclusions and Relevance
The findings suggest that the composition and design of trials changed from 2000 to 2019 and differed substantially by sponsor type. Increased funding toward larger randomized clinical trials may be warranted to inform clinical decision-making and guide future research.
Since ClinicalTrials.gov was launched in 2000, more than 345 000 interventional and observational studies have been registered.1-3 ClinicalTrials.gov is managed by the National Library of Medicine and is an online resource for health care professionals, researchers, patients, and the general public. It is an important resource that can be used to view and access clinical trials registration data. Analyzing clinical trials metadata can illuminate important trends over time, such as the composition, size, design, and types of trials being funded.
There have been updates to the clinical trials registration and reporting requirements since implementation of the US Food and Drug Administration (FDA) Modernization Act of 1997, which mandated clinical trials registration and led to the establishment of ClinicalTrials.gov.4,5 In 2005, the International Committee of Medical Journal Editors (ICMJE) required registration of clinical trials as a prerequisite for publication.6 Subsequently, the FDA Act (FDAAA 801) of 2007 expanded requirements to the types of trials being registered, key data elements being entered, and basic results being reported.7 The Final Rule became effective in January 2017, further clarifying and expanding on the registration and requirements of FDAAA 801.8 Some changes include the types of trials subject to the requirements, the information that must be submitted and data elements that are required to be entered on registration, and additional results information reporting requirements for trials.8 Simultaneously, a policy was issued by the National Institutes of Health (NIH) to require registration and results reporting for all trials funded by the NIH regardless of whether the trials are covered by the FDAAA 801 requirements of the Final Rule.8
Availability of the Clinical Trials Transformation Initiative Aggregate Analysis of ClinicalTrials.gov (CTTI AACT) database has facilitated and improved the ability to analyze ClinicalTrials.gov registration data.9 In 2017, the CTTI AACT database was upgraded to a cloud-based platform that allows for open access to the complete set of trials registered in ClinicalTrials.gov for download and analysis. Its restructured and relational format facilitates analysis and provides access to additional fields that are not readily available in direct exports from ClinicalTrials.gov.
Previous reports of ClinicalTrials.gov registration data have focused analyses on specific funders, such as the NIH; on a single condition; or on a particular registration element within ClinicalTrials.gov.10-12 To our knowledge, no studies have characterized trials by sponsor type during this 20-year time span. Thus, our objective was to assess the characteristics and trends of clinical trials started from January 1, 2000, through December 31, 2019, and to compare trends by sponsor type.
This cross-sectional study included clinical trials (interventional studies) with start dates between January 1, 2000, and December 31, 2019, that were registered in ClinicalTrials.gov and accessed using the CTTI AACT database.13 Observational studies and studies with expanded access were excluded from the analysis. CTTI AACT is a relational cloud-based database that includes aggregated and restructured data from ClinicalTrials.gov. Content is updated daily and can be publicly accessed using pgAdmin (pgAdmin Development Team), R (R Foundation for Statistical Computing), SAS (SAS Institute Inc), or pSQL (PostgreSQL Global Development Group). Characterization of the CTTI AACT content and navigation through the CTTI AACT database followed definitions from the publicly available CTTI AACT comprehensive data dictionary14 and definitions available in ClinicalTrials.gov. A static version of the CTTI AACT database was downloaded for analysis on January 1, 2020. This is an analysis of publicly available aggregate trial data; thus, institutional review board approval was not required. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline.
ClinicalTrials.gov registration fields, as coded in the CTTI AACT database, included the following: trial start and completion dates; study type (interventional or observational); overall status (completed, withdrawn, terminated, suspended, open to enrollment, recruiting, not yet recruiting, or status unknown); enrollment number; study phase (early phase 1, phase 1, phase 1 to 2, phase 2, phase 3, phase 4, and trials that do not have an FDA-defined phase [phase not applicable (NA)]); treatment assignment (randomized or not randomized); masking (open label or masked); facilities (single center or multicenter); posted results; and lead sponsor (NIH or other US government agency, industry, and all other sponsors). Lead sponsor is defined in ClinicalTrials.gov as the “organization or person who initiates the study and who has authority and control over the study.”1 This variable is not the same as funder type, which is derived from multiple data elements in ClinicalTrials.gov and is not available as a discrete field in the database download. Additional calculated variables included time to completion (calculated as the difference between actual completion date and start date for completed trials) and times to posted results. Anticipated and actual enrollment counts were also assessed by comparing target sample size provided at trial registration with the sample size provided on trial completion. A description of each variable as defined in ClinicalTrials.gov and used for the purpose of this article is available in eTable 1 in the Supplement.
Results were grouped by lead sponsor and start date in 5-year periods: 2000 to 2004, 2005 to 2009, 2010 to 2014, and 2015 to 2019. These year groupings align with changes in the registration and reporting regulations, including the launch of ClinicalTrials.gov, the ICMJE edict, and implementation of FDAAA 801. Trial start dates were used to classify periods because registration dates can be entered retrospectively and thus are more likely to be inaccurate or lead to time misclassification.
Multivariable regression models were fitted to evaluate the association between sample size and sponsor type and were adjusted for start year and other trial design characteristics. An interaction term between start year and lead sponsor was included in the model in which a significant result would indicate an interactive effect. Anticipated and actual sample sizes were compared across sponsor types for trials started and completed between 2010 and 2019 using the available CTTI AACT archived databases for each year. Median times to completion were calculated from start date to actual completion date for completed trials. A 2-sided P < .05 was considered to be statistically significant. All tabulations and analyses were duplicated (A.G.G. and J.L.M.) using postgreSQL and SAS. The postgreSQL codes used to generate tables are available in the eAppendix in the Supplement. Additional analyses were performed using Stata, version 15 (StataCorp LLC).
There were 325 860 registrations on ClinicalTrials.gov as of January 1, 2020, of which 245 999 were clinical trials (interventional studies) started between 2000 and 2019; 135 144 trials (54.9%) were completed (Figure). Overall, there were 8023 NIH- or US government–sponsored trials (3.3%), 70 329 industry-sponsored trials (28.5%), and 167 647 trials sponsored by other funding sources (68.1%). Among the NIH- and US government–sponsored trials, 63.7% were completed, 11.4% were incomplete, 20.2% were active, and 4.6% had unknown status (Table 1). Industry-sponsored trials had the highest percentage of completed trials (69.2%) and the lowest percentage of active trials (16.8%), whereas trials sponsored by other sources had the lowest completion rates (48.5%) and the highest percentage of active trials (29.8%), including trials that were not yet recruiting, were recruiting, were enrolling by invitation, or were active and not recruiting. The number of NIH- and US government–sponsored trials started each year decreased over time, in contrast to the number of trials started per year that were sponsored by industry and other funding sources, which increased over time (eTable 2 in the Supplement).
Design characteristics of completed trials ordered by lead sponsors and start year are given in Table 2. Most trials were single center (61.3%), randomized (65.6%), open label (55.7%), phase 1 to 2 (35.5%), or lacking an FDA-defined phase (38.4%). Percentages of completed trials that were double-masked, multisite, and randomized were 31.4% for industry-sponsored trials, 12.3% for NIH- and US government–sponsored trials, and 11.0% for other trials and remained stable over time. The overall percentage of drug trials completed decreased from 2000 to 2019 (70.5% in 2000-2004, 61.8% in 2005-2009, 48.9% in 2010-2014, and 40.0% in 2015-2019). This finding is in contrast to a doubling of trials that involved nondrug interventions from 2000 through 2004 (29.6%) to 2015 through 2019 (60.0%). These trends are reflected in the decreasing number of phase 1 to 2 and phase 3 to 4 trials being completed and the increasing number of trials lacking an FDA-defined phase (phase NA). The NIH and US government agencies were the lead sponsors for a larger percentage of completed phase 1 to 2 trials, whereas industry was the lead sponsor for more phase 3 to 4 trials completed over time. Trials sponsored by other sources involved mostly trials lacking an FDA-defined phase. More industry-sponsored trials were multicenter (65.6%) compared with NIH- and US government–sponsored trials (34.2%) and trials sponsored by other sources (27.7%).
Median sample sizes for completed trials by sponsor and phase over time are given in Table 3. The overall median sample size for trials between 2000 and 2019 was 60 individuals (interquartile range [IQR], 30-160 individuals) and decreased between 2000 and 2019 for all sponsors. Sample sizes were largest for industry-sponsored trials, with a median of 75 individuals (IQR, 32-236 individuals) compared with NIH- and US government–sponsored trials (median, 55 individuals; IQR, 28-140 individuals) and trials funded by other sources (median, 58 individuals; IQR, 28-128 individuals) (Table 3). Trial sample sizes were less than 100 individuals in 56.1% of industry-sponsored trials compared with 66.8% of NIH- and US government–sponsored trials and 67.2% of trials sponsored by other sources overall.
In multivariable regression of completed trials (eTable 3 in the Supplement), sample sizes decreased by 8.2 persons every 5 years. Phase 1 to 2 trials decreased by 2.2 persons (95% CI, 1.5-2.9 persons), phase 3 to 4 trials by 8.8 persons (95% CI, 5.3-12.3 persons), and trials lacking an FDA-defined phase by 4.2 persons (95% CI, 0.9-7.5 persons) every 5 years. Comparing sample sizes by sponsor, NIH- and US government–sponsored trials were smaller than industry-sponsored trials for phase 1 to 2 trials (−2.5; 95% CI, −4.0 to 1.0; P < .001), phase 3 to 4 trials (−82.7; 95% CI, −96.4 to −69.0; P < .001), and overall (−12.7; 95% CI, −14.9 to −10.6; P < .001). Interaction terms for start year and sponsor were statistically significant (eTable 3 in the Supplement). Planned sample sizes at the beginning of the trial were larger than actual sample sizes when trials were completed across all phases and sponsor types (eTable 4 and eFigure 3 in the Supplement).
Median times to trial completion by lead sponsor were 3.4 years (IQR, 1.9-5.0 years) for NIH- and US government–sponsored trials, 1.2 years (IQR, 0.5-2.4 years) for industry trials, and 2.1 years (IQR, 1.1-3.7 years) for trials sponsored by other sources between 2000 and 2019 (eFigure 1 in the Supplement). Table 4 shows median times to completion and IQRs for completed trials by sponsor type and start year, which decreased over time (eFigure 2 in the Supplement). For example, median years to completion for phase 3 to 4 trials sponsored by the NIH and other US government agencies were 5.4 (IQR, 3.7-7.7) in 2000, 3.8 (IQR, 2.3-5.9) in 2005, 3.7 (IQR, 2.9-4.9) in 2010, and 3.2 (IQR, 1.9-3.7) in 2015. Times to completion for industry-sponsored phase 3 to 4 trials remained relatively steady over time: 3.2 years (IQR, 1.9-5.2 years) in 2000, 1.7 years (IQR, 0.9-2.7 years) in 2005, 1.7 years (IQR, 0.9-3.1 years) in 2010, and 1.6 years (IQR, 0.9-2.5 years) Times to completion for phase 3 to 4 trials sponsored by other sources were 6.0 years (IQR, 3.9-9.1 years) in 2000, 3.1 years (IQR, 1.8-5.0 years) in 2005, 3.0 years (IQR, 1.6-4.4 years) in 2010, and 1.6 years (IQR, 0.9-2.5 years) in 2015.
From 2007, when posting results became required, to 2019, the percentages of completed trials posting results by agency were 47.7% for NIH- and US government-sponsored trials, 37.8% for industry-sponsored trials, and 16.0% for trials sponsored by other sources. The median times to posting were 1.4 years (IQR, 0.9-3.0 years) for all trials, 1.3 years (IQR, 0.9-2.9 years) for industry-sponsored trials, 1.6 years (IQR, 1.0-3.3 years) for NIH- and US government-sponsored trials, and 1.6 years (IQR, 0.9-3.1 years) for trials sponsored by other sources.
ClinicalTrials.gov is an important resource that can be used to characterize the state and nature of trials. We describe trends and characteristics of 245 999 trials that were registered in ClinicalTrials.gov and started between 2000 and 2019. We found that trials had smaller sample sizes and were being completed in less time and that most trials were sponsored by other sources (foundations, universities, hospitals, clinics, and others) from 2000 to 2019.
The number of trials started per year increased between 2000 and 2019, with the largest increase observed in the number of trials started each year by other sources. A similar trend was observed for industry-sponsored trials started per year, whereas the number of NIH- and US government-sponsored trials started per year decreased. Part of the decrease may have been associated with differential uptake of registration across sponsor types, which was faster for NIH- and US government-sponsored trials, within the first 5 years of its launch, accounting for most new registrations.
Differences were observed in clinical trial design characteristics over time, including different distributions across trial phases, intervention types, use of randomization or masking, and number of centers involved over time. There was a decrease in the percentages of phase 3 to 4 trials and drug trials being conducted over time compared with an increase in the percentages of nondrug trials and trials without an FDA-defined phase. The rate in difference between early-phase trials (phase 1-2) and phase 3 trials decreased by almost half by the end of 2019. This shift may be explained by the increased uptake in registration for these trials, expansion of the clinical trial definition, or increasing interest in other intervention types (eg, behavioral interventions, imaging, biologic, and devices) in recent years.15-17 The decreasing percentages of trials that involved drugs may also be associated with increasing costs and complexity of conducting phase 3 to 4 drug trials.
Overall, trial sample sizes decreased over time and took less time to complete. Median times to trial completion varied by sponsor type and phase. Industry completed trials at faster rates compared with the NIH and US government and other funders, possibly in association with more efficient trial startup processes and higher recruitment rates. Reasons for this trend may include changes in the types of outcomes being used (eg, surrogate outcomes and biomarkers as well as patient-reported outcomes), increasing trial-associated costs, and greater budget constraints. With an overall median sample size of 60 persons per trial, the ability to generate meaningful, reproducible differences with such a sample size remains questionable.10,18 Reports from almost 10 years ago had similar conclusions, without any evidence of change or improvement.10,12,19 The original planned sample sizes were not met and were often smaller compared with the actual sample size when the trial was completed.20 Reasons for not achieving the planned sample size, other than meeting the scientific goals of the trial, include recruitment and retention difficulties, business decisions, and unavailability or discontinuation of funding.21-23 At time of analysis, there were 21 455 trials started in 2019 and registered in ClinicalTrials.gov. If we assume registrations in ClinicalTrials.gov account for 70% of all trials registered and that the median cost (direct and indirect) per trial from start to finish is $1 000 000 at a minimum, the total cost for trials in 2019 would be approximately $31 billion. The median cost for comparative efficacy trials (phase 3-4) is closer to $19 million per trial, with larger trials ranging up to $53 million per trial.24,25 Thus, increased funding for larger randomized clinical trials may be warranted to inform clinical decision-making and answer important clinical and health policy questions.
The findings suggest that registration and reporting systems could be further improved. There appears to be a need for ClinicalTrials.gov to modify its registration system to accommodate the broader range of trials being conducted and the collaborative arrangements involved. For instance, there is currently no explicit data element for funding source in ClinicalTrials.gov. Thus, the lead sponsor variable was used to estimate trends by the different agencies and organizations as classified in ClinicalTrials.gov. The term lead sponsor refers to the primary organization that oversees study implementation and is responsible for conducting data analysis and is further used to determine the primary funding source for the study.1 The impetus for ClincalTrials.gov was the FDA. That history is reflected in the emphasis on drug trials, US funding, and intermixing sponsor as funder and holder of Investigational New Drug applications. Because trials are largely collaborative and can involve several funders, there is potential for misclassification, underreporting, or overreporting of estimates of funding sources. Although methods have been described to estimate the probable funding source from ClinicalTrials.gov, this information cannot be easily exported or analyzed using the publicly available data set or derived without extensive data manipulation and assumptions.10 Thus, an explicit data element for the primary source or sources of funding with potential linkage to the study project number (eg, NIH Report Expenditures and Results Tool [RePORTER] for NIH-funded trials) and the total amount and duration of funding, if available, may improve the ability to compare trials across funding sources, reducing the risk of misclassification of trials in the other funder categories.
Additional updates that may be beneficial to the analysis of ClinicalTrials.gov registration data include further subdivisions for trial phases beyond the FDA-defined phases to reduce the number of trials lacking an FDA-defined phase, expansion of the trial typography as listed in ClinicalTrials.gov to better capture the different types of trial designs, discrete elements for the outcome types (eg, time to event, surrogate, composite, and patient reported) and for specific time points for the primary outcomes for analysis purposes, and reduction of free-text renders to prevent formatting issues when analyzing the database. This approach would allow for future comparisons of specific outcome types and adjust analyses by outcome duration. Improvements to the system for tracking publications related to the trial are needed, including a separate field for the publication associated with the primary outcome, to determine the fraction of trials registered that are published. Publication is the sine qua non of trials, but only a fraction of completed trials are published. Thus, continued efforts to enforce the timely and complete reporting of results are important to reduce the reporting biases associated with delayed publication or failure to publish.26-30
This study has limitations. The analysis was limited to the available data as registered in ClinicalTrials.gov and thus may not provide a complete or accurate assessment of the clinical trials research enterprise. The ClinicalTrials.gov database structure and the individual study records have evolved since ClinicalTrials.gov was first launched in 2000, affected by the different registration and reporting requirements over time. Changes in required registration fields, data formats, and key definitions and terms (eg, clarification of applicable clinical trial and changes in phase categories) made it difficult to compare characteristics registered over time and resulted in missing data encountered for certain registration fields, such as enrollment, number of facilities, or overall status.12,32 ClinicalTrials.gov is only 1 of multiple registration sites where trials can be registered. The World Health Organization International Clinical Trials Registry Platform has 16 other places where trials can be registered with varying analytic capabilities. Although trials are required to meet ICMJE standards, it is difficult to obtain a single comprehensive evaluation of all registered trials. It also remains unclear how many duplicate registrations may exist, especially for non-US studies that may be registered in both ClinicalTrials.gov and a second or third registration registry.31 Thus, whether trials registered in ClinicalTrials.gov are representative of trials registered elsewhere may remain unknown until there is a system for merging registries into 1 or for establishing a universal and standardized data system for harvesting trial information from all registries through the World Health Organization International Clinical Trials Registry Platform.
Even with its limitations, ClinicalTrials.gov registration provides valuable insights into the massive clinical trials research enterprise. The findings suggest that the composition and design of trials changed over time and differed substantially by sponsor type. Increased funding toward larger randomized clinical trials may be warranted to inform clinical decision-making and guide future research.
Accepted for Publication: June 7, 2020.
Published: August 26, 2020. doi:10.1001/jamanetworkopen.2020.14682
Open Access: This is an open access article distributed under the terms of the CC-BY License. © 2020 Gresham G et al. JAMA Network Open.
Corresponding Author: Gillian Gresham, PhD, Department of Medicine, Cedars-Sinai Medical Center, 700 N San Vincente Blvd, Los Angeles, CA, 90048 (firstname.lastname@example.org).
Author Contributions: Drs G. Gresham and C.L. Meinert had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.
Concept and design: G. Gresham, C.L. Meinert.
Acquisition, analysis, or interpretation of data: All authors.
Drafting of the manuscript: G. Gresham, A.G. Gresham, C.L. Meinert.
Critical revision of the manuscript for important intellectual content: G. Gresham, J.L. Meinert, C.L. Meinert.
Statistical analysis: G. Gresham, J.L. Meinert, C.L. Meinert.
Administrative, technical, or material support: A.G. Gresham, C.L. Meinert.
Supervision: C.L. Meinert.
Conflict of Interest Disclosures: None reported.
Meeting Presentation: This study was presented in part at the Society for Clinical Trials Conference; May 20, 2019; New Orleans, LA.
Create a personal account or sign in to: