[Skip to Navigation]
Sign In
Figure.  Pulmonary Embolism Prevalence by 4-Level Pulmonary Embolism Clinical Probability Score (4PEPS) in the Derivation and Validation Cohorts
Pulmonary Embolism Prevalence by 4-Level Pulmonary Embolism Clinical Probability Score (4PEPS) in the Derivation and Validation Cohorts
Table 1.  Diagnostic Strategy Aiming to Reduce Imaging Testing
Diagnostic Strategy Aiming to Reduce Imaging Testing
Table 2.  Baseline Characteristics of the Patients in the Different Cohorts
Baseline Characteristics of the Patients in the Different Cohorts
Table 3.  4-Level Pulmonary Embolism Clinical Probability Score (4PEPS)
4-Level Pulmonary Embolism Clinical Probability Score (4PEPS)
Table 4.  Diagnostic Tests and False-Negative Testing According to the Strategy Retrospectively Applied
Diagnostic Tests and False-Negative Testing According to the Strategy Retrospectively Applied
1.
Konstantinides  SV, Meyer  G, Becattini  C,  et al; ESC Scientific Document Group.  2019 ESC guidelines for the diagnosis and management of acute pulmonary embolism developed in collaboration with the European Respiratory Society (ERS).   Eur Heart J. 2020;41(4):543-603. doi:10.1093/eurheartj/ehz405PubMedGoogle ScholarCrossref
2.
Kline  JA, Garrett  JS, Sarmiento  EJ, Strachan  CC, Courtney  DM.  Over-testing for suspected pulmonary embolism in American emergency departments: the continuing epidemic.   Circ Cardiovasc Qual Outcomes. 2020;13(1):e005753. doi:10.1161/CIRCOUTCOMES.119.005753PubMedGoogle Scholar
3.
Wang  RC, Miglioretti  DL, Marlow  EC,  et al.  Trends in imaging for suspected pulmonary embolism across US health care systems, 2004 to 2016.   JAMA Netw Open. 2020;3(11):e2026930. doi:10.1001/jamanetworkopen.2020.26930PubMedGoogle Scholar
4.
Dobler  CC.  Overdiagnosis of pulmonary embolism: definition, causes and implications.   Breathe (Sheff). 2019;15(1):46-53. doi:10.1183/20734735.0339-2018PubMedGoogle ScholarCrossref
5.
Wiener  RS, Schwartz  LM, Woloshin  S.  Time trends in pulmonary embolism in the United States: evidence of overdiagnosis.   Arch Intern Med. 2011;171(9):831-837. doi:10.1001/archinternmed.2011.178PubMedGoogle ScholarCrossref
6.
Mitchell  AM, Jones  AE, Tumlin  JA, Kline  JA.  Prospective study of the incidence of contrast-induced nephropathy among patients evaluated for pulmonary embolism by contrast-enhanced computed tomography.   Acad Emerg Med. 2012;19(6):618-625. doi:10.1111/j.1553-2712.2012.01374.xPubMedGoogle ScholarCrossref
7.
Niemann  T, Zbinden  I, Roser  HW, Bremerich  J, Remy-Jardin  M, Bongartz  G.  Computed tomography for pulmonary embolism: assessment of a 1-year cohort and estimated cancer risk associated with diagnostic irradiation.   Acta Radiol. 2013;54(7):778-784. doi:10.1177/0284185113485069PubMedGoogle ScholarCrossref
8.
Le Gal  G, Righini  M, Roy  P-M,  et al.  Prediction of pulmonary embolism in the emergency department: the revised Geneva score.   Ann Intern Med. 2006;144(3):165-171. doi:10.7326/0003-4819-144-3-200602070-00004PubMedGoogle ScholarCrossref
9.
Kearon  C, Ginsberg  JS, Douketis  J,  et al; Canadian Pulmonary Embolism Diagnosis Study (CANPEDS) Group.  An evaluation of D-dimer in the diagnosis of pulmonary embolism: a randomized trial.   Ann Intern Med. 2006;144(11):812-821. doi:10.7326/0003-4819-144-11-200606060-00007PubMedGoogle ScholarCrossref
10.
Kline  JA, Mitchell  AM, Kabrhel  C, Richman  PB, Courtney  DM.  Clinical criteria to prevent unnecessary diagnostic testing in emergency department patients with suspected pulmonary embolism.   J Thromb Haemost. 2004;2(8):1247-1255. doi:10.1111/j.1538-7836.2004.00790.xPubMedGoogle ScholarCrossref
11.
Righini  M, Van Es  J, Den Exter  PL,  et al.  Age-adjusted D-dimer cutoff levels to rule out pulmonary embolism: the ADJUST-PE study.   JAMA. 2014;311(11):1117-1124. doi:10.1001/jama.2014.2135PubMedGoogle ScholarCrossref
12.
van der Hulle  T, Cheung  WY, Kooij  S,  et al; YEARS study group.  Simplified diagnostic management of suspected pulmonary embolism (the YEARS study): a prospective, multicentre, cohort study.   Lancet. 2017;390(10091):289-297. doi:10.1016/S0140-6736(17)30885-1PubMedGoogle ScholarCrossref
13.
Kearon  C, de Wit  K, Parpia  S,  et al; PEGeD Study Investigators.  Diagnosis of pulmonary embolism with D-dimer adjusted to clinical probability.   N Engl J Med. 2019;381(22):2125-2134. doi:10.1056/NEJMoa1909159PubMedGoogle ScholarCrossref
14.
Singh  B, Mommer  SK, Erwin  PJ, Mascarenhas  SS, Parsaik  AK.  Pulmonary Embolism Rule-out Criteria (PERC) in pulmonary embolism—revisited: a systematic review and meta-analysis.   Emerg Med J. 2013;30(9):701-706. doi:10.1136/emermed-2012-201730PubMedGoogle ScholarCrossref
15.
Kline  JA, Courtney  DM, Kabrhel  C,  et al.  Prospective multicenter evaluation of the Pulmonary Embolism Rule-out Criteria.   J Thromb Haemost. 2008;6(5):772-780. doi:10.1111/j.1538-7836.2008.02944.xPubMedGoogle ScholarCrossref
16.
Dronkers  CEA, van der Hulle  T, Le Gal  G,  et al; Subcommittee on Predictive and Diagnostic Variables in Thrombotic Disease.  Towards a tailored diagnostic standard for future diagnostic studies in pulmonary embolism: communication from the SSC of the ISTH.   J Thromb Haemost. 2017;15(5):1040-1043. doi:10.1111/jth.13654PubMedGoogle ScholarCrossref
17.
Roy  P-M, Meyer  G, Vielle  B,  et al; EMDEPU Study Group.  Appropriateness of diagnostic management and outcomes of suspected pulmonary embolism.   Ann Intern Med. 2006;144(3):157-164. doi:10.7326/0003-4819-144-3-200602070-00003PubMedGoogle ScholarCrossref
18.
Roy  PM, Durieux  P, Gillaizeau  F,  et al.  A computerized handheld decision-support system to improve pulmonary embolism diagnosis: a randomized trial.   Ann Intern Med. 2009;151(10):677-686. doi:10.7326/0003-4819-151-10-200911170-00003PubMedGoogle ScholarCrossref
19.
Righini  M, Le Gal  G, Aujesky  D,  et al.  Diagnosis of pulmonary embolism by multidetector CT alone or combined with venous ultrasonography of the leg: a randomised non-inferiority trial.   Lancet. 2008;371(9621):1343-1352. doi:10.1016/S0140-6736(08)60594-2PubMedGoogle ScholarCrossref
20.
Penaloza  A, Soulié  C, Moumneh  T,  et al.  Pulmonary Embolism Rule-out Criteria (PERC) rule in European patients with low implicit clinical probability (PERCEPIC): a multicentre, prospective, observational study.   Lancet Haematol. 2017;4(12):e615-e621. doi:10.1016/S2352-3026(17)30210-7PubMedGoogle ScholarCrossref
21.
West  J, Goodacre  S, Sampson  F.  The value of clinical features in the diagnosis of acute pulmonary embolism: systematic review and meta-analysis.   QJM. 2007;100(12):763-769. doi:10.1093/qjmed/hcm113PubMedGoogle ScholarCrossref
22.
Wasson  JH, Sox  HC, Neff  RK, Goldman  L.  Clinical prediction rules. applications and methodological standards.   N Engl J Med. 1985;313(13):793-799. doi:10.1056/NEJM198509263131306PubMedGoogle ScholarCrossref
23.
Hosmer  DW, Lemeshow  S.  Applied Logistic Regression. Wiley; 1989.
24.
Harrell  FE  Jr, Lee  KL, Mark  DB.  Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.   Stat Med. 1996;15(4):361-387. doi:10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4PubMedGoogle ScholarCrossref
25.
Fagan  TJ.  Letter: nomogram for Bayes theorem.   N Engl J Med. 1975;293(5):257. doi:10.1056/NEJM197507312930513PubMedGoogle Scholar
26.
DeLong  ER, DeLong  DM, Clarke-Pearson  DL.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach.   Biometrics. 1988;44(3):837-845. doi:10.2307/2531595PubMedGoogle ScholarCrossref
27.
Hutchinson  BD, Navin  P, Marom  EM, Truong  MT, Bruzzi  JF.  Overdiagnosis of pulmonary embolism by pulmonary CT angiography.   AJR Am J Roentgenol. 2015;205(2):271-277. doi:10.2214/AJR.14.13938PubMedGoogle ScholarCrossref
28.
Penaloza  A, Verschuren  F, Dambrine  S, Zech  F, Thys  F, Roy  PM.  Performance of the Pulmonary Embolism Rule-out Criteria (the PERC rule) combined with low clinical probability in high prevalence population.   Thromb Res. 2012;129(5):e189-e193. doi:10.1016/j.thromres.2012.02.016PubMedGoogle ScholarCrossref
29.
Hugli  O, Righini  M, Le Gal  G,  et al.  The Pulmonary Embolism Rule-out Criteria (PERC) rule does not safely exclude pulmonary embolism.   J Thromb Haemost. 2011;9(2):300-304. doi:10.1111/j.1538-7836.2010.04147.xPubMedGoogle ScholarCrossref
30.
Kline  JA, Hogg  MM, Courtney  DM, Miller  CD, Jones  AE, Smithline  HA.  D-dimer threshold increase with pretest probability unlikely for pulmonary embolism to decrease unnecessary computerized tomographic pulmonary angiography.   J Thromb Haemost. 2012;10(4):572-581. doi:10.1111/j.1538-7836.2012.04647.xPubMedGoogle ScholarCrossref
31.
Eddy  M, Robert-Ebadi  H, Richardson  L,  et al.  External validation of the YEARS diagnostic algorithm for suspected pulmonary embolism.   J Thromb Haemost. 2020. doi:10.1111/jth.15083PubMedGoogle Scholar
32.
Kline  JA, Richardson  DM, Than  MP, Penaloza  A, Roy  PM.  Systematic review and meta-analysis of pregnant patients investigated for suspected pulmonary embolism in the emergency department.   Acad Emerg Med. 2014;21(9):949-959. doi:10.1111/acem.12471PubMedGoogle ScholarCrossref
33.
Penaloza  A, Verschuren  F, Meyer  G,  et al.  Comparison of the unstructured clinician gestalt, the Wells score, and the revised Geneva score to estimate pretest probability for suspected pulmonary embolism.   Ann Emerg Med. 2013;62(2):117-124.e2.PubMedGoogle ScholarCrossref
34.
Carpenter  CR, Raja  AS.  Arming the bayesian physician to rule out pulmonary embolism: using evidence-based diagnostics to combat overtesting.   Acad Emerg Med. 2014;21(9):1036-1038. doi:10.1111/acem.12450PubMedGoogle ScholarCrossref
Original Investigation
March 3, 2021

Derivation and Validation of a 4-Level Clinical Pretest Probability Score for Suspected Pulmonary Embolism to Safely Decrease Imaging Testing

Author Affiliations
  • 1Emergency Department, CHU Angers, Institut Mitovasc UMR (CNRS 6015—INSERM 1083), UNIV Angers, F-CRIN INNOVTE, Angers, France
  • 2Emergency Department, CHU Angers, Angers, France
  • 3Emergency Department, Cliniques Universitaires Saint Luc, Université Catholique de Louvain, Brussels, Belgium
  • 4Department of Emergency Medicine, Indiana University School of Medicine, Indianapolis
  • 5Division of Angiology and Hemostasis, Department of Internal Medicine, Faculty of Medicine, Geneva University Hospital, Geneva, Switzerland
  • 6Ottawa Hospital Research Institute, The Ottawa Hospital, Division of Hematology, Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada
  • 7Emergency Department, Cliniques Universitaires Saint Luc, Université Catholique de Louvain, F-CRIN INNOVTE, Brussels, Belgium
JAMA Cardiol. 2021;6(6):669-677. doi:10.1001/jamacardio.2021.0064
Key Points

Question  Can a pretest probability score make it possible to rule out pulmonary embolism solely on clinical criteria and optimized D-dimer measurement to safely decrease imaging testing?

Findings  In this study, the 4-Level Pulmonary Embolism Clinical Probability Score (4PEPS) was derived and validated using databases from 3 merged management studies. The safety and the efficacy of the 4PEPS strategy was confirmed in 2 external validation cohorts (false-negative rates: 0.71% and 0.89%; absolute reductions in imaging testing: −19% and −22%, respectively).

Meaning  The 4PEPS strategy may lead to a substantial and safe reduction in imaging testing for patients with suspected pulmonary embolism.

Abstract

Importance  In patients with suspected pulmonary embolism (PE), overuse of diagnostic imaging is an important point of concern.

Objective  To derive and validate a 4-level pretest probability rule (4-Level Pulmonary Embolism Clinical Probability Score [4PEPS]) that makes it possible to rule out PE solely on clinical criteria and optimized D-dimer measurement to safely decrease imaging testing for suspected PE.

Design, Setting, and Participants  This study included consecutive outpatients suspected of having PE from US and European emergency departments. Individual data from 3 merged management studies (n = 11 114; overall prevalence of PE, 11%) were used for the derivation cohort and internal validation cohort. The external validation cohorts were taken from 2 independent studies, the first with a high PE prevalence (n = 1548; prevalence, 21.5%) and the second with a moderate PE prevalence (n = 1669; prevalence, 11.7%). A prior definition of pretest probability target values to achieve a posttest probability less than 2% was used on the basis of the negative likelihood ratios of D-dimer. Data were collected from January 2003 to April 2016, and data were analyzed from June 2018 to August 2019.

Main Outcomes and Measures  The rate of PE diagnosed during the initial workup or during follow-up and the rate of imaging testing.

Results  Of the 5588 patients in the derivation cohort, 3441 (61.8%) were female, and the mean (SD) age was 52 (18.5) years. The 4PEPS comprises 13 clinical variables scored from −2 to 5. It results in the following strategy: (1) very low probability of PE if 4PEPS is less than 0: PE ruled out without testing; (2) low probability of PE if 4PEPS is 0 to 5: PE ruled out if D-dimer level is less than 1.0 μg/mL; (3) moderate probability of PE if 4PEPS is 6 to 12: PE ruled out if D-dimer level is less than the age-adjusted cutoff value; (4) high probability of PE if 4PEPS is greater than 12: PE ruled out by imaging without preceding D-dimer test. In the first and the second external validation cohorts, the area under the receiver operator characteristic curves were 0.79 (95% CI, 0.76 to 0.82) and 0.78 (95% CI, 0.74 to 0.81), respectively. The false-negative testing rates if the 4PEPS strategy had been applied were 0.71% (95% CI, 0.37 to 1.23) and 0.89% (95% CI, 0.53 to 1.49), respectively. The absolute reductions in imaging testing were −22% (95% CI, −26 to −19) and −19% (95% CI, −22 to −16) in the first and second external validation cohorts, respectively. The 4PEPS strategy compared favorably with all recent strategies in terms of imaging testing.

Conclusions and Relevance  The 4PEPS strategy may lead to a substantial and safe reduction in imaging testing for patients with suspected PE. It should now be tested in a formal outcome study.

Introduction

Despite the significant progress of the last decades, diagnosing pulmonary embolism (PE) remains a clinical challenge. The standard diagnostic strategy, based on clinical probability assessment, D-dimer testing, and computed tomography pulmonary angiography (CTPA), is proven to have a very low rate of diagnostic failure.1 However, there has been a large increase in CTPA for suspected PE.2,3 The exact reasons are likely multifactorial. The signs and symptoms of PE are very common and unspecific. As such, clinicians fear they might be missing a life-threatening condition and are prone to initiate a diagnostic process. Due to the lack of specificity of D-dimer testing, a large proportion of patients have a false-positive result and require imaging to rule out PE. Finally, CTPA is readily available, fast, minimally invasive, and more sensitive than ventilation/perfusion (V/Q) scans. A slight increase in PE diagnosis has been observed as a result but with no clear benefits in terms of outcome, especially PE-related mortality.4,5 One explanation is that because more CTPAs are being performed, there is a greater risk of false-positive results or non–clinically relevant diagnoses. Moreover, CTPA exposes patients to risks of allergies, kidney failure, and cumulative radiation-induced cancer.6,7 Several strategies have therefore been proposed to reduce PE overtesting and overdiagnosis (Table 1).4,8-14 These have proved satisfactory in terms of safety and efficacy, but they are based on different methods of assessing clinical pretest probability (CPP; eg, Wells9 or revised Geneva8 scores for PE, Pulmonary Embolism Rule-out Criteria [PERC] strategy,10 or YEARS strategy12), thus making it difficult to combine them and increasing the risk of misuse in clinical practice.

Our primary aim was to develop and validate a pretest probability score to safely reduce imaging testing by integrating all the previously proposed strategies: the 4-Level Pulmonary Embolism Clinical Probability Score (4PEPS). Our secondary goal was to retrospectively assess the safety of a diagnostic strategy based on this new score and its efficacy in reducing imaging testing.

Methods
Study Design

Four levels of CPP for 4PEPS were defined a priori:

  • Very low CPP, allowing exclusion of PE on clinical criteria only.

  • Low CPP, allowing exclusion of PE with a high-sensitivity D-dimer level less than 1.0 μg/mL (to convert to nanomoles per liter, multiply by 5.476).

  • Moderate CPP, allowing exclusion of PE with a D-dimer level less than 0.5 μg/mL or less than the age-adjusted cutoff value (calculated as age × 0.01 μg/mL for patients older than 50 years).

  • High CPP, not allowing a safe exclusion of PE with D-dimer testing and requiring imaging testing (CTPA or V/Q scan).

To derive the score, we predefined the upper limit for PE prevalence in each CPP category using the bayesian approach and considered 2% as the safety threshold for PE.13,15,16 The negative likelihood ratios of a D-dimer test using 1.0 μg/mL as the cutoff value and using an age-adjusted cutoff value were established using the results of the YEARS study12 and the Age-Adjusted D-Dimer Cutoff Levels to Rule Out Pulmonary Embolism (ADJUST-PE) study.11 They were found to be 0.08 and 0.01, respectively. Accordingly, to achieve a posttest probability less than 2%, the upper limit of PE prevalence was set at 20% for low CPP and at 65% for moderate CPP. The present study was a retrospective analysis of data prospectively collected in 5 studies that were all approved by an ethical committee and performed with the informed consent of the participating patients. According to the current European legislation, an approval of an ethical committee was not required for the present study.

Source of Data

For the derivation and internal validation, we merged 3 prospectively collected databases from patients with suspected PE (n = 11 114). The first study was performed in 117 emergency departments (EDs) in France and Belgium (n = 1529; enrolled in 2003)17; the second study was performed in 20 French EDs (n = 1645; enrolled in 2005 to 2006)18; and the third study was performed in 12 EDs in the US (n = 7940; enrolled in 2003 to 2006).15 Each database was randomly split into 2 groups, including 60% for the derivation cohort and 40% for the internal validation cohort.

Two other databases were used for external validation. The first study was performed in 6 EDs in France, Belgium, and Switzerland (n = 1819; enrolled in 2005 to 2006)19 and the second in 12 EDs in France and Belgium (n = 1757; enrolled in 2015 to 2016).20

Outcome

The outcome was a PE diagnosed on CTPA or high-probability V/Q scan during the initial diagnostic workup or a venous thromboembolism (VTE) occurring during follow-up (3 months for the 4 European studies and 45 days for the US study) in a patient in whom PE was initially ruled out. In all studies, the following were considered as VTE: symptomatic PE objectively confirmed with CTPA or high-probability V/Q scan and/or deep vein thrombosis on compression ultrasonography and/or sudden unexpected death potentially related to PE according to an independent adjudication committee.

4PEPS Derivation

We evaluated all of the clinical variables known to be potentially associated with PE and available in the database.21 As patients were suspected of PE because of dyspnea or chest pain, these variables were not included. However, we took the variable of dyspnea and chest pain into account when both were present in a given patient. Variables with more than 2% of missing data were excluded, except those included in other prediction rules (PERC strategy,10 revised Geneva score,8 and Wells score9). Namely, the following variables were excluded: history of hypertension, diabetes, dyslipidemia, coronary disease, long travel, chronic kidney failure, smoking, family history of VTE, body weight, respiratory rate, and antiplatelet treatment. We categorized the continuous variables according to the cutoff values previously chosen in other scoring systems and according to their clinical relevance. There were 4 categories for age (younger than 50 years, aged 50 to 64 years, aged 65 to 80 years, and older than 80 years), 3 categories for heart rate (less than 80 beats per minute, 80 to 100 beats per minute, and more than 100 beats per minute) and temperature (less than 38 °C, 38 to 39 °C, and greater than 39 °C), and 2 categories for systolic blood pressure (less than 90 mm Hg and 90 mm Hg or greater) and pulse oximetry (Spo2; less than 95% and 95% or greater).

To select the predictor variables associated with PE, we performed a univariate analysis by using the χ2 test.22 All variables with a 2-tailed P value less than .20 as well as the nonsignificant variables included in other prediction rules were included in a multivariate logistic regression model. We performed a stepwise backward analysis including 1 variable for every 10 VTE events.23,24 We then removed the nonsignificant variables, considering a 2-tailed P value less than .05 as significant. Only significant variables were left in the final score. We assigned points for the score according to the regression coefficients. Finally, we chose the cutoff values to achieve the predefined levels of PE prevalence in each CPP category.25

4PEPS Validation

The accuracy of the score was assessed by calculating the receiver operating characteristic curve and analyzing the area under the receiver operating characteristic curve (AUC). The AUC confidence interval was computed with the DeLong-DeLong method.26 Calibration was assessed with the Hosmer-Lemeshow goodness-of-fit statistic.23 A Brier score was also reported, summarizing the magnitude of error in the probability forecasts as between 0.0 and 1.0, where a perfectly calibrated model would score 0.0.

4PEPS Strategy Safety and Efficacy Assessment

The safety of the 4PEPS strategy was retrospectively assessed using the false-negative rate if the strategy had been applied in the 2 external validation cohorts. This is the rate of PE diagnoses during the initial diagnostic process or VTEs found during the 3-month follow-up among patients with a very low CPP, a low CPP and D-dimer level less than 1.0 μg/mL, a moderate CPP and D-dimer level less than the age-adjusted cutoff value, or a negative CTPA or V/Q scan.

We defined the safety threshold of the 4PEPS strategy as a function of PE prevalence applying the recommendations of the International Society of Thrombosis and Hemostasis (1.82 + [0.00528 × prevalence]).16 The respective PE prevalences in the first and second external validation cohorts were 21.4% and 11.7%, respectively. Thus, the acceptable upper limits of the 95% CI of false-negative rates were predefined at 1.93% and 1.88%, respectively.16

Finally, the efficacy of the 4PEPS strategy was assessed by the rate of D-dimer and imaging testing, mainly CTPA, that could have been avoided if the 4PEPS strategy had been applied compared with the standard strategy, the PERC strategy,10 the ADJUST-PE strategy,11 the YEARS strategy,12 and the Pulmonary Embolism Graduated D-Dimer (PEGeD) strategy13 (Table 1).

Missing Data

Analyses were performed including all analyzable patients. Patients with missing data were excluded and no imputation was performed. However, a sensitivity analysis was carried out for the 2 external validation cohorts considering the missing variables of 4PEPS as negative, ie, resulting in the lowest score and so representing the highest risk of a false-negative finding using the 4PEPS strategy.

Statistical Analysis

We calculated the 95% CIs by using the Mid-P exact value performed using OpenEpi version 2, an open-source calculator. All other statistical analyses were performed using SPSS version 25.0 (SPSS Inc).

Results

After exclusion of patients with missing data, 5588 patients were included in the derivation cohort (PE prevalence, 11.0%), 3726 in the internal validation cohort (PE prevalence, 11.7%), 1548 in the first external validation cohort (PE prevalence, 21.5), and 1669 in the second external validation cohort (PE prevalence, 11.7%). Of the 5588 patients in the derivation cohort, 3441 (61.8%) were female, and the mean (SD) age was 52 (18.5) years. In the 3 validation cohorts, 2265 of 3726 (60.7%), 842 of 1548 (54.4%), and 970 of 1669 (58.1%) were female, and the mean (SD) age was 52 (18.5), 59 (18.7), and 53 (19.8) years, respectively. Characteristics of the study samples are presented in Table 2.

4PEPS Derivation

A univariate analysis found a statistical association with PE diagnosis for 21 variables. All of these were included in the multivariate regression. In addition, we included the variable of estrogenic treatment since this criterion is present in the PERC strategy.10 In the multivariate model, age of 65 to 80 years or older than 80 years, pulse rate of 80 to 100 beats per minute, systolic arterial pressure, hemoptysis, cancer, chronic cardiac failure, and pregnancy or post partum were not independently associated with PE. The remaining 13 variables were included in the final model, and we assigned points for each of them according to their regression coefficient. Table 3 represents the final model (4PEPS).

The PE prevalence by 4PEPS and the distribution of 4PEPS in the derivation cohort are presented in the Figure and the eTable in the Supplement. According to the predefined cutoff values, a 4PEPS less than 0 corresponds to a very low CPP (less than 2%), a 4PEPS of 0 to 5 corresponds to a low CPP (less than 20%), a 4PEPS of 6 to 12 corresponds to a moderate CPP (less than 65%), and a 4PEPS greater than 12 corresponds to a high CPP (65% or greater) (Table 3). PE prevalence in the very low category was 1.1% (95% CI, 0.6-1.6); low category, 6.2% (95% CI, 5.3-7.1); intermediate category, 31.3% (95% CI, 28.6-34.1); and high category, 73.6% (95% CI, 65.2-82.0).

4PEPS Validation

For the 3 validation cohorts, the PE prevalence by 4PEPS and the distribution of the 4PEPS are presented in the Figure and the eTable in the Supplement. In the internal validation cohort, the AUC was 0.83 (95% CI, 0.81-0.85). In the first and second external validation cohort, the AUCs were 0.79 (95% CI, 0.76-0.82) and 0.78 (95% CI, 0.74-0.81), respectively. The AUCs and the degree of concordance between the observed and predicted prevalence are presented in the eFigure in the Supplement.

4PEPS Strategy Validation

When the 4PEPS strategy was retrospectively applied in the first and second external validation cohorts, the false-negative rates were 11 of 1548 (0.71%; 95% CI, 0.37-1.23) and 14 of 1570 (0.89%; 95% CI, 0.53-1.49), respectively. No fatal PE or high-risk hemodynamically unstable PE were observed, and 3 of 11 false-negative VTEs in the high-prevalence cohort and 3 of 14 false-negative VTEs in the moderate-prevalence cohort were subsegmental PE. The upper limit of the 95% CI of the false-negative rate was less than the predefined cutoff value to consider the 4PEPS strategy as safe in the first (1.93%) and second (1.88%) external validation cohorts. Similar results were observed in the sensitivity analyses considering missing variables of 4PEPS as negative (high-prevalence cohort: 11 of 1687; false-negative rate, 0.65%; 95% CI, 0.34-1.13; moderate-prevalence cohort: 14 of 1655; false-negative rate, 0.85%; 95% CI, 0.50-1.61).

Compared with the standard strategy (CPP assessed using the revised Geneva score, D-dimer measurement with 0.5 μg/mL as the cutoff value), applying 4PEPS would have decreased the CTPA rate (external validation cohort 1: 46% vs 68%; difference, −22%; 95% CI, −26 to −19; external validation cohort 2: 32% vs 51%; difference, −19%; 95% CI, −22 to −16). Table 410-13 compares the different strategies proposed to reduce diagnostic testing.

Discussion

Using 5 multicenter cohorts regrouping more than 12 000 patients suspected of PE, we were able to derive and validate a new clinical probability score to help physicians diagnose PE and safely decrease diagnostic imaging. Applying the 4PEPS diagnostic strategy retrospectively to 2 external validation cohorts, the rate of false-negative tests was below 1%, and the 4PEPS strategy performed better than all previously proposed strategies in terms of reducing imaging testing.

Overuse of CTPA for suspected PE is an important concern.3 There is increasing evidence that CTPA is frequently used inappropriately in patients for whom the benefits (probability of PE diagnosis and avoiding a PE complication) are outweighed by the risks (probability of a false-positive result, complication of anticoagulation, short-term or long-term adverse effect of CTPA).2,3,27 The first strategy developed to deal with overtesting was the PERC strategy.10,15 This can be used for patients for whom the clinician has already established a low clinical probability of PE based on an implicit gestalt impression. A negative PERC strategy finding defines a subgroup of these patients with a very low PE prevalence (less than 2%) allowing PE to be ruled out without any testing.15 However, applied alone or in association with the revised Geneva score, the PERC strategy appears to be insufficiently reliable.28,29 The 4PEPS strategy may not have such a limitation.

Another means to limit CTPA overuse is to optimize D-dimer testing. The ADJUST-PE study11 prospectively confirmed the safety and utility of an age-adjusted cutoff value for patients 50 years or older (Table 1). However, the effect of the ADJUST-PE strategy on imaging testing rates remains limited (−10.8% or −5.2% in our high-prevalence and moderate-prevalence external validation cohorts, respectively), particularly in young patients. A further proposal, based on the Bayes theorem, is to adjust the D-dimer cutoff value to the pretest probability.30 This principle was assessed in 2 recent studies, the YEARS study12 and PEGeD study.13 Both studies used 1.0 μg/mL as the D-dimer cutoff value for patients with a low CPP, and both achieved a very low overall rate of false-negative testing. Of note, the PEGeD study was the most recent study and has the lowest PE prevalence (7.4%), with 87% of patients having a low CPP.13 It should be used with caution in a population of patients with a higher PE prevalence. Indeed, recent external validation data of the PEGeD and YEARS strategies in cohorts of European patients suggest a higher failure rate.31 Moreover, since the methods of CPP assessment are different in the PERC strategy from the other strategies aiming to reduce overtesting, it is difficult to combine them.10-13 For example, to combine the PERC and PEGeD strategies, the physician may have to first assess implicit clinical probability (gestalt); second, if low, the PERC strategy; and third, if positive, the revised Wells score.10,13 The risk of misuse in clinical practice appears to be major and may have an important impact on safety. For example, although combining clinical gestalt and the PERC strategy has proven to be safe, the rate of failure when combining a low revised Geneva score and a negative PERC strategy finding is higher than 5%.28 Here lies the main benefit of 4PEPS: a single rule to guide diagnostic strategy resulting in a substantial reduction in testing, especially imaging testing.

Most of the 4PEPS criteria are included in other rules or scores for CPP assessment. Nevertheless, in our study, some potentially relevant criteria were not statistically associated with a PE diagnosis (pregnancy, history of cancer, chronic respiratory disease, hemoptysis). As the derivation database was large (n = 5588), we do not think that this is caused by a lack of power. More probably, we suppose that this result reflects the fact that physicians suspect PE at a very low threshold in patients with these characteristics.32 The first stage of the diagnostic process is deciding whether to investigate PE or not. This is why the PERC strategy needs to be combined with gestalt and why 4PEPS integrates the item PE is the most likely diagnosis. This criterion is sometimes criticized for a lack of objectivity and reproducibility. Nevertheless, it is included in the Wells score9 and YEARS strategy,12 is well-known by the ED physicians, and is easier to explain and to use than gestalt. The inclusion of factors decreasing the probability of PE diagnosis as well as factors increasing it allowed us to derive a 4-level score that rules out PE when negative. The 4PEPS calibration and accuracy of the 4PEPS appear to be at least similar to previous CPP scores for PE (Table 4).8,33

To facilitate 4PEPS implementation in clinical practice, an internet-application for smartphone and computer has been developed (https://peps.shinyapps.io/PEPS/). 4PEPS will be also incorporated in the new version of the decision-support software SPEED (Suspected Pulmonary Embolism in Emergency Departments; http://www.thrombus.fr/). We have previously shown that, compared with posters and pocket cards, such decision-support systems available on smartphones improves diagnostic decision-making and reduces the number of tests to reach a validated diagnostic decision.18 4PEPS could also be integrated in the electronic medical record for automated calculation. Using such setups, we believe that 4PEPS will be embraced by ED physicians and will lead to a substantial and safe decrease in imaging testing.

Strengths and Limitations

Our study has several strengths. We used a bayesian evidence-based medicine approach to define the prevalence limit in each CPP category, based on the predefined safety threshold and on the negative likelihood ratio of D-dimer.34 We followed a well-validated method to derive and validate the score and the recent recommendations of the International Society of Thrombosis and Hemostasis to assess the safety of the 4PEPS strategy in ruling out PE.16,22 The 5 databases of prospective multicenter international studies made it possible to define a large derivation cohort, an internal validation cohort, and 2 external validation cohorts. The results in terms of calibration and accuracy were very similar to each other, with an AUC around 80%. Finally, the safety of the 4PEPS strategy was confirmed in an external validation cohort with a moderate PE prevalence (11.7%) as well as in an external validation cohort with a high PE prevalence (21.5%). This reinforces the generalizability of our results.

Nevertheless, our study has some limitations. The studies used to derive and validate 4PEPS were all performed in ED settings and so 4PEPS may be not suitable for inpatients. Some variables were not systematically collected in these studies. They could not be included in our analyses. We also did not include patients with missing variables. However, the population for each cohort remains large, and similar results were obtained in the sensitivity analyses considering missing 4PEPS variables as negative. The score comprises 13 criteria that may be difficult to memorize, reinforcing the usefulness of an application for computer or handheld devices. Additionally, although we used clinical data from several prospective studies, we calculated this new score retrospectively. The 4PEPS strategy needs to be formally validated in a prospective implementation study.

Conclusions

In conclusion, using a bayesian approach, we derived a new 4-level clinical probability score (4PEPS) to help ED physicians make decisions regarding patients suspected of PE. The accuracy, safety, and efficacy of the 4PEPS strategy were confirmed in 2 independent external validation cohorts, one with a moderate PE prevalence and the other with a high PE prevalence. For both cohorts, applying 4PEPS resulted in a very low rate of diagnostic failure and a substantial reduction in imaging testing. It should now be tested in a formal outcome study.

Back to top
Article Information

Accepted for Publication: December 29, 2020.

Published Online: March 3, 2021. doi:10.1001/jamacardio.2021.0064

Open Access: This is an open access article distributed under the terms of the CC-BY License. © 2021 Roy P-M et al. JAMA Cardiology.

Corresponding Author: Pierre-Marie Roy, MD, PhD, Emergency Department, CHU Angers, Institut Mitovasc UMR (CNRS 6015—INSERM 1083), UNIV Angers, F-CRIN INNOVTE, 4 Rue Larrey, 49933 Angers, France (pmroy@chu-angers.fr).

Author Contributions: Dr Roy had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Study concept and design: Roy, Friou, Germeau, Moumneh, Penaloza.

Acquisition, analysis, or interpretation of data: Roy, Friou, Germeau, Douillet, Kline, Righini, Le Gal, Moumneh.

Drafting of the manuscript: Roy, Penaloza.

Critical revision of the manuscript for important intellectual content: All authors.

Statistical analysis: Friou, Germeau, Righini, Moumneh, Penaloza.

Administrative, technical, or material support: Germeau, Kline, Moumneh, Penaloza.

Study supervision: Roy, Penaloza.

Conflict of Interest Disclosures: Dr Roy has received grants from the French Health Ministry as well as personal fees and nonfinancial support from Bayer Health Care, Boehringer Ingelheim France, Bristol Myers Squibb, Pfizer, Aspen, Daiichi Sankyo, and Sanofi Aventis France outside the submitted work. Dr Kline has received grants from the National Institutes of Health, Stago Diagnostica, Janssen/Johnson & Johnson, and Pfizer/Bristol Myers Squibb outside the submitted work. Dr Righini has received grants from the Swiss National Research Foundation. Dr Le Gal has received grants from the French Health Ministry, Portola Pharmaceuticals, Boehringer Ingelheim, Pfizer, Bristol Myers Squibb, LEO Pharma, Daiichi Sankyo, and Bayer as well as personal fees from Bayer, Pfizer, LEO Pharma, Sanofi, and bioMérieux. Dr Moumneh has received grants from the French Health Ministry, Société Française de Médecine d’Urgence, and CanVECTOR Network outside the submitted work. Dr Penaloza has received grants from Fondation Saint Luc and Bristol Myers Squibb; grants and personal fees from Bayer; and personal fees and nonfinancial support from Boehringer Ingelheim, Daiichi Sankyo, Sanofi, Stago Diagnostica, Bristol Myers Squibb, Aspen, and Roche outside the submitted work. No other disclosures were reported.

Funding/Support: This study was conducted without specific funding. The studies that we used were supported by grants from the National Institutes of Health (grants R41HL074415 and R42HL074415, K23HL077404, and R01HL074384), the French Ministry of Health, and the Swiss National Research Foundation (grant 3200B0-105988).

Role of the Funder/Sponsor: The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Meeting Presentation: Preliminary results were presented at the ESC Congress 2019; September 3, 2019; Paris, France.

Previous Posting: This manuscript was posted as a preprint on SSRN.

Additional Contributions: We are grateful to Céline Bugli, PhD (Université Catholique de Louvain, Brussels, Belgium), for her help in statistical analysis and Jérémie Riou, PhD (CHU Angers, UNIV Angers, Angers, France), for developing the 4PEPS internet application. None of the contributors were compensated for their work.

References
1.
Konstantinides  SV, Meyer  G, Becattini  C,  et al; ESC Scientific Document Group.  2019 ESC guidelines for the diagnosis and management of acute pulmonary embolism developed in collaboration with the European Respiratory Society (ERS).   Eur Heart J. 2020;41(4):543-603. doi:10.1093/eurheartj/ehz405PubMedGoogle ScholarCrossref
2.
Kline  JA, Garrett  JS, Sarmiento  EJ, Strachan  CC, Courtney  DM.  Over-testing for suspected pulmonary embolism in American emergency departments: the continuing epidemic.   Circ Cardiovasc Qual Outcomes. 2020;13(1):e005753. doi:10.1161/CIRCOUTCOMES.119.005753PubMedGoogle Scholar
3.
Wang  RC, Miglioretti  DL, Marlow  EC,  et al.  Trends in imaging for suspected pulmonary embolism across US health care systems, 2004 to 2016.   JAMA Netw Open. 2020;3(11):e2026930. doi:10.1001/jamanetworkopen.2020.26930PubMedGoogle Scholar
4.
Dobler  CC.  Overdiagnosis of pulmonary embolism: definition, causes and implications.   Breathe (Sheff). 2019;15(1):46-53. doi:10.1183/20734735.0339-2018PubMedGoogle ScholarCrossref
5.
Wiener  RS, Schwartz  LM, Woloshin  S.  Time trends in pulmonary embolism in the United States: evidence of overdiagnosis.   Arch Intern Med. 2011;171(9):831-837. doi:10.1001/archinternmed.2011.178PubMedGoogle ScholarCrossref
6.
Mitchell  AM, Jones  AE, Tumlin  JA, Kline  JA.  Prospective study of the incidence of contrast-induced nephropathy among patients evaluated for pulmonary embolism by contrast-enhanced computed tomography.   Acad Emerg Med. 2012;19(6):618-625. doi:10.1111/j.1553-2712.2012.01374.xPubMedGoogle ScholarCrossref
7.
Niemann  T, Zbinden  I, Roser  HW, Bremerich  J, Remy-Jardin  M, Bongartz  G.  Computed tomography for pulmonary embolism: assessment of a 1-year cohort and estimated cancer risk associated with diagnostic irradiation.   Acta Radiol. 2013;54(7):778-784. doi:10.1177/0284185113485069PubMedGoogle ScholarCrossref
8.
Le Gal  G, Righini  M, Roy  P-M,  et al.  Prediction of pulmonary embolism in the emergency department: the revised Geneva score.   Ann Intern Med. 2006;144(3):165-171. doi:10.7326/0003-4819-144-3-200602070-00004PubMedGoogle ScholarCrossref
9.
Kearon  C, Ginsberg  JS, Douketis  J,  et al; Canadian Pulmonary Embolism Diagnosis Study (CANPEDS) Group.  An evaluation of D-dimer in the diagnosis of pulmonary embolism: a randomized trial.   Ann Intern Med. 2006;144(11):812-821. doi:10.7326/0003-4819-144-11-200606060-00007PubMedGoogle ScholarCrossref
10.
Kline  JA, Mitchell  AM, Kabrhel  C, Richman  PB, Courtney  DM.  Clinical criteria to prevent unnecessary diagnostic testing in emergency department patients with suspected pulmonary embolism.   J Thromb Haemost. 2004;2(8):1247-1255. doi:10.1111/j.1538-7836.2004.00790.xPubMedGoogle ScholarCrossref
11.
Righini  M, Van Es  J, Den Exter  PL,  et al.  Age-adjusted D-dimer cutoff levels to rule out pulmonary embolism: the ADJUST-PE study.   JAMA. 2014;311(11):1117-1124. doi:10.1001/jama.2014.2135PubMedGoogle ScholarCrossref
12.
van der Hulle  T, Cheung  WY, Kooij  S,  et al; YEARS study group.  Simplified diagnostic management of suspected pulmonary embolism (the YEARS study): a prospective, multicentre, cohort study.   Lancet. 2017;390(10091):289-297. doi:10.1016/S0140-6736(17)30885-1PubMedGoogle ScholarCrossref
13.
Kearon  C, de Wit  K, Parpia  S,  et al; PEGeD Study Investigators.  Diagnosis of pulmonary embolism with D-dimer adjusted to clinical probability.   N Engl J Med. 2019;381(22):2125-2134. doi:10.1056/NEJMoa1909159PubMedGoogle ScholarCrossref
14.
Singh  B, Mommer  SK, Erwin  PJ, Mascarenhas  SS, Parsaik  AK.  Pulmonary Embolism Rule-out Criteria (PERC) in pulmonary embolism—revisited: a systematic review and meta-analysis.   Emerg Med J. 2013;30(9):701-706. doi:10.1136/emermed-2012-201730PubMedGoogle ScholarCrossref
15.
Kline  JA, Courtney  DM, Kabrhel  C,  et al.  Prospective multicenter evaluation of the Pulmonary Embolism Rule-out Criteria.   J Thromb Haemost. 2008;6(5):772-780. doi:10.1111/j.1538-7836.2008.02944.xPubMedGoogle ScholarCrossref
16.
Dronkers  CEA, van der Hulle  T, Le Gal  G,  et al; Subcommittee on Predictive and Diagnostic Variables in Thrombotic Disease.  Towards a tailored diagnostic standard for future diagnostic studies in pulmonary embolism: communication from the SSC of the ISTH.   J Thromb Haemost. 2017;15(5):1040-1043. doi:10.1111/jth.13654PubMedGoogle ScholarCrossref
17.
Roy  P-M, Meyer  G, Vielle  B,  et al; EMDEPU Study Group.  Appropriateness of diagnostic management and outcomes of suspected pulmonary embolism.   Ann Intern Med. 2006;144(3):157-164. doi:10.7326/0003-4819-144-3-200602070-00003PubMedGoogle ScholarCrossref
18.
Roy  PM, Durieux  P, Gillaizeau  F,  et al.  A computerized handheld decision-support system to improve pulmonary embolism diagnosis: a randomized trial.   Ann Intern Med. 2009;151(10):677-686. doi:10.7326/0003-4819-151-10-200911170-00003PubMedGoogle ScholarCrossref
19.
Righini  M, Le Gal  G, Aujesky  D,  et al.  Diagnosis of pulmonary embolism by multidetector CT alone or combined with venous ultrasonography of the leg: a randomised non-inferiority trial.   Lancet. 2008;371(9621):1343-1352. doi:10.1016/S0140-6736(08)60594-2PubMedGoogle ScholarCrossref
20.
Penaloza  A, Soulié  C, Moumneh  T,  et al.  Pulmonary Embolism Rule-out Criteria (PERC) rule in European patients with low implicit clinical probability (PERCEPIC): a multicentre, prospective, observational study.   Lancet Haematol. 2017;4(12):e615-e621. doi:10.1016/S2352-3026(17)30210-7PubMedGoogle ScholarCrossref
21.
West  J, Goodacre  S, Sampson  F.  The value of clinical features in the diagnosis of acute pulmonary embolism: systematic review and meta-analysis.   QJM. 2007;100(12):763-769. doi:10.1093/qjmed/hcm113PubMedGoogle ScholarCrossref
22.
Wasson  JH, Sox  HC, Neff  RK, Goldman  L.  Clinical prediction rules. applications and methodological standards.   N Engl J Med. 1985;313(13):793-799. doi:10.1056/NEJM198509263131306PubMedGoogle ScholarCrossref
23.
Hosmer  DW, Lemeshow  S.  Applied Logistic Regression. Wiley; 1989.
24.
Harrell  FE  Jr, Lee  KL, Mark  DB.  Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.   Stat Med. 1996;15(4):361-387. doi:10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4PubMedGoogle ScholarCrossref
25.
Fagan  TJ.  Letter: nomogram for Bayes theorem.   N Engl J Med. 1975;293(5):257. doi:10.1056/NEJM197507312930513PubMedGoogle Scholar
26.
DeLong  ER, DeLong  DM, Clarke-Pearson  DL.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach.   Biometrics. 1988;44(3):837-845. doi:10.2307/2531595PubMedGoogle ScholarCrossref
27.
Hutchinson  BD, Navin  P, Marom  EM, Truong  MT, Bruzzi  JF.  Overdiagnosis of pulmonary embolism by pulmonary CT angiography.   AJR Am J Roentgenol. 2015;205(2):271-277. doi:10.2214/AJR.14.13938PubMedGoogle ScholarCrossref
28.
Penaloza  A, Verschuren  F, Dambrine  S, Zech  F, Thys  F, Roy  PM.  Performance of the Pulmonary Embolism Rule-out Criteria (the PERC rule) combined with low clinical probability in high prevalence population.   Thromb Res. 2012;129(5):e189-e193. doi:10.1016/j.thromres.2012.02.016PubMedGoogle ScholarCrossref
29.
Hugli  O, Righini  M, Le Gal  G,  et al.  The Pulmonary Embolism Rule-out Criteria (PERC) rule does not safely exclude pulmonary embolism.   J Thromb Haemost. 2011;9(2):300-304. doi:10.1111/j.1538-7836.2010.04147.xPubMedGoogle ScholarCrossref
30.
Kline  JA, Hogg  MM, Courtney  DM, Miller  CD, Jones  AE, Smithline  HA.  D-dimer threshold increase with pretest probability unlikely for pulmonary embolism to decrease unnecessary computerized tomographic pulmonary angiography.   J Thromb Haemost. 2012;10(4):572-581. doi:10.1111/j.1538-7836.2012.04647.xPubMedGoogle ScholarCrossref
31.
Eddy  M, Robert-Ebadi  H, Richardson  L,  et al.  External validation of the YEARS diagnostic algorithm for suspected pulmonary embolism.   J Thromb Haemost. 2020. doi:10.1111/jth.15083PubMedGoogle Scholar
32.
Kline  JA, Richardson  DM, Than  MP, Penaloza  A, Roy  PM.  Systematic review and meta-analysis of pregnant patients investigated for suspected pulmonary embolism in the emergency department.   Acad Emerg Med. 2014;21(9):949-959. doi:10.1111/acem.12471PubMedGoogle ScholarCrossref
33.
Penaloza  A, Verschuren  F, Meyer  G,  et al.  Comparison of the unstructured clinician gestalt, the Wells score, and the revised Geneva score to estimate pretest probability for suspected pulmonary embolism.   Ann Emerg Med. 2013;62(2):117-124.e2.PubMedGoogle ScholarCrossref
34.
Carpenter  CR, Raja  AS.  Arming the bayesian physician to rule out pulmonary embolism: using evidence-based diagnostics to combat overtesting.   Acad Emerg Med. 2014;21(9):1036-1038. doi:10.1111/acem.12450PubMedGoogle ScholarCrossref
×