Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
here is to obtain more true signals that are ranked higher. Our treatments, conditions, and potential risk factors. They are also
work is motivated by the common belief that signals replicated not restricted to patients who experience ADRs. Consequently,
in multiple data sources or in independent studies represent they offer several advantages that may be used to complement
increased confidence in the findings, and thus are more likely to SRS-based surveillance, from the perspective of both unbiased
be true. Figure 1 provides a pictorial illustration of the claim reporting and signal strengthening or confirmation. Importantly,
evaluated and the strategy being proposed. they also offer the potential for active surveillance,20 22 and
several retrospective studies have already demonstrated that the
BACKGROUND Vioxx case could have been identified earlier using this type of
The current mainstay in post-marketing drug safety surveillance data.23–26 That said, research into the use of electronic health-
is spontaneous reporting systems (SRSs) such as the FDA’s care data in drug safety is in its early stages and has not yet been
AERS, which is a database of over 4 million reports of suspected integrated into routine pharmacovigilance.7 20 22 To date, elec-
ADRs submitted to the FDA by healthcare professionals, consu- tronic healthcare data have been used in pharmacovigilance pre-
mers, and pharmaceutical companies. The AERS reports are dominantly to confirm signals originating from SRSs on an ad
reviewed by FDA drug safety evaluators and analyzed using hoc basis.27 28
quantitative signal-detection algorithms to identify signals of Traditionally, ADR signal-detection methodologies have
potentially new ADRs that warrant further investigation.10 13 focused on data from a single source. An emerging belief in
AERS communicates genuine health concerns, covers large pharmacovigilance research is that combining information across
populations, and is accessible for analysis. Since its inception, data sources can lead to more effective and accurate ADR dis-
AERS has supported regulatory decisions for ∼6000 marketed covery. Depending on the data sources used, the manner in
drugs.14 Notwithstanding, AERS suffers from a set of limitations which they are combined, and the scientific function they are
that may hinder its effective use15 16: notably, under-reporting, designed to perform, it is generally believed that the resulting
where only ∼10% of serious ADRs are reported,17 and over- system would either lead to increased evidence or statistical
reporting, where drugs with known, and publicized, ADRs are power of findings, or would facilitate new discoveries not pos-
more likely to be reported than other drugs. Other limiting sible with either source separately. In recent studies, Cami
factors include misattributed drug–event combinations, missing et al29 combined information about known ADRs with chemical
and incomplete data, duplicated reporting, unspecified causal and ontological information to create a predictive system of
links, and lack of exposure information.15 16 novel ADRs. Matthews et al30 combined AERS and literature
Recent drug safety events, such as the Vioxx case causing an findings to build quantitative structure–activity relationship
estimated 88 000 episodes of myocardial infarction,18 have (QSAR) models for several serious ADRs. Vilar et al11 proposed
highlighted the need to identify new data sources and improved an approach to prioritize/enrich ADR signals generated from
analytic methods to create a more effective pharmacovigilance AERS based on chemical similarity, whereas Shetty and Dalal12
system.7 19–21 Some of these new developments rely on the proposed an approach to achieve the same objective by relating
expanded secondary use of electronic healthcare data such as AERS signals to ADR signals generated by mining the literature.
EHRs and administrative claims. Unlike spontaneous reports, To our knowledge, the study presented herein is the first that
electronic healthcare data represent routine clinical care aimed to leverage EHRs and explicitly combine them with spon-
recorded over long periods of time. As such, they contain a taneous reports to facilitate uninformed, hypothesis-free signal
more complete record of the patient’s medical history, detection. Our study was based on over 4 million AERS reports
Figure 1 A comparison of two signal-detection strategies: the proposed system, AERS∩HER, which includes signals common to both AERS and
EHRs, and its comparator, AERS. The claim being made is that the top set of K ranked signals retrieved from the AERS∩EHR system contain more
true positives (true ADRs) than the set of top K signals retrieved from AERS when used by itself (currently the standard approach in signal
detection). During evaluation, the two sets of K signals will be compared with respect to a gold standard. The pattern of circle overlaps displayed
reflects typical outcomes, where, for most values of K, the signals produced by AERS∩EHR will be richer with true ADRs. ADR, adverse drug
reaction; AERS, adverse event reporting system; EHR, electronic health record.
and 1.2 million EHR narratives processed using natural lan- review board approval. The clinical narratives—corresponding
guage processing (NLP) and linked with structured EHR data. to discharge summaries, admission notes, and outpatient office
Well-established ADR signal-detection methodologies were used visits—were processed using the NLP system, MedLEE,31 to
to generate hundreds of signals from each data source, related extract medications, diseases, and signs and symptoms, and were
to three serious adverse reactions: rhabdomyolysis, acute pan- mapped to UMLS (2011AA) concept unique identifiers (CUIs).
creatitis, and QT prolongation. A reference standard consisting Temporal information corresponding to admission, discharge,
of over 600 well-established and plausible ADRs was created to and visit dates was also extracted. Laboratory test data directly
evaluate the proposed approach against a comparator. Figure 2 available in structured form and based on internal NYPH codes
provides an outline of the process. were linked to each of the narratives, and, together with the
data dimensions extracted from the narratives, formed the set of
MATERIALS AND METHODS clinical variables used for signal detection. The full dataset avail-
AERS able for analysis consisted of 7 years (2004–2010) of data, ∼1.2
The full set of AERS reports was made available by Oracle’s million narratives, and 178 000 patients.
Health Sciences division, a supplier of database and statistical
software for ADR analysis. The data were preprocessed to Events investigated
remove duplicate reports and correct terminological errors, and We concentrated on signals corresponding to the adverse reac-
consisted of over 4 million reports submitted to the FDA in the tions, rhabdomyolysis, acute pancreatitis, and QT prolongation,
years 1968–2010Q3. which are part of a set of serious adverse reactions determined
to be important and currently under active surveillance.26 32–34
EHRs In the AERS portion of the study, the adverse events studied
Both unstructured EHR data—clinical narratives processed (and report populations) were identified on the basis of
using NLP and structured data - laboratory tests from New York MedDRA (V.13.1) terms at the ‘preferred term’ level.35 In the
Presbyterian Hospital (NYPH), were used after institutional EHR portion of the investigation, outcomes (and patient
cohorts) for the events rhabdomyolysis and acute pancreatitis Prior to the application of DPA to the EHR data, we excluded
were identified using diagnostic standards based on laboratory all drug–outcome pairs mentioned in the EHRs for which the
test values.36 37 QT prolongation outcomes were identified on outcome under study was recorded prior to drug administration
the basis of terms appearing in the EHR narratives, and mapped and pairs where the drug is a known indication for the outcome
to any of a predetermined set of UMLS CUIs related to QT pro- (filtering step in figure 2). This was done to remove drug–
longation (descendants of these CUIs were not considered). outcome pairs that cannot be causally related as ADRs.
Table 1 specifies the exact event/outcome definitions used. Although theoretically possible, the application of MGPS to the
EHR data was not feasible, as it would have entailed identifying
Drug names and analyzing a substantially larger amount of adverse events to
To reduce naming variability and facilitate comparability, drug calibrate MGPS’ model (shrinkage) parameters. Identification
names were standardized according to their generic names at and definition of adverse events based on EHR data is not as
the ‘main ingredient’ level of specificity. In the EHR case, trade simple as with AERS (events are directly specified).43
names were resolved using the ‘tradename_of ’ relation defined Consequently, for the EHR portion of this work, we adopted a
in RxNorm. The main ingredient was resolved on the basis of a different DPA approach based on computing OR scores (see
manually created ‘main component/ingredient’ table. In the supplementary online table S1). To qualify signals in this frame-
AERS case, the required naming format was made available by work, an association had to meet two criteria: (1) OR equal to
Oracle as part of the data. or greater than the prespecified cut-off; (2) using a one-sided
Fisher exact test with a Bonferroni correction, the null hypoth-
esis of no association had to be rejected at the 5% significance
Signal generation
level. The Bonferroni correction was used to reduce the prob-
A family of methods commonly referred to as disproportionality
ability of identifying significant associations by chance while
analysis (DPA)10 38 are currently the most widely used approach
performing tests for multiple associations based on the same
for automated ADR signal detection in pharmacovigi-
data.
lance.5 6 10 13 39 DPA is based on frequency analysis of 2×2
contingency tables to estimate surrogate measures of statistical
Reference standard
association between specific drug–event (outcome) combina-
A well-accepted database of all currently known ADRs does not
tions.15 DPA methodologies differ by the exact measures that
exist at this time. As a result, we compiled a ‘reference standard’
are used and the statistical adjustments they may apply to
of drugs associated with each of the targeted ADRs. The refer-
account for low counts. They are all based on the entries of
ence standard was manually curated by a pharmacological
online supplementary table S1 to derive association/dispropor-
expert and reviewed by three physicians. It was divided into
tionality measures, which is computed for each drug–event pair
two classes, with the following inclusion criteria:
under consideration.
1. Established—drugs confirmed to be causally related to the
Conforming to standard practice in AERS-based analysis,5 40
ADR.
we used the MGPS41 DPA algorithm (via Oracle’s Empirica
Criteria: drug label warning, a Micromedex44 listing (a
Signal V.7.342) to generate ADR signals in the AERS portion of
trusted medical ADR reference), literature reviews of well-
this study. MGPS is based on a Bayesian framework which
established ADRs, and other published reports having conclusive
accounts for the uncertainty in a disproportionality measure
evidence such as laboratory data, clinical data, or a rechallenge/
associated with small counts by ‘shrinking’ the measure towards
dechallenge study.
the baseline case of no association, through an amount that is
2. Plausible—drugs that have a high likelihood of being
proportional to the variability of the disproportionality statistic.
causative.
In MGPS, a statistical association is quantified by a measure
Criteria: analysis of one or more case reports mentioning the
called EBGM (empirical Bayes geometric mean), corresponding
drug as a potential cause for a certain adverse event.
to a Bayesian version of the relative reporting ratio (RR)10
Although not confirmed, the second class represents poten-
measure (see online supplementary table S1). Typically, the
tially emerging or novel ADRs, which is the more interesting
EB05 measure, which corresponds to the lower 5th percentile
class for real-world pharmacovigilance.13 Online supplementary
of the posterior RR distribution, is used instead for extra conser-
table S2 provides a representative sample of drugs and totals
vatism. To qualify a signal, EBGM or EB05 must be larger than
included in the reference standard.
a prespecified cut-off.
Evaluation
Given that association scores have been computed for all drug–
Table 1 AERS and EHR—event and outcome definitions event pairs under study for both AERS and EHR data, the pro-
AERS EHR posed system was evaluated using performance metrics based on
Event MedDRA PT event definitions Outcome definitions ‘precision at K’, which is commonly used in the area of informa-
tion retrieval for systems that return ranked sequences of results,
Pancreatitis Pancreatitis acute Lab tests: amylase
with preference given to relevant results that are ranked
>300 U/l or lipase >120 U/
l higher.45 46
Rhabdomyolysis Rhabdomyolysis, blood creatine Lab test: 5× normal levels 1. An initial set of signals for analysis was created from
phosphokinase MM increased of creatine kinase AERS and the EHRs using EB05 and OR cut-offs, respect-
QT Long QT syndrome, ECG QT UMLS codes: C0023976, ively. The requirement that the evaluation starts with an
prolonged, Torsade de pointes, C0151878, C0743431, initial set of meaningful signals is essential. Low associa-
ECG QT interval abnormal, C0855333, C1560305 tions scores are typically meaningless and discarded in
ventricular tachycardia
signal detection. Associations whose score is <1 capture
AERS, adverse event reporting system; EHR, electronic health record; PT, preferred negative correlations ( potentially beneficial drug effects)
term.
and thus should also be discarded in ADR analysis.
Table 2 Averaged performance statistics comparison of the AERS and AERS∩EHR systems across reference standard classes
Precision at K Recall at K F-measure
Reference standard class AERS∩EHR (%) AERS (%) AERS∩EHR (%) AERS (%) AERS∩EHR (%) AERS (%) Relative improvement (%)
Established + plausible 85 57 20 15 30 22 70
Established 58 50 24 23 31 29 31
Plausible 27 7 15 5 17 8 267
The performance statistics were averaged across the full range of K (signals selected) using the arithmetic average. The relative improvement is defined as the performance difference
between the AERS∩EHR and AERS systems divided by the AERS performance for each performance statistic, and was averaged in the same manner as the other performance statistics.
AERS, adverse event reporting system; EHR, electronic health record.
combine them with spontaneous reports in a systematic manner 22 Madigan D, Ryan P. What can we really learn from observational studies?: the need
for drug safety purposes. Our findings provide initial promising for empirical assessment of methodology for active drug safety surveillance and
comparative effectiveness research. Epidemiology 2011;22:629–31.
evidence that replicated signaling in AERS and EHRs can 23 Simpson SE. Self-controlled methods for postmarketing drug safety surveillance in
enhance signal detection under certain operating scenarios and large-scale longitudinal data. Ph.D. dissertation, Columbia University, 2012;3(Suppl
objectives. Such a system is not intended to replace, but rather 1):S5.
augment, the portfolio of existing approaches. Its practical value 24 LePendu P, Iyer SV, Fairon C, et al. Annotation analysis for testing drug safety
signals using unstructured clinical notes. J Biomed Semantics 2012;
has been demonstrated by the discovery of a promising new
3(Suppl 1):S5.
ADR candidate. Additional EHR data would be required to 25 Brownstein JS, Sordo M, Kohane IS, et al. The tell-tale heart: population-based
further evaluate the predictive capacity and limits of this system. surveillance reveals an association of rofecoxib and celecoxib with myocardial
infarction. PLoS One 2007;2:e840.
Acknowledgements This research was supported in part by grants 26 Brown JS, Kulldorff M, Chan KA, et al. Early detection of adverse drug events
1R01LM010016, 3R01LM010016-01S1, 3R01LM010016-02S1, and within population-based health networks: application of sequential testing methods.
5T15-LM007079-19(HS) from the National Library of Medicine. We thank Lyudmila Pharmacoepidemiol Drug Saf 2007;16:1275–84.
Ena for assistance in extracting and preprocessing the EHR data, and Paea LePendu 27 Platt R, Carnahan RM, Brown JS, et al. The U.S. Food and Drug Administration’s
and Anna Bauer-Mehren from Stanford Center for Biomedical Informatics Research Mini-Sentinel program: status and direction. Pharmacoepidemiology and Drug Safety
for valuable comments, and extend our gratitude to Oracle’s Health Sciences 2012;21:1–8.
Division for supplying us with the AERS data and analysis software. 28 Tatonetti NP, Denny JC, Murphy SN, et al. Detecting drug interactions from
Funding National Library of Medicine. adverse-event reports: interaction between paroxetine and pravastatin increases
blood glucose levels. Clin Pharmacol Ther 2011;90:133–42.
Competing interests None. 29 Cami A, Arnold A, Manzi S, et al. Predicting adverse drug events using
Patient consent Obtained. pharmacological network models. Sci Transl Med 2011;3:114ra127.
30 Matthews EJ, Ursem CJ, Kruhlak NL, et al. Identification of structure-activity
Provenance and peer review Not commissioned; externally peer reviewed. relationships for adverse effects of pharmaceuticals in humans: part B. Use of (Q)
SAR systems for early detection of drug-induced hepatobiliary and urinary tract
REFERENCES toxicities. Regul Toxicol Pharmacol 2009;54:23–42.
1 Schneeweiss S, Hasford J, Gottler M, et al. Admissions caused by adverse drug 31 Friedman C, Shagina L, Lussier Y, et al. Automated encoding of clinical documents
events to internal medicine and emergency departments in hospitals: a longitudinal based on natural language processing. J Am Med Inform Assoc 2004;11:
population-based study. Eur J Clin Pharmacol 2002;58:285–91. 392–402.
2 Classen DC, Pestotnik SL, Evans RS, et al. Adverse drug events in hospitalized 32 Trifiro G, Pariente A, Coloma PM, et al. Data mining on electronic health record
patients. Excess length of stay, extra costs, and attributable mortality. JAMA databases for signal detection in pharmacovigilance: which events to monitor?
1997;277:301–6. Pharmacoepidemiol Drug Saf 2009;18:1176–84.
3 Bates DW, Spell N, Cullen DJ, et al. The costs of adverse drug events in hospitalized 33 Ramirez E, Carcas AJ, Borobia AM, et al. A pharmacovigilance program from
patients. Adverse Drug Events Prevention Study Group. JAMA 1997;277:307–11. laboratory signals for the detection and reporting of serious adverse drug reactions
4 Adverse Event Reporting System. http://www.fda.gov/cder/aers/default.htm (accessed in hospitalized patients. Clin Pharmacol Ther 2010;87:74–86.
Jul 2007). 34 Olivier P, Montastruc JL. The nature of the scientific evidence leading to drug
5 Szarfman A, Machado SG, O’Neill RT. Use of screening algorithms and computer withdrawals for pharmacovigilance reasons in France. Pharmacoepidemiol Drug Saf
systems to efficiently signal higher-than-expected combinations of drugs and events 2006;15:808–12.
in the US FDA’s spontaneous reports database. Drug Saf 2002;25:381–92. 35 Medical Dictionary for Regulatory Activities (MedDRA). http://www.meddramsso.
6 Almenoff JS, Pattishall EN, Gibbs TG, et al. Novel statistical tools for monitoring com/ (accessed Jul 2012)
the safety of marketed drugs. Clin Pharmacol Ther 2007;82:157–66. 36 Knochel JP. Rhabdomyolysis and Myoglobinuria. Ann Rev Med 1982;33:435–43.
7 Platt R, Wilson M, Chan KA, et al. The new sentinel network—improving the 37 Munoz A, Katerndahl DA. Diagnosis and management of acute pancreatitis. Am
evidence of medical-product safety. N Engl J Med 2009;361:645–7. Fam Physician 2000;62:164–74.
8 Schneeweiss S. A basic study design for expedited safety signal evaluation based on 38 Zorych I, Madigan D, Ryan P, et al. Disproportionality methods for
electronic healthcare data. Pharmacoepidemiol Drug Safety 2010;19:858–68. pharmacovigilance in longitudinal observational databases. Stat Methods Med Res
9 Hauben M, Bate A. Data mining in drug safety: side effects of drugs essay. In: 2011. doi:10.1177/0962280211403602
Aronson JK, ed. Side effects of drugs annual. 2007, Vol. 29:xxxiii–xlvi. 39 Lehman HP, Chen J, Gould AL, et al. An evaluation of computer-aided
10 Hauben M, Madigan D, Gerrits CM, et al. The role of data mining in disproportionality analysis for post-marketing signal detection. Clin Pharmacol Ther
pharmacovigilance. Expert Opin Drug Saf 2005;4:929–48. 2007;82:173–80.
11 Vilar S, Harpaz R, Chase HS, et al. Facilitating adverse drug event detection in 40 Szarfman A. Safety Data Mining. FDA Advisory Committee Meeting Briefing
pharmacovigilance databases using molecular structure similarity: application to Document. 2006. http://www.fda.gov/ohrms/dockets/as/06/briefing/2006–4266b1–
rhabdomyolysis. J Am Med Inform Assoc 2011;18:i73–80. 02-06-FDA-appendic-f.pdf (accessed Dec 2011).
12 Shetty KD, Dalal SR. Using information mining of the medical literature to improve 41 DuMouchel W. Bayesian data mining in large frequency tables, with an application
drug safety. J Am Med Inform Assoc 2011;18:668–74. to the FDA Spontaneous Reporting System. Am Stat 1999;53:177–90.
13 Hauben M, Bate A. Decision support methods for the detection of adverse events in 42 http://www.oracle.com/us/industries/life-sciences/
post-marketing data. Drug Discov Today 2009;14:343–57. health-sciences-empirica-signal-364243.html (accessed Jul 2012).
14 Wysowski DK, Swartz L. Adverse drug event surveillance and drug withdrawals in 43 Stang PE, Ryan PB, Dusetzina SB, et al. Health outcomes of interest in
the United States, 1969–2002—The importance of reporting suspected reactions. observational data: issues in identifying definitions in the literature. Health
Arch Intern Med 2005;165:1363–9. Outcomes Res Med 2012;3:e37–44.
15 Bate A, Evans SJ. Quantitative signal detection using spontaneous ADR reporting. 44 Micromedex® Healthcare Series (Internet database). Greenwood Village, Colo:
Pharmacoepidemiol Drug Saf 2009;18:427–36. Thomson Reuters (Healthcare) Inc. Updated periodically. 2010.
16 Stephenson W, Hauben M. Data mining for signals in spontaneous reporting 45 Baeza-Yates R, Ribeiro-Neto B. Modern information retrieval. Addison Wesley,
databases: proceed with caution. Pharmacoepidemiol Drug Saf 2007;16:359–65. 1999.
17 Wadman M. News feature: strong medicine. Nat Med 2005;11:465–6. 46 Liu L, Ozsu MT. Encyclopedia of database systems. Springer, 2009.
18 Graham DJ, Campen D, Hui R, et al. Risk of acute myocardial infarction and sudden 47 Deshpande G, Gogolak V, Sheila WS. Data mining in drug safety: review of
cardiac death in patients treated with cyclo-oxygenase 2 selective and non-selective published threshold criteria for defining signals of disproportionate reporting. Pharm
non-steroidal anti-inflammatory drugs: nested case-control study. Lancet Med 2010;24:37–43.
2005;365:475–81. 48 Bauters T, Mondelaers V, Robays H, et al. Methemoglobinemia and hemolytic
19 Avorn J, Schneeweiss S. Managing drug-risk information—what to do with all those anemia after rasburicase administration in a child with leukemia. Int J Clin Pharm
new numbers. N Engl J Med 2009;361:647–9. 2011;33:58–60.
20 Stang PE, Ryan PB, Racoosin JA, et al. Advancing the science for active surveillance: 49 Ryan P. Review of Observational Analysis Methods. White Paper. 2009. http://
rationale and design for the observational medical outcomes partnership. Ann omop.fnih.org/sites/default/files/OMOP%20Methods%20points-to-consider%
Intern Med 2010;153:600–W206. 2012feb2009_post_0.pdf (accessed Jul 2012).
21 McClellan M. Drug safety reform at the FDA—pendulum swing or systematic 50 Observational Medical Outcomes Partnership (OMOP). http://omop.fnih.org/
improvement. N Engl J Med 2007;356:1700–2. (accessed Jul 2012).