Biomedical Informatics
Biomedical Informatics
Informatics
Computer Applications in
Health Care and Biomedicine
Edward H. Shortliffe
James J. Cimino
Editors
Michael F. Chiang
Co-Editor
Fifth Edition
123
Biomedical Informatics
Edward H. Shortliffe • James J. Cimino
Editors
Michael F. Chiang
Co-Editor
Biomedical
Informatics
Computer Applications in Health Care
and Biomedicine
5th Edition
Editors
Edward H. Shortliffe James J. Cimino
Biomedical Informatics Informatics Institute
Columbia University University of Alabama at
New York, NY, USA Birmingham
Birmingham, AL, USA
Co-Editor
Michael F. Chiang
National Eye Institute
National Institutes of Health
Bethesda, MD, USA
Foreword
You have taken the first step in exploring these frontiers by picking up
this book. Enjoy!
References
1. IDC White Paper, The Digitization of the World from Edge to Core, #US44413318
November 2018 7 https://www.seagate.com/files/www-content/our-story/trends/
files/idc-seagate-dataage-whitepaper.pdf
2. Vamathevan, J. V., Apweiler, R., & Birney, W. (2019). Biomedical data resources:
Bioinformatics infrastructure for biomedical data science. Annual Review of
Biomedical Data Science, 2:199–222.
3. County Health Rankings Program and Roadmap 7 https://www.
countyhealthrankings.org/explore-health-rankings/measures-data-sources/county-
health-rankings-model
4. Ledley, R., & Lusted, L. (1959). Reasoning foundations of medical diagnosis.
Science, 130, 9–21.
5. Collen, M. F. (1966). Periodic health examinations using an automated multitest
laboratory. JAMA, 195, 830–3.
6. Bleich, H. L. (1971). The computer as a consultant. New England Journal of
Medicine, 284, 141–7.
The world of biomedical research and health care has changed remark-
ably in the 30 years since the first edition of this book was published. So
too has the world of computing and communications and thus the
underlying scientific issues that sit at the intersections among biomedi-
cal science, patient care, public health, and information technology. It is
no longer necessary to argue that it has become impossible to practice
modern medicine, or to conduct modern biological research, without
information technologies. Since the initiation of the Human Genome
Project three decades ago, life scientists have been generating data at a
rate that defies traditional methods for information management and
data analysis.
Health professionals also are constantly reminded that a large per-
centage of their activities relates to information management—for
example, obtaining and recording information about patients, consult-
ing colleagues, reading and assessing the scientific literature, planning
diagnostic procedures, devising strategies for patient care, interpreting
results of laboratory and radiologic studies, or conducting case-based
and population-based research. Artificial intelligence, “big data,” and
data science are having unprecedented impact on the world, with the
biomedical field a particularly active and visible component of such
activity.
It is complexity and uncertainty, plus society’s overriding concern for
patient well-being, and the resulting need for optimal decision making,
that set medicine and health apart from many other information-
intensive fields. Our desire to provide the best possible health and health
care for our society gives a special significance to the effective organiza-
tion and management of the huge bodies of data with which health
professionals and biomedical researchers must deal. It also suggests the
need for specialized approaches and for skilled scientists who are knowl-
edgeable about human biology, clinical care, information technologies,
and the scientific issues that drive the effective use of such technologies
in the biomedical context.
In many respects, the fourth edition was very different from its prede-
cessors, however. Most importantly, it reflected the remarkable changes
in computing and communications that continued to occur, most nota-
bly in communications, networking, and health information technology
policy, and the exploding interest in the role that information technol-
ogy must play in systems integration and the melding of genomics with
innovations in clinical practice and treatment. Several new chapters
were introduced and most of the remaining ones underwent extensive
revision.
In this fifth edition, we have found that two previous single-chapter
topics have expanded to warrant two complementary chapters, specifi-
cally Cognitive Science (split into Cognitive Informatics and Human-
Computer Interaction, Usability, and Workflow) and Consumer Health
Informatics and Personal Health Records (split into Personal Health
Informatics and mHealth and Applications). There is a new chapter on
precision medicine, which has emerged in the past 6 years as a unique
area of special interest. Those readers who are familiar with the first
four editions will find that the organization and philosophy are essen-
tially unchanged (although bioinformatics, as a set of methodologies, is
now considered a “recurrent theme” rather than an “application”), but
the content is either new or extensively updated.1
This book differs from other introductions to the field in its broad
coverage and in its emphasis on the field’s conceptual underpinnings
rather than on technical details. Our book presumes no health- or
computer-science background, but it does assume that you are inter-
ested in a comprehensive domain summary that stresses the underlying
concepts and that introduces technical details only to the extent that
they are necessary to meet the principal goal. Recent specialized texts
are available to cover the technical underpinnings of many topics in this
book; many are cited as suggested readings throughout the book, or are
cited in the text for those who wish to pursue a more technical exposure
to a topic.
This book is written as a text so that it can be used in formal courses, but
we have adopted a broad view of the population for whom it is intended.
Thus, it may be used not only by students of medicine and of the other
health professions but also as an introductory text by future biomedical
informatics professionals, as well as for self-study and for reference by
practitioners, including those who are pursuing formal board certifica-
tion in clinical informatics (as is discussed in more detail later in this
1 As with the first four editions, this book has tended to draw both its examples and
its contributors from North America. There is excellent work in other parts of the
world as well, although variations in health care systems, and especially financing,
do tend to change the way in which systems evolve from one country to the next.
The basic concepts are identical, however, so the book is intended to be useful in
educational programs in other parts of the world as well.
XIV Preface to the Fifth Edition
many of the concepts and analytical tools that underlie modern compu-
tational approaches to the management of human biological data, espe-
cially in areas such as genomics and proteomics. Applications of
bioinformatics related to human health and disease later appear in a
chapter on “Translational Bioinformatics” (7 Chap. 26).
7 Chapter 10 is a comprehensive introduction to the conceptual under-
pinnings of biomedical and clinical image capture, analysis, interpretation,
and use. This overview of the basic issues and imaging modalities serves as
background for 7 Chap. 22, which deals with imaging applications issues,
highlighted in the world of radiological imaging and image management
(e.g., in picture archiving and communication systems).
7 Chapter 11 considers personal health informatics not as a set of
applications (which are covered in 7 Chap. 19), but as introductory
concepts that relate to this topic, such as notions of the digital self and
the digital divide, patient-generated health data, and how a focus on the
patient (or on healthy individuals) affects both the person and the field
of biomedical informatics.
7 Chapter 12 addresses the key legal and ethical issues that have
arisen when health information systems are considered. Then, in
7 Chap. 13, the challenges associated with technology assessment and
with the evaluation of clinical information systems are introduced.
7 Chapters 14–28 (which include two new chapters in this edition,
including one on mHealth and another on precision medicine) survey
many of the key biomedical areas in which informatics methods are
being used. Each chapter explains the conceptual and organizational
issues in building that type of system, reviews the pertinent history, and
examines the barriers to successful implementations.
7 Chapter 29 reprises and updates a chapter that was new in the
fourth edition, providing a summary of the rapidly evolving policy
issues related to health information technology. Although the emphasis
is on US government policy, there is some discussion of issues that
clearly generalize both to states (in the USA) and to other countries.
The book concludes in 7 Chap. 30 with a look to the future—a
vision of how informatics concepts, computers, and advanced commu-
nication devices one day may pervade every aspect of biomedical
research and clinical practice. Rather than offering a single point of
view developed by a group of forward thinkers, as was offered in the
fourth edition, we have invited seven prominent and innovative thinkers
to contribute their own views. We integrate these seven future perspec-
tives (representing clinical medicine, nursing, health policy, translational
bioinformatics, academic informatics, the information technology
industry, and the federal government) into a chapter where the editors
have synthesized the seven perspectives after building on how an analy-
sis of the past helps to inform the future of this dynamic field.
The actual and potential uses of computers in health care and biomedi-
cine form a remarkably broad and complex topic. However, just as you
do not need to understand how a telephone or an ATM machine works
XVI Preface to the Fifth Edition
However, the conceptual basis for study changes much more slowly than
do the detailed technological issues. Thus, the lessons you learn from
this volume will provide you with a foundation on which you can
continue to build in the years ahead.
A suggestion that new courses are needed in the curricula for students
of the health professions is generally not met with enthusiasm. If any-
thing, educators and students have been clamoring for reduced lecture
time, for more emphasis on small group sessions, and for more free time
for problem solving and reflection. Yet, in recent decades, many studies
and reports have specifically identified biomedical informatics, includ-
ing computer applications, as an area in which new educational oppor-
tunities need to be developed so that physicians and other health
professionals will be better prepared for clinical practice. As early as
1984, the Association of American Medical Colleges (AAMC) recom-
mended the formation of new academic units in biomedical informatics
in our medical schools, and subsequent studies and reports have contin-
ued to stress the importance of the field and the need for its inclusion in
the educational environments of health professionals.
The reason for this strong recommendation is clear: The practice of
medicine is inextricably entwined with the management of information. In
the past, practitioners handled medical information through resources
such as the nearest hospital or medical-school library; personal collec-
tions of books, journals, and reprints; files of patient records; consulta-
tion with colleagues; manual office bookkeeping; and (all-too-often
flawed) memorization. Although these techniques continue to be vari-
ably valuable, information technology is offering new methods for find-
ing, filing, and sorting information: online bibliographic retrieval
systems, including full-text publications; personal computers, laptops,
tablets, and smart phones, with database software to maintain personal
information and commonly used references; office-practice and clinical
information systems and EHRs to capture, communicate, and preserve
key elements of the health record; information retrieval and consulta-
tion systems to provide assistance when an answer to a question is
needed rapidly; practice-management systems to integrate billing and
receivable functions with other aspects of office or clinic organization;
and other online information resources that help to reduce the pressure
to memorize in a field that defies total mastery of all but its narrowest
aspects. With such a pervasive and inevitable role for computers in clin-
ical practice, and with a growing failure of traditional techniques to deal
with the rapidly increasing information-management needs of practitio-
ners, it has become obvious to many people that an essential topic has
emerged for study in schools and clinical training programs (such as
residencies) that train medical and other health professionals.
What is less clear is how the subject should be taught in medical
schools or other health professional degree programs, and to what
extent it should be left for postgraduate education. We believe that top-
XVIII Preface to the Fifth Edition
ics in biomedical informatics are best taught and learned in the context
of health-science training, which allows concepts from both the health
sciences and informatics science to be integrated. Biomedical-computing
novices are likely to have only limited opportunities for intensive study
of the material once their health-professional training has been com-
pleted, although elective opportunities for informatics rotations are now
offered to residents in many academic medical centers.
The format of biomedical informatics education has evolved as fac-
ulty members have been hired to carry out informatics research and to
develop courses at more health-science schools, and as the emphasis on
lectures as the primary teaching method continues to diminish. Com-
puters will be used increasingly as teaching tools and as devices for com-
munication, problem solving, and data sharing among students and
faculty. Indeed, the recent COVID-19 pandemic has moved many tradi-
tional medical teaching experiences from the classroom to online teach-
ing environments using video conferencing and on-demand access to
course materials. Such experiences do not teach informatics (unless that
is the topic of the course), but they have rapidly engaged both faculty
and students in technology-intensive teaching and learning experiences.
The acceptance of computing, and dependence upon it, has already
influenced faculty, trainees, and curriculum committees. This book is
designed to be used in a traditional introductory course, whether taught
online or in a classroom, although the “Questions for Discussion” also
could be used to focus conversation in small seminars and working
groups. Integration of biomedical informatics topics into clinical expe-
riences has also become more common. The goal is increasingly to pro-
vide instruction in biomedical informatics whenever this field is most
relevant to the topic the student is studying. This aim requires educa-
tional opportunities throughout the years of formal training, supple-
mented by continuing-education programs after graduation.
The goal of integrating biomedicine and biomedical informatics is to
provide a mechanism for increasing the sophistication of health profes-
sionals, so that they know and understand the available resources. They
also should be familiar with biomedical computing’s successes and fail-
ures, its research frontiers, and its limitations, so that they can avoid
repeating the mistakes of the past. Study of biomedical informatics also
should improve their skills in information management and problem
solving. With a suitable integration of hands-on computer experience,
computer-mediated learning, courses in clinical problem solving, and
study of the material in this volume, health-science students will be well
prepared to make effective use of computational tools and information
management in health care delivery.
trained faculty to teach the courses will be obvious. Some people might
argue, however, that a course on this subject could be taught by a com-
puter scientist who had an interest in biomedical computing, or by a
physician or biologist who had taken a few computing courses. Indeed,
in the past, most teaching—and research—has been undertaken by fac-
ulty trained primarily in one of the fields and later drawn to the other.
Today, however, schools have come to realize the need for professionals
trained specifically at the interfaces among biomedicine, biomedical
informatics, and related disciplines such as computer science, statistics,
cognitive science, health economics, and medical ethics.
This book outlines a first course for students training for careers in
the biomedical informatics field. We specifically address the need for an
educational experience in which computing and information-science
concepts are synthesized with biomedical issues regarding research,
training, and clinical practice. It is the integration of the related disci-
plines that originally was lacking in the educational opportunities avail-
able to students with career interests in biomedical informatics. Schools
are establishing such courses and training programs in growing num-
bers, but their efforts have been constrained by a lack of faculty who
have a broad familiarity with the field and who can develop curricula for
students of the health professions as well as of informatics itself.
The increasing introduction of computing techniques into biomedi-
cal environments requires that well-trained individuals be available not
only to teach students but also to design, develop, select, and manage
the biomedical-computing systems of tomorrow. There is a wide range
of context-dependent computing issues that people can appreciate only
by working on problems defined by the health care setting and its con-
straints. The field’s development has been hampered because there are
relatively few trained personnel to design research programs, to carry
out the experimental and developmental activities, and to provide aca-
demic leadership in biomedical informatics. A frequently cited problem
is the difficulty a health professional (or a biologist) and a technically
trained computer scientist experience when they try to communicate
with one another. The vocabularies of the two fields are complex and
have little overlap, and there is a process of acculturation to biomedicine
that is difficult for computer scientists to appreciate through distant
observation. Thus, interdisciplinary research and development projects
are more likely to be successful when they are led by people who can
effectively bridge the biomedical and computing fields. Such profession-
als often can facilitate sensitive communication among program person-
nel whose backgrounds and training differ substantially.
Hospitals and health systems have begun to learn that they need such
individuals, especially with the increasing implementation of, and
dependence upon, EHRs and related clinical systems. The creation of a
Chief Medical Information Officer (CMIO) has now become a common
innovation. As the concept became popular, however, questions arose
about how to identify and evaluate candidates for such key institutional
roles. The need for some kind of suitable certification process became
clear—one that would require individuals to demonstrate both formal
training and the broad skills and knowledge that were required. Thus,
XX Preface to the Fifth Edition
Edward H. Shortliffe
New York, NY, USA
James J. Cimino
Birmingham, AL, USA
Michael F. Chiang
Bethesda, MD, USA
June 2020
Acknowledgments
Edward H. Shortliffe
New York, NY, USA
December 2020
XXV
Contents
1
Biomedical Informatics: The Science and the
Pragmatics ����������������������������������������������������������������������������������������������������������������������� 3
Edward H. Shortliffe and Michael F. Chiang
12
Ethics in Biomedical and Health Informatics:
Users, Standards, and Outcomes ������������������������������������������������������������� 391
Kenneth W. Goodman and Randolph A. Miller
13
Evaluation of Biomedical and Health
Information Resources ����������������������������������������������������������������������������������� 425
Charles P. Friedman and Jeremy C. Wyatt
Supplementary Information
Glossary������������������������������������������������������������������������������������������ 1018
Name Index������������������������������������������������������������������������������������ 1091
Subject Index���������������������������������������������������������������������������������� 1131
XXIX
Editors
Associate Editor
Michael F. Chiang, MD, MA, FACMI National Eye Institute, National Institutes
of Health, Bethesda, MD, USA
michael.chiang@nih.gov
Contributors
David W. Bates, MD, MSc, FACMI Division of General Internal, Medicine and
Primary Care, Department of Medicine, Brigham and Women’s Hospital, Bos-
ton, MA, USA
dbates@partners.org
David Blumenthal, MD, MPP Commonwealth Fund, New York, NY, USA
db@cmwf.org
Todd Cooper Trusted Solutions Foundry, Inc., San Diego, CA, USA
Joshua C. Denny, MD, MS, FACMI All of Us Research Program, National Insti-
tutes of Health, Bethesda, MD, USA
josh.denny@vumc.org
Parvati Dev, PhD, FACMI SimTabs, Los Altos Hills, CA, USA
parvati@parvatidev.org
Patricia C. Dykes, PhD, MA, RN, FAAN, FACMI Center for Patient Safety
Research & Practice, Brigham and Women’s Hospital, Boston, MA, USA
pdykes@bwh.harvard.edu
Peter J. Embi, MD, MS, FACMI Regenstrief Institute and Indiana University
School of Medicine, Indianapolis, IN, USA
Mark Frisse, MD, MS, MBA, FACMI Biomedical Informatics, Vanderbilt Univer-
sity, Nashville, TN, USA
mark.frisse@vanderbilt.edu
Kenneth W. Goodman, PhD, FACMI Institute for Bioethics and Health Policy,
University of Miami Miller School of Medicine, Miami, FL, USA
kgoodman@med.miami.edu
W. Edward Hammond, PhD, FACMI Duke Center for Health Informatics, Duke
University Medical Center, Durham, NC, USA
Charles Jaffe, MD, PhD, FACMI Health Level Seven International, Ann Arbor,
MI, USA
cjaffe@hl7.org
XXXII Editors
Holly Jimison, PhD, FACMI Khoury College of Computer Sciences & Bouve
College of Health Sciences, Northeastern University, Boston, MA, USA
h.jimison@northeastern.edu
Kensaku Kawamoto, MD, PhD, MHS, FACMI University of Utah, Salt Lake
City, UT, USA
Wayne R. Kubick, BA, MBA Health Level 7 International, Ann Arbor, MI,
USA
Martin LaVenture, MPH, PhD, FACMI Informatics Savvy Advisors, Edina, MN,
USA
Institute for Health Informatics (IHI), University of Minnesota, Minneapolis,
MN, USA
marty@umn.edu
Clement J. McDonald, MD, MS, FACMI Lister Hill National Center for Bio-
medical Communications, National Library of Medicine, National Institutes of
Health, Bethesda, MD, USA
clemmcdonald@mail.nih.gov; CJMcDona@Regenstrief.org
Matt Might, PhD UAB Hugh Kaul Precision Medicine Institute, University of
Alabama at Birmingham, Birmingham, AL, USA
might@uab.edu
Judy Murphy, RN, FACMI, LFHIMSS, FAAN IBM Global Healthcare, Armonk,
NY, USA
judy.murphy29@gmail.com
Mark A. Musen, MD, PhD, FACMI Stanford Center for Biomedical Informatics
Research, Stanford University, Stanford, CA, USA
musen@stanford.edu
Douglas K. Owens, MD, MS Center for Primary Care and Outcomes Research/
Center for Health Policy, Stanford University, Stanford, CA, USA
owens@stanford.edu
Vimla L. Patel, PhD, DSc, FACMI Center for Cognitive Studies in Medicine and
Public Health, The New York Academy of Medicine, New York, NY, USA
vpatel@nyam.org
William C. Reed, MS, PhD Huntzinger Management Group, Inc, Moosic, PA,
USA
wcreed@huntzingergroup.com
David A. Ross, DSc Task Force for Global Health, Decatur, GA, USA
dross@taskforce.org
Catherine Staes, PhD, MPH, RN, FACMI Nursing Informatics Program, College
of Nursing, University of Utah, Lake City, UT, USA
Catherine.Staes@hsc.utah.edu
Paul C. Tang, MD, MS, FACMI Clinical Excellence Research Center, Stanford
University, Stanford, CA, USA
paultang@stanford.edu
Lynn Harold Vogel, AB, AM, PhD LH Vogel Consulting, LLC, Ridgewood, NJ,
USA
lynn@lhvogelconsulting.com
Eric Horvitz
is a technical fellow at Microsoft, where he serves
as the company’s first Chief Scientific Officer. He
previously served as director of Microsoft
Research Labs. He has pursued principles and
applications of AI with contributions in machine
learning, perception, natural language under-
standing, and decision making. His research cen-
ters on challenges with uses of AI amidst the
complexities of the open world, including uses of
probabilistic and decision-theoretic representa-
tions for reasoning and action, models of
bounded rationality, and human-AI complemen-
tarity and coordination. His efforts and collabo-
rations have led to fielded systems in healthcare,
transportation, ecommerce, operating systems,
and aerospace. He received the Feigenbaum Prize
and the Allen Newell Prize for contributions to
AI. He was inducted into the CHI Academy for
contributions at the intersection of AI and
human-computer interaction. He has been
elected fellow of the American College of Medi-
cal Informatics (ACMI), National Academy of
Engineering (NAE), Association of Computing
Machinery (ACM), Association for the Advance-
ment of AI (AAAI), the American Association
for the Advancement of Science (AAAS), the
American Academy of Arts and Sciences, and the
American Philosophical Society. He has served as
president of the AAAI, and on advisory commit-
XXXIX
About the Future Perspective Authors (Chapter 30)
Peter Tarczy-Hornoch, MD
is Professor and Chair of the Department of Bio-
medical Informatics and Medical Education at
the University of Washington (UW). His back-
ground includes computer science, bioengineer-
ing, biomedical informatics, medicine (Pediatrics
and Neonatology) with undergraduate and medi-
cal degrees from Stanford, Residency at the Uni-
versity of Minnesota, Fellowship at the University
of Washington. He has played a key role in estab-
lishing and growing the biomedical informatics
research, teaching and practice activities at UW
XL About the Future Perspective Authors (Chapter 30)
Robert M. Wachter, MD
is Professor and Chair of the Department of
Medicine at the University of California, San
Francisco. He is author of 300 articles and 6
books and is a frequent contributor to the New
York Times and Wall Street Journal. He coined
the term “hospitalist” in 1996 and is often consid-
ered the “father” of the hospitalist field, the fast-
est growing specialty in the history of modern
medicine. He is past president of the Society of
Hospital Medicine and past chair of the Ameri-
can Board of Internal Medicine. In the safety and
quality arenas, his book, Understanding Patient
Safety, is the world’s top selling safety primer. In
2004, he received the John M. Eisenberg Award,
the nation’s top honor in patient safety. Twelve
times, Modern Healthcare magazine has ranked
him as one of the 50 most influential physician-
executives in the U.S.; he was #1 on the list in
2015. His 2015 book, The Digital Doctor: Hope,
Hype and Harm at the Dawn of Medicine’s Com-
puter Age, was a New York Times science best-
seller. In 2016, he chaired a blue-ribbon
commission advising England’s National Health
Service on its digital strategy. He is an elected
member of the National Academy of Medicine.
XLI
Edward H. Shortliffe
is Chair Emeritus and Adjunct Professor in the
Department of Biomedical Informatics at Colum-
bia University’s Vagelos College of Physicians and
Surgeons. Previously he served as President and
CEO of the American Medical Informatics Asso-
ciation. He was Professor of Biomedical Infor-
matics at the University of Texas Health Science
Center in Houston and at Arizona State Univer-
sity. A board-certified internist, he was Founding
Dean of the University of Arizona College of
Medicine – Phoenix and served as Professor of
Biomedical Informatics and of Medicine at
Columbia University. Before that he was Profes-
sor of Medicine and of Computer Science at Stan-
ford University. Honors include his election to
membership in the National Academy of Medi-
cine (where he served on the executive council for
6 years and has chaired the membership commit-
tee) and in the American Society for Clinical
Investigation. He has also been elected to fellow-
ship in the American College of Medical Infor-
matics and the American Association for Artificial
Intelligence. A Master of the American College of
Physicians (ACP), he held a position for 6 years on
that organization’s Board of Regents. He is Editor
Emeritus of the Journal of Biomedical Informatics
and has served on the editorial boards for several
other biomedical informatics publications. In the
early 1980s, he was recipient of a research career
development award from the National Library of
Medicine. In addition, he received the Grace Mur-
ray Hopper Award of the Association for Com-
puting Machinery in 1976, the Morris F. Collen
Award of the American College of Medical Infor-
matics in 2006, and was a Henry J. Kaiser Family
Foundation Faculty Scholar in General Internal
Medicine. He has served on the oversight commit-
tee for the Division on Engineering and Physical
Sciences (National Academy of Sciences), the
National Committee on Vital and Health Statis-
tics (NCVHS), and on the President’s Information
Technology Advisory Committee (PITAC). Dr.
Shortliffe has authored over 350 articles and
books in the fields of biomedical computing and
artificial intelligence.
XLII About the Editors
James J. Cimino
is a board-certified internist who completed a
National Library of Medicine informatics fellow-
ship at the Massachusetts General Hospital and
Harvard University and then went on to an aca-
demic position at Columbia University College of
Physicians and Surgeons and the Presbyterian Hos-
pital in New York. He spent 20 years at Columbia,
carrying out clinical informatics research, building
clinical information systems, teaching medical
informatics and medicine, and caring for patients,
rising to the rank of full professor in both Biomedi-
cal Informatics and Medicine. His principal
research areas there included desiderata for con-
trolled terminologies, mobile and Web-based clini-
cal information systems for clinicians and patients,
and a context-aware form of clinical decision sup-
port called “infobuttons.” In 2008, he moved to the
National Institutes of Health, where he was the
Chief of the Laboratory for Informatics Develop-
ment and a Tenured Investigator at the NIH Clini-
cal Center and the National Library of Medicine.
His principal project involved the development of
the Biomedical Translational Research Information
System (BTRIS), an NIH-wide clinical research
data resource. In 2015, he left the NIH to be the
inaugural Director of the Informatics Institute at
the University of Alabama at Birmingham. The
Institute is charged with improving informatics
research, education, and service across the Univer-
sity, supporting the Personalized Medicine Insti-
tute, the Center for Genomic Medicine, and the
University Health System Foundation, including
improvement of and access to electronic health
records. He holds the rank of Tenured Professor in
Medicine and is the Chief for the Informatics Sec-
tion in the Division of General Internal Medicine.
He continues to conduct research in clinical infor-
matics and clinical research informatics, he was
Director of the NLM’s weeklong Biomedical Infor-
matics course for 16 years, and teaches at Columbia
University and Georgetown University as an
Adjunct Professor. He is an Associate Editor of the
Journal of Biomedical Informatics. His honors
include Fellowships of the American College of
Physicians, the New York Academy of Medicine
and the American College of Medical Informatics
(Past President), the Priscilla Mayden Award from
the University of Utah, the Donald A.B. Lindberg
Award for Innovation in Informatics and the Presi-
dent’s Award, both from the American Medical
XLIII
About the Editors
Michael F. Chiang
is Director of the National Eye Institute, at the
National Institutes of Health in Bethesda, Mary-
land. His clinical practice focuses on pediatric
ophthalmology, and he is board-certified in clini-
cal informatics. His research develops and applies
biomedical informatics methods to clinical oph-
thalmology in areas such as retinopathy of prema-
turity (ROP), telehealth, artificial intelligence,
clinical information systems, genotype-phenotype
correlation, and data analytics. His group has pub-
lished over 200 peer-reviewed papers, and has
developed an assistive artificial intelligence system
for ROP that received breakthrough status from
the US Food and Drug Administration. He
received a BS in Electrical Engineering and Biol-
ogy from Stanford University, an MD from the
Harvard-MIT Division of Health Sciences and
Technology, and an MA in Biomedical Informat-
ics from Columbia University. He completed clini-
cal training at the Johns Hopkins Wilmer Eye
Institute. Between 2001 and 2010, he worked at
Columbia University, where he was Anne S. Cohen
Associate Professor of Ophthalmology & Biomed-
ical Informatics, director of medical student edu-
cation in ophthalmology, and director of the
introductory graduate student course in biomedi-
cal informatics. From 2010 to 2020, he was
Knowles Professor of Ophthalmology & Medical
Informatics and Clinical Epidemiology, and Asso-
ciate Director of the Casey Eye Institute, at the
Oregon Health & Science University (OHSU)
Casey Eye Institute. He has served as a member of
the American Academy of Ophthalmology (AAO)
Board of Trustees, Chair of the AAO IRIS Regis-
try Data Analytics Committee, Chair of the AAO
Task Force on Artificial Intelligence, Chair of the
AAO Medical Information Technology Commit-
tee, and on numerous other national and local
committees. He currently serves as an Associate
Editor for JAMIA, and is on the Editorial Board
for Ophthalmology and the Asia-Pacific Journal of
Ophthalmology.
1 I
Recurrent Themes in
Biomedical
Informatics
Contents
Biomedical Informatics:
The Science and the
Pragmatics
Edward H. Shortliffe and Michael F. Chiang
Contents
References – 44
scholarly information (e.g., accessing digital comprehensive health record evolve in the
1 libraries, supporting bibliographic search, and future, as technology creates unprecedented
providing access to drug information data- opportunities for innovation?”
bases), and even with office automation (e.g., The complexity associated with automat-
providing access to spreadsheets and ing clinical-care records is best appreciated if
document- management software). The key one analyzes the processes associated with the
idea, however, is that at the heart of the evolv- creation and use of such records rather than
ing integrated environments lies an electronic thinking of the record as a physical object
health record that is intended to be accessible, (such as the traditional paper chart) that can
confidential, secure, acceptable to clinicians be moved around as needed within the institu-
and patients, and integrated with other types tion. For example, on the input side
of useful information to assist in planning (. Fig. 1.1), an electronic version of the
and problem solving. paper chart requires the integration of pro-
cesses for data capture and for merging infor-
mation from diverse sources.
1.1.2 oday’s Electronic Health
T The contents of the paper record were tra-
Record (EHR) Environment ditionally organized chronologically—often a
severe limitation when a clinician sought to
The traditional paper-based medical record is find a specific piece of information that could
now recognized as being woefully inadequate occur almost anywhere within the chart. To be
for meeting the needs of modern medicine. It useful, the electronic record system has to
arose in the nineteenth century as a highly make it easy to access and display needed
personalized “lab notebook” that clinicians data, to analyze them, and to share them
could use to record their observations and among colleagues and with secondary users
plans so that they could be reminded of perti- of the record who are not involved in direct
nent details when they next saw the same patient care (. Fig. 1.2). Thus, the EHR, as
patient. There were no regulatory require- an adaptation of the paper record, is best
ments, no assumptions that the record would viewed not as an object, or a product, but
be used to support communication among rather as a set of processes that an organiza-
varied providers of care, and few data or test tion puts into place, supported by technology
results to fill up the record’s pages. The record (. Fig. 1.3).
that met the needs of clinicians a century or so Implementing electronic records is inher-
ago struggled mightily to adjust over the ently a systems-integration task. It accordingly
decades and to accommodate to new require- requires a custom-tailored implementation at
ments as health care and medicine changed. each institution, given the differences in exist-
Today the inability of paper charts to serve ing systems and practices that must be suit-
the best interests of the patient, the clinician, ably integrated. Joint development and local
and the health system is no longer questioned adaptation are crucial, which implies that the
(see 7 Chaps. 14 and 16). institutions that purchase such systems must
Most organizations have found it challeng- have local expertise that can oversee and facil-
ing (and expensive) to move to a paperless, itate an effective implementation process,
electronic clinical record. This observation including elements of process re-engineering
forces us to ask the following questions: and cultural change that are inevitably
“What is a health record in the modern world? involved.
Are the available products and systems well Experience has shown that clinicians are
matched with the modern notions of a com- “horizontal” users of information technology
prehensive health record? Do they meet the (Greenes and Shortliffe 1990). Rather than
needs of individual users as well as the health becoming “power users” of a narrowly defined
systems themselves? Are they efficient, easy to software package, they tend to seek broad
use, and smoothly integrated into clinical functionality across a wide variety of systems
workflow? How should our concept of the and resources. Thus, routine use of comput-
Biomedical Informatics: The Science and the Pragmatics
7 1
.. Fig. 1.1 Inputs to the clinical-care record. The tradi- logic results, reports of telephone calls or prescriptions,
tional paper record was created by a variety of organiza- and data obtained directly from patients). The paper
tional processes that captured varying types of record thus was a merged collection of such data, gener-
information (notes regarding direct encounters between ally organized in chronological order
health professionals and patients, laboratory or radio-
ers, and of EHRs, is most easily achieved and analyzed in order to learn about the safety
when the computing environment offers a and efficacy of new treatments or tests and to
critical mass of functionality that makes the gain insight into disease processes that are not
system both smoothly integrated with work- otherwise well understood. Medical research-
flow and useful for essentially every patient ers were constrained in the past by clumsy
encounter. methods for identifying patients who met
The arguments for automating clinical- inclusion criteria for clinical trials as well as
care records are summarized in 7 Chaps. 2 acquiring the data needed for the trials, gener-
and 14 and in the now classic Institute of ally relying on manual capture of information
Medicine’s report on computer-based patient onto datasheets that were later transcribed
records (CPRs) (Dick and Steen 1991 (Revised into computer databases for statistical analy-
1997).1 One argument that warrants emphasis sis (. Fig. 1.4). The approach was labor-
is the importance of the EHR in supporting intensive, fraught with opportunities for error,
clinical trials—experiments in which data and added to the high costs associated with
from specific patient interactions are pooled randomized prospective research protocols.
The use of EHRs has offered many advan-
tages to those carrying out clinical research
1 The Institute of Medicine, part of the National
(see 7 Chap. 27). Most obviously, it helps to
Academy of Sciences, is now known as the National eliminate the manual task of extracting data
Academy of Medicine. from charts or filling out specialized data-
8 E. H. Shortliffe and M. F. Chiang
.. Fig. 1.2 Outputs from the clinical-care record. Once direct patient care. Numerous providers are typically
information was collected in the traditional paper chart, involved in a patient’s care, so the chart also served as a
it needed to be provided to a wide variety of potential means for communicating among them. The traditional
users of the information that it contained. These users mechanisms for displaying, analyzing, and sharing
included health professionals and the patients them- information from such records resulted from a set of
selves, as well as “secondary users” (represented here by processes that often varied substantially across several
the individuals in business suits) who had valid reasons patient-care settings and institutions
for accessing the record but who were not involved with
sheets. The data needed for a study can often Note that . Fig. 1.5 represents a study at a
be derived directly from the EHR, thus mak- single institution and often for a limited subset
ing much of what is required for research data of the patients who receive care there. Yet
collection simply a by-product of routine clin- much research is carried out with very large
ical record keeping (. Fig. 1.5). Other advan- numbers of patients, such as within a regional
tages accrue as well. For example, the record health care system, statewide, or nationally.
environment can help to ensure compliance Accordingly, the size of research datasets can
with a research protocol, pointing out to a cli- get very large, but analyzing across them intro-
nician when a patient is eligible for a study or duces challenges related to data exchange and
when the protocol for a study calls for a spe- the standardization of the ways in which indi-
cific management plan given the currently vidual data elements are defined, identified, or
available data about that patient. We are also stored (see 7 Chap 8). Retrospective studies
seeing the development of novel authoring on data collected in the past typically cannot
environments for clinical trial protocols that assume a prior standardization of the elements
can help to ensure that the data elements that will be needed, thereby requiring analyses
needed for the trial are compatible with the that infer relationships among specific descrip-
local EHR’s conventions for representing tors in different institutions represented in dif-
patient descriptors. ferent ways. When the number of data elements
Biomedical Informatics: The Science and the Pragmatics
9 1
.. Fig. 1.3 Complex processes demanded of the record. valid reasons for accessing it. Yet paper-based documents
As shown in . Figs. 1.1 and 1.2, the paper chart evolved to were severely limited in meeting the diverse requirements for
become the incarnation of a complex set of organizational data collection and information access that are implied by
processes, which both gathered information to be shared this diagram. These deficiencies accounted in large part for
and then distributed that information to those who had the effort to create today’s electronic health records
is large, and the population being studied is that appear in monographs or journal articles
also vast, the challenges are often described as tend to sit on shelves, unavailable when the
“big data” analytics (James et al. 2013). knowledge they contain would be most valu-
Another theme in the changing world of able to practitioners. Computer-based tools for
health care is the increasing investment in the implementing such guidelines, and integrating
creation of standard order sets, clinical guide- them with the EHR, present a means for mak-
lines, and clinical pathways (see 7 Chap. 24), ing high-quality advice available in the routine
generally in an effort to reduce practice vari- clinical setting. Many organizations are
ability and to develop consensus approaches to accordingly integrating decision-support tools
recurring management problems. Several gov- with their EHR systems (see 7 Chaps. 14 and
ernment and professional organizations, as 24), and there are highly visible commercial
well as individual provider groups, have efforts underway to provide computer-based
invested heavily in guideline development, diagnostic decision support to practitioners.2
often putting an emphasis on using clear evi- There are at least five major issues that
dence from the literature, rather than expert have consistently constrained our efforts to
opinion alone, as the basis for the advice. build effective EHRs: (1) the need for stan-
Despite the success in creating such evidence-
2 7 https://ehrintelligence.com/news/top-clinical-
based guidelines, there is a growing recognition
decision-support-system-cdss-companies-by-ambu-
that we need better methods for delivering the latory-inpatient; 7 https://www.ibm.com/watson/
decision logic to the point of care. Guidelines health/. (Accessed 5/29/19/).
10 E. H. Shortliffe and M. F. Chiang
1 Medical
record
Computer database
Results
.. Fig. 1.4 Traditional data collection for clinical tri- data managers were often hired to abstract the relevant
als. Until the introduction of EHRs and similar systems, data from the paper chart. The trials were generally
the gathering of research data for clinical studies was designed to define data elements that were required and
typically a manual task. Physicians who cared for the methods for analysis, but it was common for the pro-
patients enrolled in trials, or their research assistants, cess of collecting those data in a structured format to be
would be asked to fill out special datasheets for later left to manual processes at the point of patient care
transcription into computer databases. Alternatively,
.. Fig. 1.5 Role of electronic health records (EHRs) in . Fig. 1.4 are thereby largely eliminated. In addition, the
supporting clinical trials. With the introduction of EHR interaction of the physician with the EHR permits two-way
systems, the collection of much of the research data for communication, which can greatly improve the quality and
clinical trials can become a by-product of the routine care efficiency of the clinical trial. Physicians can be reminded
of the patients. Research data may be analyzed directly when their patients are eligible for an experimental proto-
from the clinical data repository, or a secondary research col, and the computer system can also remind the clinicians
database may be created by downloading information from of the rules that are defined by the research protocol,
the online patient records. The manual processes in thereby increasing compliance with the experimental plan
Biomedical Informatics: The Science and the Pragmatics
11 1
dards in the area of clinical terminology; (2) document-management software of today
concerns regarding data privacy, confidential- bears little resemblance to a typewriter.
ity, and security; (3) challenges in data entry Consider all the powerful desktop-publishing
by physicians; (4) difficulties associated with facilities, integration of figures, spelling cor-
the integration of record systems with other rection, grammar aids, “publishing” online,
information resources in the health care set- collaboration on individual documents by
ting, and (5) designing and delivering systems multiple users, etc. Similarly, today’s spread-
that are efficient, acceptable to clinicians, and sheet programs bear little resemblance to the
intuitive to use. The first of these issues is dis- tables of numbers that we once created on
cussed in detail in 7 Chap. 7, and privacy is graph paper. To take an example from the
one of the central topics in 7 Chap. 12. Issues financial world, consider automatic teller
of direct data entry by clinicians are discussed machines (ATMs) and their facilitation of
in 7 Chaps. 2 and 14 and throughout many today’s worldwide banking in ways that were
other chapters as well. 7 Chapter 15 exam- never contemplated when the industry
ines the fourth topic, focusing on recent trends depended on human bank tellers.
in networked data integration, and offers solu- It is accordingly logical to ask what the
tions for the ways in which the EHR can be health record will become after it has been
better joined with other relevant information effectively implemented on computer systems
resources and clinical processes, especially and new opportunities for its enhancement
within communities where patients may have become increasingly clear to us. It is clear that
records with multiple providers and health EHRs a decade from now will be remarkably
care systems (Yasnoff et al. 2013). Finally, different from the antiquated paper folders
issues of the interface between computers and that used to dominate our health care envi-
clinicians (or other users), with a cognitive ronments. We might similarly predict that the
emphasis, are the subject of 7 Chap. 5. state of today’s EHR is roughly comparable
to the status of commercial aviation in the
1930s. By that time air travel had progressed
1.1.3 Anticipating the Future substantially from the days of the Wright
of Electronic Health Records Brothers, and air travel was becoming com-
mon. But 1930s air travel seems archaic by
One of the first instincts of software develop- modern standards, and it is logical to assume
ers is to create an electronic version of an that today’s EHRs, albeit much better than
object or process from the physical world. both paper records and the early computer-
Some familiar notion provides the inspiration based systems of the 1960s and 1970s, will be
for a new software product. Once the software greatly improved and further modernized in
version has been developed, however, human the decades ahead.
ingenuity and creativity often lead to an evo- If people had failed to use the early air-
lution that extends the software version far planes for travel, the quality and efficiency of
beyond what was initially contemplated. The airplanes and air travel would not have
computer can thus facilitate paradigm shifts improved as they have. A similar point can be
in how we think about such familiar concepts. made about the importance of committing
Consider, for example, the remarkable dif- to the use of EHRs today, even though we
ference between today’s office automation know that they need to be much better in the
software and the typewriter, which was the future. We must also commit to assuring that
original inspiration for the development of those improvements are made, which sug-
“word processors”. Although the early word gests a dynamic interaction and interdepen-
processors were designed largely to allow dency among the researchers who address
users to avoid retyping papers each time a limitations in EHRs and their underlying
minor change was made to a document, the methods and philosophy, the EHR compa-
12 E. H. Shortliffe and M. F. Chiang
nies that currently exist or will arise in the mercialized operation, no longer depending
1 future, and the users who identify require- on the U.S. government to support even the
ments and areas for improvement. These major backbone connections. Today, the
companies must look to creative researchers, Internet is ubiquitous, worldwide, accessible
both within their own companies and in aca- through mobile wireless devices, and has
demia, who will forge the changes that will provided the invisible but mandatory infra-
encourage EHR users to embrace and appre- structure for social, political, financial, sci-
ciate the technology much more than they entific, corporate, and entertainment
often do today. ventures. Many people point to the Internet
as a superb example of the facilitating role
of federal investment in promoting innova-
1.2 Communications Technology tive technologies. The Internet is a major
and Health Data Integration societal force that arguably would never
have been created if the research and devel-
An obvious opportunity for changing the role opment, plus the coordinating activities, had
and functionality of clinical-care records in been left to the private sector.
the digital age is the power and ubiquity of The explosive growth of the Internet did
the Internet. The Internet began in 1968 as a not occur until the late 1990s, when the World
U.S. research activity funded by the Advanced Wide Web (which had been conceived initially
Research Projects Agency (ARPA) of the by the physics community as a way of using
Department of Defense. Initially known as the Internet to share preprints with photo-
the ARPANET, the network began as a novel graphs and diagrams among researchers) was
mechanism for allowing a handful of defense- introduced and popularized. Navigating the
related mainframe computers, located mostly Web is highly intuitive, requires no special
at academic institutions or in the research training, and provides a mechanism for access
facilities of military contractors, to share data to multimedia information that accounts for
files with each other and to provide remote its remarkable growth as a worldwide phe-
access to computing power at other locations. nomenon. It is also accessible by essentially all
The notion of electronic mail arose soon digital devices—computers, tablets, smart
thereafter, and machine-to-machine electronic phones, and a plethora of personal monitors
mail exchanges quickly became a major com- and “smart home” tools—which is a tribute to
ponent of the network’s traffic. As the tech- its design and its compatibility with newer
nology matured, its value for nonmilitary networking technologies, such as Bluetooth
research activities was recognized, and by and Wi-Fi.
1973 the first medically related research com- The societal impact of this communica-
puter had been added to the network tions phenomenon cannot be overstated,
(Shortliffe 1998a, 2000). especially given the international connectivity
During the 1980s, the technology began that has grown phenomenally in the past two
to be developed in other parts of the world, decades. Countries that once were isolated
and the National Science Foundation took from information that was important to citi-
over the task of running the principal high- zens, ranging from consumers to scientists to
speed backbone network in the United those interested in political issues, are now
States. Hospitals, mostly academic centers, finding new options for bringing timely infor-
began to be connected to what had by then mation to the desktop machines and mobile
become known as the Internet, and in a devices of individuals with an Internet con-
major policy move it was decided to allow nection.
commercial organizations to join the net- There has in turn been a major upheaval
work as well. By April 1995, the Internet in in the telecommunications industry, with
the United States had become a fully com- companies that used to be in different busi-
Biomedical Informatics: The Science and the Pragmatics
13 1
nesses (e.g., cable television, Internet services, 1.2.1 Model of Integrated
A
and telephone) now finding that their activi- Disease Surveillance3
ties and technologies have merged. In the
United States, legislation was passed in 1996 To emphasize the role that the nation’s net-
to allow new competition to develop and new working infrastructure is playing in integrat-
industries to emerge. We have subsequently ing clinical data and enhancing care delivery,
seen the merging of technologies such as consider one example of how disease surveil-
cable television, telephone, networking, and lance, prevention, and care are increasingly
satellite communications. High-speed lines being influenced by information and commu-
into homes and offices are widely available, nications technology. The goal is to create an
wireless networking is ubiquitous, and inex- information-management infrastructure that
pensive mechanisms for connecting to the will allow all clinicians, regardless of practice
Internet without using conventional comput- setting (hospitals, emergency rooms, small
ers (e.g., using cell phones or set-top boxes) offices, community clinics, military bases,
have also emerged. The impact on everyone multispecialty groups, etc.) to use EHRs in
has been great and hence it is affecting the their practices both to assist in patient care
way that individuals seek health-related infor- and to provide patients with counsel on illness
mation while also enhancing how patients prevention. The full impact of this use of elec-
can gain access to their health care providers tronic resources will occur when data from all
and to their clinical data. such records are pooled in regional and
The Internet also has exhibited unin- national registries or surveillance databases
tended consequences, especially in the world (. Fig. 1.6), mediated through secure con-
of social media, which has created opportuni- nectivity with the Internet. The challenge, of
ties for promoting political unrest, social course, is to find a way to integrate data from
shaming, and dissemination of falsehoods. In such diverse practice settings, especially since
the world of health care, the Internet has cre- there are multiple vendors and system devel-
ated opportunities for attacks on personal opers active in the marketplace, competing to
privacy, even while facilitating socially valu- provide value-added capabilities that will
able exchanges of data among institutions excite and attract the practitioners for whom
and individuals. Many of these practical, their EHR product is intended.
legal, and ethical challenges are the subject of The practical need to pool and integrate
7 Chap. 12. clinical data from such diverse resources and
Just as individual hospitals and health care systems emphasizes the practical issues that
systems have come to appreciate the impor- need to be addressed in achieving such func-
tance of integrating information from multi- tionality and resources. Interestingly, most of
ple clinical and administrative systems within the barriers are logistical, political, and finan-
their organizations (see 7 Chap. 16), health cial rather than technical in nature:
planners and governments now appreciate the 55 Encryption of data: Concerns regarding
need to develop integrated information privacy and data protection require that
resources that combine clinical and health Internet transmission of clinical
data from multiple institutions within regions, information occur only if those data are
and ultimately nationally (see 7 Chaps. 15 encrypted, with an established mechanism
and 18). As you will see, the Internet and the for identifying and authenticating
role of digital communications has therefore individuals before they are allowed to
become a major part of modern medicine and decrypt the information for surveillance or
health. Although this topic recurs in essen- research use.
tially every chapter in this book, we introduce
it in the following sections because of its
importance to modern technical issues and 3 This section is adapted from a discussion that origi-
policy directions. nally appeared in (Shortliffe and Sondik 2004).
14 E. H. Shortliffe and M. F. Chiang
1 Internet
Provider EHR
Provider EHR
Regional and national registries
and surveillance databases
Provider EHR
Provider EHR
Different vendors
.. Fig. 1.6 A future vision of surveillance databases, in When information is effectively gathered, pooled, and
which clinical data are pooled in regional and national analyzed, there are significant opportunities for feeding
registries or repositories through a process of data sub- back the results of derived insights to practitioners at
mission that occurs over the Internet (with attention to the point of care. Thus the arrows indicate a bi-
privacy and security concerns as discussed in the text). directional process. See also 7 Chap. 15
1 Biomedical
and clinical
research
Electronic Pooled clinical data
health Regional
records and
national
providers
caring for public
patients health and
Standards
disease
for
registries
prevention
Creation of
Information, and
protocols,
decision-support, treatment
guidelines,
and order-entry
and A ‘’Learning
systems
educational health system’’
materials
.. Fig. 1.8 The ultimate goal is to create a cycle of knowledge then can feed back to practitioners at the
information flow, whereby data from local distributed point of care, using a variety of computer-supported
electronic health records (EHRs) and their associated decision-support delivery mechanisms. This cycle of
clinical datasets are routinely and effortlessly submitted new knowledge, driven by experience, and fed back to
to registries and research databases. The resulting new clinicians, has been dubbed a “learning health system”
requirements that the US government speci- for treatment in turn can be translated into
fied after 2009 when determining eligibility protocols, guidelines, and educational materi-
for payment of incentives to clinicians or als. This new knowledge and decision-support
hospitals who implemented EHRs (see the functionality can then be delivered over the
discussion of the government HITECH pro- network back to the clinicians so that the
gram in 7 Sect. 1.3). information informs patient care, where it is
Consider, instead, an expanded view of integrated seamlessly with EHRs and order-
the health surveillance model introduced in entry systems.
7 Sect. 1.2.1 (. Fig. 1.8). Beginning at the This notion of a system that allows us to
left of the diagram, clinicians caring for learn from what we do, unlocking the experi-
patients use electronic health records, both to ence that has traditionally been stored in
record their observations and to gain access to unusable form in paper charts, is gaining wide
information about the patient. Information attention now that we can envision an inter-
from these records is then stored in local connected community of clinicians and insti-
patient-care clinical databases and forwarded tutions, building digital data resources using
automatically to regional and national regis- EHRs. The concept has been dubbed a learn-
tries as well as to research databases that can ing health system and is an ongoing subject
support retrospective studies (see 7 Chap. 15) of study by the National Academy of
or formal institutional or community-based Medicine (Daley 2013), which has published
clinical trials (see 7 Chap. 27). The analyzed a series of reports on the topic.5 It is also the
information from institutional datasets, regis- organizing conceptual framework for a
tries and research studies can in turn be used
to develop standards for prevention and treat-
ment, with major guidance from biomedical
research. Researchers can draw information 5 7 https://nam.edu/programs/value-science-driven-
either directly from the health records or from health-care/learning-health-system-series/
the pooled data in registries. The standards (Accessed 05/29/19)
Biomedical Informatics: The Science and the Pragmatics
17 1
Biomedical
and clinical
research
.. Fig. 1.9 Today the learning health system is increas- derived from population activities that reflect individu-
ingly embracing new forms of massive health-related als’ health, activities, and attitudes
data, often from outside the clinical care setting and
recently created department at the University monitor the behavior of individuals as they
of Michigan Medical School6 and for a new use online information resources, searching
scientific journal.7 for health-related information. Social media
Although the learning health system con- exchanges (e.g., Twitter, Facebook) have also
cept of . Fig. 1.8 may at first seem expansive been used to extract health-related informa-
and all-inclusive, in recent years we have tion, such as complaints that suggest early
learned that there are other important inputs stages of communicable diseases or expressed
to the health care environment and these can attitudes towards diseases and treatment. The
have important implications for what we learn explosive adoption of health monitoring
by analyzing what both patients and healthy devices (e.g., step counters, exercise analyzers,
individuals do. Some of these data sources are cardiac or sleep monitors) has also offered a
immense and are in line with the recent inter- useful source of large-scale information that
est in “big data” analytics (. Fig. 1.9). is only beginning to be merged with other
Consider, for example, the analysis of huge data in our learning health system.
datasets associated with full human genome
specifications for individuals and populations.
Another approach for gathering massive 1.2.3 Implications of the Internet
amounts of relevant health-related data is to for Patients
With the penetration of the Internet, patients,
6 7 https://medicine.umich.edu/dept/learning-health-
as well as healthy individuals, have turned to
sciences (Accessed 05/03/2020)
7 7 https://onlinelibrary.wiley.com/journal/23796146 the Internet for health information. It is a rare
(Accessed 05/03/2020) North American physician who has not
18 E. H. Shortliffe and M. F. Chiang
encountered a patient who comes to an idly, and there are settings in which it is already
1 appointment armed with a question, or a proving to be successful and cost-effective
stack of printouts, that arose due to medically (e.g., rural care, international medicine, tele-
related searches on the net. The companies radiology, and video-based care of patients in
that provide search engines for the Internet prisons). Similarly, there are now a large num-
report that health-related sites are among the ber of apps (designed for smart phones, tab-
most popular ones being explored by consum- lets, or desktop machines) that offer
ers. As a result, physicians and other care pro- specialized medical care or advice or assist
viders have learned that they must be prepared with health data management and communi-
to deal with information that patients discover cation with providers and support groups (see
on the net and bring with them when they 7 Chaps. 11 and 20).
seek care from clinicians. Some of the infor-
mation is timely and excellent; in this sense,
physicians can often learn about innovations
1.2.4 Requirements for Achieving
from their patients and need to be open to the
kinds of questions that this enhanced access the Vision
to information will generate from patients in
their practices. Efforts that continue to push the state of the
On the other hand, much of the health art in Internet technology all have significant
information on the Web lacks peer review or is implications for the future of health care
purely anecdotal. People who lack medical delivery in general and of EHRs and their
training can be misled by such information, integration in particular (Shortliffe 1998b,
just as they have been poorly served in the 2000). But in addition to increasing speed,
past by printed information in books and reliability, security, and availability of the
magazines dealing with fad treatments from Internet, there are many other areas that need
anecdotal sources. This also creates challenges attention if the vision of a learning health sys-
for health care providers, who often feel pres- tem is to be achieved.
sured to handle more issues in less time due to
economic pressures. In addition, some sites 1.2.4.1 Education and Training
provide personalized advice, sometimes for a There is a difference between computer liter-
fee, with all the attendant concerns about the acy (familiarity with computers and their
quality of the suggestions and the ability to routine uses in our society) and knowledge
give valid advice based on an electronic-mail of the role that computing and communica-
or Web-based interaction. tions technology can and should play in our
In a positive light, communications tech- health system. We need to do a better job of
nologies offer clinicians creative ways to inter- training future clinicians in the latter area.
act with their patients and to provide higher Otherwise we will leave them poorly equipped
quality care. Years ago, medicine adopted the for the challenges and opportunities they will
telephone as a standard vehicle for facilitating face in the rapidly changing practice environ-
patient care, and we now take this kind of ments that surround them (Shortliffe 2010).
interaction with patients for granted. If we Not only do they need to feel comfortable
extend the audio channel to include our visual with the technology itself, but they need to
sense as well, typically relying on the Internet understand the profound effect that it has
as our communication mechanism, the notion had on the practice of medicine—with many
of telemedicine emerges (see 7 Chap. 20). more changes to come. Medicine, and other
This notion of “medicine at a distance” arose health professions, are being asked to adapt
early in the twentieth century (see . Fig. 1.10), in ways that were not envisioned even a
but the technology was too limited for much decade or two ago. Not all individuals
penetration of the idea beyond telephone con- embrace such change, but younger clinicians,
versations until the last 30–40 years. The use who have grown up with technology in
of telemedicine has subsequently grown rap- almost all aspects of their lives, have high
Biomedical Informatics: The Science and the Pragmatics
19 1
.. Fig. 1.10 “The Radio Doctor”: long before televi- advanced technologies. This 1924 example is from the
sion was invented, creative observers were suggesting cover of a popular magazine and envisions video
how doctors and patients could communicate using enhancements to radio. (Source: “Radio News” 1924)
expectations for how digital systems and Furthermore, in addition to the implica-
tools should enhance their professional expe- tions for education of health professionals
rience. What is even more challenging, per- about computer-related topics, much of the
haps, is that assumptions that they have future vision we have proposed here can be
made about the field they have entered may achieved only if educational institutions pro-
no longer be valid in the coming years, as duce a cadre of talented individuals who are
some skills are no longer required and new highly skilled in computing and communica-
requirements are viewed as dramatically dif- tions technology but also have a deep under-
ferent from what health professionals have standing of the biomedical milieu and of the
had to know in the past. needs of practitioners and other health work-
20 E. H. Shortliffe and M. F. Chiang
ers. Computer science training alone is not failure to understand the requirements for
1 adequate. Fortunately, there are increasing process reengineering as part of software
numbers of formal training programs in what implementation, as well as problems with
has become known as biomedical informatics technical leadership and planning, account
(see 7 Sect. 1.4) that provide custom-tailored for many of the frustrating experiences that
educational opportunities. Many of the train- health care organizations report in their
ees are life science researchers, physicians, efforts to use computers more effectively in
nurses, pharmacists, and other health profes- support of patient care and provider produc-
sionals who see the career opportunities and tivity.
challenges at the intersections of biomedicine, The notion of a learning health system
information science, computer science, deci- described previously is meant to motivate
sion science, data science, cognitive science, your enthusiasm for what lies ahead and to
and communications technologies. As has suggest the topics that need to be addressed in
been clear for three decades (Greenes and a book such as this one. Essentially all of the
Shortliffe 1990), however, the demand for following chapters touch on some aspect of
such individuals far outstrips the supply, both the vision of integrated systems that extend
for academic and industrial career pathways.8, 9 beyond single institutions. Before embarking
We need more training programs,10 expansion on these topics, however, we must emphasize
of those that already exist, plus support for two points. First, the cyclical creation of new
junior faculty in health science schools who knowledge in a learning health care system
may wish to pursue additional training in this will become reality only if individual hospi-
area. tals, academic medical centers, and national
coordinating bodies work together to provide
1.2.4.2 Organizational the standards, infrastructure, and resources
and Management Change that are necessary. No individual system
Second, as implied above, there needs to be a developer, vendor, or administrator can man-
greater understanding among health care date the standards for connectivity, data pool-
leaders regarding the role of specialized multi- ing, and data sharing implied by a learning
disciplinary expertise in successful clinical health care system. A national initiative of
systems implementation. The health care sys- cooperative planning and implementation for
tem provides some of the most complex orga- computing and communications resources
nizational structures in society (Begun et al. within and among institutions and clinics is
2003), and it is simplistic to assume that off- required before practitioners will have routine
shelf products will be smoothly intro- access to the information that they need (see
the-
duced into a new institution without major 7 Chap. 15). A major federal incentive pro-
analysis, redesign, and cooperative joint- gram for EHR implementation was a first step
development efforts. Underinvestment and a in this direction (see 7 Sect. 1.3). The criteria
that are required for successful EHR imple-
mentation are sensitive to the need for data
integration, public-health support, and a
8 7 https://www.hcinnovationgroup.com/policy- learning health system.
value-based-care/staffing-professional-development/ Second, although our presentation of the
news/13024360/report-health-informatics-labor-
learning health system notion has focused on
market-lags-behind-demand-for-workers (Accessed
5/30/2019); 7 https://www.bestvalueschools.com/ the clinician’s view of integrated information
faq/job-outlook-health-informatics-graduates/ access, other workers in the field have similar
(Accessed 5/30/2019). needs that can be addressed in similar ways.
9 7 https://www.burning-glass.com/wp-content/ The academic research community has
u p l o a d s / B G - H e a l t h _ I n fo r m at i c s _ 2 0 1 4 . p d f
already developed and made use of much of
(Accessed 5/30/2019).
10 A directory of some existing training programs is the technology that needs to be coalesced if
available at 7 http://www.amia.org/education/pro- the clinical user is to have similar access to
grams-and-courses (Accessed 5/30/19). data and information. There is also the
Biomedical Informatics: The Science and the Pragmatics
21 1
patient’s view, which must be considered in payments were made available, however, only
the notion of patient-centered health care that when eligible organizations or individual prac-
is now broadly accepted and encouraged titioners implemented EHRs that were “certi-
(Ozkaynak et al. 2013). fied” as meeting minimal standards and when
they could document that they were making
“meaningful use” of those systems. You will
1.3 The US Government Steps In see references to such certification and mean-
ingful use criteria in other chapters in this
During the early decades of the evolution of volume.
clinical information systems for use in hospi- This volume also offers a discussion of
tals, patient care, and public health, the major HIT policy and the federal government in
role of government was in supporting the 7 Chap. 29. Although the process of EHR
research enterprise as new methods were implementation is approaching completion in
developed, tested, and formally evaluated. the US, both in health systems and practices,
The topic was seldom mentioned by the the current status is largely due to this legisla-
nation’s leaders, however, even during the tive program: because of the federal stimulus
1990s when the White House was viewed as package, large numbers of hospitals, systems,
being especially tech savvy. It was accordingly and practitioners invested in EHRs and incor-
remarkable when, in the President’s State of porated them into their practices. Furthermore,
the Union address in 2004 (and in each of the the demand for workers skilled in health infor-
following years of his administration), mation technology grew much more rapidly
President Bush called for universal implemen- than did the general job market, even within
tation of electronic health records within health care (. Fig. 1.11). It is a remarkable
10 years. The Secretary of Health and Human example of how government policy and invest-
Services, Tommy Thompson, was similarly ment can stimulate major transitions in sys-
supportive and, in May 2004, created an entity tems such as health care, where many observers
intended to support the expansion of the use had previously felt that progress had been
of EHRs—the Office of the National unacceptably slow (Shortliffe 2005).
Coordinator for Health Information
Technology (initially referred to by the full
acronym ONCHIT, but later shortened sim-
1.4 Defining Biomedical
ply to ONC). Informatics and Related
There was initially limited budget for ONC, Disciplines
although the organization served as a conven-
ing body for EHR-related planning efforts and With the previous sections of this chapter as
the National Health Information Infrastruc- background, let us now consider the scientific
ture (see 7 Chaps. 14, 15 and 29). The topic of discipline that is the subject of this volume
EHRs subsequently became a talking point for and has led to the development of many of
both major candidates during the Presidential the functionalities that need to be brought
election in 2008, with strong bipartisan sup- together in the integrated biomedical-
port. Then, in early 2009, Congress enacted computing environment of the future. The
the American Recovery and Reinvestment Act remainder of this chapter deals with biomedi-
(ARRA), also known as the economic “Stimu- cal informatics as a field and with biomedical
lus Bill”. One portion of that legislation was and health information as a subject of study.
known as the Health Information Technology It provides additional background needed to
for Economic and Clinical Health (HITECH) understand many of the subsequent chapters
Act. It was this portion of the bill that pro- in this book.
vided significant fiscal incentives for health Reference to the use of computers in bio-
systems, hospitals, and providers to implement medicine evokes different images depending
EHRs in their practices and eventual financial on the nature of one’s involvement in the field.
penalties for lack of implementation. Such To a hospital administrator, it might suggest
22 E. H. Shortliffe and M. F. Chiang
199
200 HITECH Act
February 2009
150
100
57
50
52
–50
Nov-07
Nov-08
Nov-09
Nov-10
Nov-11
Mar-07
Mar-08
Mar-09
Mar-10
Mar-11
May-07
May-08
May-09
May-10
May-11
Jan-10
Jan-11
Jan-12
Jul-07
Jul-08
Jul-09
Jul-10
Jul-11
Sep-07
Sep-08
Sep-09
Sep-10
Sep-11
Jan-07
Jan-08
Jan-09
.. Fig. 1.11 Impact of the HITECH Act on health ONC analysis of data from O’Reilly Job Data Mart,
information technology (IT) employment. Percent ONC Data Brief, No. 2, May 2012 (7 https://www.
change in online health IT job postings per month for healthit.g ov/sites/default/files/pdf/0512_ONCData-
first 3 years, relative to health care jobs and all jobs: nor- Brief2_JobPostings.pdf (Accessed 5/6/2019)))
malized to February 2009 when ARRA passed. (Source:
the maintenance of clinical-care records using difficult to understand how biomedical com-
computers; to a decision scientist, it might puting can help us to tie together the diverse
mean the assistance by computers in disease aspects of health care and its delivery.
diagnosis; to a basic scientist, it might mean To achieve a unified perspective, we might
the use of computers for maintaining, retriev- consider four related topics: (1) the concept of
ing, and analyzing gene-sequencing informa- biomedical information (why it is important
tion. Many physicians immediately think of in biological research and clinical practice and
office-practice tools for tasks such as patient why we might want to use computers to pro-
billing or appointment scheduling, and of cess it); (2) the structural features of medicine,
electronic health record systems for clinical including all those subtopics to which com-
documentation. Nurses often think of puters might be applied; (3) the importance of
computer- based tools for charting the care evidence-based knowledge of biomedical and
that they deliver, or decision-support tools health topics, including its derivation and
that assist in applying the most current proper management and use; and (4) the
patient-care guidelines. The field includes applications of computers and communica-
study of all these activities and a great many tion methods in biomedicine and the scientific
others too. More importantly, it includes the issues that underlie such efforts. We mention
consideration of various external factors that the first two topics briefly in this and the next
affect the biomedical setting. Unless you keep chapter, and we provide references in the
in mind these surrounding factors, it may be Suggested Readings section for readers who
Biomedical Informatics: The Science and the Pragmatics
23 1
wish to learn more. The third topic, knowl- only vaguely defined. Even today, the term
edge to support effective decision making in computer science is used more as a matter of
support of human health, is intrinsic to this convention than as an explanation of the
book and occurs in various forms in essen- field’s scientific content.
tially every chapter. The fourth topic, how- In the 1970s we began to use the phrase
ever, is the principal subject of this book. medical computer science to refer to the sub-
Computers have captured the imagination division of computer science that applies the
(and attention) of our society. Today’s younger methods of the larger field to medical topics.
individuals have grown up in a world in which As you will see, however, medicine has pro-
computers are ubiquitous and useful. Because vided a rich area for computer science
the computer as a machine is exciting, people research, and several basic computing insights
may pay a disproportionate amount of atten- and methodologies have been derived from
tion to it as such—at the expense of consider- applied medical-computing research.
ing what the computer can do given the The term information science, which is
numbers, concepts, ideas, and cognitive under- occasionally used in conjunction with com-
pinnings of fields such as medicine, health, and puter science, originated in the field of library
biomedical research. Computer scientists, phi- science and is used to refer, somewhat gener-
losophers, psychologists, and other scholars ally, to the broad range of issues related to the
increasingly consider such matters as the management of both paper-based and elec-
nature of information and knowledge and how tronically stored information. Much of what
human beings process such concepts. These information science originally set out to be is
investigations have been given a sense of time- now drawing evolving interest under the name
liness (if not urgency) by the simple existence cognitive science.
of the computer. The cognitive activities of cli- Information theory, in contrast, was first
nicians in practice probably have received more developed by scientists concerned about the
attention over the past three or four decades physics of communication; it has evolved into
than in all previous history (see 7 Chap. 4). what may be viewed as a branch of mathemat-
Again, the existence of the computer and the ics. The results scientists have obtained with
possibilities of its extending a clinician’s cogni- information theory have illuminated many
tive powers have motivated many of these processes in communications technology, but
studies. To develop computer-based tools to they have had little effect on our understand-
assist with decisions, we must understand ing of human information processing.
more clearly such human processes as diagno- The terms biomedical computing or bio-
sis, therapy planning, decision making, and computation have been used for a number of
problem solving in medicine. We must also years. They are non-descriptive and neutral,
understand how personal and cultural beliefs implying only that computers are employed
affect the way in which information is inter- for some purpose in biology or medicine.
preted and decisions are ultimately made. They are often associated with bioengineering
applications of computers, however, in which
the devices are viewed more as tools for a bio-
1.4.1 Terminology engineering application than as a primary
focus of research.
Although, starting in the 1960s, a growing In the 1970s, inspired by the French term
number of individuals conducting serious for computer science (informatique), the
biomedical research or undertaking clinical English-speaking community began to use the
practice had access to a computer system, term medical informatics. Those in the field
there was initial uncertainty about what name were attracted by the word’s emphasis on
should be used for the biomedical application information, which they saw as more central to
of computer science concepts. The name com- the field than the computer itself, and it gained
puter science was itself new in 1960 and was momentum as a term for the discipline, espe-
24 E. H. Shortliffe and M. F. Chiang
cially in Europe, during the 1980s. The term is called the Working Group on Biomedical
1 broader than medical computing (it includes Computing. In June 1999, the group provided
such topics as medical statistics, record keep- a report12 recommending that the NIH under-
ing, and the study of the nature of medical take an initiative called the Biomedical
information itself) and deemphasizes the Information Science and Technology Initiative
computer while focusing instead on the nature (BISTI). With the subsequent creation of
of the field to which computations are applied. another NIH organization called the
Because the term informatics became widely Bioinformatics Working Group, the visibility
accepted in the United States only in the late of informatics applications in biology was
1980s, medical information science was also greatly enhanced. Today bioinformatics is a
used earlier in North America; this term, major area of activity at the NIH13 and in
however, may be confused with library sci- many universities and biotechnology compa-
ence, and it does not capture the broader nies around the world. The explosive growth
implications of the European term. As a of this field, however, has added to the confu-
result, the name medical informatics appeared sion regarding the naming conventions we
by the late 1980s to have become the preferred have been discussing. In addition, the rela-
term, even in the United States. Indeed, this is tionship between medical informatics and bio-
the name of the field that we used in the first informatics became unclear. As a result, in an
two editions of this textbook (published in effort to be more inclusive and to embrace the
1990 and 2000), and it is still sometimes used biological applications with which many med-
in professional, industrial, and academic set- ical informatics groups had already been
tings. However, many observers expressed involved, the name medical informatics gradu-
concern that the adjective “medical” is too ally gave way to biomedical informatics
focused on physicians and disease, failing to (BMI). Several academic groups have changed
appreciate the relevance of this discipline to their names, and a major medical informatics
other health and life-science professionals and journal (Computers and Biomedical Research,
to health promotion and disease prevention. first published in 1967) was reborn in 2001 as
Thus, the term health informatics, or health The Journal of Biomedical Informatics.14
care informatics, gained some popularity, Despite this convoluted naming history,
even though it has the disadvantage of tend- we believe that the broad range of issues in
ing to exclude applications to biomedical biomedical information management does
research (7 Chaps. 9 and 26) and, as we shall require an appropriate name and, beginning
argue shortly, it tends to focus the field’s name with the third edition of this book (2006), we
on application domains (clinical care, public used the term biomedical informatics for this
health, and prevention) rather than the basic purpose. It has become the most widely
discipline and its broad range of applicability. accepted term for the core discipline and
Applications of informatics methods in should be viewed as encompassing broadly all
biology and genetics exploded during the areas of application in health, clinical prac-
1990s due to the human genome project11 and tice, and biomedical research. When we speak
the growing recognition that modern life- specifically about computers and their use
science research was no longer possible with- within biomedical informatics activities, we
out computational support and analysis (see use the terms biomedical computer science
7 Chaps. 9 and 26). By the late 1990s, the use (for the methodologic issues) or biomedical
of informatics methods in such work had computing (to describe the activity itself).
become widely known as bioinformatics and
the director of the National Institutes of
Health (NIH) appointed an advisory group 12 Available at 7 https://acd.od.nih.gov/documents/
reports/060399_Biomed_Computing_WG_RPT.
htm (Accessed 5/31/2019).
13 See 7 http://www.bisti.nih.gov/. (Accessed 5/31/2019).
11 7 https://www.ornl.gov/sci/techresources/Human_ 14 7 http://www.journals.elsevier.com/journal-of-bio-
Genome/home.shtml (Accessed 5/31/2019). medical-informatics (Accessed 5/30/19).
Biomedical Informatics: The Science and the Pragmatics
25 1
Note, however, that biomedical informatics discipline, has recognized the confusion
has many other component sciences in addi- regarding the field and its definition.15 They
tion to computer science. These include the accordingly appointed a working group to
decision sciences, statistics, cognitive science, develop a formal definition of the field and to
data science, information science, and even specify the core competencies that need to be
management sciences. We return to this point acquired by students seeking graduate training
shortly when we discuss the basic versus in the discipline. The resulting definition, pub-
applied nature of the field when it is viewed as lished in AMIA’s journal and approved by the
a basic research discipline. full board of the organization, identifies the
Although labels such as these are arbitrary, focus of the field in a simple sentence and then
they are by no means insignificant. In the case adds four clarifying corollaries that refine the
of new fields of endeavor or branches of sci- definition and the field’s scope and content
ence, they are important both in designating (7 Box 1.1). We adopt this definition, which is
the field and in defining or restricting its con- very similar to the one we offered in previous
tents. The most distinctive feature of the mod- editions of this text. It acknowledges that the
ern computer is the generality of its application. emergence of biomedical informatics as a new
The nearly unlimited range of computer uses discipline is due in large part to rapid advances
complicates the business of naming the field. in computing and communications technol-
As a result, the nature of computer science is ogy, to an increasing awareness that the knowl-
perhaps better illustrated by examples than by edge base of biomedicine is essentially
attempts at formal definition. Much of this unmanageable by traditional paper- based
book presents examples that do just this for methods, and to a growing conviction that the
biomedical informatics as well. process of informed decision making is as
The American Medical Informatics important to modern biomedicine as is the col-
Association (AMIA), which was founded in lection of facts on which clinical decisions or
the late 1980s under the former name for the research plans are made.
1010
IBM z13
109 POWER8
AMD K-10
108 ItaniumTM
Pentium® 4
107
Pentium® III
Transistors per chip
Pentium® II
106 Pentium®
i486TM
105 i386TM
80286
104
8086
103 8080
4004
102
101
100
1970 1980 1990 2000 2010 2020
.. Fig. 1.17 Moore’s Law. Former Intel chairman Gor- graph shows the exponential growth in the number of
don Moore is credited with popularizing the “law” that transistors that can be integrated on a single microproces-
the size and cost of microprocessor chips will half every sor chip. The trend continues to this day. (Source: Wiki-
18 months while they double in computing power. This pedia: 7 https://en.wikipedia.org/wiki/Transistor_count)
Biomedical Informatics: The Science and the Pragmatics
29 1
Box 1.2: The Nature of Medical Information consult Blois’ book on this subject (see Sug-
This material is adapted from a small portion gested Readings).
of a classic book on this topic. It was written by Let us examine an instance of what we will
Dr. Scott Blois, who coauthored the introduc- call a low-level (or readily formalized) science.
tory chapter to this textbook in its 1st edition, Physics is a natural starting point; in any dis-
which was published shortly after his death. Dr. cussion of the hierarchical relationships among
Blois was a scholar who directed the informatics the sciences (from the fourth-century BC Greek
program at the University of California San philosopher Aristotle to the twentieth-century
Francisco and served as the first president of the U.S. librarian Melvil Dewey), physics will be
American College of Medical Informatics placed near the bottom. Physics characteristi-
(ACMI). [Blois, M. S. (1984). Information and cally has a certain kind of simplicity, or gener-
medicine: The nature of medical descriptions. ality. The concepts and descriptions of the
Berkeley: University of California Press]. objects and processes of physics, however, are
From the material in this chapter, you necessarily used in all applied fields, including
might conclude that biomedical applications medicine. The laws of physics and the descrip-
do not raise any unique problems or concerns. tions of certain kinds of physical processes are
On the contrary, the biomedical environment essential in representing or explaining func-
raises several issues that, in interesting ways, tions that we regard as medical in nature. We
are quite distinct from those encountered in need to know something about molecular phys-
most other domains of applied computing. ics, for example, to understand why water is
Clinical information seems to be systemati- such a good solvent; to explain how nutrient
cally different from the information used in molecules are metabolized, we talk about the
physics, engineering, or even clinical chemis- role of electron-transfer reactions.
try (which more closely resembles chemical Applying a computer (or any formal com-
applications generally than it does medical putation) to a physical problem in a medical
ones). Aspects of biomedical information context is no different from doing so in a phys-
include an essence of uncertainty—we can ics laboratory or for an engineering applica-
never know all about a physiological pro- tion. The use of computers in various low-level
cess—and this results in inevitable variability processes (such as those of physics or chemis-
among individuals. These differences raise try) is similar and is independent of the appli-
special problems and some investigators sug- cation. If we are talking about the solvent
gest that biomedical computer science differs properties of water, it makes no difference
from conventional computer science in funda- whether we happen to be working in geology,
mental ways. We shall explore these differ- engineering, or medicine. Such low-level pro-
ences only briefly here; for details, you can cesses of physics are particularly receptive to
Biomedical Informatics: The Science and the Pragmatics
31 1
mathematical treatment, so using computers to encode and process this information using
for these applications requires only conven- the tools of mathematics and computer sci-
tional numerical programming. ence that work so well at low levels. In light of
In biomedicine, however, there are other these remarks, the general enterprise known
higher-level processes carried out in more com- as artificial intelligence (AI) can be aptly
plex objects such as organisms (one type of described as the application of computer sci-
which is patients). Many of the important ence to high-level, real-world problems.
informational processes are of this kind. Biomedical informatics thus includes com-
When we discuss, describe, or record the prop- puter applications that range from processing
erties or behavior of human beings, we are of very low-level descriptions, which are little
using the descriptions of very high-level different from their counterparts in physics,
objects, the behavior of whom has no counter- chemistry, or engineering, to processing of
part in physics or in engineering. The person extremely high-level ones, which are com-
using computers to analyze the descriptions pletely and systematically different. When we
of these high-level objects and processes study human beings in their entirety (includ-
encounters serious difficulties (Blois 1984). ing such aspects as human cognition, self-con-
One might object to this line of argument sciousness, intentionality, and behavior), we
by remarking that, after all, computers are must use these high-level descriptions. We will
used routinely in commercial applications in find that they raise complex issues to which
which human beings and situations concern- conventional logic and mathematics are less
ing them are involved and that relevant com- readily applicable. In general, the attributes of
putations are carried out successfully. The low-level objects appear sharp, crisp, and
explanation is that, in these commercial unambiguous (e.g., “length,” “mass”), whereas
applications, the descriptions of human those of high-level ones tend to be soft, fuzzy,
beings and their activities have been so highly and inexact (e.g., “unpleasant scent,” “good”).
abstracted that the events or processes have Just as we need to develop different meth-
been reduced to low-level objects. In biomed- ods to describe high-level objects, the infer-
icine, abstractions carried to this degree ence methods we use with such objects may
would be worthless from either a clinical or differ from those we use with low-level ones.
research perspective. In formal logic, we begin with the assumption
For example, one instance of a human that a given proposition must be either true or
being in the banking business is the customer, false. This feature is essential because logic is
who may deposit, borrow, withdraw, or invest concerned with the preservation of truth value
money. To describe commercial activities such under various formal transformations. It is
as these, we need only a few properties; the difficult or impossible, however, to assume
customer can remain an abstract entity. In that all propositions have truth values when
clinical medicine, however, we could not begin we deal with the many high-level descriptions
to deal with a patient represented with such in medicine or, indeed, in everyday situations.
skimpy abstractions. We must be prepared to Such questions as “Was Woodrow Wilson a
analyze most of the complex behaviors that good president?” cannot be answered with a
human beings display and to describe patients “yes” or “no” (unless we limit the question to
as completely as possible. We must deal with specific criteria for determining the goodness
the rich descriptions occurring at high levels of presidents). Many common questions in
in the hierarchy, and we may be hard pressed biomedicine have the same property.
32 E. H. Shortliffe and M. F. Chiang
.. Fig. 1.20 Biomedical informatics as basic science. set of concepts and techniques from the field of bio-
We view the term biomedical informatics as referring to medical informatics. Note that work in biomedical
the basic science discipline in which the development informatics is motivated totally by the application
and evaluation of new methods and theories are a pri- domains that the field is intended to serve (thus the two-
mary focus of activity. These core concepts and meth- headed arrows in the diagram). Therefore the basic
ods in turn have broad applicability in the health and research activities in the field generally result from the
biomedical sciences. The informatics subfields indicated identification of a problem in the real world of health
by the terms across the bottom of this figure are accord- or biomedicine for which an informatics solution is
ingly best viewed as application domains for a common sought (see text)
Biomedical informatics is perhaps best cially tight coupling between the application
viewed as a basic biomedical science, with a areas, broad categories of which are indicated
wide variety of potential areas of application at the bottom of . Fig. 1.20, and the identifi-
(. Fig. 1.20). The analogy with other basic sci- cation of basic research tasks that character-
ences is that biomedical informatics uses the ize the scientific underpinnings of the field.
results of past experience to understand, struc- Research, however, has shown that there can
ture, and encode objective and subjective bio- be a very long period of time between the
medical findings and thus to make them development of new concepts and methods in
suitable for processing. This approach supports basic research and their eventual application
the integration of the findings and their analy- in the biomedical world (Balas and Boren
ses. In turn, the selective distribution of newly 2000). Furthermore (see . Fig. 1.21), many
created knowledge can aid patient care, health discoveries are discarded along the way, leav-
planning, and basic biomedical research. ing only a small percentage of basic research
Biomedical informatics is, by its nature, an discoveries that have a practical influence on
experimental science, characterized by posing the health and care of patients.
questions, designing experiments, performing Work in biomedical informatics (BMI) is
analyses, and using the information gained to inherently motivated by problems encoun-
design new experiments. One goal is simply to tered in a set of applied domains in biomedi-
search for new knowledge, called basic cine. The first of these historically has been
research. A second goal is to use this knowl- clinical care (including medicine, nursing,
edge for practical ends, called applications dentistry, and veterinary care), an area of
(applied) research. There is a continuity activity that demands patient-oriented infor-
between these two endeavors (see . Fig. 1.20). matics applications. We refer to this area as
In biomedical informatics, there is an espe- clinical informatics.18 It includes several sub-
Biomedical Informatics: The Science and the Pragmatics
33 1
Original research (100%)
Negative results :18%
Variable
5.8 years
Inplementation
(14%)
.. Fig. 1.21 Phases in the transfer of research into National utilization rates of specific, well-substantiated
clinical practice. A synthesis of studies focusing on vari- procedures also suggest a delay of two decades in reach-
ous phases of this transfer has indicated that it takes an ing the majority of eligible patients. For a well-docu-
average of 17 years to make innovation part of routine mented study of such delays and their impact in an
care (Balas and Boren 2000). Pioneering institutions important area of clinical medicine, see (Krumholz
often apply innovations much sooner, sometimes within et al. 1998). (Figure courtesy of Dr. Andrew Balas, used
a few weeks, but nationwide introduction is usually slow. with permission)
topics and areas of specialized expertise, lar methods are generalized for application to
including patient-care foci such as nursing populations of patients rather than to single
informatics, dental informatics, and even vet- individuals (see 7 Chap. 18). Thus clinical
erinary informatics. Furthermore, the former informatics and public health informatics
name of the discipline, medical informatics, is share many of the same methods and tech-
now reserved for those applied research and niques. The closeness of their relationship was
practice topics that focus on disease and the amply demonstrated by the explosion in infor-
role of physicians. As was previously dis- matics research and applications that occurred
cussed, the term “medical informatics” is no in response to the COVID-19 pandemic.19 By
longer used to refer to the discipline as a mid-2020, several articles had appeared to
whole. demonstrate the tight relationship between
Closely tied to clinical informatics is public EHRs and public health informatics for man-
health informatics (. Fig. 1.20), where simi- agement of the outbreak (Reeves et al. 2020).
Two other large areas of application overlap
in some ways with clinical informatics and pub-
lic health informatics. These include imaging
18 Clinical informatics was approved in 2013 by the
informatics (and the set of issues developed
American Board of Medical Specialties as a for-
mal subspecialty of medicine (Finnell and Dixon, around both radiology and other image man-
2015), with board certification examinations offered agement and image analysis domains such as
for eligible candidates by the American Board of pathology, dermatology, and molecular visual-
Preventive Medicine (7 https://www.theabpm.org/ ization—see 7 Chaps. 10 and 22). Finally, there
become-certified/subspecialties/clinical-informatics/
(Accessed 6/1/19)). AMIA is formulating a similar
certification program, AMIA Health Informatics
Certification (AHIC) for non-physicians who are
working in the clinical informatics area (7 https:// 19 7 https://www.amia.org/COVID19 (Accessed
www.amia.org/ahic, Accessed 1/5/2020). 05/03/2020)
34 E. H. Shortliffe and M. F. Chiang
Health
informatics
Applied research Imaging Clinical Public health
and practice Bioinformatics
informatics informatics informatics
Molecular and
Tissues and Individuals Populations
cellular
organs (patients) and society
processes
.. Fig. 1.22 Building on the concepts of . Fig. 1.20, others not illustrated (see text). Note that “health infor-
this diagram demonstrates the breadth of the biomedi- matics” is the term used to refer to applied research and
cal informatics field. The relationship between biomedi- practice in clinical and public health informatics. It is
cal informatics as a core scientific discipline and its not a synonym for the underlying discipline, which is
diverse array of application domains that span biologi- “biomedical informatics”
cal science, imaging, clinical practice, public health, and
is the burgeoning area of bioinformatics, which and concepts. Similarly, the term health infor-
at the molecular and cellular levels is offering matics, which refers to applied research and
challenges that draw on many of the same infor- practice in clinical and public-health informat-
matics methods as well (see 7 Chaps. 9 and 26). ics, is also not an appropriate name for the core
As is shown in . Fig. 1.22, there is a spec- discipline, since BMI is applicable to basic
trum as one moves from left to right across human biology as well as to health.
these BMI application domains. In bioinfor- We acknowledge that the four major
matics, workers deal with molecular and cel- areas of application shown in . Fig. 1.19
lular processes in the application of have “fuzzy” boundaries, and many areas of
informatics methods. At the next level, work- applied informatics research involve more
ers focus on tissues and organs, which tend to than one of the categories. For example, bio-
be the emphasis of imaging informatics work molecular imaging involves both bioinfor-
(also called structural informatics by some matics and imaging informatics concepts.
investigators). Progressing to clinical infor- Similarly, personal or consumer health infor-
matics, the focus is on individual patients, and matics (see 7 Chap. 11) includes elements of
finally to public health, where researchers both clinical informatics and public-health
address problems of populations and of soci- informatics. Another important area of BMI
ety, including prevention. The core science of research activities is pharmacogenomics (see
biomedical informatics has important contri- 7 Chap. 27), which is the effort to infer genetic
butions to make across that entire spectrum, determinants of human drug response. Such
and many informatics methods are broadly work requires the analysis of linked genotypic
applicable across the full range of domains. and phenotypic databases, and therefore lies at
Note from . Fig. 1.20 that biomedical the intersection of bioinformatics and clinical
informatics and bioinformatics are not syn- informatics. Similarly, 7 Chap. 28 presents
onyms and it is incorrect to refer to the scientific the role of informatics in precision medicine,
discipline as bioinformatics, which is, rather, an which relies heavily on both bioinformatics
important area of application of BMI methods and clinical informatics concepts and systems.
Biomedical Informatics: The Science and the Pragmatics
35 1
Translational research
.. Fig. 1.23 A Venn diagram that depicts the relation- relies on Translational Bioinformatics and Clinical
ships among the three major disciplines: biological Research Informatics, constitutes the area of common
research, clinical medicine / public health, and biomedi- overlap among all three Venn circles. (Adapted with per-
cal informatics. Bioinformatics, Health Informatics, and mission from a diagram developed by the Department
Translational Research lie at the intersections among of Biomedical Informatics at the Vanderbilt Medical
pairs of these fields as shown. Precision Medicine, which Center, Nashville, TN)
interest are of course important complements time. Although a diagnosis may be one of the
1 to the core exposures that students should first things physicians think about when they
receive (Kulikowski et al. 2012), but, given the see a new patient, patient assessment (diagno-
need for teamwork and understanding in the sis, management, analysis of treatment results,
field, separating trainees based on the applica- monitoring of disease progression, etc.) is a
tion areas that may interest them would be process that never really terminates. A physi-
counterproductive and wasteful.23 cian must be flexible and open-minded. It is
The scientific contributions of BMI also generally appropriate to alter the original
can be appreciated through their potential for diagnosis if it turns out that treatment based
benefiting the education of health profession- on it is unsuccessful or if new information
als (Shortliffe 2010). For example, in the edu- weakens the evidence supporting the diagno-
cation of medical students, the various sis or suggests a second and concurrent disor-
cognitive activities of physicians traditionally der. 7 Chapter 4 discusses these issues in
have tended to be considered separately and in greater detail.
isolation—they have been largely treated as When we speak of making a diagnosis,
though they are independent and distinct choosing a treatment, managing therapy,
modules of performance. One activity fre- making decisions, monitoring a patient, or
quently emphasized is formal education preventing disease, we are using labels for dif-
regarding medical decision making (see ferent aspects of medical care, an entity that
7 Chap. 3). The specific content of this area has overall unity. The fabric of medical care is
continues to evolve, but the discipline’s depen- a continuum in which these elements are
dence on formal methods regarding the use of tightly interwoven. Regardless of whether we
knowledge and information reveal that it is view computer and information science as a
one aspect of biomedical informatics. profession, a technology, or a science, there is
A particular topic in the study of medical no doubt about its importance to biomedi-
decision making is diagnosis, which is often cine. We can assume computers are here to
conceived and taught as though it were a free- stay as fundamental tools to be used in clini-
standing and independent activity. Medical cal practice, biomedical research, and health
students may thus be led to view diagnosis as science education.
a process that physicians carry out in isolation
before choosing therapy for a patient or pro-
ceeding to other modular tasks. A number of
studies have shown that this model is oversim-
1.4.4 Relationship to Computer
plified and that such a decomposition of cog- Science
nitive tasks may be quite misleading (Elstein
et al. 1978a; Patel and Groen 1986). Physicians During its evolution as an academic entity in
seem to deal with several tasks at the same universities, computer science followed an
unsettled course as involved faculty attempted
to identify key topics in the field and to find
the discipline’s organizational place. Many
computer science programs were located in
23 Many current biomedical informatics training pro- departments of electrical engineering, because
grams were designed with this perspective in mind. major concerns of their researchers were com-
Students with interests in clinical, imaging, public
health, and biologic applications are often trained
puter architecture and design and the devel-
together and are required to learn something about opment of practical hardware components.
each of the other application areas, even while spe- At the same time, computer scientists were
cializing in one subarea for their own research. Sev- interested in programming languages and
eral such programs were described in a series of software, undertakings not particularly char-
articles in the Journal of Biomedical Informatics in
2007 (Tarczy-Hornoch et al. 2007) and many more
acteristic of engineering. Furthermore, their
have been added since that time. work with algorithm design, computability
Biomedical Informatics: The Science and the Pragmatics
37 1
theory,24 and other theoretical topics seemed simply the study of computer science with a
more related to mathematics. “biomedical flavor”? If you return to the defi-
Biomedical informatics draws from all of nition of biomedical informatics that we pro-
these activities—development of hardware, vided in 7 Box 1.1, and then refer to
software, and computer science theory. . Fig. 1.20, you will begin to see why bio-
Biomedical computing generally has not had medical informatics is more than simply the
a large enough market to influence the course biomedical application of computer science.25
of major hardware developments; i.e., com- The issues that it addresses not only have
puters serve general purposes and have not broad relevance to health, medicine, and biol-
been developed specifically for biomedical ogy, but the underlying sciences on which
applications. Not since the early 1960s (when BMI professionals draw are inherently inter-
health-computing experts occasionally talked disciplinary as well (and are not limited to
about and, in a few instances, developed spe- computer science topics). Thus, for example,
cial medical terminals) have people assumed successful BMI research will often draw on,
that biomedical applications would use hard- and contribute to, computer science, but it
ware other than that designed for general use. may also be closely related to the decision sci-
The question of whether biomedical appli- ences (probability theory, decision analysis, or
cations would require specialized program- the psychology of human problem solving),
ming languages might have been answered cognitive science, information sciences, or the
affirmatively in the 1970s by anyone examin- management sciences (. Fig. 1.24).
ing the MGH Utility Multi-Programming Furthermore, a biomedical informatics
System, known as the MUMPS language researcher will be tightly linked to some
(Greenes et al. 1970; Bowie and Barnett 1976), underlying problem from the real world of
which was specially developed for use in med- health or biomedicine. As . Fig. 1.24 illus-
ical applications. For several years, MUMPS trates, for example, a biomedical informatics
was the most widely used language for medi- basic researcher or doctoral student will typi-
cal record processing. Under its subsequent cally be motivated by one of the application
name, M, it is still in widespread use and has areas, such as those shown at the bottom of
been used to develop commercial electronic . Fig. 1.22, but a dissertation worthy of a
health record systems. New implementations PhD in the field will usually be identified by a
have been developed for each generation of generalizable scientific result that also con-
computers. M, however, like any program- tributes to one of the component disciplines
ming language, is not equally useful for all (. Fig. 1.20) and on which other scientists
computing tasks. In addition, the software can build in the future.
requirements of medicine are better under-
stood and no longer appear to be unique;
rather, they are specific to the kind of task. A
program for scientific computation looks
pretty much the same whether it is designed 25 In fact, the multidisciplinary nature of biomedical
for chemical engineering or for pharmacoki- informatics has led the informatics term to be bor-
netic calculations. rowed in other disciplines, including computer sci-
How, then, does BMI differ from biomedi- ence organizations, even though the English name
for the field was first adopted in the biomedical con-
cal computer science? Is the new discipline
text. Today we even have generic full departments of
informatics in the US (e.g., see 7 https://informat-
ics.njit.edu, Accessed 11/28/2020) and in other parts
of the world as well (e.g., 7 http://www.sussex.ac.
24 Many interesting problems cannot be computed in a uk/informatics/. Accessed 1/5/2020). In the US,
reasonable time and require heuristics. Computabil- there are full schools with informatics in their title
ity theory is the foundation for assessing the feasi- (e.g., 7 https://luddy.indiana.edu/index.html.
bility and cost of computation to provide the Accessed 1/5/2020) and even a School of Biomedical
complete and correct results to a formally stated Informatics (7 https://sbmi.uth.edu/. Accessed
problem. 1/2/2020).
38 E. H. Shortliffe and M. F. Chiang
Computer
science
decision Draw upon....
science
statistics Contributes to....
cognitive
science
information
sciences
management Clinical or
sciences biomedical
other domain of
component Applied interest
sciences information
Draws upon....
.. Fig. 1.24 Component sciences in biomedical infor- cal domain is ultimately to benefit. At the methodologic
matics. An informatics application area is motivated by level, biomedical informatics draws on, and contributes
the needs of its associated biomedical domain, to which to, a wide variety of component disciplines, of which
it attempts to contribute solutions to problems. Thus computer science is only one. As . Figs. 1.20 and 1.22
any applied informatics work draws upon a biomedical show explicitly, biomedical informatics is inherently
domain for its inspiration, and in turn often leads to the multidisciplinary, both in its areas of application and in
delineation of basic research challenges in biomedical the component sciences on which it draws
informatics that must be tackled if the applied biomedi-
information storage devices,27 including the considered adequate for making diagnoses
1 movement of files to the “cloud”, are facilitat- and planning treatments. In fact, medical stu-
ing the inexpensive storage of large amounts dents who are taught by more experienced
of data, thus improving the feasibility of data- physicians to find subtle diagnostic signs by
intensive applications, such as drawing infer- examining various parts of the body nonethe-
ences from human genome datasets (see less often choose to bypass or deemphasize
7 Chaps. 9, 26, and 28) and the all-digital physical examinations in favor of ordering
radiology department (7 Chap. 22). one test after another. Sometimes, they do so
Standardization of hardware and advances in without paying sufficient attention to the
network technology are making it easier to ensuing cost. Some new technologies replace
share data and to integrate related less expensive, but technologically inferior,
information-management functions within a tests. In such cases, the use of the more expen-
hospital or other health care organization, sive approach is generally justified.
although inadequacies in standards for encod- Occasionally, computer-related technologies
ing and sharing data continue to be challeng- have allowed us to perform tasks that previ-
ing (7 Chaps. 7, 14, 15, and 16). ously were not possible. For example, the
The second factor is the frustratingly slow scans produced with computed tomography
increase in the number of professionals who or magnetic resonance imaging (see 7 Chaps.
are being trained to understand the biomedical 10 and 22) have allowed physicians to visual-
issues as well as the technical and engineering ize cross- sectional slices of the body, and
ones. Computer scientists who understand bio- medical instruments in intensive care units
medicine are better able to design systems perform continuous monitoring of patients’
responsive to actual needs and sensitive to body functions that previously could be
workflow and the clinical culture. Health pro- checked only episodically (see 7 Chap. 21).
fessionals who receive formal training in BMI The development of expensive new tech-
are likely to build systems using well-estab- nologies, and the belief that more technology
lished techniques while avoiding the past mis- is better, have helped to fuel rapidly escalating
takes of other developers. As more professionals health care costs. In the 1970s and 1980s, such
are trained in the special aspects of both fields, rising costs led to the introduction of man-
and as the programs they develop are intro- aged care and capitation—changes in financ-
duced, health care professionals are more likely ing and delivery that were designed to curb
to have useful and usable systems available spending. Today we are seeing a trend toward
when they turn to the computer for help with value-based reimbursement, which is predi-
information management tasks. cated on the notion that payment for care of
The third factor affecting the integration patients should be based on the demonstrated
of computing technologies into health care value received (as defined by high quality at
settings is our evolving health care system and low cost) rather than simply the existence of
the increasing pressure to control medical an encounter or procedure. Integrated com-
spending. The escalating tendency to apply puter systems can provide the means to cap-
technology to all patient-care tasks is a fre- ture data to help assess such value, while they
quently cited phenomenon in modern medical also support detailed cost accounting, the
practice. Mere physical findings no longer are analysis of the relationship of costs of care to
the benefits of that care, evaluation of the
27 Technological progress in this area is occurring at a
quality of care provided, and identification of
dizzying rate. Consider, for example, the announce- areas of inefficiency. Systems that improve the
ment that scientists are advancing the notion of quality of care while reducing the cost of pro-
using DNA for data storage and can store as much viding that care clearly will be favored. The
as 704 terabytes of information in a gram of DNA. effect of cost containment pressures on tech-
7 http://www.engadget.com/2012/08/19/harvard-
stores-704tb-in-a-gram-of-dna; 7 https://homes.
nologies that increase the cost of care while
cs.washington.edu/~bornholt/dnastorage-asplos16/ improving the quality are less clear. Medical
(Accessed 5/30/19). technologies, including computers, will be
Biomedical Informatics: The Science and the Pragmatics
41 1
embraced only if they improve the delivery of Coiera, E. (2015). Guide to health informatics (3rd
clinical care while either reducing costs or pro- ed.). Boca Raton, FL: CRC Press. This intro-
viding benefits that clearly exceed their costs. ductory text is a readable summary of clinical
Designers of medical systems must address and public health informatics, aimed at mak-
satisfactorily many logistical and engineering ing the domain accessible and understandable
questions before innovative solutions are inte- to the non-specialist.
grated optimally into medical practice. For Collen, M. F., & Ball, M. J. (Eds.). (2015). A his-
example, are the machines conveniently tory of medical informatics in the United States
located? Should mobile devices further replace (2nd ed.). London: Springer. This comprehen-
tethered workstations? Can users complete sive book traces the history of the field of
their tasks without excessive delays? Is the sys- medical informatics, and identifies the origins
tem reliable enough to avoid loss of data? Can of the discipline’s name (which first appeared
users interact easily and intuitively with the in the English-language literature in 1974).
computer? Does it facilitate rather than dis- The original (1995) edition was being updated
rupt workflow? Are patient data secure and by Dr. Collen when he passed away shortly
appropriately protected from prying eyes? In after his 100th birthday. Dr. Ball organized an
addition, cost-control pressures produce a effort to complete the 2nd edition, enlisting
growing reluctance to embrace expensive tech- participation by many leaders in the field.
nologies that add to the high cost of health Elstein, A. S., Shulman, L. S., & Sprafka, S. A.
care. The net effect of these opposing trends is (1978b). Medical problem solving: An analysis
in large part determining the degree to which of clinical reasoning. Cambridge, MA: Harvard
specific systems are embraced and effectively University Press. This classic collection of
implemented in the health care environment. papers describes detailed studies that have illu-
In summary, rapid advances in communi- minated several aspects of the ways in which
cations, computer hardware, and software, expert and novice physicians solve medical
coupled with an increasing computer literacy problems. The seminal work described remains
of health care professionals and researchers, highly relevant to today’s work on problem
favor the implementation of effective com- solving and clinical decision support systems.
puter applications in clinical practice, public Friedman, C. P., Altman, R. B., Kohane, I. S.,
health, and life sciences research. Furthermore, McCormick, K. A., Miller, P. L., Ozbolt,
in the increasingly competitive health care J. G., Shortliffe, E. H., Stormo, G. D.,
industry, providers have a greater need for the Szczepaniak, M. C., Tuck, D., & Williamson,
information management capabilities sup- J. (2004). Training the next generation of
plied by computer systems. The challenge is to informaticians: The impact of BISTI and bio-
demonstrate in persuasive and rigorous ways informatics. Journal of American Medical
the financial and clinical advantages of these Informatics Association, 11, 167–172. This
systems (see 7 Chap. 13). important analysis addresses the changing
nature of biomedical informatics due to the
nnSuggested Readings revolution in bioinformatics and computa-
Blois, M. S. (1984b). Information and medicine: tional biology. Implications for training, as
The nature of medical descriptions. Berkeley: well as organization of academic groups and
University of California Press. In this classic curriculum development, are discussed.
volume, the author analyzes the structure of Hoyt, R. E., & Hersh. W. R. (2018). Health infor-
medical knowledge in terms of a hierarchical matics: Practical guide (7th ed). Raleigh: Lulu.
model of information. He explores the ideas com. This introductory volume provides a
of high- and low-level sciences and suggests broad view of informatics and is aimed espe-
that the nature of medical descriptions cially at health professionals in management
accounts for difficulties in applying comput- roles or IT professionals who are entering the
ing technology to medicine. A brief summary clinical world.
of key elements in this book is included as Box Institute of Medicine25. (1991 [revised 1997]). The
1.2 in this chapter. computer- based patient record: An essential
42 E. H. Shortliffe and M. F. Chiang
technology for health care. Washington, DC: cussed in greater detail in Chapter 8), and
1 National Academy Press. National Research artificial intelligence (which is discussed in in
Council (1997). For The Record: Protecting many chapters throughout this volume).
Electronic Health Information. Washington, National Academy of Medicine. (2019). Taking
DC: National Academy Press. National action against clinician burnout: A systems
Research Council (2000). Networking Health: approach to professional well-being. Washington,
Prescriptions for the Internet. Washington, DC: National Academy Press. This consensus
DC: National Academy Press. This set of three study from the National Academy of Medicine
reports from branches of the US National discusses the problem of clinician burnout in
Academies of Science has had a major influ- the United States, including areas where health
ence on health information technology educa- care information technology may contribute or
tion and policy over the last 25 years. reduce these problems.
Institute of Medicine25. (2000). To err is human: Shortliffe, E. (1993b). Doctors, patients, and com-
Building a safer health system. Washington, puters: Will information technology dehuman-
DC: National Academy Press. Institute of ize health care delivery? Proceedings of the
Medicine (2001). Crossing the Quality Chasm: American Philosophical Society, 137(3), 390–
A New Health Systems for the 21st Century. 398 In this paper, the author examines the fre-
Washington, DC: National Academy Press. quently expressed concern that the introduction
Institute of Medicine (2004). Patient Safety: of computing technology into health care set-
Achieving a New Standard for Care. tings will disrupt the development of rapport
Washington, DC: National Academy Press. between clinicians and patients and thereby
This series of three reports from the Institute dehumanize the therapeutic process. He argues,
of Medicine has outlined the crucial link rather, that computers may eventually have pre-
between heightened use of information tech- cisely the opposite effect on the relationship
nology and the enhancement of quality and between clinicians and their patients.
reduction in errors in clinical practice. Major
programs in patient safety have resulted from ??Questions for Discussion
these reports, and they have provided motiva- 1. How do you interpret the phrase
tion for a heightened interest in health care “logical behavior”? Do computers
information technology among policy makers, behave logically? Do people behave
provider organizations, and even patients. logically? Explain your answers.
Kalet, I. J. (2013). Principles of biomedical infor- 2. What do you think it means to say that
matics (2nd ed.). New York: Academic. This a computer program is “effective”?
volume provides a technical introduction to Make a list of a dozen computer appli-
the core methods in BMI, dealing with stor- cations with which you are familiar. List
age, retrieval, display, and use of biomedical the applications in decreasing order of
data for biological problem solving and medi- effectiveness, as you have explained this
cal decision making. Application examples are concept. Then, for each application,
drawn from bioinformatics, clinical informat- indicate your estimate of how well
ics, and public health informatics. human beings perform the same tasks
National Academy of Medicine. (2018). Procuring (this will require that you determine
interoperability: Achieving high-quality, con- what it means for a human being to be
nected, and person-centered care. Washington, effective). Do you discern any pattern?
DC: National Academy Press. National If so, how do you interpret it?
Academy of Medicine (2019). Artificial 3. Discuss three society-wide factors that
Intelligence in Health Care: The Hope, the will determine the extent to which
Hype, the Promise, the Peril. Washington, computers are assimilated into clinical
DC: National Academy Press. This series of practice.
two reports from the National Academy of 4. Reread the future vision presented in
Medicine outlines emerging issues in biomedi- 7 Sect. 1.1. Describe the characteris-
cal informatics: interoperability (which is dis- tics of an integrated environment for
Biomedical Informatics: The Science and the Pragmatics
43 1
managing clinical information. Discuss systems should mitigate this
two ways (either positive or negative) problem. What are three specific
in which such a system could change ways in which they could be
clinical practice. reducing the number of adverse
5. Do you believe that improving the tech- events in hospitals?
nical quality of health care entails the (b) Are there ways in which computer-
risk of dehumanization? If so, is it based systems could increase the
worth the risk? Explain your reasoning. incidence of medical errors?
6. Consider . Fig. 1.20, which shows Explain.
that bioinformatics, imaging informat- (c) Describe a practical experiment that
ics, clinical informatics, and public could be used to examine the impact
health informatics are all application of an EHR system on patient safety.
domains of the biomedical informatics In other words, propose a study
discipline because they share the same design that would address whether
core methods and theories: the computer-based system increases
(a) Briefly describe two examples of or decreases the incidence of pre-
core biomedical informatics ventable adverse events in hospi-
methods or theories that can be tals – and by how much.
applied both to bioinformatics (d) What are the limitations of the
and clinical informatics. experimental design you proposed
(b) Imagine that you describe in (c)?
. Fig. 1.20 to a mathematics fac-
8. It has been argued that the ability to
ulty member, who responds that
capture “nuance” in the description of
“in that case, I’d also argue that
what a clinician has seen when
statistics, computer science, and
examining or interviewing a patient
physics are all application domains
may not be as crucial as some people
of math because they share the
think. The desire to be able to express
same core mathematical methods
one’s thoughts in an unfettered way
and theories.” In your opinion, is
(free text) is often used to argue against
this a legitimate argument? In
the use of structured data-entry
what ways is this situation similar
methods using a controlled vocabulary
to, and different from, the case of
and picking descriptors from lists
biomedical informatics?
when recording information in an
(c) Why is biomedical informatics not
EHR.
simply computer science applied to
(a) What is your own view of this
biomedicine, or to the practice of
argument? Do you believe that it is
medicine, using computers?
important to the quality and/or
(d) How would you describe the
efficiency of care for clinicians to
relevance of psychology and
be able to record their observations,
cognitive science to the field of
at least part of the time, using free
biomedical informatics? (Hint: See
text/natural language?
. Fig. 1.24)
(b) Many clinicians have been
7. In 2000, a major report by the Institute
unwilling to use an EHR system
of Medicine28 entitled “To Err is
requiring structured data entry
Human: Building a Safer Health
System” (see Suggested Readings) stated
that up to 98,000 patient deaths were 28 The Institute of Medicine (IOM), part of the former
being caused by preventable medical National Academy of Sciences (NAS) was reorga-
nized in 2015 to become the National Academy of
errors in American hospitals each year.
Medicine (NAM). The NAS is now known as the
(a) It has been suggested that effective National Academies of Science, Engineering, and
electronic health record (EHR) Medicine (NASEM).
44 E. H. Shortliffe and M. F. Chiang
because of the increased time and effectiveness of beta-blockers for the treatment
1 required for documentation at the of elderly patients after acute myocardial infarc-
tion: National Cooperative Cardiovascular Project.
point of care and constraints on
JAMA, 19; 280(7), 623–9.
what can be expressed. What are Kulikowski, C. A., Shortliffe, E. H., et al. (2012). AMIA
two strategies that could be used Board white paper: Definition of biomedical informat-
to address this problem (other ics and specification of core competencies for graduate
than “designing a better user education in the discipline. Journal of the American
Medical Informatics Association, 19(6), 931–938.
interface for the system”)?
Ozkaynak, M., Brennan, P. F., Hanauer, D. A., Johnson,
S., Aarts, J., Zheng, K., & Haque, S. N. (2013).
Patient-centered care requires a patient oriented
References workflow model. Journal of the American Medical
Informatics Association: JAMIA, 20(e1), e14–e16.
Balas, E. A., & Boren, S. A. (2000). Managing clini- Patel, V. L., & Groen, G. J. (1986). Knowledge based
cal knowledge for health care improvement. In solution strategies in medical reasoning. Cognitive
Yearbook of medical informatics 2000: Patient- Science, 10(1), 91–116.
centered systems (pp. 65–70). Stuttgart: Schattauer. Reeves, J. J., Hollandsworth, H. M., Torriani, F. J.,
Begun, J. W., Zimmerman, B., & Dooley, K. (2003). et al. (2020). Rapid response to COVID-19: Health
Health care organizations as complex adaptive informatics support for outbreak management in
systems. In S. M. Mick & M. Wyttenbach (Eds.), an academic health system. Journal of the American
Advances in health care organization theory (pp. 253– Medical Informatics Association: JAMIA, ocaa037.
288). San Francisco: Jossey-Bass. https://doi.org/10.1093/jamia/ocaa037.
Blois, M. S. (1984). Information and medicine: The Schwartz, W. B. (1970). Medicine and the computer: The
nature of medical descriptions. Berkeley: University promise and problems of change. The New England
of California Press. Journal of Medicine, 283(23), 1257–1264.
Bowie, J., & Barnett, G. O. (1976). MUMPS: An eco- Shortliffe, E. H. (1993a). Doctors, patients, and com-
nomical and efficient time-sharing system for puters: Will information technology dehumanize
information management. Computer Programs in health-care delivery? Proceedings of the American
Biomedicine, 6, 11–22. Philosophical Society, 137(3), 390–398.
Collen, M. F. (1995). A history of medical informatics in Shortliffe, E. H. (1998a). The next generation internet
the United States, 1950 to 1990. Bethesda: American and health care: A civics lesson for the informatics
Medical Informatics Association. community. In Proceedings of the AMIA annual fall
Daley, K. A. (2013). A continuously learning health care symposium (pp. 8–14). Orlando.
system in the United States. In NAM perspectives. Shortliffe, E. H. (1998b). The evolution of health-care
Washington, DC: Commentary, National Academy records in the era of the internet, proceedings of
of Medicine. https://doi.org/10.31478/201307b. Medinfo 98. Seoul/Amsterdam: IOS Press.
Dick, R., & Steen, E. (Eds.). (1991). The computer – Shortliffe, E. H. (2000). Networking health: Learning from
Based patient record : An essential technology others, taking the lead. Health Affairs, 19(6), 9–22.
for HealthCare. (Rev. 1997). Washington, D.C.: Shortliffe, E. H. (2005). Strategic action in health infor-
Institute of Medicine, National Academy Press. mation technology: Why the obvious has taken so
Elstein, K. A., Shulman, L. S., & Sprafka, S. A. (1978a). long. Health Affairs, 24, 1222–1233.
Medical problem solving: An analysis of clinical rea- Shortliffe, E. H. (2010). Biomedical informatics in the
soning. Cambridge, MA: Harvard University Press. education of physicians. Journal of the American
Finnell, J. T., & Dixon, B. E. (2015). Clinical informatics Medical Association, 304(11), 1227–1228.
study guide: Text and review. London: Springer. Shortliffe, E. H., & Sondik, E. (2004). The informatics
Greenes, R. A., & Shortliffe, E. H. (1990). Medical infor- infrastructure: Anticipating its role in cancer surveil-
matics: An emerging academic discipline and insti- lance. Proceedings of the C-change summit on can-
tutional priority. Journal of the American Medical cer surveillance and information: The next decade,
Association, 263(8), 1114–1120. Phoenix, AZ.
Greenes, R. A., Barnett, G. O., Klein, S. W., Robbins, Simborg, D. W., Chadwick, M., Whiting-O’Keefe, Q. E.,
A., & Prior, R. E. (1970). Recording, retrieval, Tolchin, S. G., Kahn, S. A., & Bergan, E. S. (1983).
and review of medical data by physician-computer Local area networks and the hospital. Computers
interaction. The New England Journal of Medicine, and Biomedical Research, 16(3), 247–259.
282(6), 307–315. Tarczy-Hornoch, P., Markey, M. K., Smith, J. A., &
James, G., Witten, D., Hastie, T., & Tibshirani, R. Hiruki, T. (2007). Biomedical informatics and
(2013). An introduction to statistical learning: With genomic medicine: Research and training. Journal
applications in R, (Springer texts in statistics). of Biomedical Informatics, 40, 1–4.
New York: Springer. Yasnoff, W. A., Sweeney, L., & Shortliffe, E. H. (2013).
Krumholz, H. M., Radford, M. J., Wang, Y., Chen, J., Putting health IT on the path to success. Journal of
Heiat, A. & Marciniak, T. A. (1998). National use the American Medical Association, 309(10), 989–990.
45 2
Contents
References – 74
Biomedical Data: Their Acquisition, Storage, and Use
47 2
nnLearning Objectives ple reflection will reveal that all health care
After reading this chapter, you should know activities involve gathering, analyzing, or using
the answers to these questions: data. Data provide the basis for categorizing
55 What are clinical data? the problems a patient may be having or for
55 How are clinical data used? identifying subgroups within a population of
55 What are the advantages and disadvan- patients. They also help a physician to decide
tages of traditional paper medical records what additional information is needed and
vs. electronic health records? what actions should be taken to gain a greater
55 What is the role of the computer in data understanding of a patient’s problem or most
storage, retrieval, and interpretation? effectively to treat the problem that has been
55 What distinguishes a database from a diagnosed.
knowledge base? It is overly simplistic to view data as the
55 How are data collection and hypothesis columns of numbers or the monitored wave-
generation intimately linked in clinical forms that are a product of our technological
diagnosis? health care environment. Although laboratory
55 What are the meanings of the terms test results and other numeric data are often
prevalence, predictive value, sensitivity, invaluable, a variety of more subtle types of
and specificity? data may be just as important to the delivery
55 How are the terms related? of optimal care: the awkward glance by a
55 What are the alternatives for entry of patient who seems to be avoiding a question
data into a clinical database? during the medical interview, information
about the details of a patient’s symptoms or
about his family or economic setting, or the
2.1 What Are Clinical Data? subjective sense of disease severity that an
experienced clinician will often have within a
From earliest times, the ideas of ill health and its few moments of entering a patient’s room. No
treatment have been wedded to those of the clinician disputes the importance of such
observation and interpretation of data. Whether observations in decision making during patient
we consider the disease descriptions and guide- assessment and management, yet the precise
lines for management in early Greek literature role of these data and the corresponding deci-
or the modern physician’s use of complex labo- sion criteria are so poorly understood that it is
ratory and X-ray studies, it is clear that gather- difficult to record them in ways that convey
ing data and interpreting their meaning are their full meaning, even from one clinician to
central to the health care process. With the move another. Despite these limitations, clinicians
toward the use of clinical and genomic informa- need to share descriptive information with
tion in assessing individual patients (their risks, others. When they cannot interact directly
prognosis, and likely responses to therapy), the with one another, they often turn to the chart
sheer amounts of data that may be used in or electronic health record for communication
patient care have become huge. A textbook on purposes.
biomedical informatics will accordingly refer We consider a clinical datum to be any sin-
time and again to issues in data collection, stor- gle observation of a patient—e.g., a tempera-
age, and use. This chapter lays the foundation ture reading, a red blood cell count, a past
for this recurring set of issues that is pertinent to history of rubella, or a blood pressure reading.
all aspects of the use of information, knowl- As the blood pressure example shows, it is a
edge, and computers in biomedicine, both in the matter of perspective whether a single observa-
clinical world and in applications related to pub- tion is in fact more than one datum. A blood
lic health, biology and human genetics. pressure of 120/80 might well be recorded as a
If data are central to all health care, it is single element in a setting where knowledge
because they are crucial to the process of deci- that a patient’s blood pressure is normal is all
sion making (as described in detail in 7 Chaps. that matters. If the difference between diastolic
3 and 4 and again in 7 Chap. 26). In fact, sim- (while the heart cavities are beginning to fill)
48 E. H. Shortliffe and M. F. Chiang
and systolic (while they are contracting) blood to-minute variations may be important—e.g.,
pressures is important for decision making or the frequent blood sugar readings obtained
for analysis, however, the blood pressure read- for a patient in diabetic ketoacidosis (acid
2 ing is best viewed as two pieces of information production due to poorly controlled blood
(systolic pressure = 120 mmHg, diastolic pres- sugar levels) or the continuous measurements
sure = 80 mmHg). Human beings can glance at of mean arterial blood pressure for a patient
a written blood pressure value and easily make in cardiogenic shock (dangerously low blood
the transition between its unitary view as a sin- pressure due to failure of the heart muscle).
gle data point and the decomposed informa- It may also be important to keep a record
tion about systolic and diastolic pressures. of the circumstances under which a data point
Such dual views can be much more difficult for was obtained. For example, was the blood
computers, however, unless they are specifically pressure taken in the arm or leg? Was the
allowed for in the design of the method for patient lying or standing? Was the pressure
data storage and analysis. The idea of a data obtained just after exercise? During sleep?
model for computer-stored medical data What kind of recording device was used? Was
accordingly becomes an important issue in the the observer reliable? Such additional infor-
design of medical data systems. mation, sometimes called contexts, methods,
Clinical data may involve several different or modifiers, can be of crucial importance in
observations made concurrently, the observa- the proper interpretation of data. Two patients
tion of the same patient parameter made at with the same basic problem or symptom
several points in time, or both. Thus, a single often have markedly different explanations for
datum generally can be viewed as defined by their problem, revealed by careful assessment
five elements: of the modifiers of that problem.
1. The patient in question A related issue is the uncertainty in the val-
2. The parameter being observed (e.g., liver ues of data. It is rare that an observation—
size, urine sugar value, history of rheumat- even one by a skilled clinician—can be
ic fever, heart size on chest X-ray film) accepted with absolute certainty. Consider the
3. The value of the parameter in question (e.g., following examples:
weight is 70 kg, temperature is 98.6 °F, pro- 55 An adult patient reports a childhood ill-
fession is steel worker) ness with fevers and a red rash in addition
4. The time of the observation (e.g., 2:30 to joint swelling. Could he or she have had
A.M. on 14FEB20191) scarlet fever? The patient does not know
5. The method by which the observation was what his or her pediatrician called the dis-
made (e.g., patient report, thermometer, ease nor whether anyone thought that he
urine dipstick, laboratory instrument). or she had scarlet fever.
55 A physician listens to the heart of an asth-
Time can particularly complicate the assess- matic child and thinks that she hears a
ment and computer-based management of heart murmur—but is not certain because
data. In some settings, the date of the obser- of the patient’s loud wheezing.
vation is adequate—e.g., in outpatient clinics 55 A radiologist looking at a shadow on a
or private offices where a patient generally is chest X-ray film is not sure whether it rep-
seen infrequently and the data collected need resents overlapping blood vessels or a lung
to be identified in time with no greater accu- tumor.
racy than a calendar date. In others, minute- 55 A confused patient is able to respond to
simple questions about his or her illness,
but under the circumstances the physician
1 Note that it was the tendency to record such dates is uncertain how much of the history being
in computers as “14FEB12” that led to the reported is reliable.
end-of-century complexities that were called the
Year 2K problem. It was shortsighted to think that
it was adequate to encode the year of an event with As described in 7 Chaps. 3 and 4, there are a
only two digits. variety of possible responses to deal with
Biomedical Data: Their Acquisition, Storage, and Use
49 2
incomplete data, the uncertainty in them, and tions such as reports of specialty consultations,
in their interpretation. One technique is to col- surgical procedures, pathologic examinations
lect additional data that will either confirm or of tissues, and hospitalization summaries when
eliminate the concern raised by the initial a patient is discharged.
observation. This solution is not always appro- Some narrative data are loosely coded with
priate, however, because the costs of data col- shorthand conventions known to health per-
lection must be considered. The additional sonnel, particularly data collected during the
observation might be expensive, risky for the physical examination, in which recorded obser-
patient, or wasteful of time during which treat- vations reflect the stereotypic examination pro-
ment could have been instituted. The idea of cess taught to all practitioners. It is common,
trade-offs in data collection thus becomes for example, to find the notation “PERRLA”
extremely important in guiding health care under the eye examination in a patient’s medi-
decision making. cal record. This encoded form indicates that
the patient’s “Pupils are Equal (in size), Round,
and Reactive to Light and Accommodation
2.1.1 hat Are the Types of Clinical
W (the process of focusing on near objects).”
Data? Note that there are significant problems
associated with the use of such abbreviations.
The examples in the previous section suggest Many are not standard and can have different
that there is a broad range of data types in the meanings depending on the context in which
practice of medicine and the allied health sci- they are used. For example, “MI” can mean
ences. They range from narrative, textual data “mitral insufficiency” (leakage in one of the
to numerical measurements, genetic informa- heart’s valves) or “myocardial infarction” (the
tion, recorded signals, drawings, and photo- medical term for what is commonly called a
graphs or other images. heart attack). Many hospitals try to establish a
Narrative data account for a large compo- set of “acceptable” abbreviations with mean-
nent of the information that is gathered in the ings, but the enforcement of such standardiza-
care of patients. For example, the patient’s tion is often unsuccessful. Other hospitals
description of his or her present illness, includ- approach this challenge by not permitting use of
ing responses to focused questions from the abbreviations in the medical record, and instead
physician, generally is gathered verbally and is require use of full-length narrative descriptions.
recorded as text in the medical record. The Standard narrative expressions have often
same is true of the patient’s social and family become loose standards of communication
history, the general review of systems that is among medical personnel. Examples include
part of most evaluations of new patients, and “mild dyspnea (shortness of breath) on exer-
the clinician’s report of physical examination tion,” “pain relieved by antacids or milk,” and
findings. Such narrative data were traditionally “failure to thrive.” Such standardized expres-
handwritten by clinicians and then placed in sions are attempts to use conventional text
the patient’s medical record (. Fig. 2.1a). notation as a form of summarization for oth-
Increasingly, however, the narrative summaries erwise heterogeneous conditions that together
were dictated and then transcribed by typists characterize a simple concept about a patient.
who produced printed summaries or electronic Many data used in medicine take on discrete
copies for inclusion in paper or electronic med- numeric values. These include such parameters
ical records. Now, physicians and staff largely as laboratory tests, vital signs (such as tempera-
enter narrative text directly into electronic ture and pulse rate), and certain measurements
health records (EHRs), usually through key- taken during the physical examination. When
board, mouse-driven, or voice-driven interfaces such numerical data are interpreted, however,
(. Fig. 2.1b). Electronic narrative data often the issue of precision becomes important. Can
include not only patient histories and physical physicians distinguish reliably between a 9-cm
examinations, but also other narrative descrip- and a 10-cm liver span when they examine a
50 E. H. Shortliffe and M. F. Chiang
.. Fig. 2.1 Much of the information gathered during a physician–patient encounter is written in the medical
record. This was traditionally done using a paper notes, and now increasingly using b electronic health records
patient’s abdomen? Does it make sense to report ment instruments rather than changes in the
a serum sodium level to two-decimal-place accu- patient)?
racy? Is a 1-kg fluctuation in weight from 1 week In some fields of medicine, analog data in
to the next significant? Was the patient weighed the form of continuous signals are particular-
on the same scale both times (i.e., could the dif- ly important (see 7 Chap. 23). Perhaps the
ferent values reflect variation between measure- best-known example is an electrocardiogram
Biomedical Data: Their Acquisition, Storage, and Use
51 2
b
(ECG), a tracing of the electrical activity from taken and the rationales for those actions, for
a patient’s heart. When such data are stored in later communication to themselves and other
medical records, a graphical tracing frequent- people. A glance at a medical record will quick-
ly is included, with a written interpretation of ly reveal the wide variety of data-recording
its meaning. There are clear challenges in techniques that have evolved. The range goes
determining how such data are best managed from narrative text to commonly understood
in computer-based storage systems. shorthand notation to cryptic symbols that only
Visual images—acquired from machines specialists can understand; for example, few
or sketched by the physician—are another physicians without specialized training know
important category of data. Radiologic imag- how to interpret the data-recording conventions
es or photographs of skin lesions are obvious of an ophthalmologist (. Fig. 2.3). The nota-
examples. It has traditionally been common tions may be highly structured records with
for physicians to draw simple pictures to rep- brief text or numerical information, machine-
resent abnormalities that they have observed; generated tracings of analog signals, photo-
such drawings may serve as a basis for com- graphic images (of the patient or of radiologic
parison when they or another physician next or other studies), or drawings. This range of
see the patient. For example, a sketch is a con- data-recording conventions presents significant
cise way of conveying the location and size of challenges to the person implementing electron-
a nodule in the prostate gland (. Fig. 2.2). In ic health record systems.
electronic health record systems, these hand
drawings are increasingly being replaced in
the medical record by text-based descriptions 2.1.2 Who Collects the Data?
or photographs (Sanders et al. 2013).
As should be clear from these examples, the Health data on patients and populations are
idea of data is inextricably bound to the idea of gathered by a variety of health professionals.
data recording. Physicians and other health care Although conventional ideas of the healthcare
personnel are taught from the outset that it is team evoke images of coworkers treating ill
crucial that they do not trust their memory patients, the team has much broader responsi-
when caring for patients. They must record their bilities than treatment per se; data collection
observations, as well as the actions they have and recording are a central part of its task.
52 E. H. Shortliffe and M. F. Chiang
.. Fig. 2.2 A physician’s hand-drawn sketch of a pros- but are less common in electronic health records com-
tate nodule. Drawings may convey precise information pared to paper charts
more easily and compactly than a textual description,
.. Fig. 2.3 An ophthalmologist’s report of an eye the ophthalmologist has used. (Image courtesy of Nita
examination. Most physicians trained in other special- Valikodath, MD, with permission)
ties would have difficulty deciphering the symbols that
Physicians are key players in the process of tions and recording them for future reference.
data collection and interpretation. They con- The data that they gather contribute to nursing
verse with a patient to gather narrative descrip- care plans as well as to the assessment of
tive data on the chief complaint, past illnesses, patients by physicians and by other health care
family and social information, and the system staff. Thus, nurses’ training includes instruc-
review. They examine the patient, collecting tion in careful and accurate observation, his-
pertinent data and recording them during or at tory taking, and examination of the patient.
the end of the visit. In addition, they generally Because nurses typically spend more time with
decide what additional data to collect by order- patients than physicians do, especially in the
ing laboratory or radiologic studies and by hospital setting, nurses often build relation-
observing the patient’s response to therapeutic ships with patients that uncover information
interventions (yet another form of data that and insights that contribute to proper diagno-
contributes to patient assessment). sis, to understanding of pertinent psychosocial
In both outpatient and hospital settings, issues, or to proper planning of therapy or dis-
nurses play a central role in making observa- charge management (. Fig. 2.4). The role of
Biomedical Data: Their Acquisition, Storage, and Use
53 2
responsibilities. As these examples suggest,
many different individuals employed in health
care settings gather, record, and make use of
patient data in their work.
Finally, there are the technological devices
that generate data—laboratory instruments,
imaging machines, monitoring equipment in
intensive care units, and measurement devices
that take a single reading (such as thermome-
ters, ECG machines, sphygmomanometers for
taking blood pressure, and spirometers for
testing lung function). Sometimes such a device
produces a paper report suitable for inclusion
in a traditional medical record. Sometimes the
device indicates a result on a gauge or traces a
result that must be read by an operator and
then recorded in the patient’s chart. Sometimes
a trained specialist must interpret the output.
Increasingly, however, the devices feed their
results directly into computer equipment so
that the data can be analyzed or formatted for
electronic storage in the electronic health
.. Fig. 2.4 Nurses often develop close relationships record (see 7 Chap. 16), thereby allowing
with patients. These relationships may allow the nurse to access to information is through computer
make observations that are missed by other staff. This workstations, hand-held tablets, or even mobile
ability is just one of the ways in which nurses play a key devices.
role in data collection and recording. (Photograph cour-
tesy of Susan Ostmo, with permission)
ing the health care delivery process that were As is true for all experiments, one purpose is to
not feasible a generation ago. We will discuss learn from experience through careful observa-
these topics in more detail later in this chapter tion and recording of data. The lessons learned
2 and in 7 Chaps. 16 and 20. in a given encounter may be highly individual-
ized (e.g., the physician may learn how a spe-
cific patient tends to respond to pain or how
2.2.1 Create the Basis family interactions tend to affect the patient’s
for the Historical Record response to disease). On the other hand, the
value of some experiments may be derived only
Any student of science learns the importance of by pooling of data from many patients who
collecting and recording data meticulously when have similar problems and through the analysis
carrying out an experiment. Just as a scientific of the results of various treatment options to
laboratory notebook provides a record of pre- determine efficacy.
cisely what an investigator has done, the experi- Although laboratory research has contrib-
mental data observed, and the rationale for uted dramatically to our knowledge of human
intermediate decision points, medical records disease and treatment, it is careful observation
are intended to provide a detailed compilation and recording by skilled health care personnel
of information about individual patients: that has always been fundamental to the effec-
55 What is the patient’s history (development tive generation of new knowledge about patient
of a current illness; other diseases that coex- care. We learn from the aggregation of infor-
ist or have resolved; pertinent family, social, mation from large numbers of patients; thus,
and demographic information)? the historical record for individual patients is
55 What symptoms has the patient reported? of inestimable importance to clinical research.
When did they begin, what has seemed to
aggravate them, and what has provided
relief ? 2.2.2 Support Communication
55 What physical signs have been noted on Among Providers
examination?
55 How have signs and symptoms changed A central function of structured data collec-
over time? tion and recording in health care settings is to
55 What laboratory results have been, or are assist personnel in providing coordinated care
now, available? to a patient over time. Most patients who have
55 What radiologic and other special studies significant medical conditions are seen over
have been performed? months or years on several occasions for one or
55 What medications are being taken and are more problems that require ongoing evaluation
there any allergies? and treatment. Given the increasing numbers
55 What other interventions have been under- of elderly patients in many cultures and health
taken? care settings, the care given to a patient is less
55 What is the reasoning behind the manage- oriented to diagnosis and treatment of a single
ment decisions? disease episode and increasingly focused on
management of one or more chronic disor-
Each new patient problem and its manage- ders—possibly over many years.
ment can be viewed as a therapeutic experi- It was once common for patients to receive
ment, inherently confounded by uncertainty, essentially all their care from a single provider:
with the goal of answering three questions the family doctor who tended both children and
when the experiment is over: adults, often seeing the patient over many or all
1. What was the nature of the disease or the years of that person’s life. We tend to picture
symptom? such physicians as having especially close rela-
2. What was the treatment decision? tionships with their patients—knowing the fam-
3. What was the outcome of that treatment? ily and sharing in many of the patient’s life
events, especially in smaller communities. Such
Biomedical Data: Their Acquisition, Storage, and Use
55 2
doctors nonetheless kept records of all encoun-
ters so that they could refer to data about past
illnesses and treatments as a guide to evaluating
future care issues.
In the world of modern medicine, the
emergence of subspecialization and the
increasing provision of care by teams of health
professionals have placed new emphasis on the
central role of the medical record. Over the
past several decades, shared access to a paper
chart (. Fig. 2.5) has largely been replaced by
clinicians accessing electronic records, some-
times conferring as they look at the same com-
puter screen (. Fig. 2.6). Now the record not
only contains observations by a physician for
reference on the next visit but also serves as a
communication mechanism among physicians
and other medical personnel, such as physical
or respiratory therapists, nursing staff, radiol-
ogy technicians, social workers, or discharge
planners. In many outpatient settings, patients
receive care over time from a variety of physi- .. Fig. 2.6 Today similar communication sessions occur
around a computer screen rather than a paper chart (see
cians—colleagues covering for the primary
. Fig. 2.5). (Photograph courtesy of Susan Ostmo with
physician, or specialists to whom the patient permission)
has been referred, or a managed care organiza-
tion’s case manager. It is not uncommon to and therefore recognize the importance of the
hear complaints from patients who remember medical record in ensuring quality and conti-
the days when it was possible to receive essen- nuity of care through adequate recording of
tially all their care from a single physician the details and logic of past interventions and
whom they had come to trust and who knew ongoing treatment plans. This idea is of par-
them well. Physicians are sensitive to this issue ticular importance in a health care system in
which chronic diseases rather than care for
trauma or acute infections increasingly domi-
nate the basis for interactions between patients
and their doctors.
data therefore are important in screening for children for normal growth and development
risk factors, following patients’ risk profiles by pediatricians (. Fig. 2.7). Single data
over time, and providing a basis for specific points regarding height and weight may have
2 patient education or preventive interventions, limited use by themselves; it is the trend in
such as diet, medication, or exercise. Perhaps such data points observed over months or
the most common examples of such ongoing years that may provide the first clue to a med-
risk assessment in our society are routine mon- ical problem. It is accordingly common for
itoring for excess weight, high blood pressure, such parameters to be recorded on special
and elevated serum cholesterol levels. In these charts or forms that make the trends easy to
cases, abnormal data may be predictive of later discern at a glance. Women who want to have
symptomatic disease; optimal care requires a child often keep similar records of body
early intervention before the complications temperature. By measuring temperature daily
have an opportunity to develop fully. and recording the values on special charts,
women can identify the slight increase in tem-
perature that accompanies ovulation and thus
2.2.4 ecord Standard Preventive
R may discern the days of maximum fertility.
Measures Many physicians will ask a patient to keep
such graphical records so that they can later
The medical record also serves as a source of discuss the data with the patient and include
data on interventions that have been performed the scanned or photographed graph in the
to prevent common or serious disorders. Some- electronic record for ongoing reference. Such
times the interventions involve counseling or graphs are increasingly captured and dis-
educational programs (for example, regarding played for viewing by clinicians as a feature of
smoking cessation, measures for stopping drug a patient’s medical record.
abuse, safe sex practices, or dietary changes).
Other important preventive interventions
include immunizations: the vaccinations that 2.2.6 Provide a Legal Record
begin in early childhood and continue through-
out life, including special treatments adminis- Another use of health data, once they are
tered when a person will be at particularly high charted and analyzed, is as the foundation for
risk (e.g., injections to protect people from cer- a legal record to which the courts can refer if
tain highly communicable diseases, administered necessary. The medical record is a legal docu-
before travel to areas where such diseases are ment; the responsible individual must certify
endemic). When a patient comes to his local hos- or sign most of the clinical information that is
pital emergency room with a laceration, the phy- recorded. In addition, the chart generally
sicians routinely check for an indication of when should describe and justify both the presumed
he most recently had a tetanus immunization. diagnosis for a patient and the choice of man-
When easily accessible in the record (or from the agement.
patient), such data can prevent unnecessary We emphasized earlier the importance of
treatments (in this case, a repeat injection) that recording data; in fact, data do not exist in a
may be associated with risk or significant cost. generally useful form unless they are record-
ed. The legal system stresses this point as well.
Providers’ unsubstantiated memories of what
2.2.5 Identify Deviations they observed or why they took some action
are of little value in the courtroom. The medi-
from Expected Trends cal record is the foundation for determining
whether proper care was delivered. Thus, a
Data often are useful in medical care only
well-maintained record is a source of protec-
when viewed as part of a continuum over
tion for both patients and their physicians.
time. An example is the routine monitoring of
Biomedical Data: Their Acquisition, Storage, and Use
57 2
.. Fig. 2.7 A pediatric growth chart. Single data points tion with the National Center for Chronic Disease Preven-
would not be useful; it is the changes in values over time tion and Health Promotion (2000). 7 http://www.cdc.gov/
that indicate whether development is progressing normally. growthcharts)
(Source: National Center for Health Statistics in collabora-
58 E. H. Shortliffe and M. F. Chiang
is often focused on billing and compliance physical growth of the document and, accord-
over clinical care (7 Chaps. 16 and 31). ingly, complicated the chart’s logistical man-
agement. Furthermore, it became increasingly
2 difficult to locate specific patient data as the
2.3.2 Redundancy and Inefficiency chart succumbed to “obesity”. The predictable
result was that someone would write yet anoth-
To be able to find data quickly in the medical er redundant entry, summarizing information
record, health professionals developed a vari- that it took hours to track down – and creating
ety of techniques in paper documentation that potential sources for transcription error.
provided redundant recording to match alter- A similar inefficiency occured because of a
nate modes of access. For example, the result tension between opposing goals in the design
of a radiologic study typically was entered on a of reporting forms used by many laboratories.
standard radiology reporting form, which was Most health personnel preferred a consistent,
filed in the portion of the chart labeled “X-ray.” familiar form, often with color-coding, because
For complicated procedures, the same data it helped them to find information more quick-
often were summarized in brief notes by radi- ly (. Fig. 2.10). For example, a physician might
ologists in the narrative part of the chart, know that a urinalysis report form is printed on
which they entered at the time of studies yellow paper and records the bacteria count
because they knew that the formal report halfway down the middle column of the form.
would not make it back to the chart for 1 or This knowledge allowed the physician to work
2 days. In addition, the study results often were backward quickly in the laboratory section of
mentioned in notes written by the patient’s the chart to find the most recent urinalysis sheet
admitting and consulting physicians and by and to check at a glance the bacterial count.
the nursing staff. Although there may have The problem is that such forms typically stored
been good reasons for recording such informa- only sparse information. It was clearly subopti-
tion multiple times in different ways and in dif- mal if a rapidly growing physical chart was
ferent locations within the paper chart, the filled with sheets of paper that reported only a
combined bulk of these notes accelerated the single data element.
.. Fig. 2.10 Laboratory reporting forms present medical data in a consistent, familiar format (in this case a com-
plete blood count (CBC)). (Photograph courtesy of Jimy Chen, with permission)
Biomedical Data: Their Acquisition, Storage, and Use
61 2
2.3.3 Influence on Clinical Research not a subject of study at the time the data were
collected and prospective studies in which the
Anyone involved in a clinical research project clinical hypothesis is known in advance and the
based on retrospective data review from paper research protocol is designed specifically to col-
records can attest to the tediousness of flipping lect future data that are relevant to the question
through myriad medical records. For all the rea- under consideration (see also 7 Chaps. 15 and
sons described in 7 Chap. 1, it is arduous to sit 29). Subjects are assigned randomly to different
with stacks of patient records, extracting data study groups to help prevent researchers—who
and formatting them for structured statistical are bound to be biased, having developed the
analysis, and the process is vulnerable to tran- hypothesis—from unintentionally skewing the
scription errors. Observers often wonder how results by assigning a specific class of patients
much medical knowledge is sitting untapped in all to one group. For the same reason, to the
old paper medical records because there is no extent possible, the studies are double blind; i.e.,
easy way to analyze experience across large pop- neither the researchers nor the subjects know
ulations of patients from the past without first which treatment is being administered. Such
extracting pertinent data from those charts. blinding is of course impractical when it is
Let’s contrast such retrospective review with obvious to patients or physicians what therapy
paper and electronic medical records. Suppose, is being given (such as surgical procedures ver-
for example, that physicians on a medical con- sus drug therapy). Prospective, randomized,
sultation service notice that patients receiving a double-blind studies are considered the best
certain common oral medication for diabetes method for determining optimal management
(call it drug X) seem to be more likely to have of disease, but it is often impractical to carry
significant postoperative hypotension (low out such studies, and then methods such as ret-
blood pressure) than do surgical patients receiv- rospective chart review may be used.
ing other medications for diabetes. The doctors Returning to our example, consider the
have based this hypothesis—that drug X influ- problems in paper chart review that the research-
ences postoperative blood pressure—on only a ers used to encounter in addressing the postop-
few recent observations, however, so they decide erative hypotension question retrospectively.
to look into existing hospital records to see First, they would have to identify the charts of
whether this correlation has occurred with suf- interest: the subset of medical records dealing
ficient frequency to warrant a formal investiga- with surgical patients who are also diabetic. In a
tion. One efficient way to follow up on their hospital record room filled with thousands of
theory from existing medical data would be to charts, the task of chart selection was often
examine the hospital records of all patients who overwhelming. Medical records departments
have diabetes and also have been admitted for generally did keep indexes of diagnostic and
surgery. The task would then be to examine procedure codes cross-referenced to specific
those records (difficult and arduous with paper patients (see 7 Sect. 2.5.1). Thus, it sometimes
charts as will be discussed shortly, but subject to was possible to use such an index to find all
automated analysis in the case of EHRs) and to charts in which the discharge diagnoses includ-
note for all patients (1) whether they were taking ed diabetes and the procedure codes included
drug X when admitted and (2) whether they major surgical procedures. The researcher might
had postoperative hypotension. If the statistics then have compiled a list of patient identifica-
showed that patients receiving drug X were tion numbers and have the individual charts
more likely to have low blood pressure after sur- pulled from the file room for review.
gery than were similar diabetic patients receiving The researchers’ next task was to examine
alternate treatments, a controlled trial (prospec- each paper chart serially to find out what treat-
tive observation and data gathering) might well ment each patient was receiving for diabetes at
be appropriate. the time of the surgery and to determine
Note the distinction between retrospective whether the patient had postoperative hypo-
chart review to investigate a question that was tension. Finding such information tended to be
62 E. H. Shortliffe and M. F. Chiang
extremely time-consuming. Where should the such as legibility, accuracy, or implications for
researcher look for it? The admission drug patient management. They cannot take an
orders might have shown what the patient active role in responding appropriately to those
2 received for diabetes control, but it would also implications.
have been wise to check the medication sheets EHR systems have changed our perspec-
to see whether the therapy was also adminis- tive on what health professionals can expect
tered (as well as ordered) and the admission from the medical chart. Automated record sys-
history to see whether a routine treatment for tems introduce new opportunities for dynamic
diabetes, taken right up until the patient entered responses to the data that are recorded in them.
the hospital, was not administered during the As described in many of the chapters to follow,
inpatient stay. Information about hypotensive computational techniques for data storage,
episodes might be similarly difficult to locate. retrieval, and analysis make it feasible to devel-
The researchers might start with nursing notes op record systems that (1) monitor their con-
from the recovery room or with the anesthesi- tents and generate warnings or advice for
ologist’s datasheets from the operating room, providers based on single observations or on
but the patient might not have been hypoten- logical combinations of data; (2) provide auto-
sive until after leaving the recovery room and mated quality control, including the flagging of
returning to the ward. So the nursing notes potentially erroneous data; or (3) provide feed-
from the ward would need to be checked too, as back on patient-specific or population-based
well as vital signs sheets, physicians’ progress deviations from desirable standards.
notes, and the discharge summary.
It should be clear from this example that
retrospective paper chart review was a labori- 2.4 ew Kinds of Data
N
ous and tedious process and that people per- and the Resulting Challenges
forming it were prone to make transcription
errors and to overlook key data. EHRs offer The revolution in human genetics that emerged
an enormous opportunity (7 Chap. 16) to with the Human Genome Project in the 1990s
facilitate the chart review and clinical research already has had a profound effect on the diag-
process. They have obviated the need to nosis, prognosis, and treatment of disease
retrieve hard copy charts; instead, researchers (Vamathevan and Birney 2017). The vast
are increasingly using computer-based data amounts of data that are generated in biomed-
retrieval and analysis techniques to do most ical research (see 7 Chaps. 11 and 28), and
of the work (finding relevant patients, locat- that can be pooled from patient datasets to
ing pertinent data, and formatting the infor- support clinical research (7 Chap. 29) and
mation for statistical analyses). Researchers public health (7 Chap. 20), have created new
can use similar techniques to harness comput- opportunities as well as challenges. Researchers
er assistance with data management in pro- are finding that the amount of data that they
spective clinical trials (7 Chap. 29). must manage and assess has become so large
that they often find that they lack either the
capabilities or expertise to handle the analytics
2.3.4 he Passive Nature of Paper
T that are required. This problem, sometimes
Records dubbed the “big data” problem, has gathered
the attention of government agencies as well.2
The traditional manual system has another
limitation that would have been meaningless
until the emergence of the computer age. A 2 Big Data Senior Steering Group. The Federal Big
manual archival system is inherently passive; Data Research and Development Strategic Plan.
the charts sit waiting for something to be done Available at: 7 https://obamawhitehouse.archives.
gov/sites/default/files/microsites/ostp/NSTC/
with them. They are insensitive to the charac- bigdatardstrategicplan-nitrd_final-051916.pdf
teristics of the data recorded within their pages, (Accessed 6/28/2019).
Biomedical Data: Their Acquisition, Storage, and Use
63 2
Some suggest that the genetic material itself ences; these people question whether it is
will become our next-generation method for possible to introduce too much standardization
storing large amounts of data (Erlich and into a field that prides itself in humanism.
Zielinski 2017). Data analytics, and the man- The debate has been accentuated by the
agement of large amounts of genomic/pro- introduction of computers for data manage-
teomic or clinical/public-health data, have ment, because such machines tend to demand
accordingly become major research topics and conformity to data standards and definitions.
key opportunities for new methodology devel- Otherwise, issues of data retrieval and analysis
opment by biomedical informatics and data are confounded by discrepancies between the
scientists (Adler-Milstein and Jha 2013; meanings intended by the observers or record-
Brennan et al. 2018; Bycroft et al. 2018). ers and those intended by the individuals
The issues that arise are practical as well as retrieving information or doing data analysis.
scientifically interesting. For example, develop- What is an “upper respiratory infection”? Does
ers of EHRs have begun to grapple with ques- it include infections of the trachea or of the
tions regarding how they might store an main stem bronchi? How large does the heart
individual’s personal genome within the elec- have to be before we can refer to “cardiomega-
tronic health record. New standards will be ly”? How should we deal with the plethora of
required, and tactical questions need answer- disease names based on eponyms (e.g., Alzheim-
ing regarding, for example, whether to store an er’s disease, Hodgkin’s disease) that are not
entire genome or only those components (e.g., descriptive of the illness and may not be famil-
genetic markers) that are already reasonably iar to all practitioners? What do we mean by an
well understood (Masys et al. 2012; Haendel “acute abdomen”? Are the boundaries of the
et al. 2018). In cancer, for example, where abdomen well agreed on? What are the time
mutations in cell lines can occur, an individual constraints that correspond to “acuteness” of
may actually have many genomes represented abdominal pain? Is an “ache” a pain? What
among his or her cells. These issues will about “occasional” cramping?
undoubtedly influence the evolution of data Imprecision and the lack of a standardized
systems and EHRs, as well as the growth of vocabulary are particularly problematic when
precision medicine (see 7 Chap. 30), in the we wish to aggregate data recorded by multiple
years ahead (Relling and Evans 2015). health professionals or to analyze trends over
time. Without a controlled, predefined vocabu-
lary, data interpretation is inherently compli-
2.5 The Structure of Clinical Data cated, and the automatic summarization of
data may be impossible. For example, one phy-
Scientific disciplines generally develop a precise sician might note that a patient has “shortness
terminology or notation that is standardized of breath.” Later, another physician might note
and accepted by all workers in the field. Con- that she has “dyspnea.” Unless these terms are
sider, for example, the universal language of designated as synonyms, an automated pro-
chemistry embodied in chemical formulae, the gram will fail to indicate that the patient had
precise definitions and mathematical equations the same problem on both occasions.
used by physicists, the predicate calculus used Regardless of arguments regarding the
by logicians, or the conventions for describing “artistic” elements in medicine, the need for
circuits used by electrical engineers. Medicine is health personnel to communicate effectively is
remarkable for its failure to develop a widely clear both in acute care settings and when
accepted standardized vocabulary and nomen- patients are seen over long periods. Both high-
clature, and many observers believe that a true quality care and scientific progress depend on
“scientific” basis for the field will be impossible some standardization in terminology. Other-
until this problem is addressed (see 7 Chap. 8). wise, differences in intended meaning or in
Other people argue that common references to defining criteria will lead to miscommunication,
the “art” of medicine reflect an important dis- improper interpretation, and potentially nega-
tinction between medicine and the “hard” sci- tive consequences for the patients involved.
64 E. H. Shortliffe and M. F. Chiang
Given the lack of formal definitions for hospitalized population and the average length
many medical terms, it is remarkable that med- of stay for each disease category), for quality
ical workers communicate as well as they do. improvement, and for research. For such data
2 Only occasionally is the care for a patient clear- to be useful, the codes must be well defined as
ly compromised by miscommunication. If well as uniformly applied and accepted.
EHRs are to become dynamic and responsive The World Health Organization publishes
manipulators of patient data, however, their a diagnostic coding scheme called the Interna-
encoded logic must be able to presume a spe- tional Classification of Disease (ICD). The
cific meaning for the terms and data elements 10th revision of this standard, ICD-10-CM
entered by the observers. This point is discussed (clinical modification),3 is currently in use in
in greater detail in 7 Chap. 8, which deals in much of the world (see 7 Chap. 8). ICD-10-
part with the multiple efforts to develop health- CM is used by all nonmilitary hospitals in the
care computing standards, including a shared, United States for discharge coding, and must be
controlled terminology for biomedicine. reported on the bills submitted to most insur-
ance companies (. Fig. 2.11). Pathologists
have developed another widely used diagnostic
2.5.1 Coding Systems coding scheme; originally known as System-
atized Nomenclature of Pathology (SNOP), it
We are used to seeing figures regarding the was expanded to the Systematized Nomencla-
growing incidences of certain types of tumors, ture of Medicine (SNOMED) and then merged
deaths from influenza during the winter with the Read Clinical Terms from Great Brit-
months, and similar health statistics that we ain to become SNOMED-CT (Stearns et al.
tend to take for granted. How are such data 2001; Lee et al. 2014). In recent years, support
accumulated? Their role in health planning and for SNOMED-CT was assumed by the Interna-
health care financing is clear, but electronic tional Health Terminology Standards Develop-
health records provide the infrastructure for ment Organization, based in Copenhagen, now
aggregating individual patient data to learn renamed SNOMED International and relocat-
more about the health status of the populations ed to London.4 Another coding scheme, devel-
in various communities (see 7 Chap. 20). oped by the American Medical Association, is
Because of the needs to know about health the Current Procedural Terminology (CPT)
trends for populations and to recognize epidem- (Hirsch et al. 2015). It is similarly widely used in
ics in their early stages, there are various health- producing bills for services rendered to patients.
reporting requirements for hospitals (as well as More details on such schemes are provided in
other public organizations) and practitioners. 7 Chap. 8. What warrants emphasis here, how-
For example, cases of gonorrhea, syphilis, and ever, is the motivation for the codes’ develop-
tuberculosis generally must be reported to local ment: health care personnel need standardized
public-health organizations, which code the terms that can support pooling of data for anal-
data to allow trend analyses over time. The Cen- ysis and can provide criteria for determining
ters for Disease Control and Prevention in charges for individual patients.
Atlanta (CDC) then pool regional data and The historical roots of a coding system
report national as well as local trends in disease reveal themselves as limitations or idiosyncra-
incidence, bacterial-resistance patterns, etc. sies when the system is applied in more general
Another kind of reporting involves the cod- clinical settings. For example, ICD-10-CM was
ing of all discharge diagnoses for hospitalized derived from a classification scheme developed
patients, plus coding of certain procedures (e.g., for epidemiologic reporting. Consequently, it
type of surgery) that were performed during the has over 60 separate codes for describing tuber-
hospital stay. Such codes are reported to state
and federal health-planning and analysis agen-
cies and also are used internally at the institu- 3 7 http://www.icd10data.com/ (Accessed
tion for case-mix analysis (determining the 11/1/2019).
relative frequencies of various disorders in the 4 7 http://snomed.org/ (Accessed 5/6/2019).
Biomedical Data: Their Acquisition, Storage, and Use
65 2
J45 Asthma
Includes: allergic (predominantly) asthma, allergic bronchitis NOS, allergic rhinitis with asthma, atopic asthma,
extrinsic allergic asthma, hay fever with asthma, idiosyncratic asthma, intrinsic nonallergic asthma, nonallergic
asthma
Use additional code to identify: exposure to environmental tobacco smoke (Z77.22), exposure to tobacco smoke
in the perinatal period (P96.81), history of tobacco use (Z87.891), occupational exposure to environmental
tobacco smoke (Z57.31), tobacco dependence (F17.-), tobacco use (Z72.0)
Excludes: detergent asthma (J69.8), eosinophilic asthma (J82), lung diseases due to external agents (J60-J70),
miner's asthma (J60), wheezing NOS (R06.2), wood asthma (J67.8), asthma with chronic obstructive pulmonary
disease (J44.9), chronic asthmatic (obstructive) bronchitis (J44.9), chronic obstructive asthma (J44.9)
.. Fig. 2.11 The subset of disease categories for asthma Human Services, 7 https://www.cms.gov/Medicare/Cod-
taken from ICD-10-CM. (Source: Centers for Medicare ing/ICD10/2018-ICD-10-CM-and-GEMs.html, accessed
and Medicaid Services, US Department of Health and June 28, 2019)
culosis infections. SNOMED versions have (person who studies blood diseases) may want
long permitted coding of pathologic findings in to distinguish among a variety of hemoglobin-
exquisite detail but only in later years began to opathies (disorders of the structure and func-
introduce codes for expressing the dimensions tion of hemoglobin) lumped under a single
of a patient’s functional status. In a particular code in ICD-10-CM. On the other hand,
clinical setting, none of the common coding another practitioner may prefer to aggregate
schemes is likely to be completely satisfactory. many individual codes—e.g., those for active
In some cases, the granularity of the code will tuberculosis—into a single category to simplify
be too coarse; on the one hand, a hematologist the coding and retrieval of data.
66 E. H. Shortliffe and M. F. Chiang
Such schemes cannot be effective unless which may reflect the experience or biases of
health care providers accept them. There is an people who interpret the primary data and the
inherent tension between the need for a coding resulting information.
2 system that is general enough to cover many dif- The observation that patient Brown has a
ferent patients and the need for precise and blood pressure of 180/110 is a datum, as is the
unique terms that accurately apply to a spe- report that the patient has had a myocardial
cific patient and do not unduly constrain physi- infarction (heart attack). When researchers
cians’ attempts to describe what they observe. pool such data, creating information, subse-
Yet if physicians view the EHR as a blank sheet quent analysis may determine that patients
of paper on which any unstructured informa- with high blood pressure are more likely to
tion can be written, the data they record will have heart attacks than are patients with nor-
be unsuitable for dynamic processing, clinical mal or low blood pressure. This analysis of
research, and health planning. The challenge is organized data (information) has produced a
to learn how to meet all these needs. Researchers piece of knowledge about the world. A physi-
at many institutions worked for over two decades cian’s belief that prescribing dietary restriction
to develop a unified medical language system of salt is unlikely to be effective in controlling
(UMLS), a common structure that ties together high blood pressure in patients of low econom-
the various vocabularies that have been created. ic standing (because the latter are less likely to
At the same time, the developers of specific ter- be able to afford special low-salt foods) is an
minologies are continually working to refine additional personal piece of knowledge—a heu-
and expand their independent coding schemes ristic that guides physicians in their decision
(Humphreys et al. 1998) (see 7 Chap. 8). making. Note that the appropriate interpreta-
tion of these definitions depends on the con-
text. Knowledge at one level of abstraction
2.5.2 The Data-to-Knowledge may be considered data at higher levels. A
Spectrum blood pressure of 180/110 mmHg is a raw piece
of data; the statement that the patient has
A central focus in biomedical informatics is the hypertension is an interpretation of several
information base that constitutes the “substance such data and thus represents a higher level of
of medicine.” Workers in the field have tried to information. As input to a diagnostic decision
clarify the distinctions among three terms fre- aid, however, the presence or absence of hyper-
quently used to describe the content of com- tension may be requested, in which case the
puter-based systems: data, information, and presence of hypertension is treated as a data
knowledge (Blum 1986; Bernstam et al. 2010). item.
These terms are often used interchangeably. In A database is a collection of individual
this volume, we shall refer to a datum as a single observations without any summarizing analy-
observational point that characterizes a rela- sis. An EHR system is thus primarily viewed
tionship. It generally can be regarded as the as a database—the place where patient data
value of a specific parameter for a particular are stored. When properly collated and pooled
object (e.g., a patient) at a given point in time. with other data, these elements in the EHR
The term information refers to analyzed data provide information about the patient. A
that have been suitably curated and organized so knowledge base, on the other hand, is a collec-
that they have meaning. Data do not constitute tion of facts, heuristics, and models that can
information until they have been organized in be used for problem solving and analysis of
some way, e.g., for analysis or display. Knowledge, organized data (information). If the knowl-
then, is derived through the formal or informal edge base provides sufficient structure, includ-
analysis (or interpretation) of information that ing semantic links among knowledge items,
was in turn derived from data. Thus, knowledge the computer itself may be able to apply that
includes the results of formal studies and also knowledge as an aid to case-based problem
common sense facts, assumptions, heuristics solving. Many decision-support systems have
(strategic rules of thumb), and models—any of been called knowledge-based systems, reflect-
Biomedical Data: Their Acquisition, Storage, and Use
67 2
ing this distinction between knowledge bases What do we mean by selectivity in data col-
and databases (see 7 Chap. 26). lection and recording? It is precisely this pro-
cess that often is viewed as a central part of the
“art” of medicine, an element that accounts for
2.6 trategies of Clinical Data
S individual styles and the sometimes marked
Selection and Use distinctions among clinicians. As is discussed
with numerous clinical examples in 7 Chaps. 3
It is illusory to conceive of a “complete clinical and 4, the idea of selectivity implies an ongoing
data set.” All medical databases, and medical decision-making process that guides data col-
records, are necessarily incomplete because lection and interpretation. Attempts to under-
they reflect the selective collection and record- stand how expert clinicians internalize this
ing of data by the health care personnel respon- process, and to formalize the ideas so that they
sible for the patient. There can be marked can better be taught and explained, are central
interpersonal differences in both style and in biomedical informatics research. Improved
problem solving that account for variations in guidelines for such decision making, derived
the way practitioners collect and record data from research activities in biomedical infor-
for the same patient under the same circum- matics, not only are enhancing the teaching
stances. Such variations do not necessarily and practice of medicine (Shortliffe 2010) but
reflect good practices, however, and much of also are providing insights that suggest meth-
medical education is directed at helping physi- ods for developing computer-based decision-
cians and other health professionals to learn support tools.
what observations to make, how to make them
(generally an issue of technique), how to inter-
pret them, and how to decide whether they 2.6.1 The Hypothetico-Deductive
warrant formal recording. Approach
An example of this phenomenon is the dif-
ference between the first medical history, Studies of clinical decision makers have shown
physical examination, and summarizing report that strategies for data collection and interpre-
developed by a medical student and the similar tation may be imbedded in an iterative process
process undertaken by a seasoned clinician known as the hypothetico-deductive approach
examining the same patient. Medical students (Elstein et al. 1978; Kassirer and Gorry 1978).
tend to work from comprehensive mental out- As medical students learn this process, their
lines of questions to ask, physical tests to per- data collection becomes more focused and
form, and additional data to collect. Because efficient, and their medical records become
they have not developed skills of selectivity, the more compact. The central idea is one of
process of taking a medical history and per- sequential, staged data collection, followed by
forming a physical examination may take more data interpretation and the generation of
than 1 h, after which students develop extensive hypotheses, leading to hypothesis-directed
reports of what they observed and how they selection of the next most appropriate data to
have interpreted their observations. It clearly be collected. As data are collected at each
would be impractical, inefficient, and inappro- stage, they are added to the growing database
priate for physicians in practice to spend this of observations and are used to reformulate or
amount of time assessing every new patient. refine the active hypotheses. This process is
Thus, part of the challenge for the neophyte is to iterated until one hypothesis reaches a thresh-
learn how to ask only the questions that are nec- old level of certainty (e.g., it is proved to be
essary, to perform only the examination compo- true, or at least the uncertainty is reduced to a
nents that are required, and to record only those satisfactory level). At that point, a manage-
data that will be pertinent in justifying the ongo- ment, disposition, or therapeutic decision can
ing diagnostic approach and in guiding the be made.
future management of the patient. The diagram in . Fig. 2.12 clarifies this
process. As is shown, data collection begins
68 E. H. Shortliffe and M. F. Chiang
Patient presents
with a problem Initial hypotheses
ID, CC, HPI PE
2 Ask questions
More questions
Examine
patient
HPI, PMH, FH,
Patient is better;
Social, ROS
no further care
required Patient
dies Refine hypotheses Laboratory
tests
Observe
results
Chronic
disease Radiologic
Treat patient ECG etc. studies
Select most
accordingly likely diagnosis
.. Fig. 2.12 A schematic view of the hypothetico- for full discussion. ID patient identification, CC chief
deductive approach. The process of medical data collec- complaint, HPI history of present illness, PMH past
tion and treatment is intimately tied to an ongoing medical history, FH family history, Social social history,
process of hypothesis generation and refinement. See text ROS review of systems, PE physical examination
when the patient presents to the physician tinent. Human beings use heuristics all the
with some issue (a symptom or disease, or time in their decision making because it often
perhaps the need for routine care). The physi- is impractical or impossible to use an exhaus-
cian generally responds with a few questions tive problem-solving approach. A common
that allow one to focus rapidly on the nature example of heuristic problem solving is the
of the problem. In the written report, the data playing of a complex game such as chess.
collected with these initial questions typically Because it would require an enormous amount
are recorded as the patient identification, of time to define all the possible moves and
chief complaint, and initial portion of the his- countermoves that could ensue from a given
tory of the present illness. Studies have shown board position, expert chess players develop
that an experienced physician will have an ini- personal heuristics for assessing the game at
tial set of hypotheses (theories) in mind after any point and then selecting a strategy for
hearing the patient’s response to the first six how best to proceed. Differences among such
or seven questions (Elstein et al. 1978). These heuristics account in part for variations in
hypotheses then serve as the basis for selecting observed expertise.
additional questions. As shown in . Fig. 2.12, Physicians have developed safety mea-
answers to these additional questions allow sures, however, to help them to avoid missing
the physician to refine hypotheses about the important issues that they might not discov-
source of the patient’s problem. Physicians er when collecting data in a hypothesis-
refer to the set of active hypotheses as the dif- directed fashion when taking the history of
ferential diagnosis for a patient; the differen- a patient’s present illness (Pauker et al.
tial diagnosis comprises the set of possible 1976). These measures tend to be focused in
diagnoses among which the physician must four general categories of questions that fol-
distinguish to determine how best to adminis- low the collection of information about the
ter treatment. chief complaint: past medical history, family
Note that the question selection process is history, social history, and a brief review of
inherently heuristic; e.g., it is personalized and systems in which the physician asks some
efficient, but it is not guaranteed to collect general questions about the state of health
every piece of information that might be per- of each of the major organ systems in the
Biomedical Data: Their Acquisition, Storage, and Use
69 2
body. Occasionally, the physician discovers is observed. Note data collected to measure
entirely new problems or finds important response to treatment may themselves be used to
information that modifies the hypothesis list synthesize information that affects the hypothe-
or modulates the treatment options available ses about a patient’s illness. If patients do not
(e.g., if the patient reports a serious past respond to treatment, it may mean that their dis-
drug reaction or allergy). ease is resistant to that therapy and that their
When physicians have finished asking ques- physicians should try an alternate approach, or
tions, the refined hypothesis list (which may it may mean that the initial diagnosis was incor-
already be narrowed to a single diagnosis) then rect and that physicians should consider alter-
serves as the basis for a focused physical exam- nate explanations for the patient’s problem.
ination. By this time, physicians may well have The patient may remain in a cycle of treat-
expectations of what they will find on examina- ment and observation for a long time, as
tion or may have specific tests in mind that will shown in . Fig. 2.12. This long cycle reflects
help them to distinguish among still active the nature of chronic-disease management—
hypotheses about diseases based on the ques- an aspect of medical care that is accounting
tions that they have asked. Once again, as in for an increasing proportion of the health
the question-asking process, focused hypothe- care community’s work (and an increasing
sis-directed examination is augmented with proportion of health care cost). Alternatively,
general tests that occasionally turn up new the patient may recover and no longer need
abnormalities and generate hypotheses that the therapy, or he or she may die. Although the
physician did not expect on the basis of the process outlined in . Fig. 2.12 is oversimpli-
medical history alone. In addition, unexplained fied in many regards, it is generally applicable
findings on examination may raise issues that to the process of data collection, diagnosis,
require additional history taking. Thus, the and treatment in most areas of medicine.
asking of questions generally is partially inte- Note that the hypothesis-directed process
grated with the examination process. of data collection, diagnosis, and treatment is
When physicians have completed the physi- inherently knowledge-based. It is dependent
cal examination, their refined hypothesis list not only on a significant fact base that permits
may be narrowed sufficiently for them to under- proper interpretation of data and selection of
take specific treatment. Additional data gather- appropriate follow-up questions and tests but
ing may still be necessary, however. Such testing also on the effective use of heuristic tech-
is once again guided by the current hypotheses. niques that characterize individual expertise.
The options available include laboratory tests Another important issue, addressed in
(of blood, urine, other body fluids, or biopsy 7 Chap. 3, is the need for physicians to balance
specimens), radiologic studies (X-ray examina- financial costs and health risks of data collec-
tions, nuclear-imaging scans, computed tomog- tion against the perceived benefits to be gained
raphy (CT) studies, magnetic resonance scans, when those data become available. It costs
sonograms, or any of a number of other imag- nothing but time to examine the patient at the
ing modalities), and other specialized tests bedside or to ask an additional question, but if
(electrocardiograms (ECGs), electroencephalo- the data being considered require, for example,
grams, nerve conduction studies, and many oth- X-ray exposure, coronary angiography, or a
ers), as well as returning to the patient to ask CT scan of the head (all of which have associ-
further questions or perform additional physi- ated risks and costs), then it may be preferable
cal examination. As the results of such studies to proceed with treatment in the absence of full
become available, physicians constantly revise information. Differences in the assessment of
and refine their hypothesis list. cost-benefit trade-offs in data collection, and
Ultimately, physicians are sufficiently certain variations among individuals in their willing-
about the source of a patient’s problem to be ness to make decisions under uncertainty, often
able to develop a specific management plan. account for differences of opinion among col-
Treatments are administered, and the patient laborating physicians.
70 E. H. Shortliffe and M. F. Chiang
( sensitivity )( prevalence )
PV + =
( sensitivity )( prevalence ) + (1 - specificity ) (1 - prevalence )
There is a similar formula for defining PV− in a rule for combining probabilistic data that is
terms of sensitivity, specificity, and prevalence. generally attributed to the work of Reverend
Both formulae can be derived from simple prob- Thomas Bayes in the 1700s. Bayes’ theorem is
ability theory. Note that positive tests with high discussed in greater detail in 7 Chap. 3.
sensitivity and specificity may still lead to a low
post-test probability of the disease (PV+) if the
prevalence of that disease is low. You should 2.6.3 Methods for Selecting
substitute values in the PV+ formula to convince Questions and Comparing
yourself that this assertion is true. It is this rela- Tests
tionship that tends to be poorly understood by
practitioners and that often is viewed as counter- We have described the process of hypothesis-
intuitive (which shows that your intuition can directed sequential data collection and have
misguide you!). Note also (by substitution into asked how an observation might evoke or
the formula) that test sensitivity and disease refine the physician’s hypotheses about what
prevalence can be ignored only when a test is abnormalities account for the patient’s illness.
pathognomonic (i.e., when its specificity is 100%, The complementary question is: Given a set of
which mandates that PV+ be 100%). The PV+ current hypotheses, how does the physician
formula is one of many forms of Bayes’ theorem, decide what additional data should be collect-
72 E. H. Shortliffe and M. F. Chiang
ed? This question also has been analyzed at data into the electronic health record) to reduce
length (Elstein et al. 1978; Arocha et al. 2005) the data entry burden on physicians while they
and is pertinent for computer programs that interact with patients.
2 gather data efficiently to assist clinicians with In some applications, data are entered auto-
diagnosis or with therapeutic decision making matically into the computer by the device that
(see 7 Chap. 26). Because understanding issues measures or collects them. For example, moni-
of test selection and data interpretation is cru- tors in intensive care or coronary care units,
cial to understanding medical data and their pulmonary function or ECG machines, and
uses, we devote 7 Chap. 3 to these and related measurement equipment in the clinical chemis-
issues of medical decision making. In 7 Sect. try laboratory can interface directly with a
3.6, for example, we discuss the use of decision- computer in which a database is stored. Certain
analytic techniques in deciding whether to treat data can be entered directly by patients; there
a patient on the basis of available information are systems, for example, that take the patient’s
or to perform additional diagnostic tests. history by presenting on a computer screen or
tablet multiple-choice questions that follow a
branching logic. The patient’s responses to the
2.7 he Computer and Collection
T questions are used to generate electronic or
hard copy reports for physicians and also may
of Medical Data
be stored directly in a computer database for
Although this chapter has not directly discussed subsequent use in other settings.
computer systems, the role of the computer in When physicians or other health personnel
medical data storage, retrieval, and interpreta- do use the machine themselves, specialized devic-
tion should be clear. Much of the rest of this es often allow rapid and intuitive operator–
book deals with specific applications in which machine interaction. Most of these devices use a
the computer’s primary role is data manage- variant of the “point-and-select” approach—e.g.,
ment. One question is pertinent to all such appli- touch-sensitive computer screens, mouse-point-
cations: What are the best approaches for getting ing devices, and increasingly the clinician’s finger
data into the computer in the first place? on a mobile tablet or smart phone (see 7 Chaps.
5 and 6). When conventional computer worksta-
The need for data entry by physicians has
tions are used, specialized keypads can be help-
posed a problem for medical-computing sys-
ful. Designers frequently permit logical selection
tems since the earliest days of the field. Awkward
of items from menus displayed on the screen so
or nonintuitive interactions at computing devic-
that the user does not need to learn a set of spe-
es—particularly ones requiring keyboard typing
cialized commands to enter or review data. There
or confusing movement through multiple dis-
were clear improvements when handheld tablets
play screens by the physician—have perhaps
done more to frustrate clinicians than have any using pen-based or finger-based mechanisms for
other factor. data entry were introduced. With ubiquitous
wireless data services, such devices are allowing
A variety of approaches have been used to
clinicians to maintain normal mobility (in and
try to finesse this problem. One is to design sys-
out of examining rooms or inpatient rooms)
tems such that clerical staff can do essentially all
the data entry and much of the data retrieval as while accessing and entering data that are perti-
well. Many clinical research systems (see nent to a patient’s care.
7 Chap. 29) have taken this approach. Physicians These issues arise in essentially all applica-
tion areas, and, because they can be crucial to
may be asked to fill out structured paper data-
the successful implementation and use of a
sheets, or such sheets may be filled out by data
system, they warrant particular attention in
abstractors who review patient charts, but the
system design. As more physicians are com-
actual entry of data into the database is done by
fortable with computers in daily life, they will
paid transcriptionists. Other physicians have
likely find the use of computers in their prac-
adopted “scribes” (staff whose role is to follow
tice less of a hindrance. We encourage you to
physicians in examination rooms and to enter
consider human–computer interaction, and
Biomedical Data: Their Acquisition, Storage, and Use
73 2
the cognitive issues that arise in dealing with Klasnja, P., & Pratt, W. (2012). Healthcare in the
computer systems (see 7 Chap. 4), as you pocket: Mapping the space of mobile-phone
learn about the application areas and the spe- health interventions. Journal of Biomedical
cific systems described in later chapters. Informatics, 45(1), 184–198. This review arti-
cle describes the multiple ways in which both
nnSuggested Readings patients and providers are being empowered
Adler-Milstein, J., Zhao, W., Willard-Grace, through the introduction of affordable mobile
R., Knox, M., & Grumbach, K. (2020). technologies that manage data and apply
Electronic health records and burnout: Time knowledge to generate advice.
spent on the electronic health record after Steinhubl, S. R., Muse, E. D., & Topol, E. J.
hours and message volume associated with (2015). The emerging field of mobile health.
exhaustion but not with cynicism among pri- Science Translational Medicine, 7(283),
mary care clinicians. Journal of the American 283rv3. The authors discuss the potential for
Medical Informatics Association. https://doi. mobile health (mHealth) to impact the deliv-
org/10.1093/jamia/ocz220. This paper exam- ery and quality of health care delivery and
ines the correlation between electronic health clinical research on a large scale. This paper
record use and clinician burnout, and con- includes a discuss of challenges to the field, as
cludes that two specific EHR usage measures well as efforts to address those challenges.
(EHR time after hours and message volume) Vamathevan, J., & Birney, E. (2017). A review of
were associated with exhaustion. recent advances in translational bioinformat-
Arocha, J. F., Wang, D., & Patel, V. L. (2005). ics: Bridges from biology to medicine.
Identifying reasoning strategies in medical Yearbook of Medical Informatics, 26(1), 178–
decision making: A methodological guide. 187. This articles reviews the latest trends and
Journal of Biomedical Informatics, 38(2), major developments in translational bioinfor-
154–171. This paper illustrates the role of the- matics. This includes work applying findings
ory-driven psychological research and cogni- from national genome sequencing initiatives
tive evaluation as they relate to medical to health care delivery. There is a discussion of
decision making and the interpretation of current challenges and emerging technologies
clinical data. See also Chap. 4. that bridge research with clinical care. See also
Bernstam, E. V., Smith, J. W., & Johnson, T. R. Chap. 28.
(2010). What is biomedical informatics?
Journal of Biomedical Informatics, 43(1), ??Questions for Discussion
104–110. The authors discuss the transforma- 1. You check your pulse and discover that
tion of data into information and knowledge, your heart rate is 100 beats per minute. Is
delineating the ways in which this focus lies at this rate normal or abnormal? What
the heart of the field of biomedical informat- additional information would you use in
ics. making this judgment? How does the
Brennan, P. F., Chiang, M. F., & Ohno-Machado, context in which data are collected influ-
L. (2018). Biomedical informatics and data ence the interpretation of those data?
science: Evolving fields with significant over- 2. Given the imprecision of many medical
lap. Journal of the American Medical terms, why do you think that serious
Informatics Association, 25(1), 2–3. This edi- instances of miscommunication among
torial introduces a special issue of the Journal health care professionals are not more
of the American Medical Informatics common? Why is greater standardization
Association, in which the rapidly evolving of terminology necessary if computers
field of data science is the focus. There are 8 rather than humans are to manipulate
papers in this issue that involve applications patient data?
such as secondary use of EHR data, reposito- 3. Based on the discussion of coding schemes
ries of data, and standardization of data rep- for representing clinical information, dis-
resentation. cuss three challenges you foresee in
74 E. H. Shortliffe and M. F. Chiang
Contents
References – 119
Biomedical Decision Making: Probabilistic Clinical Reasoning
79 3
nnLearning Objectives tween information needs and system design
After reading this chapter, you should know and implementation.
the answers to these questions: The material in this chapter is presented in
55 How is the concept of probability useful the context of the decisions made by an indi-
for understanding test results and for vidual clinician. The concepts, however, are
making medical decisions that involve more broadly applicable. Sensitivity and spec-
uncertainty? ificity are important parameters of laboratory
55 How can we characterize the ability of a systems that flag abnormal test results, of pa-
test to discriminate between disease and tient monitoring systems (7 Chap. 21), and of
health? information-retrieval systems (7 Chap. 23).
55 What information do we need to inter- An understanding of what probability is and
pret test results accurately? of how to adjust probabilities after the acqui-
55 What is expected-value decision making? sition of new information is a foundation for
How can this methodology help us to un- our study of clinical decision-support systems
derstand particular medical problems? (7 Chap. 24). The importance of probability
55 What are utilities, and how can we use in medical decision making was noted as long
them to represent patients’ preferences? ago as 1922:
55 What is a sensitivity analysis? How can
we use it to examine the robustness of a
»» [G]ood medicine does not consist in the indis-
criminate application of laboratory examina-
decision and to identify the important tions to a patient, but rather in having so clear
variables in a decision? a comprehension of the probabilities and pos-
sibilities of a case as to know what tests may be
55 What are influence diagrams? How do
expected to give information of value (Peabody
they differ from decision trees? 1922).
►►Example 3.1
3.1 he Nature of Clinical
T
Decisions: Uncertainty and the You are the director of a blood bank. All poten-
tial blood donors are tested to ensure that they
Process of Diagnosis
are not infected with the human immunodeficien-
cy virus (HIV), the causative agent of acquired
Because clinical data are imperfect and out-
immunodeficiency syndrome (AIDS). You ask
comes of treatment are uncertain, health profes-
whether use of the polymerase chain reaction
sionals often are faced with difficult choices. In
(PCR), a gene-amplification technique that can
this chapter, we introduce probabilistic medical
diagnose HIV, would be useful to identify people
reasoning, an approach that can help health care
who have HIV. The PCR test is positive 98% of
providers to deal with the uncertainty inherent
the time when antibody is present, and negative
in many medical decisions. Medical decisions
99% of the time antibody is absent.1 ◄
are made by a variety of methods; our approach
is neither necessary nor appropriate for all deci-
If the test is positive, what is the likelihood that
sions. Throughout the chapter, we provide sim-
a donor actually has HIV? If the test is negative,
ple clinical examples that illustrate a broad range
how sure can you be that the person does not
of problems for which probabilistic medical rea-
have HIV? On an intuitive level, these questions
soning does provide valuable insight.
As discussed in 7 Chap. 2, medical practice
is medical decision making. In this chapter, we
look at the process of medical decision mak-
ing. Together, 7 Chaps. 2 and 3 lay the ground- 1 The test sensitivity and specificity used in
work for the rest of the book. In the remaining 7 Example 3.1 are consistent with the reported
values of the sensitivity and specificity of the PCR
chapters, we discuss ways that computers can test for diagnosis of HIV early in its development
help clinicians with the decision-making pro- (Owens et al. 1996b); the test now has higher
cess, and we emphasize the relationship be- sensitivity and specificity.
80 D. K. Owens et al.
do not seem particularly difficult to answer. The HIV—a mistake with profound emotional and
test appears accurate, and we would expect that, social consequences. In 7 Example 3.2, the de-
if the test is positive, the donated blood speci- cision-making skill of the clinician will affect a
men is likely to contain the HIV. Thus, we are patient’s quality and length of life. Similar situ-
surprised to find that, if only one in 1000 donors ations are commonplace in medicine. Our goal
actually is infected, the test is more often mis- in this chapter is to show how the use of prob-
3 taken than it is correct. In fact, of 100 donors ability and decision analysis can help to make
with a positive test, fewer than 10 would be in- clear the best course of action.
fected. There would be ten wrong answers for Decision making is one of the quintessen-
each correct result. How are we to understand tial activities of the healthcare professional.
this result? Before we try to find an answer, let us Some decisions are made on the basis of de-
consider a related example. ductive reasoning or of physiological princi-
ples. Many decisions, however, are made on the
►►Example 3.2 basis of knowledge that has been gained
Ms. Kamala is a 66-year-old woman with coro- through collective experience: the clinician of-
nary artery disease (narrowing or blockage of the ten must rely on empirical knowledge of asso-
blood vessels that supply the heart tissue). When ciations between symptoms and disease to
the heart muscle does not receive enough oxygen evaluate a problem. A decision that is based on
(hypoxia) because blood cannot reach it, the pa- these usually imperfect associations will be, to
tient often experiences chest pain (angina). Ms. some degree, uncertain. In 7 Sects. 3.1.1, 3.1.2
Kamala has twice undergone coronary artery by- and 3.1.3, we examine decisions made under
pass graft (CABG) surgery, a procedure in which uncertainty and present an overview of the di-
new vessels, often taken from the leg, are grafted agnostic process. As Smith (1985, p. 3) said:
onto the old ones such that blood is shunted “Medical decisions based on probabilities are
past the blocked region. Unfortunately, she has necessary but also perilous. Even the most as-
again begun to have chest pain, which becomes tute physician will occasionally be wrong.”
progressively more severe, despite medication. If
the heart muscle is deprived of oxygen, the result
can be a heart attack (myocardial infarction), in 3.1.1 ecision Making Under
D
which a section of the muscle dies. ◄ Uncertainty
requires more oxygen) during exercise, many explore methods used to estimate pretest proba-
heart conditions are evident only when the pa- bility accurately in 7 Sect. 3.2.
tient is physically stressed. Mr. Smith’s results After the pretest probability of disease has
show abnormal changes in the ECG during been estimated, the second stage of the diagnos-
exercise—a sign of heart disease. ◄ tic process involves gathering more information,
often by performing a diagnostic test. The clini-
3 How would the clinician evaluate this patient? cian in 7 Example 3.4 ordered a test to reduce
The clinician would first talk to the patient about the uncertainty about the diagnosis of heart dis-
the quality, duration, and severity of his or her ease. The positive test result supports the diag-
pain. Traditionally, the clinician would then de- nosis of heart disease, and this reduction in
cide what to do next based on his or her intuition uncertainty is shown in . Fig. 3.1a. Although
about the etiology (cause) of the chest pain. Our the clinician in 7 Example 3.4 chose the exercise
approach is to ask the clinician to make his or stress test, there are many tests available to diag-
her initial intuition explicit by estimating the nose heart disease, and the clinician would like
pretest probability of disease. The clinician in to know which test he or she should order next.
this example, based on what he or she knows Some tests reduce uncertainty more than do
from talking with the patient, might assess the others (see . Fig. 3.1b), but may cost more. The
pretest or prior probability of heart disease as more a test reduces uncertainty, the more useful
0.5 (50% chance or 1:1 odds; see 7 Sect. 3.2). We it is. In 7 Sect. 3.3, we explore ways to measure
a
Pretest Post-test
probability probability
Perform test
Probability of disease
b
Pretest Post-test probability
probability after test 2
Perform test 2
Post-test probability
after test 1
Perform test 1
.. Fig. 3.1 The effect of test results on the probability ence of disease (increases the probability of disease)
of disease. a A positive test result increases the probabil- more than test 1 does
ity of disease. b Test 2 reduces uncertainty about pres-
Biomedical Decision Making: Probabilistic Clinical Reasoning
83 3
how well a test reduces uncertainty, expanding The probability of event A and event B occur-
the concepts of test sensitivity and specificity ring together is denoted by p[A&B] or by p[A,B].
first introduced in 7 Chap. 2. Events A and B are considered independent
Given new information provided by a test, if the occurrence of one does not influence the
the third step is to update the initial probability probability of the occurrence of the other.
estimate. The clinician in 7 Example 3.4 must The probability of two independent events A
ask: “What is the probability of disease given and B both occurring is given by the product
the abnormal stress test?” The clinician wants to of the individual probabilities:
know the posterior probability, or post-test prob-
p [ A,B] = p [ A ] ´ p [ B].
ability, of disease (see . Fig. 3.1a). In 7 Sect.
3.4, we reexamine Bayes’ theorem, introduced Thus, the probability of heads on two consecu-
in 7 Chap. 2, and we discuss its use for calculat- tive coin tosses is 0.5 × 0.5 = 0.25. (Regardless
ing the post-test probability of disease. As we of the outcome of the first toss, the probability
noted, to calculate post-test probability, we of heads on the second toss is 0.5).
must know the pretest probability, as well as the The probability that event A will occur
sensitivity and specificity, of the test.3 given that event B is known to occur is called
the conditional probability of event A given
event B, denoted by p[A|B] and read as “the
3.2 Probability Assessment: probability of A given B.” Thus a post-test
Methods to Assess Pretest probability is a conditional probability predi-
Probability cated on the test or finding. For example, if
30% of patients who have a swollen leg have a
In this section, we explore the methods that blood clot, we say the probability of a blood
clinicians can use to make judgments about clot given a swollen leg is 0.3, denoted:
the probability of disease before they order
p [ blood clot|swollen leg ] = 0.3.
tests. Probability is our preferred means of
expressing uncertainty. In this framework, Before the swollen leg is noted, the pretest
probability (p) expresses a clinician’s opin- probability is simply the prevalence of blood
ion about the likelihood of an event as a clots in the leg in the population from which
number between 0 and 1. An event that is the patient was selected—a number likely to
certain to occur has a probability of 1; an be much smaller than 0.3.
event that is certain not to occur has a prob- Now that we have decided to use probabili-
ability of 0.4 ty to express uncertainty, how can we estimate
The probability of event A is written p[A]. probability? We can do so by either subjective
The sum of the probabilities of all possible, or objective methods; each approach has ad-
collectively exhaustive outcomes of a chance vantages and limitations.
event must be equal to 1. Thus, in a coin flip,
p [ heads ] + p [ tails ] = 1.0.
3.2.1 Subjective Probability
Assessment
Most assessments that clinicians make about
3 Note that pretest and post-test probabilities corre-
spond to the concepts of prevalence and predictive probability are based on personal experience.
value. The latter terms were used in 7 Chap. 2 The clinician may compare the current prob-
because the discussion was about the use of tests for lem to similar problems encountered previous-
screening populations of patients; in a population, ly and then ask: “What was the frequency of
the pretest probability of disease is simply that dis-
disease in similar patients whom I have seen?”
ease’s prevalence in that population.
4 We assume a Bayesian interpretation of probability; To make these subjective assessments of
there are other statistical interpretations of proba- probability, people rely on several discrete, often
bility. unconscious mental processes that have been de-
84 D. K. Owens et al.
scribed and studied by cognitive psychologists heuristic, and it is often misleading. We re-
(Tversky and Kahneman 1974). These processes member dramatic, atypical, or emotion-
are termed cognitive heuristics. laden events more easily and therefore are
More specifically, a cognitive heuristic is a likely to overestimate their probability. A
mental process by which we learn, recall, or pro- clinician who had cared for a patient who
cess information; we can think of heuristics as had a swollen leg and who then died from
3 rules of thumb. Knowledge of heuristics is im- a blood clot would vividly remember
portant because it helps us to understand the thrombosis as a cause of a swollen leg. The
underpinnings of our intuitive probability as- clinician would remember other causes of
sessment. Both naive and sophisticated decision swollen legs less easily, and he or she would
makers (including clinicians and statisticians) tend to overestimate the probability of a
misuse heuristics and therefore make system- blood clot in patients with a swollen leg.
atic—often serious—errors when estimating 3. Anchoring and adjustment. Another com-
probability. So, just as we may underestimate mon heuristic used to judge probability is
distances on a particularly clear day (Tversky anchoring and adjustment. A clinician
and Kahneman 1974), we may make mistakes in makes an initial probability estimate (the
estimating probability in deceptive clinical situa- anchor) and then adjusts the estimate based
tions. Three heuristics have been identified as on further information. For instance, the
important in estimation of probability: clinician in 7 Example 3.4 makes an initial
1. Representativeness. One way that people es- estimate of the probability of heart disease
timate probability is to ask themselves: What as 0.5. If he or she then learns that all the
is the probability that object A belongs to patient’s brothers had died of heart disease,
class B? For instance, what is the probability the clinician should raise the estimate be-
that this patient who has a swollen leg be- cause the patient’s strong family history of
longs to the class of patients who have blood heart disease increases the probability that
clots? To answer, we often rely on the repre- he or she has heart disease, a fact the clini-
sentativeness heuristic in which probabilities cian could ascertain from the literature. The
are judged by the degree to which A is repre- usual mistake is to adjust the initial esti-
sentative of, or similar to, B. The clinician mate (the anchor) insufficiently in light of
will judge the probability of the development the new information. Instead of raising his
of a blood clot (thrombosis) by the degree to or her estimate of prior probability to, say,
which the patient with a swollen leg resem- 0.8, the clinician might adjust it to only 0.6.
bles the clinician’s mental image of patients
with a blood clot. If the patient has all the Heuristics often introduce error into our judg-
classic findings (signs and symptoms) associ- ments about prior probability. Errors in our
ated with a blood clot, the clinician judges initial estimates of probabilities will be reflect-
that the patient is highly likely to have a ed in the posterior probabilities even if we use
blood clot. Difficulties occur with the use of quantitative methods to derive those posterior
this heuristic when the disease is rare (very probabilities. An understanding of heuristics
low prior probability, or prevalence); when is thus important for medical decision mak-
the clinician’s previous experience with the ing. The clinician can avoid some of these dif-
disease is atypical, thus giving an incorrect ficulties by using published research results to
mental representation; when the patient’s estimate probabilities.
clinical profile is atypical; and when the
probability of certain findings depends on
whether other findings are present. 3.2.2 Objective Probability
2. Availability. Our estimate of the probabil- Estimates
ity of an event is influenced by the ease
with which we remember similar events. Published research results can serve as a guide
Events more easily remembered are judged for more objective estimates of probabilities. We
more probable; this rule is the availability can use the prevalence of disease in the popula-
Biomedical Decision Making: Probabilistic Clinical Reasoning
85 3
tion or in a subgroup of the population, or clin- What is the probability that Ms. Troy will
ical prediction rules, to estimate the probability suffer a cardiac complication? Clinical pre-
of disease. diction rules have been developed to help cli-
As we discussed in 7 Chap. 2, the preva- nicians to assess this risk (Palda and Detsky
lence is the frequency of an event in a popula- 1997). . Table 3.1 lists clinical findings and
tion; it is a useful starting point for estimating their corresponding diagnostic weights. We
probability. For example, if you wanted to es- add the diagnostic weights for each of the
timate the probability of prostate cancer in a patient’s clinical findings to obtain the total
50-year-old man, the prevalence of prostate score. The total score places the patient in a
cancer in men of that age (5–14%) would be a group with a defined probability of cardiac
useful anchor point from which you could in- complications, as shown in . Table 3.2. Ms.
crease or decrease the probability depending Troy receives a score of 20; thus, the clinician
on your findings. Estimates of disease preva- can estimate that the patient has a 27%
lence in a defined population often are avail- chance of developing a severe cardiac com-
able in the medical literature. plication.
Symptoms, such as difficulty with urina- Objective estimates of pretest probability
tion, or signs, such as a palpable prostate nod- are subject to error because of bias in the
ule, can be used to place patients into a clinical studies on which the estimates are based. For
subgroup in which the probability of disease is
known. For patients referred to a urologist for
evaluation of a prostate nodule, the preva- .. Table 3.1 Diagnostic weights for assessing
lence of cancer is about 50%. This approach risk of cardiac complications from noncardiac
may be limited by difficulty in placing a pa- surgery
tient in the correct clinically defined subgroup,
especially if the criteria for classifying patients Clinical finding Diagnostic
weight
are ill-defined. A trend has been to develop
guidelines, known as clinical prediction rules, Age greater than 70 years 5
to help clinicians assign patients to well-de-
Recent documented heart
fined subgroups in which the probability of attack
disease is known.
Clinical prediction rules are developed from >6 months previously 5
systematic study of patients who have a particu- <6 months previously 10
lar diagnostic problem; they define how clini- Severe angina 20
cians can use combinations of clinical findings
to estimate probability. The symptoms or signs Pulmonary edemaa
that make an independent contribution to the Within 1 week 10
probability that a patient has a disease are iden- Ever 5
tified and assigned numerical weights based on
statistical analysis of the finding’s contribution. Arrhythmia on most recent
ECG 5
The result is a list of symptoms and signs for an
individual patient, each with a corresponding >5 PVCs 5
numerical contribution to a total score. The to- Critical aortic stenosis 20
tal score places a patient in a subgroup with a
Poor medical condition 5
known probability of disease.
Emergency surgery 10
►►Example 3.5
ECG electrocardiogram, PVCs premature ven-
Ms. Troy, a 65-year-old woman who had a heart tricular contractions on preoperative electrocar-
attack 4 months ago, has abnormal heart rhythm diogram
(arrhythmia), is in poor medical condition, and aFluid in the lungs due to reduced heart function
Normal Abnormal
Healthy
population
Cutoff
Number of Diseased
value
individuals population
Test result
.. Fig. 3.2 Distribution of test results in healthy and possible values changes the relative proportions of false
diseased individuals. Varying the cutoff between “nor- positives (FPs) and false negatives (FNs) for the two
mal” and “abnormal” across the continuous range of populations
abnormal by the statistical criterion, 2.5% of A perfect test would have no FN or FP re-
healthy individuals will have an abnormal sults. Erroneous test results do occur, howev-
test. If there is an overlap in the distribution er, and you can use a 2 × 2 contingency table
of test results in healthy and diseased individ- to define the measures of test performance
uals, some diseased patients will have a nor- that reflect these errors.
mal test (see . Fig. 3.2). You should be
3 familiar with the terms used to denote these
groups: 3.3.2 Measures of Test Performance
55 A true positive (TP) is a positive test result
obtained for a patient in whom the disease Measures of test performance are of two
is present (the test result correctly classifies types: measures of agreement between tests or
the patient as having the disease). measures of concordance, and measures of
55 A true negative (TN) is a negative test re- disagreement or measures of discordance.
sult obtained for a patient in whom the Two types of concordant test results occur in
disease is absent (the test result correctly the 2 × 2 table in . Table 3.3: TPs and TNs.
classifies the patient as not having the dis- The relative frequencies of these results form
ease). the basis of the measures of concordance.
55 A false positive (FP) is a positive test result These measures correspond to the ideas of the
obtained for a patient in whom the disease sensitivity and specificity of a test, which we
is absent (the test result incorrectly classi- introduced in 7 Chap. 2. We define each mea-
fies the patient as having the disease). sure in terms of the 2 × 2 table and in terms of
55 A false negative (FN) is a negative test re- conditional probabilities.
sult obtained for a patient in whom the The true-positive rate (TPR), or sensitivi-
disease is present (the test result incorrect- ty, is the likelihood that a diseased patient has
ly classifies the patient as not having the a positive test. In conditional-probability no-
disease). tation, sensitivity is expressed as the probabil-
ity of a positive test given that disease is
. Figure 3.2 shows that varying the cutoff present:
point (moving the vertical line in the figure)
p [ positive test|disease ].
for an abnormal test will change the relative
proportions of these groups. As the cutoff is Another way to think of the TPR is as a ratio.
moved further up from the mean of the nor- The likelihood that a diseased patient has a
mal values, the number of FNs increases and positive test is given by the ratio of diseased
the number of FPs decreases. Once we have
chosen a cutoff point, we can conveniently
summarize test performance—the ability to .. Table 3.3 A 2 × 2 contingency table for test
discriminate disease from nondisease—in a 2 results
× 2 contingency table, as shown in . Table
Results of Disease Disease Total
3.3. The table summarizes the number of pa- test present absent
tients in each group: TP, FP, TN, and FN.
Note that the sum of the first column is the Positive TP FP TP + FP
total number of diseased patients, TP + FN. result
The sum of the second column is the total Negative FN TN FN + TN
number of nondiseased patients, FP + TN. result
The sum of the first row, TP + FP, is the total TP + FN FP + TN
number of patients with a positive test result.
Likewise, FN + TN gives the total number of TP true positive, TN true negative, FP false posi-
patients with a negative test result. tive, FN false negative
Biomedical Decision Making: Probabilistic Clinical Reasoning
89 3
patients with a positive test to all diseased pa-
.. Table 3.4 A 2 × 2 contingency table for HIV
tients: antibody EIA
0.98. If the test were performed on 100 pa- and the therapy is dangerous, we should set
tients who truly had the antibody, we would the cutoff value to minimize FP results.
expect the test to be positive in 98 of the pa- We stress the point that sensitivity and spec-
tients. Conversely, we would expect two of the ificity are characteristics not of a test per se but
patients to receive incorrect, negative results, rather of the test and a criterion for when to
for an FNR of 2%. (You should convince call that test abnormal. Varying the cutoff in .
3 yourself that the sum of TPR and FNR by Fig. 3.2 has no effect on the test itself (the way
definition must be 1: TPR + FNR = 1). it is performed, or the specific values for any
And the TNR is: particular patient); instead, it trades off speci-
ficity for sensitivity. Thus, the best way to char-
TN 297 acterize a test is by the range of values of
= = 0.99
TN + FP 297 + 3 sensitivity and specificity that it can take on
over a range of possible cutoffs. The typical
The likelihood that a patient who has no HIV way to show this relationship is to plot the test’s
antibody will have a negative test is 0.99. sensitivity against 1 minus specificity (i.e., the
Therefore, if the EIA test were performed on TPR against the FPR), as the cutoff is varied
100 individuals who had not been infected and the two test characteristics are traded off
with HIV, it would be negative in 99 and in- against each other (. Fig. 3.3). The resulting
correctly positive in 1. (Convince yourself that curve, known as a receiver-operating character-
the sum of TNR and FPR also must be 1: istic (ROC) curve, was originally described by
TNR + FPR = 1). researchers investigating methods of electro-
magnetic-signal detection and was later ap-
plied to the field of psychology (Peterson and
3.3.3 I mplications of Sensitivity Birdsall 1953; Swets 1973). Any given point
and Specificity: How to along a ROC curve for a test corresponds to
Choose Among Tests the test sensitivity and specificity for a given
threshold of “abnormality.” Similar curves can
It may be clear to you already that the calcu- be drawn for any test used to associate ob-
lated values of sensitivity and specificity for a
continuous-valued test depend on the particu-
lar cutoff value chosen to distinguish normal 1.0 Test B
and abnormal results. In . Fig. 3.2, note that
True-positive rate (sensitivity)
The experience with CEA has been repeated standard test than are patients with positive
with numerous tests. Early measures of test tests. In other words, the study population,
discrimination are overly optimistic, and sub- comprising individuals with positive index–test
sequent test performance is disappointing. results, has a higher percentage of patients with
Problems arise when the TPR and TNR, as disease than does the clinically relevant popula-
measured in the study population, do not ap- tion. Therefore, both TN and FN tests will be
3 ply to the clinically relevant population. These underrepresented in the study population. The
problems usually are the result of bias in the de- result is overestimation of the TPR and under-
sign of the initial studies—notably spectrum estimation of the TNR in the study population.
bias, test referral bias, or test interpretation bias. Test-interpretation bias develops when the
Spectrum bias occurs when the study popu- interpretation of the index test affects that of the
lation includes only individuals who have ad- gold standard test or vice versa. This bias causes
vanced disease (“sickest of the sick”) and an artificial concordance between the tests (the
healthy volunteers, as is often the case when a results are more likely to be the same) and spuri-
test is first being developed. Advanced disease ously increases measures of concordance—the
may be easier to detect than early disease. For sensitivity and specificity—in the study popula-
example, cancer is easier to detect when it has tion. (Remember, the relative frequencies of TPs
spread throughout the body (metastasized) and TNs are the basis for measures of concor-
than when it is localized to, say, a small portion dance). To avoid these problems, the person in-
of the colon. In contrast to the study popula- terpreting the index test should be unaware of
tion, the clinically relevant population will con- the results of the gold standard test.
tain more cases of early disease that are more To counter these three biases, you may
likely to be missed by the index test (FNs). need to adjust the TPR and TNR when they
Thus, the study population will have an artifac- are applied to a new population. All the biases
tually low FNR, which produces an artifactu- result in a TPR that is higher in the study pop-
ally high TPR (TPR = 1 − FNR). In addition, ulation than it is in the clinically relevant pop-
healthy volunteers are less likely than are pa- ulation. Thus, if you suspect bias, you should
tients in the clinically relevant population to adjust the TPR (sensitivity) downward when
have other diseases that may cause FP results6; you apply it to a new population.
the study population will have an artificially Adjustment of the TNR (specificity) de-
low FPR, and therefore the specificity will be pends on which type of bias is present.
overestimated (TNR = 1 − FPR). Inaccuracies Spectrum bias and test interpretation bias re-
in early estimates of the TPR and TNR of the sult in a TNR that is higher in the study popu-
CEA were partly due to spectrum bias. lation than it will be in the clinically relevant
Test-referral bias (sometimes referred to as population. Thus, if these biases are present,
referral bias) occurs when a positive index test you should adjust the specificity downward
is a criterion for ordering the gold standard when you apply it to a new population. Test-
test. In clinical practice, patients with negative referral bias, on the other hand, produces a
index tests are less likely to undergo the gold measured specificity in the study population
that is lower than it will be in the clinically rel-
evant population. If you suspect test referral
6 Volunteers are often healthy, whereas patients in the bias, you should adjust the specificity upward
clinically relevant population often have several dis- when you apply it to a new population.
eases in addition to the disease for which a test is
designed. These other diseases may cause FP test
results. For example, patients with benign (rather
than malignant) enlargement of their prostate 3.3.6 eta-Analysis of Diagnostic
M
glands are more likely than are healthy volunteers to Tests
have FP elevations of prostate-specific antigen
(Meigs et al. 1996), a substance in the blood that is
Often, there are many studies that evaluate the
elevated in men who have prostate cancer. Measure-
ment of prostate-specific antigen is often used to sensitivity and specificity of the same diagnostic
detect prostate cancer. test. If the studies come to similar conclusions
Biomedical Decision Making: Probabilistic Clinical Reasoning
93 3
about the sensitivity and specificity of the test, 3.4.1 Bayes’ Theorem
you can have increased confidence in the results
of the studies. But what if the studies disagree? As we noted earlier in this chapter, a clinician can
For example, by 1995, over 100 studies had as- use the disease prevalence in the patient popula-
sessed the sensitivity and specificity of the PCR tion as an initial estimate of the pretest risk of
for diagnosis of HIV (Owens et al. 1996a, b); disease. Once clinicians begin to accumulate in-
these studies estimated the sensitivity of PCR to formation about a patient, however, they revise
be as low as 10% and to be as high as 100%, and their estimate of the probability of disease. The
they assessed the specificity of PCR to be be- revised estimate (rather than the disease preva-
tween 40 and 100%. Which results should you lence in the general population) becomes the pre-
believe? One approach that you can use is to as- test probability for the test that they perform.
sess the quality of the studies and to use the es- After they have gathered more information with
timates from the highest-quality studies. a diagnostic test, they can calculate the post-test
For evaluation of PCR, however, even the probability of disease with Bayes’ theorem.
high-quality studies did not agree. Another ap- Bayes’ theorem is a quantitative method
proach is to perform a meta-analysis: a study for calculating post-test probability using the
that combines quantitatively the estimates pretest probability and the sensitivity and
from individual studies to develop a summary specificity of the test. The theorem is derived
ROC curve (Moses et al. 1993; Owens et al. from the definition of conditional probability
1996a, b; Hellmich et al. 1999; Leeflang et al. and from the properties of probability (see the
2008; Leeflang 2014). Investigators develop a Appendix to this chapter for the derivation).
summary ROC curve by using estimates from Recall that a conditional probability is the
many studies, in contrast to the type of ROC probability that event A will occur given that
curve discussed in 7 Sect. 3.3.3, which is devel- event B is known to occur (see 7 Sect. 3.2). In
oped from the data in a single study. Summary general, we want to know the probability that
ROC curves provide the best available ap- disease is present (event A), given that the test is
proach to synthesizing data from many studies. known to be positive (event B). We denote the
7 Section 3.3 has dealt with the second presence of disease as D, its absence as − D, a
step in the diagnostic process: acquisition of test result as R, and the pretest probability of
further information with diagnostic tests. We disease as p[D]. The probability of disease, given
have learned how to characterize the perfor- a test result, is written p[D|R]. Bayes’ theorem is:
mance of a test with sensitivity (TPR) and spec-
ificity (TNR). These measures reveal the p [ D ] ´ p [ R|D ]
p [ D|R ] =
probability of a test result given the true state of p [ D ] ´ p [ R|D ] + p [ -D ] ´ p [ R| - D ]
the patient. They do not, however, answer the
clinically relevant question posed in the opening We can reformulate this general equation in
example: Given a positive test result, what is the terms of a positive test, (+), by substituting
probability that this patient has the disease? To p[D|+] for p[D|R], p[+|D] for p[R|D], p[+| −
answer this question, we must learn methods to D] for p[R| − D], and 1 − p[D] for p[− D].
calculate the post-test probability of disease. From 7 Sect. 3.3, recall that p[+|D] = TPR
and p[+| − D] = FPR. Substitution provides
Bayes’ theorem for a positive test:
3.4 ost-test Probability: Bayes’
P p [ D ] ´ TPR
p [ D| + ] =
Theorem and Predictive Value p [ D ] ´ TPR + (1 - p [ D ]) ´ FPR
The third stage of the diagnostic process (see We can use a similar derivation to develop
. Fig. 3.1a) is to adjust our probability esti- Bayes’ theorem for a negative test:
mate to take into account the new informa- p [ D ] ´ FNR
tion gained from diagnostic tests by calculating p [ D| - ] =
p [ D ] ´ FNR + (1 - p [ D ]) ´ TNR
the post-test probability.
94 D. K. Owens et al.
Post-test probability
ity give the probability of a particular test result
0.6
in a patient who has a particular disease state.
3 The PV gives the probability of true disease state
once the patient’s test result is known. 0.4
The PV+ calculated from . Table 3.4 is 0.97,
so we expect 97 of 100 patients with a positive 0.2
index test actually to have antibody. Yet, in 7
Example 3.1, we found that fewer than one of 0.0
ten patients with a positive test were expected to
0.0 0.2 0.4 0.6 0.8 1.0
have antibody. What explains the discrepancy in
Pretest probability
these examples? The sensitivity and specificity
(and, therefore, the LRs) in the two examples are b 1.0
identical. The discrepancy is due to an extremely
important and often overlooked characteristic 0.8
of PV: the PV of a test depends on the preva-
Post-test probability
Post-test probability
has little effect on the post-test probability if 0.6 TNR=0.80
the pretest probability is low.
This discussion emphasizes a key idea of
0.4 TNR=0.80
this chapter: the interpretation of a test result
depends on the pretest probability of disease. If TNR=0.90
the pretest probability is low, a positive test re- 0.2
TNR=0.98
sult has a large effect, and a negative test result
has a small effect. If the pretest probability is 0.0
high, a positive test result has a small effect, and 0.0 0.2 0.4 0.6 0.8 1.0
a negative test result has a large effect. In other Pretest probability
words, when the clinician is almost certain of
b 1.0
the diagnosis before testing (pretest probabil-
TPR=0.95
ity nearly 0 or nearly 1), a confirmatory test
has little effect on the posterior probability 0.8 TPR=0.60
(see 7 Example 3.8). If the pretest probability is
Post-test probability
0.
terpretation of a positive test; test sensitivity are inaccurate estimation of pretest probabil-
affects primarily the interpretation of a nega- ity, faulty application of test-performance
tive test. In both parts (a) and (b) of . Fig. 3.5, measures, and violation of the assumptions of
the top family of curves corresponds to positive conditional independence and of mutual ex-
test results and the bottom family to negative clusivity.
test results. . Figure 3.5a shows the post-test Bayes’ theorem provides a means to adjust
3 probabilities for tests with varying specificities an estimate of pretest probability to take into
(TNR). Note that changes in the specificity account new information. The accuracy of
produce large changes in the top family of the calculated post-test probability is limited,
curves (positive test results) but have little effect however, by the accuracy of the estimated pre-
on the lower family of curves (negative test re- test probability. Accuracy of estimated prior
sults). That is, an increase in the specificity of a probability is increased by proper use of pub-
test markedly changes the post-test probability lished prevalence rates, heuristics, and clinical
if the test is positive but has relatively little ef- prediction rules. In a decision analysis, as we
fect on the post-test probability if the test is shall see, a range of prior probability often is
negative. Thus, if you are trying to rule in a di- sufficient. Nonetheless, if the pretest probabil-
agnosis,8 you should choose a test with high ity assessment is unreliable, Bayes’ theorem
specificity or a high LR+. . Figure 3.5b shows will be of little value.
the post-test probabilities for tests with varying A second potential mistake that you can
sensitivities. Note that changes in sensitivity make when using Bayes’ theorem is to apply
produce large changes in the bottom family of published values for the test sensitivity and
curves (negative test results) but have little ef- specificity, or LRs, without paying attention
fect on the top family of curves. Thus, if you to the possible effects of bias in the studies in
are trying to exclude a disease, choose a test which the test performance was measured
with a high sensitivity or a high LR−. (see 7 Sect. 3.3.5). With certain tests, the LRs
may differ depending on the pretest odds in
part because differences in pretest odds may
3.4.5 autions in the Application of
C reflect differences in the spectrum of disease
Bayes’ Theorem in the population.
A third potential problem arises when you
Bayes’ theorem provides a powerful method use Bayes’ theorem to interpret a sequence of
for calculating post-test probability. You should tests. If a patient undergoes two tests in se-
be aware, however, of the possible errors you quence, you can use the post-test probability
can make when you use it. Common problems after the first test result, calculated with
Bayes’ theorem, as the pretest probability for
the second test. Then, you use Bayes’ theorem
a second time to calculate the post-test prob-
8 In medicine, to rule in a disease is to confirm that the
patient does have the disease; to rule out a disease is ability after the second test. This approach is
to confirm that the patient does not have the disease. valid, however, only if the two tests are condi-
A doctor who strongly suspects that his or her tionally independent. Tests for the same dis-
patient has a bacterial infection orders a culture to ease are conditionally independent when the
rule in his or her diagnosis. Another doctor is almost
probability of a particular result on the sec-
certain that his or her patient has a simple sore
throat but orders a culture to rule out streptococcal ond test does not depend on the result of the
infection (strep throat). This terminology oversim- first test, given (conditioned on) the disease
plifies a diagnostic process that is probabilistic. state. Expressed in conditional probability
Diagnostic tests rarely, if ever, rule in or rule out a notation for the case in which the disease is
disease; rather, the tests raise or lower the probabil-
present,
ity of disease.
Biomedical Decision Making: Probabilistic Clinical Reasoning
99 3
p [second test positive|first test positive and disease present ]
= p [second test positive|first test negative and disease present ]
= p [second test positive|disease present ].
If the conditional independence assumption use the ideas developed in the preceding sec-
is satisfied, the post-test odds = pretest odds × tions to solve such difficult decision problems.
LR1 × LR2. If you apply Bayes’ theorem se- Here we discuss two methods: the decision
quentially in situations in which conditional tree, a method for representing and comparing
independence is violated, you will obtain inac- the expected outcomes of each decision alter-
curate post-test probabilities. native; and the threshold probability, a meth-
The fourth common problem arises when od for deciding whether new information can
you assume that all test abnormalities result change a management decision. These tech-
from one (and only one) disease process. The niques help you to clarify the decision problem
Bayesian approach, as we have described it, and thus to choose the alternative that is most
generally presumes that the diseases under con- likely to help the patient.
sideration are mutually exclusive. If they are
not, Bayesian updating must be applied with
great care. 3.5.1 omparison of Uncertain
C
We have shown how to calculate post-test Prospects
probability. In 7 Sect. 3.5, we turn to the prob-
lem of decision making when the outcomes of Like those of most biological events, the out-
a clinician’s actions (e.g., of treatments) are un- come of an individual’s illness is unpredictable.
certain. How can a clinician determine which course of
action has the greatest chance of success?
100 100
80 Treatment A 80 Treatment B
Percent of patients
Percent of patients
60 60
40 40
20 20
0 0
0 1 2 3 4 5 0 1 2 3 4 5
Survival (years) Survival (years)
.. Fig. 3.6 Survival after therapy for a fatal disease. Two therapies are available; the results of either are unpredictable
100 D. K. Owens et al.
Treatment A Treatment B
.. Fig. 3.7 A chance-node representation of survival after the two therapies in . Fig. 3.6. The probabilities times
the corresponding years of survival are summed to obtain the total expected survival
chance node is shown as a circle from which eral hundred patients were assigned to receive
several lines emanate. Each line represents one either therapy A or therapy B, the expected
of the possible outcomes. Associated with each survival would be 2.3 years for therapy A and
line is the probability of the outcome occurring. 3.1 years for therapy B.
For a single patient, only one outcome can oc- We have just described the basis of expect-
cur. Some physicians object to using probability ed-value decision making. The term expected
for just this reason: “You cannot rely on popu- value is used to characterize a chance event,
lation data, because each patient is an individu- such as the outcome of a therapy. If the out-
al.” In fact, we often must use the frequency of comes of a therapy are measured in units of
the outcomes of many patients experiencing the duration of survival, units of sense of well-
same event to inform our opinion about what being, or dollars, the therapy is characterized
might happen to an individual. From these fre- by the expected duration of survival, expected
quencies, we can make patient-specific adjust- sense of well-being, or expected monetary
ments and thus estimate the probability of each cost that it will confer on, or incur for, the pa-
outcome at a chance node. tient, respectively.
A chance node can represent more than To use expected-value decision making, we
just an event governed by chance. The out- follow this strategy when there are therapy
come of a chance event, unknowable for the choices with uncertain outcomes: (1) calculate
individual, can be represented by the expected the expected value of each decision alternative
value at the chance node. The concept of ex- and then (2) pick the alternative with the high-
pected value is important and is easy to un- est expected value.
derstand. We can calculate the mean survival
that would be expected based on the probabil-
ities depicted by the chance node in . Fig. 3.5.3 erformance of a Decision
P
3.7. This average length of life is called the Analysis
expected survival or, more generally, the ex-
pected value of the chance node. We calculate We clarify the concepts of expected-value de-
the expected value at a chance node by the cision making by discussing an example.
process just described: we multiply the sur- There are four steps in decision analysis:
vival value associated with each possible out- 1. Create a decision tree; this step is the most
come by the probability that that outcome will difficult, because it requires formulating
occur. We then sum the product of probability the decision problem, assigning probabili-
times survival over all outcomes. Thus, if sev- ties, and measuring outcomes.
102 D. K. Owens et al.
2. Calculate the expected value of each deci- Mr. Danby’s internist is familiar with deci-
sion alternative. sion analysis. She recognizes that this problem
3. Choose the decision alternative with the is filled with uncertainty: Mr. Danby’s ability to
highest expected value. survive the operation is in doubt, and the sur-
4. Use sensitivity analysis to test the conclu- gery sometimes does not restore mobility to the
sions of the analysis. degree required by such a patient. Furthermore,
3 there is a small chance that the prosthesis (the
Some health professionals hesitate when they artificial knee) will become infected, and Mr.
first learn about the technique of decision Danby then would have to undergo a second
analysis, because they recognize the opportu- risky operation to remove it. After removal of
nity for error in assigning values to both the the prosthesis, Mr. Danby would never again be
probabilities and the utilities in a decision able to walk, even with canes. The possible out-
tree. They reason that the technique encour- comes of knee replacement include death from
ages decision making based on small differ- the first procedure and death from a second
ences in expected values that are estimates at mandatory procedure if the prosthesis becomes
best. The defense against this concern, which infected (which we will assume occurs in the
also has been recognized by decision analysts, immediate postoperative period, if it occurs at
is the technique known as sensitivity analysis. all). Possible functional outcomes include
We discuss this important fourth step in deci- recovery of full mobility or continued, and
sion analysis in 7 Sect. 3.5.5. In addition, de- unchanged, poor mobility. Should Mr. Danby
cision analysis helps make the assumptions choose to undergo knee replacement surgery,
underlying a decision explicit, so that the as- or should he accept the status quo? ◄
sumptions can be assessed carefully.
The first step in decision analysis is to create Using the conventions of decision analysis,
a decision tree that represents the decision prob- the internist sketches the decision tree shown
lem. Consider the following clinical problem. in . Fig. 3.8. According to these conventions,
a square box denotes a decision node, and
►►Example 3.12 each line emanating from a decision node rep-
The patient is Mr. Danby, a 66-year-old man resents an action that could be taken.
who has been crippled with arthritis of both According to the methods of expected-
knees so severely that, while he can get about the value decision making, the internist first must
house with the aid of two canes, he must oth- assign a probability to each branch of each
erwise use a wheelchair. His other major health chance node. To accomplish this task, the inter-
problem is emphysema, a disease in which the nist asks several orthopedic surgeons for their
lungs lose their ability to exchange oxygen and estimates of the chance of recovering full func-
carbon dioxide between blood and air, which in tion after surgery (p[full recovery] = 0.60) and
turn causes shortness of breath (dyspnea). He the chance of developing infection in the pros-
is able to breathe comfortably when he is in a thetic joint (p[infection] = 0.05). She uses her
wheelchair, but the effort of walking with canes subjective estimate of the probability that the
makes him breathe heavily and feel uncom- patient will die during or immediately after
fortable. Several years ago, he seriously con- knee surgery (p[operative death] = 0.05).
sidered knee replacement surgery but decided Next, she must assign a value to each out-
against it, largely because his internist told him come. To accomplish this task, she first lists the
that there was a serious risk that he would not outcomes. As you can see from . Table 3.6, the
survive the operation because of his lung dis- outcomes differ in two dimensions: length of life
ease. Recently, however, Mr. Danby’s wife had (survival) and quality of life (functional status).
a stroke and was partially paralyzed; she now To characterize each outcome accurately, the in-
requires a degree of assistance that the patient ternist must develop a measure that takes into
cannot supply given his present state of mobil- account these two dimensions. Simply using du-
ity. He tells his doctor that he is reconsidering ration of survival is inadequate because Mr.
knee replacement surgery. Danby values 5 years of good health more than
Biomedical Decision Making: Probabilistic Clinical Reasoning
103 3
Operative death
Operative death
Infection
Surgery
Survival
Survival
Full mobility
No infection
Poor mobility
No surgery
.. Fig. 3.8 Decision tree for knee replacement surgery. The box represents the decision node (whether to have
surgery); the circles represent chance nodes
Operative death
Death 0
p = 0.05
Operative death
3 p = 0.05
Death 0
Infection
A
Surgery
D p = 0.05 p = 0.95 Wheelchair-bound 3
Survival
p = 0.95
C Full mobility 10
Survival Full mobility
p = 0.6
B
p = 0.95
p = 0.4
No infection Poor mobility 6
Poor mobility
Poor mobility 6
No surgery
.. Fig. 3.9 Decision tree for knee-replacement surgery. (measured in years of perfect mobility) are assigned to
Probabilities have been assigned to each branch of each the tips of each branch of the tree
chance node. The patient’s valuations of outcomes
the tree) to left (the root of the tree). Let us viving knee replacement surgery (Node C),
consider, for example, the expected value at she proceeds as follows:
the chance node representing the outcome of 1. Multiply the expected value of an infected
surgery to remove an infected prosthesis prosthesis (already calculated as 2.85
(Node A in . Fig. 3.9). The calculation re- QALYs) by the probability that the prosthe-
quires three steps: sis will become infected (0.05): 2.85 × 0.05 =
1. Calculate the expected value of operative 0.143 QALYs.
death after surgery to remove an infected 2. Multiply the expected value of never de-
prosthesis. Multiply the probability of op- veloping an infected prosthesis (already
erative death (0.05) by the QALY of the calculated as 8.4 QALYs) by the probabil-
outcome—death (0 years): 0.05 × 0 = 0 ity that the prosthesis will not become in-
QALY. fected (0.95): 8.4 × 0.95 = 7.98 QALYs.
2. Calculate the expected value of surviving 3. Add the expected values calculated in step
surgery to remove an infected knee prosthe- 1 (0.143 QALY) and step 2 (7.98 QALYs)
sis. Multiply the probability of surviving the to get the expected value of surviving knee
operation (0.95) by the number of healthy replacement surgery: 0.143 + 7.98 = 8.123
years equivalent to 10 years of being wheel- QALYs.
chair-bound (3 years): 0.95 × 3 = 2.85 QA-
LYs. The clinician performs this process, called av-
3. Add the expected values calculated in step eraging out at chance nodes, for node D as
1 (0 QALY) and step 2 (2.85 QALYs) to well, working back to the root of the tree, un-
obtain the expected value of developing an til the expected value of surgery has been cal-
infected prosthesis: 0 + 2.85 = 2.85 QALYs. culated. The outcome of the analysis is as
follows. For surgery, Mr. Danby’s average life
Similarly, the expected value at chance node B expectancy, measured in years of normal mo-
is calculated: (0.6 × 10) + (0.4 × 6) =8.4 bility, is 7.7. What does this value mean? It
QALYs. To obtain the expected value of sur- does not mean that, by accepting surgery,
Biomedical Decision Making: Probabilistic Clinical Reasoning
105 3
Mr. Danby is guaranteed 7.7 years of mobile quality of life with the outcome, and on the risk
life. One look at the decision tree will show involved in achieving the outcome (e.g., a cure
that some patients die in surgery, some devel- for cancer might require a risky surgical opera-
op infection, and some do not gain any tion). How can we incorporate these elements
improvement in mobility after surgery. Thus, into a decision analysis? To do so, we can repre-
an individual patient has no guarantees. If the sent patients’ preferences with utilities. The util-
clinician had 100 similar patients who under- ity of a health state is a quantitative measure of
went the surgery, however, the average number the desirability of a health state from the pa-
of mobile years would be 7.7. We can under- tient’s perspective. Utilities are typically ex-
stand what this value means for Mr. Danby pressed on a 0 to 1 scale, where 0 represents
only by examining the alternative: no surgery. death and 1 represents ideal health. For exam-
In the analysis for no surgery, the average ple, a study of patients who had chest pain (an-
length of life, measured in years of normal gina) with exercise rated the utility of mild,
mobility, is 6.0, which Mr. Danby considered moderate, and severe angina as 0.95, 0.92, and
equivalent to 10 years of continued poor mo- 0.82 (Nease et al. 1995), respectively. There are
bility. Not all patients will experience this out- several methods for assessing utilities.
come; some who have poor mobility will live The standard-gamble technique has the
longer than, and some will live less than, 10 strongest theoretical basis of the various ap-
years. The average length of life, however, ex- proaches to utility assessment, as shown by
pressed in years of normal mobility, will be 6. Von Neumann and Morgenstern and described
Because 6.0 is less than 7.7, on average the by Sox et al. (1988). To illustrate use of the
surgery will provide an outcome with higher standard gamble, suppose we seek to assess a
value to the patient. Thus, the internist recom- person’s utility for the health state of asymp-
mends performing the surgery. tomatic HIV infection. To use the standard
The key insight of expected-value decision gamble, we ask our subject to compare the de-
making should be clear from this example: giv- sirability of asymptomatic HIV infection to
en the unpredictable outcome in an individual, those of two other health states whose utility
the best choice for the individual is the alterna- we know or can assign. Often, we use ideal
tive that gives the best result on the average in health (assigned a utility of 1) and immediate
similar patients. Decision analysis can help the death (assigned a utility of 0) for the compari-
clinician to identify the therapy that will give son of health states. We then ask our subject to
the best results when averaged over many simi- choose between asymptomatic HIV infection
lar patients. The decision analysis is tailored to and a gamble with a chance of ideal health or
a specific patient in that both the utility func- immediate death. We vary the probability of
tions and the probability estimates are adjusted ideal health and immediate death systemati-
to the individual. Nonetheless, the results of cally until the subject is indifferent between as-
the analysis represent the outcomes that would ymptomatic HIV infection and the gamble. For
occur on average in a population of patients example, a subject might be indifferent when
who have similar utilities and for whom uncer- the probability of ideal health is 0.8 and the
tain events have similar probabilities. probability of death is 0.2. At this point of in-
difference, the utility of the gamble and that of
asymptomatic HIV infection are equal. We cal-
3.5.4 epresentation of Patients’
R culate the utility of the gamble as the weighted
Preferences with Utilities average of the utilities of each outcome of the
gamble [(1 × 0.8) + (0 × 0.2)] = 0.8. Thus in this
In 7 Sect. 3.5.3, we introduced the concept of example, the utility of asymptomatic HIV in-
QALYs, because length of life is not the only fection is 0.8. Use of the standard gamble en-
outcome about which patients care. Patients’ ables an analyst to assess the utility of outcomes
preferences for a health outcome may depend that differ in length or quality of life. Because
on the length of life with the outcome, on the the standard gamble involves chance events, it
106 D. K. Owens et al.
also assesses a person’s willingness to take (1995); in patient living with HIV, see Joyce et
risks—called the person’s risk attitude. al. (2009) and (2012). Other approaches to
A second common approach to utility as- valuing health outcomes include the Quality of
sessment is the time-trade-off technique (Sox et Well-Being Scale, the Health Utilities Index,
al. 1988; Torrance and Feeny 1989). To assess and the EuroQoL (see Neumann et al. 2017,
the utility of asymptomatic HIV infection us- ch. 7). Each of these instruments assesses how
3 ing the time-trade-off technique, we ask a per- people value health outcomes and therefore
son to determine the length of time in a better may be appropriate for use in decision analyses
state of health (usually ideal health or best at- or cost-effectiveness analyses.
tainable health) that he or she would find In summary, we can use utilities to represent
equivalent to a longer period of time with as- how patients value complicated health out-
ymptomatic HIV infection. For example, if our comes that differ in length and quality of life
subject says that 8 months of life with ideal and in riskiness. Computer-based tools with an
health was equivalent to 12 months of life with interactive format have been developed for as-
asymptomatic HIV infection, then we calculate sessing utilities; they often include text and mul-
the utility of asymptomatic HIV infection as timedia presentations that enhance patients’
8 ÷ 12 = 0.67. The time-trade-off technique understanding of the assessment tasks and of
provides a convenient method for valuing out- the health outcomes (Sumner et al. 1991; Nease
comes that accounts for gains (or losses) in and Owens 1994; Lenert et al. 1995).
both length and quality of life. Because the
time trade-off does not include gambles, how-
ever, it does not assess a person’s risk attitude. 3.5.5 erformance of Sensitivity
P
Perhaps the strongest assumption underlying Analysis
the use of the time trade-off as a measure of
utility is that people are risk neutral. A risk- Sensitivity analysis is a test of the robustness
neutral decision maker is indifferent between of the conclusions of an analysis over a wide
the expected value of a gamble and the gamble range of assumptions about the probabilities
itself. For example, a risk- neutral decision and the values, or utilities. The probability of
maker would be indifferent between the choice an outcome at a chance node may be the best
of living 20 years (for certain) and that of tak- estimate that is available, but there often is a
ing a gamble with a 50% chance of living 40 wide range of reasonable probabilities that a
years and a 50% chance of immediate death clinician could use with nearly equal confi-
(which has an expected value of 20 years). In dence. We use sensitivity analysis to answer
practice, of course, few people are risk-neutral. this question: Do my conclusions regarding
Nonetheless, the time-trade- off technique is the preferred choice change when the proba-
used frequently to value health outcomes be- bility and outcome estimates are assigned val-
cause it is relatively easy to understand. ues that lie within a reasonable range?
Several other approaches are available to The knee-replacement decision in 7 Exam-
value health outcomes. To use the visual analog ple 3.12 illustrates the power of sensitivity
scale, a person simply rates the quality of life analysis. If the conclusions of the analysis
with a health outcome (e.g., asymptomatic (surgery is preferable to no surgery) remain the
HIV infection) on a scale from 0 to 100. same despite a wide range of assumed values
Although the visual analog scale is easy to ex- for the probabilities and outcome measures,
plain and use, it has no theoretical justification the recommendation is trustworthy. . Figures
as a valid measure of utility. Ratings with the 3.10 and 3.11 show the expected survival in
visual analog scale, however, correlate modest- healthy years with surgery and without surgery
ly well with utilities assessed by the standard under varying assumptions of the probability
gamble and time trade-off. For a demonstra- of operative death and the probability of at-
tion of the use of standard gambles, time trade- taining perfect mobility, respectively. Each
offs, and the visual analog scale to assess point (value) on these lines represents one cal-
utilities in patients with angina, see Nease et al. culation of expected survival using the tree in
Biomedical Decision Making: Probabilistic Clinical Reasoning
107 3
10 10
Surgery Surgery
5 5
No surgery No surgery
.. Fig. 3.10 Sensitivity analysis of the effect of operative .. Fig. 3.11 Sensitivity analysis of the effect of a suc-
mortality on length of healthy life (7 Example 3.12). As the cessful operative result on length of healthy life (7 Exam-
probability of operative death increases, the relative values ple 3.12). As the probability of a successful surgical result
of surgery versus no surgery change. The point at which the increases, the relative values of surgery versus no surgery
two lines cross represents the probability of operative death change. The point at which the two lines cross represents
at which no surgery becomes preferable. The solid line repre- the probability of a successful result at which surgery
sents the preferred option at a given probability becomes preferable. The solid line represents the preferred
option at a given probability
.. Table 3.8 Probability of future health states for the Markov model in Fig. 3.12
Health state Probability of health state at end of year
Year 1 Year 2 Year 3 Year 4 Year 5 Year 6 Year 7
Treatment-threshold
probability
Do not
Treat
treat
.. Fig. 3.13 Depiction of the treatment threshold is to withhold therapy. At probabilities of disease that
probability. At probabilities of disease that are less than are greater than the treatment threshold probability, the
the treatment threshold probability, the preferred action preferred action is to treat
old, you should do nothing unless a positive representation for such problems (Nease and
CTA scan result could raise the probability of Owens 1997; Owens et al. 1997).
pulmonary embolus to above 0.10. You have As shown in . Fig. 3.15, influence dia-
completed step 2. grams have certain features that are similar to
To decide whether a test result could affect decision trees, but they also have additional
your decision to treat, you must decide whether graphical elements. Influence diagrams repre-
3 a positive CTA scan result would raise the prob- sent decision nodes as squares and chance
ability of pulmonary embolism to more than nodes as circles. In contrast to decision trees,
0.10, the treatment threshold. You review the however, the influence diagram also has arcs
literature and learn that the LR for a positive
CTA scan is approximately 21 (Stein et al. 2006).
HIV+
A negative CTA scan result will move the 0.8099
10.50
Treat
probability of disease away from the treatment
HIV−
threshold and will be of no help in deciding “HIV+” 0.1901
75.46
HIV−
= 0.053 ´ 21 = 1.11. 75.50
0.9982
HIV+
A post-test odds of 1.1 is equivalent to a prob- Treat 0.08
10.50
.. Fig. 3.16 The influence diagram from . Fig. 3.15, with adjusted life expectancy, PCR polymerase chain reaction,
the probability and value tables associated with the nodes. NA not applicable, TX+ treated, TX− not treated. Test
The information in these tables is the same as that associ- results are shown in quotation marks (“HIV+”), and the true
ated with the branches and endpoints of the decision tree disease state is shown without quotation marks (HIV+).
in . Fig. 3.15. HIV human immunodeficiency virus, HIV+ (Source: Owens et al. (1997). Reproduced with permission)
HIV infected, HIV− not infected with HIV, QALE quality-
114 D. K. Owens et al.
For influence diagrams, the calculation of ex- the problem and the objectives of the analysis.
pected value is more complex (Owens et al. Although how to choose and design such mod-
1997), and generally must be performed with els is beyond our scope, we note other type of
computer software. With the appropriate soft- models that analysts use commonly for medical
ware, we can use influence diagrams to perform decision making. Microsimulation models are
the same analyses that we would perform with a individual-level health state transition models,
3 decision tree. Diagrams that have only chance similar to Markov models, that provide a means
nodes are called belief networks; we use them to to model very complex events flexibly over
perform probabilistic inference. time. They are useful when the clinical history
Why use an influence diagram instead of a of a problem is complex, such as might occur
decision tree? Influence diagrams have both ad- with cancer, heart disease, and other chronic
vantages and limitations relative to decision diseases. They are also useful for modeling indi-
trees. Influence diagrams represent graphically vidual heterogeneity which may depend on
the probabilistic relationships among variables combinations of individual characteristics (e.g.
(Owens et al. 1997). Such representation is ad- heterogeneity of response to treatment based
vantageous for problems in which probabilistic on medical conditions or genetics). Dynamic
conditioning is complex or in which communi- transmission models are particularly well-suited
cation of such conditioning is important (such for assessing the outcomes of infectious diseas-
as may occur in large models). In an influence es. These models divide a population into com-
diagram, probabilistic conditioning is indicated partments (for example, uninfected, infected,
by the arcs, and thus the conditioning is appar- recovered, dead), and transitions between com-
ent immediately by inspection. In a decision partments are governed by differential or dif-
tree, probabilistic conditioning is revealed by the ference equations. The rate of transition
probabilities in the branches of the tree. To de- between compartments depends in part on the
termine whether events are conditionally inde- number of individuals in the compartment, an
pendent in a decision tree requires that the important feature for infectious diseases in
analyst compare probabilities of events between which the transmission may depend on the
branches of the tree. Influence diagrams also are number of infected or susceptible individuals.
particularly useful for discussion with content Discrete event simulation models also are often
experts who can help to structure a problem but used to model interactions between people.
who are not familiar with decision analysis. In These models are composed of entities (a pa-
contrast, problems that have decision alterna- tient) that have attributes (clinical history), and
tives that are structurally different may be easier that experience events (a heart attack). An en-
for people to understand when represented with tity can interact with other entities and use re-
a decision tree, because the tree shows the struc- sources. Discrete event simulation models are
tural differences explicitly, whereas the influence also used when considering scarce resources
diagram does not. The choice of whether to use such as queues for a diagnostic test or an oper-
a decision tree or an influence diagram depends ating room slot. For more information on these
on the problem being analyzed, the experience types of models, we suggest a recent series of
of the analyst, the availability of software, and papers on best modeling practices; the paper by
the purpose of the analysis. For selected prob- Caro and colleagues noted in the suggested
lems, influence diagrams provide a powerful readings at the end of the chapter is an over-
graphical alternative to decision trees. view of this series of papers.
tain published estimates of disease preva- Again, from the definition of conditional
lence and test performance. Computer probability,
programs for performing statistical analyses
p [ R,D ] p [ R, - D ]
can be used on data collected by hospital in- p [ R|D ] = and p [ R| - D ] =
formation systems. Decision analysis soft- p [ D] p [ -D]
ware, available for personal computers, can
3 help clinicians to structure decision trees, to These expressions can be rearranged:
calculate expected values, and to perform p [ R,D ] = p [ D ] ´ p [ R| D ] , (3.3)
sensitivity analyses. Researchers continue to
explore methods for computer-based auto- p [ R, - D ] = p [ -D ] ´ p [ R | - D ]. (3.4)
mated development of practice guidelines
from decision models and use of computer- Substituting Eqs. 3.3 and 3.4 into Eq. 3.2, we
based systems to implement guidelines obtain Bayes’ theorem:
(Musen et al. 1996). With the growing matu-
rity of this field, there are now companies p [ D ] ´ p [ R|D ]
p [ D|R ] =
that offer formal analytical tools to assist p [ D ] ´ p [ R|D ] + p [ -D ] ´ p [ R| - D ]
with clinical outcome assessment and inter-
pretation of population datasets. nnSuggested Readings
Medical decision making often involves Briggs, A., Weinstein, M., Fenwick, E., Karnon,
uncertainty for the clinician and risk for the J., Sculpher, M., & Paltiel, A. (2012). Model
patient. Most health care professionals would parameter estimation and uncertainty analy-
welcome tools that help them make decisions sis: A report of the ISPOR-SMDM modeling
when they are confronted with complex clini- good research practices task force-6. Medical
cal problems with uncertain outcomes. There Decision Making, 32(5), 722–732. This article
are important medical problems for which de- describes best practices for estimating model
cision analysis offers such aid. parameters and for performing sensitivity
analyses, including probabilistic sensitivity
analysis.
3.10 ppendix A: Derivation of
A Caro, J., Briggs, A., Siebert, U., & Kuntz, K.
Bayes’ Theorem (2012). Modeling good research practices –
overview: A report of the ISPOR-SMDM
Bayes’ theorem is derived as follows. We de- modeling good research practices task force-1.
note the conditional probability of disease, D, Value in Health, 15, 796–803. This paper is an
given a test result, R, p[D|R]. The prior (pre- introduction to a series of papers that describe
test) probability of D is p[D]. The definition best modeling practices.
of conditional probability is: Hunink, M., Glasziou, P., Siegel, J., Weeks, J.,
Pliskin, J., Einstein, A., & Weinstein, M.
p [ R,D ]
p [ D|R ] = (3.1) (2001). Decision making in health and medi-
p [R ] cine. Cambridge: Cambridge University Press.
The probability of a test result (p[R]) is the This textbook addresses in detail most of the
sum of its probability in diseased patients and topics introduced in this chapter.
its probability in nondiseased patients: Nease, R. F., Jr., & Owens, D. K. (1997b). Use of
influence diagrams to structure medical deci-
p [ R ] = p [ R,D ] + p [ R, - D ]. sions. Medical Decision Making, 17(13), 263–
Substituting into Eq. 3.1, we obtain: 275. This article provides a comprehensive
introduction to the use of influence diagrams.
p [ R,D ] Neumann, P. J., Sanders, G. D., Russell, L. B.,
p [ D|R ] = (3.2)
p [ R,D ] + p [ R, - D ] Siegel, J. E., & Ganiats, T. G. (Eds.). (2017b).
Cost-effectiveness in health and medicine (2nd
Biomedical Decision Making: Probabilistic Clinical Reasoning
117 3
ed.). New York: Oxford University Press. This (b) Two known complications of heart
book provides authoritative guidelines for the surgery are stroke and heart attack,
conduct of cost-effectiveness analyses. with probabilities of 0.02 and 0.05,
Chapter 7 discusses approaches for valuing respectively. The patient asks what
health outcomes. chance he or she has of having both
Owens, D. K., Schacter, R. D., & Nease, R. F., Jr. complications. Assume that the com-
(1997b). Representation and analysis of medi- plications are conditionally indepen-
cal decision problems with influence diagrams. dent, and calculate your answer.
Medical Decision Making, 17(3), 241–262. (c) The patient wants to know the
This article provides a comprehensive intro- probability that he or she will
duction to the use of influence diagrams. have a stroke given that he or she
Raiffa, H. (1970). Decision analysis: Introductory has a heart attack as a complica-
lectures on choices under uncertainty. Reading: tion of the surgery. Assume that 1
Addison-Wesley. This now classic book pro- in 500 patients has both compli-
vides an advanced, nonmedical introduction cations, that the probability of
to decision analysis, utility theory, and deci- heart attack is 0.05, and that the
sion trees. events are independent. Calculate
Sox, H. C. (1986). Probability theory in the use of your answer.
diagnostic tests. Annals of Internal Medicine,
2. The results of a hypothetical study to
104(1), 60–66. This article is written for clini-
measure test performance of a diag-
cians; it contains a summary of the concepts
nostic test for HIV are shown in the 2
of probability and test interpretation.
× 2 table in . Table 3.9.
Sox, H. C., Higgins, M. C., & Owens, D. K.
(a) Calculate the sensitivity, specific-
(2013). Medical decision making. Chichester:
ity, disease prevalence, PV+, and
Wiley-Blackwell. This introductory textbook
PV–.
covers the subject matter of this chapter in
(b) Use the TPR and TNR calculated
greater detail, as well as discussing many other
in part (a) to fill in the 2 × 2 table
topics.
in . Table 3.10. Calculate the dis-
Tversky, A., & Kahneman, D. (1974b). Judgment
ease prevalence, PV+, and PV–.
under uncertainty: Heuristics and biases.
Science, 185, 1124. This now classic article
provides a clear and interesting discussion of
the experimental evidence for the use and mis- .. Table 3.9 A 2 × 2 contingency table for the
use of heuristics in situations of uncertainty. hypothetical study in problem 2
Owens, D. K., & Nease, R. F., Jr. (1997). A normative Salpeter, S. R., Sanders, G. D., Salpeter, E. E., & Owens,
analytic framework for development of practice D. K. (1997). Monitored isoniazid prophylaxis for
guidelines for specific clinical populations. Medical low-risk tuberculin reactors older than 35 years of
Decision Making, 17(4), 409–426. age: A risk-benefit and cost-effectiveness analysis.
Owens, D., Harris, R., Scott, P., & Nease, R. F., Jr. Annals of Internal Medicine, 127(12), 1051–1061.
(1995). Screening surgeons for HIV infection: A Sanders, G. D., Hagerty, C. G., Sonnenberg, F. A., Hlatky,
cost-effectiveness analysis. Annals of Internal M. A., & Owens, D. K. (1999). Distributed dynamic
3 Medicine, 122(9), 641–652. decision support using a web-based interface for pre-
Owens, D. K., Holodniy, M., Garber, A. M., Scott, J., vention of sudden cardiac death. Medical Decision
Sonnad, S., Moses, L., Kinosian, B., & Schwartz, J. S. Making, 19(2), 157–166.
(1996a). The polymerase chain reaction for the diag- Sanders, G. D., Hlatky, M. A., & Owens, D. K. (2005). Cost
nosis of HIV infection in adults: A meta-analysis with effectiveness of the implantable cardioverter defibrilla-
recommendations for clinical practice and study tor (ICD) in primary prevention of sudden death. The
design. Annals of Internal Medicine, 124(9), 803–815. New England Journal of Medicine, 353, 1471–1478.
Owens, D. K., Holodniy, M., McDonald, T. W., Scott, J., Siebert, U., Alagoz, O., Bayoumi, A. M., Jahn, B., Owens,
& Sonnad, S. (1996b). A meta-analytic evaluation D. K., Cohen, D., et al. (2012). State-transition model-
of the polymerase chain reaction (PCR) for diagno- ing: A report of the ISPOR-SMDM modeling good
sis of human immunodeficiency virus (HIV) infec- research practices task force-3. Medical Decision
tion in infants. Journal of the American Medical Making, 32, 690–700.
Association, 275(17), 1342–1348. Smith, L. (1985). Medicine as an art. In J. Wyngaarden & L.
Owens, D. K., Shachter, R. D., & Nease, R. F., Jr. Smith (Eds.), Cecil textbook of medicine. Philadelphia:
(1997a). Representation and analysis of medical W. B. Saunders.
decision problems with influence diagrams. Medical Sonnenberg, F. A., & Beck, J. R. (1993). Markov models
Decision Making, 17(3), 241–262. in medical decision making: A practical guide.
Owens, D. K., Whitlock, E. P., Henderson, J., Pignone, Medical Decision Making, 13(4), 322–338.
M. P., Krist, A. H., Bibbins-Domingo, K., Curry, S. Sox, H. C. (1987). Probability theory in the use of diagnos-
J., Davidson, K. W., Ebell, M., Gilman, M. W., tic tests: Application to critical study of the literature.
Grossman, D. C., Kemper, A. R., Kurth, A. E., In H. C. Sox (Ed.), Common diagnostic tests: Use and
Maciosek, M., Siu, A. L., LeFevre, M. L., & for the interpretation (pp. 1–17). Philadelphia: American
U.S. Preventive Services Task Force. (2016). Use of College of Physicians.
decision models in the development of evidence- Sox, H. C., Blatt, M. A., Higgins, M. C., & Marton, K. I.
based clinical preventive services recommenda- (1988). Medical decision making. Boston: Butterworth
tions: Methods of the U.S. Preventive Services Task Publisher.
Force. Annals of Internal Medicine, 165, 501–508. Stein, P. D., Fowler, S. E., Goodman, L. R., Gottschalk,
Palda, V. A., & Detsky, A. S. (1997). Perioperative A., Hales, C. A., et al. (2006). Multidetector computed
assessment and management of risk from coronary tomography for acute pulmonary embolism. The New
artery disease. Annals of Internal Medicine, 127(4), England Journal of Medicine, 354, 2317–2327.
313–328. Sumner, W., Nease, R. F., Jr., & Littenberg, B. (1991).
Pauker, S. G., & Kassirer, J. P. (1980). The threshold U-titer: A utility assessment tool. In Proceedings of
approach to clinical decision making. The New the 15th annual symposium on computer applications
England Journal of Medicine, 34(5 Pt 2), 1189–1208. in medical care (pp. 701–705). Washington, DC.
Peabody, G. (1922). The physician and the laboratory. Swets, J. A. (1973). The relative operating characteristic
Boston Medical Surgery Journal, 187, 324. in psychology. Science, 182, 990.
Peterson, W., & Birdsall, T. (1953). The theory of signal Torrance, G. W., & Feeny, D. (1989). Utilities and quality-
detectability. (Technical Report No. 13.): Electronic adjusted life years. International Journal of Technology
Defense Group, University of Michigan, Ann Assessment in Health Care, 5(4), 559–575.
Arbor. Tversky, A., & Kahneman, D. (1974a). Judgment under
Ransohoff, D. F., & Feinstein, A. R. (1978). Problems of uncertainty: Heuristics and biases. Science, 185,
spectrum and bias in evaluating the efficacy of diag- 1124–1131.
nostic tests. The New England Journal of Medicine,
299(17), 926–930.
121 4
Cognitive Informatics
Vimla L. Patel and David R. Kaufman
Contents
References – 148
.. Table 4.1 Correspondences between cognitive science, medical cognition and applied cognitive research
in medical informatics
Knowledge organization and Organization of clinical and Development and use of medical
human memory basic science knowledge knowledge bases
Problem solving, Heuristics/ Medical problem solving and Medical artificial intelligence/decision
4 reasoning strategies decision making support systems/medical errors
Perception/attention Radiologic and dermatologic Medical imaging systems
diagnosis
Text comprehension Understanding medical texts Information retrieval/digital libraries/
Knowledge representation health literacy
Conversational analysis Medical discourse Medical natural language processing
Distributed cognition Collaborative practice and Computer-based provider order entry
research in health care systems
Coordination of theory and Diagnostic and therapeutic Evidence-based clinical guidelines
evidence reasoning
Diagrammatic reasoning Perceptual processing of patient Biomedical information visualization
data displays
in a number of ways including: (1) seed basic coordination and communication patterns of
research findings that can illuminate dimen- distributed clinical teams, to developing sus-
sions of design (e.g., attention and memory, tainable and cognitively plausible interven-
aspects of the visual system), (2) provide an tions for supporting clinician activities.
explanatory vocabulary for characterizing how The social sciences are constituted by mul-
individuals process and communicate health tiple frameworks and approaches. Behaviorism
information (e.g., various studies of medical constitutes a framework for analyzing and
cognition pertaining to doctor-patient inter- modifying behavior. It is an approach that
action), (3) present an analytic framework for has had an enormous influence on the social
identifying problems and modeling certain sciences. Cognitive science partially emerged
kinds of user interactions, (4) characterize the as a response to the limitations of behavior-
relationship between health information tech- ism. The next section of the chapter contains
nology, human factors and patient safety, (5) a brief history of the cognitive and behavioral
provide rich descriptive accounts of clinicians sciences that emphasizes the points of dif-
employing technologies in the context of ference between the two approaches. It also
work, and (6) furnish a generative approach serves to introduce basic concepts in the study
for novel designs and productive applied of cognition.
research programs in informatics (e.g., inter-
vention strategies for supporting low literacy
populations in health information seeking).
Based on a review of articles published 4.2 Cognitive Science:
in the Journal of Biomedical Informatics The Emergence of an
between January 2001 and March 2014, Patel Explanatory Framework
and Kannampallil (2015) identified 57 articles
that focused on topics related to cognitive In this section, we sketch a brief history of the
informatics. The topics ranged from char- emergence of cognitive science in view to dif-
acterizing the limits of clinician problem- ferentiate it with competing theoretical frame-
solving and reasoning behavior, to describing works in the social sciences. The section also
Cognitive Informatics
125 4
serves to introduce core concepts that consti- the advent of the digital computer, aroused
tute an explanatory framework for cognitive substantial interest in “information process-
science. ing” (Gardner 1985).
Behaviorism is the conceptual framework Cognitive scientists placed “thought”
underlying a particular science of behavior and “mental processes” at the center of their
(Zuriff 1985). It is not to be confused with explanatory framework. The “computer met-
the term behavioral science which names a aphor” provided a framework for the study
large body of work across disciplines, but not of human cognition as the manipulation of
a specific theoretical framework. Behaviorism “symbolic structures.” It also provided the
dominated experimental and applied psychol- foundation for a model of memory, which
ogy as well as the social sciences for the better was a prerequisite for an information process-
part of the twentieth century (Bechtel et al. ing theory (Atkinson and Shiffrin 1968). The
1998). Behaviorism represented an attempt implementation of models of human perfor-
to develop an objective, empirically-based mance as computer programs provided a mea-
science of behavior and more specifically, sure of objectivity and a sufficiency test of a
learning. Empiricism is the view that experi- theory and also served to increase the objec-
ence is the only source of knowledge (Hilgard tivity of the study of mental processes (Estes
and Bower 1975). Behaviorism endeavored to 1975).
build a comprehensive framework of scien- Arguably, the landmark publication in the
tific inquiry around the experimental analysis nascent field of cognitive science is Newell and
of observable behavior. Behaviorists eschewed Simon’s “Human Problem Solving” (Newell
the study of thinking as an unacceptable psy- and Simon 1972). This was the culmination
chological method because it was inherently of over 15 years of work on problem solv-
subjective, error-prone, and could not be sub- ing and research in artificial intelligence. It
jected to empirical validation. Similarly, hypo- was a mature thesis that described a theoreti-
thetical constructs (e.g., mental processes as cal framework, extended a language for the
mechanisms in a theory) were discouraged. study of cognition, and introduced protocol-
All constructs had to be specified in terms analytic methods that have become ubiquitous
of operational definitions, so they could be in the study of high-level cognition. It laid the
manipulated, measured, and quantified for foundation for the formal investigation of
empirical investigation (Weinger and Slagle symbolic-information processing (more spe-
2001). cifically, problem solving). The development
For reasons that go beyond the scope of of models of human information processing
this chapter, classical behavioral theories have also provided a foundation for the discipline
been largely discredited as a comprehensive of human-computer interaction and the first
unifying theory of behavior. However, behav- formal methods of analysis (Card et al. 1983).
iorism continues to provide a theoretical and The early investigations of problem solv-
methodological foundation in a wide range ing focused primarily on investigations of
of social science disciplines. For example, experimentally contrived or toy-world tasks
behaviorist tenets continue to play a central such as elementary deductive logic, the Tower
role in public health research. In particular, of Hanoi, illustrated in . Fig. 4.1, and math-
health behavior research emphasizes anteced- ematical word problems (Greeno and Simon
ent variables and environmental contingencies 1988). These tasks required very little back-
that serve to sustain unhealthy behaviors such ground knowledge and were well structured,
as smoking (Sussman 2001). Around 1950, in the sense that all the variables necessary
there was increasing dissatisfaction with the for solving the problem were present in the
limitations and methodological constraints problem statement. These tasks allowed for a
(e.g., the disavowal of the unobserved such complete description of the task environment,
as mental states) of behaviorism. In addition, a step-by-step description of the sequential
developments in logic, information theory, behavior of the subjects’ performance, and
cybernetics, and perhaps most importantly, the modeling of subjects’ cognitive and overt
126 V. L. Patel and D. R. Kaufman
A B C A B C
4 .. Fig. 4.1 Tower of Hanoi task illustrating a start state and a goal state
behavior in the form of a computer simula- egy. Although TOH bears little resemblance
tion. The Tower of Hanoi, in particular, served to the tasks performed by either clinicians or
as an important test bed for the development patients, the example illustrates the process
of an explanatory vocabulary and framework of analyzing task demands and task perfor-
for analyzing problem-solving behavior. mance in human subjects. The TOH helped
The Tower of Hanoi (TOH) is a relatively lay the groundwork for cognitive task analyses
straight-forward task that consists of three that are performed today.
pegs (A, B, and C) and three or more disks Protocol analysis1 is among the most
that vary in size. The goal is to move the three commonly used methods (Newell and Simon
disks from peg A to peg C one at a time with 1972). Protocol analysis refers to a class of
the constraint that a larger disk can never rest techniques for representing verbal think-aloud
on a smaller one. Problem solving can be con- protocols (Greeno and Simon 1988). Think-
strued as search in a problem space. A prob- aloud protocols are the most common source
lem space has an initial state, a goal state, and of data used in studies of problem solving.
a set of operators. Operators are any moves In these studies, subjects are instructed to
that transform a given state to a successor verbalize their thoughts as they perform an
state. For example, the first move could be to experimental task. Ericsson and Simon (1993)
move the small disk to peg B or peg C. In a specify the conditions under which verbal
three-disk TOH, there are a total of 27 pos- reports are acceptable as legitimate data. For
sible states representing the complete problem example, retrospective think-aloud protocols
space. TOH has 3n states where n is the num- are viewed as somewhat suspect because the
ber of disks. The minimum number of moves subject has had the opportunity to recon-
necessary to solve a TOH is 2n−1. Problem struct the information in memory, and the
solvers will typically maintain only a small set verbal reports are inevitably distorted. Think-
of states at a time. aloud protocols recorded in concert with
The search process involves finding a solu- observable behavioral data such as a subject’s
tion strategy that will minimize the number actions provide a rich source of evidence to
of steps. The metaphor of movement through characterize cognitive processes.
a problem space provides a means for under- Cognitive psychologists and linguists have
standing how an individual can sequentially investigated the processes and properties of
address the challenges they confront at each language and memory in adults and children
stage of a problem and the actions that ensue. for many decades. Early research focused on
We can characterize the problem-solving basic laboratory studies of list learning or
behavior of the subject at a local level in terms processing of words and sentences (as in a
of state transitions or at a more global level sentence completion task) (Anderson 1985).
in terms of strategies. For example, means-
ends analysis is a commonly used strategy
for reducing the difference between the start 1 The term protocol refers to that which is produced
state and goal state. For instance, moving all by a subject during testing (e.g., a verbal record). It
but the largest disk from peg A to peg B is differs from the more common use of protocol as
defining a code or set of procedures governing
an interim goal associated with such a strat-
behavior or a situation.
Cognitive Informatics
127 4
van Dijk and Kintsch (1983) developed an of expert systems (Clancey and Shortliffe
influential method of analyzing the process of 1984). The shift to real-world problems in
text comprehension based on the realization cognitive science was spearheaded by research
that text can be described at multiple levels exploring the nature of expertise. Most of
from surface codes (e.g., words and syntax) to the early investigations on expertise involved
a deeper level of semantics. Comprehension laboratory experiments. However, the shift
refers to cognitive processes associated with to knowledge-intensive domains provided a
understanding or deriving meaning from theoretical and methodological foundation
text, conversation, or other informational to conduct both basic and applied research
resources. It involves the processes that people in real-world settings such as the workplace
use when trying to make sense of a piece of (Vicente 1999) and the classroom (Bruer
text, such as a sentence, a book, or a verbal 1993). These areas of application provided a
utterance. It also involves the final product of fertile test bed for assessing and extending the
such processes, which is, the mental represen- cognitive science framework.
tation of the text, essentially what people have In recent years, the conventional
understood. information- processing approach has come
Comprehension may often precede prob- under criticism for its narrow focus on the
lem solving and decision making but is also rational/cognitive processes of the solitary
dependent on perceptual processes that focus individual. One of the most compelling pro-
attention, the availability of relevant knowl- posals has to do with a shift from viewing cog-
edge, and the ability to deploy knowledge in nition as a property of the solitary individual
a given context. Some of the more impor- to viewing cognition as distributed across
tant differences in medical problem solving groups, cultures, and artifacts. This claim has
and decision making arise from differences in significant implications for the study of col-
knowledge and comprehension. Furthermore, laborative endeavors and human-computer
many of the problems associated with deci- interaction. We explore the concepts underly-
sion making are the result of either a lack of ing distributed cognition in greater detail in a
knowledge or failure to understand the infor- subsequent section.
mation appropriately.
The early investigations provided a well-
constrained artificial environment for the
development of the basic methods and prin- 4.3 Human Information
ciples of problem solving. They also provide Processing
a rich explanatory vocabulary (e.g., prob-
lem space), but were not fully adequate in It is well known that product design often fails
accounting for cognition in knowledge-rich to consider cognitive and physiological con-
domains of greater complexity and involv- straints adequately and imposes an unneces-
ing uncertainty. In the mid to late 1970s, there sary burden on task performance (Sharp et al.
was a shift in research to complex “real-life” 2019). Fortunately, advances in theory and
knowledge-based domains of inquiry (Greeno methods provide us with greater insight into
and Simon 1988). Problem-solving research designing systems for the human condition.
was studying performance in domains such as Cognitive science serves as a basic science
physics (Larkin et al. 1980), medical diagno- and provides a framework for the analysis and
ses (Elstein et al. 1978) and architecture (Akin modeling of complex human performance. A
1982). Similarly, the study of text comprehen- computational theory of mind provides the
sion shifted from research on simple stories fundamental underpinning for most contem-
to technical and scientific texts in a range porary theories of cognitive science. The basic
of domains, including medicine. This paral- premise is that much of human cognition can
leled a similar change in artificial intelligence be characterized as a series of operations or
research from “toy programs” to addressing computations on mental representations.
“real-world” problems and the development Mental representations are internal cognitive
128 V. L. Patel and D. R. Kaufman
states that have a certain correspondence with of the nature of medical expertise and skilled
the external world. For example, they may clinical performance.
reflect a clinician’s hypothesis about a patient’s
condition after noticing an abnormal gait as
he entered the clinic. These are likely to elicit 4.3.1 Cognitive Architectures
further inferences about the patient’s underly- and Human Memory Systems
ing condition and may direct the physician’s
information-gathering strategies and contrib- Fundamental research in perception, cogni-
4 ute to an evolving problem representation. tion, and psychomotor skills over the last
Two interdependent dimensions by which 50 years has provided a foundation for design
we can characterize cognitive systems are (1) principles in human factors and human-
architectural theories that endeavor to pro- computer interaction. Although cognitive
vide a unified theory for all aspects of cogni- guidelines have made significant inroads in
tion and (2) the different kinds of knowledge the design community, there remains a signifi-
necessary to attain competency in a given cant gap in applying basic cognitive research
domain. Individuals differ substantially (Gillan and Schvaneveldt 1999). Designers
in terms of their knowledge, experiences, routinely violate basic assumptions about the
and endowed capabilities. The architectural human cognitive system. There are invari-
approach capitalizes on the fact that we can ably challenges in applying basic research and
characterize certain regularities of the human theory to applications. More human-centered
information-processing system. These can design and cognitive research can instrumen-
be either structural regularities—such as the tally contribute to such an endeavor (Zhang
existence of and the relations between percep- et al. 2004).
tual, attentional, and memory systems and Over the last 50 years, there have been
memory capacity limitations—or processing several attempts to develop a unified theory
regularities, such as processing speed, selec- of cognition. The goal of such a theory is
tive attention, or problem-solving strategies. to provide a single set of mechanisms for all
Cognitive systems are characterized function- cognitive behaviors from motor skills, lan-
ally in terms of the capabilities they enable guage, memory, to decision making, prob-
(e.g., focused attention on selective visual lem solving, and comprehension (Newell
features), the way they constrain human 1990). Such a theory provides a means to put
cognitive performance (e.g., limitations on together a voluminous and seemingly dispa-
memory), and their development during the rate body of human experimental data into a
lifespan. In regard to the lifespan issue, there coherent form. Cognitive architecture repre-
is a growing body of literature on cognitive sents unifying theories of cognition that are
aging and how aspects of the cognitive system embodied in large-scale computer simulation
such as attention, memory, vision and motor programs. Although there is much plasticity
skills change as a function of aging (Fisk evidenced in human behavior, cognitive pro-
et al. 2009). This basic science research is of cesses are bound by biological and physical
growing importance to informatics as we seek constraints. Cognitive architectures specify
to develop e-health applications for seniors, functional rather than biological constraints
many of whom suffer from chronic health on human behavior (e.g., limitations on
conditions such as arthritis and diabetes. A working memory). These constraints reflect
graphical user interface or more generally, a the information- processing capacities and
website designed for younger adults may not limitations of the human cognitive system.
be suitable for older adults. Architectural systems embody a relatively
Differences in knowledge organization are fixed permanent structure that is (more or
a central focus of research into the nature of less) characteristic of all humans and doesn’t
expertise. In medicine, the expert-novice para- substantially vary over an individual’s life-
digm has contributed to our understanding time. It represents a scientific hypothesis about
Cognitive Informatics
129 4
those aspects of human cognition that are is infinite, whereas WM is limited to five to
relatively constant over time and independent ten “chunks” of information. A chunk is
of the task (Carroll 2003). Cognitive architec- any stimulus or patterns of stimuli that have
tures also play a role in providing blueprints become familiar from repeated exposure and
for building future intelligent systems that is subsequently stored in memory as a single
embody a broad range of capabilities like unit (Larkin et al. 1980). Problems impose a
those of humans (Duch et al. 2008). There variable cognitive load on working memory.
are several large-scale cognitive architecture This refers to an excess of information that
theories that embody computational models competes for few cognitive resources, creat-
of cognition and have informed a substan- ing a burden on working memory (Chandler
tial body of research in cognitive science and and Sweller 1991). For example, maintaining
allied disciplines. ACT-R (short for “Adaptive a seven-digit phone number in WM is not very
Control of Thought- Rational”) is perhaps, difficult. However, to maintain a phone num-
the most widely known cognitive architec- ber while engaging in conversation is nearly
ture. It was developed by John R. Anderson impossible for most people. Multi-tasking
and is sustained by a large global community is one factor that contributes to cognitive
of researchers centered at Carnegie Mellon load. The structure of the task environment,
University (Anderson 2013). It is a theory for for example, a crowded computer display
simulating and understanding human cogni- is another contributor. High velocity/high
tion. It started more than 40 years ago as an workload clinical environments such as inten-
architecture that could simulate basic tasks sive care units also impose cognitive loads on
related to memory, language and problem clinicians carrying out the task.
solving. It has continued to evolve into a sys-
tem that can perform an enormous range of
human tasks (Ritter et al. 2019). 4.3.2 The Organization
Cognitive architectures include short- of Knowledge
term and long-term memories that store con-
tent about an individual’s beliefs, goals, and Architectural theories specify the structure
knowledge, the representation of elements and mechanisms of memory systems, whereas
that are contained in these memories as well theories of knowledge organization focus on
as their organization into larger-scale struc- the content. There are several ways to char-
tures (Lieto et al. 2018). An extended discus- acterize the kinds of knowledge that reside in
sion of architectural theories and systems is LTM and that support decisions and actions.
beyond the scope of this chapter. However, Cognitive psychology has furnished a range of
we employ the architectural frame of refer- domain-independent constructs that account
ence to introduce some basic distinctions in for the variability of mental representations
memory systems. Human memory is typically needed to engage the external world.
divided into at least two structures: long-term A central tenet of cognitive science is that
memory and short-term/working memory. humans actively construct and interpret infor-
Working memory is an emergent property mation from their environment. Given that
of interaction with the environment. Long- environmental stimuli can take a multitude
term memory (LTM) can be thought of as a of forms (e.g., written text, speech, music,
repository of all knowledge, whereas working images, etc.), the cognitive system needs to
memory (WM) refers to the resources needed be attuned to different representational types
to maintain information active during cogni- to capture the essence of these inputs. For
tive activity (e.g., text comprehension). The example, we process written text differently
information maintained in working memory than we do mathematical equations. The
includes stimuli from the environment (e.g., power of cognition is reflected in the ability to
words on a display) and knowledge activated form abstractions - to represent perceptions,
from long-term memory. In theory, LTM experiences, and thoughts in some medium
130 V. L. Patel and D. R. Kaufman
1. 43-year-old white female who developed diarrhea after a brief period of 2 days
of GI upset
characterized clinicians’ mental models of the blood flow and on various clinical measures
cardiovascular system (specifically, cardiac such as left ventricular ejection fraction.
output). The study characterized the develop- Thus far, we have only considered domain-
ment of an understanding of the system as a general ways of characterizing the organiza-
function of expertise. The research also docu- tion of knowledge. In view to understanding
mented various conceptual flaws in subjects’ the nature of medical cognition, it is necessary
models and how these flaws impacted subjects’ to characterize the domain-specific nature of
predictions and explanations of physiological knowledge organization in medicine. Given
4 manifestations. . Figure 4.3 illustrates the the vastness and complexity of the domain of
four chambers of the heart and blood flow in medicine, this can be a rather daunting task.
the pulmonary and cardiovascular systems. There is no single way to represent all biomed-
The claim is that clinicians and medical stu- ical (or even clinical) knowledge, but it is an
dents have variably robust representations of issue of considerable importance for research
the structure and function of the system. This in biomedical informatics. Much research has
model enables prediction and explanation of been conducted in biomedical artificial intel-
the effects of perturbations in the system on ligence to develop biomedical ontologies for
use in knowledge-based systems (Ramoni
et al. 1992). Patel et al. (1997) address this
issue in the context of using empirical evi-
Lungs
dence from psychological experiments on
medical expertise to test the validity of the
Pulmonary Pulmonary Pulmonary
AI systems. Developers of biomedical taxono-
Arteries Circulation Veins mies, nomenclatures, and vocabulary systems
such as UMLS or SNOMED are engaged in a
Heart
similar pursuit (see 7 Chap. 7).
RV LA We have employed an epistemological
PV
framework developed by Evans and Gadd
TV MV (1989). They proposed a framework that
AV
RA LV serves to characterize the knowledge used for
Collecting Distributing
medical understanding and problem solving,
System System and for differentiating the levels at which bio-
Vena Cavae Aorta and medical knowledge may be organized. This
Systemic framework represents a formalization of bio-
and Systemic Systemic
Circulation
Veins Circulation medical knowledge as realized in textbooks
and journals and can be used to provide us
Exchange System with insight into the organization of clinical
Systemic practitioners’ knowledge (see . Fig. 4.4).
Capillaries The framework consists of a hierarchical
structure of concepts formed by clinical obser-
Chambers Valves vations at the lowest level, followed by findings,
facets, and diagnoses. Clinical observations
RV - Right Ventricle PA - Pulmonic Valve
RA - Right Atrium TV - Tricuspid Valve are units of information that are recognized
LV - Left Ventricle MV - Mitral Valve as potentially relevant in the problem-solving
LA - Left Atrium AV - Aortic Valve context. However, they do not constitute clini-
cally useful facts. Findings are composed of
.. Fig. 4.3 Schematic model of circulatory and cardio- observations that have potential clinical sig-
vascular physiology. The diagram illustrates various nificance. Establishing a finding reflects a
structures of the pulmonary and systemic circulation
decision made by a physician that an array
system and the process of blood flow. The illustration is
used to exemplify the concept of mental model and how of data contains a significant cue or cues that
it could be applied to explaining and predicting physio- need to be considered. Facets consist of clus-
logic behavior ters of findings that indicate an underlying
Cognitive Informatics
133 4
Diagnosis level D1 D2 D3
Finding level f1 f2 f3 f4 f5 f6
Observation level 01 02 03 04 05 06 07 08 09 10 11 12
.. Fig. 4.4 Epistemological frameworks representing the structure of medical knowledge for problem solving
medical problem or class of problems. They A goal of this approach has been to char-
reflect general pathological descriptions such acterize expert performance in terms of the
as left-ventricular failure or thyroid condi- knowledge and cognitive processes used in
tion. Facets resemble the kinds of constructs comprehension, problem solving, and decision
used by researchers in medical artificial intel- making, using carefully developed laboratory
ligence to describe the partitioning of a prob- tasks (Chi and Glaser 1981), where deGroot’s
lem space. They are interim hypotheses that (1965) pioneering research in chess represents
serve to divide the information in the problem one of the earliest characterizations of expert-
into sets of manageable sub-problems and to novice differences. In one of his experiments,
suggest possible solutions. Facets also vary in subjects were allowed to view a chess board for
terms of their levels of abstraction. Diagnosis 5–10 seconds and were then required to repro-
is the level of classification that subsumes and duce the position of the chess pieces from
explains all levels beneath it. Finally, the sys- memory. The grandmaster chess players were
tems level consists of information that serves able to reconstruct the mid-game positions
to contextualize a problem, such as the ethnic with better than 90% accuracy, while novice
background of a patient. chess players could only reproduce approxi-
mately 20% of the correct positions. When
the chess pieces were placed on the board in a
4.4 Medical Cognition random configuration, not encountered in the
course of a normal chess match, expert chess
The study of expertise is one of the princi- masters’ recognition ability fell to that of
pal paradigms in problem-solving research, novices. This result suggests that superior rec-
which has been documented in a number of ognition ability is not a function of superior
volumes in literature (Sternberg and Ericsson memory, but is a result of an enhanced abil-
1996; Ericsson 2009; Ericsson et al. 2018). ity to recognize typical situations (Chase and
Comparing experts to novices provides us Simon 1973). This phenomenon is accounted
with the opportunity to explore the aspects of for by a process known as “chunking.” Patel
performance that undergo change and result and Groen (1991b) showed a similar phenom-
in increased problem-solving skill (Glaser enon in medicine The expert physicians were
2000). It also permits investigators to develop able to reconstruct patient summaries in an
domain-specific models of competence that accurate manner when patient information
can be used for assessment and training was collected out of order (e.g., history, physi-
purposes. cal exam, lab results), as long as the pattern of
information, even out of sequence was famil-
134 V. L. Patel and D. R. Kaufman
iar. When the sentences were placed out of to solving it, while novices tend to spend more
order in a way that the pattern was unfamiliar, time working on the solution itself and little
the expert physicians’ recognition ability was time in problem assessment; (6) individual
no better than the novices. experts may differ substantially in terms of
It is well known that knowledge-based dif- exhibiting these kinds of performance char-
ferences impact the problem representation acteristics (e.g., superior memory for domain
and determine the strategies a subject uses materials).
to solve a problem. Simon and Simon (1978) Usually, someone is designated as an
4 compared a novice subject with an expert sub- expert based on a certain level of performance,
ject in solving textbook physics problems. The as exemplified by Elo ratings in chess; by vir-
results indicated that the expert solved the tue of being certified by a professional licens-
problems in one-quarter of the time required ing body, as in medicine, law, or engineering;
by the novice with fewer errors. The nov- on the basis of academic criteria, such as
ice solved most of the problems by working graduate degrees; or simply based on years of
backward from the unknown problem solu- experience or peer evaluation (Hoffman et al.
tion to the givens of the problem statement. 1995). The concept of an expert, however,
The expert worked forward from the givens refers to an individual who surpasses com-
to solve the necessary equations and deter- petency in a domain (Sternberg and Horvath
mine the quantities they are asked to solve for. 1999). Although competent performers, for
Differences in the directionality of reasoning instance, may be able to encode relevant infor-
by levels of expertise has been demonstrated mation and generate effective plans of action
in diverse domains from computer program- in a specific domain, they often lack the speed
ming (Perkins et al. 1990) to medical diagno- and the flexibility that we see in an expert. A
sis (Patel and Groen 1986). domain expert (e.g., a medical practitioner)
The expertise paradigm spans the range possesses an extensive, accessible knowledge
of content domains including physics (Larkin base that is organized for use in practice and
et al. 1980), sports (Allard and Starkes 1991), is tuned to the particular problems at hand. In
music (Sloboda 1991), and medicine (Patel the study of medical expertise, it has been use-
et al. 1994). Edited volumes (Ericsson 2006; ful to distinguish different types of expertise.
Chi et al. 1988 Ericsson et al. 2018; Ericsson Patel and Groen (1991a, b) distinguished
and Smith 1991; Hoffman 1992) provide an between general and specific expertise, a dis-
informative general overview of the area. This tinction supported by research indicating
research has focused on differences between differences between subexperts (i.e., expert
subjects varying in levels of expertise in terms physicians who solve a case outside their field
of memory, reasoning strategies, and in par- of specialization) and experts (i.e., domain
ticular the role of domain-specific knowledge. specialist) with respect to reasoning strate-
Among the expert’s characteristics uncovered gies and organization of knowledge. General
by this research are the following: (1) experts expertise corresponds to expertise that cuts
are capable of perceiving large patterns of across medical subdisciplines (e.g., general
meaningful information in their domain, medicine). Specific expertise results from
which novices cannot perceive; (2) they are having extensive experience within a medical
fast at processing and at deployment of dif- subdomain, such as cardiology or endocrinol-
ferent skills required for problem solving; (3) ogy. An individual may possess both or only
they have superior short-term and long-term generic expertise.
memories for materials (e.g., clinical findings The development of expertise can follow
in medicine) within their domain of expertise, a somewhat unusual trajectory. It is often
but not outside of it; (4) they typically rep- assumed that the path from novice to expert
resent problems in their domain at deeper, goes through a steady process of gradual
more principled levels whereas novices show accumulation of knowledge and fine-tuning
a superficial level of representation; (5) they of skills. That is, as a person becomes more
spend more time assessing the problem prior familiar with a domain, his or her level of per-
Cognitive Informatics
135 4
formance (e.g., accuracy, quality) gradually tion, recall and explanation of laboratory data,
increases. However, research has shown that generation of diagnostic hypotheses, and prob-
this assumption is often incorrect (Lesgold lem solving (Patel and Groen 1991a, b). The
et al. 1988; Patel et al. 1994). Cross-sectional performance indicators used have included
studies of experts, intermediates, and novices recall and inference of medical-text infor-
have shown that people at intermediate levels mation, recall, and inference of diagnostic
of expertise may perform more poorly than hypotheses, generation of clinical findings
those at a lower level of expertise on some from a patient in doctor-patient interaction,
tasks. and requests for laboratory data, among oth-
Furthermore, there is a longstanding body ers. The research has also identified devel-
of research on learning that has suggested that opmental levels at which the intermediate
the learning process involves phases of error- phenomenon occurs, including senior medi-
filled performance followed by periods of sta- cal students and residents. It is important to
ble, comparatively error-free performance. In note, however, that in some tasks, the develop-
other words, human learning does not consist ment is monotonic. For instance, in diagnostic
of the gradually increasing accumulation of accuracy, there is a gradual increase, with an
knowledge and fine-tuning of skills. Rather, intermediate exhibiting a greater degree of
it requires the arduous process of continu- accuracy than the novice and the expert dem-
ally learning, re-learning, and exercising new onstrating a still greater degree than the inter-
knowledge, punctuated by periods of an appar- mediate. Furthermore, when the relevancy
ent decrease in mastery and declines in perfor- of the stimuli to a problem is considered, an
mance, which may be necessary for learning to appreciable monotonic phenomenon appears.
take place. . Figure 4.5 presents an illustration For instance, in recall studies, novices, inter-
of this learning and development phenomenon mediates, and experts are assessed in terms
known as the intermediate effect. of the total number of propositions recalled
The intermediate effect has been found showing the typical non-monotonic effect.
in a variety of tasks and with a great num- However, when propositions are divided in
ber of performance indicators. The tasks used terms of their relevance to the problem (e.g.,
include comprehension and explanation of a clinical case), experts recall more relevant
clinical problems, doctor-patient communica- propositions than intermediates and novices,
suggesting that intermediates have difficulty
separating what is relevant from what is not.
expected During the periods when the intermediate
Performance Level
45 years-old
male
4-hr history
Myocardial of chest pain
infarction
central, crushing
chest pain
4 faintness
Other
diagnosis
sweating
mild cough
.. Fig. 4.6 Problem interpretations by a novice medi- circles (filled). Forward driven or data driven inference
cal student. The given information from patient prob- arrows are shown from left to right (solid dark line).
lem is represented on the right side of the figure and the Backward or hypothesis driven inference arrows are
new generated information is given on the left side, shown from right to left (solid light line). Thick solid
information in the box represents diagnostic hypothesis. dark line represents rule out strategy
Intermediate hypotheses are represented as solid dark
45 years-old
male
4-hr history
Myocardial of chest pain
infarction
central, crushing
chest pain
faintness
Other
diagnoses
sweating
mild cough
solution (. Fig. 4.8). Thus, the intermediate The intermediate effect is not a one-time
effect can be explained as a function of the phenomenon. Rather, it repeatedly occurs at
learning process, maybe as a necessary phase strategic points in a student or physician’s
of learning. Identifying the factors involved in training and follows periods in which large
the intermediate effect may help in improving bodies of new knowledge or complex skills
performance during learning (e.g., by design- are acquired. These periods are followed by
ing decision-support systems or intelligent intervals in which there is a decrement in
tutoring systems that help the user in focusing performance until a new level of mastery is
on relevant information). achieved.
Cognitive Informatics
137 4
.. Fig. 4.8 Problem
45 year-old
interpretations by a
male
senior medical
student
4-hr history
Myocardial of chest pain
infarction
central, crushing
chest pain
faintness
Other
diagnoses
sweating
mild cough
Aortic
dissection
asymmetric BP
using patient data to lead toward a complete et al. 1980). Due to their extensive knowledge
diagnosis (i.e., reasoning from data to hypoth- base and the high-level inferences they make,
esis). This is in contrast to subjects who mis- experts typically skip steps in their reasoning.
diagnosed or partially diagnosed the patient Although experts typically use data-driven
problem. They tended to use a backward or reasoning during clinical performance, this
hypothesis-driven reasoning strategy. type of reasoning sometimes breaks down, and
Patel and Groen (1991a, b) investigated the the expert must resort to hypothesis-driven
nature and directionality of clinical reasoning reasoning. Although data-driven reasoning is
4 in a range of contexts of varying complex- highly efficient, it is often error-prone in the
ity. The objectives of this research program absence of adequate domain knowledge, since
were both to advance our understanding of there are no built-in checks on the legitimacy
medical expertise and to devise more effec- of the inferences that a person makes. Pure
tive ways of teaching clinical problem solving. data-driven reasoning is only successful in
It has been established that the patterns of constrained situations, where one’s knowledge
data-driven and hypothesis-driven reasoning of a problem can result in a complete chain
are used differentially by novices and experts. of inferences from the initial problem state-
Experts tend to use data-driven reasoning, ment to the problem solution, as illustrated
which depends on the physician possessing a in . Fig. 4.9. In contrast, hypothesis-driven
highly organized knowledge base about the reasoning is slower and may make heavy
patient’s disease (including sets of signs and demands on working memory, because one
symptoms). Because of their lack of substan- must keep track of goals and hypotheses. It is,
tive knowledge or their inability to distinguish therefore, most likely to be used when domain
relevant from irrelevant knowledge, novices knowledge is inadequate, or the problem is
and intermediates use more hypothesis-driven complex. Hypothesis-driven reasoning is usu-
reasoning, often resulting in very complex ally exemplary of a weak method of problem
reasoning patterns. The fact that experts and solving in the sense that is used in the absence
novices reason differently suggests that they of relevant prior knowledge and when there is
might reach different conclusions (e.g., deci- uncertainty about problem solution. In prob-
sions or understandings) when solving medi- lem-solving terms, strong methods engage
cal problems. Similar patterns of reasoning knowledge, whereas weak methods refer to
have been found in other domains (Larkin general strategies. Weak does not necessar-
Examination COND:
of thyroid
RSLT:
.. Fig. 4.9 Diagrammatic representation of data- finding of respiratory failure, which is inconsistent with
driven (top down) and hypothesis-driven (bottom-up) the main diagnosis, is accounted for as a result of a
reasoning. From the presence of vitiligo, a prior history hypometabolic state of the patient, in a backward-
of progressive thyroid disease, and examination of the directed fashion. COND: refers to a conditional rela-
thyroid (clinical findings on the left side of figure), the tion; CAU: indicates a causal relation; and RSLT:
physician reasons forward to conclude the diagnosis of identifies a resultive relation
Myxedema (right of figure). However, the anomalous
Cognitive Informatics
139 4
ily imply ineffectual in this context. However, cognitively taxing forms of reasoning that can
hypothesis-driven reasoning may be more resemble hypothetico-deductive methods. As
conducive to the novice learning experience in problems increase in complexity and uncer-
that it can guide the organization of knowl- tainness, expert clinicians’ resort to hybrid
edge (Patel et al. 1990). forms of reasoning that may include substan-
In the more recent literature, described tial backward-directed reasoning.
in a chapter by Patel and colleagues (2013a), The study of medical cognition has been
two forms of human reasoning that are more summarized in a series of articles (Patel et al.
widely accepted are deductive and inductive 1994, 2018) and edited volumes (e.g., Evans
reasoning. Deductive reasoning is a process of and Patel 1989). In more recent times, medi-
reaching specific conclusions (e.g., a diagno- cal cognition is discussed in the context of
sis) from a hypothesis or a set of hypotheses, informatics and in the new field of inves-
whereas inductive reasoning is the process tigation, cognitive informatics (Patel and
of generating possible conclusions based on Kannampallil 2015; Patel et al. 2014, 2015b,
available data, such as data from a patient. 2017). Furthermore, foundations of cognition
However, when reasoning in real-world clini- also play a significant role in investigations
cal situations, it is too simplistic to think of of HCI, including human factors and patient
reasoning with only these two strategies. A safety. Details of HCI in biomedicine are cov-
third form of reasoning, abductive, which ered in 7 Chap. 5.
combines deductive and inductive reason-
ing, was proposed (Peirce 1955). A physician
developing and testing explanatory hypoth- 4.5 uman Factors Research
H
eses based on a set of heuristics, may be con-
and Patient Safety
sidered abductive reasoning (Magnani 2001).
Thus, an abductive reasoning process where a
set of hypotheses are identified and then each
»» Human error in medicine and the adverse
events which may follow are problems of
of these hypotheses is evaluated on the basis psychology and engineering not of medicine
of its potential consequences (Elstein et al. “(Senders 1993)” (cited in (Woods et al.
1978; Ramoni et al. 1992). This makes abduc- 2008).
tive reasoning a data-driven process that relies
heavily on the domain expertise of the person. Human factors research is a discipline devoted
During the testing phase, hypotheses are to the study of technology systems and how
evaluated by their ability to account for the people work with them or are impacted by
current problem. Deduction helps in building these technologies (Henriksen 2010). Human
up the consequences of each hypothesis, and factors research discovers and applies infor-
this kind of reasoning is customarily regarded mation about human behavior, abilities, limi-
as a common way of evaluating diagnostic tations, and other characteristics to the design
hypotheses (Joseph and Patel 1990; Kassirer of tools, machines, systems, tasks, and jobs,
1989; Patel et al. 1994; Patel, Evans, and and environments for productive, safe, com-
Kaufman 1989). All these types of inferences fortable, and effective human use (Chapanis
play different roles in the hypothesis genera- 1996). In the context of healthcare, human
tion and testing phases (Patel and Ramoni factors are concerned with the full comple-
1997; Peirce 1955). Our inherent ability ment of technologies and systems used by a
to adapt to different kinds of knowledge diverse range of individuals including clini-
domains, situations, and problems requires cians, hospital administrators, health con-
the use of a variety of reasoning modes, and sumers and patients (Flin and Patey 2009).
this process describes the notion of abductive Human factors work approaches the study of
medical reasoning (Patel and Ramoni 1997). health practices from several perspectives or
In contrast, novices and intermediate sub- levels of analysis. A full exposition of human
jects (e.g., medical trainees) are more likely factors in medicine is beyond the scope of
to employ more deliberative, effortful, and this chapter. For a detailed treatment of these
140 V. L. Patel and D. R. Kaufman
issues, the reader is referred to the Handbook settings and challenging situations in aviation,
of Human Factors and Ergonomics in Health industrial process control, military command
Care and Patient Safety (Carayon et al. 2011). control and space operations (Woods et al.
The focus in this chapter is on cognitive work 2008). The research has elucidated empiri-
in human factors and healthcare, particularly cal regularities and provides explanatory
in relation to patient safety. We recognize that concepts and models of human performance.
patient safety is a systemic challenge at mul- This allows us to derive common underlying
tiple levels of aggregation beyond the individ- patterns in somewhat disparate settings.
4 ual. It is clear that understanding, predicting,
and transforming human performance in any
complex setting requires a detailed under- 4.5.1 Patient Safety
standing of both the setting and the factors
that influence performance (Woods et al. Patient safety refers to the prevention of
2008). healthcare errors, and the elimination or miti-
Our objective in this section is to introduce gation of patient injury caused by healthcare
a theoretical foundation, establish important errors (Patel and Zhang 2007). It has been
concepts, and discuss illustrative research an issue of considerable concern for the past
in patient safety. The field of human fac- quarter-century, but the greater community
tors is guided by principles of engineering was galvanized by the National Academy
and applied cognitive psychology (Chapanis of Medicine report “To Err is Human,”
1996). Human factors analysis applies knowl- (Kohn et al. 2000) and by a follow-up report,
edge about the strengths and limitations of “Improving Diagnosis in Health Care”
humans to the design of interactive systems (Balogh et al. 2015). The 2000 report commu-
and their environment. The objective is to nicated the surprising fact that up to 98,000
ensure their effectiveness, safety, and ease of preventable deaths every single year in the
use. Mental models and issues of decision United States are attributable to human error,
making are central to human-factors analysis. which makes it the 8th leading cause of death
Any system will be easier and less burdensome in this country. Although one may argue over
to use to the extent that it is co-extensive with the specific numbers, there is no disputing that
users’ mental models. The different dimen- too many patients are harmed or die every
sions of cognitive capacity, including memory, year as a result of human actions or absence
attention, and workload are central to human- of action.
factor analyses. Our perceptual system inun- We can only analyze errors after they
dates us with more stimuli than our cognitive happened, and they often seem to be glar-
systems can process. Attentional mechanisms ing blunders after the fact. This leads to the
enable us to selectively prioritize and attend assignment of blame or searches for a single
to certain stimuli and attenuate other ones. cause of the error. However, in hindsight, it is
They also have the property of being sharable, exceedingly difficult to recreate the situational
which enables us to multitask by dividing our context, stress, shifting attention demands,
attention between two activities. For example, and competing goals that characterized a
if we are driving on a highway, we can eas- situation prior to the occurrence of an error.
ily have a conversation with a passenger at This sort of retrospective analysis is subject
the same time. However, as the skies get dark to hindsight bias. Hindsight bias masks the
or the weather changes or suddenly you find dilemmas, uncertainties, demands, and other
yourself driving through winding mountain- latent conditions that were operative before
ous roads, you will have to allocate more of the mishap. Too often the term ‘human error’
your attentional resources to driving and less connotes blame and a search for the guilty
to the conversation. culprits, suggesting some sort of human defi-
Human factors research leverages theories ciency or irresponsible behavior. Human fac-
and methods from cognitive engineering to tors researchers recognized that this approach
characterize human performance in complex error is inherently incomplete and poten-
Cognitive Informatics
141 4
tially misleading. They argue for the need a kind of error, but not necessarily leading
for a more comprehensive systems-centered to an adverse event. For example, if there is
approach that recognizes that error could be a system of checks and balances that is part
attributed to a multitude of factors as well as of routine practice or if there is a systematic
the interaction of these factors. Error is the supervisory process in place, the vast majority
failure of a planned sequence of mental or of errors will be trapped and defused in this
physical activities to achieve its intended out- middle zone. If these measures or practices are
come when these failures cannot be attributed not in place, an error can propagate and cross
to chance (Patel and Zhang 2007; Reason the boundary to become an adverse event. At
1990). Reason (1990) introduced an impor- this point, the patient has been harmed. In
tant distinction between latent and active addition, if an individual is subject to a heavy
failures. Active failure represents the face of workload or intense time pressure, then that
error. The effects of active failure are immedi- will increase the potential for an error, result-
ately felt. In healthcare, active errors are com- ing in an adverse event.
mitted by providers such as nurses, physicians, The notion that human error should not
or pharmacists who are actively responding to be tolerated is prevalent in both the public
patient needs at the “sharp end”. The latent and personal perception of the performance
conditions are less visible but equally impor- of most clinicians. However, researchers in
tant. Latent conditions are enduring systemic other safety-critical domains have long since
problems that may not be evident for some abandoned the quest for zero defect, citing it
time, combine with other system problems to as an impractical goal, and choosing to focus
weaken the system’s defenses and make errors instead on the development of strategies to
possible. There is a lengthy list of potential enhance the ability to recover from error
latent conditions including poor interface (Morel et al. 2008). Patel and her colleagues
design of important technologies, communi- conducted empirical investigations into error
cation breakdown between key actors, gaps in detection and recovery by experts (attending
supervision, inadequate training, and absence physicians) and non-experts (resident train-
of a safety culture in the workplace—a cul- ees) in the critical care domain, using both
ture that emphasizes safe practices and the laboratory-based and naturalistic approaches
reporting of any conditions that are poten- (Patel et al. 2011). These studies show that
tially dangerous. expertise is more closely tied to the ability
Zhang, Patel, Johnson, and Shortliffe to detect and recover from errors and not so
(2004) developed a taxonomy of errors par- much to the ability not to make errors. The
tially based on the distinctions proposed study results show that both the experts and
by Reason (1990). They further classified non-experts are prone to commit and recover
errors in terms of slips and mistakes (Reason from errors, but experts’ ability to detect and
1990). A slip occurs when the actor selected recover from knowledge-based errors is better
the appropriate course of action, but it was than that of trainees. Error detection and cor-
executed inappropriately. A mistake involves rection in complex real-time critical care situ-
an inappropriate course of action reflecting ations appears to induce certain urgency for
an erroneous judgment or inference (e.g., a quick action in a high alert condition, result-
wrong diagnosis or misreading of an x-ray). ing in rapid detection and correction. Studies
Mistakes may either be knowledge-based on expertise and understanding of the limits
owing to factors such as incorrect knowl- and failures of human decision-making are
edge, or they may be rule-based, in which important if we are to build robust decision-
case the correct knowledge was available, but support systems to manage the boundaries of
there was a problem in applying the rules or risk of error in decision making (Patel et al.
guidelines. They further characterize medical 2015a; Patel and Cohen 2008). Research on
errors as a progression of events. There is a situational complexity and medical errors
period when everything is operating smoothly. is documented in a recent book by Patel,
Then an unsafe practice unfolds resulting in Kaufman, and Cohen (2014).
142 V. L. Patel and D. R. Kaufman
mance, although a given technology may do thinking skills, such as diagnostic hypoth-
all of these things. The difference is not merely esis generation and evaluation. The effects
one of quantitative change, but one that is of technology refer to enduring changes in
qualitative in nature. general cognitive capacities (knowledge and
In a distributed world, what becomes of skills) as a consequence of interaction with
the individual? We believe it is important to a technology. This effect is illustrated subse-
understand how technologies promote endur- quently in the context of the enduring effects
ing changes in individuals. Salomon, Perkins of an EHR (see 7 Chap. 10).
4 and Globerson (1991) introduced an impor- We employed a pen-based EHR system,
tant distinction in considering the mediating DCI (Dossier of Clinical Information), in
role of technology on individual performance, several of our studies (see Kushniruk et al.
the effects with technology and the effects of 1996). Using the pen or computer keyboard,
technology. The former is concerned with physicians can directly enter information into
the changes in performance displayed by the EHR, such as the patient’s chief com-
users while equipped with the technology. plaint, past history, history of present illness,
For example, when using an effective medical laboratory tests, and differential diagnoses.
information system, physicians should be able Physicians were encouraged to use the sys-
to gather information more systematically tem while collecting data from patients (e.g.,
and efficiently. In this capacity, medical infor- during the interview). The system allows the
mation technologies may alleviate some of the physician to record information about the
cognitive load associated with a given task and patient’s differential diagnosis, the ordering
permit physicians to focus on higher- order of tests, and the prescription of medication.
.. Fig. 4.10 Display of a structured electronic medical record with graphical capabilities
Cognitive Informatics
145 4
The graphical interface provides a highly diabetes clinic. The study considered the fol-
structured set of resources for representing a lowing questions (1) How do physicians man-
clinical problem, as illustrated in . Fig. 4.10. age information flow when using an EHR
We have studied the use of this EHR in system? (2) What are the differences in the way
both laboratory-based research (Kushniruk physicians organize and represent this infor-
et al. 1996) and actual clinical settings using mation using paper-based and EHR systems,
cognitive methods (Patel et al. 2000). The and (3) Are there long-term, enduring effects
laboratory research included a simulated of the use of EHR systems on knowledge rep-
doctor- patient interview. We have observed resentations and clinical reasoning? One study
two distinct patterns of EHR usage in the focused on an in-depth characterization of
interactive condition, one in which the subject changes in knowledge organization in a single
pursues information from the patient predi- subject as a function of using the system. The
cated on a hypothesis; the second strategy study first compared the contents and struc-
involves the use of the EHR display to guide ture of patient records produced by the physi-
asking the patient questions. In the screen- cian using the EHR system and paper-based
driven strategy, the clinician is using the struc- patient records, using ten pairs of records
tured list of findings in the order in which they matched for variables such as patient age and
appear on the display to elicit information problem type. After having used the system
from the patient. All experienced users of this for six months, the physician was asked to
system appear to have both strategies in their conduct his/her next five patient interviews
repertoire. using only hand-written paper records.
In general, a screen-driven strategy can The results indicated that the EHRs con-
enhance performance by reducing the cogni- tained more information relevant to the diag-
tive load imposed by information-gathering nostic hypotheses. In addition, the structure
goals and allow the physician to allocate more and content of information were found to
cognitive resources toward testing hypotheses correspond to the structured representa-
and rendering decisions. On the other hand, tion of the particular medium. For example,
this strategy can encourage a certain sense EHRs were found to contain more informa-
of complacency. We observed both effective tion about the patient’s past medical history,
as well as counter-productive uses of this reflecting the query structure of the interface.
screen-driven strategy. A more experienced The paper-based records appear to better pre-
user consciously used the strategy to structure serve the integrity of the time course of the
the information-gathering process, whereas evolution of the patient problem, whereas, this
a novice user used it less discriminately. In is notably absent from the EHR. Perhaps, the
employing this screen-driven strategy, the nov- most striking finding is that, after having used
ice elicited almost all of the relevant findings the system for six months, the structure and
in a simulated patient encounter. However, content of the physician’s paper-based records
she also elicited numerous irrelevant findings bore a closer resemblance to the organization
and pursued incorrect hypotheses. In this par- of information in the EHR than the paper-
ticular case, the subject became too reliant on based records produced by the physician prior
the technology and had difficulty imposing to exposure to the system. This finding is con-
her own set of working hypotheses to guide sistent with the enduring effects of technology
the information-gathering and diagnostic- even in the absence of the particular system
reasoning processes. (Salomon et al. 1991). The authors conclude
The use of a screen-driven strategy is evi- that given these potentially enduring effects,
dence of how technology transforms clinical the use of a particular EHR will almost cer-
cognition, as manifested in clinicians’ patterns tainly have a direct effect on medical decision
of reasoning. Patel et al. (2000) extended this making.
line of research to study the cognitive conse- The previously discussed research dem-
quences of using the same EHR system in a onstrates how information technologies can
146 V. L. Patel and D. R. Kaufman
mediate cognition and even produce endur- seven physicians. The CW analysis revealed
ing changes in how one performs a task. What that the configuration of resources (e.g., very
dimensions of an interface contribute to such long menus, complexly configured displays)
changes? What aspects of a display are more placed unnecessarily heavy cognitive demands
likely to facilitate efficient task performance, on users, especially those who were new to
and what aspects are more likely to impede the system. The resources model was also
it? Norman (1986) argued that well-designed used to account for patterns of errors pro-
artifacts could reduce the need for users to duced by clinicians. The authors concluded
4 remember large amounts of information, that the redistribution and reconfiguration of
whereas poorly designed artifacts increased resources might yield guiding principles and
the knowledge demands on the user and the design solutions in the development of com-
burden of working memory. In the distributed plex interactive systems.
approach to HCI research, cognition is viewed The distributed cognition framework has
as a process of coordinating distributed inter- proved to be particularly useful in under-
nal and external representations, and this, in standing the performance of teams or groups
effect, constitutes an indivisible information- of individuals in a particular work setting
processing system. (Hutchins 1995). Hazlehurst and colleagues
One of the appealing features of the dis- (Hazlehurst et al. 2003, 2007) have drawn on
tributed cognition paradigm is that it can be this framework to illuminate how work in
used to understand how properties of objects healthcare settings is constituted using shared
on the screen (e.g., links, buttons) can serve resources and representations. The activity
as external representations and reduce cogni- system is the primary explanatory construct.
tive load. The distributed resource model pro- It is comprised of actors and tools, together
posed by Wright, Fields, and Harrison (2000) with shared understandings among actors
addresses the question of “what information that structure interactions in a work setting.
is required to carry out some task and where The “propagation of representational states
should it be located: as an interface object or through activity systems” is used to explain
as something that is mentally represented to cognitive behavior and investigate the organi-
the user.” The relative difference in the distri- zation of the system and human performance.
bution of representations (internal and exter- Following Hazlehurst et al. (2007, p. 540), “a
nal) is central to determining the efficacy of representational state is a particular configu-
a system designed to support a complex task. ration of an information-bearing structure,
Wright, Fields, and Harrison (2000) were such as a monitor display, a verbal utterance,
among the first to develop an explicit model or a printed label, that plays some functional
for coding the kinds of resources available in role in a process within the system.” The
the environment and how they are embodied author has used the concept to explain the
on an interface. process of medication ordering in an intensive
Horsky, Kaufman, and Patel (2003a, b) care unit and the coordinated communica-
applied the distributed resource model and tions of a surgical team in a heart room.
analysis to a provider order entry system. The framework for distributed cognition
The goal was to analyze specific order-entry is still an emerging one in human-computer
tasks such as those involved in admitting interaction. It offers a novel and potentially
a patient to a hospital and then to identify powerful approach for illuminating the kinds
areas of complexity that may impede optimal of difficulties users encounter and find-
recorded entries. The research consisted of ing ways to better structure the interaction
two- component analyses: a cognitive walk- by redistributing the resources. Distributed
through evaluation that was modified based cognition analyses may also provide a win-
on the distributed resource model and a sim- dow into why technologies sometimes fail to
ulated clinical ordering task performed by reduce errors or even contribute to them.
Cognitive Informatics
147 4
4.6 Conclusion medical decision-making. Journal of
Biomedical Informatics, 35, 52–75. This rela-
Theories and methods from cognitive science tively recent article summarizes new directions
can shed light on a range of issues about the in decision-making research. The authors
design and implementation of health informa- articulate a need for alternative paradigms for
tion technologies. They can also serve an instru- the study of medical decision making.
mental role in understanding and enhancing Patel, V. L., Yoskowitz, N. A., Arocha, J. F., &
the performance of clinicians and patients as Shortliffe, E. H. (2009). Cognitive and learning
they engage in a range of cognitive tasks related sciences in biomedical and health instructional
to health. We believe that fundamental studies design: A review with lessons for biomedical
in psychology and cognitive science in general, informatics education. Journal of Biomedical
can provide general guiding principles to study Informatics, 42(1), 176–197. A review of learn-
these issues, and can be combined with field ing and cognition with a particular focus on
studies which serve to illuminate different fac- biomedical informatics.
ets and contextualize the phenomena observed Patel, V. L., Kannampallil, T. G., & Shortliffe,
in laboratory studies. The potential scope of E. H. (2015c). Role of cognition in generating
applied cognitive research in biomedical infor- and mitigating clinical errors. BMJ Quality &
matics is very broad. Significant inroads have Safety, 24, 468–474. https://doi.org/10.1136/
been made in areas such as EHRs and patient bmjqs-2014-003482.
safety. However, there are promising areas of Middleton, B., Bloomrosen, M., Dente, M. A.,
future cognitive research that remain largely Hashmat, B., Koppel, R., Overhage, J. M., et al.
uncharted. These include understanding how (2013). Enhancing patient safety and quality of
to capitalize on health information technology care by improving the usability of electronic
without compromising patient safety (particu- health record systems: Recommendations from
larly in providing adequate decision support), AMIA. Journal of the American Medical
understanding how various visual representa- Informatics Association, 20(e1), e2–e8.
tions/graphical forms mediate reasoning in
biomedical informatics and how these repre- ??Questions for Discussion
sentations can be used by patients and health 1. How can cognitive science theory
consumers with varying degrees of literacy. meaningfully inform and shape
These are only a few of the cognitive challenges design, development, and assessment
related to harnessing the potential of cutting- of health-care information systems?
edge technologies to improve patient safety. 2. Describe two or three kinds of mental
representations and briefly characterize
nnSuggested Readings their significance in understanding
Anderson, J. R. (2015). Cognitive psychology and human performance.
its implications. New York: Worth Publishers. 3. What is the purpose and value of cogni-
Carayon, P., Alyousef, B., & Xie, A. (2012). tive architectures?
Human factors and ergonomics in health care. 4. Identify three ways in which novices
In Handbook of human factors and ergo- differ from experts in medicine.
nomics (pp. 1574–1595). 5. What are the limitations of interpreting
Patel, V. L., Kaufman, D. R., & Kannampallil, retroactive data on medical errors?
T. G. (2013c). Diagnostic reasoning and deci- 6. Explain the difference between latent
sion making in the context of health informa- and active failures and their implica-
tion technology. In D. Marrow (Ed.), Reviews tions for patient safety?
of human factors and ergonomics (Vol. 8). 7. How does the field of Cognitive
Thousand Oaks, CA: SAGE Publications. Informatics capture the interaction of
Patel, V. L., Kaufman, D. R., & Arocha, J. F. cognition and informatics in biomedi-
(2002). Emerging paradigms of cognition in cine and healthcare?
148 V. L. Patel and D. R. Kaufman
8. Explain the role inductive, deductive Bloomrosen, M., Starren, J., Lorenzi, N. M., Ash,
and abductive reasoning play in medi- J. S., Patel, V. L., & Shortliffe, E. H. (2011).
Anticipating and addressing the unintended con-
cal diagnostic reasoning?
sequences of health IT and policy: A report from
9. Explain some ways in which technology- the AMIA 2009 health policy meeting. Journal of
mediated errors can compromise American Medical Informatics Association: JAMIA,
patient safety. 18(1), 82–90. http://doi.org/18/1/82 [pii]10.1136/
10. What are some of the assumptions of jamia.2010.007567.
Bruer, J. T. (1993). Schools for thought: A science of
the distributed cognition framework?
learning in the classroom. Cambridge: MIT Press.
4 What implications does this approach Carayon, P., Karsh, B.-T., Gurses, A. P., Holden, R. J.,
have for the evaluation of electronic Hoonakker, P., Hundt, A. S., Wetterneck, T. (2011).
health records? Macroergonomics in patient care and health care
11. Explain the difference between the safety. In D. G. Morrow (Ed.), Reviews of human
factors and ergonomics (Vol. 8). Santa Monica, CA:
effects of technology and the effects
Human Factors and Ergonomics Society.
with technology? How can each of these Carayon, P. (2012). Handbook of human factors and
effects contribute to improving patient ergonomics in health care and patient safety (2nd
safety and reducing medical error? ed.). Boca Raton: CRC Press.
12. The use of electronic health records Card, S. K., Moran, T. P., & Newell, A. (1983). The psy-
chology of human-computer interaction. Hillsdale:
(EHR) has been shown to differentially
L. Erlbaum Associates.
affect clinical reasoning relative to Carroll, J. M. (2003). HCI models, theories, and frame-
paper charts. Briefly characterize the works: Toward a multidisciplinary science. San
effects they have on reasoning, Francisco: Morgan Kaufmann.
including those that persist after the Chandler, P., & Sweller, J. (1991). Cognitive load theory
and the format of instruction. Cognition and
clinician ceases to use the system.
Instruction, 8(4), 293–332.
Chapanis, A. (1996). Human factors in systems engineer-
ing. New York: Wiley.
References Chase, W. G., & Simon, H. A. (1973). Perception in
chess. Cognitive Psychology, 4(1), 55–81.
Akin, O. (1982). The psychology of architecture design. Chi, M. T. H., & Glaser, R. (1981). Categorization and
London: Pion. representation of physics problems by experts and
Allard, F., & Starkes, J. L. (1991). Motor-skill experts in novices. Cognitive Science, 5, 121–152.
sports, dance, and other domains. In K. A. Ericsson Chi, M. T. H., Glaser, R., & Farr, M. J. (1988). The
& J. Smith (Eds.), Toward a general theory of exper- nature of expertise. Hillsdale: L. Erlbaum Associates.
tise: Prospects and limits (pp. 126–150). New York: Clancey, W. J., & Shortliffe, E. H. (1984). Readings in
Cambridge University Press. medical artificial intelligence: The first decade.
Anderson, J. R. (1985). Cognitive psychology and its Reading: Addison-Wesley.
implications (2nd ed.). New York: Freeman. DeGroot, A. T. (1965). Thought and choice in chess. The
Anderson, J. R. (2013). The architecture of cognition. Hague: Mouton.
New York: Psychology Press. Duch, W., Oentaryo, R. J., & Pasquier, M. (2008).
Atkinson, R. C., & Shiffrin, R. M. (1968). Human mem- Cognitive architectures: Where do we go from here?.
ory: A proposed system and its control processes. In Paper presented at the proceeding of the 2008 con-
K. W. Spence & J. T. Spence (Eds.), The psychology ference on Artificial General Intelligence 2008:
of learning and motivation (Vol. 2, pp. 89–195). Proceedings of the First AGI Conference.
New York: Academic. Elstein, K. A., Shulman, L. S., & Sprafka, S. A. (1978).
Balogh, E. P., Miller, B. T., & Ball, J. R. (2015). Improving Medical problem solving: An analysis of clinical rea-
diagnosis in health care. Washington, DC: National soning. Cambridge, MA: Harvard University Press.
Academy Press. Ericsson, K. A. (1996). The road to excellence: The
Bartolomeo, P. (2008). The neural correlates of visual acquisition of expert performance in the arts and sci-
mental imagery: An ongoing debate. Cortex, 44(2), ences sports and games. Mahwah: Lawrence
107–108. S0010-9452(07)00016-0 [pii]. Erlbaum Associates.
Bechtel, W., Abrahamsen, A., & Graham, G. (1998). Ericsson, K. A. (2006). The Cambridge handbook of
Part I: The life of cognitive science. In W. Bechtel & expertise and expert performance. Cambridge/New
G. Graham (Eds.), A companion to cognitive science York: Cambridge University Press.
Blackwell companions to philosophy (Vol. 13, Ericsson, K. A., & Simon, H. A. (1993). Protocol analy-
pp. 2–104). Malden: Blackwell. sis: Verbal reports as data (Rev. ed.). Cambridge,
MA: MIT Press.
Cognitive Informatics
149 4
Ericsson, K. A., & Smith, J. (1991). Toward a general (Eds.), Stevens’ handbook of experimental psychol-
theory of expertise: Prospects and limits. New York: ogy Vol 1: Perception and motivation; Vol 2: Learning
Cambridge University Press. and cognition (Vol. 1, 2nd ed., pp. 589–672).
Ericsson, K. A. (2009). Enhancing the development of New York: Wiley.
professional performance: Implications from the Harrington, L. (2015). Electronic health record work-
study of deliberate practice. In K. A. Ericsson (Ed.), flow: Why more work than flow? Advanced Critical
The development of professional expertise: Toward Care AACN, 26(1), 5–9.
measurement of expert performance and design of Hazlehurst, B., McMullen, C., Gorman, P., & Sittig, D.
optimal learning environments (pp. 405–431). New (2003). How the ICU follows orders: Care delivery
York: Cambridge University Press. as a complex activity system. AMIA Annual
Ericsson, K. A., Hoffman, R. R., Kozbelt, A., & Symposium Proceedings, 2003, 284–288.
Williams, A. M. (Eds.). (2018). The Cambridge D030003599 [pii].
handbook of expertise and expert performance. UK: Hazlehurst, B., McMullen, C. K., & Gorman, P. N.
Cambridge University Press. (2007). Distributed cognition in the heart room:
Ericsson, K. A. (Ed.). (1996). The road to excellence: How situation awareness arises from coordi-
The acquisition of expert performance in the arts nated communications during cardiac surgery.
and sciences, sports. Hillsdale, NJ: Lawrence Journal of Biomedical Informatics, 40(5), 539–551.
Erlbaum. https://doi.org/10.1016/j.jbi.2007.02.001. S1532-
Estes, W. K. (1975). The state of the field: General prob- 0464(07)00008-1 [pii].
lems and issues of theory and metatheory. In W. K. Henriksen, K. (2010). Partial truths in the pursuit of
Estes (Ed.), Handbook of learning and cognitive pro- patient safety. BMJ Quality & Safety Health Care,
cesses (Vol. 1). Hillsdale/New York: L. Erlbaum 19(3), i3–i7.
Associates. Hilgard, E. R., & Bower, G. H. (1975). Theories of learn-
Evans, D. A., & Gadd, C. S. (1989). Managing coher- ing (4th ed.). Englewood Cliffs: Prentice-Hall.
ence and context in medical problem-solving dis- Hoffman, R. R. (Ed.). (1992). The psychology of exper-
course. In D. A. Evans & V. L. Patel (Eds.), Cognitive tise: Cognitive research and empirical AI. Mahwah:
science in medicine: Biomedical modeling (pp. 211– Lawrence Erlbaum Associates.
255). Cambridge, MA: MIT Press. Hoffman, R. R., Shadbolt, N. R., Burton, A. M., &
Evans, D. A., & Patel, V. L. (1989). Cognitive science in Klein, G. (1995). Eliciting knowledge from experts –
medicine. Cambridge, MA: MIT Press. A methodological analysis. Organizational Behavior
Feltovich, P. J., Johnson, P. E., Moller, J. H., & Swanson, and Human Decision Processes, 62(2), 129–158.
D. B. (1984). The role and development of medical Horsky, J., Kaufman, D. R., & Patel, V. L. (2003a). The
knowledge in diagnostic expertise. In W. J. Clancey cognitive complexity of a provider order entry inter-
& E. H. Shortliffe (Eds.), Readings in medical artifi- face. AMIA Annual Symposium Proceedings, 2013,
cial intelligence: The first decade (pp. 275–319). 294–298. PMID: 14728181; PMCID: PMC1480200.
Reading: Addison Wesley. Horsky, J., Kaufman, D. R., Oppenheim, M. I., & Patel,
Fisk, A. D., Rogers, W. A., Charness, N., Czaja, S. J., & V. L. (2003b). A framework for analyzing the cogni-
Sharit, J. (2009). Designing for older adults: tive complexity of computer-assisted clinical order-
Principles and creative human factors approaches. ing. Journal of Biomedical Informatics, 36(1–2), 4–22.
Boca Raton: CRC Press. https://doi.org/10.1016/S1532-0464(03)00062-5.
Flin, R., & Patey, R. (2009). Improving patient Horsky, J., Kuperman, G. J., & Patel, V. L. (2005).
safety through training in non-technical skills. Comprehensive analysis of a medication dosing
British Medical Journal, 339, B3595. https://doi. error related to CPOE. Journal of the American
org/10.1136/Bmj.B3595. Medical Informatics Association: Journal of the
Frederiksen, C. H. (1975). Representing logical and American Medical Informatics Association JAMIA,
semantic structure of knowledge acquired from dis- 12(4), 377–382. https://doi.org/10.1197/jamia.
course. Cognitive Psychology, 7(3), 371–458. M1740. M1740 [pii].
Gardner, H. (1985). The mind’s new science: A history of Hutchins, E. (1995). Cognition in the wild. Cambridge,
the cognitive revolution. New York: Basic Books. MA: MIT Press.
Gillan, D. J., & Schvaneveldt, R. W. (1999). Applying Joseph, G. M., & Patel, V. L. (1990). Domain knowledge
cognitive psychology: Bridging the gulf between and hypothesis generation in diagnostic reasoning.
basic research and cognitive artifacts. In F. T. Durso, Medical Decision Making, 10, 31–46.
R. Nickerson, R. Schvaneveldt, S. Dumais, M. Chi, Karsh, B., Weinger, M. B., Abbott, P. A., & Wears, R. L.
& S. Lindsay (Eds.), Handbook of applied cognition (2010). Health information technology: Fallacies
(pp. 3–31). Chichester/New York: Wiley. and sober realities. Journal of the American Medical
Glaser, R. (Ed.). (2000). Advances in instructional psy- Informatics Association, 17(6), 617–623.
chology: Education design and cognitive science (Vol. Kassirer, J. P. (1989). Diagnostic reasoning. Annals of
5). Mahwah: Lawrence Erlbaum and Associates. Internal Medicine, 110, 893–900.
Greeno, J. G., & Simon, H. A. (1988). Problem solving Kaufman, D. R., Patel, V. L., & Magder, S. (1996). The
and reasoning. In R. C. Atkinson & R. J. Herrnstein explanatory role of spontaneously generated analo-
150 V. L. Patel and D. R. Kaufman
gies in a reasoning about physiological concepts. resilience: The decision-making process of profes-
International Journal of Science Education, 18, 369– sional sea-fishing skippers. Human Factors, 50(1),
386. 1–16.
Kintsch, W. (1998). Comprehension: A paradigm for cog- Newell, A. (1990). Unified theories of cognition.
nition. Cambridge/New York: Cambridge University Cambridge, MA: Harvard University Press.
Press. Newell, A., & Simon, H. A. (1972). Human problem solv-
Klienmuntz, B., & McLean, R. S. (1968). Diagnostic ing. Englewood Cliffs: Prentice-Hall.
interviewing by digital computer. Behavioral Norman, D. A. (1986). Cognitive engineering. In D. A.
Science, 13(1), 75–80. Norman & S. W. Draper (Eds.), User centered sys-
Kohn, L. T., Corrigan, J., & Donaldson, M. S. (2000). tem design: New perspectives on human-computer
4 To err is human: Building a safer health system (Vol. interaction (pp. 31–61). Hillsdale: Lawrence
6). Washington, DC: National Academy Press. Erlbaum Associates.
Koppel, R., Metlay, J. P., Cohen, A., Abaluck, B., Norman, D. A. (1993). Things that make us smart:
Localio, A. R., Kimmel, S. E., & Strom, B. L. defending human attributes in the age of the
(2005). Role of computerized physician order entry machine. Reading, Mass.: Addison-Wesley Pub. Co.
systems in facilitating medication errors. Journal of Patel, V. L., & Cohen, T. (2008). New perspectives on
the American Medical Association JAMA, 293(10), error in critical care. Current Opinions in Critical
1197–1203. Care, 14(4), 456–459.
Kushniruk, A. W., Kaufman, D. R., Patel, V. L., Patel, V. L., & Groen, G. J. (1986). Knowledge based
Levesque, Y., & Lottin, P. (1996). Assessment of a solution strategies in medical reasoning. Cognitive
computerized patient record system: A cognitive Science, 10(1), 91–116.
approach to evaluating medical technology. MD Patel, V. L., & Groen, G. J. (1991a). Developmental
Computing, 13(5), 406–415. accounts of the transition from medical student to
Kushniruk, A. W., Triola, M. M., Borycki, E. M., Stein, doctor: Some problems and suggestions. Medical
B., & Kannry, J. L. (2005). Technology induced Education, 25(6), 527–535.
error and usability: The relationship between usabil- Patel, V. L., & Groen, G. J. (1991b). The general and spe-
ity problems and prescription errors when using cific nature of medical expertise: A critical look. In
a handheld application. International Journal of K. A. Ericsson & J. Smith (Eds.), Toward a general
Medical Informatics, 74(7–8), 519–526. https:// theory of expertise: Prospects and limits (pp. 93–125).
doi.org/10.1016/j.ijmedinf.2005.01.003. S1386- New York: Cambridge University Press.
5056(05)00011-0 [pii]. Patel, V. L., & Kannampallil, T. G. (2015). Cognitive
Larkin, J., McDermott, J., Simon, D. P., & Simon, H. A. informatics in biomedicine and healthcare. Journal
(1980). Expert and novice performance in solving of Biomedical Informatics, 53, 3–14.
physics problems. Science, 208(4450), 1335–1342. Patel, V. L., & Kaufman, D. R. (1998). Medical infor-
Ledley, R. S., & Lusted, L. B. (1959). Probability, logic matics and the science of cognition. Journal of the
and medical diagnosis. Science, 130(3380), 892–930. American Medical Informatics Association JAMIA,
Lesgold, A., Rubinson, H., Feltovich, P., Glaser, R., 5(6), 493–502.
Klopfer, D., & Wang, Y. (1988). Expertise in a com- Patel, V. L., & Ramoni, M. F. (1997). Cognitive models
plex skill: Diagnosing x-ray pictures. In M. T. of directional inference in expert medical reasoning.
H. Chi, R. Glaser, & M. J. Farr (Eds.), The nature of In P. J. Feltovich, K. M. Ford, & R. R. Hoffman
expertise (pp. 311–342). Hillsdale: Lawrence (Eds.), Expertise in context: Human and machine
Erlbaum Associates. (pp. 67–99). Cambridge: The MIT Press.
Lieto, A., Lebiere, C., & Oltramari, A. (2018). The Patel, V. L., & Zhang, J. (2007). Cognition and patient
knowledge level in cognitive architectures: Current safety in healthcare. In F. T. Durso, R. S. Nickerson,
limitations and possible developments. Cognitive S. Dumais, S. Lewandowsky, & T. Perfect (Eds.),
Systems Research, 48, 39–55. Handbook of applied cognition (2nd ed., pp. 307–
Lin, L., Isla, R., Doniz, K., Harkness, H., Vicente, K. J., 331). New York: Wiley.
& Doyle, D. J. (1998). Applying human factors to Patel, V. L., Groen, G. J., & Arocha, J. F. (1990). Medical
the design of medical equipment: Patient-controlled expertise as a function of task-difficulty. Memory
analgesia. Journal of Clinical Monitoring and and Cognition, 18(4), 394–406.
Computing, 14(4), 253–263. Patel, V. L., Arocha, J. F., & Kaufman, D. R. (1994).
Magnani, L. (2001). Abduction, reason, and science: Diagnostic reasoning and medical expertise. In
Processes of discovery and explanation. Dordrecht: D. L. Medin (Ed.), The psychology of learning and
Kluwer Academic. motivation: Advances in research and theory (Vol. 31,
Mayes, T. J., Draper, S. W., McGregor, A. M., & Koatley, pp. 187–252). San Diego: Academic Press.
K. (1988). Information flow in a user interface: The Patel, V. L., Kushniruk, A. W., Yang, S., & Yale, J. F.
effect of experience and context on the recall of (2000). Impact of a computer-based patient record
MacWrite screens. Paper presented at the confer- system on data collection, knowledge organiza-
ence on people and computers IV, Cambridge. tion, and reasoning. Journal of the American
Morel, G., Amalberti, R., & Chauvin, C. (2008). Medical Informatics Association JAMIA, 7(6),
Articulating the differences between safety and 569–585.
Cognitive Informatics
151 4
Patel, V. L., Cohen, T., Murarka, T., Olsen, J., Kagita, Reason, J. T. (1990). Human error. Cambridge/New
S., Myneni, S., et al. (2011). Recovery at the edge of York: Cambridge University Press.
error: Debunking the myth of the infallible expert. Rimoldi, H. J. (1961). The test of diagnostic skills.
Journal of Biomedical Informatics, 44(3), 413–424. Journal of Medical Education, 36, 73–79.
Patel, V. L., Kaufman, D. R., & Kannampallil, T. G. Ritter, F. E., Tehranchi, F., & Oury, J. D. (2019). ACT-
(2013a). Diagnostic reasoning and decision making R: A cognitive architecture for modeling cognition.
in the context of health information technology. In Wiley Interdisciplinary Reviews: Cognitive Science,
D. Marrow (Ed.), Reviews of human factors and 10(3), e1488.
ergonomics (Vol. 8). Thousand Oaks: SAGE Rogers, Y. (2004). New theoretical approaches for HCI.
Publications. Annual Review of Information Science and
Patel, V. L., Kaufman, D. R., & Kannampallil, T. G. Technology, 38, 87–143.
(2013b). Diagnostic reasoning and decision making Salomon, G., Perkins, D. N., & Globerson, T. (1991).
in the context of health information technology. Partners in cognition: Extending human intelligence
Reviews of Human Factors and Ergonomics, 8(1), with intelligent technologies. Educational
149–190. Researcher, 20(3), 2–9. https://doi.org/10.3102/0013
Patel, V. L., Kaufman, D. R., & Cohen, T. (2014). 189x020003002.
Cognitive informatics in health and biomedicine: Senders JW. (1993) Theory and analysis of typical errors
Case studies on critical care, complexity and errors in a medical setting. Hospital Pharmacy. 1993
(pp. 1–13). London: Springer. Jun;28(6):505–508.
Patel, V. L., Kannampallil, T. G., & Shortliffe, E. H. Sharp, H., Preece, J., & Rogers, Y. (2019). Interaction
(2015a). Role of cognition in generating and miti- design: Beyond human-computer interaction.
gating clinical errors. BMJ Quality and Safety, 24, Hoboken, NJ: Wiley.
468–474. Shortliffe, E. H., & Blois, M. S. (2001). The computer
Patel, V. L., Kannampallil, T. G., & Kaufman, D. R. meets medicine and biology: Emergence of a disci-
(Eds.). (2015b). Cognitive informatics in health and pline. In E. H. Shortliffe & L. E. Perreault (Eds.),
biomedicine: Human computer interaction. London: Medical informatics: Computer applications in health
Springer. care and biomedicine (2nd ed., pp. 3–40). New York:
Patel, V. L., Arocha, J. F., & Ancker, J. (2017). Cognitive Springer.
informatics in health and biomedicine: Understanding Simon, D. P., & Simon, H. A. (1978). Individual differ-
and modeling health behaviors. London: Springer. ences in solving physics problems. In R. S. Siegler
Patel, V. L., Kaufman, D. R., & Kannampallil, T. G. (Ed.), Children’s thinking: What develops? (Vol. xi,
(2018). Diagnostic reasoning and expertise in pp. 325–348). Hillsdale, NJ: Lawrence Erlbaum
healthcare. In P. Ward, J. M. Schraagen, J. Gore, & Associates.
E. Roth (Eds.), The Oxford handbook of expertise: Sloboda, J. (1991). Musical expertise. In K. A. Ericsson
Research & application. UK: Oxford University & J. Smith (Eds.), Toward a general theory of exper-
Press. tise: Prospects and limits (pp. 153–171). New York:
Patel, V. L., Evans, D. A., & Kaufman, D. R. (1989). Cambridge University Press.
Cognitive framework for doctor-patient communi- Sternberg, R. J., & Horvath, J. A. (Eds.). (1999). Tacit
cation. In D. A. Evans & V. L. Patel (Eds.), Cognitive knowledge in professional practice: Researcher and
science in medicine: Biomedical modeling (pp. 257– practitioner. Mahwah: Lawrence Erlbaum
312). Cambridge, MA: MIT Press.. Associates.
Payne, S. J. (2003). Users’ mental models: The very Suchman, L. A. (1987). Understanding computers and
ideas. In J. M. Carroll (Ed.), HCI models, theories, cognition: A new foundation for design – Winograd,
and frameworks: Toward a multidisciplinary science T., Flores, F. Artificial Intelligence, 31(2), 227–232.
(1st ed., pp. 135–156). San Francisco: Morgan Sussman, S. Y. (2001). Handbook of program develop-
Kaufmann. ment for health behavior research & practice.
Peirce, C. S. (1955). Philosophical writings of Peirce. Ed. Thousand Oaks: Sage.
by Justus Buchler. New York: Dover. van Dijk, T. A., & Kintsch, W. (1983). Strategies of dis-
Peleg, M., Gutnik, L. A., Snow, V., & Patel, V. L. (2006). course comprehension. New York: Academic.
Interpreting procedures from descriptive guidelines. Vicente, K. J. (1999). Cognitive work analysis: Toward
Journal of Biomedical Informatics, 39, 184–195. safe, productive & healthy computer-based work.
Perkins, D. N., Schwartz, S., & Simmons, R. (1990). A Mahwah: Lawrence Erlbaum Associates Publishers.
view from programming. In M. Smith (Ed.), Toward Weinger, M. B., & Slagle, J. (2001). Human factors
a unified theory of problem solving: Views from con- research in anesthesia patient safety. Proceedings of
tent domains (pp. 45–67). Hillsdale: Lawrence the AMIA Symposium, 756–760. PMID: 11825287;
Erlbaum Associates. PMCID: PMC2243459.
Ramoni, M., Stefanelli, M., Magnani, L., & Barosi, G. White, B. Y., & Frederiksen, J. R. (1990). Causal model
(1992). An epistemological framework for medical progressions as a foundation for intelligent learning
knowledge-based systems. IEEE Transactions on environments. In W. J. Clancey & E. Soloway (Eds.),
Systems, Man, and Cybernetics, 22(6), 1361–1375. Artificial intelligence and learning environments spe-
152 V. L. Patel and D. R. Kaufman
cial issues of “Artificial Intelligence: An International Zhang, J., Johnson, T. R., Patel, V. L., Paige, D. L., &
Journal” (pp. 99–157). Kubose, T. (2003). Using usability heuristics to eval-
Woods, D. D., Patterson, E. S., & Cook, R. I. (2008). uate patient safety of medical devices. Journal of
Behind human error: Taming complexity to improve Biomedical Informatics, 36(1–2), 23–30.
patient safety. In P. Carayon (Ed.), Handbook of S1532046403000601 [pii].
human factors and ergonomics in health care and Zhang, J., Patel, V. L., Johnson, T. R., & Shortliffe,
patient safety (pp. 459–476). Mahwah: Lawrence E. H. (2004). A cognitive taxonomy of medical
Erlbaum Associates. errors. Journal of Biomedical Informatics, 37(3),
Wright, P. C., Fields, R. E., & Harrison, M. D. (2000). 193–204.
Analyzing human-computer interaction as distrib- Zuriff, G. E. (1985). Behaviorism: A conceptual recon-
4 uted cognition: The resources model. Human struction. New York: Columbia University Press.
Computer Interaction, 15(1), 1–41.
153 5
Human-Computer
Interaction, Usability,
and Workflow
Vimla L. Patel, David R. Kaufman, and Thomas Kannampallil
Contents
References – 171
HCI research in healthcare emerged at a time In recent years, there has been a significant
when health information technology and elec- growth in research and application regarding
tronic health records (EHRs) were becoming HCI and healthcare systems. They have pro-
more central to the practice of medicine (Patel duced a collective body of experiential and
et al. 2015). Much HCI work has been devoted practical knowledge about user experience,
to creating or enhancing design in healthcare adoption and implementation to guide future
systems. However, the focus of most of our design work. Some of the work is not specifi-
work has been on the cognitive mediation of cally guided by a theoretical foundation and
technology in healthcare practice (Patel et al. these efforts have proven to be useful in eluci-
2015). Most of the early HCI research focused dating problems and contributing to user-
on the solitary user of technology. Although centered design efforts. Human-computer
such research is still commonplace, the focus interaction work is at least partly an empirical
has extended to distributed health information science in which local knowledge derived from
systems (Hazlehurst et al. 2007; Horsky et al. a small body of studies will suffice in solving a
2003) and analysis of unintended sociotechni- problem. However, it is also necessary that we
cal consequences with a particular focus on extrapolate knowledge from one context to
computerized provider order entry systems another. Concentrated efforts in HCI are time-
(Koppel et al. 2005). HCI studies in biomedi- consuming, tend to employ small numbers of
cine extend across clinical and consumer subjects and are conducted in a limited num-
health informatics, addressing a range of user ber of settings. For example, it is simply not
populations including providers, biomedical possible to conduct an HCI research project in
scientists, and patients. While the implications many different hospitals or to thoroughly test
of HCI principles for the design of HIT are every facet of an electronic health record sys-
acknowledged, the adoption of the tools and tem. Knowledge solely based on practical
techniques among clinicians, informatics experience or empirical studies are not ade-
researchers and developers of HIT are limited. quate to account for the immense variety of
There is a consensus that HIT has not realized health information technologies and the rich
its potential as a tool that facilitates clinical array of contexts that constitute the practice
decision-making, coordination of care, and of medicine (Kaufman et al. 2015).
improvement of patient safety (Middleton There are many facets to technology use
et al. 2013). and a range of theories that address them. For
The field of human computer interaction example, the technology acceptance model
intersects behavioral, and computer and infor- that focuses on user’s perceived usefulness and
mation science. Thus, this field involves the usage intentions has been widely used in
study of interaction between people and com- healthcare research (Venkatesh 2000).
puters. Computing systems includes both soft- Sociotechnical systems theory is very broad in
ware and hardware. In addition, devices from scope. It views all organizations as having the
156 V. L. Patel et al.
following elements that comprise its organiza- external representations (e.g., visual displays,
tional design: technological (including the post-it notes). DCog has two lines of analysis,
actual IT system, usability, and unintended one that emphasizes the social and collabora-
consequences), social (doctors, staff, patients, tive nature of cognition (e.g., surgeons, nurses
etc.), and external environment (e.g., political, and respiratory therapists in cardiothoracic
economic, cultural, and legal influences) surgical setting jointly contributing to a deci-
(Hendrick and Kleiner 1999). These subsys- sion process), and one that characterizes the
tems are intricately connected, such that mediating effects of technology (e.g., EHRs,
changes to any one affects others, sometimes paper charts, mobile devices, apps) or other
in unanticipated or dysfunctional ways (Aarts artifacts on cognition. DCog constitutes a fam-
5 et al. 2007; Ash et al. 2004). One of the most
influential theories in clinical informatics was
ily of interrelated theories rather than a single
approach (Cohen et al. 2006). The approaches
offered by Sittig and Singh (2010). They collectively offer a penetrating view of the
proposed an 8-dimensional model of interre- complexities embodied in human-computer
lated concepts that can be used to explain per- interaction. However, there is no “off-the-
formance in complex adaptive systems in the shelf” methodology for using it in research or
healthcare arena. The model has been applied as a practitioner (Furniss et al. 2015). The
in a range of settings model to understand application of DCog theory and methods are
and improve HIT applications at various complicated by the fact that there are no set of
stages of development and implementation. features to attend to and no checklist or pre-
Cognitive engineering (CE) is an interdisci- scribed method to follow (Rogers 2012). In
plinary approach to the development of prin- addition, the analysis and abstraction requires
ciples, methods, and tools to assess and guide a high level of skill and training. More in-depth
the design of systems to support human perfor- reviews of DCog can be found in (Rogers 2004)
mance (Hettinger et al. 2017). The approach is and as applied to healthcare in (Hazlehurst
rooted in both cognitive science and engineer- et al. 2008; Kaufman et al. 2015). DCog
ing and has been used to support design of dis- approaches have been particularly useful in the
plays, decision support and training in analysis of teamwork and EHR-mediated
numerous high-risk domains (Kushniruk et al. workflow in complex environments (Blandford
2004). A computational theory of mind pro- and Furniss 2006; Hazlehurst et al. 2007;
vides the fundamental underpinning for most Kaufman et al. 2009). It is not unusual for HCI
contemporary cognitive theories. The basic researchers to engage multiple theories depend-
premise is that much of human cognition can ing on the area of focus.
be characterized as a series of operations, com-
putations on mental representations. At a
higher level of cognitive analysis, CE also 5.4 Usability of Health
focuses on the discrepancy between user’s goals Information Technology1
and the physical controls embodied in a system
(Norman 1986). Interface design choices differ- Theories of cognitive science meaningfully
entially mediates task performance and various inform and shape design, development and
methods of analysis including those described assessment of health-care information sys-
below endeavor to measure this impact. tems by providing insight into principles of
Distributed cognition (DCog) represents a
shift in the study of cognition from an exclusive
focus on the mind of the individual to being
“stretched” across groups, material artifacts 1 Parts of the section, have been adapted, with permis-
and cultures (Hutchins 1995). This paradigm sion, from Kannampallil, T. G., & Abraham, J.
has gained substantial currency in HCI (2015). Evaluation of health information technol-
ogy: Methods, frameworks and challenges. In V. L.
research. In the distributed approach, cogni- Patel, T. G. Kannampallil, & D. Kaufman (Eds.),
tion is viewed as a process of coordinating dis- Cognitive informatics in health and biomedicine:
tributed internal (i.e., what’s in the mind) and Human computer interaction. London: Springer.
Human-Computer Interaction, Usability, and Workflow
157 5
Evaluation
Methods
Field/
Observational
[Lab or Field]
Shadowing Time and
Task Model- Motion Studies Focus Groups, Verbal Think Eye-tracking,
Inspection-
Analytic based Surveys and Interviews Aloud Screen capture
based
Questionnaires
Heuristic Cognitive [Lab or Field
Evaluation Walkthrough or Online]
Hierarchical Task Cognitive
Analysis (HTA) Task Analysis Motor-based Keystroke-
theories (e.g., level Models
Fitts Low)
system usability and learnability, as well as the The question then becomes how we evalu-
design of a safer workplace. ate and study the various attributes of usabil-
Usability methods, most often drawn from ity. We classified usability evaluation methods
cognitive science, have been used to evaluate a into two categories: analytic evaluation
wide range of medical information technolo- approaches and usability testing. Analytic
gies including infusion pumps (Karat 1994), evaluation studies use experts as partici-
ventilator management systems, physician pants—usability experts, domain experts,
order entry (Ash et al. 2003; Horsky et al. software designers—or in some cases, are con-
2003; Koppel et al. 2005), pulmonary graph ducted without participants using task-ana-
displays (Wachter et al. 2003), information lytic, inspection-based or model- based
retrieval systems, and research web environ- approaches and are conducted in laboratory-
ments for clinicians (Elkin et al. 2002). In based settings.
addition, usability techniques are increasingly We categorized usability testing into field-
used to assess patient-centered environments based studies that capture situated and con-
(Chan and Kaufman 2011; Cimino et al. 2000; textual aspects of HIT use, and a general
Kaufman et al. 2003a, b). The methods category of methods (e.g., interviews, focus
include observations, focus groups, surveys groups, surveys) that solicit user opinions and
and experiments. Collectively, these studies can be administered in different modes (e.g.,
make a compelling case for the instrumental face-to-face or online). A brief categorization
value of such research to improve efficiency, of the evaluation approaches can be found in
user acceptance and relatively seamless inte- . Fig. 5.1. In the following sections, we pro-
gration with current workflow and practices. vide a detailed description of each of the eval-
What do we mean by usability? Nielsen uation approaches along with research
suggests that usability includes the following examples of its use.
five attributes: (1) learnability: system should
be relatively easy to learn, (2) efficiency: an
experienced user can attain a high level of 5.4.1 Analytical Approaches
productivity, (3) memorability: features sup-
ported by the system should be easy to retain Analytical approaches rely on analysts’ judg-
once learned, (4) errors: system should be ments and analytic techniques to perform
designed to minimize errors and support error evaluations on user interfaces, and often do
detection and recovery, and (5) satisfaction: not directly involve the participation of end
the user experience should be subjectively sat- users. These approaches employ experts—
isfying. general usability, human factors, or soft-
158 V. L. Patel et al.
5.4.1.1 Task Analysis2 In this task analysis, the task can be decom-
Task analysis is one of most commonly used posed as follows: moving to your desktop,
techniques to evaluate “existing practices” in searching for the document (either visually or
order to understand the rationale behind peo- by using the search function and typing in the
5 ple’s goals of performing a task, the motiva-
tions behind their goals, and how they perform
search criteria), selecting the document, open-
ing and printing it using the appropriate
these tasks (Preece et al. 1994). As described printer. The order in which these tasks are
by Vicente (1999), task analysis is an evalua- performed may change based on specific situ-
tion of the “trajectories of behavior.” There ations. For example, if the document is not
are several variants of task analysis—hierar- immediately visible on the desktop (or if the
chical task analysis (HTA) and cognitive task desktop has several documents making it
analysis (CTA) being the most commonly impossible to identify the document visually),
used in biomedical informatics research. then a search function is necessary. Similarly,
HTA is the simplest task analytic approach if there are multiple printer choices, then a rel-
and involves the breaking down of a task into evant printer must be selected. The plans
sub-tasks and smaller constituted parts (e.g., include a set of tasks that a user must under-
sub-sub-tasks). The tasks are organized take to achieve the goal (i.e., print the docu-
according to specific goals. This method, orig- ment). In this case, there are two plans: plan 0
inally designed to identify specific training and plan 2 (all plans are conditional on tasks
needs, has been used extensively in the design having pertinent sub-tasks associated with it).
and evaluation of interactive interfaces For example, if the user cannot find a docu-
(Annett and Duncan 1967). The application ment on the desktop, plan 2 is instantiated,
of HTA can be explained with an example: where a search function is used to identify the
consider the goal of printing a Microsoft document (steps 2.1, 2.2 and 2.3). . Figure 5.2
Word document that is on your desktop. The depicts the visual form of the HTA for this
sub-tasks for this goal would involve finding particular example.
(or identifying) the document on your desk- HTA has been used in evaluating inter-
top, and then print it by selecting the faces and medical devices. For example,
appropriate printer. The HTA for this task Chung et al. (2003) used HTA to compare the
can be organized as follows: differences between six infusion pumps. Using
0. Print document on the desktop HTA, they identified potential sources for the
1. Go to the desktop generation of human errors during various
2. Find the document tasks. While exploratory, their use of HTA
2.1. Use “Search” function provided insights into how the HTA can be
2.2. Enter the name of the document used for evaluating human performance and
2.3. Identify the document for predicting potential sources of errors.
3. Open the document Alternatively, HTA has been used to model
4. Select the “File” menu and then “Print” information and clinical workflow in ambula-
tory clinics (Unertl et al. 2009). Unertl et al.
(2009) used direct observations and semi-
structured interviews to create a HTA of the
2 While GOMS (See 7 Sect. 5.4.1.3) is considered workflows. The HTA was then used to iden-
a task-analytic approach, we have categorized it
as a model-based approach for predictions of task
tify the gaps in existing HIT functionality for
completion times. It is based on a task analytic supporting clinical workflows, and the needs
decomposition of tasks. of chronic disease care providers.
Human-Computer Interaction, Usability, and Workflow
159 5
.. Fig. 5.2 Graphical representation of task analysis of printing a document: the tasks are represented in the
boxes; the line underneath certain boxes represents the fact that there are no sub-tasks for these tasks
CTA is an extension of the general task through direct (e.g., verbal think aloud) or
analysis technique to develop a comprehensive indirect (e.g., unobtrusive screen recording)
understanding regarding the knowledge, cog- data capture methods. Whereas the process-
nitive/thought processes and goals that under- tracing approach is generally used to capture
lie observable task activities (Chipman et al. expert behaviors, it has also been used to eval-
2000). Although the focus is on knowledge uate general users. In a study on experts’
and cognitive components of the task activi- information seeking behavior in critical care,
ties and performance, CTA relies on observ- Kannampallil et al. (2013a) used the process-
able human activities to draw insights on the tracing approach to identify the nature of
knowledge-based constraints and challenges these activities including the information
that impair effective task performance. sources, cognitive strategies, and shortcuts
CTA techniques are broadly classified into used by critical care physicians in decision-
three groups based: (a) interviews and obser- making tasks. The CTA approach relied on
vations, (b) process tracing and (c) conceptual the verbalizations of physicians, their access
techniques (Cooke 1994). CTA using inter- to various sources, and the time spent on
views and observations involve developing a accessing these sources to identify the strate-
comprehensive understanding of tasks gies of information seeking.
through discussions with, and task observa- Finally, CTA supported by conceptual
tions of experts. For example, a researcher techniques rely on the development of repre-
observes an expert physician performing the sentations of a domain (and their related con-
task of medication order entry into a CPOE cepts) and the potential relationships between
(Computerized Physician Order Entry) sys- them. This approach is often used with experts
tem and asks to follow up questions regarding and different methods are used for knowledge
the specific aspects of the task. In a study on elicitation including concept elicitation, struc-
understanding providers’ management of tured interviews, ranking approaches, card
abnormal test results, Hysong et al. (2010) sorting, structural approaches such as multi-
conducted CTA-based interviews with 28 pri- dimensional scaling, and graphical associa-
mary care physicians on how and when they tions (Cooke 1994).
manage alerts, and how they use the various
features on the EHR system to filter and sort 5.4.1.2 Inspection-Based Evaluation
their alerts. Inspection methods involve experts appraising
CTA supported by process-tracing a system, playing the role of a user to identify
approaches relies on capturing task activities potential usability and interaction issues with
160 V. L. Patel et al.
a system. Inspection methods are often con- usability experts evaluate the user interface
ducted on fully developed systems or inter- against the identified heuristics. After evaluat-
faces but may also be used for prototypes. ing the heuristics, the potential violations are
Inspection methods rely on a usability expert, rated according to a severity score (1–5, where
i.e., a person with significant training and 1 indicates a cosmetic problem and 5 indicates
experience in evaluating interfaces, to go a catastrophic problem). This process is itera-
through a system and identify whether the tive and continues until the expert feels that a
user interface elements conform to a pre- majority (if not all) of the violations are iden-
determined set of usability guidelines and tified. It is also generally recommended that a
design requirements (or principles). The most set of 4–5 usability experts are required to
5 commonly used inspection methods are heu-
ristic evaluations (HE) and walkthroughs.
identify 95% of the perceived violations or
problems with a user interface. However, it is
HE techniques utilize a small set of experts not uncommon to employ fewer experts (e.g.,
to evaluate a user interface (or a set of inter- 3). It should be acknowledged that the HE
faces in a system) based on their understand- approach may not lead to the identification of
ing of a set of heuristic principles regarding all problems and the identified problems may
interface design (Johnson et al. 2005). This be localized (i.e., specific to a particular inter-
technique was developed by Jakob Nielsen face in a system). An example of an HE evalu-
and colleagues (Nielsen and Molich 1990), ation form is shown in . Fig. 5.3.
and has been used extensively in the evalua- In the healthcare domain, HE has been
tion of user interfaces. The original set of used in the evaluation of medical devices and
heuristics was developed by Nielsen based on HIT interfaces. For example, Zhang et al.
an abstraction of 249 usability problems. In (2003) used a modified set of 14 heuristics to
general, the following ten heuristic principles compare the patient safety characteristics of
(or a subset of these) are most often consid- two 1-channel volumetric infusion pumps.
ered for HE studies: system status visibility; Four independent usability experts evaluated
match between system and real world; user both infusion pumps using the list of heuris-
control and freedom; consistency and stan- tics and identified 89 usability problems cate-
dards; error prevention; recognition rather gorized as 192 heuristic violations for pump 1,
than recall; flexibility and efficiency of use; and 52 usability problems categorized as 121
aesthetic and minimalist design; help users heuristic violations for pump 2. The heuristic
recognize, diagnose and recover from errors; violations were also classified based on their
and help and documentation (retrieved from: severity. In another study, Allen et al. (2006)
7 http://www.n ngroup.c om/articles/ten- developed a simplified list of heuristics to
usability-heuristics/). Conducting an HE evaluate web-based healthcare interfaces
involves a usability expert going through an (printouts of each interface). Multiple usabil-
interface to identify potential violations to a ity experts assigned severity ratings for each
set of usability principles (referred to as “heu- of the identified violations and the severity
ristics”). These perceived violations could ratings were used to re-design the interface.
involve a variety of interface elements such as Walkthroughs are another inspection-
windows, menu items, links, navigation, and based approach that relies on experts to evalu-
interaction. ate the cognitive processes of users performing
Evaluators typically select a relevant sub- a task. It involves employing a set of potential
set of heuristics for evaluation (or add more stakeholders (designers, usability experts) to
based on the specific needs and context). The characterize a sequence of actions and goals
selection of heuristics is based on the type of for completing a task. Most commonly used
system and interface being evaluated. For walkthrough, referred to as cognitive walk-
example, the relevant heuristics for evaluating through (CW), involves observing, recording
an EHR interface would be different from and analyzing the actions and behaviors of
that of an app on a mobile device. After select- users as they complete a scenario of use. CW
ing a set of applicable heuristics, one or more is focused on identifying the usability and
Human-Computer Interaction, Usability, and Workflow
161 5
comprehensibility of a system (Polson et al. for working with an interface or system. For
1992). The aim of CW is to investigate and example, for an interface for entering demo-
determine whether the user’s knowledge and graphic and patient history details, partici-
skills and the interface cues are sufficient to pants (e.g., physicians) are asked to enter the
produce an appropriate goal-action sequence age, gender, race and clinical history informa-
that is required to perform a given task tion. As the participants perform their
(Kaufman et al. 2003a, b). CW is derived from assigned task, their task sequences, errors and
the cognitive theory of how users work on other behavioral aspects are recorded. Often,
computer-based tasks, using the exploratory follow up interviews or think aloud (described
learning approach, where system users con- in a later section) are used to identify partici-
tinually appraise their goals and evaluate their pants’ interpretation of the tasks, how they
progress against these goals (Kahn and Prail make progress, and potential points of mis-
1994). matches in the system. Detailed observations
While performing CW, the focus is on sim- and recordings of these mismatches are docu-
ulating the human-system interaction, and mented for further analysis. While in most
evaluating the fit between the system features situations CWs are performed by individuals,
and the user’s goals. Conducting CW studies sometimes groups of stakeholders perform
involves multiple steps. Potential participants the walkthrough together. For example,
(e.g., users, designers, usability experts) are usability experts, designers and potential users
provided a set of task sequences or scenarios could go through systems together to identify
162 V. L. Patel et al.
the potential issues and drawbacks. Such choosing among alternative available meth-
group walkthroughs are often referred to as ods depending on features of the task at hand,
pluralistic walkthroughs. keeping track of what has been done and what
In biomedical informatics domain, it must needs to be done, and executing the motor
be noted that CW has been used extensively in movements necessary for the keyboard and
evaluating situations other than human- mouse” (Olson and Olson 2003). In other
computer interaction. For example, the CW words, GOMS assumes that the execution of
method (and its variants) has been used to tasks can be represented as a serial sequence
evaluate diagnostic reasoning, decision- of cognitive operations and motor actions.
making processes and clinical activities. For GOMS is used to describe an aggregate of
5 example, Kushniruk et al. (1996) used the CW the task and the user’s knowledge regarding
method to perform an early evaluation of the how to perform the task. This is expressed
mediating role of HIT in clinical practice. The regarding the Goals, Operators, Methods and
CW was not only used to identify usability Selection rules. Goals are the expected out-
problems but was instrumental in the develop- comes that a user wants to achieve. For exam-
ment of a coding scheme for subsequent ple, a goal for a physician could be
usability testing. Hewing et al. (2013) used documenting the details of a patient interac-
CW to evaluate an expert ophthalmologist’s tion on an EHR interface. Operators are the
reasoning regarding retinal disease in infants. specific actions that can be performed on the
Using images, clinical experts were indepen- user interface. For example, clicking on a text
dently asked to rate the presence and severity box or selecting a patient from a list in a drop-
of retinal disease and provide an explanation down menu. Methods are sequential combina-
of how they arrived at their diagnostic deci- tions of operators and sub-goals that need to
sions. Similar approaches were used by be achieved. For example, in the case of select-
Kaufman et al. (2003a, b) to evaluate the ing a patient from a dropdown list, the user
usability of a home-based, telehealth system. has to move the mouse over to the dropdown
menu, click on the arrow using the appropri-
5.4.1.3 Model-Based Evaluation ate mouse key to retrieve the list of patients.
Model-based evaluation approaches use pre- Finally, selection rules are used to ascertain
dictive modeling approaches to characterize which methods to choose when several choices
the efficiency of user interfaces. Model-based are available. For example, using the arrow
approaches are often used for evaluating rou- keys on the keyboard to scroll down a list ver-
tine, expert task performance. For example, sus using the mouse to select.
how can the keys of a medical device interface One of the simplest and most commonly
be optimally organized such that the users can used GOMS approaches is the Keystroke-
complete their tasks quickly and accurately? Level Model (KLM), which was first described
Similarly, predictive modeling can be used to in Card et al. (1983). As opposed to the gen-
compare the data entry efficiency between eral GOMS model, the KLM makes several
interfaces with different layouts and organiza- assumptions regarding the task. In KLM,
tion. We describe two commonly used predic- methods are limited to keystroke level opera-
tive modeling techniques in the evaluation of tions and task duration is predicted based on
interfaces. these estimates. For the KLM, there are six
Card et al. (1980) proposed the GOMS types of operators: K for pressing a key; P for
(Goals, Operators, Methods and Selection pointing the mouse to a target; H for moving
Rules) analytical framework for predicting hands to the keyboard or pointing device; D
human performance with interactive systems. for drawing a line segment; M for mental
Specifically, GOMS models predict the time preparation for an action; and R for system
taken to complete a task by a skilled/expert response. Based on experimental data or other
user based on “the composite of actions of predictive models (e.g., Fitts Law), each of
retrieving plans from long-term memory, these operators is assigned a value or a param-
Human-Computer Interaction, Usability, and Workflow
163 5
eterized estimate of execution time. We details in Card et al. (1980). GOMS models
describe an example from Saitwal et al. (2010) can be applied only to the error-free, routine
on the use of the KLM approach. tasks of skilled users. Hence, it is not possible
In a study investigating the usability of to make time predictions for non-skilled users,
EHR interfaces, Saitwal et al. (2010) used the who are likely to take considerable time to
KLM approach to evaluate the time is taken, learn to use a new system. For example, the
and the number of steps required to complete use of the GOMS approach to predict the
a set of 14 EHR-based tasks. The purpose of potential time spent by physicians in using a
the study was to characterize the issues with new EHR would be inaccurate owing to rela-
the user interface and also to identify poten- tive lack of knowledge of the physicians
tial areas for improvement. The evaluation regarding the use of the various interfaces,
was performed on the AHLTA (Armed Forces and the learning curve required to be up-to-
Health Longitudinal Technology Application) speed with the new system. The complexity of
user interface. A set of 14 prototypical tasks clinical work processes and tasks, and the
was first identified. Sample tasks included variability of the user population create sig-
entering the patient’s current illness, history nificant challenges for the effective use of
of present illness, social history and family GOMS in measuring the effectiveness of clini-
history. KLM analysis was performed on each cal tasks.
of the tasks: this involved breaking each of Fitts Law is used to predict human motor
the tasks into its component goals, operators, behavior; it is used to predict the time taken to
methods and selection rules. The operators acquire a target (Fitts 1954). On computer-
were also categorized as physical (e.g., move based interfaces, it has been used to develop a
the mouse to a button) or mental (e.g., locate predictive model of time it takes to acquire a
an item from a dropdown menu). For exam- target using a mouse (or another pointing
ple, the selection of a patient name involved device). The time taken to acquire a target
eight steps (M – mental operation; P – physi- depends on the distance between the pointer
cal operation): (1) think of location on the and target (referred to as amplitude, A) and
menu [M, 1.2s], (2) move hand to the mouse the width of the target (W). The movement
[P, 0.4s], (3) move the mouse to “Go” in the time (MT) is mathematically represented as
menu [P, 0.4s], (4) extend the mouse to follows:
“Patient” [P, 0.4s], (5) retrieve the name of the
A
patient [M, 1.2s], (6) locate patient name on MT k .log 2 1
the list [M, 1.2s], (7) move mouse to the identi- W
fied patient [P, 0.4s] and (8) click on the iden- where k is a constant, A – amplitude, W –
tified patient [P, 0.4s]. In this case, there were width of the target.
a total of 8 steps that would take 5.2s to com- In summary, based on Fitts law, one can
plete. Similarly, the number of steps and the say that the larger objects are easier to acquire
time taken for each of the 14 considered while smaller, closely aligned objects are much
AHLTA tasks were computed. more difficult to acquire with a pointing
In addition, GOMS and its family of device. While the direct application of Fitts
methods can be productively used to make law is not often found in the evaluation stud-
comparisons regarding the efficiency of per- ies of HIT or health interfaces in general, it
forming tasks interfaces. However, such has a profound influence in the design of
approaches are approximations and have sev- interfaces. For example, the placement of
eral disadvantages. Although GOMS provides menu items and buttons, such that a user can
a flexible and often reliable mechanism for easily click on them for selection, are based on
predicting human performance in a variety of Fitts law parameters. Similarly, in the design
computer-based tasks, there are several poten- of number keypads for medical devices, the
tial limitations. A brief summary is provided size of the buttons and their location can be
here, and interested readers can find further effectively predicted by Fitts law parameters.
164 V. L. Patel et al.
are reflected in the study methods. However, it route taken to complete a task including the
is quite clear that EHRs typically add signifi- action steps and the trajectory through space
cantly to the documentation burden and that (e.g., sequence of tabs or display screens). In a
has resulted in frustration among users. systematic review, Roman et al. found that
EHR usability problems and their impact navigation actions (e.g., scrolling through a
on workflow are well documented. For exam- patient list) were often linked to specific
ple, the Healthcare Information and usability heuristic violations, including flexi-
Management Systems Society (HIMSS) sur- bility and efficiency of use, and lack of an
vey found that workflow was clinicians’ num- emphasis on recognition rather than recall
ber one EHR usability “pain point” (Ribitzky (Roman et al. 2017).
5 et al. 2010). Respondents also reported frus-
tration due to numerous alerts and difficulties
Medication reconciliation (MedRec) tools
are an essential part of a strategy to reduce
with navigation resulting from the need to medication errors and prevent adverse events
negotiate too many displays in the EHR to (Agrawal 2009). MedRec tools enable clini-
access information (Carayon et al. 2010). As cians to compare lists of medications in
referred to previously, Saitwal and colleagues patients history and revise the lists so that
(Saitwal et al. 2010) employed a cognitive task they are up-to-date and accurate. There have
analytic approach to evaluate an EHR inter- been several recent studies that have applied
face and quantified interactive behavior (e.g., cognitive methods of analysis to MedRec
task duration, number of steps). Challenges (Boockvar et al. 2011; Lesselroth et al. 2013).
of employing the user interface were: (a) large Horsky and colleagues et al. investigate the
number of average total steps to complete accuracy of two different medication recon-
routine tasks, (b) slow execution time, and (c) ciliation tools integrated into EHRs in a simu-
overall mental workload. Similarly, Carayon lated study (Horsky et al. 2017). They found
et al. (6) summarized a range of usability that the reconciled records were significantly
issues in their systematic review of the work- more accurate when clinicians used the sec-
flow literature, including the large number of ond tool. Specifically, the comparison showed
mouse clicks to complete a task, difficulties in clinicians made three times as many errors in
navigating between many screens to input and EHRs with single column medication lists, as
retrieve information and cluttered screens. compared to using side-by-side lists. The
These usability problems serve to increase authors concluded that the better outcome
cognitive load. Information overload, a form using the second tool was strongly facilitated
of cognitive load, is a problem that occurs by a design that was more effective in support-
when attentional, perceptual and cognitive ing a cognitively demanding task. The system
capacity is exceeded by the quantity of data made less demands on working memory than
presented via an interface to the extent that the first. Plaisant and colleagues similarly
errors occur in users’ information processing contrasted a conventional interface design
(Zahabi et al. 2015). (control) with a novel prototype (Twinlist)
Cognitive overload can be partially attrib- (Plaisant et al. 2015). The Twinlist interface
uted to poor user design. Small interface dif- divided information into five columns, while
ferences can be consequential and have a the control used two side- by-side lists.
significant impact on task efficiency. Evaluation showed that in Twinlist partici-
According to Gray and colleagues, interactive pants completed MedRec significantly faster
behavior is constrained by the design and con- and more accurately than the control. Both
figuration of displays “as well as by the ways studies demonstrated the comparative advan-
in which elementary cognitive, perceptual, tage of having access to needed information
and motor operations can be combined” on a single screen as opposed to having to
(Gray and Boehm-Davis 2000). Poorly config- toggle between two screens or tabs.
ured interfaces can increase navigational com- The studies described above employed
plexity. Navigation in this context refers to simulated methods. Duncan compared
Human-Computer Interaction, Usability, and Workflow
169 5
MedRec interfaces in three Mayo Clinic cam- bottlenecks in EHR-mediated workflow can
puses in which different EHR systems were lead to system redesigns that minimize cogni-
used (Duncan et al. 2018b). Although they tive load and may also improve patient safety
support the same set of functions, the inter- and efficiency.
faces differed in important respects.
Specifically, the steps to access the medication
list and perform the addition of a last dose 5.7 Future Directions
differ. System 2 necessitated a three-step pro-
cess as opposed to a single step needed to The role of HCI, and more specifically, usabil-
execute the same reconciliation goal (system ity, is likely to be more embedded within the
1). The systems were compared using a pre- biomedical informatics research paradigm
dictive model (KLM) and with live observa- over the next decade. This is primarily because
tions captured to video. As described earlier of the role health information technologies
in this chapter, the keystroke-level model play in transforming the practice of medicine.
(KLM) is a widely used analysis where execu- Such a transformation has led to the wide-
tion time of routine tasks (performed without spread use of technology in patient care (e.g.,
errors) are estimated. Duncan found that the the use of EHRs) and the development of
KLM estimates for the MedRec task closely patient-facing applications for self-manage-
approximated the observations. Both meth- ment of care (e.g., mobile devices). As a result,
ods found that the time required to reconcile the role of usability is likely to have an
a single medication was more than 2 seconds increased focus in potentially two areas: devel-
greater for system 2, which also required opment of new approaches for unobtrusive
more mouse clicks and screen transitions. The evaluation of user interactions, and in the
two systems differ in terms of interactive evaluation of consumer-facing applications
complexity and demands on working mem- for health.
ory. The difference highlights the importance As previously described, usability and user
of emphasizing recognition rather than recall interaction studies are expensive, in terms of
to minimize the memory load on clinicians. time, and effort that are involved. Recent efforts
Duncan and colleagues employed a similar on usability evaluation have focused on utiliz-
approach with the same EHR systems in rela- ing logs of user interactions—including audit
tion to a vital signs documentation and medi- trails, and other unobtrusively collected inter-
cation administration record tasks (Duncan action data (e.g., key stroke or eye-tracker). One
et al. 2018a; Duncan et al., 2020). Vital signs classic example of such data is the EHR-based
are used to gauge a patient’s hemodynamic audit logs. In a recent study on EHR use in an
stability and, in this case, provide a point of emergency department, Kannampallil and col-
reference prior to surgery. The objectives were leagues used user log files to track the physician
to: (1) analyze aspects of vital signs charting interactions during patient care activities
interfaces and determine how these aspects (Kannampallil et al. 2018b). Similar efforts on
differentially mediate task performance and tracing and modeling interaction and naviga-
(2) investigate variations in vital signs docu- tion behaviors are ongoing. The availability of
mentation across clinical sites. Analyses more powerful tools for data capture and anal-
revealed that accessing displays and the orga- ysis will create new opportunities for computer
nization of interface elements are often unin- scientists, psychologists and clinicians to col-
tuitive and inefficient, creating unnecessary laborate on HCI-related investigations of tech-
complexities when interacting with the sys- nology-mediated clinical practice. In a recent
tem. The study documented the ways in which study of such a collaboration, Vankipuram and
the systems differed in their modes of interac- colleagues showed that the process of data-
tion, organization of patient information and driven iterative workflow redesign using visual-
cognitive support. The authors noted that ization of overlaid data from quantitative and
identifying barriers to interface usability and qualitative sources could be used to identify
170 V. L. Patel et al.
inefficiencies and bottlenecks in clinical work- Pub. Co. This book addresses significant issues
flow and potentially contribute to process in human-computer interaction in a very read-
improvement (Vankipuram et al., 2019). able and entertaining fashion.
The growth of wearable devices and Patel, V. L., Kannampallil, T., & Kaufman, D.
mobile sensing technologies have led to the (Eds.). (2015). Cognitive informatics in health
development and use of a large number of and biomedicine: Human computer interac-
consumer-facing applications and tools. tion. London: Springer. This edited book
Although much of these are on mobile addresses the key gaps on the applicability of
devices, other patient-facing applications such theories, models and evaluation frameworks
as portals and social networking tools have of HCI and human factors for research in bio-
5 gained prominence in recent years. These
applications are rapidly evolving and, in many
medical Informatics.
Preece, J., Rogers, Y., & Sharp, H. (2015).
cases, still require significant improvements Interaction design: Beyond human-computer
for translation into a usable and sufficiently interaction (4th ed.). West Sussex: Wiley. A
robust product. very readable and relatively comprehensive
introduction to human-computer interaction.
A new edition will be available in April, 2019.
5.8 Conclusion Zheng, K., Westbrook, J., Kannampallil, T., &
Patel, V. L. (Eds.). (2018). Cognitive informat-
The implementation and broad use of health ics: Reengineering clinical workflow for more
information technologies have grown rapidly efficient and safer care. London: Springer.
over the course of the last decade. Clinical This edited book offers a comprehensive
applications and increasingly patient-facing aspect of clinical workflow, supported by the
systems are beginning to transform healthcare theoretical, methodological, empirical, and
practices. They offer significant potential for pragmatic perspectives from experts in the
transforming the quality of patient care as field.
well as enabling patient to become agents of
change in managing their own heath. Usability ??Questions for Discussion
challenges continue to provide significant 1. What role do the theories of HCI and
impediments to productive use of technology cognitive science play in providing insight
and efficient workflow. The focus of this chap- into principles of system usability, as well
ter has been on methods of usability evalua- as the design of a safer workplace?
tion. There is a wealth of different methods 2. A large urban hospital is planning to
available for researchers and professional implement a provider order entry system.
usability analysts to deploy in view to opti- You have been asked to advise them on
mize the user experience. The methods have system usability and to study the cogni-
contributed to a growing body of knowledge tive effects of the system on performance.
to inform user-centered design and best prac- Discuss the issues involved and suggests
tices across the range of health technologies. some of the steps you would take to study
system usability.
nnSuggested Readings 3. What are the primary differences between
Carroll, J. M. (2003). HCI models, theories, and analytic usability evaluation methods and
frameworks: Toward a multidisciplinary sci- usability testing?
ence. San Francisco: Morgan Kaufmann. An 4. What are some of the considerations for
edited volume on the theoretical foundations choosing analytic approaches for usability
of HCI. evaluation?
Norman, D. A. (1993). Things that make us 5. How does usability impact clinical work-
smart: Defending human attributes in the age flow? How can we provide better cognitive
of the machine. Reading: Addison-Wesley support for workflow?
Human-Computer Interaction, Usability, and Workflow
171 5
References Burrows, D., & Kendall, S. (1997). Focus groups: What
are they and how can they be used in nursing and
health care research? Social Sciences in Health, 3,
Aarts, J., Ash, J., & Berg, M. (2007). Extending the
244–253.
understanding of computerized physician order
Carayon, P., Karsh, B., Cartmill, R., Hoonakker, P.,
entry: Implications for professional collaboration,
Hundt, A. S., Krueger, D., & Wetterneck, T. (2010).
workflow and quality of care. International Journal
Incorporating health information technology into
of Medical Informatics, 76(Suppl 1), S4–S13.
workflow redesign-summary report. Rockville:
Agrawal, A. (2009). Medication errors: Prevention using
Agency for Healthcare Research and Quality.
information technology systems. British Journal of
Carayon, P., Xie, A., & Kianfar, S. (2014). Human fac-
Clinical Pharmacology, 67(6), 681–686.
tors and ergonomics as a patient safety practice.
Allen, M., Currie, L. M., Bakken, S., Patel, V. L., &
BMJ Quality and Safety, 23(3), 196–205.
Cimino, J. J. (2006). Heuristic evaluation of paper-
Card, S. K., Moran, T. P., & Newell, A. l. (1980). The
based Web pages: A simplified inspection usability
keystroke-level model for user performance time
methodology. Journal of Biomedical Informatics,
with interactive systems. Communications of the
39(4), 412–423. https://doi.org/10.1016/j.
ACM, 23(7), 396–410. https://doi.
jbi.2005.10.004.
org/10.1145/358886.358895.
Annett, J., & Duncan, K. D. (1967). Task analysis and
Card, S. K., Newell, A., & Moran, T. P. (1983). The psy-
training design. Occupational Psychology, 41, 211–
chology of human-computer interaction. Hillsdale:
221.
Lawrence Erlbaum Associates.
Ash, J. S., Gorman, P. N., Lavelle, M., Payne, T. H.,
Carroll, J. M. (2003). HCI models, theories, and frame-
Massaro, T. A., Frantz, G. L., & Lyman, J. A.
works: Toward a multidisciplinary science. San
(2003). A cross-site qualitative study of physician
Francisco: Morgan Kaufmann.
order entry. Journal of the American Medical
Caulton, D. A. (2001). Relaxing the homogeneity
Informatics Association, 10(2), 188–200.
assumption in usability testing. Behaviour &
Ash, J. S., Berg, M., & Coiera, E. (2004). Some unin-
Information Technology, 20, 1–7.
tended consequences of information technology in
Chan, C. V., & Kaufman, D. R. (2011). A framework for
health care: The nature of patient care information
characterizing eHealth literacy demands and barri-
system-related errors. Journal of the American
ers. Journal of Medical Internet Research, 13(4), e94.
Medical Informatics Association, 11(2), 104–112.
Chipman, S. F., Schraagen, J. M., & Shalin, V. L. (2000).
https://doi.org/10.1197/jamia.M1471.
Introduction to cognitive task analysis. In J. M.
Assila, A., de Oliveira, K. M., & Ezzedine, H. (2014).
Schraagen, S. F. Chipman, & V. J. Shute (Eds.),
Towards qualitative and quantitative data integra-
Cognitive task analysis (pp. 3–23). Mahwah:
tion approach for enhancing HCI quality evalua-
Lawrence Erlbaum Associates.
tion. In M. Kurosu (Ed.), Human-computer
Chung, P., Zhang, J., Johnson, T., & Patel, V. (2003). An
interaction. theories, methods, and tools (pp. 469–
extended hierarchical task analysis for error
480). Cham: Springer International Publishing.
prediction in medical devices. AMIA Annu Symp
Ben-Assuli, O., Shabtai, I., & Leshno, M. (2015). Using
Proc, 2003, 165–169.
electronic health record systems to optimize admis-
Cimino, J. J., Li, J., Mendonça, E. A., Sengupta, S.,
sion decisions: The creatinine case study. Health
Patel, V. L., & Kushniruk, A. W. (2000). An evalua-
Informatics Journal, 21(1), 73–88.
tion of patient access to their electronic medical
Blandford, D., & Furniss, D. (2006). DiCoT: A method-
records via the World Wide Web. Paper presented at
ology for applying distributed cognition to the
the Proceedings of the AMIA Symposium.
design of teamworking systems. In S. W. Gilroy &
Collins, S., Couture, B., Dykes, P., Schnock, K.,
M. D. Harrison (Eds.), Interactive systems: Design,
Knaplund, C., Frank Chang, F., & Cato, K. (2018).
specification, and verification (pp. 26–38). Berlin:
Quantifying and visualizing nursing flowsheet docu-
Springer.
mentation burden in acute and critical care. Paper
Boockvar, K. S., Santos, S. L., Kushniruk, A., Johnson,
presented at the AMIA 2018 Annual Symposium,
C., & Nebeker, J. R. (2011). Medication reconcilia-
San Francisco.
tion: Barriers and facilitators from the perspectives
Cohen, T., Blatter, B., Almeida, C., Shortliffe, E., &
of resident physicians and pharmacists. Journal of
Patel, V. (2006). Distributed cognition in the
Hospital Medicine, 6(6), 329–337.
Psychiatric Emergency Department: A cognitive
Brooke, J. (1996). SUS: A “quick and dirty” usability
blueprint of a collaboration in context. Artificial
scale. In P. W. Jordan, B. Thomas, & I. L. McClell
Intelligence in Medicine; 37:73–83.
(Eds.), Usability evaluation in industry (pp. 189–
Cooke, N. J. (1994). Varieties of knowledge elicitation
195). London: Taylor & Francis.
techniques. International Journal of Human-
Burke, H. B., Sessums, L. L., Hoang, A., Becher, D. A.,
Computer Studies, 41, 801–849.
Fontelo, P., Liu, F., et al. (2014). Electronic health
Dalai, V. V., Khalid, S., Gottipati, D., Kannampallil,
records improve clinical note quality. Journal of the
T. G., John, V., Blatter, B., et al. (2014). Evaluating
American Medical Informatics Association, 22(1),
the effects of cognitive support on psychiatric clini-
199–205.
172 V. L. Patel et al.
cal comprehension. Aritificial Intelligence in design and use through layers of distributed cogni-
Medicine, 62(2), 91–104. tion: How a glucometer is coupled with its context.
Denton, C. A., Soni, H. C., Kannampallil, T. G., Journal of Biomedical Informatics, 53, 330–341.
Serrichio, A., Shapiro, J. S., Traub, S. J., & Patel, V. Gibbs, A. (1997). Focus groups. Social Research Update,
L. (2018). Emergency Physicians’ Perceived 19, 1–7.
Influence of EHR Use on Clinical Workflow and Gray, W. D., & Boehm-Davis, D. A. (2000). Milliseconds
Performance Metrics. Applied clinical informatics, matter: An introduction to microstrategies and to
9(03), 725–733. their use in describing and predicting interactive
Duncan, B., Zheng, L., Furniss, S., Solomon, A., behavior. Journal of Experimental Psychology:
Doebbeling, B., Grando, M., et al. (2018a). In search Applied, 6(4), 322.
of vital signs: A comparative study of EHR documen- Grobe, S. J., Drew, J. A., & Fonteyn, M. E. (1991). A
tation. Paper presented at the AMIA 2018 Annual descriptive analysis of experienced nurses’ clinical
MacKenzie, I. S., Zhang, S. X., & Soukoreff, R. W. Association, 8(4), 324–343. https://doi.org/10.1136/
(1999). Text entry using soft keyboards. Behaviour jamia.2001.0080324.
and Information Technology, 18, 235–244. Patel, V. L., Kannampallil, T. G., & Kaufman, D. R.
Mason, J. (2002). Qualitative researching. London: Sage (2015). Cognitive informatics for biomedicine:
Publications. Human computer interaction in healthcare. Cham:
McDonald, C. J., Callaghan, F. M., Weissman, A., Springer.
Goodwin, R. M., Mundkur, M., & Kuhn, T. (2014). Payne, T. H., Corley, S., Cullen, T. A., Gandhi, T. K.,
Use of internist’s free time by ambulatory care Harrington, L., Kuperman, G. J., et al. (2015).
Electronic Medical Record systems. JAMA Internal Report of the AMIA EHR-2020 Task Force on the
Medicine, 174(11), 1860–1863. status and future direction of EHRs. Journal of the
Middleton, B., Bloomrosen, M., Dente, M. A., Hashmat, American Medical Informatics Association, 22(5),
B., Koppel, R., Overhage, J. M., et al. (2013). 1102–1110.
Software Engineering
for Health Care
and Biomedicine
Adam B. Wilcox, David K. Vawdrey, and Kensaku Kawamoto
Contents
References – 203
and develop a treatment plan. John is admitted by a study coordinator. Because GHS investiga-
to the hospital, and physicians, nurses, and oth- tors are conducting the study, relevant parts of
ers caring for him access his longitudinal medi- John’s EHR are easily shared with the clinical
cal records and document new observations and trials management system.
treatments. They are also able to electronically This fictional case study highlights many of
reconcile his outpatient prescriptions with his the current opportunities for improving health
inpatient medications to ensure continuity. care delivery, including improved access to
After a brief hospital stay, John is stabilized care, increased patient engagement, shared
and ready to be discharged, with an updated list patient-provider decision-making, better care
of medications. management, medication reconciliation,
Because Dr. Stark is listed as John’s pri- improved transitions of care, population
mary care physician, she is notified electroni- health management, and research recruitment.
cally of both the hospital admission and In the case study, each of these goals required
6 discharge. She reviews his discharge summary software to make health information accessible
in the EHR and instructs her staff to send a to the correct individuals at the proper time.
message through the patient portal to John, to In today’s health care system, few individ-
let him know she reviewed his inpatient record uals enjoy the interaction with software
and to schedule a follow-up appointment. depicted in the John Miller case study.
The GMS EHR is also part of a statewide Although the functions described in the sce-
health information exchange (HIE), which nario exist at varying levels of maturity, most
allows medical records to be easily shared with health care delivery institutions have not con-
health care providers outside the GMS system. nected all the functions together as described.
This means that if John should need to visit a The current role of software engineering in
hospital, emergency department or specialty health care is therefore two-fold: to design and
care clinic outside the GMS network, his record implement software applications that provide
would be available for review and any informa- required functions, and to connect these func-
tion entered by these outside providers would be tions in a seamless experience for both the cli-
similarly available to Dr. Stark and others nicians and the patients.
within the GMS network. The local and state The case study highlights the usefulness of
health departments where John lives are also several functions provided by health care soft-
linked to the HIE. This allows clinics, hospitals ware applications for clinicians, patients, and
and labs to electronically submit information to administrators. Some of these functions
the health departments for disease surveillance include:
and case reporting purposes. 1. Acquiring and storing data
Back at home, John’s wife, Gina, is able to 2. Summarizing and displaying data
view his medical records on the GHS patient 3. Facilitating communication and informa-
portal because he has granted her proxy access tion exchange
to his account. This allows her to see notes from 4. Generating alerts, reminders, and other
Dr. Stark and schedule appointments. Gina also forms of decision support
views the hospital discharge instructions that 5. Supporting educational, research, and
were electronically sent to John’s patient record. public health initiatives
As she reviews the information about diabetes
that GHS had automatically linked to John’s
record, Gina sees a notification about a clinical 6.2.2 Acquiring and Storing Data
research study involving genetic links with dia-
betes. Concerned about their two children, Gina The amount of data needed to describe the
discusses the study with John, and they review health and health care of even a single person
more online materials about the study. is huge. Health professionals require assis-
Interested in the possible benefits of the tance with data acquisition to deal with the
research, John electronically volunteers to par- data that must be collected and processed.
ticipate in the study, and he is later contacted One of the first uses of computers in a medi-
Software Engineering for Health Care and Biomedicine
181 6
cal setting was the automatic analysis of 6.2.3 Summarizing
blood specimens and other body fluids by and Displaying Data
instruments that measure chemical concentra-
tions or that count cells and organisms. These Computers are well suited to performing
systems generated printed or electronic results tedious and repetitive data-processing tasks,
to health care workers and identified values such as collecting and tabulating data, com-
that were outside normal limits. Computer- bining related data, and formatting and pro-
based patient monitoring that collected physi- ducing reports. They are particularly useful
ological data directly from patients were for processing large volumes of data.
another early application of computing tech- Data acquired by computer systems can be
nology (see 7 Chap. 19). These systems pro- detailed and voluminous. Data analysis sys-
vided frequent, consistent collection of vital tems must aid decision makers by reducing
signs, electrocardiograms (ECGs), and other and presenting the intrinsic information in a
indicators of patient status. More recently, clear and understandable form. Software can
researchers have developed medical imaging be used to create useful visualizations that
applications as described in 7 Chaps. 9 and facilitate trend analysis and compute second-
20, including computed tomography (CT), ary parameters (e.g., means, standard devia-
magnetic resonance imaging (MRI), and digi- tions, rates of change) to help identify
tal subtraction angiography. The calculations abnormalities. Clinical research systems have
for these computationally intensive applica- modules for performing powerful statistical
tions cannot be performed manually; comput- analyses over large sets of patient data. When
ers are required to collect and manipulate employing such tools, research investigators
millions of individual observations. should have insight into the methods being
Early computer-based medical instru- used. For clinicians, graphical displays are
ments and measurement devices provided useful for interpreting data and identifying
results only to human beings. Today, most trends.
instruments can transmit data directly into the Fast retrieval of information is essential
EHR, although the interfaces can still be awk- for any computer system. Data must be well
ward and poorly standardized (see 7 Chaps. 5 organized and indexed so that information
and 8). Computer-based systems that acquire recorded in an EHR system can be easily
information directly from patients are also retrieved. Here the variety of users must be
data-acquisition systems; they free health pro- considered. Obtaining recent information
fessionals from the need to collect and enter about a patient entering the office differs from
demographic and health history information. the needs that a research investigator will have
Various departments within a hospital use in accessing the same data. The query inter-
computer systems to store clinical data. For faces provided by EHRs and clinical research
instance, clinical laboratories use information systems assist researchers in retrieving perti-
systems to keep track of orders and specimens nent records from the huge volume of patient
and to report test results; most pharmacy and information. Recently, there has been increas-
radiology departments use computers to per- ing industry adoption of the Health Level 7
form analogous functions. Their systems may International (HL7) Fast Healthcare
connect to outside services (e.g., pharmacy Interoperability Resources (FHIR) standard
systems are typically connected to one or for sharing data on a patient-by-patient basis.
more drug distributors so that ordering and The FHIR standard is being adapted for
delivery are rapid and local inventories can be population- level data sharing through the
kept small). By automating processing in FHIR Bulk Data initiative. As discussed in
areas such as these, health care facilities are 7 Chap. 21, bibliographic retrieval systems
able to provide efficient service, reduce labor are also an essential component of health
costs, and minimize errors. information services.
182 A. B. Wilcox et al.
the software product (usually an application). ware. In-house development can have less
SDLC is a subset of the systems development detailed requirements, as the contract to build
life cycle, focusing on the software component the software is with the organization itself,
of a larger system. In practice, and particu- and can allow some evolution of the require-
larly in heath care, software development ments as the project progresses. However, the
encompasses more than just the software, more flexibility that is allowed and the longer
often stretching into areas such as process re- changes or enhancements are permitted, the
engineering in order to maximize the benefits higher the likelihood of “scope creep” causing
of the software product. Although SDLC schedule and cost overruns.
most literally applies to an in-house develop- Other tasks performed during analysis
ment project, all or most of the life cycle include an examination of existing products
framework is also relevant to shared develop- and potential alternative solutions, and, par-
ment and even purchase of commercial off- ticularly for large projects, a cost/benefit anal-
6 the-shelf (COTS) software. The following is ysis. A significant, and frequently overlooked,
an overview of the phases of the SDLC. aspect of the planning and analysis phase is to
determine outcome measures that can be used
6.3.1.1 Planning/Analysis during the life cycle to demonstrate progress
The software development life cycle begins and evaluate success or failure of the project.
with the formation of a project goal during These measures can be refined and details
the planning phase. This goal typically derives added as the project progresses. The planning
from an organization’s or department’s mis- and analysis phase typically ends when a deci-
sion/vision, focusing on a particularly need or sion to proceed is made, along with at least a
outcome. This is sometimes called project rough plan of how to implement the next
conceptualization. Planning includes some steps in the SDLC. If the organization decides
initial scoping of the project as well as resource to purchase a solution, a request for proposals
identification (including funding). It is impor- (RFP) that contains the requirements docu-
tant to address what is not included in the ment is released to the vendor community.
project in order to create appropriate expecta- The planning and analysis stage of soft-
tions for the final product. A detailed analysis ware development is perhaps both the most
of current processes and needs of the target difficult and the most important stage in the
users is often done. As part of the analysis, development lifecycle as it is applied to health
specific user requirements are gathered. care. Requirements for software in health care
Depending on the development process, this are inherently difficult to define for many rea-
might include either detailed instructions on sons. Health care practice is constantly chang-
specific functions and operating parameters ing, and as new therapies or approaches are
or more general user stories that explain in discovered and validated, these new
simple narrative the needs, expected workflow advancements can change the way care is
and outcomes for the software. It is important delivered. In addition, the end-users of health
that users of the system are consulted, as well care software are comparatively advanced
as those in the organization who will imple- relative to other industries. Unlike industries
ment and maintain the software. The decision where front-line workers may be directed by
of whether to develop the software in-house, supervisors with more advanced training and
partner with a developer, or purchase a ven- greater flexibility in decision making, in health
dor system will likely determine the level of care the front-line workers are often physi-
detail needed in the requirements. Vendors cians, who are often the most highly-trained
will want very specific requirements that allow workers in the system (although not necessar-
them to properly scope and price their work. ily the most advanced with respect to com-
The requirements document will usually puter literacy) and require the greatest
become part of a contract with a vendor and flexibility for decisions. This flexibility makes
will be used to determine if the final product it difficult to define workflows or even get
meets the agreed specification for the soft- indications of the workflows being followed,
Software Engineering for Health Care and Biomedicine
185 6
since physicians will not always make explicit details, such as security, performance, and
what actions or plans are being pursued. This internationalization are also addressed during
flexibility is important for patient care, design. Analysts with domain knowledge in
because it allows front-line clinicians to adapt the target environment are often employed
appropriately to different settings, staffing lev- during this phase in order to translate user
els, and specialties. The need for flexibility is requirements into suitable proposals. Simple
such that defining requirements for software mock-ups of the proposed system may be
that could reduce flexibility is criticized as developed, particularly for user-facing com-
“cookbook” medicine, constituting a com- ponents, in order to validate the design and
mon reason for resistance to software adop- identify potential problems and missing infor-
tion. However, this resistance is not just mation. Closely related to this, an integrated,
characteristic of software – clinical guidelines automated testing architecture, with appro-
and other approaches to structured or formal- priate testing scripts/procedures, may be
ized care processes can also be criticized, and designed in this phase in order to ensure the
the challenge of applying discovered software being developed meets quality stan-
knowledge to clinical care processes remains dards and is responsive to the requirements.
difficult. The depth and completeness of the design is
Over time, however, there have been some contingent on the software development pro-
successful efforts that have defined standard cess, as well as other factors. In some cases,
requirements for health information software. the entire design is completed before moving
Among the most notable efforts have been in on to software coding. In other development
EHRs, where organizations have created lists strategies, a high-level system architecture is
of requirements and certified systems that designed but the details of the software com-
match those requirements. The Certification ponents are delayed until each component or
Commission for Health Information component feature is being created. The pros
Technology (CCHIT) began in 2004 and and cons of these approaches are discussed
defined criteria for electronic health records’ later in this chapter. For vendor-developed
functionality, interoperability and security systems, the purchasing organization will
(Leavitt and Gallagher 2006). Later, the certi- often hold design reviews and demonstrations
fication approach was adopted by the Office of mock-ups or prototypes with the vendor to
of the National Coordinator of Health assess the solutions. In the case of COTS soft-
Information Technology (ONC) in 2010, ware, the purchasing organization relies on
when a list of EHR functions that were most the vendor’s system description and reviews
related to “meaningful use” of EHRs was from third parties, supplemented by system
established (Blumenthal and Tavenner 2010). demonstrations, to determine the appropri-
Such “meaningful use” of EHRs came with ateness of the design. As with the Analysis
significant financial incentives administered phase, it is important to include the target
by the Centers for Medicare and Medicaid users and IT operations personnel in the
Services (CMS), leading to a rapid increase in design reviews.
the adoption of EHRs meeting these require- Ideally, the software could be designed
ments (Washington et al. 2017). solely around the care requirements and the
use of information. However, rarely are the
6.3.1.2 Design clinical requirements of the use case the only
During the Design phase, potential software consideration. In the design phase, other
solutions are explored. System architectures requirements are considered, such as the soft-
are examined for their abilities to meet the ware cost and how it integrates with an exist-
needs stated in the requirements. Data storage ing health IT strategy of an organization.
and interface technologies are assessed for Resources applied to a development project
appropriate fit. User front-end solutions are are not available for other potential projects,
investigated to assess capabilities for required so costs are always influential. The design
user input and data display functions. Other phase must consider various alternatives to
186 A. B. Wilcox et al.
meet the most important requirements, recog- software, to complete development of docu-
nizing trade-offs and contingency approaches. mentation templates, order sets, clinical deci-
Additional considerations are how the soft- sion support rule, reports, and so on. In fact,
ware will support long-term needs, not just configuration can be so considerable that
the immediate requirements that have been institutions may use an internal brand name
identified. Clinicians and clinical workflow for the software and configuration project
analysts are often the primary participants in that is different from the name of the COTS
the requirements analysis stage, whereas software, which represents their local configu-
informaticians are more prominent in the ration. This configuration is often done using
design phase. This is because during this latter tools built specifically for the commercial soft-
phase the clinical goals and strategies are con- ware, which facilitate the integration of the
sidered together with what can be vastly dif- configuration products into the software
ferent design approaches, and the ability to infrastructure. The tools can be complex,
6 consider the various strengths and weaknesses requiring significant training for developers.
of these different approaches is critical. Often, Typically, tools work well for basic configura-
design considerations are between custom tion and may also have advanced functional-
development, purchasing niche applications, ity that can be used to configure more
or purchasing components of a monolithic complicated functionality. The most intensive
EHR. The considerations of development time investment for configuration is typically
versus COTS software is discussed in more when the tools do not directly support certain
detail in the Acquisition Strategy section configurations, and developers must find
(7 6.3.3.1) below. approaches to creatively adapt the develop-
ment “around the tools.”
6.3.1.3 Development
Coding of the software is done during the 6.3.1.4 Integration and Test
Development phase of the SDLC. The soft- For complex software projects consisting of
ware engineers use the requirements and sys- several components and/or interfaces with
tem designs as they program the code. outside systems, an Integration phase in the
Analysts help resolve questions about require- SDLC is employed to tie together the various
ments and designs for the programmers when pieces. Some aspects of the software integra-
it is unclear how software might address a tion are likely done during the Development
particular feature. The software process phase by simulating or mocking the outputs
defines the pace and granularity of the devel- to, and inputs from, other systems. During
opment. In some cases, an entire software Integration, these connections are finalized.
component or system is developed at once by Simulations are run to demonstrate functional
the team. In other cases, the software is bro- integration of the various system components.
ken down into logical pieces and the program- Once the various components are integrated,
mers only work on the features that are a thorough testing regimen is conducted in
relevant to the piece they are currently work- order to prove the end-to-end operation of
ing on. As software components are com- the entire software system. Specific test sce-
pleted, unit tests are run to confirm the narios are run with known inputs and expected
component is free of known bugs and pro- outputs. This is typically done in a safe, non-
duces expected outputs or results. operational environment in order to avoid
In health care, development includes cod- conflicts and unnecessary risk to production
ing of custom software as well as configura- environments, although some inbound infor-
tion of COTS software. Health care practices mation from live systems may be used to ver-
across institutions (and even within larger ify scenarios that are difficult to simulate.
organizations) are so variable that all software Testing and integration in health care are
requires some – often substantial – configura- similar to other complex environments, in that
tion. Configuration can range from assigning it can be difficult to create a testing environ-
local values to generic variables within the ment that matches the dynamics of the
Software Engineering for Health Care and Biomedicine
187 6
real-world setting. Generally, testing is done tion, or if the efficiency does not improve
around multiple use cases or case studies, quickly enough after the initial implementa-
using data to support the cases. In a produc- tion, they may choose to disregard the soft-
tion environment, however, there may be data ware or even revolt against its implementation.
and information that do not match the case There have been prominent examples in bio-
studies, since both people and health care are medical informatics of software implementa-
complex. As a result, internally-developed tions failing during implementation (Bates
applications are often provisionally used in a 2006; Smelcer et al. 2009; Sullivan 2017), and
“pilot” phase as part of testing. For COTS even studies demonstrating harm (Han et al.
software, companies may use simulation labo- 2005). Because of these risks, health IT pro-
ratories that try to mimic the clinical environ- fessionals need to be flexible in implementa-
ment, or work with specific health care tion, and adapt the implementation strategies
organizations as development and testing to how the system is adopted. Users have been
partners. Later, however, this can lead to chal- shown to use health IT software in different
lenges if data representing the dynamics of ways for different benefits, and may need
one organization are not easily transferable, incentives or prodding to advance to different
and software must be further tested with new levels of use.
environments. Software transferability
between institutions has been demonstrated 6.3.1.6 Verification and Validation
in studies, even for specific applications To ensure that the software satisfies the origi-
(Hripcsak et al. 1998). Another challenge is nal requirements for the system and meets the
that with current privacy laws, organizations need of the organization, a formal verification
are more reluctant to release data to vendors and validation of the software is performed.
for testing. The implementing organization verifies that
the software has the features and performs all
6.3.1.5 Implementation the functions specified in the requirements
Once the software passes integration testing it document. The software is also validated to
moves to the implementation phase. In this show that it performs according to specified
phase, the software is installed in the live envi- operational requirements, that it produces
ronment. In preparation for installation, valid outputs, and that it can be operated in a
server hardware, user devices, network infra- safe manner. For purchased software, the ver-
structure, changes specific to individual facili- ification and validation phase is used by the
ties, etc., may need to be implemented and purchasing organization in order to officially
tested as well. In addition, user training will accept the software.
be performed in the weeks before the software Since clinicians often use software at dif-
goes live. Any changes to policies and proce- ferent levels or in different ways, tracking
dures required by the software will also be patterns of use can be an important approach
implemented in the build-up to installation. for verification and validation of software in
Health care presents interesting consider- health care. Additionally, because they have
ations in each phase of the software develop- experience working in complicated environ-
ment cycle, but the challenges have been more ments, users can be good at identifying incon-
visible in implementation than any other sistencies in data or software functions. Two
phase. This may be because health IT, while approaches that have been used and can be
intended to facilitate more efficient workflows successful for validation are monitoring use,
with information, is still disruptive. Disruption and facilitating user feedback.
happens most during implementation, when
clinicians actually begin using the software, 6.3.1.7 Operations and Maintenance
and studies have shown that during this time Software eventually enters an operations and
clinical productivity does decline (Shekelle maintenance (O&M) phase where it is being
et al. 2006). If users do not perceive that the regularly used to support the operational
benefits are sufficient to justify this disrup- needs of the organization. During this phase,
188 A. B. Wilcox et al.
an O&M team will ensure that the software is Verification and Validation or O&M, a sum-
operating as desired and will be fielding the mative evaluation may be performed to assess
support needs of the users. Updates may need the outcome effects, organizational impact,
to be installed as new versions of the software and cost-benefit of the software solution.
are released. This may require new integration Health IT is considered an intervention
and testing, implementation, and verification into the health care delivery system, so evalu-
and validation steps. Ongoing training will be ations have been done and published as com-
required for new users and system updates. parative studies in the clinical literature (Bates
The O&M team may conduct regular security et al. 1998; Campanella et al. 2016; Evans
reviews of the system and its use. Data reposi- et al. 1998; Hunt et al. 1998; Jones et al. 2014).
tories and software interfaces will be moni- These evaluations, and syntheses of multiple
tored for proper operation and continued studies, have identified areas of impact and
information validity. Software bugs and fea- areas where the effect of health IT software is
6 ture enhancement requests will be collected. inconsistent. Researchers have also noted that
These may drive an entire new development most of these studies have occurred in institu-
life cycle as new requirements persuade an tions where software was developed internally,
organization to explore significant upgrades with disproportionate under-representation
to its current software or even an entirely new of COTS software systems in evaluations,
system. especially considering that most health care
Maintenance is a demanding task in health institutions use COTS rather than internal
information software. It involves correcting development (Chaudhry et al. 2006). It is
errors; adapting configurations and software hoped that the existing evaluations can be a
to growth, new standards, and new regula- model for software evaluations of COTS, to
tions; and linking to other information clarify their impact on care.
sources. Maintenance tasks can exceed by
more than double the initial acquisition costs,
making it a substantial consideration that 6.3.2 Software Development
should affect software design. COTS suppliers Models
often provide maintenance services for
15–30% of the purchase price annually, but Different software development processes or
custom development or configuration mainte- methods can be used in an SDLC. The soft-
nance must be supported by the purchasing ware development process describes the day-
organization. If the software is not main- to-
day methodology followed by the
tained, it can quickly become unusable in a development team, while the life cycle
health care setting. Indeed, optimization of describes a higher-level view that encompasses
COTS EHRs is a central and ongoing focus aspects that take place well before code is ever
of applied clinical informatics, and this is written and after an application is in use. The
likely to continue for the foreseeable future. following are two of the most common exam-
ples of different development processes in
6.3.1.8 Evaluation clinical information systems development.
An important enhancement to the SDLC sug-
gested by Thompson et al. (1999) is the inclu- 6.3.2.1 Waterfall Model
sion of an evaluation process during each of The Waterfall model of software development
the phases of the life cycle. The evaluation is suggests that each step in the process happens
influenced by risk factors that may affect a par- sequentially, as shown in . Fig. 6.1. The term
ticular SDLC segment. An organization might “Waterfall” refers to the analogy of water cas-
perform formative evaluations during each cading downward in stages. A central concept
phase, depending on specific needs, in order to of the Waterfall methodology is to solidify all
assess the inputs, processes and resources of the requirements, establish complete func-
employed during development. During tional specifications, and create the final soft-
Software Engineering for Health Care and Biomedicine
189 6
Requirements
Design
Implementation
Verification
Maintenance
ware design prior to performing programming throughout the process. In 2001, a group of
tasks. This concept is referred to as “Big software developers published the Manifesto
Design Up Front,” and reflects the thinking for Agile Software Development, which
that time spent early-on making sure require- emphasizes iterative, incremental develop-
ments and design are correct saves consider- ment and welcomes changes to software
able time and effort later. Steve McConnell, requirements even late in the development
an expert in software development, estimated process (Beck et al. 2001).
that “...a requirements defect that is left unde- Agile development eschews long-term
tected until construction or maintenance will planning in favor of short iterations that usu-
cost 50–200 times as much to fix as it would ally last from 1 to 4 weeks. During each itera-
have cost to fix at requirements time” tion, a small collaborative team (typically
(McConnell 1996). 5–10 people) conducts planning, requirements
The waterfall model provides a structured, analysis, design, coding, unit testing, and
linear approach that is easy to understand. acceptance testing activities with direct
Application of the model is best suited to soft- involvement of a customer representative.
ware projects with stable requirements that Multiple iterations are required to release a
can be completely designed in advance. In product, and larger development efforts
practice, it may not be possible to create a involve several small teams working toward a
complete design for software a priori. common goal. The agile method is value-
Requirements and design specifications can driven, meaning that customers set priorities
change even late in the development process. at the beginning of each iteration based on
Clients may not know exactly what require- perceived business value.
ments they need before reviewing a working Agile methods emphasize face-to-face
prototype. In other cases, software developers communication over written documents.
may identify problems during the implemen- Frequent communication exposes problems
tation that necessitate reworking the design or as they arise during the development process.
modifying the requirements. Typically, a formal meeting is held each morn-
ing during which team members report to
6.3.2.2 Agile Models each other what they did the previous day,
In contrast to the Waterfall model, modern what they intend to do today, and what their
software development approaches have roadblocks are. The brief meeting, sometimes
attempted to provide more flexibility, particu- called a “stand-up,” “scrum,” or “huddle,”
larly in terms of involving the customer usually lasts 5–15 minutes, and includes the
190 A. B. Wilcox et al.
100%
75%
50%
25%
0
2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2017
100%
75%
50%
25%
0
2008 2009 2010 2011 2012 2013 2014 2015 2016 2017
All (Basic*) Small (Basic*) Rural (Basic*) Critical Access (Basic*) All (Certified EHR^)
.. Fig. 6.3 Non-federal acute care hospital EHR adoption, from ONC (2017b)
.. Fig. 6.4 Health IT developers product use by hospitals participating in the Medicare EHR, from ONC (2017a)
Software Engineering for Health Care and Biomedicine
193 6
when the needs are not standard components n” approach alleviates one of the main barri-
o
of EHR software. ers to the user, by facilitating the login and
After a commercial product is selected, an patient selection, while retaining all the func-
organization must then choose how extensive tionality of each system.
the software will be. EHR companies typically A deeper level of integration is at the
have a core EHR system, with additional application view. In this case, a primary appli-
modules that have either been developed or cation provides a portal that views another
acquired and integrated into their system. The application; the second application shares
set of modules used by each institution varies. user and patient context and is accessible
One organization may use the core EHR sys- through the user’s main workflow system. The
tem and accompanying modules for certain second application may use data from the pri-
specialties, such as internal medicine and fam- mary application and/or other data sources.
ily practice, while choosing to purchase sepa- Rapidly expanding in adoption for this type
rate best-of breed software for other of integration is the HL7 Substitutable
specialties, like obstetrics/gynecology and Medical Applications and Reusable
emergency medicine, even when the core EHR Technologies (SMART) framework, in par-
vendor has functional modules for those ticular in combination with HL7 Fast
areas. Another organization may choose to Healthcare Interoperability Resources
purchase and implement all specialty systems (FHIR) for data interchange. This approach,
offered from the core EHR vendor, and only known as SMART on FHIR, is enabling an
purchase other software if a similar module is ecosystem wherein applications developed by
not available from the vendor. These decisions health care organizations or third-party ven-
also must be made for all ancillary systems, dors can be seamlessly integrated within the
including laboratory, pharmacy, radiology, EHR (Kawamoto et al. 2019; Mandel et al.
etc. This is both a pre-implementation 2016).
decision and a long-term strategy. Once the The deepest level of integration is where l
EHR is implemented, many specialties that data elements from one system are also stored
were not included in the initial implementa- in the other system. With this approach, one
tion plan may request software and data inte- system is determined to be the main reposi-
gration, depending on the success of the EHR tory, and data from the other systems are
implementation. automatically stored into the repository. This
For organizations that choose components approach has the advantage of the most com-
of multiple vendor offerings to any degree, plete use of data, e.g., decision support logic
they will need to address how to integrate the can use data from multiple systems, which can
components together to minimize disruption be more accurate. The disadvantage is that the
to the users’ workflow. There are various strat- integration can be expensive, requiring new
egies that can be pursued to integrate mod- interfaces for each integrated system.
ules, either at the level of user context (user Another and often overlooked consider-
authentication credentials are maintained), ation of EHR software modules is data ana-
the level of the application view (one applica- lytics capabilities, usually discussed in
tion is viewable as a component within conjunction with a data warehouse. EHR sys-
another application), or at the level of data tems generally include reporting functional-
sharing (data are exchanged between the ity, where specific reports can be configured to
applications). If components are not inte- summarize and display data stored in the sys-
grated, a user must access each application tem. However, these systems often do not
separately, by opening the software applica- facilitate ad hoc data extracts that are com-
tion, logging in to each separately, and select- monly needed for more complicated data
ing the patient within each. When data are analysis. Additionally, if modules from multi-
integrated at the user context, a user moves ple software vendors are used, the data report-
between both applications, but the user and ing functions will be limited according to how
patient context are shared. This “single sign- well data are integrated. One typical approach
194 A. B. Wilcox et al.
is to use a separate data warehouse and analy- related specialties. However, over time there
sis system, with functions to create ad hoc was disproportionate development in these
reports, that can combine data from multiple areas, and clinicians in other specialties com-
systems. Data integration with warehouses is plained about the rudimentary functionality,
less expensive than with repositories, because especially when compared to existing vendor
the data do not need to be synchronized. systems for their specialty. As a result, the
Instead, data can be extracted in batches from organization decided to purchase a new vendor
source systems, transformed to the warehouse system. This made the other specialties happy,
data model, and then loaded into the ware- but was a big concern to the groups that had
house at periodic intervals. The greatest cost been using A-Chart for years. These clinicians
of integration is the data transformation, but feared that they would have to reconfigure their
this transformation is similar to what is complicated decision support rules with a new
required when receiving data through a real- system, or worse, that functionality would no
6 time interface. longer be supported. To alleviate concerns, rep-
The Meaningful Use program, which has resentatives from each department were asked
evolved to become the Promoting to participate in both drafting a Request for
Interoperability program, has greatly influ- Proposals and then reviewing the proposals
enced the systems that are installed by an from four different vendors. Many clinicians
institution. Initially, the ONC created a list of liked System X, but in the end the hospital
important EHR functions. They also created chose System Y, which seemed to have most of
a requirement that hospitals and physician the same capabilities but was perceived to be
practices use a “certified” system – i.e., one more affordable than System X. However,
that has demonstrated it provides those func- System Y did not include a laboratory system,
tions – to receive the incentives, and other cri- so the medical center purchased a separate lab-
teria that the functions must be used in clinical oratory system and built interfaces to connect it
care (Washington et al. 2017). As a result, with the core EHR.
health care organizations rapidly imple- Patients’ Choice is an integrated delivery
mented EHRs that were certified and best ful- system with a long history of EHR use.. Years
filled regulatory requirements. ago, it existed as two separate systems of hospi-
tals and clinics. Shortly before the merger of
6.3.3.2 Case Studies of EHR Adoption these systems, the hospitals and clinics pur-
Consider the following case studies of institu- chased separate EHRs, InPatSys and CliniCare.
tions adopting EHR systems. All examples At the time of the merger, the institution felt
are fictional, but reflect the complexity of the that each would be best served by a best-of-
issues with EHR software. breed system, to support the different work-
Best-Care Medical Center had been using flows, and there was no single system that both
information systems for many years, dating sides of the organization could agree to use.
back to when some researchers in the cardiology Years later, as Patients’ Choice began to inte-
department built a small system to integrate grate care between the hospitals and clinics, the
data from the purchased laboratory and phar- clinicians and administrators became increas-
macy information systems. Eventually, the ingly frustrated at how different the InPatSys
infection control group for the hospital began and CliniCare systems were, and that they had
using the system, and contributed efforts to to use two separate systems to care for the same
expand its functionality. Other departments patients. A team was formed to evaluate the
began developing decision support rules, and the options, and the CliniCare system was eventu-
system continued to grow. Eventually, the insti- ally replaced by OutPatSys, the outpatient ver-
tution made a commitment to redevelop the sion of InPatSys. To prevent losing data as they
infrastructure to support a much larger group moved from one system to the other, the
of users and functions, and named it A-Chart. Patients’ Choice IT department prepared the
Satisfaction with the system was high where it OutPatSys system by loading existing labora-
had been initially developed, and with other tory results and vital sign measurements from
Software Engineering for Health Care and Biomedicine
195 6
CliniCare. A SMART on FHIR application available to interface externally developed
provided an integrated view of historical alerts and reminders into the EHR.3
CliniCare data inside the OutPatSys system. One consideration that is not always stated
Hometown Community Hospital histori- in the software selection process, but is signifi-
cally used various niche information systems, cant in its influence over the decision, is how
but no EHR. With the availability of an organization will pay for the application.
Meaningful Use incentives, the hospital decided In organizations where software purchases are
to acquire a commercial EHR. A leadership requested from the information technology
team visited six hospitals to investigate how department and budget, overall maintenance
various EHRs were used. Ultimately, the hospi- costs are considered more prominently, and
tal made the decision to purchase eCompu- software that integrates with and is a compo-
Chart, because it was highly rated and seemed nent of the overall EHR vendor offering is
best adapted to their needs. Hometown hired a often selected. However, if a clinical depart-
new Chief Information Officer who had recently ment has direct control over their spending for
implemented eCompuChart at a community the software, functionality may become a
hospital in a neighboring state. They also pro- more focal concern. An additional case study
moted Dr. Jones, who had recently moved from illustrates this situation.
another hospital that had also used eCompu- Downtown Hospital recently decided to pur-
Chart, to Chief Medical Information Officer chase eCompuChart as the centerpiece of its
(CMIO). The CIO and CMIO negotiated a overall clinical information system strategy.
contract with DigiHealth, a consulting com- eCompuChart has award-winning modules for
pany with experience in implementing EHRs, the emergency department and intensive care
to plan and coordinate the implementation. units. However, there are strong complaints
Among other recommendations from about its capabilities for labor and delivery
DigiHealth, most existing systems were management and radiology. After considering
replaced with modules from eCompuChart to capabilities of best-of-breed options and their
simplify maintenance. ability to integrate with eCompuChart,
In practice, organizations may not adopt a Downtown Hospital eventually made a split
complete “build” or a complete “buy” strat- decision. The labor and delivery module for
egy. EHR vendors have advanced consider- eCompuChart was purchased because other
ably in their ability to create systems that meet systems with more elaborate functionality could
common needs in health care. Still, no system not integrate data as well with the overall
exists to date that can fully address all infor- EHR. On the other hand, a separate best-of-
mation needs for an organization, in part breed system was purchased for radiology,
because the information needs expand as because interfaces between the systems were
more data and new technologies become seen as an acceptable solution for integrating
available. Additionally, EHR strategies data.
become malleable over time, as commercial
software capabilities increase and data become 6.3.3.3 Enhancing Acquired Software
more consistent. As indicated through some Although most institutions will choose to
of the examples above, organizational strate- acquire a system rather than building it from
gies may change over time to adapt to these scratch, software engineering is still required
capabilities and needs. Expanding options for to make the systems function effectively. This
health care organizations is the emergence of involves more than just installing and config-
the notion of the EHR as a platform, where uring the software to the local environment.
HL7 FHIR data interfaces can be used to There is still a significant need for software
read data from, and in some cases write data development in implementing COTS, because
back to the EHR; HL7 SMART is available to
integrate external applications into the native
EHR user interface; and more recently, a 3 7 https://cds-hooks.org/ (last accessed June 3,
framework known as HL7 CDS Hooks is 2020).
196 A. B. Wilcox et al.
(1) applications must be integrated with exist- 2. The physician orders a set of routine blood
ing systems, and (2) leading healthcare institu- tests for the patient in the inpatient EHR
tions are increasingly developing custom computerized order entry module.
applications, such as SMART on FHIR appli- 3. The request for blood work is sent elec-
cations, that supplement commercial systems. tronically to the laboratory information
system, where the blood specimen is
6.3.3.4 Integration with Existing matched to the patient using a barcode.
Systems 4. The results of the laboratory tests are sent
In all but the most basic health care informa- to the results review module of the EHR
tion technology environments, multiple soft- Message exchange is an effective means
ware applications are used for treatment, of integrating disparate software applica-
payment, and operations purposes. A partial tions in healthcare when the users rely pri-
list of applications that might be used in a marily on a single “workflow system” (e.g.,
6 hospital environment is shown in . Table 6.1. a physician uses the inpatient EHR and a
To facilitate the sharing of information laboratory technician uses the LIS).
among various software applications, standards Because message exchange is handled by a
have emerged for exchanging messages and sophisticated “interface engine” (see
defining clinical terminology (see 7 Chap. 8). 7 Chap. 8), little software development, in
Message exchange between different software the traditional sense, is typically required.
applications enables the following scenario: When a user accesses multiple workflow
1. A patient is admitted to the hospital. A systems to perform a task, message
registration clerk uses the bed manage- exchange may not be sufficient and a
ment system to assign the patient’s loca- deeper level of integration may be required.
tion and attending physician of record. For example, consider the following addi-
tion to the previously described scenario:
5. The physician reviews the patient’s blood
.. Table 6.1 Partial list of software work and notes that the patient may be
applications that may be used in a hospital suffering from renal insufficiency as evi-
setting
denced by his elevated creatinine level.
System Primary Users 6. The physician would like to review a trend
of the patient’s creatinine over the past
Inpatient EHR (Results Physicians, nurses, 3 years. Because the hospital installed their
Review, Order Entry, allied health commercial EHR less than a year ago,
Documentation) professionals
data from prior to that time are available
Pharmacy Information Pharmacists, in a legacy results review system that was
System pharmacy developed locally. The physician logs into
technicians
the legacy application (entering her user-
Laboratory Information Laboratory name and password), searches for the cor-
System technicians, rect patient, and reviews the patient’s
phlebotomists
creatinine history.
Radiology Information Radiologists,
System radiology While it may seem preferable in this scenario
technicians
to load all data from the legacy system into
Pathology Information Pathologists the new EHR, commercial applications may
System not support importing such data for various
Registration/Bed Registration staff reasons. To simplify and improve the user
Management experience for reviewing information from a
Hospital Billing System Medical coders legacy application within a commercial EHR,
one group of informaticians created the cus-
Professional Services Physicians, medical
Billing System coders
tom application shown in . Fig. 6.5. The
application is accessed by clicking a link
Software Engineering for Health Care and Biomedicine
197 6
.. Fig. 6.5 Example screen from a custom lab summary display application integrated into a commercial EHR. The
application shows a longitudinal view of laboratory results that can span multiple patient encounters
.. Fig. 6.6 Example screen from a custom billing application integrated into a commercial EHR. This replaced a
separate application that was not integrated into the clinicians’ workflow
within the commercial EHR and does not an inpatient commercial EHR. Users of the
require login or patient look-up. application were part of a physician practice
In an example of a more sophisticated that used a different outpatient EHR with a
level of “workflow integration” is shown in professional billing module with which they
. Fig. 6.6. In this example, informaticians were already familiar. When the physicians in
developed a custom billing application within the practice rounded on their patients who
198 A. B. Wilcox et al.
were admitted to the hospital, they docu- Modules (MLMs), which can be triggered by
mented their work by writing notes within the various events within the EHR (e.g., the plac-
inpatient EHR, and then used their outpa- ing of a medication order) and execute serially
tient EHR to submit their professional service as a sequence of instructions to access and
charges. This practice not only required a manipulate data and generate output. MLMs
separate login to submit a bill, but also have been used to generate clinical alerts and
required duplicate patient lists to be main- reminders, to screen for eligibility in clinical
tained in each application, as well as a dupli- research studies, to perform quality assurance
cate problem list for each patient to be functions, and to provide administrative sup-
managed in each application. The integrated port (Dupuits 1994; Jenders 2008; Jenders and
charge application was accessed from the Shah 2001; Ohno-Machado et al. 1999).
inpatient EHR but provided the same look- Although one goal of the Arden Syntax was
and-feel as the outpatient EHR billing mod- to make knowledge portable, MLMs devel-
6 ule. Charges were submitted through the oped for one environment are not easily trans-
outpatient EHR infrastructure and would ferable to another. Developers of clinical
appear as normal charges in the outpatient decision support logic require skills in both
system, with the substantial improvement of computer programming as well as medical
displaying the information (note name, knowledge representation.
author, and time) for the documentation that An example of a standalone, locally devel-
supported the charge. oped software application that relies on EHR
data is shown in . Fig. 6.7. The Web-based
6.3.3.5 Development of Custom application, EpiPortal™, provides a compre-
Applications that Supplement hensive, electronic hospital epidemiology
or Enhance Commercial decision support system. The application can
Systems be accessed from a Web browser or directly
Commercial EHRs frequently provide cus- from within the EHR. It relies on EHR data
tomers with the ability to develop custom such as microbiology results, clinician orders,
software modules. Some EHRs provide a flex- and bed tracking information to provide users
ible clinical decision support infrastructure with timely information related to infection
that allows customers to develop modules that control and prevention.
execute medical logic to generate alerts, In some cases, it is desirable to develop
reminders, corollary orders, and so on. custom applications to address specific clini-
Vendors may also provide customers with cal needs that are not met by a commercial
tools to access the EHR database, which EHR. For example, most commercial EHRs
allows development of stand-alone applica- lack dedicated tools to support patient hand-
tions that make use of EHR data. Additionally, off activities. For hospitalized patients, hand-
vendors may foster development of custom offs between providers affect continuity of
user interfaces within the EHR by providing care and increase the risk of medical errors.
an application programming interface Informaticians at one academic medical cen-
through which developers can obtain infor- ter developed a collaborative application sup-
mation on user and patient context. porting patient handoff that is fully integrated
The ability to provide patient-specific clin- with a commercial EHR (Fred et al. 2009). An
ical decision support is one of the key benefits example screen from the application is shown
of EHRs. Many commercial EHRs either in . Fig. 6.8. The application creates user-
directly support or have been influenced by customizable printed reports with automatic
the Arden Syntax for Medical Logic Modules inclusion of patient allergies, active medica-
(Pryor and Hripcsak 1993). The Arden Syntax tions, 24-hour vital signs, recent common lab-
is part of the HL7 family of standards. It oratory test results, isolation requirements,
encodes medical knowledge as Medical Logic code status, and other EHR data.
Software Engineering for Health Care and Biomedicine
199 6
.. Fig. 6.7 Example screen from a standalone, soft- right 2012 The New York and Presbyterian Hospital
ware application that relies on EHR data to provide a and Columbia University – All rights reserved. EpiPor-
comprehensive, electronic hospital epidemiology deci- tal is a trademark of The New York and Presbyterian
sion support system. (Reused with permission. Copy- Hospital.)
.. Fig. 6.8 Example screen from a custom patient reports with automatic inclusion of patient allergies,
handoff application integrated into a commercial active medications, 24-hour vital signs, recent common
EHR. The application creates user-customizable printed laboratory test results, isolation requirements, code sta-
tus, and other EHR data
application programming interface (API), third-party applications and services, will play
which uses modern Internet technologies and an important role in the health IT ecosystem
approaches for data exchange, as well as HL7 in the years to come.
SMART for application integration and HL7 Another important consideration in clini-
CDS Hooks for integrating decision support cal information systems is infrastructure to
services. While this notion of EHR as a plat- support data sharing, such as through a health
form is still in its early stages and still matur- information exchange (HIE). HIE infrastruc-
ing, many EHR vendors are strongly ture allows organizations to share informa-
supportive of this type of an ecosystem, and tion about patients through a common
promising examples are emerging of how electronic framework. Robust HIE capabili-
these technologies can be used to deliver value ties, which are now being implemented in
to health care organizations in an EHR- commercial EHR systems, make it much more
agnostic manner. We anticipate that this efficient to share patient information between
approach to health information systems, organizations versus creating point-to-point
wherein core EHR systems are augmented by interfaces between all the clinical information
Software Engineering for Health Care and Biomedicine
201 6
systems a particular provider might need to with user-centered, workflow- integrated
communicate with. Effective sharing of infor- interventions, such insights have the poten-
mation is predicated on the use standard mes- tial to improve clinical decision making and
sage formats and terminologies (see 7 Chap. enhance patient care.
8), or the use of a shared EHR vendor. Where
HIE functionality does not exist or is declin-
ing (Adler-Milstein et al. 2016), sharing can
be coordinated directly between organizations 6.5 Summary
with incentives for sharing, such as in an
ambulatory care network (ACN). As health The goal of software engineering in health
care in the United States increasingly shifts to care is to create a system that facilitates deliv-
a payment model that rewards value over vol- ery of care. Much has changed in the past
ume, data sharing capabilities and the capac- decade with EHRs, and today most institu-
ity for population-level analytics and health tions will purchase rather than build an
management will become increasingly critical. EHR. But engineering these systems to facili-
Moreover, there are also emerging efforts to tate care is still challenging, and following
scale health information exchange to a appropriate software development practices is
national scale, and to facilitate patient access increasingly important. The success of a sys-
to their information using APIs (ONC 2019b). tem depends on interaction among designers
Software engineering is an ever-evolving of healthcare software applications and those
discipline, and new ideas are emerging rap- that use the systems. Communication among
idly in this field. It is less than 30 years since the participants is very difficult when it comes
the first graphical browser was used to access to commercial applications. Informaticians
the World Wide Web, but today Web-based have an important role to play in bridging the
applications are the standard. Access to gaps among designers and users that result
information through search engines has from the wide variety in background, educa-
changed the way that people find and evalu- tion, experience, and styles of interaction.
ate information. Social networking applica- They can improve the process of software
tions have altered our views on privacy and development by specifying accurately and real-
personal interaction. All of these develop- istically the need for a system and of designing
ments have shaped the development of workable solutions to satisfy those needs.
healthcare software, too. Today it is unimagi-
nable that an EHR would not support a nnSuggested Reading
Web-based patient portal. Clinicians and Carter, J. H. (2008). Electronic health records (2nd
consumers use the Web to search for health- ed.). Philadelphia: ACP Press. Written by a
related information in growing numbers and clinician and for clinicians, this is a practical
with growing expectations. It is not atypical guide for the planning, selection, and imple-
for patients to discuss health issues in online mentation of an electronic health record. It
forums and share intimate details on patient first describes the basic infrastructure of an
networking sites. EHR, and then how they can be used effec-
Another development that is impacting tively in health care. The second half of the
virtually all industries, including health book is written more as a workbook for some-
care, is advanced analytics. Coupled with one participating in the selection and imple-
the rapid increase in the adoption of EHR mentation of an EHR.
systems, health care represents a golden KLAS Reports. http://www.klasresearch.com/reports.
opportunity for leveraging powerful com- These reports are necessary tools for a project
puting approaches with large data sets to manager who needs to know the latest indus-
identify and apply new insights. For exam- try and customer information about vendor
ple, deep learning techniques can be applied health information technology products. The
to EHR data to predict important outcomes reports include information on functionality
such as in-hospital mortality. If coupled available from vendors as well as customer
202 A. B. Wilcox et al.
opinions about how vendors are meeting the Wiley & Sons. This is a textbook giving a good
needs of organizations and whose products overview of healthcare information systems,
are the best in a particular user environment. used in many academic courses on the subject.
McConnell, S. (1996). Rapid development: Taming It reviews the different environmental factors
wild software schedules. Redmond: Microsoft and contexts that influence the health infor-
Press. For those who would like a deeper under- mation landscape nationally, as well as giving
standing of software development and project guidance on implementation, management
methodologies like Agile, this is an excellent and evaluation of systems.
source. It is targeted to code developers, system
architects, and project managers. ??Questions for Discussion
President’s Council of Advisors on Science and 1. Reread the hypothetical case study in
Technology (December 2010). Report to the 7 Sect. 6.2.1.
President Realizing the Full Potential of (a) What are three primary benefits of
6 Health Information Technology to Improve the software used in James’s care?
Healthcare for Americans: the Path Forward. (b) How many different ways is James’s
http://www.whitehouse.gov/sites/default/files/micro- information used to help manage
sites/ostp/pcast-health-it-report.pdf. This PCAST his care?
report focuses on what changes could be made (c) Without the software and informa-
in the field of electronic health records to make tion, how might his care be d
ifferent?
them more useful and transformational in the (d) How has health care that you have
future. It gives a good summary of the current experienced similar or different to
state of EHRs in general, and compares the this example?
barriers to those faced in adopting information 2. For what types of software
technology in other fields. Time will tell if the development projects would an agile
suggestions really become the solution. development approach be better than a
Stead, W. W., & Lin, H. S. (Eds.). (2009). waterfall approach? For what types of
Computational technology for effective health development would waterfall be
care: immediate steps and strategic directions. preferred?
Washington, DC: National Academies Press. 3. What are reasons an institution would
This is a recent National Research Council choose to develop software instead of
report about the current state of health infor- purchase it from a vendor?
mation technology and the vision of the 4. How is would various stages in the soft-
Institute of Medicine about how such technol- ware development life cycle be different
ogy could be used. It can help give a good when developing software versus config-
understanding of how health IT could be used uring or adding enhancements to an
in health care, especially to technology profes- existing software program?
sionals without a health care background. 5. Reread the case studies in 7 Sect.
Tang, P. C. (2003). Key capabilities of an electronic 6.3.3.2.
health record system. Washington, DC: (a) What are the benefits and
National Academies Press. This is a short, let- advantages of the different
ter report from an Institute of Medicine com- approaches to development and
mittee that briefly describes the core acquisition among the scenarios?
functionalities of an electronic health record (b) What were the initial costs for
system. Much of the report is tables that list each institution for the software?
specific capabilities of EHRs in some core Where will most of the long-term
functional areas, and indicate their maturity in costs be?
hospitals, ambulatory care, nursing homes, 6. In what ways might new trends in
and personal health records. software (small “apps” that accomplish
Wager, K. A., Lee, F. W., & Glaser, J. P. (2017). focused tasks) change long-term
Health care information systems: a practical strategies for electronic health record
approach for health care management. John architectures?
Software Engineering for Health Care and Biomedicine
203 6
References infection surveillance. In Proceedings of the AMIA
Symposium (pp. 289–293). American Medical
Informatics Association.
Adler-Milstein, J., Lin, S. C., & Jha, A. K. (2016). The
Jones, S. S., Rudin, R. S., Perry, T., & Shekelle, P. G.
number of health information exchange efforts is
(2014). Health information technology: An updated
declining, leaving the viability of broad clinical data
systematic review with a focus on meaningful use.
exchange uncertain. Health Affairs, 35(7), 1278–
Annals of Internal Medicine, 160(1), 48–54.
1285.
Kawamoto, K., Kukhareva, P., Shakib, J. H., Kramer,
Bates, D. W. (2006). Invited commentary: The road to
H., Rodriguez, S., Warner, P. B., Shields, D., Weir,
implementation of the electronic health record.
C., Fiol, G. D., Taft, T., & Stipelman, C. H. (2019).
Proceedings (Baylor University. Medical Center),
Association of an electronic health record add-on
19(4), 311–312.
app for neonatal bilirubin management with physi-
Bates, D. W., Leape, L. L., et al. (1998). Effect of com-
cian efficiency and care quality. JAMA Network
puterized physician order entry and a team inter-
Open, 2(11), e1915343–e1915343.
vention on prevention of serious medication errors.
Leavitt, M., & Gallagher, L. (2006). The EHR seal of
JAMA, 280(15), 1311–1316.
approval: CCHIT introduces product certification
Beck, K., Beedle, M., et al. (2001). Manifesto for agile
to spur EHR adoption. Journal of AHIMA, 77(5),
software development. 2012, from agilemanifesto.org.
26–30, quiz 33–24.
Blumenthal, D., & Tavenner, M. (2010). The “meaning-
Mandel, J. C., Kreda, D. A., Mandl, K. D., Kohane,
ful use” regulation for electronic health records. The
I. S., & Ramoni, R. B. (2016). SMART on FHIR: A
New England Journal of Medicine, 363(6), 501–504.
standards-based, interoperable apps platform for
Campanella, P., Lovato, E., Marone, C., Fallacara, L.,
electronic health records. Journal of the American
Mancuso, A., Ricciardi, W., & Specchia, M. L.
Medical Informatics Association, 23(5), 899–908.
(2016). The impact of electronic health records on
McConnell, S. (1996). Rapid development : Taming wild
healthcare quality: A systematic review and meta-
software schedules/Steve McConnell. Redmond,
analysis. European Journal of Public Health, 26(1),
Wash: Microsoft Press.
60–64.
Office of the National Coordinator for Health
Chaudhry, B., Wang, J., et al. (2006). Systematic review:
Information Technology (ONC). (2017a). Certified
Impact of health information technology on qual-
health IT developers and editions reported by hospi-
ity, efficiency, and costs of medical care. Annals of
tals participating in the medicare EHR incentive pro-
Internal Medicine, 144(10), 742–752.
gram, Health IT Quick-Stat #29.
Dupuits, F. M. (1994). The use of the Arden Syntax for
dashboard.h ealthit.g ov/quickstats/pages/FIG-
MLMs in HIOS+, a decision support system for
Vendors-of-EHRs-to-Participating-Hospitals.php.
general practitioners in The Netherlands. Computers
July 2017.
in Biology and Medicine, 24(5), 405–410.
Office of the National Coordinator for Health
Evans, R. S., Pestotnik, S. L., et al. (1998). A computer-
Information Technology (ONC). (2017b). Non-
assisted management program for antibiotics and
federal acute care hospital electronic health record
other antiinfective agents. The New England Journal
adoption, Health IT Quick-Stat #47. dashboard.
of Medicine, 338(4), 232–238.
healthit.gov/quickstats/pages/FIG-Hospital-EHR-
Fred, M., Vawdrey, D., et al. (2009). Enhancing a com-
Adoption.php. September 2017.
mercial electronic health record with a novel patient
Office of the National Coordinator for Health
handoff application. Society of Hospital Medicine
Information Technology (ONC). (2019a). Office-
(SHM) Annual Meeting, Chicago, IL.
based physician electronic health record adoption,
Han, Y. Y., Carcillo, J. A., et al. (2005). Unexpected
Health IT Quick-Stat #50. dashboard.healthit.gov/
increased mortality after implementation of a com-
quickstats/pages/physician-ehr-adoption-trends.
mercially sold computerized physician order entry
php. January 2019.
system. Pediatrics, 116(6), 1506–1512.
Office of the National Coordinator for Health
Hripcsak, G., Kuperman, G. J., et al. (1998). Extracting
Information Technology (ONC). (2019b). Trusted
findings from narrative reports: Software transfer-
exchange framework and common agreement |
ability and sources of physician disagreement.
HealthIT.gov. (2019). Retrieved February 2020,
Methods of Information in Medicine, 37(1), 1–7.
from https://www.healthit.gov/topic/interoperabil-
Hunt, D. L., Haynes, R. B., et al. (1998). Effects of
ity/trusted-exchange-framework-and-common-
computer- based clinical decision support systems
agreement
on physician performance and patient outcomes: A
Ohno-Machado, L., Wang, S. J., et al. (1999). Decision
systematic review. JAMA, 280(15), 1339–1346.
support for clinical trial eligibility determination in
Jenders, R. A. (2008). Suitability of the Arden Syntax
breast cancer. In Proceedings of the AMIA sympo-
for representation of quality indicators. American
sium (pp. 340–344). American Medical Informatics
Medical Informatics Association Annual Symposium
Association.
Proceedings, 991.
Pryor, T. A., & Hripcsak, G. (1993). The Arden syntax
Jenders, R. A., & Shah, A. (2001). Challenges in using
for medical logic modules. International Journal of
the Arden Syntax for computer-based nosocomial
Clinical Monitoring and Computing, 10(4), 215–224.
204 A. B. Wilcox et al.
6
205 7
Standards in Biomedical
Informatics
Charles Jaffe, Viet Nguyen, Wayne R. Kubick, Todd Cooper,
Russell B. Leftwich, and W. Edward Hammond
Contents
References – 239
records systems (although this term was not institutions. Various management techniques,
used then), early standards were created in such as continuous quality improvement and
ASTM (formerly, the American Society for case management, require up-to-date, accu-
Testing and Materials, see 7 Sect. 8.3.2) for rate abstracts of patient data. Post hoc analy-
the transfer of laboratory data from local and ses for clinical and outcomes research require
commercial laboratories (American Society for comprehensive summaries across patient
Testing and Materials 1999). In the late 1980s, populations. Advanced tools, such as clini-
Simborg and others developed an HIS by cal workstations (7 Chap. 5) and decision-
interfacing independent systems using a “Best support systems (7 Chap. 3), require ways to
of Breed” approach to create an integrated translate raw patient data into generic forms
HIS (Simborg et al. 1983) Unfortunately, the for tasks as simple as summary reporting and
cost of developing and maintaining those as complex as automated medical diagnosis.
interfaces was prohibitive, and the need for All these needs must be met in the existing
a broader set of standards was realized. This setting of diverse, interconnected information
effort resulted in the creation of the standards systems—an environment that cries out for
7 developing organization (SDO) Health Level implementation of standards.
Seven (HL7) in 1987. Other SDOs were cre- One obvious need is for standardized
ated in this same time frame: EDIFACT by the identifiers for individuals, health care provid-
United Nations and ASC X12N by ANSI to ers, health plans, and employers so that such
address standards for claims and billing, IEEE participants can be recognized across systems.
for device standards, ACR/NEMA (later Choosing such an identifier is much more
DICOM) for imaging standards, and NCPDP complicated than simply deciding how many
for prescription standards. Internationally, digits the identifier should have. Ideal attri-
the 1990s saw the creation of the European butes for these sets of identifiers have been
Normalization Committee (CEN) and described in a publication from the ASTM
ISO Technical Committee 215 for Health (1999). The identifier must include a check
Informatics (TC251). These organizations are digit to ensure accuracy when the identifier
described in more detail in 7 Sect. 8.3.2. is entered by a human being into a system.
A standardized solution must also determine
mechanisms for issuing identifiers to individu-
7.2.2 Transitioning Standards als, facilities, and organizations; for maintain-
to Meet Present Needs ing databases of identifying information; and
for authorizing access to such information
Early standards were usually applied within a (also see 7 Chap. 5).
single unit or department in which the stan- The Centers for Medicare and Medicaid
dards addressed mainly local requirements. Services (CMS), has defined a National
Even then, data acquired locally came from Provider Identifier (NPI) as a national stan-
another source introducing the need for addi- dard. This number is a seven-character alpha-
tional standards. These many pressures caused numeric base identifier plus a one-character
health care information systems to change check digit. No meaning is built into the num-
the status quo such that data collected for a ber, each number is unique, it is never reissued,
primary purpose could be reused in a multi- and alpha characters that might be confused
tude of ways. Newer models for health care with numeric characters have been eliminated
delivery, such as integrated delivery networks, (e.g., 0, 1, 2, 4, and 5 can be confused with O,
health maintenance organizations (HMOs), I or L, Z, Y, and S). CMS was tasked to define
preferred provider organizations (PPOs), and a Payer ID for identifying health care plans.
now accountable care organizations (ACOs) The Internal Revenue Service’s employer
have increased the need for coordinated, identification number has been adopted as the
integrated, and consolidated information Employer Identifier.
(see 7 Chap. 15), even though the informa- The most controversial issue is identify-
tion comes from disparate departments and ing each individual or patient. Many people
Standards in Biomedical Informatics
209 7
consider assignment and use of such a num- The inclusion of medical knowledge in
ber to be an invasion of privacy and are con- clinical systems is becoming increasingly
cerned that it could be easily linked to other important and commonplace. Sometimes, the
databases. Public Law 104–191, passed in knowledge is in the form of simple facts such
August 1996 (see 7 Sect. 8.3.2), required that as the maximum safe dose of a medication or
Congress formally define suitable identifiers. the normal range of results for a laboratory
Pushback by privacy advocates and negative test. Much medical knowledge is more com-
publicity in the media resulted in Congress plex, however. It is challenging to encode such
declaring that this issue would not be moved knowledge in ways that computer systems can
forward until privacy legislation was in place use (see 7 Chap. 26), especially if one needs
and implemented (see 7 Chap. 14). The to avoid ambiguity and to express logical rela-
Department of Health and Human Services tions consistently. Thus the encoding of clini-
has recommended the identifiers discussed cal knowledge using an accepted standard
above, except for the person identifier. This would allow many people and institutions to
problem has still not been resolved, although share the work done by others. One standard
the momentum for creating such a unique designed for this purpose is the Arden Syntax,
personal identified seems to be increasing. discussed in 7 Chap. 3, as well as the HL7
A work-around non-solution is algorithmic standard Clinical Quality Language (Odigie
patient matching based on electronic health et al. 2019).
record (EHR) data. The United States is one Because the tasks we have described
of the few developed countries without such require coordination of systems, methods
an identifier Stead et al. 2005). are needed for transferring information from
one system to another. Such transfers were
traditionally accomplished through custom-
7.2.3 Settings Where Standards tailored point-to-point interfaces, but this
Are Needed technique has become unworkable as the
number of systems and the resulting permu-
The patient care process, which can be varied tations of necessary connections have grown.
and complicated, also include numerous pro- A current approach to solving the multiple-
cesses that can be improved with standardiza- interface problem is through the development
tion. A hospital admissions system records of messaging standards. Such messages must
that a patient has the diagnosis of diabetes depend on the preexistence of standards for
mellitus, a pharmacy system records that the patient identification and data encoding.
patient has been given gentamicin, a labora- Over the past decade, non-healthcare
tory system records that the patient had cer- domains such as travel, package delivery and
tain results on kidney function tests, and a e-commerce have adopted, implemented and
radiology system records that a doctor has published standard application programing
ordered an X-ray examination for the patient interfaces (APIs) in order to streamline their
that requires intravenous iodine dye. Other business processes and improve efficiency. The
systems need ways to store these data, to pres- adoption of open APIs especially the HL7
ent the data to clinical users, to send warn- Fast Healthcare Interoperability Resources
ings about possible drug-drug interactions, (FHIR®) has increased dramatically and
to recommend dosage changes, and to follow cited proposed regulations as an enabler of
the patient’s outcome. A standard for coding improved data sharing (Braunstein 2018).
patient data is nontrivial when one consid- Data sharing has become an expected
ers the need for agreed-on definitions, use of functionality for any health IT system. Many
qualifiers, differing (application-specific) lev- of the new initiatives in health require data
els of granularity in the data, and synonymy, sharing. Data sharing is essential not only for
not to mention the breadth and depth that patient care, but for aggregating data across
such a standard would need to have. multiple sites for research. Security must also
210 C. Jaffe et al.
be addressed before such exchanges can be standards; and new groups continue to be
allowed to take place. Before a system can formed as they become aware of the need for
divulge patient information, it must ensure standards and do not look to see what stan-
that requesters are who they say they are and dards exist. Yet, the process of creating stan-
that they are permitted access to the requested dards largely works, and effective standards
information (see 7 Chap. 5). Standards exist are created.
for this functionality. Although each clinical There are four ways in which a standard
system can have its own security features, sys- can be produced:
tem builders would rather draw on available 1. Ad hoc method: A group of interested
standards and avoid reinventing the wheel. people and organizations (e.g., laboratory-
Besides, the secure exchange of information system and hospital-system vendors) agree
requires that interacting systems use stan- on a standard specification. These specifi-
dard technologies. Electronic Health Record cations are informal and are accepted as
systems (EHRs) are increasingly adopt- standards through mutual agreement of
ing standard authorization (OAuth2) and the participating groups. A standard
7 identification (OpenID) by implementing example produced by this method is the
Substitutable Medical Applications Reusable DICOM standard for medical imaging.
Technology (SMART) on FHIR which allows 2. De facto method: A single vendor controls
platform independent applications to be a large enough portion of the market to
launched within the EHR workflow and uti- make its product the market standard. An
lize EHR data via FHIR (Payne et al. 2015). example is Microsoft’s Windows. A more
recent example are the Argonaut
Implementation Guides.4 In this case, a
7.3 Standards Undertakings collaborative of vendors and academic
and Organizations health systems are creating consensus
standards for their requirements.
It is helpful to separate our discussion of the 3. Government-mandate method: A govern-
general process by which standards are cre- ment agency, such as CMS or the National
ated from our discussion of the specific orga- Institute for Standards and Technology
nizations and the standards that they produce. (NIST) creates a standard and legislates its
The process is relatively constant, whereas use. An example is the HIPAA standard.
the organizations form, evolve, merge, and Another example is the Consolidated
are disbanded. This section will discuss how Clinical Data Architecture (CCDA),5 a
standards are created then identify the many standard that resulted from the US
SDOs and an overview of the types of stan- Government’s creating a set of require-
dards they create. This section will also iden- ments and driving a standard to meet
tify other groups and organizations that those requirements.
contribute or relate to standards activities. 4. Consensus method: A group of volunteers
representing interested parties works in an
open process to create a standard. Most
7.3.1 he Standards Development
T health care standards are produced by this
method. An example is the Health Level 7
Process
(HL7) standard for clinical-data inter-
change (. Fig. 7.1).
The process of creating standards is biased
and highly competitive. Most standards are
created by volunteers who represent multiple,
disparate stakeholders. They are influenced
by direct or indirect self-interest rather than 4 7 http://fhir.org/guides/argonaut/ (accessed
judgment about what is best or required. The 12/2/19)
5 7 https://www.healthit.gov/topic/standards-tech-
process is generally slow and inefficient; mul- nology/consolidated-cda-overview (accessed
tiple international groups create competitive 12/2/19)
Standards in Biomedical Informatics
211 7
be its format? In the early years of standards
development, this approach led the develop-
ment of standards, and the process was sup-
ported by vendors and providers.
As those early standards have become
successful, the need for “gap-standards” has
arisen. These gap-standards have no cham-
pion but are necessary for completeness of
an interoperable data exchange network. The
need for these standards is not as obvious
as for the primary standards, people are less
likely to volunteer to do work, putting stress
on the voluntary approach. In such cases, the
need of such standards must be sold to the
volunteers or developed by paid professionals.
Let us consider, for purposes of illustra-
tion, how a standard might be developed for
sending laboratory data in electronic form
from one computer system to another in the
form of a message. The volunteers for the
development might include laboratory system
vendors, clinical users, and consultants. One
.. Fig. 7.1 Standards development meetings. The key discussion would be on the scope of the
development of effective standards often requires the standard. Should the standard deal only with
efforts of dedicated volunteers, working over many
the exchange of laboratory data, or should
years. Work is often done in small committee meetings
and then presented to a large group to achieve consen- the scope be expanded to include other types
sus. Here we see meetings of the HL7. Vocabulary Tech- of data exchange? Should the data elements
nical Committee (top) and an HL7 plenary meeting being exchanged be sent with a XML tag
(bottom). See 7 Sect. 8.5.2 for a discussion on HL7; identifying the data element, or should the
Photo courtesy of Ken Rubin Photography
data be defined positionally? In the ensuing
discussion stage, the participants will begin to
The process of creating a standard proceeds create an outline that defines content, identi-
through several stages (Libicki 1995). It begins fies critical issues, and produces a timeline. In
with an identification stage, during which the discussion, the pros and cons of the vari-
someone becomes aware that there exists a ous concepts are discussed. What will be the
need for a standard in some area and that tech- specific form for the standard? For example,
nology has reached a level that can support will it be message based or document based?
such a standard. For example, suppose there Will the data exchange be based on a query
are several laboratory systems sending data to or on a trigger event? Will the standard define
several central hospital systems—a standard the message content, the message syntax, the
message format would allow each laboratory terminology, the network protocol, or a sub-
system to talk to all the hospital systems with- set of these issues? How will a data model or
out specific point-to-point interface programs information model be incorporated?
being developed for each possible laboratory- The participants are generally well
to-laboratory or laboratory-to-hospital com- informed in the domain of the standard, so
bination. If the time for a standard is ripe, they appreciate the needs and problems that
then several individuals can be identified and the standard must address. Basic concepts
organized to help with the conceptualization are usually topics for heated discussion; sub-
stage, in which the characteristics of the stan- sequent details may follow at an accelerated
dard are defined. What must the standard do? pace. Many of the participants will have expe-
What is the scope of the standard? What will rience in solving problems to be addressed
212 C. Jaffe et al.
by the standard and will protect their own cific fields (do we include the time the test was
approaches. The meanings of words are often ordered? specimen drawn? test performed?).
debated. Compromises and loosely defined A standard will generally go through sev-
terms are often accepted to permit the pro- eral versions on its path to maturity. The first
cess to move forward. For example, the likely attempts at implementation are frequently
participants would be vendors of competing met with frustration as participating ven-
laboratory systems and vendors of competing dors interpret the standard differently and
HISs. All participants would be familiar with as areas not addressed by the standard are
the general problems but would have their encountered. These problems may be dealt
own proprietary approach to solving them. with in subsequent versions of the standard.
Definitions of basic concepts normally taken Backward compatibility is a major concern as
for granted, such as what constitutes a test or the standard evolves. How can the standard
a result, would need to be clearly stated and evolve, over time, and still be economically
agreed on. responsible to both vendors and users? An
The writing of the draft standard is usu- implementation guide is usually produced to
7 ally the work of a few dedicated individuals— help new vendors profit from the experience
typically people who represent the vendors in of the early implementers.
the field. Other people then review that draft; Connectathons have become increasingly
controversial points are discussed in detail and important in the standards development. In
solutions are proposed and finally accepted. the past, standards development and stan-
Writing and refining the standard is further dards implementation have been generally
complicated by the introduction of people separated. Today, standards are tested during
new to the process who have not been privy one or two day connectathons where imple-
to the original discussions and who want to menters bring client and server applications to
revisit points that have been resolved earlier. test against one another. The successes rein-
The balance between moving forward and force the validity of the standard while fail-
being open is a delicate one. Most standards- ures identify gaps and errors which need to be
writing groups have adopted an open standards addressed.
development policy: Anyone can join the pro- A critical stage in the life of a standard is
cess and can be heard. Most standards devel- early implementation, when acceptance and
opment organizations—certainly those by rate of implementation are important to suc-
accredited groups— support an open ballot- cess. This process is influenced by accredited
ing process. A draft standard is made a vailable standards bodies, by the federal government, by
to all interested parties, inviting comments major vendors, and the marketplace. The main-
and recommendations. All comments are con- tenance and promulgation of the standard are
sidered. Negative ballots must be addressed also important to ensure widespread availabil-
specifically. If the negative comments are per- ity and continued value of the standard. Some
suasive, the standard is modified. If they are form of conformance testing is ultimately nec-
not, the issues are discussed with the submit- essary to ensure that vendors adhere to the
ter in an attempt to convince the person to standard and to protect its integrity.
remove the negative ballot. If neither of these Producing a standard is an expensive pro-
efforts is successful, the comments are sent to cess in terms of both time and money. Vendors
the entire balloting group to see whether the and users must be willing to support the many
group is persuaded to change its vote. The hours of work, usually on company time; the
resulting vote then determines the content of travel expense; and the costs of documenta-
the standard. Issues might be general, such tion and distribution. In the United States,
as deciding what types of laboratory data to the production of a consensus standard is
include (pathology? blood bank?), or specific, voluntary, in contrast to in Europe and else-
such as deciding the specific meanings of spe- where, where most standards development
Standards in Biomedical Informatics
213 7
is funded by governments. In the US, a new dor. There is, however, the certification of an
model for funding standards development application that uses standards. In 2010, the
work has emerged. The Da Vinci Project6 and Office of the National Coordinator (ONC)8
the CARIN Alliance7 are two collaboratives engaged with the CCHIT to certify EHR
funded by payers and technology vendors products. That certification process evolved in
to address interoperability needs between 2011 to include eight groups that could cer-
patients, providers and payers. These group tify EHR products, and to date over 500 EHR
share the cost of standards development and products have been certified. The certification
benefit from the accelerated pace. process is still undergoing change.
An important aspect of standards is con-
formance, a concept that covers compliance
with the standard and also usually includes 7.3.2 Data Standards Organizations
specific agreements among users of the stan-
dard who affirm that specific rules will be fol- Sometimes, standards are developed by orga-
lowed. A conformance document identifies nizations that need the standard to carry
specifically what data elements will be sent, out their principal functions; in other cases,
when, and in what form. Even with a perfect coalitions are formed for the express purpose
standard, a conformance document is neces- of developing a particular standard. The lat-
sary to define business relationships between ter organizations are discussed later, when
two or more partners. Unlike past standards, we examine the particular standards devel-
recent standards have built conformance test- oped in this way. There are also standards
ing directly into the standard artifacts which organizations that exist for the sole purpose
are not only human readable documents but of fostering and promulgating standards. In
also machine readable. some cases, they include a membership with
The creation of the standard is only the expertise in the area where the standard is
first step. Ideally the first standard would be needed. In other cases, the organization pro-
a Standard for Trial Use (STU), and two or vides the rules and framework for standard
more vendors would implement and test the development but does not offer the expertise
standard to identify problems and issues. needed to make specific decisions for specific
Those items would be corrected, and in a standards, relying instead on participation by
short period of time (usually 1 year), the knowledgeable experts when a new standard
standards would be advanced to a normative is being studied.
stage. A normative standard specifies to what This section describes several of the best
implementers must conform. Even then, the known and most influential health-related
process is only beginning. Implementation SDOs. Since most standards continue to
that conforms to the standard is essential if evolve to accommodate changes in technol-
the true value of the standard is to be realized. ogy, policy, regulations, and requirements,
The use of most standards is enhanced by a links are provided to selected standards and
certification process in which a neutral body SDO information. For a detailed understand-
certifies that a vendor’s product, in fact, does ing of an organization or the standards it has
comply and conform to the standard. developed, you will need to refer to current
There is currently no body that certifies primary resources. Many of the organizations
conformance of specific standards from a ven- maintain Web sites with excellent current
information on their status.
6 7 http://www.hl7.org/about/davinci/ (accessed
12/2/19)
7 7 https://www.carinalliance.com/ (accessed 8 7 http://www.healthit.gov/newsroom/about-onc
12/2/19) (accessed 12/2/19)
214 C. Jaffe et al.
U.S. government as part of the certification participating country. For most work, there are
requirements of Meaningful Use. a defined series of steps, beginning with a New
Link: 7 www.hl7.org Work Item Proposal and getting five coun-
Link to HL7 FHIR: 7 www.hl7.org/ tries to participate; a Working D ocument, a
FHIR Committee Document; a Draft International
Standard, a Final Draft International Standard
7.3.2.9 Institute of Electrical (FDIS); and finally an International Standard.
and Electronic Engineering This process, if fully followed, takes several
(IEEE) years to produce an International Standard.
IEEE is an international SDO organiza- Under certain conditions, a fast track to FDIS
tion that is a member of both ANSI and is permitted. Technical Reports and Technical
ISO. Through IEEE, many of the world’s Specifications are also permitted.
standards in telecommunications, electronics, The United States has been assigned the
electrical applications, and computers have duties of Secretariat, and that function is car-
been developed. ried out, at this time, by ANSI. ANSI also
7 IEEE 1073, Standard for Medical Device serves as the U.S. Technical Advisory Group
Communications, has produced a family of Administrator, which represents the U.S. posi-
documents that defines the entire seven-layer tion in ISO.
communications requirements for the Medical Link: 7 https://www.iso.org/standards.
Information Bus (MIB; Gottschalk 1991). The html
MIB is a robust, reliable communication ser-
vice designed for bedside devices in the inten- 7.3.2.11 Integrating the Healthcare
sive care unit, operating room, and emergency Enterprise
room. These standards have been harmonized The goal of the Integrating the Healthcare
with work in CEN, and the results are released Enterprise (IHE) initiative is to stimulate inte-
as ISO standards. IEEE and HL7 have col- gration of healthcare information resources.
laborated on several key standards, including IHE is sponsored jointly by the Radiological
those for mobile medical devices. Society of North America (RSNA) and the
Link: 7 www.ieee.org HIMSS. Using established standards and
Link to standards: 7 https://standards. working with direction from medical and
ieee.org/standard/index.html information technology professionals, indus-
try leaders in healthcare information and
7.3.2.10 ISO Technical Committee imaging systems cooperate under IHE to
215—Health Informatics agree upon implementation profiles for the
In 1989, interests in the European Committee transactions used to communicate images
for Standardization (CEN) and the United and patient data within the enterprise. Their
States led to the creation of Technical incentive for participation is the opportunity
Committee (TC) 215 for Health Information to demonstrate that their systems can operate
within ISO. efficiently in standards-based, multi-vendor
TC 215 meets once in a year as a TC and environments with the functionality of real
once as a Joint Working Group. TC 215 fol- HISs. Moreover, IHE enables vendors to
lows rather rigid procedures to create ISO direct product development resources toward
standards. Thirty-five countries are active par- building increased functionality rather than
ticipants in the TC with another 23 countries redundant interfaces.
acting as observers. While the actual work Link: 7 https://www.ihe.net/
is done in the working groups, the balloting Link to IHE domains: 7 https://www.ihe.
process is very formalized—one vote for each net/ihe_domains/
Standards in Biomedical Informatics
217 7
7.3.2.12 National Council secure and reliable exchange of data to and
for Prescription Drug from personal health devices.
Program (NCPDP) Link to guidelines: 7 www.pchalliance.
NCPDP is a not-for-profit, multi-stakeholder org/continua-design-guidelines
forum for developing and promoting industry
standards and business solutions that improve
7.3.2.15 SNOMED International
patient safety and health outcomes, while SNOMED International (previously called
also decreasing costs. NCPDP is an ANSI IHTSDO) was founded in 2007 with nine
accredited SDO and uses a consensus-build- charter members. Currently 19 countries,
ing process to create national standards for including the United States, are members.
real-time, electronic exchange of healthcare The primary purpose of IHTSDO is the con-
information. Their primary focus is on infor- tinued development and maintenance of the
mation exchange for prescribing, dispensing, Systematized Nomenclature of Medicine –
monitoring, managing and paying for medica- Clinical Terms (SNOMED-CT; see 7 Sect.
tions and pharmacy services crucial to quality 7.4.4.4, below). Member countries make
healthcare. SNOMED- CT freely available to its citi-
Link: 7 www.ncpdp.org/home zens. SNOMED has a number of Special
Link to standards: 7 www.ncpdp.org/ Interest Groups: Anesthesia, Concept Model,
standards-Development Education, Implementation, International
Pathology & Laboratory Medicine, Mapping,
7.3.2.13 OpenEHR Nursing, Pharmacy, and Translation.
OpenEHR is the name of a technology for Link: 7 http://www.snomed.org/
e-health, consisting of open specifications,
clinical models and software that can be used
7.3.2.16 OHDSI
to create standards and build information and The Observational Health Data Sciences and
interoperability solutions for healthcare. The Informatics (OHDSI) program is a multi-
various artefacts of openEHR are produced stakeholder, interdisciplinary open-source
by the openEHR community and managed by collaborative to leverage the value of health
the openEHR Foundation, an international data through large-scale analytics. OHDSI
non-profit organization established in the year has established an international network of
2003. researchers and observational health data-
Link: 7 openehr.org/ bases. ODHSI enables active engagement
Link to Clinical Models: 7 www.openehr. across multiple disciplines, including clinical
org/clinicalmodels medicine, biostatistics, computer science, epi-
Link to Clinical Knowledge Manager: demiology, and life sciences (. Fig. 7.2).
7 www.openehr.org/ckm Link: 7 https://www.ohdsi.org/
Link to Software Programs: 7 www.
openehr.org/programs/software
Link to Specification Program: 7 openehr.
7.4 etailed Clinical Models,
D
org/programs/specification Coded Terminologies,
Nomenclatures,
7.3.2.14 Personal Connected Health and Ontologies
Alliance (PCHA)
The PCHA publishes the Continua Design The capture, storage, and use of clinical data
Guidelines to enable a flexible implementa- in computer systems is complicated by lack of
tion framework for end-to-end interoperabil- agreement on terms and meanings. In recent
ity of personal connected health devices and years there has also been a growing recogni-
systems. These Guidelines are recognized by tion that just standardizing the terms and
the International Telecommunications Union codes used in medicine is not sufficient to
(ITU) as the international standard for safe, enable interoperability. The structure or form
218 C. Jaffe et al.
.. Fig. 7.2 Extensive International Vocabularies, continues to be the requirement for integrating data
George Hripscak, MD (by permission). A critical chal- sources among multiple vocabularies. (Photo courtesy
lenge to data analytics across international domains of George Hripcsak, MD, with permission)
of medical data provides important context to create their own. Second, using commonly
for computable understanding of the data. accepted standards can facilitate exchange of
Terms and codes need to be interpreted in data, applications, and clinical decision sup-
the context of clinical information models. port logic among systems. For example, if a
The many terminologies and detailed clinical central database is accepting clinical data from
modeling activities discussed in this section many sources, the task is greatly simplified if
have been developed to ease the communica- each source is using the same logical data struc-
tion of coded medical information. ture and coding scheme to represent the data.
System developers often ignore available stan-
dards and continue to develop their own solu-
7.4.1 Motivation for Structured tions. It is easy to believe that the developers
and Coded Data have resisted adoption of standards because it
is too much work to understand and adapt to
The structuring and encoding of medical any system that was “not invented here.” The
information is a basic function of most clini- reality, however, is that the available standards
cal systems. Standards for such structuring are often inadequate for the needs of the users
and encoding can serve two purposes. First, (in this case, system developers). As a result, no
they can save system developers from reinvent- standard terminology enjoys the wide accep-
ing the wheel. For example, if an application tance sufficient to facilitate the second func-
allows caregivers to compile problem lists tion: exchange of coded clinical information.
about their patients, using a standard structure The need for detailed clinical models
and terminology saves developers from having is directly related to the second goal dis-
Standards in Biomedical Informatics
219 7
cussed above, that of creating interoperabil- are specified. The HL7 Vocabulary Working
ity between systems. The subtle relationship Group has created a comprehensive discus-
between terminologies and models is best sion of how value sets can be defined and used
understood using a couple of examples. If with information models.
a physician wants to record the idea that a There are many clinical information mod-
patient had “chest pain that radiates to the eling activities worldwide. Some of the most
back”, the following coded terms could be important activities are briefly listed below.
used from SNOMED-CT. 55 HL7 Activities
–– HL7 Detailed Clinical Models – This
group has developed a method for spec-
7.4.2 Detailed Clinical Models ifying clinical models based on the HL7
Reference Information Model (RIM)
The creation of unambiguous data represen- that guarantees that data that conforms
tation is a combination of creating appropri- to the model could be sent in HL7
ate structures (models) for representing the Version 3 messages.
form of the data and then linking or “bind- –– HL7 Clinical Document Architecture
ing” specific sets of codes to the coded ele- (CDA) Templates – This group has
ments in the structures. Several modeling defined a standard way of specifying
languages or formalisms have been found to the structure of data to be sent in XML
be useful in describing the structure of the documents that conform to the CDA
data. They include: standard.
55 UML – the Unified Modeling Language, –– HL7 TermInfo – This Workgroup of
Object Management Group HL7 has specified a set of guidelines for
55 ADL – Archetype Definition Language, how SNOMED-CT codes and concepts
OpenEHR Foundation should be used in conjunction with the
55 CDL – Constraint Definition Language, HL7 RIM to represent data sent in HL7
General Electric and Intermountain Version 3 messages.
Health care –– HL7 Clinical Information Modeling
55 MIF – Model Interchange Format, Health Initiative – A formerly independent
Level Seven International Inc. group now organized as and HL7
55 OWL – Web Ontology Language, World Workgroup is chartered to develop
Wide Web Consortium implementable clinical information
models.
All languages used for clinical modeling need 55 The openEHR Foundation is developing
to accomplish at least two major things: they models based on a core reference model
need to show the “logical” structure of the and the Archetype Definition Language.
data, and they need to show how sets of codes This approach has been adopted by several
from standard terminologies participate in national health information programs.
the logical structure. Defining the logical 55 EN 13606 is developing models based on
structure is simply showing how the named the ISO/CEN 13606 standard and core
parts of a model relate to one another. Model reference model.
elements can be contained in other elements, 55 The US Veterans Administration (VA) is
creating hierarchies of elements. It is also creating models for integrating data across
important to specify which elements of the all VA facilities and for integration with
model can occur more than once (cardinal- military hospitals that are part of the US
ity), which elements are required, and which Department of Defense. The modeling is
are optional. Terminology binding is the act done primarily using Unified Modeling
of creating connections between the elements Language.
in a model and concepts in a coded terminol- 55 US Department of Defense is creating
ogy. For each coded element in a model, the models for integrating data across all DoD
set of allowed values for the coded element facilities and for integration with VA
220 C. Jaffe et al.
facilities. The modeling is done primarily are often used interchangeably by creators
using Unified Modeling Language. of coding systems and by authors discussing
55 The National Health Service in the United the subject. Fortunately, although there are
Kingdom (UK) is developing the Logical few accepted standard terminologies, there
Record Architecture to provide models for is a generally accepted standard about termi-
interoperability across all health care nology: ISO Standard 1087 (Terminology—
facilities in the UK. The modeling is done Vocabulary).
primarily using UML. Finally, we should consider the methods by
55 Clinical Element Models – Intermountain which the terminology is maintained. Every
Health care and General Electric have standard terminology must have an ongoing
created a set of detailed clinical models maintenance process, or it will rapidly become
using a core reference model and obsolete. The process must be timely and must
Constraint Definition Language. The not be too disruptive to people using an older
models are free-for-use and are available version of the terminology. For example, if the
for download from the Internet. creators of the terminology choose to rename
7 55 SHARE Models – CDISC is creating a code, what happens to the data previously
models to integrate data collected as part recorded with that code?
of clinical trials.
55 SMART Team – This group at Boston
Children’s Hospital is defining standard 7.4.4 Specific Terminologies
application programming interfaces
(APIs) for securely connecting with EHRs. With these considerations in mind, let us sur-
55 Clinical Information Modeling Initiative vey some of the available controlled terminol-
(CIMI) – This is an international ogies. There are introductory descriptions of a
consortium that has the goal of establishing few current and common terminologies. New
a free-for-use repository of detailed terminologies appear annually, and existing
clinical models, where the models are proprietary terminologies often become pub-
expressed in a single common modeling licly available. When reviewing the following
language with explicit bindings to standard descriptions, try to keep in mind the back-
terminologies. ground motivation for a development effort.
55 OMOP – The OMOP Common Data All these standards are evolving rapidly, and
Model enables the systematic analysis of one should consult the Web sites or other pri-
disparate observational claims-based data. mary sources for the most recent information.
The initial approach was to transform data
contained within claims databases into a
7.4.4.1 International Classification
common format (data model) as well as a of Diseases and Its Clinical
common representation (terminologies, Modifications
vocabularies, coding schemes). OMOP has One of the most recognized terminologies is
been primarily used by the OHDSI the International Classification of Diseases
community (see 7 Sect. 7.3.2.16), but is (ICD). First published in 1893, it has been
being examined by other consortia seeking revised at roughly 10-year intervals, first by
to share patient-level clinical information. the Statistical International Institute and later
by the World Health Organization (WHO).
The Ninth Edition (ICD-9) was published in
1977 (World Health Organization 1977) and
7.4.3 Vocabularies, Terminologies, the Tenth Edition (ICD-10) in 1992 (World
and Nomenclatures Health Organization 1992). The ICD-9 cod-
ing system consists of a core classification
In discussing coding systems, the first step is to of three-digit codes that are the minimum
clarify the differences among a terminology, a required for reporting mortality statistics
vocabulary, and a nomenclature. These terms to WHO. A fourth digit (in the first decimal
Standards in Biomedical Informatics
221 7
place) provides an additional level of detail; therapy” (75894). Although limited in scope
usually .0 to .7 are used for more specific forms and depth (despite containing over 8000 terms),
of the core term, .8 is usually “other,” and .9 CPT-4 is the most widely accepted nomencla-
is “unspecified.” Codes in the ICD-10 coding ture in the United States for reporting physi-
system start with an alpha character and con- cian procedures and services for federal and
sist of three to seven characters. In both sys- private insurance third-party reimbursement.
tems, terms are arranged in a strict hierarchy,
based on the digits in the code. In addition to 7.4.4.3 Diagnostic and Statistical
diseases, ICD includes several “families” of Manual of Mental Disorders
terms for medical-specialty diagnoses, health The American Psychiatric Association pub-
status, disease-related events, procedures, and lished the Fifth Edition of the Diagnostic
reasons for contact with health care providers. and Statistical Manual of Mental Disorders
In June of 2018, the World Health (DSM-5) in May 2013 (American Psychiatric
Organization (WHO) replaced ICD-10 with Association Committee on Nomenclature
ICD-11, with the intention to make a more and Statistics (2013)). DSM-5 is the standard
comprehensive and computable version that classification of mental disorders used by
could be maintained without the need for mental health professionals in the U.S. and
major version changes in the future. ICD-11 contains a listing of diagnostic criteria for
includes many new features, such as a semantic every psychiatric disorder recognized by the
network, a polyhierarchy, a formal information U.S. healthcare system.9 The previous edition,
model, and a set of "linearization" that con- DSM-IV, was originally published in 1994
strain the terms in strict hierarchies, designed to and revised in 2000 as DSM-IV-TR. DSM-5
support various functions (Chute (2018)). The is coordinated with ICD-10.
US National Committee for Vital and Health
Statistics (NCVHS) is currently considering the 7.4.4.4 SNOMED Clinical Terms
timing and method for transition to ICD-11, so and Its Predecessors
close on the 2015 adoption of ICD-10. Drawing from the New York Academy
of Medicine’s Standard Nomenclature of
7.4.4.2 Current Procedural Diseases and Operations (SNDO) (Plunkett
Terminology 1952; Thompson and Hayden 1961; New York
The American Medical Association developed Academy of Medicine 1961), the College
the Current Procedural Terminology (CPT) in of American Pathologists (CAP) developed
1966 (American Medical Association, updated the Standard Nomenclature of Pathology
annually) to provide a pre coordinated cod- (SNOP) as a multiaxial system for describ-
ing scheme for diagnostic and therapeutic ing pathologic findings (College of American
procedures that has since been adopted in the Pathologists 1971) through post-coordination
United States for billing and reimbursement. of topographic (anatomic), morphologic,
Like the DRG codes, CPT codes specify infor- etiologic, and functional terms. SNOP has
mation that differentiates the codes based on been used widely in pathology systems in the
cost. For example, there are different codes for United States; its successor, the Systematized
pacemaker insertions, depending on whether Nomenclature of Medicine (SNOMED) has
the leads are “epicardial, by thoracotomy” evolved beyond an abstracting scheme to
(33200), “epicardial, by xiphoid approach” become a comprehensive coding system.
(33201), “transvenous, atrial” (33206), “trans- Largely the work of Roger Côté and David
venous, ventricular” (33207), or “transvenous, Rothwell, SNOMED was first published in
atrioventricular (AV) sequential” (33208). CPT 1975, was revised as SNOMED II in 1979,
also provides information about the reasons and then greatly expanded in 1993 as the
for a procedure. For example, there are codes
for arterial punctures for “withdrawal of blood
for diagnosis” (36600), “monitoring” (36620),
9 7 https://www.psychiatry.org/psychiatrists/prac-
“infusion therapy” (36640), and “occlusion tice/dsm (last accessed 12/2/2019)
222 C. Jaffe et al.
Metathesaurus, which contains over 8.9 mil- 7.5 Data Interchange Standards
lion terms collected from over 160 different
sources (including many of those that we have The recognition of the need to interconnect
discussed), and attempts to relate synony- health care applications led to the develop-
mous and similar terms from across the dif- ment and enforcement of data interchange
ferent sources into over 2.6 million concepts standards. The conceptualization stage began
(. Fig. 7.6). . Figure 7.7 lists the preferred in 1980 with discussions among individu-
names for many of the pneumonia concepts als in an organization called the American
in the Metathesaurus; . Fig. 7.8 shows how Association for Medical Systems and
like terms are grouped into concepts and are Informatics (AAMSI). In 1983, an AAMSI
tied to other concepts through semantic rela- task force was established to pursue those
tionships. . Figure 7.9 shows some of the interests in developing standards.
information available in the Unified Medical The development phase was multifaceted.
Language System about selected pneumonia The AAMSI task force became subcommittee
concepts. E31.11 of the ASTM and developed and pub-
Standards in Biomedical Informatics
227 7
Respiratory Tract Diseases lished ASTM standard 1238 for the exchange
Lung Diseases of clinical-laboratory data. Two other groups
Pneumonia were formed to develop standards, each with a
Bronchopneumonia
Pneumonia, Aspiration slightly different emphasis: HL7 and Institute
Pneumonia, Lipid of Electrical and Electronics Engineering
Pneumonia, Lobar (IEEE) Medical Data Interchange (“Medix”)
Pneumonia, Mycoplasma Standard. The American College of Radiology
Pneumonia, Pneumocystis carinii (ACR) joined with the National Electronic
Pneumonia, Rickettsial
Pneumonia, Staphylococcal Manufacturers Association (NEMA) to
Pneumonia, Viral develop a standard for the transfer of image
Lung Diseases, Fungal data. Two other groups developed related
Pneumonia, Pneumocystis carinii standards independent of the biomedical
Respiratory Tract Infections informatics community: (1) ANSI X12 for
Pneumonia
Pneumonia, Lobar the transmission of commonly used business
Pneumonia, Mycoplasma transactions, including health care claims and
Pneumonia, Pneumocystis carinii benefit data, and (2) National Council for
Pneumonia, Rickettsial Prescription Drug Programs (NCPDP) for
Pneumonia, Staphylococcal
the transmission of third-party drug claims.
Pneumonia, Viral
Lung Diseases, Fungal
Pneumonia, Pneumocystis carinii
7.5.1 General Concepts
.. Fig. 7.6 Partial tree structure for the Medical Sub- and Requirements
ject Headings showing pneumonia terms. Note that
terms can appear in multiple locations, although they
The purpose of a data-interchange standard
may not always have the same children, implying that
they have somewhat different meanings in different con- is to permit one system, the sender, to trans-
texts. For example, Pneumonia means “lung inflamma- mit to another system, the receiver, all the
tion” in one context (line 3) and “lung infection” in data required to accomplish a specific com-
another (line 16) munication, or transaction set, in a precise,
unambiguous fashion. To complete this task
successfully, both systems must know what
format and content is being sent and must
understand the words or terminology, as well
as the delivery mode.
A communications model, called the
Open Systems Interconnection (OSI) refer-
ence model (ISO 7498–1), has been defined
by the ISO (see 7 Chap. 5 and the discussion
of software for network communications). It
describes seven levels of requirements or spec-
ifications for a communications exchange:
physical, data link, network, transport, ses-
sion, presentation, and application (Rose
1989; Stallings 1987; Tanenbaum 1987). Level
.. Fig. 7.7 Growth of the UMLS. The UMLS Metath- 7, the application level, deals primarily with
esaurus contains 3.85 million concepts and 14.6 million
the semantics or data-content specification
unique concept names from 210 source vocabularies.
The content continues to grow dynamically in response of the transaction set or message. For the
to user needs (Source: U.S. National Library of data-interchange standard, HL7 requires the
Medicine) definition of all the data elements to be sent in
228 C. Jaffe et al.
response to a specific task, such as the admis- this level across the various standards bodies.
sion of a patient to a hospital. In many cases, Two philosophies are used for defining syntax:
the data content requires a specific terminol- one proposes a position-dependent format;
ogy that can be understood by both sender the other uses a tagged-field format. In the
and receiver. position-dependent format, the data content is
Presentation, the sixth level of specified and defined by position.
Interoperability, addresses the syntax of The remaining OSI levels—session, trans-
the message, or how the data are formatted. port, network, data link, and physical—gov-
There are both similarities and differences at ern the communications and networking
Standards in Biomedical Informatics
229 7
.. Fig. 7.9 Some of the Bacterial pneumonia
information available in the Source: CSP93/PT/259S.5280; DOR27/DT/U000523;
Unified Medical Language ICD91/PT/482.9; ICD91/IT/482.9
System about selected Parent: Bacterial Infections; Pneumonia; Influenza with Pneumonia
pneumonia concepts. Concept’s Child: Pneumonia, Mycoplasma
preferred names are shown in Narrower: Pneumonia, Lobar; Pneumonia, Rickettsial; Pneumonia,
italics. Sources are identifiers for Staphylococcal; Pneumonia due to Klebsiella pneumoniae;
the concept in other Pneumonia due to Pseudomonas; Pneumonia due to Hemophilus
influenzae
terminologies. Synonyms are
Other: Klebsiella pneumoniae, Streptococcus pneumonlae
names other than the preferred
name. ATX is an associated Pneumonia, Lobar
Medical Subject Heading Source: ICD91/IT/481 ; MSH94/PM/D011018; MSH94/MH/D011018;
SNM2/RT/M-40000; ICD91/PT/481 ; SNM2/PT/D-0164;
expression that can be used for
DXP92/PT/U000473; MSH94/EP/D011018;
Medline searches. The remaining
INS94/MH/D011018;1NS94/SY/D011018
fields (Parent, Child, Broader, Synonym: Pneumonia, diplococcal
Narrower, Other, and Semantic) Parent: Bacterial Infections; Influenza with Pneumonia
show relationships among Broader: Bacterial Pneumonia; Inflammation
concepts in the Metathesaurus. Other: Streptococcus pneumoniae
Note that concepts may or may Semantic: inverse-is-a: Pneumonia
not have hierarchical relations has-result: Pneumococcal Infections
to each other through Pneumonia, Staphylococcal
Parent–Child, Broader– Source: ICD91/PT/482.4; ICD91/IT/482.4; MSH94/MH/D011023;
Narrower, and Semantic MSH94/PMID011023; MSH94/EP/D011023; SNM2/PT/D-017X;
(is-a and inverse is-a) relations. INS94/MH/D011023; INS94/SY /D011023
Note also that Pneumonia, Parent: Bacterial Infections; Influenza with Pneumonia
Streptococcal and Pneumonia Broader: Bacterial Pneumonia
due to Streptococcus are treated Semantic inverse-is-a: Pneumonia; Staphylococcal Infections
as separate concepts, as are Pneumonia, Streptococcal
Pneumonia in Anthrax and Source: ICD91/IT/482.3
Pneumonia, Anthrax Other: Streptococcus pneumoniae
Pneumonia due to Streptococcus
Source: ICD91/PT/482.3
ATX: Pneumonia AND Streptococcal Infections AND NOT Pneumonia, Lobar
Parent: lnftuenza with Pneumonia
Pneumonia in Anthrax
Source: ICD91/PT/484.5; ICD91/IT/022.1 ; ICD91/IT/484.5
Parent: Influenza with Pneumonia
Broader: Pneumonia in other infectious diseases classified elsewhere
Other: Pneumonia, Anthrax
Pneumonia, Anthrax
Source: ICD91/IT/022.1; ICD91/IT/484.5
Other: Pneumonia in Anthrax
protocols and the physical connections made data segments; each data segment consists of
to the system. Obviously, some understanding one or more data fields. Data fields, in turn,
at these lower levels is necessary before a link- consist of data elements that may be one of
age between two systems can be successful. several data types. The message must identify
Increasingly, standards groups are defining the sender and the receiver, the message num-
scenarios and rules for using various proto- ber for subsequent referral, the type of mes-
cols at these levels, such as TCP/IP. Much of sage, special rules or flags, and any security
the labor in making existing standards work requirements. If a patient is involved, a data
lies in these lower levels. segment must identify the patient, the circum-
Typically, a transaction set or message is stances of the encounter, and additional infor-
defined for a particular event, called a trig- mation as required. A reply from the receiving
ger event. This trigger event, such as a hos- system to the sending system is mandatory in
pital admission, then initiates an exchange of most circumstances and completes the com-
messages. The message is composed of several munications set.
230 C. Jaffe et al.
It is important to understand that the sole new governmental policies, new science (both
purpose of the data-interchange standard is to clinical and pre-clinical), new care paradigms,
allow data to be sent from the sending system and new models for payment models.
to the receiving system; the standard is not
intended to constrain the application system Compendium of HL7 Standards
that uses those data. Application indepen- zz Introduction to HL7 Standards
dence permits the data-interchange standard HL7 provides a framework, as well as related
to be used for a wide variety of applications. standards, for the exchange, integration,
However, the standard must ensure that it sharing, and retrieval of electronic health
accommodates all data elements required by information. These standards define how
the complete application set. information is packaged and communicated
from one party to another, setting the lan-
guage, structure and data types required for
7.5.2 pecific Data Interchange
S seamless integration between systems. HL7
Standards standards support clinical practice and the
7 management, delivery, and evaluation of
As health care increasingly depends on health services, and are recognized as the most
the connectivity within an institution, an commonly used in the world.
enterprise, an integrated delivery system, a 7 http://www.h l7.o rg/implement/stan-
geographic system, or even a national inte- dards/index.cfm?ref=nav
grated system, the ability to interchange
data in a seamless manner becomes criti- zz HL7 Primary Standards
cally important. The economic benefits of Primary standards are the most widely
data-interchange standards are immediate implemented standards and are fundamen-
and obvious. Consequently, it is in this area tal for system integrations, inter-operability
of healthcare standards that most effort has and compliance. The most frequently used
been expended. All of the SDOs in health standards are defined in this category
care have some development activity in data- 7 h t t p : / / w w w.h l 7 .o rg / i m p l e m e n t /
interchange standards. standards/product_section.c fm?section
The following sections summarize many =1&ref=nav
of the current standards for data-interchange.
Examples are provided to give you a sense zz HL7 Foundational Standards
of the technical issues that arise in defining Foundational standards define the fundamen-
a data-exchange standard, but details are tal tools and building blocks used to create the
beyond the scope of this effort. In fact, the standards, and the technology infrastructure
pace of change is so great that many of the that implementers of HL7 standards must
referenced standards will have been improved manage.
at the time of publication. Rather than pro- 7 h t t p : / / w w w.h l 7 .o rg / i m p l e m e n t /
viding an exhaustive list of standards, links standards/product_section.c fm?section
to the standards and standards platforms will =2&ref=nav
provide access to the most recent technical
information and its implementation. zz HL7 Clinical & Administrative Domains
Messaging and document standards for clini-
7.5.2.1 HL7 Standards cal specialties and groups are found in this
HL7 has provided standards that have been section. These standards are usually imple-
adopted world-wide. In the United States and mented once primary standards for the orga-
in many other countries, these standards are nization are operational.
codified in legislation and in regulation. The 7 http://www.h l7.o rg/implement/stan-
changes in these standards most often reflect dards/product_section.cfm?section=3
Standards in Biomedical Informatics
231 7
zz HL7 EHR Profiles ence. There is also an explanation of the IP
These standards provide functional models Policy that provides more information about
and profiles that enable the constructs for how members and non-members can use the
management of electronic health records. standard
EHR System Functional Model (EHR-S FM) 7 http://www.h l7.o rg/implement/stan-
outlines important features and functions that dards/product_matrix.cfm?ref=nav
should be contained in an EHR system.
7 http://www.h l7.o rg/implement/stan- Clinical Document Architecture
dards/product_section.cfm?section=4 Since its initial development in 2001, the
Clinical Document Architecture (CDA)
zz HL7 Implementation Guides standard has become globally adopted for a
Implementation guides and their supporting broad range of use (Ferranti et al. 2006). Now
documents are intended to be used in conjunc- an ISO standard and advanced to Release
tion with an existing standard. The support- 2, CDA is a document markup standard for
ing documents serve as supplemental material the structure and semantics of an exchanged
for a parent standard. Implementation guides “clinical document.” CDA is built upon the
provide the road map for transforming the RIM and relies upon reusable templates for
technical standard into an effective working its ease of implementation. A CDA document
solution. is a defined and complete information object
7 http://www.h l7.o rg/implement/stan- that can exist outside of a message and can
dards/product_section.cfm?section=5 include text, images, sounds, and other mul-
timedia content. CDA supports the following
zz HL7 Standards Rules & References features: persistence, stewardship, potential
These references provide the technical speci- for authentication, context, wholeness, and
fications, programming structures and guide- human-readability. In the US, CDA is one
lines for software and standards development. of the core components of data exchange for
They are not stand alone solutions, but rather Meaningful Use. The competing implementa-
provide support for a standard or for a family tion processes for CCD profile development
of standards. were successfully harmonized into a broadly
7 http://www.h l7.o rg/implement/stan- adopted Consolidated Continuity of Care
dards/product_section.cfm?section=6 Document (CCCD).
In order to ease the path to implementa-
zz HL7 Current Projects & Education tion of CDA, HL7 has developed a more nar-
This is a resource for Standards for Trial Use rowly defined specification called greenCDA,
(STUs) and for ongoing projects and stan- which limits the requirements of the RIM,
dards. The link also provides helpful resources provides greater ease of template composi-
and tools to further supplement understand- tion, and consumes much less bandwidth for
ing and adoption of HL7 standards. transmission. An additional effort to promote
7 http://www.h l7.o rg/implement/stan- CDA adoption was achieved with the release
dards/product_section.cfm?section=7 of the CDA Trifolia repository, which, in
addition to offering a library of templates,
zz HL7 Standards Master Grid includes tooling for template modification as
This is a convenient navigation tool for all well as a template-authoring language. This
HL7 standards. Because HL7 encompasses has enabled the adoption of native CDA for
the complete life cycle of a standards specifi- exchange of laboratory data, clinical summa-
cation, including the development, adoption, ries, and electronic prescriptions and well as
market recognition, utilization, and adher- for clinical decision support.
232 C. Jaffe et al.
EDIFACT is independent of the machine, and formats became essential. This led to the
media, system, and application and can be development of claims attachment standards
used with any communications protocol or (see X12, above) that enabled more complex
with physical magnetic tape. adjudication, comparative effectiveness, and
Link: 7 https://www.edistaffing.com/ accountable care. These standards will most
resources/unedifact-standards/ certainly require structured, coded data rather
than free-text and unstructured narrative.
Complexity of data requirements is con-
7.6 Today’s Reality stantly growing to better support evidence-
and Tomorrow’s Directions based medicine, clinical decision support,
personalized medicine, and accountable care.
In the current environment, the seamless Each of these has overlapping, but fundamen-
exchange of data that can be used for any tally unique data streams. Moreover, the data
purpose remains a challenge. As we move provided at the point of care, if unfiltered, is
closer to sematic interoperability (7 https:// likely to overwhelm the clinical decision mak-
7 en.w ikipedia.o rg/wiki/Semantic_interoper- ing process. Elements of clinical data, such
ability) for healthcare and biomedical data, as events in pediatric years, must not com-
the challenge of true plug-and-play interoper- pete for the attention of the caregiver. To an
ability is still elusive. The meaning of many extent, this was solved with specifications,
concepts and terms remains ambiguous, con- such as FHIRcast, which were developed to
troversial, disputed, or poorly understood. provide context aware data to that process.
Instead, the standards community has built There are growing demands for increasing the
a system of interchange that often requires depth and breadth of data delivered to that
mapping between terminologies and stan- clinical environment. In addition, these stan-
dards to overcome these issues. dards must support the implicit policy deci-
sions about the nature of this data.
To date, clinical and preclinical informa-
7.6.1 he Interface: Standards
T tion populates many of the alerts that clini-
and Systems cians receive at the point of care. Typically,
these range from information supporting
Historically, interchange standards evolved complex decision trees to the selection of test-
to support sharing of information over com- ing and interventions. This has been abetted
plex networks of distributed systems. This by increasing knowledge of genomic data
served a simple business model in which data and implication for therapeutic decisions.
was pushed from disparate repositories with Although this has had its greatest impact on
inconsistent architectures and data structures. the chemotherapy of cancer, the importance
This permitted the exchange of data for both in many other clinical domains, for more com-
business needs and patient care. mon conditions (including the treatment of
In today’s medical environment, there are diabetes, hypertension, and arthritis) is now
several competing forces that place a burden recognized. Current architectural systems are
on standards requirements. The traditional ill-prepared to manage this process. Moreover,
scope of data sources included business level data formats for genomic and genetic infor-
information, principally for payment needs. mation are disparate and often incompatible.
These were developed utilizing coding meth- Data privacy requirements, and the vari-
odologies and business architecture that did ability of these requirements among legal
not rely upon inclusion of primary clinical entities, currently pose a different set of
data into the reimbursement decision. With demands for information access technologies.
the advent of statutory requirements that For example, some states permit line-item
demand justification of insurance claims exclusion of clinical data that is transferred
and reimbursement, additional data forms between providers, based on the primacy of
Standards in Biomedical Informatics
235 7
the information and the role of the caregiver. tions. Perhaps justification for that lies in the
Other jurisdictions allow participation of fact that this patient derived data is neither
health information exchanges to those indi- quantifiable nor codeable. This is supported
viduals who agree only to dissemination of by valid concerns about the patient’s health
data from complete sources. care literacy, or lack thereof, but is no less
Existing data architectures enable a required than validated decision support for
constant stream of data to be passed in an caregivers.
untended and unmonitored fashion. In evolv- Data obtained from clinical research
ing models, data request and acknowledge- and clinical data provided to inform clinical
ment require a more complex query and studies suffer from other concerns of failed
response logic. In fact, most inquiries demand interoperability. This is attributed, and right-
the validation of the provider system and fully so, to disparities of terminologies uti-
privileges that are afforded to both the care- lized for patient care and those used in clinical
giver and the primary data repository. This research. This is most dramatically highlighted
places another component of interface design in the terminology deployed by regulation for
between the respective systems and necessi- adverse event reporting (MedDRA; Medical
tated the development of analogous provider Dictionary of Regulatory Affairs). Mapping
indices and provider repositories. Concerns between the MedDRA dictionary and other
of both privacy and security must be met by clinical terminologies (SNOMED- CT,
these specifications. The system effectively LOINC, ICD, and CPT) has not proven suc-
asks not only who you are but why you want cessful. Moreover, many aspects pertaining
the information. to study subject inquiries in clinical research
Much of this process overhead has been are often designed to elicit yes-no responses
addressed by the design and architecture of (Have you smoked in the last 5 years), rather
health information exchanges. Often the busi- than data that many caregivers deem relevant.
ness case supersedes the demand for clinical Yet, today, it is more critical than ever to
knowledge. At the same time, these exchanges enable clinical research to inform patient care
are designed to behave in an entirely agnos- and care derived data to enable clinical trials.
tic fashion, placing no demand on either the The business model of developing drugs for
sender or recipient for data quality, other than billon dollar markets (“blockbuster drugs”)
source identification. In fact, the metadata, so has proven itself to be unsustainable, as the
responsible for the value of the information, cost of developing a new drug entity has now
is often capable of specifying only its origin exceeded a billion dollars. From the clinical
and value sets. perspective, current estimates suggest that
In today’s clinical environment, there has information from basic science research expe-
been very little attention paid to the cap- riences delays of nearly 17 years before that
ture and validation of patient-initiated data. knowledge can be incorporated into clinical
While so very critical to diagnosis and ongo- care (Balas and Boren 2000).
ing management, only scant standards exist Semantic interoperability of clinical data
for embedding patient derived information inherently requires data reuse. It is not suffi-
into the clinical record without intermediate cient for systems to unambiguously exchange
human interaction and adjudication. When machine readable data. Data, once required
allowed by current systems, data provided only for third party payment, must be shared
by patients often lies within the audit trail, by other partners in the wellness and health
as a comment, rather than in the record as care delivery ecosystems. Certainly, these
source data. Steps are sorely needed to define data must be presented to research systems,
and attribute such data since it is so critical as noted above. The data must also be avail-
to many aspects of accountable care. Data able for public health reporting and analy-
obtained directly from patient sources is often sis, for comparative effectiveness research,
attributed to “subjective” status, but it is no for accountable care measurement and for
less objective that many clinician observa- enhancement of decision support systems
236 C. Jaffe et al.
(including those for patients and their fami- the EHR. The new emphasis on translational
lies). The immediate beneficiaries have been informatics will require new standards for the
systems developed to support biosurveillance transport, inclusion into the EHR, and use of
and pharmacovigilance. In practical terms, genetic information including genes, biomark-
the business practices that govern our delivery ers, and phenotypic data. Imaging, videos,
systems (and the government policies and reg- waveforms, audio, and consumer-generated
ulations that enable them) must enable these data will require new types of standards.
data streams to both enhance care and control Effective use of these new types of data as
costs. well as exponential increases in the volume of
data will require standards for decision sup-
port, standards for creating effective filters
7.6.2 Future Directions for presentation and data exchange, and new
forms of presentation including visualization.
The new models for health care require a New sources of data will include geospatial
very different approach. The concept of a coding, health environmental data, social
7 patient-centric EHR (7 Chap. 19) requires and community data, financial data, and cul-
the aggregation of any and all data created tural data. Queries and navigation of very
from, for, and about a patient into a single large databases will require new standards.
real or virtual record that provides access Establishing quality measures and trust will
to the required data for effective care at the require new standards. Ensuring integrity and
place and time of care. Health information trust as data is shared and used by other than
exchange (HIE; see 7 Chap. 20) at regional, the source of data will require new standards
state, national, and potentially at a global addressing provenance and responsibility.
level is now the goal. This goal can only be The third trend area is the use of mobile
reached through the effective use of infor- devices, smart devices, and personal health
mation technology, and that use can only be devices. How, when and where such devices
accomplished through the use of common should be used is still being explored.
global standards that are ubiquitously imple- Standards will be required for safe design,
mented across all sites of care. presentation, interface, integrity, and protec-
Three other future trends influence the tion from interference.
need for new and different standards. The first True global interoperability will require
is secondary use of data by multiple stake- a suite of standards starting with the plan-
holders. This requirement can only be met ning of systems, the definition and packaging
through semantic interoperability – a univer- of the data, collection of the data including
sal ontology that covers all aspects of health, usability standards, the exchange of data,
health care, clinical research, management, the storage and use of data, and a wealth of
and evaluation. Standards for expressing applications that enable the EHR for bet-
what is to be exchanged and under what cir- ter care. IT systems must turn data collected
cumstances are important as well as standards into information for use – a process that will
for the exchange of data. Included in multiple require the use of knowledge in real time with
uses of data is reporting to other organiza- data to produce information for patient care.
tions such as immunization and infectious Selecting the correct knowledge from litera-
disease reports to the Centers for Disease ture, clinical trials, and other forms of docu-
Control and Prevention (CDC), performance mentation will require standards. Knowledge
reports to the National Quality Forum (NQF) representation, indexing, and linkages will
and audit reports to The Joint Commission require standards.
(TJC). Such systems as described also enable A major, and challenging, requirement to
population health studies and health surveil- address these new types and use of data will
lance for natural and bioterrorism outbreaks. be effective standards for privacy and security.
The second trend area is the expansion of These standards must protect, but not restrict
the types of data that are to be included in the use of data and access to that data for
Standards in Biomedical Informatics
237 7
determining and giving the best care possible. frame. Other approaches to standards devel-
The aggregation of data requires an error-free opment, such as those focused upon services,
way for patient identification that will per- are rapidly evolving. These services-aware
mit merger of data across disparate sources. architectures are governed by strict develop-
Sharing data also requires standards for the ment principles that help ensure both interop-
de-identification of patient data. erability and the ability of components to be
The effective management of all of these reused.
resources will require additional output from Increasingly large data stores (“big data”)
the standards communities. Standards for have demanded some of these changes. These
defining the required functionality of systems data have emanated from a highly diverse
and ways for certifying adherence to required universe of scientific development. In fact,
functionality is essential for connecting a some of the new bio-analytic platforms for in
seamless network of heterogeneous EHRs vitro cellular research are generating data at a
from multiple vendors. Testing of standards, rate, which by some estimates, is faster than
including IHE Connectathons before wide- the data can be analyzed. Medical images,
spread dissemination and perhaps mandated for which storage requirements are growing,
use of standards is critical to use and accep- must now be principally evaluated by human
tance. Standards for registries, standards for inspection. Newly evolving algorithms and
the rules that govern the sharing of research the technologies to support them, initially
data, standards for patient consent, and stan- developed for “star wars” type image analy-
dards for identification of people, clinical sis, are replacing radiologist and pathologists
trials, collaboration, and other similar areas for the establishment of diagnoses. These
are necessary. Profiles for use and applica- machines have proven to be faster and more
tion from the suite of standards are a neces- accurate than their human counterparts. In
sity. Detailed implementation guides are key the very near future, such instrumentation
to use and implementation of standards. will supplant medical scientists the same way
Tools that enable content population and use that comparable technologies replaced human
of standards are mandatory for easy use of inspection in the estimation of cell differen-
standards. tials for blood counts. These new technologies
Standards for these new and evolving busi- are demanding the development of specifica-
ness and social needs must be supported by tions and the terminologies to support them.
changes in standards development methodol- Tomorrow’s technologies will transition
ogies and harmonization. Legacy systems are from early vision through prototyping to com-
not easily discarded. Recommendations for mercial products in a more compressed life
complete replacement of existing standards cycle. A model for this process in biomedical
are neither politically expedient nor fiscally science was established with the emergence of
supportable. Currently, there is increasing the Human Genome Project (see 7 Chap. 11).
attention to new approaches to standards Within the next decade, routine genome deter-
development that speeds the creation process mination and archiving, as well as their appli-
and improves the quality of standards that cation to disease management, will require
are developed. These evolving development greatly enhanced solutions for data manage-
platforms pay appropriate homage to existing ment and analysis. Innovative strategies for
standards and leverage previously developed recognizing and validating biomarkers will
models of development and analysis. grow exponentially from the current stable of
The use of the FHIR may provide a much imaging and cell surface determinants. These
needed solution while relying upon historically data streams will require adaption of exist-
developed and refined interoperability specifi- ing decision support systems and compara-
cations, it hides the complexity of authoring tive effectiveness paradigms. Lastly, scientific
messages within the FHIR development pro- evidence supporting the diagnosis and man-
cess. This leads to more usable specifications, agement with the field of behavioral medi-
created in a dramatically abbreviated time cine will change the entire clinical spectrum
238 C. Jaffe et al.
and approach to evaluation and care. As we 37(4–5), 394–403. This article enumerates a set
emerge from the dark ages of behavioral med- of desirable characteristics for controlled ter-
icine, we will certainly require new systems for minologies in health care.
recognizing, diagnosing, naming and inter- Executive Office of the President; President’s
vening on behalf of our patients. Council of Advisors on Science and
In some sense, the development of stan- Technology (2010). Report to the President
dards is just beginning. The immediate future realizing the full potential of health information
years will be important to create effective technology to Improve healthcare for
organizations that include the right experts Americans: the path forward.
in the right setting to produce standards that Henderson, M. (2003). HL7 messaging. Silver
are in themselves interoperable. That goal still Spring: OTech Inc.. Description of HL7 V2
remains in the future. with examples. Available from HL7.
Holland, C., & Shostak, J. (2016). Implementing
nnSuggested Reading CDISC using SAS (2nd ed.). SAS Institute.
Abbey, L. M., & Zimmerman, J. (Eds.). (1991). ICD-10-CM 2020: The Complete Official
7 Dental informatics: Integrating technology into Codebook, AMA, 2019
the dental environment. New York: Springer. Institute of Medicine. (2003). Patient safety:
This text demonstrates that the issues of stan- Achieving a new standard for care. Washington,
dards extend throughout the areas of applica- DC: National Academy Press. Discusses
tion of biomedical informatics. The standards approaches to the standardization of collec-
issues discussed in this chapter for clinical tion and reporting of patient data.
medicine are shown to be equally pertinent for Jamie, G. (2019). Starting SNOMED: A begin-
dentistry. ner’s guide to the Snomed CT healthcare termi-
American Psychiatric Association Committee on nology. SNOMED.
Nomenclature and Statistics. (1994). Kahn, A. N., et al. (2006). Standardizing labora-
Diagnostic and statistical manual of mental tory data by mapping to LOINC. JAMIA,
disorders (4th ed.). Washington, DC: The 13(3), 353–355.
American Psychiatric Association. Argonaut NCPDP Standards-based Facilitator Model for
Project – HL7 FHIR, https://argonautwiki. PDMP White Paper, NCPDP, 2019 ISO
hl7.org/Main_Page. Technical Committee 215 Healthcare
Benson, T. (2012). Principles of health interopera- Informatics, https://en.wikipedia.org/wiki/
bility HL7 and SNOMED (2nd ed.). London: ISO/TC_215
Springer. This book presents a detail discus- New York Academy of Medicine. (1961). Standard
sion of the HL7 version 2 messaging standard nomenclature of diseases and operations (5th
and a detail presentation of SNOMED-CT. ed.). New York: McGraw-Hill.
Braunstein, M. (2018). Health informatics on Pianykh, O. (2011). Digital imaging and communi-
FHIR: How HL7’s new API is transforming cations in medicine (DICOM): A practical
healthcare. Springer. introduction and survival guide. Springer.
Boone, K. W. (2012). The CDATMbook. London: Richesson, R. L., & Andrews, J. E. (2012). Clinical
Springer. This book provides an excellent pre- research informatics. London: Springer. This
sentation of the HL7 Clinical Document book includes a discussion of standards and
Architecture and related topics. applications from the clinical research per-
Chute, C. G. (2000). Clinical classification and spectives.
terminology: some history and current obser- Stallings, W. (1987). Handbook of computer-
vations. Journal of the American Medical communications standards. New York:
Informatics Association, 7(3), 298–303. This Macmillan. This text provides excellent details
article reviews the history and current status on the Open Systems Interconnection model
of controlled terminologies in health care. of the International Standards Organization.
Cimino, J. J. (1998). Desiderata for controlled Stallings, W. (1997). Data and computer communi-
medical vocabularies in the twenty-first cen- cations. Englewood Cliffs: Prentice-Hall. This
tury. Methods of Information in Medicine, text provides details on communications
Standards in Biomedical Informatics
239 7
architecture and protocols and on local and of the 1998 AMIA Annual Fall Symposium (pp. 785–
wide area networks. 789). Orlando: American Medical Informatics
Association.
Chute, C. G. (2018). The rendering of human phenotype
??Questions for Discussion and rare diseases in ICD-11. Journal of Inherited
1. Who should be interested in Metabolic Disease, 41(3), 563–569.
interoperability and health data Coenen, A., McNeil, B., Bakken, S., Bickford, C.,
standards? Warren, J. J., & American Nurses Association
2. What are the five possible approaches to Committee on Nursing Practice Information
Infrastructure. (2001). Toward comparable nursing
accelerating the creation of standards? data: American Nurses Association criteria for data
3. Define five health care standards, not sets, classification systems, and nomenclatures.
mentioned in the chapter, which might Computers in Nursing, 19(6), 240–246.
also be needed? College of American Pathologists. (1971). Systematized
4. What role should the government play in nomenclature of pathology. Chicago: The College of
American Pathologists.
the creation of standards? Côté, R. A., & Rothwell, D. J. (1993). The systema-
5. At what level might a standard interfere tised nomenclature of human and veterinary medi-
with a vendor’s ability to produce a cine, 1993. Northfield: College of American
unique product? Pathologits.
6. Define a hypothetical standard for one Ferranti, J. M., Musser, R. C., Kawamoto, K., &
Hammond, W. E. (2006). The clinical document
of the areas mentioned in the text for architecture and the continuity of care record: A
which no current standard exists. critical analysis. Journal of the American Medical
Include the conceptualization and Informatics Association, 13(3), 245–252.
discussion points. Specifically state the Gottschalk, H. W. (1991). The medical information bus:
scope of the standard. Overview of the medical device data language.
Journal of Clinical Engineering, 16(4), 323–329.
Harris, M. A., for the Gene Ontology Consortium, et al.
(2004). The gene ontology (GO) database and infor-
matics resource. Nucleic Acids Research,
References 32(Database issue), D258–D256.
Hudson, K. L., & Collins, F. S. (2017). The 21st century
American Psychiatric Association Committee on cures act — A view from the NIH. The New England
Nomenclature and Statistics. (2013). Diagnostic and Journal of Medicine, 376(2), 111–113.
statistical manual of mental disorders (4th ed.). Humphreys, B. L. (Ed.). (1990). UMLS knowledge
Washington, DC: The American Psychiatric sources – First experimental edition documentation.
Association. Bethesda: National Library of Medicine.
American Society for Testing and Materials. (1999). International Standards Organization. (1987).
Standard guide for properties of a universal healthcare Information processing systems-concepts and termi-
identifier (UHID) (E1714-95.). West Conshohocken: nology for the conceptual schema and the information
American Society for Testing and Materials. base. (ISO TR 9007:1987). International Standards
Balas, E., & Boren, S. (2000). Managing clinical knowl- Organization.
edge for health care improvement. In J. H. van Jollis, J. G., Ancukiewicz, M., DeLong, E. R., Pryor,
Bemmel & M. C. AT (Eds.), Yearbook of medical D. B., Muhlbaier, L. H., & Mark, D. B. (1993).
informatics (pp. 65–70). Stuttgart: Schattauer Discordance of databases designed for claims pay-
Verlagsgesellschaft mbH. ment versus clinical information systems.
Becnel, L. B., Hastak, S., Ver Hoef, W., Milius, R. P., Implications for outcomes research. Annals of
Slack, M. A., Wold, D., Glickman, M. L., Brodsky, Internal Medicine, 119(8), 844–850.
B., Jaffe, C., Kush, R., & Helton, E. (2017). BRIDG: Libicki, M. C. (1995). Information technology standards:
A domain information model for translational and Quest for the common byte. Boston: Digital Press.
clinical protocol-driven research. Journal of the National Council for Prescription Drug Programs.
American Medical Informatics Association, 24(5), (1994). Data dictionary.
882–890. Nelson, S. J., Brown, S. H., Erlbaum, M. S., Olson, N.,
Braunstein, M. (2018). Health informatics on FHIR: Powell, T., Carlsen, B., Carter, J., Tuttle, M. S., &
How HL7’s new API is transforming healthcare. Hole, W. T. (2002). A semantic normal form for
Springer. clinical drugs in the UMLS: Early experience with
Campbell, K. E., Tuttle, M. S., & Spackman, K. A. the VANDF. In Proceedings of the AMIA Fall
(1998). A “lexically-suggested logical closure” met- Symposium (pp. 557–561). American Medical
ric for medical terminology maturity. In Proceedings Informatics Association.
240 C. Jaffe et al.
New York Academy of Medicine. (1961). Standard Spackman, K. A. (2000). SNOMED RT and
nomenclature of diseases and operations (5th ed.). SNOMEDCT. Promise of an international clinical
New York: McGraw-Hill. terminology. MD Computing, 17(6), 29.
NHS Centre for Coding and Classification. (1994). Read Spackman, K. A., Campbell, K. E., & Cote, R. A.
codes (Version 3) (April ed.). London: NHS (1997a). SNOMED RT: a reference terminology for
Management Executive, Department of Health. health care. In Proceedings of the AMIA Annual
Odigie, E., Lacson, R., Raja, A., Osterbur, D., Ip, I., Fall Symposium (pp. 640–644). American Medical
Schneider, L., & Khorasani, R. (2019). Fast health- Informatics Association.
care interoperability resources, clinical quality lan- Spackman, K. A., Campbell, K. E., et al. (1997b).
guage, and systematized nomenclature of SNOMED-RT: A reference terminology for health
medicine—Clinical terms in representing clinical care. In D. R. Masys (Ed.), Proceedings, AMIA
evidence logic statements for the use of imaging pro- annual fall symposium (pp. 640–644). Philadelphia:
cedures: Descriptive study. JMIR Medical Hanley and Belfus.
Informatics, 7(2), e13590. Spineth, M., Rappelsberger, A., & Adlassnig, K. P.
Park, H. A., & Cho, I. (2009). Education, practice, and (2018). Implementing CDS hooks communication
research in nursing terminology: Gaps, challenges, in an Arden-syntax-based clinical decision support
and opportunities. Yearbook of Medical Informatics, platform. Studies in Health Technology and
103–108. Informatics, 255, 165–169.
7 Payne, T. H., Corley, S., Cullen, T. A., Gandhi, T. K.,
Harrington, L., Kuperman, G. J., Mattison, J. E.,
Stallings, W. (1987). The open systems interconnection
(OSI) model and OSI-related standards (Vol. 1).
McCallie, D. P., McDonald, C. J., Tang, P. C., New York: Macmillian.
Tierney, W. M., Weaver, C., Weir, C. R., & Stead, W. W., Kelly, B. J., & Kolodner, R. M. (2005).
Zaroukian, M. H. (2015). Report of the AMIA Achievable Steps toward Building a National Health
EHR-2020 Task Force on the status and future Information Infrastructure in the United States.
direction of EHRs. Journal of the American Medical Journal of the American Medical Informatics
Informatics Association, 22(5), 1102–1110. Association, 12(2), 113–120.
Plunkett, R. J. (1952). Standard nomenclature of diseases Tanenbaum, A. S. (1987). Computer networks (2nd ed.).
and operations (4th ed.). New York: McGraw-Hill. Englewood Cliffs: Prentice Hall.
Rector, A. L., Glowinski, A. J., Nowlan, W. A., & Rossi- Thompson, E. T., & Hayden, A. C. (1961). Standard
Mori, A. (1995). Medical-concept models and medi- nomenclature of diseases and operations (5th ed.).
cal records: An approach based on GALEN and New York: McGraw-Hill.
PEN & PAD. Journal of the American Medical World Health Organization. (1992). International clas-
Informatics Association: JAMIA, 2(1), 19–35. sification of diseases index. Tenth revision (Vol.
Rose, M. T. (1989). The open book: A practical perspec- Tabular list, Vol. 1). Geneva: The World Health
tive on OSI. Upper Saddle River: Prentice Hall. Organization.
Simborg, D. W., Chadwick, M., Whiting-O’Keefe, Q. E., World Health Organization. (1977). International clas-
Tolchin, S. G., Kahn, S. A., & Bergan, E. S. (1983). sification of diseases index. Ninth revision. Geneva:
Local area networks and the hospital. Computers The World Health Organization.
and Biomedical Research, 16(3), 247–259.
241 8
Natural Language
Processing for
Health-Related Texts
Dina Demner-Fushman, Noémie Elhadad, and Carol Friedman
Contents
References – 269
Natural Language Processing for Health-Related Texts
243 8
nnLearning Objectives tics and variability of language. While in most
After reading this chapter, you should know automated applications, structured, standard-
the answers to these questions: ized data are readily available for process-
55 What are the potential uses for NLP in ing, there is a significant amount of manual
the biomedical, clinical, and health work currently devoted to mapping textual
domains? information to coded representations in bio-
55 What are the principal computational medicine and health care: Professional coders
tasks of NLP for health-related texts? assign billing codes corresponding to diag-
55 What are the different knowledge noses and procedures to hospital admissions
resources and linguistic representations based on discharge summaries and admission
that can support the development of information; indexers at the National Library
NLP techniques? of Medicine assign MeSH (Medical Subject
55 What are the near-future directions for Headings) terms to represent the main top-
health-related NLP research and appli- ics of scientific articles; and database curators
cations? extract genomic and phenotypic information
on organisms from the literature. Because of
the overwhelmingly large amount of textual
8.1 Motivation information in health domains, manual work
is costly, time-consuming, and impossible to
Language is the primary means of human keep up to date. One aim of Natural Language
communication, and it is no surprise clini- Processing (NLP) is to facilitate these tasks by
cians, biomedical researchers, patients, and enabling use of automated methods with high
health consumers alike rely on language validity and reliability.
extensively. Clinicians document the care of Another aim of NLP is to help advance
their patients in the electronic health record many of the fundamental aims of biomedical
and use patient record notes to determine next informatics, which are the discovery and vali-
steps of care. Biomedical scientists write and dation of scientific knowledge, improvement
read articles to keep abreast of research prog- in the quality and cost of health care, and
ress from their peers. Patients rely on online support to patients and health consumers.
platforms to learn from and exchange infor- The considerable amounts of texts amassed
mational and emotional support from their through clinical care, published in the scien-
peers, and health consumers view health con- tific biomedical literature, and discussed by
tent online as a primary source of informa- patients and caregivers online can help acquire
tion to manage their health and increase their new knowledge and promote discovery of new
health literacy. In fact, there are continuously phenomena. For instance, the information in
growing, unprecedented amounts of these patient notes, while not originally entered for
biomedical and health-related texts available. discovery purposes, but rather for the care of
The field of health and biomedical natu- individual patients, can be processed, aggre-
ral language processing is concerned with gated and mined to discover patterns across
the theories, principles, and computational patients. This process of leveraging obser-
approaches to building tools that exploit these vational health data has shown much suc-
textual data and support these stakeholders – cess stories when applied to health data that
patients and health consumers, clinicians, and are highly structured (OHDSIPNAS), and
biomedical researchers – in their information there is promise that incorporating learned
needs. representations of language can help as well
While there is valuable information con- (Ghassemi et al. 2014).
veyed in biomedical, clinical, and health texts, For clinicians interacting with an elec-
it is not in a format directly amenable to further tronic health record and treating a particular
processing. These texts are difficult to process patient, NLP can support several points in a
reliably because of the inherent characteris- clinician workflow: when reviewing the patient
244 D. Demner-Fushman et al.
.. Fig. 8.1 Active (initiated by the application) and passive (initiated by the users) decision support applications to
which NLP tools have contributed and have potential to contribute in the future
chart, NLP can be leveraged to aggregate and in VA EHR clinical notes (Lynch et al. 2019)
consolidate information spread across many or Medical Text Indexer that supports index-
notes and reports, and to highlight relevant ers in assigning MeSH terms at the National
facts about the patient. During the decision- Library of Medicine (Mork et al. 2017). New
making and actual care phase, information requirements arise in the domain of consumer
extracted through NLP from the notes can language, due to the changes in consumers’
contribute to the decision support systems in behavior, which in turn, are changing the
the EHR as shown in . Fig. 8.1 (Demner- dynamics of healthcare interaction (7 Chap.
Fushman et al. 2009). Finally, when health 11). NLP is crucial in enabling consumers to
care professionals are documenting patient get to the right information, whether through
information, higher quality notes can be gen- access to clinical information or to informa-
erated with the help of NLP-based methods. tion generated by their peers. NLP can sup-
For quality and administrative purposes, port health consumers and patients looking
NLP can signal potential errors, conflicting for information about a particular disease or
information, or missing documentation in the treatment, by providing better access to rel-
chart. For public health administrators, EHR evant information, targeted to their informa-
patient information can be monitored for syn- tion needs, and to their health literacy levels
dromic surveillance through the analysis of through the analysis of the topics conveyed in
ambulatory notes or chief complaints in the a document as well as the vocabulary used in
emergency room (Hripcsak et al. 2009). the document.
The expectations and requirements for When approaching the above health-
NLP support evolve and grow due to suc- related tasks or use cases for NLP, it is
cesses and demonstrated potential, such as beneficial to align the human tasks and end-
a tool to identify tests for EGFR (epidermal goals with the tasks that NLP has to per-
growth factor receptor) mutations deployed form. For example, giving a clinician a full
Natural Language Processing for Health-Related Texts
245 8
and concise description of a patient’s course 8.2 LP Applications and Their
N
relevant to the patient’s current state is the Context
task of summarization. The high-level NLP
tasks or applications described later in this In this section, we describe specific hNLP
chapter include, but are not limited to, ques- applications that have been and continue to
tion answering (QA), summarization, text prove being useful to biomedical, clinical, and
labeling and text generation. These tasks are health stakeholders in their information needs
bringing the field closer to the ultimate goal (7 Sect. 8.2.1). We then abstract away from
of text understanding, which encompasses them and emphasize the essential role that
not only direct understanding of textual context in which these applications exist play
information, but also the author’s attitude, (7 Sect. 8.2.2).
such as sentiment polarity and modality at
the document level, and temporal reasoning
at a document and document sequence levels 8.2.1 NLP Applications
(7 Chap. 23).
Across all these use cases of health NLP Natural language processing has a wide range
(hNLP), the techniques of natural language of potential applications in the biomedical
processing provide a means to bridge the and health domains. The following are impor-
gap between unstructured text and data by tant applications of NLP technology for bio-
transforming the text to data in a computable medicine and health:
format, allowing humans to interact using 55 Information extraction locates and struc-
familiar natural language, while enabling tures specific information in text, some-
computer applications to process data effec- times by performing a complete linguistic
tively and to provide users with easy access analysis of the text, but more frequently,
and synthesis of the raw textual information. by looking for patterns in the text. This is
This chapter is organized with two types the most common application in biomedi-
of readers in mind: students and researchers cine. It is also one of the earliest: In the
looking for a broad introduction to health 1970s, the Linguistic String Project (LSP)
NLP prior to delving into this active field under the leadership of Dr. Naomi Sager,
of research, and informatics practitioners a pioneer in NLP, developed a comprehen-
looking to use hNLP technology for specific sive computer grammar and parser of
tasks or types of text. 7 Section 8.2 presents English (Grishman et al. 1973; Sager
a more in-depth description of hNLP appli- 1981), and also began work in NLP of
cations and emphasizes the critical role that clinical reports (Sager 1972, 1978; Sager
the context in which these applications are et al. 1987) that continued into the 1990s.
deployed plays when developing hNLP solu- Other clinical and biomedical NLP sys-
tions. In 7 Sect. 8.3 we establish the basic tems followed, e.g., MedLee (Medical
computational tasks involved in most hNLP Language Extraction and Encoding
applications. 7 Section 8.4 is concerned with System) has been used successfully pri-
the different linguistic knowledge resources marily for clinical information extraction,
and types of linguistic representations that but also adapted to literature processing
can enable and facilitate these basic NLP (Friedman et al. 1994; Hripcsak et al.
tasks. 7 Section 8.5 provides further practi- 1995; Friedman 2000). Other systems that
cal considerations for users of hNLP tech- have been successfully used for extraction
nology, while 7 Sect. 8.6 provides further of information from clinical notes and the
research considerations for hNLP research, literature include MetaMap (Aronson and
including evaluation methodology. Finally, Lang 2010), MetaMap Lite (Demner-
7 Section 8.7 briefly outlines future direc- Fushman et al. 2017) and SemRep
tions of hNLP. (Kilicoglu et al. 2012). The latter three sys-
246 D. Demner-Fushman et al.
be viewed as a post-processing of IR and tion on the Web, but need support because
QA: the relevant passages corresponding their health literacy levels do not match
to an input question are further processed the ones of the documents they read
into a single, coherent text. Several steps (Elhadad 2006; Keselman et al. 2007).
are involved in the summarization process:
content selection (identifying salient pieces Finally, sentiment analysis and emotion detec-
of information in the input document(s)), tion belong to the general task of automated
content organization (identifying redun- content analysis (Zunic et al. 2020).
dancy and contradictions among the
selected pieces of information, and order-
ing them so the resulting summary is 8.2.2 Context for NLP Applications
coherent), and content re-generation (pro-
ducing a text output from the organized Understanding the context and intent of the
pieces of information). Text summariza- speaker on meaning (pragmatics) is crucial
tion in the biomedical domain has focused to performing many NLP tasks correctly.
on the literature (Elhadad et al. 2005; Although the context of health-related NLP
Zhang et al. 2011), with some forays into applications is broad and varied, it is bounded
summarization of clinical text and Web by specific tasks and environments that can
8 resources (Pivovarov and Elhadad 2015; be relatively easily enumerated and therefore
Mane et al. 2015). taken into account during processing. It is
55 Other tasks: defined by those who produce the text, by the
Text classification (labeling) involves purposes for which the text is produced, and
categorizing text into known types. by the intended readers. For example, clini-
Sentences in scientific articles can be clas- cians can write a patient status report for their
sified as key sentences indicative of the colleagues, or a simplified summary for the
article’s content (Ruch et al. 2007). patient. Clinical researchers can describe the
Sections of a clinical note can be labeled, arms of a clinical trial in a scientific publica-
as, e.g., social or family history (Denny tion or describe the inclusion criteria of the
et al. 2008). At the document level, texts same trial in a simpler language for recruit-
can be classified into different genres, etc. ment and patient education purposes. The
Related to classification is clustering, style of communication is often determined by
which involves grouping texts based on cultural conventions and ecosystems in which
some intrinsic similarity without knowing these texts are written and read, and inference
a priori what these similar properties are. is needed for correct interpretation and gen-
Text generation formulates natural lan- eration of language. Powerful context models
guage sentences from a given source of are missing in the open domain (Bunt 2017)
information, which is not directly readable but can be approximated through the seman-
by humans. Generation can be used to cre- tic lexicon and rules about the discourse of
ate a text from a structured database, such a text in the biomedical domain. Biomedical
as summarizing trends and patterns in sublanguages are easier to interpret than
laboratory data (Hüske-Kraus 2003). general languages because they exhibit more
Machine translation converts text in restrictive semantic patterns that can be repre-
one language (e.g. English) into another sented more easily (Harris et al. 1989; Harris
(e.g. Spanish). These applications are 1991; Sager et al. 1987). Sublanguages tend to
important in multilingual environments in have a relatively small number of well-defined
which human translation is too expensive semantic types (e.g. medication, gene, disease,
or time consuming (Deléger et al. 2009a). body part, or organism) and a small number
Text readability assessment and simpli- of semantic patterns (e.g. medication-treats-
fication is becoming relevant to the health disease, gene-interacts with-gene).
domain, as patients and health consumers The fact that texts belong to a particular
access more and more medical informa- domain, be it clinical, biological or related
Natural Language Processing for Health-Related Texts
249 8
to health-consumers, allows us to capture approaches. An evaluation of the application
domain-specific characteristics in the lexi- then helps determine which design was the
con, the grammar, and the discourse struc- more appropriate. There is not always a single
ture. Thus, the more specific the domain of a solution to the application casting step. For
text, the more knowledge can be encoded to instance, take the task of identifying within a
help its processing, but then the NLP system large pool of patient records the notes with
would be extremely limited and specialized. documented heart attack in the past year.
For instance, in the domain of online patient We can cast this application as a note label-
discourse, patients discussing breast cancer ing task: given a note, label it according to a
among their peers online rely on a very dif- binary label: documentation present or absent.
ferent set of terms than caregivers of children In this case, the application will retrieve notes
on the autism spectrum. One can develop a of patients that are labeled as a whole as likely
lexicon for each subdomain, online breast to contain documentation of heart attack
cancer patients and online autism caregivers. within the past year or not. To achieve the
But maintaining separate lexicons can be inef- task, we will then need to compile a training
ficient and error prone, since there can be a set of notes and their gold-standard labels
significant amount of overlap among terms and train a document-level classifier. An alter-
across subdomains. Conversely, if a single native approach is to cast the same applica-
lexicon is developed for all subdomains, ambi- tion as an information extraction or template
guity can increase as terms can have different filling task. There, the template would contain
meanings in different subdomains. For exam- for instance the location in the note that men-
ple, in the emergency medicine domain shock tions the condition heart attack, along with
will more likely refer to a procedure used for the temporal expression documenting the
resuscitating a patient, or to a critical condi- time at which the heart attacks occurred in
tion brought about by a drop in blood flow, the patient. In this case, all heart attacks and
whereas in psychiatry notes it will more likely temporal aspects will be extracted in our pool
denote an emotional condition or occasionally of patient notes, and only the ones that sat-
electric shock therapy. Deciding on whether isfy our temporal constraint will be kept. Both
to model a domain as a whole or to focus on tasks will achieve the same goal (identify the
its subdomains independently of each other is patients who have a documented heart attack
a tradeoff. Careful determination of the use in the past year), but will do so in different
cases of a system can help determine the best ways and with slightly different outputs: the
choice for the system. labeling approach will not be able to provide
data provenance, but might be more feasible
because it is easier to label a document as a
whole than it is to annotate templates and
8.3 Basic Computational extract information as shown in . Fig. 8.3.
NLP Tasks In this section, we review the basic tasks
that are most common when processing
The different applications we reviewed in the health-related texts and in biomedical and
previous sections have in common the need to health applications. These tasks do not con-
process text, but they differ widely in the types stitute a pipeline, but rather a portfolio of
of input and output they produce. Tasked the tasks one can rely on depending on their
with an application that entails text process- application and goals. They each take specific
ing, one can think in these terms: is the goal to inputs (sometimes a collection of documents
cluster, label, extract, generate or a combina- as in topic modeling, or a sequence of words
tion of these? And what is my input: a corpus, as in sequence labeling) and produce specific
a document, a fragment of text, or a sequence outputs (groups of words in topic modeling,
of words? The step of casting an application one label in document labeling, triples in rela-
into one or a set of NLP tasks is important tion extraction, or complex frames in event
in determining the choice and design of the extraction, as shown in . Fig. 8.3).
250 D. Demner-Fushman et al.
.. Fig. 8.3 Two different approaches to identifying patients who suffered a heart attack in the past year. The docu-
ment labeling/classification approach and the template filling/information extraction approach
.. Fig. 8.4 Topic modeling is an unsupervised basic over words in the corpus, and an inference mechanism
computational tasks in hNLP. Given a corpus, two types to assign topic assignments to any new document
of output are generated: topics defined as distributions
In our example, we see that our documents some are already discussed in 7 Sect. 8.2, but
are a mixture of articles in the fields of clini- we list here a few examples:
cal, public health, informatics, and biological 55 Automated coding of discharge summary
domains. Themes that emerge from the analy- according to diagnostic codes. This is an
sis include obesity (topic 1), microbiology example of text labeling task where the
(topic 19), and electronic health records (topic input is an entire document – a discharge
4). Because the topic modeling task is unsu- summary, the clinical note authored at the
pervised, the discovered topics do not always end of a hospital admission by a physi-
depict actual themes, but can sometimes elicit cian – and the output is a set of diagnostic
groups of words representative of the genre codes, where the codes are chosen from a
of texts analyzed. Topic 3 in our example is taxonomy, e.g., ICD-10. This task is typi-
such a topic, where the most highly ranked cally carried out in hospital for billing and
words are common to the genre of biomedi- administrative goals, and the tasks are
cal studies. done in a semi-automated fashion with
Beyond utility of topic modeling as explor- coders, professionals trained to select the
atory technique for large corpora, it is a clus- appropriate codes (Resnik et al. 2006).
tering technique which can be found useful 55 Sentiment analysis of hospital reviews
in many other applications. For instance, in written by health consumers is another
a machine learning application that leverages example of text labeling task (Greaves
both text and other types of data. et al. 2013). There, a short text written in
lay language is labeled according to a sin-
gle label – its polarity or sentiment. In fact,
8.3.2 Text Labeling it might even be a fragment of the text at a
time that gets labeled according to its
In Text Labeling tasks, the input is text, either sentiment.
in its entirety or a fragment, and the output is
a label or a set of labels. Examples of appli- Most approaches to this task use classification
cations in healthcare, consumer health, and techniques from the field of machine learning
biomedicine that use text labeling abound and and thus require a training set of text inputs
252 D. Demner-Fushman et al.
INPUT SENTENCE/LINE
exam for metastatic disease
TEXT SEGMENTATION
History of History O
TERM
Hodgkin’s of O
NORMALIZATION RESULTS
Iymphoma Hodgkin’s B
Iymphoma I
8 INPUT TEXT RESULTS
.. Fig. 8.5 Two approaches to named entity recogni- − C0582103: exam: Medical Examination : [hlca]: 0 : 4
static disease, Impression: 1.5 cm nodule in the left mid- : [neop]: 9 : 18 : 0.0
lung zone may contain calcium.” MetaMap will output
the following:
and their associated labels, although rule- ognition (NER). NER is a classic task of gen-
based approaches are also a possibility, e.g., eral NLP and in the health domain. The task
in labeling sections of clinical notes (Denny consists of identifying within a text the dif-
et al. 2008). Which features to include and ferent mentions of different types of entities
how to represent them depends on the task as shown in . Fig. 8.5. The task is difficult
at hand. For instance, in automated diagnosis because we don’t know in advance how many
coding, word-level representations described words comprise the entity, e.g., lymphoma is
in 7 Sect. 8.4 have been found helpful, as well a single word, Hodgkin’s lymphoma is a two-
as simple bag-of-words approaches. word phrase, but many other terms include
these entities:
55 non-Hodgkin’s lymphoma of lung,
8.3.3 Sequence Labeling 55 Hodgkin’s lymphoma of lung (without the
non), and
The task of sequence labeling is a specific case 55 non-Hodgkin’s lymphoma.
of text labeling. In sequence labeling, we are
keeping track of the order in which textual units Each one of these phrases is a different term,
occur (be it words or text fragments), and the and sometimes nested recognition might be
approaches work better at labeling the entire needed, e.g., if a clinician is looking for all
sequence jointly. Here are two examples of patients with lymphoma grouped by the type
sequence labeling tasks in health-related texts. and location of the lesion, we will need to
We focus on a traditional example for identify the full names, as well as the nested
sequence labeling, namely, named entity rec- mentions of lymphoma. Sequence labeling is
Natural Language Processing for Health-Related Texts
253 8
also difficult because terms are often ambig- resources created for BioCreative evaluations
uous with respect to the type of entity they (Krallinger et al. 2008). In the clinical domain,
instantiate. The context in which the term relations of interest are those between medical
occurs might help identify the correct entity problems and treatments, problems and tests,
type and link the entity to its class or an problems and genes, and genes and treatments
appropriate identifier in a terminology. For (the latter two becoming increasingly impor-
instance, the term “ca” in itself in the clinical tant due to interest in translational research
text is ambiguous towards two frequent entity and precision medicine). The resources for
types: (as in calcium) measurement or the extraction of some of these relations are avail-
neoplasm UMLS semantic type (as in cancer). able due to the i2b2 relation extraction chal-
The output provides UMLS identifiers, lenge (Uzuner et al. 2011). Another source of
positional information of the terms in the text, literature partially annotated for entities and
and their semantic types. Note that in addi- relations is the pharmacogenomics knowledge
tion to ambiguity of Ca that will not be easy base (PharmGKB) (Barbarino et al. 2018).
to disambiguate to cancer because the note Among other activities, PharmGKB provides
also contains the term calcium, Exam is also literature annotated for genetic variants and
ambiguous due to the different semantic types gene-drug-disease relationships and anno-
assigned to physical exam. Also note the fine- tates associations between genetic variants
grained differences in normalizing metastatic and drugs, and drug pathways.
disease. The output of the machine learning Initially, the existence of relations between
model trained to label disease name sequences entities was assumed if the entities co-occurred
will identify Hodgkin’s lymphoma as disease. more frequently than by chance in a unit of
Like in text labeling, sequence labeling text, such as a MEDLINE abstract or a sen-
rely on supervised approaches. Training data tence. Mutual information, chi-square and
with sequences fully annotated are required log-likelihood ratio were often used to extract
as training examples, see . Fig. 8.5. While co-occurrence-based relations (Hakenberg
probabilistic approaches like Hidden Markov et al. 2012). This approach has two draw-
Models and Conditional Random Fields backs: (1) the results are often noisy and (2)
were historically used for this task, neural the nature of the relations is undefined.
architectures have shown impressive results Both the knowledge-based and statistical
in sequence learning. Recurrent neural net- approaches are used to extract specific rela-
works (LSTM, bi-LSTM, GRUs) in particu- tions. Knowledge-based approaches often
lar encode in their architecture the ability to involve lexical-semantic or syntactic-semantic
handle almost arbitrarily long sequences and patterns. For example, to extract a relation-
keep track of the relevant words within them ship between a complication of a patient’s
(whether long-term dependency or short-term health condition and its cause, we can define a
dependency) (Habibi et al. 2017). “Complications of ” relation. The expressions:
“status post”, “secondary to”, and others
indicate the presence of the “Complications
8.3.4 Relation Extraction of ” relation. These expressions can be com-
bined with semantic categories to form pat-
Similarly to NER, relation extraction can be terns for a rule-based system. For example,
decomposed into relation detection and deter- the “[concept Problem][s/p][concept any|word
mination of the relation type. Until recently, noun]” pattern, in which “s/p” represent the
research in biomedical relation extraction was set of the indicator expressions. A rule-based
limited to protein-protein interactions and system that extracts “treats” relations can
gene binding, primarily due to availability of include the following rule:
where treatment indicators are stored in a . Fig. 8.6, exemplify relatively simple events:
gazetteer and Procedure and Problem are a medication event involves a drug, its form,
identified by one of the NER tools presented dosage, administration route, duration, and
above. As common for the rule-based meth- indications.
ods, this approach has relatively low recall The complex biomolecular events are usu-
and is potentially brittle. The currently better- ally more involved. For example, consider the
performing systems use supervised machine phrase “SYK-TLR4 binding increases upon
learning. Most supervised machine learning TLR4 dimerization and phosphorylation”
methods assume that the entities are identified (PMID: 22776094) that introduces a complex
by a NER tool and require positive examples positive regulation event in which binding of
in which the relations and entities are anno- spleen tyrosine kinase (SYK) to the toll-like
tated and negative examples in which there receptor-4 (TLR4) is regulated by two simple
are no relations between annotated entities. events: TLR4 dimerization and phosphoryla-
Given these examples, classifiers (Support tion. Secondary arguments of the events may
Vector Machines in the near past, and cur- provide additional information about the
rently Deep Learning frameworks (Peng et al. event, such as the specific domain or region
2018)) are trained to determine if a specific of the theme of the event. For example, in
relationship between the candidate entities
8 exists. If several relations are possible between
“binding of SYK to the cytoplasmic domain
of toll- like receptor-4 (TLR4)”, cytoplasmic
the two entities, for example, drug causing a domain is the secondary argument associated
problem or drug treating a problem, a two- with the TLR4 theme of the binding event.
step approach can be applied: first determin- Many systems for event recognition are
ing if a relation exists; and then determining built as pipelines that start with recognizing
the type of the relation. As with other clas- protein names. For example, one approach to
sification tasks, feature selection is one of extracting biomedical events on PubMed scale
the factors determining the success of the (Björne et al. 2010) is to use publicly available
method. In addition to the standard “bag of NER tools as the first step in extraction of
words” in the windows preceding, following, protein events from PubMed abstracts. In the
and between the two concepts, features often subsequent steps, the extraction pipelines rely
used in relation extraction are: the semantic on the graph representations of sentence syn-
types of the concepts; the distance between tax and semantics, which we will present in the
the concepts, parts of speech, paths to the next section. The syntactic graph generated by
root of the parse tree and dependency rela- a dependency parser and the identified named
tions between the concepts. entities are used to generate a semantic graph
for each sentence independently. The system
uses the graphs as the source of features to
8.3.5 Template Filling supervised machine learning for event trigger
and edge detection, which are followed by the
Often, n-ary relations, e.g., the size, the loca- rule-based event construction step.
tion and the borders of a lesion, are of inter-
est. Capturing such information requires
event (frame) extraction and is viewed as tem-
plate filling task. 8.4 Linguistic Knowledge
Biomedical events involve a change in the and Representations
state of biomedical objects. Examples of the
events are gene expression, protein binding It is important to know the principles of lin-
and regulation in the biological domain and guistics and computational linguistics that,
medication or phenotype events in the clini- when incorporated in computational tasks,
cal domain. Events usually involve multiple help achieve better results. Processing of lan-
entities and relations between the entities and guage is not as simple as applying a pipeline
can be nested. Medication events, as shown in of independent modules: one to determine
Administration4 Administration3
Administration2 B-Medication B-Strength I-Strength B-Frequency I-Frequency
Dose Strength Administration Medication Dose
CRF
A dose of 3 mg of WR-2721 per mouse (167 mg/kg,
Contextural
Administration3 representation
Route
Glove +
intraperitoneally) was given 30 min before irradiation, Character
embeddings
Administration3 .....ritonavir 100 mg once daily....
Natural Language Processing for Health-Related Texts
Administration4
Administration Administration2
Strength Medication Dose
and 30 Mg of PGN per mouse (1.7 mg/kg) was
Administration3
Administration4
Form Route
r i t o n a v i r
injected intraperitoneally 24 h after 10 Gy irradiation.
.. Fig. 8.6 Annotation of a medication event on the left using brat annotation tool using Bi-directional LSTM and concatenated with word embeddings. These word rep-
(Stenetorp et al. 2012) and on the right, a Long Short-Term Memory (LSTM) neural resentations feed into another Bi-LSTM layer to extract contextual representations of
network with a Conditional Random Field (CRF) layer using character embeddings the words. The contextual representations are fed to the CRF layer to decode the best
trained to extract medication frames. Character level word representations are extracted label sequence
8 255
256 D. Demner-Fushman et al.
tokens, one to assign part-of-speech tags to Lexicon) – can be used as a knowledge base
tokens and to parse the syntax, one to inter- and resource for NLP tasks. MedDRA and
pret the meaning of a sentence, and one to RxNorm, two of the over 130 sources contrib-
resolve the discourse-level characteristics of uting to the UMLS, are examples of terminolo-
the text. In reality, all linguistic levels influ- gies specific to adverse events and medications,
ence each other. Low-level decisions about respectively. They are particularly helpful in
how to tokenize a string impact named-entity the clinical domain, in biosurveillance, phar-
recognition; determining which sense to attri- macovigilance and in pharmacogenomics. The
bute to a named entity depends on its place UMLS Metathesaurus is organized by con-
in the syntactic tree, the pragmatics of the cept. It preserves the meaning and structure
text, and its place in the discourse structure. of the contributing sources and links alterna-
How to model these interactions is one of tive surface representations of a concept in
the primary open research questions of natu- many languages. It also establishes relation-
ral language processing, which is currently ships between concepts. All concepts in the
addressed by modeling the tasks jointly, e.g., Metathesaurus are assigned to at least one
using deep learning approaches (James et al. Semantic Type from the Semantic Network.
2013). Although language processing is not a Most English strings in the Metathesaurus
simple pipeline, the practical applications still also appear in the SPECIALIST Lexicon.
8 often approximate the process as one, and lin- The Specialist lexicon provides detailed syn-
guistic knowledge contributes to performing tactic knowledge for words and phrases and
the basic tasks and modeling the interactions. includes a comprehensive medical vocabulary.
Linguistic levels consist of word-level It also provides a set of tools to assist in NLP,
representations (tokens and morphology; e.g., a lexical variant generator.
sentence-level representations (syntax and Although ontologies and lexicons are
semantics); and document-level representa- maintained and regularly updated, the lan-
tions (pragmatics and discourse). Linguistic guage changes, new concepts enter the lan-
knowledge is captured in lexicons, domain guage, and terms fall out of use and become
knowledge, e.g., lists of diseases and drugs obsolete (although data represented with
can be found in terminologies, and domain the terms may persist). The biomedical and
semantics, such as relations among diseases health domains are highly dynamic in the
and drugs comprises ontologies. influx of new terms (e.g. new drug names,
but also sometimes new disease names, like
COVID-19, SARS and H1N1). Modern cor-
8.4.1 Terminological pus-based approaches compensate for that
and Ontological Knowledge lag leveraging the existing knowledge. For
example, all diseases in the UMLS can be
In the biomedical and clinical domains, inter- marked in PubMed and the representation of
preting text might require extensive back- disease can be learned as described in the sec-
ground knowledge, as many facts are implied, tion on word embeddings. A new disease such
e.g., inferring that a patient is likely hyper- as COVID-19 can then be recognized as such
tensive or has an edema if loop diuretics are using its context and the learned representa-
prescribed, even though there are no explicit tion of Disease.
mentions of either high blood pressure or
swelling in the text. Ontologies contain some
of the background knowledge, and some can 8.4.2 Word-Level Representations
be learned from text.
The Unified Medical Language System zz Tokens
(UMLS), including the Metathesaurus, Tokens are basic language units defined based
Semantic Network, and the Specialist on their utility for solving a specific language
Natural Language Processing for Health-Related Texts
257 8
processing task. The units include mor- ►►Example 8.2 Rückschmerzen nach Brustwir
phemes, words (often morpheme sequences), belbruch
numbers, symbols (e.g. mathematical opera- Ich habe oft Schmerzen im Rücken und in
tors), and punctuation. The notion of what der Schulter. [Back pain after thoracic frac-
constitutes a token is far from trivial. The pri- ture: I often have pain in the back and in the
mary indication of a token in general English shoulder]. ◄
is the occurrence of white space before and
after it; however there are many exceptions: In this example blog post, Rückschmerzen in
a token may be followed by certain punctua- the title needs to be split into Rück[en] (back)
tion marks without an intervening space, such and Schmerzen (pains). In addition, to identify
as by a period, comma, semicolon, or ques- the reason for back pain, information about
tion mark, or may have a “–” in the middle, as the fracture (bruch) has to be separated from
shown in 7 Example 8.1: its location (thoracic spine – Brust – wirbel.)
Some thought should be given to upper
►►Example 8.1 and lower case in the original text. In some
… a decoy receptor for IL-2 in the T cell … situations, it makes sense to keep all tokens
… IL 2-regulated genes … in lowercase, as it reduces variations in
… Il2 and Csf2 are increased as T-cells … ◄ vocabulary. But in others, it might hinder
further NLP: in both the biology and clini-
The three snippets in 7 Example 8.1 contain cal domains, there are many acronyms, which
three different spellings for IL2. Ideally, we when lowercased, might be confused with reg-
would like to link all three surface forms to ular words, such as CAT (computerized axial
interleukin-2. To achieve this normalization, tomography) and cat or FISH (fluorescent in
we need a tokenizer that will treat the dash situ hybridization) and fish.
and the white space equally and segment IL2 Morphology concerns the combination of
into IL and 2 (or merge IL and 2 in the first morphemes (roots, prefixes, suffixes) to pro-
two example snippets.) duce words or lexemes, where a lexeme gener-
In biomedicine, periods and other punc- ally constitutes several forms of the same word
tuation marks can be part of words (e.g., p.o. (e.g. activate, activates, activating, activated,
means per os (by mouth; orally) in the clinical activation). There has been little work con-
domain, and M03F4.2A, is a gene name that cerning morphology in the field of NLP in the
includes a period.) Moreover, punctuation biomedicine and health domains, especially
marks are used inconsistently, thereby com- for the English language. In other languages
plicating the tokenization process: In clinical that are morphologically rich (e.g., Turkish,
text, it is common to abbreviate “discontinue” German, and Hebrew), encoding morpho-
as d/c, without as w/o, but it is also common logical knowledge is necessary. For example,
to write s/p for status post, and, finally, use morphological proximity can identify impor-
slashes in measurements and units. In addi- tant terminological relations (Claveau and
tion, chemical and biological names often L’Homme 2005) or generate definitions of
include parentheses, commas and hyphens, medical terms (Deléger et al. 2009b).
for example (w)adh-2, which also compli-
cate the tokenization process. For example, zz Word Embeddings
replacing non-alphanumeric characters with Word embedding is a feature learning tech-
spaces will prevent us from correctly identify- niques in NLP in which words or phrases from
ing entities in the following sentence: “PBMC the vocabulary are numeric vectors such that
of HLA-DR3(+) but not HLA-DR3(−) cured similar words will have similar vectors. Word
TB patients.” For agglutinative languages, embeddings have two functions: they capture
tokenization needs to be augmented by seg- the meaning of a word using its context and,
mentation as shown in 7 Example 8.2. at the same time, condense the representation
258 D. Demner-Fushman et al.
of the word into a vector, e.g., a vocabulary of posting content online, misspellings, typos,
thousands of words can be compressed into a and non-standard abbreviations are pervasive
300-dimensional vector. The idea of “recog- like in the rest of the social Web. Ignoring
nizing a word by the company it keeps” origi- these variations may cause an NLP system to
nated from Firth (1957). One of the earliest lose or misinterpret information. At the same
corpus-based implementations, Brown clus- time, errors can be introduced when correct-
tering, aggregated words into classes using ing the typos automatically. For instance, it
hierarchical clustering (Turian et al. 2010), is not trivial to correct hypetension automati-
and now deep learning provides more robust cally without additional knowledge because
approaches to pre-computing representations it may refer to hypertension or hypotension.
of words using large corpora, e.g., 2.5 bil- This type of error is troublesome not only
lion Wiki words. The growing family of text for automated systems, but also for clinicians
embeddings and pre-trained language mod- when reading a note, as this phenomenon is
els started with Word2Vec (Mikolov et al. aggravated by the large amount of short, mis-
2013), and now includes GloVe (Pennington spelled words in notes. In the clinical domain,
et al. 2014), fastText (Bojanowski et al. 2016), misspellings can be found even in the defini-
ELMo (Peters et al. 2018), BERT (Devlin tions of clinical variables.
et al. 2018), GPT (Radford et al. 2019), and
8 BART (Lewis et al. 2019), to name a few. The
language models, such as BERT, have been
shown to be open to fine-tuning for specific 8.4.3 Sentence-Level
NLP tasks. Domain-specific embeddings and Representations
models trained on PubMed and clinical text
also exist, and have been shown superior to zz Sentence Boundary Detection
those pre-trained on the open-domain text. Detecting the beginning and end of a sentence
BioBert, for example, significantly advanced may seem like an easy task, but it is highly
the state-of-the-art for biomedical named domain dependent. Not all sentences end
entity recognition, relation extraction, and with a punctuation mark (this is especially
question answering (Lee et al. 2019). true in texts with minimal editing, such as
online patient posts and clinical notes entered
zz Spelling Variants and Errors by physicians). Sentences in scientific publica-
In addition to common American English tions are usually well-formed and delimited by
and British spelling variants, such as -or/- final punctuation, primarily a period. Some
our (e.g., color/colour), e/ae, e/oe, er/-re (e.g., care has to be taken to avoid breaking up sen-
liter/litre), and -ize/-ise, generic drug names tences on periods used in abbreviations (e.g.,
may also differ (e.g., adrenaline (British) vs. vs.) and in honorifics, chemical names, and
epinephrine). The complex origins and spell- decimal numbers (as discussed in tokeniza-
ing of the biomedical terms often lead to tion). In most cases an off-the-shelf sentence
misspellings. Misspellings in the published tokenizer is expected to be highly accurate.
literature are relatively rare, but the queries The informal biomedical text is much
submitted to the search engines are often harder to split into meaningful utterances.
misspelled. Clinical notes and informal com- Clinical notes often contain table-like struc-
munications also often contain misspelled tures and lists. Moreover, some electronic
terms. Clinicians, when typing free text in the health records might enforce a certain line
EHR, do so under time pressure and gener- length, which could be violated by, for exam-
ally do not have the time to proofread their ple, a de-identification tool. Therefore, cus-
notes carefully. In addition, they frequently tom solutions might be needed to detect end
use abbreviations (e.g. HF for Hispanic female of sentences in these texts.
or heart failure, 2/2 for secondary to or a date), Syntax concerns the categorization of the
many of which are non-standard and ambigu- words in the language, and the structure of the
ous. For patients and health consumers, when phrases and sentences. Each word belongs to
Natural Language Processing for Health-Related Texts
259 8
one or more parts of speech in the language, S
such as noun (e.g. chest), adjective (e.g. mild), VP
or tensed verb (e.g. improves) Lexemes can NP
consist of more than one word as in foreign PP
phrases (ad hoc), prepositions (along with),
and idioms (follow up, on and off). Lexemes NP NP
combine in well-defined ways and according
to their parts of speech, to form sequences of DT NN VBD NN IN JJ NNS
words or phrases, such as noun phrases (e.g.
The patient had pain in lower extremities
severe chest pain), adjectival phrases (e.g.
NMOD SUBJ OBJ NMOD PMOD
painful to touch), or verb phrases (e.g. has
NMOD
increased). Each phrase generally consists of a
main part of speech and modifiers, e.g. nouns DT NN VBD NN IN JJ NNS
are frequently modified by adjectives, while
verbs are frequently modified by adverbs. The .. Fig. 8.7 A syntactic parse tree for the sentence “The
phrases then combine in well-defined ways to patient had pain in lower extremities” according to the
form sentences (e.g. “he complained of severe context-free grammar is shown above the sentence.
Notice that the terminal nodes in the tree correspond to
chest pain” is a well-formed sentence, but
the syntactic categories of the words in the sentence. The
“pneumococcal vaccine how often?” is not). parse tree below the sentence, is in a dependency gram-
General English imposes many restric- mar framework
tions on the formation of sentences, e.g. every
sentence requires a verb, and count nouns basically directed relations between words.
(like cough) require an article (e.g., a or the). For example, in the sentence, “The patient
Clinical language, in contrast, is often tele- had pain in lower extremities”, the head of
graphic, relaxing many of these restrictions the sentence is the verb “had”, which has two
of the general language to achieve a highly arguments, a subject noun “patient” and an
compact form. For example, clinical language object noun “extremities”, that modifies or is
allows all of the following as sentences: the dependent on “patient”, while “in” is depen-
cough worsened, cough worsening, and cough. dent on “pain”, “extremities” is dependent on
Because the community widely uses and “in”, and “lower” is dependent on “extremi-
accepts these alternate forms, they are not ties”. As such, in a dependency grammar, the
considered ungrammatical, but constitute a relations among words and the concept of
sublanguage (Friedman et al. 2004). head in particular (e.g. “extremities” is the
head of “lower”) is closer to the semantics of
zz Representation of Syntactic Knowledge a sentence.
Phrases and sentences can be represented as
a sequence where each word is coupled with zz Semantics
its corresponding part of speech as shown in Semantics concerns the meaning or interpre-
. Fig. 8.7. For example, Severe joint pain can tation of words, phrases and sentences, gen-
be represented as Severe/adjective joint/noun erally associated with real-world applications.
pain/noun. Formalisms that can be used There are many different theories for repre-
to represent syntactic linguistic knowledge sentation of meaning, such as logic-based
include probabilistic context free grammars (e.g., first order logic and lambda calculus),
(Jurafsky and Martin 2019), which, along frame-based, conceptual graph formalisms,
with dependency formalisms are widely used and distributional semantics, i.e., vector rep-
in language processing. resentations learned from data (Jurafsky and
Dependency is a binary asymmetrical rela- Martin 2019).
tion between a head and its dependents or Each word has one or more meanings –
modifiers. The head of a sentence is usually a word senses (e.g. resistance, as in psycho-
tensed verb. Thus dependency structures are logical resistance, social resistance, multidrug
260 D. Demner-Fushman et al.
resistance, and capillary resistance), and other not critical. For example, slightly improved
terms may modify the senses (e.g. no, as in may not be clinically different from improved.
no fever, or last week as in fever last week). Since this type of information is fuzzy and
Recognizing polysemy or ambiguity of the imprecise, the loss of information may not be
word is important. Complementary to the significant. However, the loss of a negation
phenomenon of polysemy, there are often modifier would be significant. Another such
terms that are different variations of the same example concerns hedging, which frequently
concepts (synonymy). For instance, the term occurs in radiology reports as well as in the
blood sugar is often used by health consum- scientific articles. Implementing a semantic
ers to refer to a glucose measurement, but it is grammar would require a large corpus that
used rarely if ever in the clinical literature or has been annotated with both syntactic and
in clinical notes. semantic information. Since a semantic gram-
Additionally, the meanings of the words mar is domain and/or application specific,
combine to form a meaningful sentence, as in annotation involving the phrase structure
“there was thickening in the renal capsule”). would be costly and not portable, and there-
Representation of the semantics of general fore is not generally done.
language is extremely important, but the
underlying concepts are not as clear or uni-
8 form as those concerning syntax. Interpreting 8.4.4 Document-Level
the meaning of words and text in general is Representations
very challenging. In biomedical informat-
ics, interpreting the meaning of text focuses, As is the case for text in general, documents
largely, on entity linking (i.e., representing a within the biomedical domain are expected
word or group of words with a unique seman- to have a certain structure. Text or discourse
tic concept from a relatively small number of structure refers to the way in which authors
well-defined semantic types, e.g. medication, organize information within documents. The
gene, disease, body part, or organism.) The organization of biomedical text is reflective of
semantics of phrases and sentences is also both the type of information being conveyed
restricted to a smaller set of patterns than as well as its intended audience. The structure
in general language (e.g. medication-treats- of biomedical text aids in its comprehension
disease, gene-interacts-with-gene). by the reader, and can be utilized in perform-
There are often several ways to express a ing various natural language processing tasks.
particular medical concept as well as numer- The structure of biomedical text can be exam-
ous ways to express modifiers for that concept. ined at a local or global level. The local level
For example, ways to express severity include primarily concerns coherence and cohesion,
faint, mild, borderline, 1+, 3rd degree, severe, aspects of structure that connect text and give
extensive, and moderate. Often, to complicate it meaning, whereas the global level concerns
matters, modifiers can be composed or nested. aspects of the overall organization and rhe-
For instance, in the phrase “no improvement torical structure of a document, such as its
in pneumonia,” improvement is a change sectioning.
modifier that modifies the concept pneumo- The cohesive devices responsible for the
nia, and no is a negation marker that modi- local structure of text play an important role
fies improvement (not pneumonia). Complex in comprehension of biomedical documents.
semantic structures containing nesting can be The recognition of phenomena such as coor-
represented using a semantic grammar, which dinating constructions, anaphora, as well as
is a context free grammar based on seman- ellipsis is needed to represent biomedical text
tic categories. An alternative representation at the document level. In this section, we exem-
would facilitate processing by flattening the plify discourse processing with automated
nesting. In this case, some information may resolution of referential expressions and then
be lost but ideally only information that is discuss pragmatics, which concerns everything
Natural Language Processing for Health-Related Texts
261 8
extraneous to the text that contributes to its of changes in centers. Semantic information
meaning and context. In clinical and health- for resolving referential expressions involves
consumers language processing, the context is consideration of the semantic type of the
sometimes readily accessible and sometimes expression and the way it relates to potential
easily extractable from datasets, e.g., medica- referents (Hahn et al. 1999; Kilicoglu 2016).
tion orders, who wrote the text, at what time,
etc. Interaction patterns can be inferred from
the datasets as well. 8.4.5 Pragmatics
8.4.4.1 Automated Resolution Pragmatics concerns how the intent of the
of Referential Expressions author of the text, or, more generally, the con-
Determining which words or phrases in a text text in which the text is written, influences the
referring to the same entity, called coreference meaning of a sentence or a text. For example,
resolution can draw on both syntactic and in a mammography report “mass” generally
semantic information in the text. denotes breast mass, whereas a radiological
Syntactic information for resolving refer- report of the chest denotes mass in lung. In
ential expressions includes: yet a different genre of texts, like a religious
55 Agreement of syntactic features between journal, it is likely to denote a ceremony.
the referential phrase and potential Similarly, in a health care setting “he drinks
referents heavily” is assumed to be referring to alcohol
55 Recency of potential referents (nearness to and not water. In these two examples, prag-
referential phrase) matics influences the meaning of individual
55 Syntactic position of potential referents words. It can also influence the meaning of
(e.g. subject, direct object, object of larger linguistic units. For instance, when
preposition) physicians document the chief-complaint sec-
55 The pattern of transitions of topics across tion of a note, they list symptoms and signs,
the sentences as reported by the patient. The presence of a
particular symptom, however, does not imply
Syntactic features that aid the resolution that the patient actually has the symptom.
include such distinctions as singular/plural, Rather, it is understood implicitly by both the
animate/inanimate, and subjective/objective/ author of the note and its reader that this is
possessive. For instance, the inanimate pro- the patient’s impression rather than the truth.
noun “it” usually refers to things, but some- Thus, the meaning of the chief-complaint
times does not refer to anything when it section of a note is quite different from the
occurs in cleftconstructions, such as “it was assessment and plan, for instance.
noted”, “it was decided to” and “it seemed Another pragmatic consideration is the
likely that”. interpretation of pronouns and other ref-
Referential expressions are usually very erential expressions (there, tomorrow). For
close to their referents in the text. The syn- example, in the two following sentences “An
tactic position of a potential referent is an infiltrate was noted in right upper lobe. It was
important factor. For example, a referent in patchy”, the pronoun “it” refers to “infiltrate”
the subject position is a more likely candi- and not “lobe”. In a sentence containing the
date than the direct object, which in turn is term “tomorrow”, it would be necessary to
more likely than an object of a preposition. know when the note was written in order to
Centering theory accounts for reference by interpret the actual date denoted by “tomor-
noting how the center (focus of attention) of row”. As mentioned above, in the biomedical
each sentence changes across the discourse domain, pragmatics can be encoded through
(Grosz et al. 1995). In this approach, resolu- the semantic lexicon and rules about the dis-
tion rules attempt to minimize the number course of a text.
262 D. Demner-Fushman et al.
.. Fig. 8.8 Document annotation process for detection of Adverse Drug Reactions
vice versa. The F-measure is a combination and the research considerations for moving
of both measures and can be used to weigh the field forward.
the importance of one measure over the other Although NLP continues to advance
by giving more weight to one. If both mea- towards practical applications and more
sures are equally important, the F measure NLP methods are used in large-scale real-life
is the harmonic mean of the two measures. health information applications, more needs
When reporting the results, an error analysis to be done to make NLP use in biomedical
provides insights into ways to improve a sys- and clinical applications a routine widespread
tem. This process involves determining rea- reality. Some of the applications described
sons for errors in recall and in precision. In in this chapter, are already used in practice,
an extrinsic evaluation, some errors can be e.g., named entity recognition and text label-
due to the NLP system and other errors can ing are used to support MEDLINE index-
be due to the subsequent application com- ers. Some research approaches are already
ponent. Some NLP errors in recall (i.e. false outperforming humans on research datasets
negatives) can be due to failure of the NLP that approximate real-life tasks, for example,
system to tokenize the text correctly, to rec- on reading comprehension tests (SQUAD).
ognize a word, to detect a relevant pattern, or This does not mean, however, that NLP in
to interpret the meaning of a word or a struc- general and health NLP are solved. In addi-
8 ture correctly. Some errors in precision can be tion to improvements in the existing applica-
due to errors in interpreting the meaning of tions, new areas are emerging, and some of
a word or structure or to loss of important the well- known impediments still need to
information. Errors caused by the application be addressed. The impediments include the
component can be due to failure to access the data access challenges, which are partially
extracted information properly or failure of addressed by synthetic data and transfer
the reasoning component. learning; the lack of interoperability and
Understanding the errors is the first step in standards, particularly in the evaluation of
bringing the NLP applications closer to being tools included in the clinical workflows. The
incorporated in a wider range of biomedical emerging areas and areas of active research
and clinical text processing tasks. include, but are not limited to: multi-modal
data integration, interpretability of machine
learning results, understanding machine
8.7 Recapitulation and Future learning models bias, and distributed large-
Directions scale computational models.
Natural language processing to identify pneumonia discourse. Computational Linguistics, 2(21), 203–
from radiology reports. Pharmacoepidemiology and 225.
Drug Safety, 8(22), 834–841. Habibi, M., Weber, L., Neves, M., Wiegandt, D. L., &
Eichstaedt, J. C., Schwartz, H. A., Kern, M. L., Park, Leser, U. (2017). Deep learning with word embed-
G., Labarthe, D. R., Merchant, R. M., et al. (2015). dings improves biomedical named entity recogni-
Psychological language on Twitter predicts county- tion. Bioinformatics, 33(14), i37–i48.
level heart disease mortality. Psychological Science, Hahn, U., Romacker, M., & Schulz, S. (1999). Discourse
26(2), 159–169. https://doi. structures in medical reports – watch out! The gen-
org/10.1177/0956797614557867. eration of referentially coherent and valid text
Elhadad, N. (2006). Comprehending technical texts: knowledge bases in the MEDSYNDIKATE system.
Predicting and defining unfamiliar terms. International Journal of Medical Informatics, 53(1),
Proceedings AMIA Symposium, 239–243. 1–28.
Elhadad, N., Kan, M. Y., Klavans, J. L., & McKeown, Hakenberg, J., Voronov, D., Nguyên, V. H., Liang, S.,
K. R. (2005). Customization in a unified framework Anwar, S., Lumpkin, B., et al. (2012). A SNPshot of
for summarizing medical literature. Artificial PubMed to associate genetic variants with drugs,
Intelligence in Medicine, 33(2), 179–198. diseases, and adverse reactions. Journal of
Evans, D. A., Cimino, J. J., Hersh, J. J., Huff, S. M., & Biomedical Informatics, 45(5), 842–850.
Bell, D. S. (1994). Toward a medical-concept repre- Harris, Z. (1991). A theory of language and information –
sentation language. The Canon Group. Journal of a mathematical approach. New York: Oxford
the American Medical Informatics Association: University Press.
JAMIA, 1(3), 207–217. Harris, Z., Gottfried, M., Ryckman, T., Mattick, P.,
8 Eysenbach, G., & Till, J. E. (2001). Ethical issues in Daladier, A., Harris, T., & Harris, S. (1989). The
qualitative research on internet communities. BMJ, form of information in science – analysis of an immu-
323(7321), 1103–1105. nology sublanguage. Dordrecht: Kluwer Academic.
Firth, J. R. (1957). A synopsis of linguistic theory. In Haug, P. J., Ranum, D. L., & Frederick, P. R. (1990).
Studies in linguistic analysis. Oxford: Blackwell. Computerized extraction of coded findings
Friedman, C. (2000). A broad-coverage natural lan- from free-text radiology reports. Radiology, 174,
guage processing system. American Medical 543–548.
Informatics Association Annual Symposium Haug, P., Koehler, S., Lau, L. M., Wang, P., Rocha, R.,
Proceedings, 2000, 270–274. & Huff, S. (1994). A natural language understand-
Friedman, C., Alderson, P. O., Austin, J., Cimino, J. J., & ing system combining syntactic and semantic tech-
Johnson, S. B. (1994). A general natural language niques. Proceedings of the Annual Symposium on
text processor for clinical radiology. Journal of the Computer Applications in Medical Care, 247–251.
American Medical Informatics Association: JAMIA, Hofmann, T. (1999). Probabilistic latent semantic index-
1(2), 161–174. ing. Proceedings of the Twenty-Second Annual
Friedman, C., Shagina, L., Lussier, Y., & Hripcsak, G. International SIGIR Conference.
(2004). Automated encoding of clinical documents Hripcsak, G., Friedman, C., Alderson, P. O.,
based on natural language processing. Journal of the DuMouchel, W., Johnson, S. B., & Clayton, P. D.
American Medical Informatics Association, 11(5), (1995). Unlocking data from narrative reports: A
392–402. study of natural language processing. Annals of
Ganiz, M. C., Pottenger, W. M., & Janneck, C. D. (2005). Internal Medicine, 122(9), 681–688.
Recent advances in literature based discovery. Hripcsak, G., Soulakis, N. D., Li, L., Morrison, F. P.,
Journal of the American Society for Information Lai, A. M., Friedman, C., et al. (2009). Syndromic
Science and Technology: JASIST (Submitted). surveillance using ambulatory electronic health
Ghassemi, M., Naumann, T., Doshi-Velez, F., Brimmer, records. Journal of the American Medical Informatics
N., Joshi, R., Rumshisky, A., & Szolovits, P. (2014). Association, 16(3), 354–361.
Unfolding physiological state: Mortality modelling Hüske-Kraus, D. (2003). Text generation in clinical
in intensive care units. In Proceedings of the 20th medicine – a review. Methods of Information in
ACM SIGKDD international conference on Medicine, 42(1), 51–60.
Knowledge discovery and data mining (pp. 75–84). Institute of Medicine (US) Committee for Evaluating
New York: ACM. Medical Technologies in Clinical Use. (1985).
Greaves, F., Ramirez-Cano, D., Millett, C., Darzi, A., & Assessing medical technologies. Washington, DC:
Donaldson, L. (2013). Use of sentiment analysis for National Academies Press.
capturing patient experience from free-text com- James, G., Witten, D., Hastie, T., & Tibshirani, R.
ments posted online. Journal of Medical Internet (2013). An introduction to statistical learning: With
Research, 15(11), e239. applications in R (Springer texts in statistics).
Grishman, R., Sager, N., Raze, C., & Bookchin, B. New York: Springer Science+Business Media.
(1973). The linguistic string parser. Proceedings of Keselman, A., Tse, T., Crowell, J., Browne, A., Ngo, L.,
the National Computer Conference, 42, 427–434. & Zeng, Q. (2007). Assessing consumer health
Grosz, B., Joshi, A., & Weinstein, S. (1995). Centering: vocabulary familiarity: An exploratory study.
A framework for modeling the local coherence of Journal of Medical Internet Research, 9(1), e5.
Natural Language Processing for Health-Related Texts
271 8
Kilicoglu, H., & Demner-Fushman, D. (2016). Bio- ensembles of SVM and deep learning models.
SCoRes: A smorgasbord architecture for corefer- Database, 2018, bay073.
ence resolution in biomedical text. PLoS One, 11(3), Pennington, J., Socher, R., & Manning, C. D. (2014,
e0148538. Oct). Glove: Global vectors for word representa-
Kilicoglu, H., Shin, D., Fiszman, M., Rosemblat, G., & tion. Proceedings of the 2014 Conference on
Rindflesch, T. C. (2012). SemMedDB: A PubMed- Empirical Methods in Natural Language Processing
scale repository of biomedical semantic predica- (EMNLP), 1532–1543.
tions. Bioinformatics, 28(23), 3158–3160. Peters, M. P., Neumann, M., Iyyer, M., Gardner, M.,
Krallinger, M., Morgan, A., Smith, L., Leitner, F., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep
Tanabe, L., Wilbur, J., et al. (2008). Evaluation of contextualized word representations.
text-mining systems for biology: Overview of the arXiv:1802.05365.
Second BioCreative community challenge. Genome Pivovarov, R., & Elhadad, N. (2015, Sep). Automated
Biology, 9(2), S1. methods for the summarization of electronic health
Lee, J., Yoon, W., Kim, S., Kim, D., Kim, S., So, C. H., records. Journal of the American Medical Informatics
& Kang, J. (2019, Jan 25). BioBERT: Pre-trained Association, 22(5), 938–947. https://doi.org/10.1093/
biomedical language representation model for bio- jamia/ocv032.
medical text mining. arXiv preprint Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., &
arXiv:1901.08746. Sutskever, I. (2019). Language models are unsuper-
Lewis, M., Liu, Y., Goyal, N., Ghazvininejad, M., vised multitask learners. OpenAI Blog, 1(8), 9.
Mohamed, A., Levy, O., et al. (2019, Oct 29). Bart: Resnik, P., Niv, M., Nossal, M., Schnitzer, G., Stoner, J.,
Denoising sequence-to-sequence pre-training for Kapit, A., & Toren, R. (2006). Using intrinsic and
natural language generation, translation, and com- extrinsic metrics to evaluate accuracy and facilita-
prehension. arXiv preprint arXiv:1910.13461. tion in computer-assisted coding. Perspectives in
Lindberg, D. A. B., Humphreys, B. L., & McCray, A. T. Health Information Management Computer Assisted
(1993a). The unified medical language system. Coding Conference Proceedings, Fall, 6 Sept 2006.
Methods of Information in Medicine, 32, 281–291. Roberts, K., & Patra, B. G. (2018). A semantic parsing
Lindberg, D. A., Siegel, E. R., Rapp, B. A., Wallingford, method for mapping clinical questions to logical
K. T., & Wilson, S. R. (1993b). Use of MEDLINE by forms. American Medical Informatics Association
physicians for clinical problem solving. Journal of the Annual Symposium Proceedings, 2017, 1478–1487.
American Medical Association, 269(24), 3124–3129. Roberts, K., Demner-Fushman, D., & Tonning, J. M.
Lynch, J. A., Kelley, M. J., Lee, K. M., Hung, A., Li, Y., (2017). Overview of the TAC 2017 adverse reaction
Hintze, B. J., et al. (2019). An NLP tool to identify extraction from drug labels track. Proceedings of the
molecular diagnostic testing in veterans with stage 2017 Text Analysis Conference, 13 Nov 2017.
IV NSCLC. Journal of Clinical Oncology, 37(27_ Ruch, P., Boyer, C., Chichester, C., Tbahriti, I.,
suppl), 318. https://ascopubs.org/doi/abs/10.1200/ Geissbühler, A., Fabry, P., et al. (2007). Using argu-
JCO.2019.37.27_suppl.318. mentation to extract key sentences from biomedical
Mane, V. L., Panicker, S. S., & Patil, V. B. (2015, Jan 8). abstracts. International Journal of Medical
Summarization and sentiment analysis from user Informatics, 76(2–3), 195–200.
health posts. In 2015 International Conference on Ruder, S. (2019). Neural transfer learning for natural lan-
Pervasive Computing (ICPC) (pp. 1–4). IEEE. guage processing (Diss). NUI Galway.
Maroto, M., Reshef, R., Munsterberg, A. E., Koester, Sager, N. (1972). Syntactic formatting of science infor-
S., Goulding, M., & Lassar, A. B. (1997). Ectopic mation. Proceedings of the AFIPS (pp. 791–800). In
Pax-3 activates MyoD and Myf-5 expression in R. Kittredge & J. Lehrberger (Eds.). (1982).
embryonic mesoderm and neural tissue. Cell, 89, Reprinted in Sublanguage: Studies of language in
139–148. restricted semantic domains (pp. 9–26). Berlin:
Meystre, S. M., Friedlin, F. J., South, B. R., Shen, S., & Walter de Gruyter.
Samore, M. H. (2010). Automatic de-identification Sager, N. (1978). Natural language information format-
of textual documents in the electronic health record: ting: The automatic conversion of texts to a struc-
A review of recent research. BMC Medical Research tured data base. In M. C. Yovits (Ed.), Advances in
Methodology, 10(1), 70. computers (Vol. 17, pp. 89–162). New York:
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013, Academic Press.
Jan 16). Efficient estimation of word representations Sager, N. (1981). Natural language information process-
in vector space. arXiv preprint arXiv:1301.3781. ing: A computer grammer of English and its applica-
Mork, J., Aronson, A., & Demner-Fushman, D. (2017). tions. Reading: Addison-Wesley.
12 years on – is the NLM medical text indexer still Sager, N., Friedman, C., & Lyman, M. (1987). Medical
useful and relevant? Journal of Biomedical language processing – computer management of nar-
Semantics, 8(1), 8. rative data. Reading: Addison-Wesley.
OHDSIPNAS, Lancet – Noemie? Savova, G. K., Masanz, J. J., Ogren, P. V., Zheng, J.,
openNLP. http://opennlp.apache.org/index.html. Sohn, S., Kipper-Schuler, K. C., & Chute, C. G.
Peng, Y., Rios, A., Kavuluru, R., & Lu, Z. (2018). (2010). Mayo clinical text analysis and knowledge
Extracting chemical–protein relations with extraction system (cTAKES): Architecture, compo-
272 D. Demner-Fushman et al.
Bioinformatics
Sean D. Mooney, Jessica D. Tenenbaum, and Russ B. Altman
Contents
References – 297
Bioinformatics
275 9
nnLearning Objectives ing how basic biological systems conspire
After reading this chapter, you should know to create molecules, organelles, living cells,
the answers to these questions: organs, and entire organisms. Remarkably,
55 Why are sequence, structure, and bio- however, the two disciplines share significant
logical pathway information relevant to methodological elements, so an understand-
medicine? ing of the issues in bioinformatics can be valu-
55 Where on the Internet should you look able for the student of clinical informatics and
for a DNA sequence, a protein sequence, vice versa.
or a protein structure? The discipline of bioinformatics continues
55 What are two problems encountered in to be in a period of rapid growth, because the
analyzing biological sequence, struc- needs for information storage, retrieval, and
ture, and function? analysis in biology—particularly in molecular
55 How has the age of genomics changed biology and genomics—have increased dra-
the landscape of bioinformatics? matically over the past two decades. History
55 What are two computational challenges has shown that scientific developments within
in bioinformatics for the future? the basic sciences tend to have a delayed effect
on clinical care and there is typically a lag of a
decade before the influence of basic research
9.1 he Problem of Handling
T on clinical medicine is realized. It cannot be
Biological Information understated the impact that genomics and
bioinformatic approaches are having in the
Bioinformatics is the study of how informa- clinic and the point of care. Indeed, chapters
tion is represented and analyzed in biological focusing on “Translational Bioinformatics”
systems, especially information derived at the and “Precision Medicine and Informatics”
molecular level. Whereas clinical informatics (7 Chaps. 28 and 30) describe how these foun-
deals with the management of information dational advances are leading toward impacts
related to the delivery of health care, bio- on human health and improved approaches to
informatics focuses on the management of clinical care.
information related to the underlying basic
biological sciences. As such, the two disciplines
are closely related—more so than generally 9.1.1 Many Sources
appreciated (see 7 Chap. 1). Bioinformatics of Biological Data
and clinical informatics share a concentration
on systems that are inherently uncertain, diffi- There are many sources of information that
cult to measure, and the result of complicated are revolutionizing our understanding of
interactions among multiple complex com- human biology and that are creating signifi-
ponents. Both deal with living systems that cant challenges for computational processing.
generally lack straight edges and right angles. New technologies are enabling the miniatur-
Although reductionist approaches to studying ization of laboratory experiments, increased
these systems can provide valuable lessons, it automation of experiments and through
is often necessary to analyze those systems advanced computer processing, and the inter-
using integrative models that are not based pretation of data quickly. These technolo-
solely on first principles. Nonetheless, the two gies are producing data at a staggering rate.
disciplines approach the patient from oppo- The data produced can interrogate different
site directions. Whereas applications within views into the Central Dogma of Biology, the
clinical informatics usually are concerned with metabolome, the metagenome and ancillary
the social systems of medicine, the cognitive molecular processes.
processes of medicine, and the technologies The most dominant new type of informa-
required to understand human physiology, tion is the sequence information produced by
bioinformatics is concerned with understand- genetic studies. This was enabled by the Human
276 S. D. Mooney et al.
Genome Project, an international undertaking This enabled the study of the expression of
intended to determine the complete sequence large numbers of genes with one another (Bai
of human DNA as it is encoded in each of and Elledge 1997) and to study multiple varia-
the 23 human chromosomes. The first draft of tions on a genome to explore the implications
the sequence was published in 2001 (Lander of changes in genome function on human dis-
et al. 2001) and a final version was announced ease. This work has led to the field of genom-
in 2003 coincident with the 50th anniversary ics, the study of the molecular state of a cell,
of the solving of the Watson and Crick struc- tissue or organism through the state and activ-
ture of the DNA double helix. The sequence ity of its genome. With technology advance-
continues to be revised and refined and now ments, gene expression can now be measured
the sequence the genomes of many differ- by directly sequencing messenger RNA mol-
ent individuals have been realized. Initially, ecules in a cell and counting the number of
the 1000 genomes consortium provided copies of that RNA molecule that is observed.
>1000 genomes of healthy individuals (1000 While some scientists are studying the
Genomes Consortium, 2010), and now datas- human genome, other researchers are study-
ets exist with >100,000 genomes of individu- ing the functions of the genomes of numerous
als with a variety of conditions.1 Essentially, other biological organisms, including impor-
the entire set of genetically driven events tant model organisms (such as mouse, rat,
from conception through embryonic develop- fruit fly and yeast) as well as important human
ment, childhood, adulthood, and aging are pathogens (such as Mycobacterium tuberculo-
9 encoded by the DNA blueprints within most sis or Haemophilus influenzae). The genomes
human cells. Given a complete knowledge of of these organisms have been determined, and
these DNA sequences, we are in a position to efforts are underway to characterize them.
understand these processes at a fundamen- These allow two important types of analysis:
tal level and to consider the possible use of the analysis of mechanisms of pathogenicity
DNA sequences for diagnosing and treating and the analysis of animal models for human
disease. This has led to the application of bio- disease. In both cases, the functions encoded
informatics (and other foundational domains) by genomes can be studied, classified, and
as Translational Bioinformatics and Precision categorized, allowing us to decipher how
Medicine Informatics (7 Chaps. 28 and 30). genomes affect human health and disease.
Additionally, large-scale experimental These ambitious scientific projects are not
methodologies are used to collect data on only proceeding at a furious pace, but also
thousands or millions or more molecules are often accompanied by another approach
simultaneously. Scientists apply these metho to biology, which produces another source
dologies longitudinally over time and across a of biomedical information: proteomics, the
wide variety of organisms or within an organ- study of the protein gene products of the
ism to observe the development of various genome—the proteome. Proteomics enables
physiological phenomena. Technologies give researchers to discover the state (quantity
us the ability to follow the production and and configuration) of proteins within an
degradation of molecules, such as the expres- organism. These protein states can be corre-
sion (transcription) of large numbers of genes lated with different physiological conditions,
simultaneously, the presence of proteins or including disease states. Some of these protein
metabolites in a biosample, or the populations states can be used as identifying markers of
of microorganisms in a sample. human disease. Similar approaches are being
The first high throughput experiments applied to understanding the diversity, con-
measured the expression of genes on gene centration levels and functions of non-DNA,
expression microarrays (Lashkari et al. 1997). RNA or protein molecules such as metabo-
lites through the study of the small molecules
in the metabolome.
Using these technologies together, we can
1 7 https://www.nhlbiwgs.org/ (accessed December 1,
2018).
now study the epigenome, the non-genetic
Bioinformatics
277 9
effects that influence genome function. These be highly specific and sensitive indicators
include molecules that directly alter the struc- of the subtype of disease and of that sub-
ture of DNA but not its sequence (such as type’s probable responsiveness to different
DNA methylation) or proteins that bind to therapeutic agents. Several genotype-based
DNA and affect how that DNA expresses databases have been developed to identify
genes. Epigenomics gives us a more complete markers that are associated with specific phe-
picture of how biology functions and what its notypes and identify how genotype affects a
implications are for human health. patient’s response to therapeutics. ClinVar2
All these technologies, along with the and The Human Gene Mutation Database
genome-sequencing projects, are conspiring (HGMD)3 both annotate mutations with
to produce a volume of biological informa- disease phenotype. This resource has become
tion that at once contains secrets to age-old invaluable for genetic counselors, basic
questions about health and disease and threat- researchers, and clinicians. Additionally,
ens to overwhelm our current capabilities of the Pharmacogenomics Knowledge Base
data analysis. Thus, bioinformatics is becom- (PharmGKB) collects genetic information
ing critical for medicine in the twenty-first that is known to affect a patient’s response to
century. a drug (more on PharmGKB is described in
Translational Bioinformatics, 7 Chap. 26).4
3. Ethical considerations. One of the critical
9.1.2 Implications for Clinical questions facing the genome- sequencing
Informatics and other related projects is “Can genetic
or other molecular information be mis-
The effects of this new biological information used?” The answer is certainly yes. With
on clinical medicine and clinical informat- knowledge of a complete genome for an
ics are still evolving. It is already clear, how- individual, it may be possible in the future
ever, that some major changes to medicine to predict the types of disease for which
will have to be accommodated. These efforts that individual is at risk years before the
have emerged as important areas of bio- disease actually develops. If this informa-
medical informatics that have become their tion fell into the hands of unscrupulous
own domains, Translational Bioinformatics employers or insurance companies, the
(7 Chap. 26) and Precision Medicine and individual might be denied employment or
Informatics (7 Chap. 28) and use of bio- coverage due to the likelihood of future dis-
technology data is now common in Clinical ease, however distant. There is even debate
Research Informatics (7 Chap. 27). about whether such information should
1. Genetic information in the medical record. be released to a patient even if it could
With the first set of human genomes now be kept confidential. Should a patient be
available and prices for gene sequencing informed that he or she is likely to get a
rapidly decreasing, it is now cost-effective disease for which there is no treatment?
to consider sequencing every patient What about that patient’s relatives, who
genome or at least genotyping key sections share genetic information with the patient?
of the genomes and integrating that with This is a matter of intense debate, and
the medical record. such questions have significant implica-
2. New diagnostic and prognostic information tions for what information is collected and
sources. One of the main contributions of for how and to whom that information
the genome-sequencing projects (and of the
associated biological innovations) is that
we are likely to have unprecedented access 2 7 https://www.ncbi.nlm.nih.gov/clinvar/ (accessed
to new diagnostic and prognostic tools. November 1, 2018).
Diagnostically, the genetic markers from a 3 7 http://www.hgmd.org/ (accessed November 1,
2018).
patient with an autoimmune disease, or of 4 7 http://www.pharmgkb.org/ (accessed November
an infectious pathogen within a patient, will 1, 2018).
278 S. D. Mooney et al.
is disclosed (Durfy 1993). Passage of the for the translation of proteins. Ribosomal
Genetic Information Nondiscrimination RNA, for example, is used in the construction
Act in 2008 set initial federal guidelines on of the ribosome, the huge molecular engine
use of genetic information.5 Additionally, that translates mRNA sequences into protein
the Personal Genome Project (PGP) has sequences. Additionally, mRNAs can be mod-
been working to define open consent mod- ified through alternative splicing, degradation,
els for releasing genetic information.6 and formation of secondary structures that
The Clinical Sequencing and Exploratory influence transcriptions. Once expressed, pro-
Research Consortium (CSER) has been teins are frequently modified (e.g. phosphory-
tackling the difficult issues in translation lated), and these modifications can change the
of genomic data to the clinic broadly.7 function of the protein. This process of DNA
being transcribed to RNA and RNA being
translated to protein is commonly referred to
9.2 The Rise of Bioinformatics as the Central Dogma of Biology.
Understanding the basic building blocks
A brief review of the biological basis of medi- of life requires understanding the function of
cine will bring into focus the magnitude of genomic sequences, genes, and proteins. When
the revolution in molecular biology and the are genes expressed? Once genes are transcribed
tasks that are created for the discipline of and translated into proteins, into what cellular
bioinformatics. The genetic material that we compartment are the proteins directed? How
9 inherit from our parents, that we use for the do the proteins function once there? Do the
structures and processes of life, and that we proteins need to be modified in order for them
pass to our children is contained in a sequence to become active? How are the proteins turned
of chemicals known as deoxyribonucleic acid off ? Experimentation and bioinformatics have
(DNA).8 The total collection of DNA for a divided the research into several areas, and
single person or organism is referred to as the largest are: (1) DNA and protein sequence
the genome. DNA is a long polymer chemi- analysis, (2) macromolecular structure–func-
cal made of four basic subunits. The sequence tion analysis, (3) gene expression analysis, (4)
in which these subunits occur in the poly- proteomics, (5) metabolomics, (6) metagenom-
mer distinguishes one DNA molecule from ics, and (5) systems biology.
another and directs a cell’s production of
proteins and all other basic cellular processes.
Genes are discreet units encoded in DNA 9.2.1 Roots of Modern
and they are transcribed into ribonucleic Bioinformatics
acid (RNA), which has a composition very
similar to DNA. Genes are transcribed into Practitioners of bioinformatics have come
messenger RNA (mRNA) and a majority of from many backgrounds, including medicine,
mRNA sequences are translated by complex molecular biology, chemistry, physics, statis-
macromolecular machines, called ribosomes, tics, mathematics, engineering, and computer
into protein. Not all RNAs are messengers science. It is difficult to define precisely the
ways in which this discipline emerged. There
are, however, two main developments that have
created opportunities for the use of informa-
5 7 http://www.genome.gov/10002328 (accessed
November 1, 2018).
tion technologies in biology. The first is the
6 7 http://www.personalgenomes.org/ (accessed progress in our understanding of how biologi-
November 1, 2018). cal molecules are constructed and how they
7 7 https://cser-consortium.org/ (accessed November perform their functions. This dates back as far
1, 2018). as the 1930s with the invention of electropho-
8 If you are not familiar with the basic terminology of
molecular biology and genetics, reference to an
resis, and then in the 1950s with the elucidation
introductory textbook in the area would be helpful of the structure of DNA and the subsequent
before you read the rest of this chapter. sequence of discoveries in the relationships
Bioinformatics
279 9
among DNA, RNA, and protein structure. critical elements. This is because advances in
The second development has been the paral- research methods such as genetic sequenc-
lel increase in the availability of computing ing, experimental robotics and microfluid-
power. Starting with mainframe computer ics, X-ray crystallography, nuclear magnetic
applications in the 1950s and moving to mod- resonance spectroscopy, cryoelectron micros-
ern workstations, and ‘the Cloud’, there have copy, proteomic mass spectrometry and other
been hosts of biological problems addressed high throughput experiments have resulted in
with computational methods. experiments that generate massive amounts
of data. These data pose new problems for
basic researchers on how the data are properly
9.2.2 The Genomics Explosion stored, analyzed, and disseminated.
The volume of data being produced by
The benefit of the human genome sequence genomics projects is staggering. There are now
to medicine is both in the short and in the more than 211 million sequences in GenBank
long term. The short-term benefits lie prin- comprising more than 285 billion digits. Since
cipally in diagnosis; the availability of 2008, sequencing has bested Moore’s law (see
sequences of normal and variant human 7 Chap. 1).9 But these data do not stop with
genes will allow for the rapid identification sequence data: PubMed contains over 28
of these genes in any patient (e.g., Babior million literature citations, the Protein Data
and Matzner 1997). The long-term benefits Bank (PDB) contains three-dimensional
will include a greater understanding of the structural data for over 45,538 distinct protein
proteins produced from the genome: how the structures, and the Gene Expression Omnibus
proteins interact with drugs; how they mal- (GEO) contains over 2.8 million arrayed sam-
function in disease states; and how they par- ples. These data are of incredible importance
ticipate in the control of development, aging, to biology, and in the following sections we
and responses to disease. introduce and summarize the importance of
The effects of genomics on biology and sequences, structures, gene expression experi-
medicine cannot be overstated. We now have ments, systems biology, and their computa-
the ability to measure the activity and func- tional components to medicine.
tion of genes within living cells. Genomics
data and experiments have changed the way
biologists think about questions fundamen- 9.3.1 Sequences in Biology
tal to life. Whereas in the past, reductionist
experiments probed the detailed workings of Sequence information (including DNA
specific genes, we can now assemble those data sequences, RNA sequences, and protein
together to build an accurate understanding sequences) is critical in biology: DNA, RNA,
of how cells work. and protein can be represented as a set of
sequences of basic building blocks (bases for
DNA and RNA, amino acids for proteins).
9.3 Biology Is Now Data-Driven Computer systems within bioinformatics thus
must be able to handle biological sequence
Nearly 30 years ago, the use of computers information effectively and efficiently. To
was proving to be useful to the laboratory that end, the bioinformatics community has
researcher. Today, computers are an essential developed central databases to store sequence
component of modern research. This has led information, data models to represent that
to a change in thinking about the role of com- information and software analysis tools to pro-
puters in biology. Before, they were optional cess sequence data.
tools that could help provide insight to expe-
rienced and dedicated enthusiasts. Today,
they are required by most investigators, and 9 7 http://www.genome.gov/sequencingcosts/
experimental approaches rely on them as (accessed November 1, 2018).
280 S. D. Mooney et al.
sate for environmental changes, how patho- changes have been associated with sponta-
gens fight antibiotics, and how cells grow neous mutations in cancer, complex genetic
uncontrollably (as is found in cancer). A chal- diseases, and Mendelian inherited genetic
lenge for biological computing is to develop diseases. Second, cytosine bases in the DNA
methods to analyze these data, tools to store can be methylated and this can affect gene
these data, and computer systems to collect expression. DNA methylation patterns can
the data automatically. be passed on when DNA is replicated. Like
chromosome structure, these modifications
have been associated with human disease
9.3.5 Metabolomics Data in Biology (Bird 2002).
for experimental groups. Numerous other data- edge about the evolution of the molecules that
bases store the sequences of protein molecules15 we typically do not have. There are now, how-
and information about human genetic diseases.16 ever, well-established algorithms for finding
Included among the databases that have the mathematically optimal alignment of two
accelerated the development of bioinformatics sequences. These algorithms require the two
is the Medline database of the biomedical lit- sequences and a scoring system based on (1)
erature and its paper-based companion Index exact matches between amino acids that have
Medicus (see 7 Chap. 23).17 Including articles not mutated in the two sequences and can be
as far back as 1809 and brought online free on aligned perfectly; (2) partial matches between
the Web in 1997, Medline provides the glue that amino acids that have mutated in ways that
relates many high-level biomedical concepts have preserved their overall biophysical prop-
to the low-level molecule, disease, and experi- erties; and (3) gaps in the alignment signifying
mental methods. In fact, this “glue” role was places where one sequence or the other has
the basis for creating the NCBI suite of data- undergone a deletion or insertion of amino
bases and software and PubMed systems (see acids. The algorithms for determining opti-
7 Sect. 9.5) for integrating access to literature mal sequence alignments are based on a tech-
references and the associated d
atabases. nique in computer science known as dynamic
programming and are at the heart of many
computational biology applications (Gusfield
9.4.2 Sequence Alignment 1997). . Figure 9.1 shows an example of a
9 and Genome Analysis Smith-Waterman matrix, the first described
local alignment algorithm that utilizes a
Perhaps the most basic activity in computa- dynamic programming approach. The algo-
tional biology is comparing two biological rithm works by calculating a similarity matrix
sequences to determine (1) whether they are between two sequences, then finding optimal
similar and (2) how to align them. The prob- paths through the matrix that maximize a
lem of alignment is not trivial but is based on a similarity score between the two sequences.
simple idea. Sequences that perform a similar Unfortunately, the dynamic programming
function should, in general, be descendants of algorithms are too computationally expensive
a common ancestral sequence, with mutations to apply to large numbers of sequences, so a
over time. These mutations can be replace- number of faster, more heuristic methods have
ments of one amino acid with another, dele- been developed. The most popular algorithm
tions of amino acids, or insertions of amino is the Basic Local Alignment Search Tool
acids. The goal of sequence alignment is to (BLAST) (Altschul et al. 1990). BLAST is
align two sequences so that the evolutionary based on the observation that sections of pro-
relationship between the sequences becomes teins are often conserved without gaps (so the
clear. If two sequences are descended from gaps can be ignored—a critical simplification
the same ancestor and have not mutated too for speed) and that there are statistical analy-
much, then it is often possible to find corre- ses of the occurrence of small subsequences
sponding locations in each sequence that play within larger sequences that can be used to
the same role in the evolved proteins. The prune the search for matching sequences
problem of solving correct biological align- in a large database. These tools work well
ments is difficult because it requires knowl- for both protein and nucleic acid sequences.
Other tools have been developed that are bet-
ter suited for nucleic acid sequence assembly
and mapping of short read high throughput
15 7 http://www.uniprot.org/ (accessed December 1, sequencing data including BLAT (Kent 2003),
2018). SOAP (Li et al. 2008), and others.
16 7 http://www.ncbi.nlm.nih.gov/omim (accessed
December 1, 2018).
Protein 3D structures can be aligned, visu-
17 7 http://www.ncbi.nlm.nih.gov/pubmed (accessed alized and compared in a similar way to lin-
December 1, 2018). ear protein sequences (. Fig. 9.2). Tools such
Bioinformatics
285 9
a) Pairwise alignment between human chymotrypsin and human trypsin.
CTRB_HUMAN MAFLWLLSCWALLGTTFGCGVPAIHPVLSGLSRIVNGEDAVPGSWPWQVSLQDKTGFHFC
TRY1_HUMAN MNPLLILTFVA- - - - - - - - - - - - AALAAPFDDDDKIVGGYNCEENSVPYQVSLN- - SGFHFC
CTRB_HUMAN GGSLISEDWVVTAAHCGVRTSDDVVVAGEFDQGSDEENIQVLKIAKVFKNPKFSILTVNND
TRY1_HUMAN GGSLINEQWVVSAGHC- YKSRIQVRLGEHNIEVLEGNEQFINAAKIIRHPQYDRKTLNND
CTRB_HUMAN ITLLKLATPARFSQTVSAVCLPSADDDFPAGTLCATTGWGKTKYNANKTPDKLQQAALPL
TRY1_HUMAN IMLIKLSSRAVINARVSTISLPTAPP - - ATGTKCLISGWGNTASSGADYPDYPDELQCLDAPV
CTRB_HUMAN LSNAECKKSWGRRITDVMICAG - - ASGVSSCMGDSGGPLVCQKDGAWTLVGIVSWGSDTC
TRY1_HUMAN LSQAKCEASYPGKITSNMFCVGFLEGGKDSCQGDSGGPVVCNG - - - - QLQGVVSWGDGCA
CTRB_HUMAN STSSPGVYARVTKLIPWVQKILLAN -
TRY1_HUMAN QKNKPGVYTKVYNYVKWIKNTIAANS
b ) S m i t h Wa t e r m a n m a t r i x i l l u s t r a t i n g t h e a l i g n e d r e g i o n i n A , u s i n g t h e B L O S U M 6 2
m u t a t i o n m a t r i x ( H e n i k ff a n d H e n i k o ff, 1 9 9 4 ) .
G F L E G G K D S C Q G D S G G P V V C N G Q L Q
G 6 -3 -4 -2 6 6 -2 -1 0 -3 -2 6 1 0 6 6 -2 -3 -3 -3 0 6 -2 -4 -2
A 0 -2 -1 -1 0 0 -1 -2 1 0 -1 0 -2 1 0 0 -1 0 0 0 -2 0 -1 -1 -1
S 0 -2 -2 0 0 0 0 0 4 -1 0 0 0 4 0 0 -1 -2 -2 -1 1 0 0 -2 0
G 6 -3 -4 -2 6 6 -2 -1 0 -3 -2 6 1 0 6 6 -2 -3 -3 -3 0 6 -2 -4 -2
V -3 -1 1 -2 -3 -3 -2 -3 -2 -1 -2 -3 -3 -2 -3 -3 -2 4 4 -1 -3 -3 -2 1 -2
S 0 -2 -2 0 0 0 0 0 4 -1 0 0 0 4 0 0 -1 -2 -2 -1 1 0 0 -2 0
S 0 -2 -2 0 0 0 0 0 4 -1 0 0 0 4 0 0 -1 -2 -2 -1 1 0 0 -2 0
C -3 -2 -1 -4 -3 -3 -3 -3 -1 9 -3 -3 -3 -1 -3 -3 -3 -1 -1 9 -3 -3 -3 -1 -3
M -3 0 2 -2 -3 -3 -1 -3 -1 -1 0 -3 -3 -1 -3 -3 -2 1 1 -1 -2 -3 0 2 0
G 6 -3 -4 -2 6 6 -2 -1 0 -3 -2 6 1 0 6 6 -2 -3 -3 -3 0 6 -2 -4 -2
D -1 -3 -4 2 -1 -1 -1 -6 0 -3 0 -1 6 0 -1 -1 -1 -3 -3 -3 1 -1 0 -4 0
S 0 -2 -2 0 0 0 0 0 4 -1 0 0 0 4 0 0 -1 -2 -2 -1 1 0 0 -2 0
G 6 -3 -4 -2 6 6 -2 -1 0 -3 -2 6 1 0 6 6 -2 -3 -3 -3 0 6 -2 -4 -2
G 6 -3 -4 -2 6 6 -2 -1 0 -3 -2 6 1 0 6 6 -2 -3 -3 -3 0 6 -2 -4 -2
P -2 -4 -3 -1 -2 -2 -1 -1 -1 -3 -1 -2 -1 -1 -2 -2 7 -2 -2 -3 -2 -2 -1 -3 -1
L -4 0 4 -3 -4 -4 -2 -4 -2 -1 -2 -4 -4 -2 -4 -4 -3 1 1 -1 -3 -4 -2 4 -2
V -3 -1 1 -2 -3 -3 -2 -3 -2 -1 -2 -3 -3 -2 -3 -3 -2 4 -4 -1 -3 -3 -2 1 -2
C -3 -2 -1 -4 -3 -3 -3 -3 -1 9 -3 -3 -3 -1 -3 -3 -3 -1 -1 9 -3 -3 -3 -1 -3
Q -2 -3 -2 -2 -2 -2 1 0 0 -3 5 -2 0 0 -2 -2 -1 -2 -2 -3 0 -2 5 -2 5
K -2 -3 -2 1 -2 -2 5 -1 0 -3 1 -2 -1 0 -2 -2 -1 -2 -2 -3 0 -2 -1 -1 1
D -1 -3 -4 2 -1 -1 -1 6 0 -3 0 -1 6 0 -1 -1 -1 -3 -3 -3 1 -1 1 -4 1
G 6 -3 -4 -2 6 6 -2 -1 0 -3 -2 6 1 0 6 6 -2 -3 -3 -3 0 6 -2 -4 -2
A 0 -2 -1 -1 0 0 -1 -2 1 0 -1 0 -2 1 0 0 -1 0 0 0 -2 0 -1 -1 -1
W -2 1 -2 -3 -2 -2 -3 -4 -3 -2 -2 -2 -4 -3 -2 -2 -4 -3 -3 -2 -4 -2 -2 -2 -2
T -2 -2 -1 -1 -2 -2 -1 -1 1 -1 -1 -2 -1 1 -2 -2 -1 0 0 -1 0 -2 -1 -1 -1
L -4 0 4 -3 -4 -4 -2 -4 -2 -1 -2 -4 -4 -2 -4 -4 -3 1 1 -1 -3 -4 -2 -4 -2
V -3 -1 1 -2 -3 -3 -2 -3 -2 -1 -2 -3 -3 -2 -3 -3 -2 4 4 -1 -3 -3 -2 1 -2
.. Fig. 9.1 Example of sequence alignment using the Smith Waterman algorithm
286 S. D. Mooney et al.
will be clusters that are merged. One way to 9.4.4.1 Classification and Prediction
determine whether the chosen k is correct is to A high level description of some common
estimate the average distance from any mem- approaches to classification or supervised
ber profile to the center of mass. By varying learning are described below, but note that
k, it is best to choose the lowest k where this entire courses could be, and are, taught on
average is minimized for each cluster. Another each of these methods. For further details we
drawback of K-means is that different initial refer readers to the suggested texts at the end
conditions can give different results, therefore of this chapter.
it is often prudent to test the robustness of the One of the simplest methods for clas-
results by running multiple runs with different sification is that of k-nearest-neighbor, or
starting configurations (. Fig. 9.3). KNN. Essentially, KNN uses the classifica-
The future clinical usefulness of these algo- tion of the k closest instances to a given input
rithms cannot be overstated. In 2002, van’t Veer as a set of votes regarding how that instance
et al. (2002) found that a gene expression pro- should be classified. Unfortunately, KNN
file could predict the clinical outcome of breast tends not to be useful for omics-based classifi-
cancer. The global analysis of gene expression cation because it tends to break down in high-
showed that some cancers were associated with dimensional space. For high-dimensional
different prognosis, not detectable using tradi- data, KNN has difficulty in finding enough
tional means. Another exciting advancement neighbors to make prediction, which will
in this field is the potential use of microarray lead to large variation in the classification.
9 expression data to profile the molecular effects This breakdown is one aspect of the “curse
of known and potential therapeutic agents. This of dimensionality,” described in more detail
molecular understanding of a disease and its below (Hastie et al. 2009).
treatment will soon help clinicians make more A more general statistical approach to
informed and accurate treatment choices (for supervised learning, and one which encom-
more, see 7 Chap. 26). passes a number of popular methods, is that
of function approximation. In this approach,
one attempts to find a useful approximation of
the function f(x) that underlies the actual rela-
tion between the inputs and outputs. In this
case, one chooses a metric by which to judge
the accuracy of the approximation, for exam-
ple the residual sum of squares, and uses this
metric to optimize the model to fit the training
data. Bayesian modeling, logistic regression,
and Support Vector Machines all use varia-
tions on this approach.
Finally, there is the class of rule-based clas-
sifiers. This type of classifier may be thought
of as a series of rules, each of which splits the
set of instances based on a given characteris-
tic. Details such as what criteria are used to
choose the feature on which to base a rule,
and whether the algorithm uses enhancements
such ensemble learning (i.e., multiple models
together) determine the specifics of the clas-
.. Fig. 9.3 The exponential growth of GEN- sifier type, for example decision trees, random
BANK. This plot shows that since 1982 the number of forests, or covering rules.
bases in GENBANK has grown by five full orders of
Which approach to use depends both on
magnitude and continues to grow by a factor of 10 every
4 years the nature of the data and the question being
Bioinformatics
289 9
asked. The question might prioritize sensitivity for multiple hypothesis testing.24 It entails
over specificity or vice versa. For example, for a dividing the threshold p-value one would use,
test to detect a life-threatening infection that is traditionally 0.05, by the number of hypoth-
easily treatable by readily available antibiotics, eses. So, for a test of 20,000 genes, one would
one might want to err on the side of sensitivity. require a p-value of 2.5 × 10−6 to call a gene
In addition, data may be numeric or categori- significant. Typically, analyses using high
cal or have differing degrees of noise, missing dimensional data such as gene expression are
values, correlated features or non-linear inter- not sufficiently powered to pass this stringent
actions among features. These different quali- test. One would need thousands of samples to
ties are better handled by different methods. In be sufficiently powered. Another approach is
many cases the best approach is actually to try to use q-value, or false discovery rate (Storey
a number of different methods and to compare and Tibshirani 2003), rather than p-value.
the results. Such comparative analysis is facili- This approach relies on empirical permuta-
tated through freely available software pack- tion to determine the expected number of
ages such as R/Bioconductor22 and Weka.23 false positives if indeed the null hypothesis
is correct, which enables approximation of
the proportion of false positives among all
9.4.5 The Curse of Dimensionality reported positives. Consider again the micro-
array experiment above in which each array
In the post-genomic era, there is no shortage includes 20,000 genes. We want to know
of data to analyze. Rather, many researchers whether gene X was differentially expressed
have more data than they know what to do in cases versus controls. Choosing a threshold
with. However this overabundance tends to p-value, or false positive rate, of 0.05 means
be a factor of the dimensionality of the data, that 1 time in 20 we will erroneously reject
rather than the number of subjects. This mis- the null hypothesis and predict a false posi-
match can lead to challenges for experimental tive. If a statistical test returns 2000 positives,
design and statistical analysis. Type 1 error, i.e. 2000 genes appear to be significantly dif-
or the tendency to incorrectly reject the null ferentially expressed, we expect 1 in 20 of the
hypothesis and say that indeed there is statisti- genes being analyzed (20,000 × (1/20) = 1000)
cal significance to a pattern (see 7 Chap. 13), or approximately half of them to be false
is amplified by looking at high-dimensional positives. A false discovery rate of 0.05, on
data. This is one aspect of what is known as the other hand, would mean that 5% of those
the “curse of dimensionality” (Hastie et al. called positive, in this case 100 out of 2000,
2009). Consider analysis of gene expression are false positives. Q-value is thus less strin-
data for 20,000 genes, trying to detect a pattern gent than p-value, but may be of greater util-
that can predict outcome. In a sample of, say, ity in a high-dimensional omics context than
30 subjects—a reasonable number when test- a traditional p-value or correction for multiple
ing a single hypothesis—by random chance, hypotheses.
some number of genes will correlate with Another approach to analysis of high
outcome. Essentially one is testing not one dimensional data sets is to use dimensionality
but 20,000 hypotheses simultaneously. One reduction methods such as feature selection
must therefore correct for multiple hypoth- or feature extraction. Feature selection entails
esis testing. The Bonferroni method is a com- extracting only a subset of the features at
mon and straightforward approach to correct hand, in this case genes. This may be done in
a number of ways, based on which genes vary
the most, or on which genes seem to best pre-
dict the categorization at hand. In contrast,
22 7 http://bioconductor.org/ (accessed December 1,
2018).
23 7 http://www.cs.waikato.ac.nz/ml/weka/ (accessed 24 7 http://en.wikipedia.org/wiki/Bonferroni_correc-
December 1, 2018). tion (accessed December 1, 2018).
290 S. D. Mooney et al.
feature extraction creates a new smaller set of gives integrated access to the biomedical litera-
features that captures the essence of the origi- ture, protein, and nucleic acid sequences, mac-
nal variation. As an example, imagine a plane romolecular and small molecular structures,
flight from Seattle, WA to Key West, FL. One and genome project links (including both
could use a 3-dimensional vector consist- the Human Genome Project and sequenc-
ing of latitude and longitude to describe the ing projects that are attempting to determine
plane’s position at any given point along the the genome sequences for organisms that are
way. In this case, one value would describe either human pathogens or important experi-
how far the plane had gone in the north/south mental model organisms) in a manner that
direction, and one would indicate how far takes advantages of either explicit or com-
the plane had gone in the east/west direction. puted links between these data resources.25
However, if we change the axis along which Newer technologies are being developed that
we are measuring to instead be the direct route will allow multiple heterogeneous databases
along which the plane is flying, then we only to be accessed by search engines that can com-
need 1 dimension to describe where the plane bine information automatically, thereby pro-
is located. The distance flown tells us where cessing even more intricate queries requiring
the plane is located at any given time. This knowledge from numerous data sources. One
approach of changing the axes is the basis for example is the Bioconductor project, a tool-
principle components analysis (PCA), a com- box for bioinformatics in the R programming
mon method for feature extraction. Instead of language.26
9 going from two dimensions to one, PCA on
gene expression data typically goes from tens
of thousands of features to just a few. Both 9.5.1 Data Sharing
for feature selection and feature extraction,
it is important to replicate the findings in an In 1996, the First International Strategy
independently generated data set in order to Meeting on Human Genome Sequencing was
be sure the model is not over fitting the data held in Bermuda. In this meeting, a set of prin-
on which it was trained. ciples was agreed upon regarding sharing of
human genome sequencing data. These prin-
ciples came to be known as the Bermuda prin-
9.5 urrent Application Successes
C ciples. They stipulated that (1) all sequence
from Bioinformatics assemblies larger than 1 kb should be released
as soon as possible, ideally within 24 h; (2)
Biologists have embraced the Internet in a finished annotated sequences should be pub-
remarkable way and have made access to data lished immediately to public databases; and
a normal and expected mode for doing busi- (3) that all human sequence data generated in
ness. Hundreds of databases curated by indi- large-scale sequencing centers should be made
vidual biologists create a valuable resource available in the public domain.27
for the developers of computational methods Increasingly, journals and funders require
who can use these data to test and refine their that researchers deposit all types of research
analysis algorithms. With standard Internet data in publicly available repositories (Fischer
search engines, most biological databases can and Zigmond 2010). In 2009, President
be found and accessed within moments. The Obama announced an Open Government
large number of databases has led to the devel-
opment of meta-databases that combine infor-
mation from individual databases to shield the 25 7 https://www.ncbi.nlm.nih.gov/search/ (accessed
user from the complex array that exists. There December 7th, 2020).
are various approaches to this task. 26 7 http://bioconductor.org/ (accessed December 1,
The National Center for Biotechnology 2018).
27 7 http://www.ornl.gov/sci/techresources/Human_
Information (NCBI) suite of databases and Genome/research/bermuda.shtml (accessed Decem-
software (previously known as the ‘Entrez’ ber 1, 2018).
Bioinformatics
291 9
Directive that included plans to make fed- ment of such schemes necessitates the cre-
erally funded research data available to the ation of terminology standards, just as in
public.28 This announcement describes the clinical informatics. There are now many con-
NIH’s policy regarding published manuscripts trolled vocabularies (or ontologies) and meta-
in particular, but also notes that the results of data standards for annotation of genomic
vgovernment-funded research can take many or proteomic data. Metadata standards help
forms, including data sets. Currently the NIH define information which should be collected
requires that proposals for funding of over and annotated upon various types of datas-
$500,000 include a data sharing plan.29 ets. Furthermore, a great many tools have
To that end, a significant advancement in been developed to help researchers access and
bioinformatics is in making research datasets analyze this data. For example, the previously
more available and reusable. From the com- mentioned Bioconductor project provides
munity of researchers who are enabling this bioinformatic tools in the R language for
effort the concept of FAIR data has emerged. solving common problems. Other commonly
FAIR datasets are Findable, Accessible, used tools include BioPerl, BioPython and
Interoperable and Reusable. FAIR data MATLAB.32
principles lay out a framework to encourage Biomedical ontologies have become a key
increased sharing and use of scientific data- component in the development of metadata
sets. Findable data includes the use of global standards for the management and exchange
persistent identifiers and metadata standards. of bioinformatic datasets and in making data
Accessible data is available on the Internet more FAIR (see 7 Sect. 9.5.1). The open bio-
and searchable through metadata usage. medical ontologies consortium (OBO) has
Interoperable data use a “formal, accessible, developed a number of reference ontologies
shared and broadly applicable language for that are in wide use in bioinformatics including
knowledge representation”. Finally, reusable Gene Ontology, Human Phenotype Ontology
data have clear attribution and license that and the UBERON anatomy ontology (Smith
enables reuse. The webportal FAIRsharing et al. 2007). For example, Gene Ontology
provides curated resources on datasets, stan- (GO) is an ontology used for annotation of
dards and collections that are more FAIR.30 gene function, and arguably the most widely
Resources such as BioCaddie DataMed used ontology in basic research. Ontologies
enable discovery of datasets through a Data enable indexing, exchange and computing
Discovery Index.31 with biomedical datasets and metadata.
Metadata standards for bioinformat-
ics datasets are an intellectual challenge for
9.5.2 ata Standards, Metadata
D researchers to enable the sharing and interop-
and Biomedical Ontologies erability of data and to make data more
FAIR. There are a number of tools and web
7 Chapter 7 on standards in biomedical portals such as the Center for Expanded Data
informatics addresses standardized terminol- Annotation and Retrieval (CEDAR) provide
ogies as well as standards for data exchange, tools for creation and sharing of metadata
and terminologies for translational research about datasets.33 Metadata can include infor-
are discussed in 7 Chap. 27. The develop- mation about an experiment such as the pro-
tocol, the time the experiment was performed,
who performed the experiment and technology
28 7 http://edocket.access.gpo.gov/2009/E9-29322. used to generate or analyze the experiment, but
htm (accessed December 1, 2018).
29 7 http://grants.nih.gov/grants/guide/notice-files/
NOT-OD-03-032.html (accessed December 1,
2018). 32 7 http://www.open-bio.org/ (accessed December 1,
30 7 https://fairsharing.org/ (accessed December 1, 2018).
2018). 33 7 https://metadatacenter.org/ (accessed December
31 7 https://datamed.org/ (accessed April 20, 2019). 1, 2018).
292 S. D. Mooney et al.
.. Fig. 9.4 The NCBI Gene entry for the digestive of the key regions in the sequence and the complete
enzyme chymotrypsin. Basic information about the sequence of DNA bases (a, g, t, and c) is provided as a
original report is provided, as well as some annotations link. (Courtesy of NCBI)
can also include information such as organism, interpreted cautiously. GENBANK can be
disease model, tissue, conditions, etc. searched efficiently with a number of algo-
rithms and is usually the first stop for a scien-
9.5.2.1 Sequence and Genome tist with a new sequence who wonders “Has a
Databases sequence like this ever been observed before?
The main types of sequence information If one has, what is known about it?” There are
that must be stored are DNA and protein. increasing numbers of stories about scientists
One of the largest DNA sequence databases using GENBANK to discover unanticipated
is GENBANK, which is managed by the relationships between DNA sequences, allow-
NCBI.23 GENBANK is growing rapidly as ing their research programs to leap ahead
genome- sequencing projects feed their data while taking advantage of information col-
(often in an automated procedure) directly lected on similar sequences.
into the database. . Figure 9.3 shows the A database that has become very useful
logarithmic growth of data in GENBANK recently is the University of California Santa
since 1982. NCBI Gene curates some of the Cruz Genome Browser34 (. Fig. 9.5). This
many genes within GENBANK and presents data set allows users to search for specific
the data in a way that is easy for the researcher sequences in the UCSC version of the human
to use (. Fig. 9.4). genome. Powered by the similarity search tool
In addition to GENBANK, there are BLAT, users can quickly find annotations on
numerous special-purpose DNA databases the human genome that contain their sequence
for which the curators have taken special care of interest. These annotations include known
to clean, validate, and annotate the data. The
work required of such curators indicates the
degree to which raw sequence data must be 34 7 http://genome.ucsc.edu/ (accessed December 1,
2018).
Bioinformatics
293 9
.. Fig. 9.5 Screen from the UC Santa Cruz genome t are represented from left to right (5–3). The annota-
browser showing the chymotrypsin C gene. The rows in tions include gene predictions and annotations as well
the browser show annotations on the gene sequence. as an alignment of the similarity of this region of the
The browser window here shows a small segment of genome when compared with the mouse genome
human chromosome 15, as if the sequence of a, g, c and
variations (mutations and SNPs), genes, com- molecules (usually less than 100 atoms) and
parative maps with other organisms, and the PDB36 for macromolecules (see 7 Sect.
many other important data. 9.3.2), including proteins and nucleic acids,
and combinations of these macromolecules
with small molecules (such as drugs, cofac-
9.5.3 Structure Databases tors, and vitamins). The PDB has approxi-
mately 75,000 high-resolution structures, but
Although sequence information is obtained this number is misleading because many of
relatively easily, structural information them are small variants on the same struc-
remains expensive on a per-entry basis. The tural architecture. There are approximately
experimental protocols used to determine 100,000 proteins in humans; therefore, many
precise molecular structural coordinates structures remain unsolved (e.g., Burley and
are expensive in time, materials, and human Bonanno 2002). In the PDB, each structure is
power. Therefore, we have only a small num- reported with its biological source, reference
ber of structures for all the molecules char- information, manual annotations of interest-
acterized in the sequence databases. The two ing features, and the Cartesian coordinates of
main sources of structural information are the each atom within the molecule. Given knowl-
Cambridge Structural Database35 for small edge of the three-dimensional structure of
35 7 https://www.ccdc.cam.ac.uk/solutions/csd-sys-
tem/components/csd/ (accessed December 15, 36 7 https://www.rcsb.org/ (accessed December 18,
2018). 2018).
294 S. D. Mooney et al.
molecules, the function sometimes becomes way were removed (would other pathways be
clear. For example, the ways in which the med- used to create the desired function, or would
ication methotrexate interacts with its biologi- the organism lose a vital function and die?),
cal target have been studied in detail for two and (3) the ability to provide a rich user inter-
decades. Methotrexate is used to treat cancer face to the literature on E. coli metabolism.
and rheumatologic diseases, and it is an inhib- Similarly, the Kyoto Encyclopedia of Genes
itor of the protein dihydrofolate reductase, an and Genomes (KEGG) provides pathway
important molecule for cellular reproduction. datasets for organism genomes.39
The three-dimensional structure of dihydro-
folate reductase has been known for many
years and has thus allowed detailed studies 9.5.5 Integrative Databases
of the ways in which small molecules, such as
methotrexate, interact at an atomic level. As A integrative database is a postgenomic data-
the PDB increases in size, it becomes impor- base that bridges the gap between molecular
tant to have organizing principles for thinking biological databases with those of clinical
about biological structure. SCOP237 provides importance. One excellent example of a post-
a classification based on the overall structural genomic database is the Online Mendelian
features of proteins. It is a useful method for Inheritance in Man (OMIM) database, which
accessing the entries of the PDB. is a compilation of known human genes and
genetic diseases, along with manual annota-
9 tions describing the state of our understanding
9.5.4 Analysis of Biological of individual genetic disorders.40 Each entry
Pathways and Understanding contains links to special-purpose databases and
of Disease Processes thus provides links between clinical syndromes
and basic molecular mechanisms (. Fig. 9.6).
The ECOCYC project is an example of a com-
putational resource that has comprehensive
information about biochemical pathways. 9.6 Future Challenges
ECOCYC is a knowledge base of the meta- as Bioinformatics and Clinical
bolic capabilities of E. coli; it has a repre- Informatics Converge
sentation of all the enzymes in the E. coli
genome and of the chemical compounds Bioinformatics didn’t solve all of its problems
those enzymes transform.38 It also links these with the sequencing of the human genome.
enzymes to their genes, and genes are mapped There is a series of challenges for which
to the genome sequence. the completion of the first human genome
EcoCyc also encodes the genetic regula- sequence is only the beginning.
tory network of E. coli, describing all protein
and RNA regulators of E. coli genes. The
network of pathways within ECOCYC pro- 9.6.1 Linkage of Molecular
vides an excellent substrate on which useful Information with Symptoms,
applications can be built. For example, they
Signs, and Patients
provide: (1) the ability to guess the function
of a new protein by assessing its similarity to
There is currently a gap in our understand-
E. coli genes with a similar sequence, (2) the
ing of disease processes. Although we have
ability to ask what the effect on an organism
a good understanding of the principles by
would be if a critical component of a path-
39 7 http://www.genome.jp/kegg/pathway.html
37 7 http://scop2.mrc-lmb.cam.ac.uk/ (accessed Dece (accessed December 1, 2018).
mber 15, 2018). 40 7 http://www.ncbi.nlm.nih.gov/omim/ (accessed
38 7 http://ecocyc.org/ (accessed December 15, 2018). December 1, 2018).
Bioinformatics
295 9
.. Fig. 9.6 Screen from the Online Mendelian Inheri- in which chymotrypsin (NCBI Gene entry shown in
tance in Man (OMIM) database showing an entry for . Fig. 9.2) is totally absent (as are some other key
pancreatic insufficiency, an autosomal recessive disease digestive enzymes). (Courtesy of NCBI)
which small groups of molecules interact, we cal literature provides exciting opportunities
are not able to explain fully how thousands of for making data easily available to scientists.
molecules interact within a cell to create both Already, certain types of simple data that
normal and abnormal physiological states. As are produced in large volumes are expected
the databases continue to accumulate infor- to be included in manuscripts submitted for
mation ranging from patient-specific data to publication, including new sequences that are
fundamental genetic information, a major required to be deposited in GENBANK and
challenge is creating the conceptual links new structure coordinates that are deposited
among these databases to create an audit trail in the PDB. However, there are many other
from molecular-level information to macro- experimental data sources that are currently
scopic phenomena, as manifested in disease. difficult to provide in a standardized way,
The availability of these links will facilitate either because the data are more intricate than
the identification of important targets for those stored in GENBANK or PDB or they
future research and will provide a scaffold for are not produced in a volume sufficient to fill a
biomedical knowledge, ensuring that impor- database devoted entirely to the relevant area.
tant literature is not lost within the increasing Knowledge base technology can be used,
volume of published data. however, to represent multiple types of highly
interrelated data.
Knowledge bases can be defined in many
9.6.2 Computational ways (see 7 Chap. 24); for our purposes, we
Representations can think of them as databases in which (1)
of the Biomedical Literature the ratio of the number of tables to the num-
ber of entries per table is high compared with
An important opportunity within bioinfor- usual databases, (2) the individual entries (or
matics is the linkage of biological experimen- records) have unique names, and (3) the values
tal data with the published papers that report of many fields for one record in the database
them. Electronic publication of the biologi- are the names of other records, thus creating
296 S. D. Mooney et al.
a highly interlinked network of concepts. The informatics and clinical informatics because
structure of knowledge bases often leads to they both focus on representing, storing,
unique strategies for storage and retrieval of and analyzing biological or biomedical data.
their content. To build a knowledge base for These technologies include the creation and
storing information from biological experi- management of standard terminologies and
ments, there are some requirements. First, data representations, the integration of het-
the set of experiments to be modeled must erogeneous databases, the organization and
be defined. Second, the key attributes of each searching of the biomedical literature, the use
experiment that should be recorded in the of machine learning techniques to extract new
knowledge base must be specified. Third, the knowledge, the simulation of biological pro-
set of legal values for each attribute must be cesses, and the creation of knowledge-based
specified, usually by creating a controlled ter- systems to support advanced practitioners in
minology for basic data or by specifying the the two fields.
types of knowledge-based entries that can
serve as values within the knowledge base. nnSuggested Readings
Altman, R. B., Dunker, A. K., Hunter, L., & Klein,
T. E. (2003). Pacific symposium on
9.6.3 Computational Challenges Biocomputing’03. Singapore: World Scientific
Publishing. The proceedings of one of the prin-
with an Increasing Deluge cipal meetings in bioinformatics, this is an
9 of Biomedical Data excellent source for up-to-date research reports.
Other important meetings include those spon-
An increasing challenge in biomedicine is stor- sored by the International Society for
ing, interpreting and integrating the massive Computational Biology (ISCB, http://www.
amount of datasets the biomedical commu- iscb.org/), Intelligent Systems for Molecular
nity is generating, largely from modern tech- Biology (ISMB, http://iscb.org/conferences.
nologies in high throughput experimentation. shtml.35), and the RECOMB meetings on
The amount of DNA sequence data being computational biology (http://www.ctw-
generated over time has dwarfed Moore’s congress.de/recomb/). ISMB and PSB have
Law, for example. This issue is important for their proceedings indexed in PubMed.
all areas of biomedical informatics, and is dis- Baldi, P., & Brunak, S. (2001). Bioinformatics:
cussed in more detail in the on Translational The machine learning approach. Cambridge,
Bioinformatics (7 Chap. 26). MA: MIT Press. This introduction to the field
of bioinformatics focuses on the use of statis-
tical and artificial intelligence techniques in
9.7 Conclusion machine learning.
Baldi, P., & Hatfield, G. W. (2002). DNA microar-
Bioinformatics is closely allied to transla- rays and gene expression. Cambridge:
tional and clinical informatics. It differs in its Cambridge University Press. Introduces the
emphasis on a reductionist view of biologi- different microarray technologies and how
cal systems, starting with sequence informa- they are analyzed.
tion and moving to structural and functional Berg, J. M., Tymoczko, J. L., & Stryer, L. (2010).
information. The emergence of the genome Biochemistry. New York: W.H. Freeman. The
sequencing projects and the new technolo- textbook by Stryer and colleagues is well writ-
gies for measuring metabolic processes within ten, and is illustrated and updated on a regu-
cells is beginning to allow bioinformaticians lar basis. It provides an excellent introduction
to construct a more synthetic view of bio- to basic molecular biology and biochemistry.
logical processes, which will complement the Durbin, R., Eddy, S. R., Krogh, A., & Mitchison,
whole-organism, top-down approach of clini- G. (1998). Biological sequence analysis:
cal informatics. More importantly, there are Probabilistic models of proteins and nucleic
technologies that can be shared between bio- acids. Cambridge: Cambridge University Press.
Bioinformatics
297 9
This edited volume provides an excellent intro- problems are arising in basic biomedical
duction to the use of probabilistic representa- research?
tions of sequences for the purposes of 6. Why have biologists and bioinformati-
alignment, multiple alignment, and analysis. cians embraced the Web as a vehicle for
Gusfield, D. (1997). Algorithms on strings, trees disseminating data so quickly, whereas
and sequences: Computer science and compu- clinicians and clinical informaticians
tational biology. Cambridge: Cambridge have been more hesitant to put their pri-
University Press. Gusfield’s text provides an mary data online?
excellent introduction to the algorithmics of 7. If a patient’s entire genome were present
sequence and string analysis, with special atten- in their medical record how would one go
tion paid to biological sequence analysis prob- about interpreting it clinically? Similarly,
lems. if we had an entire electronic health
Malcolm, S., & Goodship, J. (Eds.). (2007). record database that included human
Genotype to phenotype (2nd ed.). Oxford: genomes, how would a researcher go
BIOS Scientific Publishers. This volume illus- about finding new or novel genetic asso-
trates the different efforts to understand how ciations?
diseases are linked to genes. 8. With the many high throughput
Pevsner, P. (2009). Bioinformatics and functional experiments that are used in biomedical
genomics. Hoboken: Wiley. A widely used research, how are some ways to integrate
excellent introduction to bioinformatics algo- those datasets using systems biology?
rithms. For example, if you had a microarray
dataset that annotated gene expression
??Questions for Discussion levels and a proteomics dataset that
1. How are DNA and protein sequence identified protein interactions, how
information changing the way that med- could you jointly use both datasets to
ical records are managed? Which types identify markers for a disease?
of systems are or will be most affected
(laboratory, radiology, admission and
discharge, financial, order entry)? References
2. It has been postulated that clinical infor-
matics and bioinformatics are working Altschul, S. F., Gish, W., Mille, W., Myers, E. W., &
on the same problems, but in some areas Lipman, D. J. (1990). Basic local alignment search
one field has made more progress than tool. Journal of Molecular Biology, 215(3), 403–410.
Ashley, E. A., Butte, A. J., Wheeler, M. T., Chen, R.,
the other. Identify three common themes. Klein, T. E., Dewey, F. E., et al. (2010). Clinical
Describe how the issues are approached assessment incorporating a personal genome. The
by each subdiscipline. Lancet, 375, 1525–1535.
3. Why should an awareness of bioinfor- Babior, B. M., & Matzner, Y. (1997). The familial
matics be expected of clinical informatics Mediterranean fever gene—cloned at last. The New
England Journal of Medicine, 337(21), 1548–1549.
professionals? Should a chapter on bioin- Bai, C., & Elledge, S. J. (1997). Gene identification using
formatics appear in a clinical informatics the yeast two-hybrid system. Methods in Enzymology,
textbook? Explain your answers. 283, 141–156.
4. Why should an awareness of clinical Bird, A. (2002). DNA methylation patterns and epigen-
informatics be expected of bioinformat- etic memory. Genes & Development, 16(1), 6–21.
Brown, D. G., Rao, S., Weir, T. L., O’Malia, J., Bazan,
ics professionals? Should a chapter on M., Brown, R. J., & Ryan, E. P. (2016). Metabolomics
clinical informatics appear in a bioinfor- and metabolic pathway networks from human
matics textbook? Explain your answers. colorectal cancers, adjacent mucosa, and stool.
5. One major problem with introducing Cancer and Metabolism, 4, 11.
computers into clinical medicine is the Burley, S. K., & Bonanno, J. B. (2002). Structuring the
universe of proteins. Annual Review of Genomics
extreme time and resource pressure and Human Genetics, 3, 243–262.
placed on physicians and other health Cech, T. R. (2000). Structural biology. The ribosome is a
care workers. Do you think that the same ribozyme. Science, 289(5481), 878–879.
298 S. D. Mooney et al.
Davies, K. (2010). Physicians and their use of informa- Li, R., Li, Y., Kristiansen, K., & Wang, J. (2008). SOAP:
tion: A survey comparison between the United Short oligonucleotide alignment program.
States, Canada, and the United Kingdom. Journal Bioinformatics, 24(5), 713–714.
of the Medical Library Association, 99, 88–91. Needleman, S. B., & Wunsch, C. D. (1970). A general
Dayhoff, M. O. (1974). Computer analysis of protein method applicable to the search for similarities in
sequences. Federation Proceedings, 33(12), 2314– the amino acid sequence of two proteins. Journal of
2316. Molecular Biology, 48(3), 443–453.
Durfy, S. J. (1993). Ethics and the human genome proj- Ng, S. B., Buckingham, K. J., Lee, C., Bigham, A. W.,
ect. Archives of Pathology & Laboratory Medicine, Tabor, H. K., Dent, K. M., et al. (2010). Exome
117(5), 466–469. sequencing identifies the cause of a mendelian dis-
Fischer, B. A., & Zigmond, M. J. (2010). The essential order. Nature Genetics, 42, 30–35.
nature of sharing in science. Science and Engineering Qin, J., Li, R., Raes, J., Arumugam, M., Burgdorf, K. S.,
Ethics, 16(4), 783–799. Manichanh, C., et al. (2010). A human gut micro-
Gibson, K., & Scheraga, H. (1967). Minimization of bial gene catalogue established by metagenomic
polypeptide energy. I. Preliminary structures of sequencing. Nature, 464(7285), 59–65.
bovine pancreatic ribonuclease S-peptide. Richardson, J. S. (1981). The anatomy and taxonomy of
Proceedings of the National Academy of Sciences, protein structure. Advances in Protein Chemistry, 34,
58(2), 420–427. 167–339.
Goldberg, A. D., Allis, C. D., & Bernstein, E. (2007). Senior, A.W., Evans, R., Jumper, J., Kirkpatrick, J.,
Epigenetics: A landscape takes shape. Cell, 128(4), Sifre, L., Green, T., et al. (2020) Improved protein
635–638. structure prediction using potentials from deep
Gusfield, D. (1997). Algorithms on strings, trees and learning. Nature, 577, 706–710.
sequences: Computer science and computational biol- Shapiro, E., Biezunner, T., & Linnarsson, S. (2013).
ogy. Cambridge: Cambridge University Press. Single-cell sequencing-based technologies will revo-
9 Hastie, T., Tibshirani, R., & Friedman, J. (2009). The lutionize whole-organism science. Nature Reviews
elements of statistical learning: Data mining, Genetics, 14, 618–630.
inference, and prediction. New York: Springer. Smith, T., & Waterman, M. (1981). Identification of
Karplus, M., & Weaver, D. L. (1976). Protein-folding common molecular subsequences. Journal of
dynamics. Nature, 260(5550), 404–406. Molecular Biology, 147(1), 195–197.
Karr, J. R., Sanghvi, J. C., Macklin, D. N., Gutschow, Smith, B., Ashburner, M., Rosse, C., Bard, J., Bug, W.,
M. V., Jacobs, J. M., Bolival, B., Jr., et al. (2012). A Ceusters, W., et al. (2007). The OBO foundry:
whole-cell computational model predicts phenotype Coordinated evolution of ontologies to support bio-
from genotype. Cell, 150(2), 389–401. medical data integration. Nature Biotechnology,
Kent, W. J. (2003). BLAT—the BLAST-like alignment 25(11), 1251–1255.
tool. Genome Research, 12(4), 656–664. Storey, J. D., & Tibshirani, R. (2003). Statistical signifi-
Lander, E. S., Linton, L. M., Birren, B., Nusbaum, C., cance for genomewide studies. Proceedings of the
Zody, M. C., Baldwin, J., et al. (2001). Initial National Academy of Sciences of the United States
sequencing and analysis of the human genome. of America, 100(16), 9440–9445.
Nature, 409, 860–921. Van’t Veer, L. J., Dai, H., van de Vijver, M. J., He, Y. D.,
Langridge, R. (1974). Interactive three-dimensional Hart, A. A. M., Mao, M., et al. (2002). Gene expres-
computer graphics in molecular biology. Federation sion profiling predicts clinical outcome of breast
Proceedings, 33(12), 2332–2335. cancer. Nature, 415(6871), 484–485.
Lashkari, D. A., DeRisi, J. L., McCusker, J. H., Wei, L., & Altman, R. B. (1998). Recognizing protein
Namath, A. F., Gentile, C., Hwang, S. Y., et al. binding sites using statistical descriptions of their
(1997). Yeast microarrays for genome wide parallel 3D environments. In Proceedings of the pacific sym-
genetic and gene expression analysis. Proceedings posium on Biocomputing ’98 (pp. 497–508),
of the National Academy of Sciences, 94(24), Singapore.
13057–13062. Yan, J., & Gu, W. (2009). Gene expression microarrays.
Levitt, M. (1983). Molecular dynamics of native pro- In Y. Lu & R. I. Mahato (Eds.), Cancer research
tein. I. Computer simulation of trajectories. Journal pharmaceutical perspectives of cancer therapeutics
of Molecular Biology, 168(3), 595–617. (pp. 645–672). New York: Springer.
299 10
Biomedical Imaging
Informatics
Daniel L. Rubin, Hayit Greenspan, and Assaf Hoogi
Contents
References – 348
Image Image
Acquisition Management/
Storage
Image Content
Representation
.. Fig. 10.1 The major topics in biomedical imaging by image content representation, management/storage
informatics follow a workflow of activities and tasks of images, image processing, and image interpretation/
commencing with include image acquisition, followed computer reasoning
of the common issues that pertain to all image content representation, management/storage
modalities and applications once the images of images, image processing, and image inter-
are converted to digital form. pretation/computer reasoning (. Fig. 10.1).
Biomedical imaging informatics is a Image acquisition is the process of generat-
dynamic field, recently evolving from primar- ing images from the modality and converting
ily focusing on image processing to broader them to digital form if they are not intrinsi-
informatics topics such as representing and cally digital. Image content representation
processing the semantic contents (Rubin and makes the information in images accessible
Napel 2010) and integrating image data with to machines for processing. Image manage-
other types of data (Scheckenbach et al. 2017; ment/storage includes methods for storing,
Pujara et al. 2018; Valdora et al. 2018; Weaver transmitting, displaying, retrieving, and orga-
and Leung 2018). At the same time, imaging nizing images. Image processing comprises
informatics shares common methodologies methods to enhance, segment, visualize, fuse,
and challenges with other domains in bio- or analyze the images. Image interpretation/
medical informatics. By trying to understand computer reasoning is the process by which
these common issues, we can develop general the individual viewing the image renders an
solutions that can be applied to all images, impression of the medical significance of the
regardless of the source. results of imaging study, potentially aided by
The major topics in biomedical imaging computer methods. 7 Chapter 22 is primar-
informatics include image acquisition, image ily concerned with information systems for
302 D. L. Rubin et al.
image management and storage, whereas this (Rubin and Napel 2010), as well as process-
chapter concentrates on these other core top- ing entire images to learn certain semantic
ics in biomedical imaging informatics. image content (Hosny et al. 2018; Yamashita
An important concept when thinking et al. 2018). Many of the topics in this chapter
about imaging from an informatics perspec- therefore involve how to represent, extract and
tive is that images are an unstructured data characterize the information that is present in
type; though they are readily understood and images, such as anatomy and abnormalities.
interpreted by knowledgeable human experts, Once that task is completed, useful applica-
their contents are not readily machine under- tions that process the image contents can be
standing except at the granular pixel level. As developed, such as image search and decision
such, while machines can readily manage the support to assist with image interpretation.
raw image data in terms of storage/retrieval, While we seek generality in discussing bio-
they cannot easily access image contents (rec- medical imaging informatics, many examples
ognize the type of image, annotations made in this chapter are taken from a few selected
on the image, or anatomy or abnormalities domains such as brain imaging, which is
within the image), except for newer deep learn- part of the growing field of neuroinformat-
ing methods (7 Sect. 10.4.5). In this regard, ics (Koslow and Huerta 1997). Though our
biomedical imaging informatics shares much examples are specific, we attempt to describe
in common with natural language processing the topics in generic terms so that the reader
(NLP; 7 Chap. 8). In fact, as the methods of can recognize parallels to other imaging
computationally representing and processing domains and applications.
images is presented in this chapter, parallels
10 to NLP should be considered, since there is
synergy from an informatics perspective. 10.2 Image Acquisition
As in NLP, a major purpose of the meth-
ods of imaging informatics is to extract par- In general, there are two different strategies
ticular information; in biomedical informatics in imaging the body: (1) delineate anatomic
the goal is often to extract information about structure (anatomic/structural imaging), and
the structure of the body and to collect fea- (2) determine tissue composition or function
tures that will be useful for characterizing (functional imaging) (. Fig. 10.2). In real-
abnormalities based on morphological altera- ity, one does not choose between anatomic
tions. In fact, imaging provides detailed and and functional imaging; many modalities pro-
diverse information very useful for character- vide information about both morphology and
izing disease, providing an “imaging pheno- function. However, in general, each imaging
type” useful for characterizing disease, since modality is characterized primarily as being
“a picture is worth a thousand words,1” and able to render high-resolution images with
the informatics methods for capturing imag- good contrast resolution (anatomic imaging)
ing phenotypes complement the informat- or to render images that depict tissue function
ics methods that are now being applied to (functional imaging).
electronic medical records data to capture
“electronic phenotypes” of diseased patients.
However, to overcome the challenges posed 10.2.1 Anatomic (Structural)
by the unstructured image data type, recent Imaging
work is applying semantic methods from
biomedical informatics to images to make Imaging the structure of the body has been
their content explicit for machine processing and continues to be the major application of
medical imaging, although, as described in
7 Sect. 10.2.2, functional imaging is a very
active area of research. The goal of anatomic
1 Frederick Barnard, “One look is worth a thousand
words,” Printers’ Ink, December, 1921.
imaging is to accurately depict the structure
Biomedical Imaging Informatics
303 10
Radiography
Spatial resolution (anatomic detail)
PET-CT
CT
MRI
US
PET
VLA HLA SA
Planar NM ED
ES
.. Fig. 10.2 The various radiology imaging methods tional information depicted (which represents the tissue
differ according to two major axes of information of composition—e.g., normal or abnormal). A sample of
images, spatial resolution (anatomic detail) and func- the more common imaging modalities is shown
of the body—the size and shape of organs— to show the functioning of the heart by depict-
and to visualize abnormalities clearly. Since ing wall motion, and ultrasound doppler
the goal in anatomic imaging is to depict and can image both normal and disturbed blood
understand the structure of anatomic entities flow (Mehta et al. 2000). Molecular imaging
accurately, high spatial resolution is an impor- (7 Sect. 10.2.3) is increasingly able to depict
tant requirement of the imaging method the expression of particular genes superim-
(. Fig. 10.2). Conversely, in anatomic imag- posed on structural images, and thus can also
ing, recognizing tissue function (e.g., tissue be seen as a form of functional imaging.
ischemia, neoplasm, inflammation, etc.) is not A particularly important application of
the goal, though this is crucial to functional functional imaging is for understanding the
imaging and to patient diagnosis. In most cognitive activity in the brain. It is now rou-
cases, imaging will be done using a combina- tinely possible to put a normal subject in a
tion of methods or modalities to derive both scanner, to give the person a cognitive task,
structural/anatomic information as well as such as counting or object recognition, and
functional information. to observe which parts of the brain light
up. This unprecedented ability to observe
the functioning of the living brain opens up
10.2.2 Functional Imaging entirely new avenues for exploring how the
brain works.
Many imaging techniques not only show the Functional brain imaging modalities can
structure of the body, but also the function, be classified as image-based or non-image
where for imaging purposes function can be based. In both cases it is taken as axiomatic
inferred by observing changes of structure that the functional data must be mapped to the
over time. In recent years this ability to image individual subject’s anatomy, where the anat-
function has greatly accelerated. For example, omy is extracted from structural images using
ultrasound and angiography are widely used techniques described in the previous sections.
304 D. L. Rubin et al.
Once mapped to anatomy, the functional data ing and analyzing functional data (Frackowiak
can be integrated with other functional data et al. 1997). The output of most of these tech-
from the same subject, and with functional niques is a low-resolution 3-D image volume
data from other subjects whose anatomy has in which each voxel value is a measure of the
been related to a template or probabilistic amount of activation for a given task. The
atlas. Techniques for generating, mapping and low-resolution volume is then mapped to
integrating functional data are part of the anatomy guided by a high-resolution struc-
field of Functional Brain Mapping, which has tural MR dataset, using one of registration
become very active in the past few years, with techniques described in 7 Sect. 10.4.7.
several conferences (Organization for Human Many of these and other techniques are
Brain Mapping 2001) and journals (Fox 2001; implemented in the SPM program (Friston
Toga et al. 2001) devoted to the subject. et al. 1995), the AFNI program (Cox 1996),
the Lyngby toolkit (Hansen et al. 1999), and
kImage-Based Functional Brain Imaging several commercial programs such as Medex
Image-based functional data generally come (Sensor Systems Inc. 2001) and BrainVoyager
from scanners that generate relatively low- (Brain Innovation B.V. 2001). The FisWidgets
resolution volume arrays depicting spatially- project at the University of Pittsburgh is
localized activation. For example, positron developing an approach that allows custom-
emission tomography (PET) (Heiss and Phelps ized creation of graphical user interfaces in
1983; Aine 1995; Alberini et al. 2011) and an integrated desktop environment (Cohen
magnetic resonance spectroscopy (MRS) 2001). A similar effort (VoxBox) is underway
(Ross and Bluml 2001) reveal the uptake of at the University of Pennsylvania (Kimborg
10 various metabolic products by the function- and Aguirre 2002).
ing brain; and functional magnetic resonance The ultimate goal of functional neuroim-
imaging (fMRI) reveals changes in blood oxy- aging is to observe the actual electrical activ-
genation that occur following neural activity ity of the neurons as they perform various
(Aine 1995). The raw intensity values gener- cognitive tasks. fMRI, MRS and PET do not
ated by these techniques must be processed directly record electrical activity. Rather, they
by sophisticated statistical algorithms to sort record the results of electrical activity, such
out how much of the observed intensity is due as (in the case of fMRI) the oxygenation of
to cognitive activity and how much is due to blood supplying the active neurons. Thus,
background noise. there is a delay from the time of activity to the
As an example, one approach to fMRI measured response. In other words these tech-
imaging is language mapping (Corina et al. niques have relatively poor temporal resolution
2000). The subject is placed in the magnetic (7 Sect. 10.2.4). Electro-encephalography
resonance imaging (MRI) scanner and told to (EEG) or magnetoencephalography (MEG),
silently name objects shown at 3-second inter- on the other hand, are more direct measures
vals on a head-mounted display. The actual of electrical activity since they measure the
objects (“on” state) are alternated with non- electromagnetic fields generated by the electri-
sense objects (“off ” state), and the fMRI sig- cal activity of the neurons. Current EEG and
nal is measured during both the on and the off MEG methods involve the use of large arrays
states. Essentially the voxel values at the off of scalp sensors, the output of which are pro-
(or control) state are subtracted from those at cessed in a similar way to CT in order to local-
the on state. The difference values are tested ize the source of the electrical activity inside
for significant difference from non-activated the brain. In general this “source localization
areas, then expressed as t-values. The voxel problem” is under-constrained, so informa-
array of t-values can be displayed as an image. tion about brain anatomy obtained from MRI
A large number of alternative methods is used to provide further constraints (George
have been and are being developed for acquir- et al. 1995).
Biomedical Imaging Informatics
305 10
10.2.3 Imaging Modalities
the various body tissues based on the thick- CR systems is that the cassettes are of stan-
ness and atomic number of the tissues, the dard size, so they can be used in any equip-
X-rays produce varying degrees of brightness ment that holds film-based cassettes (Horii
and darkness on the radiographic image. The 1996). More recently, digital radiography
differential amounts of brightness and dark- uses charge-coupled device (CCD) arrays to
ness on the image are referred to as “image capture the image directly. Currently, nearly
contrast;” differential contrast among struc- all radiology departments no longer acquire
tures on the image is the basis for recognizing radiographic images on film (analog images)
anatomic structures. Since the image in radi- and instead use digital radiography (Korner
ography is a projection, radiographs show a et al. 2007) to acquire digital images. This
superposition of all the structures traversed evolution was driven by the cost of film and
by the X-ray beam. Much of the art (and technological advances in digital image acqui-
difficulty) in interpretation of radiographs sition detectors and monitors whose resolu-
is understanding the imaging patterns that tion approached that of film. At the same
result from these superimposed structured time, digitization of radiology drove the evo-
and how to differentiate pathologies from lution of methods of imaging informatics we
normal structures or artifacts. describe in this chapter.
Radiographic images have very high spatial Computed Tomography (CT) is an impor-
resolution because a high photon flux is used tant imaging method that uses X-rays to pro-
to produce the images, and a high resolution duce cross sectional and volumetric images of
detector (film or digital image array) that cap- the body (Lee 2006). Similar to radiography,
tures many line pairs per unit area is used. On X-rays are projected through the body onto
10 the other hand, since the contrast in images is an array of detectors; however, the beam and
due to differences in tissue density and atomic detectors rotate around the patient, making
number, the amount of functional informa- numerous views at different angles of rota-
tion that can be derived from radiographic tion. Using computer reconstruction algo-
images is limited (. Fig. 10.2). Radiography rithms, an estimate of absolute density at each
is also limited by relatively poor contrast reso- point (volume element or voxel) in the body is
lution (compared with other modalities such computed. Thus, the CT image is a computed
as computed tomography (CT) or magnetic image (. Fig. 10.4); CT did not become prac-
resonance imaging (MRI) described below), tical for generating high quality images until
their use of ionizing radiation, the challenge the advent of powerful computers and devel-
of spatial localization due to projection ambi- opment of computer-based reconstruction
guity, and their limited ability to depict physi- techniques, which represent one of the most
ological function. As described below, newer
imaging modalities have been developed to
increase contrast resolution, to eliminate the
need for X-rays, and to improve spatial local-
ization. A benefit of radiographic images is
that they can be generated in real time (fluo-
roscopy) and can be produced using portable
devices.
Digital radiography (DR) is an imaging
technique that directly creates digital radio-
graphs from the imaging procedure. Storage
phosphor replaces film by substituting a
reusable phosphor plate in a standard film .. Fig. 10.4 A CT image of the upper chest. CT images
cassette. The exposed plate is processed by a are slices of a body plane; in this case, a cross sectional
(axial) image of the chest. Axial images are viewed from
reader system that scans the image into digital below the patient, so that the patient’s left is on viewer’s
form, erases the plate, and packages the cas- right. This image shows a cancer mass in the left upper
sette for reuse. An important advantage of lobe of the lung
Biomedical Imaging Informatics
307 10
spectacular applications of computers in all (B-scans) by displaying the echoes from pulses
of medicine (Buxton 2009). The spatial reso- of multiple adjacent one-dimensional paths
lution of images is not as high in CT as it is in (A-scans). Current ultrasound machines
radiography, due to the computed nature of are essentially specialized computers with
the images; however, the contrast resolution attached peripherals, with active development
and ability to derive functional information of three-dimensional imaging. The ultra-
of tissues in the body are superior for CT than sound transducer now often sweeps out a 3-D
for radiography (. Fig. 10.2). volume rather than a 2-D plane, and the data
are written directly into a three-dimensional
zz Ultrasound array memory, which is displayed using vol-
A common energy source used to pro- ume or surface-based rendering techniques
duce images is ultrasound, which developed (Ritchie et al. 1996).
from research performed by the Navy dur- Ultrasound images are acquired as digi-
ing World War II in which sonar was used tal images from the outset. They may also be
to locate objects of interest in the ocean. recorded as frames in rapid succession (cine
Ultrasonography uses pulses of high-fre- loops) for real-time imaging. Ultrasound
quency sound waves rather than ionizing imaging captures not only structural informa-
radiation to image body structures (Kremkau tion but also functional information. Doppler
2006). The basis of image generation is due methods in ultrasound are used to measure
to a property of all objects called acoustical and characterize the blood flow in blood ves-
impedance. As sound waves encounter differ- sels in the body (. Fig. 10.5). More recently,
ent types of tissues in a patient’s body (par- ultrasound techniques called elastography
ticularly interfaces where there is a chance in have been developed to measure tissue stiff-
acoustical impedance), a portion of the wave ness, which is improving the ability of ultra-
is reflected and a portion of the sound beam sound to diagnose a variety of pathology
(which is now attenuated) continues to tra- conditions such as liver fibrosis (Pawlus et al.
verse into deeper tissues. The time required 2015; Zaleska-Dorobisz et al. 2015). The low
for the echo to return is proportional to the cost of ultrasound and the fact it doesn’t use
distance into the body at which it is reflected; ionizing radiation makes it very attractive as a
the amplitude (intensity) of a returning echo primary modality for imaging worldwide, par-
depends on the acoustical properties of the ticularly for obstetrical and pediatric imaging.
tissues encountered and is represented in the Since the image contrast in ultrasound is
image as brightness (more echoes returning to based on differences in the acoustic imped-
the source is shown as image brightness). The ance of tissue, ultrasound provides functional
system constructs two- dimensional images information (e.g., tissue composition and
Mahmood 2001; Massoud and Gambhir of the image. These characteristics provide an
2003; Biswal et al. 2007; Hoffman and objective means for comparing images formed
Gambhir 2007; Margolis et al. 2007; Ray by digital imaging modalities.
and Gambhir 2007; Willmann et al. 2008; 55 Spatial resolution is related to the sharpness
Pysz et al. 2010). Molecularly-tagged mol- of the image; it is a measure of how well
ecules are increasingly being introduced into the imaging modality can distinguish
the living organism, and imaged with optical, points on the object that are close together.
radioactive, or magnetic energy sources, often For a digital image, spatial resolution is
10 using reconstruction techniques and often in generally related to the number of pixels
3-D. It is becoming possible to combine gene per image area. Spatial resolution is critical
sequence information, gene expression array for detecting abnormalities in very small
data, and molecular imaging to determine structures, such as microcalcifications on
not only which genes are expressed, but where mammograms or diffuse lung diseases on
they are expressed in the organism (Kang and chest radiographs.
Chung 2008; Min and Gambhir 2008; Singh 55 Contrast resolution is a measure of the
et al. 2008; Lexe et al. 2009; Smith et al. 2009; ability to distinguish small differences in
Harney and Meade 2010). These capabilities intensity in different regions of the image,
will become increasingly important in the which in turn are related to differences in
post-genomic era for determining exactly how measurable parameters, such as X-ray
genes generate both the structure and func- attenuation. For digital images, the number
tion of the organism. of bits per pixel is related to the contrast
resolution of an image. Contrast resolution
is critical to image interpretation, since
10.2.4 Image Quality differences in contrast are the basis for an
object (of sufficient size) to be appreciated
zz Characteristics of Image Quality by the human eye or by a computerized
The imaging modalities described above are image detection algorithm.
complex devices with many parameters that 55 Temporal resolution is a measure of the
need to be specified in generating the image, time needed to create an image. We consider
and most of the parameters can have sub- an imaging procedure to be a real-time
stantial impact on the following key charac- application if it can generate images
teristics of the final image appearance: spatial concurrent with the physical process it is
resolution, contrast resolution, and tempo- imaging. At a rate of at least 30 images per
ral resolution, all of which have substantial second, it is possible to produce unblurred
impact on image quality and diagnostic value images of the beating heart.
Biomedical Imaging Informatics
311 10
Other parameters that are specifically relevant netic contrast agents such as gadolinium have
to medical imaging are the degree of invasive- been introduced to enhance contrast in MR
ness, the dosage of ionizing radiation, the images.
degree of patient discomfort, the size (porta- Recently, contrast agents have been devel-
bility) of the instrument, the ability to depict oped for ultrasound to greatly enhance image
physiologic function as well as anatomic contrast (Durot et al. 2018). Ultrasound con-
structure, and the availability and cost of the trast agents generally comprise microbub-
procedure at a specific location. bles—bubbles in the blood that are too small
A perfect imaging modality would pro- to cause damage to tissues, but that in aggre-
duce images with high spatial, contrast, and gate alter the impedance mismatch between
temporal resolution; it would be available, low blood and tissue to enhance image contrast.
in cost, portable, free of risk, painless, and Although contrast agents have been very
noninvasive; it would use nonionizing radia- successful and they are commonly used, their
tion; and it would depict physiological func- enhancement tends to be non-specific in that
tion as well as anatomic structure. As seen any vascularized structure will be enhanced.
above, the different modalities differ in these In recent years, advances in molecular biol-
characteristics and none is uniformly strong ogy have led to the ability to design contrast
across all the parameters (. Fig. 10.2). agents that are highly specific for individual
molecules. In addition to radioactively tagged
zz Contrast Agents molecules used in nuclear medicine, molecules
One of the major motivators for develop- are tagged for imaging by magnetic resonance
ment of new imaging modalities is the desire and optical energy sources. Tagged molecules
to increase contrast resolution. A contrast are imaged in 2-D or 3-D, often by applica-
agent is a substance introduced into the body tion of reconstruction techniques developed
to enhance the imaging contrast of structures for clinical imaging (Pysz et al. 2010; Jokerst
or fluids in medical imaging. Contrast agents and Gambhir 2011; Weissleder et al. 2016).
can be introduced in various ways, such as by Tagged molecules have been used for several
injection, inspiration, ingestion, or enema. years in vitro by such techniques as immuno-
The chemical composition of contrast agents cytochemistry (binding of tagged antibodies
vary with modality so as to be optimally visible to antigen) (Van Noorden 2002) and in situ
based on the physical basis of image forma- hybridization (binding of tagged nucleotide
tion. For example, iodinated contrast agents sequences to DNA or RNA) (King et al.
are used in radiography and CT because 2000). More recently, methods have been
iodine has high atomic number, greatly atten- developed to image these molecules in the liv-
uating X-rays, and thus greatly enhancing ing organism, thereby opening up entirely new
image contrast in any tissues that accumulate avenues for understanding the functioning of
the contrast agent. Contrast agents for radi- the body at the molecular level (Biswal et al.
ography are referred to as “radiopaque” since 2007; Hoffman and Gambhir 2007; Margolis
they absorb X-rays and obscure the beam. et al. 2007; Ray and Gambhir 2007; Willmann
Contrast agents in radiography are used to et al. 2008; Pysz et al. 2010). Recent work in
highlight the anatomic structures of interest altering microbubbles of ultrasound contrast
(e.g., stomach, colon, urinary tract). In an agents to target them to particular tissues and
imaging technique called angiography, a con- types of disease raises the exciting prospects
trast agent is injected into the blood vessels to for ultrasound imaging to provide even greater
opacify them on the images. In pathology, his- functional information about tissues in a
tological staining agents such as haematoxylin minimally invasive and cost effective manner
and eosin (H&E) have been used for years to (Deshpande et al. 2010; Abou-Elkacem et al.
enhance contrast in tissue sections, and mag- 2015; Zhang et al. 2017).
312 D. L. Rubin et al.
to a digital representation or by generating it defined as a set of three values for each pixel
directly in digital form. Once an image is in in the image, where the intensity of each pixel
digital form, it can be handled just like all in each of the three MRI images is extracted
other data. It can be transmitted over com- and recorded (e.g., [Intensity(Sequence 1),
munications networks, stored compactly in Intensity(Sequence 2), Intensity(Sequence 3)].
databases on magnetic or optical media, and Any imaging performed over time (e.g., car-
displayed on graphics monitors. In addition, diac echo videos) can be represented by the set
the use of computers has created an entirely of values at each time point, thus the time is
new realm of capabilities for image generation added as an additional dimension to the rep-
and analysis: images can be computed rather resentation.
than measured directly. Furthermore, digi- Finally, in addition to representing the
tal images can be manipulated for display or visual content, medical images also need
analysis in ways not possible with film-based to represent certain information about that
images. visual content (referred to as image meta-
In addition to the 2D (slice) and 3D (vol- data). Image metadata include such things as
ume) representation for image data, there can the name of the patient, date the image was
be additional dimensions to representing the acquired, the slice thickness, the modality that
visual content of images. It is often the case was used to acquire the image, etc. All image
that multi-modality data are required for the metadata are usually stored in the header of
diagnosis; this can be a combination of vary- the image file. Given that there are many dif-
ing modalities, (e.g., CT and PET, CT and ferent types of equipment and software that
MRI) and can be a combination of imaging produce and consume images, standards are
10 sequences within a modality (e.g., T1, T2, or crucial. For images, the Digital Imaging and
other sequences in MRI) (. Fig. 10.10). Pixel Communications in Medicine (DICOM)
(or voxel) content, from each of the respec- standard is for distributing and viewing any
tive acquisition modalities, are combined in kind of medical image regardless of the origin
what is known as a “feature-vector” in the (Bidgood Jr. and Horii 1992). DICOM has
multi-dimensional space. For example, a become pervasive throughout radiology and
3-dimensional intensity-based feature vec- is becoming a standard in other domains such
tor, based on 3 MRI pulse sequences, can be as pathology, ophthalmology, and dermatol-
.. Fig. 10.10 Multi-modality imaging. Images of the features on each of these modalities, and the combina-
brain from three modalities (T1 without contrast, T1 tion of these different features on different modalities
with contrast, and T2) are shown. The patient has a establishes characteristic patterns useful in diagnosis
lesion in the left occipital lobe that has distinctive image
Biomedical Imaging Informatics
315 10
ogy. In addition to specifying a standard file ontologies of physiology and pathology they
syntax and metadata structure, DICOM spec- can provide increasingly sophisticated knowl-
ifies a standard protocol for communicating edge about the meaning of the various images
images among imaging devices. and other data that are increasingly becom-
ing available in online databases. This kind
of knowledge (by the computer, as opposed
10.3.2 Representing Knowledge to the scientist) will be required in order to
Content in Digital Images achieve the seamless integration of all forms
of imaging and non-imaging data.
As noted above, the knowledge content related At the most fundamental level, Nomina
to images is not directly encoded in the images, Anatomica (International Anatomical
but it is recognized by the observer of the Nomenclature Committee 1989) and its suc-
images. This knowledge includes recognition cessor, Terminologia Anatomica (Federative
of the anatomic entities in the image, imaging Committee on Anatomical Terminology 1998)
observations and characteristics of the observa- provide a classification of officially sanctioned
tions (sometimes called “findings”), and inter- terms that are associated with macroscopic
pretations (probable diseases). Representing and microscopic anatomical structures. This
this knowledge in the imaging domain is similar canonical term list, however, has been substan-
to knowledge representation in other domains tially expanded by synonyms that are current
of biomedical informatics (see 7 Chap. 24). in various fields, and has also been augmented
Specifically, for representing the entities in the by a large number of new terms that desig-
domain of discourse, we adopt terminologies nate structures omitted from Terminologia
or ontologies. To make specific statements Anatomica. Many of these additions are pres-
about individuals (images), we use “informa- ent in various controlled terminologies (e.g.,
tion models” (described below) that reference MeSH (National Library of Medicine 1999),
ontological entities as necessary. As described SNOMED (Spackman et al. 1997), Read Codes
below, d ifferent aspects of the knowledge con- (Schultz et al. 1997), GALEN (Rector et al.
tent of images is stored in different ways (which 1993)). Unlike Terminologia these vocabular-
is one of the challenges of leveraging this infor- ies are entirely computer-based, and therefore
mation). lend themselves for incorporation in computer-
based applications.
zz Knowledge Representation of Anatomy Classification and ontology projects to
Given segmented anatomical structures, date have focused primarily on arranging the
whether at the macroscopic or microscopic terms of a particular domain in hierarchies.
level, and whether represented as 3-D surface As noted with respect to the evaluation of
meshes or extracted 3-D regions, it is often Terminologia Anatomica (Rosse 2000), insuf-
desirable to attach labels (names) to the struc- ficient attention has been paid to the relation-
tures in images. If the names are drawn from a ships among these terms. These relationships
controlled terminology or ontology, they can are named (e.g., “is-a” and “part-of”) to indi-
be used as an index into a database of seg- cate how the entities connected by them are
mented structures, thereby providing a quali- related (e.g., Left Lobe of Liver part-of Liver).
tative means for comparing structures from Linking entities with relations encodes knowl-
multiple subjects or retrieving images contain- edge and is used by computer reasoning appli-
ing particular structures. cations in making inferences. Terminologia,
If the terms in the vocabulary are orga- as well as anatomy sections of the controlled
nized so as to assert relationships that are true medical terminologies, mix -is a- and -part of-
for all instances (the case in “ontologies”), relationships in the anatomy segments of their
they can support systems that manipulate and hierarchies. Although such heterogeneity does
retrieve image contents in intelligent ways. not interfere with using these term lists for
If anatomical ontologies are linked to other keyword-based retrieval, these programs will
316 D. L. Rubin et al.
fail to support higher level knowledge (rea- cover other portions of the radiology domain.
soning) required for knowledge-based appli- As is discussed in 7 Chap. 7, there are con-
cations. To meet this gap, the Foundational trolled terminologies in other domains, such
Model of Anatomy (FMA) was developed to as MeSH, SNOMED, and related terminolo-
define a comprehensive symbolic description gies in the UMLS (Cimino 1996; Bodenreider
of the structural organization of the body, 2008); however, these lack terminology spe-
including anatomical concepts, their preferred cific to radiology for describing the features
names and synonyms, definitions, attributes seen in imaging. The Radiological Society
and relationships (Rosse et al. 1998a, b; Rosse of North America (RSNA) recently devel-
and Mejino 2003). oped RadLex, a controlled terminology
In the FMA, anatomical entities are for radiology (Langlotz 2006; Rubin 2008).
arranged in class-subclass hierarchies, with The primary goal of RadLex is to provide a
inheritance of defining attributes along the means for radiologists to communicate clear,
is-a link, and other relationships (e.g., parts, concise, and orderly descriptions of imaging
branches, spatial adjacencies) represented findings in understandable, unambiguous lan-
as additional descriptors associated with guage. Another goal is to promote an orderly
the concept. The FMA currently consists of thought process and logical assessments and
over 75,000 concepts, represented by about recommendations based on observed imaging
120,000 terms, and arranged in over 2.1 mil- features based on terminology-based descrip-
lion links using 168 types of relationships. tion of radiology images and to enable deci-
These concepts represent structures at all sion support (Baker et al. 1995; Burnside et al.
levels: macroscopic (to 1 mm resolution), cel- 2009). Another goal of RadLex is to enable
10 lular and macromolecular. Brain structures radiology research; data mining is facilitated
have been added by integrating NeuroNames by the use of standard terms to code large col-
with the FMA as a Foundational Model of lections of reports and images (Channin et al.
Neuroanatomy (FMNA) (Martin et al. 2001). 2009a, b).
The FMA can be useful for symboli- RadLex includes thousands of descrip-
cally organizing and integrating biomedical tors of visual observations and characteris-
information, particularly that obtained from tics for describing imaging abnormalities, as
images. But in order to answer non-trivial well as terms for naming anatomic structures,
queries in neuroscience and other basic sci- radiology imaging procedures, and diseases
ence areas, and to develop “smart tools” (. Fig. 10.11). Each term in RadLex con-
that rely on deep knowledge, additional tains a unique identifier as well as a variety
ontologies must also be developed (e.g., for of attributes such as definition, synonyms,
physiological functions mediated by neu- and foreign language equivalents. In addition
rotransmitters, and pathological processes to a lexicon of standard terms, the RadLex
and their clinical manifestations, as well for ontology includes term relationships—links
the radiological appearances with which between terms to relate them in various ways
they correlate). The relationships that exist to encode radiological knowledge. For exam-
among these concepts and anatomical parts ple, the is-a relationship records subsumption.
of the body must also be explicitly modeled. Other relationships include part-of, connec-
Next-generation informatics efforts that link tivity, and blood supply. These relationships
the FMA and other anatomical ontologies are enabling computer-reasoning applications
with separately developed functional ontolo- to process image-related data annotated with
gies will be needed in order to accomplish RadLex.
this type of integration. RadLex has been used in several imag-
ing informatics applications, such as to
zz Knowledge Representation of Radiology improve search for radiology information.
Imaging Features RadLex- based indexing of radiology jour-
While FMA provides a comprehensive knowl- nal figure captions achieved very high preci-
edge representation for anatomy, it does not sion and recall, and significantly improved
Biomedical Imaging Informatics
317 10
.. Fig. 10.11 RadLex controlled terminology eases (“pathophysiologic process”), treatments (“treat-
(7 http://radlex.org). RadLex includes term hierarchies ment”), and components of radiology reports
for describing anatomy (“anatomical entity”), imaging (“report”). Each term includes definitions, preferred
observations (“imaging observation”) and characteris- name, image exemplars, and other term metadata and
tics (“imaging observation characteristic”), imaging relationships such as subsumption
procedures and procedure steps (“procedure step”), dis-
image retrieval over keyword-based search et al. 2009) and transcatheter therapy for
(Kahn and Rubin 2009). RadLex has been hepatic malignancy (Brown et al. 2009), and
used to index radiology reports (Marwede the CT Colonography Reporting and Data
et al. 2008). Work is underway to introduce System (Zalis et al. 2005).
RadLex controlled terms into radiology
reports to reduce radiologist variation in zz Knowledge Representation of Radiology
use of terms for describing images (Kahn Jr. Procedures
et al. 2009). Tools are beginning to appear A very important type of knowledge represen-
enabling radiologists to annotate and query tation for images is the type of imaging proce-
image databases using RadLex and other dure that produced it. While RadLex contains
controlled terminologies (Rubin et al. 2008b; atomic terms for the various modalities, such
Channin et al. 2009a, b). as CT and MRI, it lacks the full spectrum of
In addition to RadLex, there are other types of procedures that can be performed
important controlled terminologies for on a patient. The RadLex Playbook (Wang
radiology. The Breast Imaging Reporting et al. 2017) is a project of the Radiological
and Data System (BI-RADS) is a lexicon Society of North America that provides a
of descriptors and a reporting structure standard system for naming radiology pro-
comprising assessment categories and man- cedures, based on atomic terms (usually from
agement recommendations created by the RadLex) that define an imaging procedure,
American College of Radiology (D’Orsi and such as “CT Head.” By providing standard
Newell 2007). Terminologies are also being names and codes for radiologic studies, the
created in other radiology imaging domains, RadLex Playbook can facilitate a variety of
including the Fleischner Society Glossary operational and quality improvement efforts,
of terms for thoracic imaging (Hansell et al. including workflow optimization, chargemas-
2008), the Nomenclature of Lumbar Disc ter management, radiation dose tracking,
Pathology (Appel 2001), terminologies for enterprise integration and image exchange
image guided tumor ablation (Goldberg (Wang et al. 2017).
318 D. L. Rubin et al.
.. Fig. 10.12 Semantic annotation of images. The (right), comprising terms from controlled terminologies
radiologist’s image annotation (left) and interpretation (Systematized Nomenclature of Medicine (SNOMED)
(middle) associated with the annotation are not repre- and RadLex) as well as numeric values (coordinates and
sented in a form such that the detailed content is directly measurements). (Figure reprinted with permission from
accessible. The same information can be put into a (Rubin and Napel 2010). © Thieme)
structured representation as a semantic annotation
So-called “semantic annotation” methods image, such as the name of imaging procedure
are being developed to adapt the semantic and how or when the image was acquired.
content about images that would have been AIM supports controlled terminologies,
put into narrative text so that it can instead enabling semantic interoperability. AIM has
be put in structured annotations compliant recently been incorporated into the DICOM
with the information model. The information Structured Report (DICOM SR) standard
model conveys the pertinent image informa- (DICOM Standards Committee - Working
tion explicitly and in human-readable and Group 8 – Structured Reporting 2017), with
machine accessible format. For example, a specifications for saving AIM in DICOM-SR
semantic annotation might record the coor- (DICOM Standards Committee 2017). Given
dinates of the tip of an arrow and indicate that DICOM is the international standard for
the organ (anatomic location) and imaging specifying image data, it is hoped that there
observations (e.g., mass) in that organ. These will be widespread adoption of the AIM/
annotations can be recorded in a standard, DICOM-SR format to enable interoperability
searchable format, such as the Annotation of image annotations across systems.
and Image Markup (AIM) schema, developed The AIM information model includes use
by the National Cancer Imaging Program of of controlled terms as semantic descriptors
NCI for storing and sharing image metadata of lesions (e.g., RadLex). It also provides a
(caBIG In-vivo Imaging Workspace 2008; syntax associating an ROI in an image with
Rubin et al. 2008a, 2009a). AIM captures a the aforementioned information, enabling raw
variety of information about image annota- image data to be linked with semantic infor-
tions, e.g., regions of interest, a lesion iden- mation, and thus bridges the current discon-
tification label, lesion location, measurements nect between semantic terms and the lesions
of image regions, method of measurement, in images being described. In conjunction
radiologist observations, anatomic locations with RadLex, the AIM information model
of abnormalities, calculations, inferences, provides a standard syntax (in XML schema)
and other qualitative and quantitative image to create a structured representation of the
features (Channin et al. 2009a, b). The image semantic contents of images (. Fig. 10.12).
metadata also include information about the Once the semantic contents are recorded in
320 D. L. Rubin et al.
.. Fig. 10.13 The electronic Imaging Physician Anno- annotations (pull down panel on right). As they make
AIM (as XML instances of the AIM XML semantic image annotation methods are
schema), applications can be developed for also being pursued (Carneiro et al. 2007;
image query and analysis. Mechouche et al. 2008; Yu and Ip 2008) that
AIM has been gaining traction in the will ultimately make the process of generating
research community. A number of diverse this structured information efficient.
research projects have embraced and have The electronic Imaging Physician Anno
been enabled by AIM (Levy et al. 2009, Napel tation Device (ePAD (Rubin and Snyder 2011))
et al. 2010, Gevaert et al. 2011, Gimenez et al. is a freely available web-based image viewing
2011a, b, Hoang et al. 2011, Levy and Rubin and AIM-compliant annotation tool. ePAD
2011a, b, Napel et al. 2011, Plevritis et al. 2011, permits the user to draw image annotations in
Gevaert et al. 2012a, b; Levy et al. 2012). An a manner in which they are accustomed while
increasing number of tools are supporting AIM viewing images, while simultaneously collecting
to facilitate creating semantic annotations on semantic information about the image and the
images as part of the image viewing workflow image region directly from the image itself as
are being developed, including open source well as from the user using a structured report-
projects such as Osirix (Rubin and Snyder ing template (. Fig. 10.13). The tool also fea-
2011), ClearCanvas (Klinger 2010; National tures a panel to provide feedback so as to ensure
Cancer Institute 2012), Slicer (Pieper et al. complete and valid annotations. Image annota-
2004; Fedorov 2012; Fedorov et al. 2012), and tions are saved in the AIM XML format.
ePAD (Rubin and Snyder 2011). There are By making the semantic content of images
also several commercial applications using explicit and machine-accessible, these struc-
AIM that are in development (Rubin et al. tured annotations of images will help radiolo-
2010; Zimmerman et al. 2011). Automated gists analyze data in large databases of images.
Biomedical Imaging Informatics
321 10
.. Fig. 10.14 ePAD and AIM within the clinical/ the AIM Data Service, and (4) a variety of software
research environment. (1) Images are acquired and applications can use the AIM Data Service to access the
stored in the hospital PACS, (2) the Radiologist uses image metadata for different purposes, such as listing
ePAD to review the images and to make measurements the lesion measurements or generating a summary of
on cancer lesions, (3) The measurements (saved as AIM patient response assessment
XML in ePAD) with links to the images are stored by
. Figure 10.14 shows how image annota- ated at each time point. Automated tools such
tions in AIM can be integrated into rou- as ePAD can use semantic image annotations
tine research and clinical workflows. Images to identify the measurable lesions at each time
acquired from imaging devices flow into the point and produce a summary of, and auto-
PACS and can be viewed on an imaging work- matically reason about, the total tumor bur-
station. If the imaging workstation supports den over time, helping physicians to determine
AIM (e.g., ePAD as shown in the figure), then how well patients are responding to treatment
image annotations and radiologist observa- (Levy and Rubin 2008).
tions are saved in the AIM format and stored
in database of AIM annotation files (an XML kAtlases
database in the case of ePAD). The AIM Spatial representations of anatomy, in the
annotations have a pointer to their corre- form of segmented regions on 2-D or 3-D
sponding images and queries and analyses can images, or 3-D surfaces extracted from image
be done on the image annotations for clinical volumes, are often combined with symbolic
applications, such as summarizing the change representations to form digital atlases. A digi-
in cancer lesion sizes over time for assessing tal atlas (which for this chapter refers to an
response assessment to cancer (. Fig. 10.14). atlas created from 3-D image data taken from
Cancer patients often have many serial imag- real subjects, as opposed to artists’ illustra-
ing studies in which a set of lesions is evalu- tions) is generally created from a single indi-
322 D. L. Rubin et al.
10
.. Fig. 10.15 The Digital Anatomist Dynamic Scene shown here was created by first requesting one of the
Generator. The user can select a set of 3-D anatomical muscles of the thorax, then exploring FMA to find the
models, in which each model is associated with an entity heart, aorta and branches, and thoracic vertebral col-
from the Foundational Model of Anatomy (FMA) to umn. In the scene on the left, the first thoracic vertebra
select a starting structure, load relations from the FMA was clicked, which caused it to be highlighted, and the
between that structure and related structures, and, if relations between it and other structures are then shown
3-D models are available, to add those models to the in the panel on the right. The top pane of the right
scene. The evolving scene can be manipulated in real- panel = shows the structures with which the first tho-
time on the web and can be saved as a standalone scene racic vertebra articulates. (Figure used with permission
that can be saved locally or accessed via URL, thus per- from Jim Brinkley)
mitting it to be embedded in other web apps. The scene
vidual, which therefore serves as a “canonical” active browsing, where the names of struc-
instance of the species. Traditionally, atlases tures are given in response to mouse clicks;
have been primarily used for education, and dynamic creation of “pin diagrams”, in which
most digital atlases are used the same way. selected labels are attached to regions on the
As an example in 2-D, the Digital images; and dynamically-generated quizzes, in
Anatomist Interactive Atlases (Sundsten which the user is asked to point to structures
et al. 2000) were created by outlining ROIs on the image (Brinkley et al. 1997).
on 2-D images (many of which are snapshots As an example 3-D, the Digital
of 3-D scenes generated by reconstruction Anatomist Dynamic Scene Generator (DSG,
from serial sections) and labeling the regions . Fig. 10.15) creates interactive 3-D atlases
with terminology from the FMA. The atlases, “on-the-fly” for viewing and manipulation
which are available on the web, permit inter- over the web (Brinkley et al. 1999; Wong et al.
Biomedical Imaging Informatics
323 10
1999). An example of a 3-D brain atlas cre- 10.4 Image Processing
ated from the Visible Human is Voxelman
(Hohne et al. 1995), in which each voxel in the Image processing is a form of signal pro-
Visible Human head is labeled with the name cessing in which computational methods
of an anatomic structure in a “generalized are applied to an input image to produce an
voxel model” (Hohne et al. 1990), and highly- output image or a set of characteristics or
detailed 3-D scenes are dynamically gener- parameters related to the image. Most image
ated. Several other brain atlases have also processing techniques involve treating the
been developed, primarily for educational image as a two-dimensional signal and ana-
use (Johnson and Becker 2001; Stensaas and lyzing it using signal-processing techniques
Millhouse 2001). or a variety of other transformations or com-
Atlases have also been developed for putations. There are a broad variety of image
integrating functional data from multiple processing methods, including transforma-
studies (Bloom and Young 1993; Toga et al. tions to enhance visualization, computations
1994, 1995; Swanson 1999; Fougerousse to extract features, and systems to automate
et al. 2000; Rosen et al. 2000; Martin and detection or diagnose abnormalities in the
Bowden 2001). In their original published images. The latter two methods, referred to
form these atlases permit manual drawing as computer-assisted detection and diagnosis
of functional data, such as neurotransmit- (CAD) is discussed in 7 Sect. 10.5.2. In this
ter distributions, onto hardcopy printouts section we discuss the former methods, which
of brain sections. Many of these atlases are more elemental and generic processing
have been or are in the process of being methods.
converted to digital form. The Laboratory The rapidly increasing number and types
of Neuroimaging (LONI) at the University of digital images has created many opportu-
of California Los Angeles has been particu- nities for image processing, since one of the
larly active in the development and analy- great advantages of digital images is that they
sis of digital atlases (Toga 2001), and the can be manipulated just like any other kind
California Institute of Technology Human of data. This advantage was evident from the
Brain Project has released a web-accessible early days of computers, and success in pro-
3-D mouse atlas acquired with micro-MR cessing satellite and spacecraft images gener-
imaging (Dhenain et al. 2001). ated considerable interest in biomedical image
The most widely used human brain atlas processing, including automated image anal-
is the Talairach atlas, based on post mortem ysis to improve radiological interpretation.
sections from a 60-year-old woman (Talairach Beginning in the 1960s, researchers devoted
and Tournoux 1988). This atlas introduced a a large amount of work to this end, with the
proportional coordinate system (often called hope that eventually much of radiographic
“Talairach space”) which consists of 12 rect- image analysis could be improved. One of the
angular regions of the target brain that are first areas to receive attention was automated
piecewise affine transformed to corresponding interpretation of chest X-ray images, because,
regions in the atlas. Using these transforms previously, most patients admitted to a hos-
(or a simplified single affine transform based pital were subjected to routine chest X-ray
on the anterior and posterior commissures) examinations. (This practice is no longer
a point in the target brain can be expressed considered cost effective except for selected
in Talairach coordinates, and thereby related subgroups of patients.) Interestingly, recent
to similarly transformed points from other research in deep learning (discussed below),
brains. Other human brain atlases have also however, has raised enthusiasm and hopes
been developed (Schaltenbrand and Warren for automating certain tasks of radiographic
1977; Hohne et al. 1992; Caviness et al. 1996; image interpretation, such as detection of
Drury and Van Essen 1997; Van Essen and pneumonia (Rajpurkar et al. 2017). While
Drury 1997). most of the emphasis of image processing
324 D. L. Rubin et al.
continues to be on systems that aid the user ment systems such as PACS (7 Chap. 22)
in viewing and manipulating images, there is and powerful workstations, has led to many
a quickly growing body of work completely applications of image processing techniques.
automated analysis of images, particularly In general, routine techniques are available
for lesion detection, image segmentation, and on the manufacturer’s workstations (e.g., a
image classification (diagnosis). vendor-provided console for an MR machine
Medical image processing utilizes tools or an ultrasound machine), whereas more
similar to general image processing. But there advanced image-processing algorithms are
are unique features to the medical imagery available as software packages that run on
that present different, and often more diffi- independent workstations.
cult, challenges from those that exist in gen- The primary uses of image processing
eral image processing tasks. To begin with, the in the clinical environment are for image
images analyzed all represent the 3D body; enhancement, screening, and quantitation.
thus, the information extracted (be it in 2D Software for such image processing is pri-
or 3D) is based on a 3D volumetric object. marily developed for use on independent
The images themselves are often taken from workstations. Several journals are devoted
multi-modalities (CT, MRI, PET), where each to medical image processing (e.g., IEEE
modality has its own unique physical charac- Transactions on Medical Imaging, Journal of
teristics, leading to unique noise, contrast and Digital Imaging, Neuroimage), and the num-
other issues that need to be addressed. The ber of journal articles is rapidly increasing
fusion of information across several modali- as digital images become more widely avail-
ties is a challenge that needs to be addressed able. Several books are devoted to the spec-
10 as well. trum of digital imaging processing methods
When analyzing the data, it is often desir- (Yoo 2004; Gonzalez et al. 2009), and the
able to segment and characterize specific reader is referred to these for more detailed
organs. The human body organs, or vari- reading on these topics. We describe a few
ous tissue of interest within them, cannot be examples of image-processing techniques in
described with simple geometrical rules, as the remainder of this section.
opposed to objects and scenes in non-medi-
cal images that usually can be described with
such representations. This is mainly because 10.4.1 Types of Image-Processing
the objects and free-form surfaces in the body Methods
cannot easily be decomposed into simple
geometric primitives. There is thus very little Image processing methods are applied to rep-
use of geometric shape models that can be resentations of image content (7 Sect. 10.3).
defined from a-priori knowledge. Moreover, One may use the very low-level, pixel represen-
when trying to model the shape of an organ tation. The computational effort is minimal in
or a region, one needs to keep in mind that the representation stage, with substantial effort
there are large inter-person variations (e.g., (computational cost) in further analysis stages
in the shape and size of the heart, liver and such as segmentation of the image, matching
so on), and, as we are frequently analyzing between images, registration of images, etc.
images of patients, there is a large spectrum A second option is to use a very high-level
of abnormal states that can greatly modify image content representation, in which each
tissue properties or deform structures. Finally, image is labeled according to its semantic
especially in regions of interest that are close content (medical image categories such as
to the heart, complex motion patterns need to “abdomen vs chest”, “healthy vs pathol-
be accounted for as well. These issues make ogy”). In this scenario, a substantial compu-
medical image processing a very challenging tational effort is needed in the representation
domain. stage, including the use of automated image
The widespread availability of digi- segmentation methods to recognize ROIs as
tal images, combined with image manage- well as advanced learning techniques to clas-
Biomedical Imaging Informatics
325 10
sify the regions of image content. Further methods, except for image reasoning which is
analysis can utilize knowledge resources such discussed in 7 Sect. 10.5.
as ontologies, linked to the images using cat-
egory labels. A mid-level representation exists,
that balances the above two options, in which 10.4.2 Global Processing
a transition is made from pixels to semantic
features. Feature vectors are used to represent Global processing involves computations on
the spectrum of image content compactly and the entire image, without regard to specific
subsequent analysis is done on the feature regional content. The purpose is generally to
vector representation. enhance an image for human visualization or
Image processing is the foundation for cre- for further analysis by the computer (“pre-
ating image-based applications, such as image processing”). A simple but important example
enhancement to facilitate human viewing, to of global image processing is gray-scale win-
show views not present in the original images, dowing of CT images. The CT scanner gener-
to flag suspicious areas for closer examination ates pixel values (Hounsfield numbers, or CT
by the clinician, to quantify the size and shape numbers) in the range of −1000 to +3000.
of an organ, and to prepare the images for Humans, however, cannot distinguish more
integration with other information. To create than about 100 shades of gray. To appreciate
such applications, several types of image pro- the full precision available with a CT image,
cessing are generally performed sequentially the operator can adjust the midpoint and
in an image processing pipeline, although some range of the displayed CT values. By chang-
processing steps may feed back to earlier ones, ing the level and width (i.e., intercept and
and the specific methods used in a pipeline slope of the mapping between pixel value and
varies with the application. Most image pro- displayed gray scale or, roughly, the bright-
cessing pipelines and applications generalize ness and contrast) of the display, radiologists
from two-dimensional to three-dimensional enhance their ability to perceive small changes
images, though three-dimensional images in contrast resolution within a subregion of
pose unique image processing opportunities interest.
and challenges. Image processing pipelines are Other types of global processing change
generally built using one or more of the fol- the pixel values to produce an overall enhance-
lowing fundamental image processing meth- ment or desired effect on the image: histogram
ods: global processing, image enhancement, equalization, convolution, and filtering. In
image rendering/visualization, image quanti- histogram equalization, the pixel values are
tation, image segmentation, image registra- changed, spreading out the most frequent
tion, and image reasoning (e.g., classification). intensity values to increase the global contrast
Those steps are shown in . Fig. 10.16. In the of the image. It is most effective when the
remainder of this section we describe these usable data of the image are represented by a
326 D. L. Rubin et al.
narrow range of contrast values. Through this image are isotropic, then a variety of arbitrary
adjustment, the intensities can be better dis- projections can be derived from the volume,
tributed on the histogram, improving image such as a sagittal or coronal view, or even
contrast by allowing for areas of lower local curved planes. A technique called maximum
contrast to gain a higher contrast. In convo- intensity projection (MIP) and minimum
lution and filtering, mathematical functions intensity projection (MinIP) can also be cre-
are applied to the entire image for a variety of ated in which imaginary rays are cast through
purposes, such as de-noising, edge enhance- the volume, recording the maximum or mini-
ment, and contrast enhancement. mum intensity encountered along the ray
path, respectively, and displaying the result as
a 2-D image.
10.4.3 Image Enhancement In addition to these planar visualizations,
the volume can be visualized directly in its
Image enhancement uses global processing to entirety using volume rendering techniques
improve the appearance of the image either (Lichtenbelt et al. 1998) (. Fig. 10.17) which
for human use or for subsequent process- project a two-dimensional image directly from
ing by computer. The consoles of all vendor a three- dimensional voxel array by casting
image viewing platforms and independent rays from the eye of the observer through the
image-processing workstations provide some volume array to the image plane. Because each
form of image enhancement. We have already ray passes through many voxels, some form
mentioned CT windowing. Another tech- of segmentation (usually simple threshold-
nique is unsharp masking, in which a blurred, ing) often is used to remove obscuring struc-
10 or “unsharp,” positive is created to be used as tures. As workstation memory and processing
a “mask” that is combined with the original power have advanced, volume rendering has
image, creating the illusion that the resulting become widely used to display all sorts of
image is sharper than the original. The tech- three-dimensional voxel data—ranging from
nique increases local contrast and enhances cell images produced by confocal microscopy,
the visibility of fine-detail (high-frequency) to three-dimensional ultrasound images, to
structures. Histogram equalization spreads brain images created from MRI or PET.
the image gray levels throughout the vis- Volume images can also be given as input to
ible range to maximize the visibility of those image-based techniques for warping the image
gray levels that are used frequently. Temporal volume of one structure to other. However,
subtraction subtracts a reference image from more commonly the image volume is processed
later images that are registered to the first. in order to extract an explicit spatial (or quan-
A common use of temporal subtraction is titative) representation of anatomy (7 Sect.
digital-subtraction angiography (DSA) in 10.4.5). Such an explicit representation permits
which a background image is subtracted from improved visualization, quantitative analysis
an image taken after the injection of contrast of structure, comparison of anatomy across a
material. population, and mapping of functional data. It
is thus a component of most research involving
3-D image processing.
10.4.4 Image Rendering/
Visualization
10.4.5 Image Quantitation
Image rendering and visualization refer to a
variety of techniques for creating image dis- Image quantitation is the process of extracting
plays, diagrams, or animations to display useful numerical parameters or deriving cal-
images more in a different perspective from culations from the image or from ROIs in the
the raw images. Image volumes are comprised image (often as part of “radiomics” analyses,
of a stack of 2-D images. If the voxels in each described below) (Scheckenbach et al. 2017;
Biomedical Imaging Informatics
327 10
a b
.. Fig. 10.17 Three-dimensional CT scan of the full isotropic voxels (the dimension of pixels in the x,y plane
abdomen and pelvis a and of the liver b. The volumetric is the same as in the z dimension). The volume is ren-
CT scan comprises a set of contiguous slices containing dered directly using volume-rendering techniques
Lohmann et al. 2018; Valdora et al. 2018). domain knowledge, since they are designed to
These values are also referred to as “quantita- capture specific characteristics of the image or
tive imaging features.” These parameters may image region, such as texture, shapes, lesion
themselves be informative—for example, the margin characteristics, and image noise. Pre-
volume of the heart or the size of the fetus. defined image features are generally designed
They also may be used as input into an auto- to capture characteristics of the image that
mated classification procedure, which deter- reflect underlying biology, tissue function, or
mines the type of object found. For example, disease. For example, heart size, shape, and
small round regions on chest X-ray images motion are subtle indicators of heart function
might be classified as tumors, depending on and of the response of the heart to therapy
such features as intensity, perimeter, and area. (Clarysse et al. 1997). Similarly, fetal head size
Mathematical models often are used in and femur length, as measured on ultrasound
conjunction with image quantitation. In images, are valuable indicators of fetal well-
classic pattern-recognition applications, the being (Brinkley 1993).
mathematical model is a classifier (learned Pre-defined image features are quantitative
using some type of supervised machine learn- representations of visual signals contained in
ing) that assigns a label to the image; e.g., to an image. Two types of pre-defined image fea-
indicate if the image contains an abnormal- tures are photometric features, which exploit
ity, or indicates the diagnosis underlying an color and texture cues, derived directly from
abnormality. raw pixel intensities, and geometric features,
which use shape-based cues. While color is
zz Quantitative Image Features one of the visual cues often used for content
Quantitation of images uses global process- description (Hersh et al. 2009), most medi-
ing and segmentation to characterize regions cal images are grayscale. Texture features
of interest in the image with numerical values. encode spatial organization of pixel values
There are two kinds of quantitative image fea- of an image region. Shape features describe
tures, pre-defined image features and learned in quantitative terms the contour of a lesion
features. Pre-defined image features encode and complement the information captured by
328 D. L. Rubin et al.
color or texture (Depeursinge et al. 2014). In have the capability to identify the entire image
addition, the histogram of pixel values within content (similar to a puzzle being formed
an ROI or transforms on those values is com- from its pieces).
monly performed to compute quantitative Patch extraction approaches include using
image features. a regular sampling grid, a random selection
Pre-defined image features are com- of points, or the selection of points with
monly represented by feature-vectors in a high information content using salient point
N-dimensional space, where each dimension detectors, such as SIFT (Lowe 1999). Once
of the feature vector describes an aspect of patches are selected, the information content
the individual pixel (e.g., color, texture, etc.) within a patch is extracted. It is possible to
(Haralick and Shapiro 1992) Image analysis take the patch information as a collection of
tasks that use the quantitative features, such pixel values, or to shift the representation to
as segmentation and classification are then a different set of features based on the pix-
approached in terms of distance measure- els, such as SIFT features. A final step in the
ments between points (samples) in the chosen process is to learn a dictionary of words over
N-dimensional feature space. a large collection of patches, extracted from
In contrast to pre-defined image features, a large set of images. The vector represented
learned features are derived by computa- patches are converted into “visual words”
tional analysis of the image itself without which form a representative “dictionary”.
incorporating any domain knowledge. Deep A visual word can be considered as a rep-
learning methods (described below) are a resentative of several similar patches. A fre-
common and popular approach for deriving quently-used method is to perform K-means
10 learned features. In deep learning, the goal clustering (Bishop 1995) over the vectors of
is to learn a task directly from a large col- the initial collection, and then cluster them
lection of images; the parameters of deep into K groups in the feature space. The resul-
learning models reflect image features that tant cluster centers serve as a vocabulary of
are extracted during the learning of these K visual words, with K often in the hundreds
models (Milletari et al. 2016; Yasaka et al. and thousands.
2018). Once a global dictionary is learned, each
image is represented as a collection of words
zz Image Patches (also known as a “bag of words”, or “bag of
In the last several years, “patch-based” repre- features”), using an indexed histogram over
sentations and “bag-of-features” classification the defined words. Various image processing
techniques have been proposed and used as an tasks can then be undertaken, ranging from
approach to processing image contents (Jurie the categorization of the image content, giving
and Triggs 2005; Nowak et al. 2006; Avni the image a “high-level,” more semantic label,
2009). An overview of the methodology is the matching between images, or between an
shown in . Fig. 10.18, and represents one of image and an image class, using patches for
the types of “feature learning” being used for image segmentation and region- of-
interest
automated computer analysis of images (the detection within an image. For these various
other type of feature learning, deep learning, is tasks, images are compared using a distance
discussed below). In image patch approaches, measure between the representative histo-
a shift is made from the pixel as being the grams. In categorizing an image as belonging
atomic entity of analysis to a “patch” – a small to a certain image class, well-known classifi-
window centered on the pixel, thus region- ers, such as the k- nearest neighbor and sup-
based information is included. A very large set port-vector machines (SVM) (Vapnik 2000),
of patches is extracted from an image. Each can be used.
small patch shows a localized “glimpse” at the Using patches in a bag-of-visual-words
image content; the collection of thousands (BoW) representation was shown to be suc-
and more such patches, randomly selected, cessful in general scene and object recognition
Biomedical Imaging Informatics
329 10
.. Fig. 10.18 A block diagram of the patch-based individual pixels in the image. A dictionary of visual
image representation. A radiographic image is shown words is learned from a large set of images, and their
with a set of patches indicated for processing the image respective patches. Further analysis of the image con-
data. Subsequent image processing is performed on each tent can then be pursued based on a histogram across
patch, and on the entire set of patches, rather than on the dictionary words
tasks (Fei-Fei and Perona 2003, Varma and benchmarking tool to assess different feature
Zisserman 2003, Sivic and Zisserman 2003, sets as well as classification schemes on large
Nowak et al. 2006, Jiang et al. 2007). A few archives of Radiographs. For several years,
research studies were conducted in the medical approaches based on local patch representa-
domain as well. For example, in (André et al. tion achieved the highest scores for categoriza-
2009) BoW was used as the representation of tion accuracy (Deselaers et al. 2006; Caputo
endomicroscopic images and achieved high et al. 2008; Greenspan et al. 2011).
accuracy in the tasks of classifying the images
into neoplastic (pathological) and benign. In zz Radiomics, Machine Learning and Deep
(Bosch et al. 2006) an application to texture Learning
representation for mammography tissue classi- Radiomics describes a broad set of com-
fication and segmentation was presented. The putational methods that extract quantita-
use of BoW techniques for large scale radio- tive features from radiology images (though
graph archive categorization can be found in similar approaches can be applied to other
the ImageCLEF competition, in a task to clas- image types like histopathology or ophthal-
sify over 12,000 X-ray images to 196 different mology images) (Kumar et al. 2012; Lambin
(organ-level) categories (Tommasi et al. 2010). et al. 2012; Grossmann et al. 2017). The term
This competition provides an important “radiomics” has been used to mean a variety
330 D. L. Rubin et al.
of concepts, being wide in scope to include selections. This permits a quantitative mea-
several fields, including clinical radiology (eg, surement of the tumor shape so that subtle
imaging interpretation), computer vision (eg, variations during treatment can be observed
quantitative feature extraction), and machine and quantified. Local-level feature extraction
learning (eg, classifier evaluation). In recent provides an image descriptor used to compare
years, radiomics is commonly used to refer to a pixel being tested with its immediate pixel
the quantitation of image features in large col- neighborhood. This allows identification of a
lections of images, akin to the use of “omics” small area within an otherwise homogeneous,
for large scale collection and analysis of other larger tumor region. This can be achieved, for
types of biomedical data. It includes all the example, with local binary patterns (LBP).
quantification techniques described in this These are local image descriptors sensitive to
Chap.. A more distinctive and unique formal- small monotonic gray-level differences (Ojala
ism for the term is to view the quantitation et al. 2002). Texture descriptors, such as the
that it represents as one that focuses on the LBP are very common in ROI descriptions,
identification of quantitative imaging indica- including gray-level co-occurrence matrices
tors that predict important clinical outcomes, (Haralick et al. 1973) that examine the spatial
e.g. prognosis and response or resistance to a relationships of pixels through a series of sta-
specific cancer treatment (Zhou et al. 2018). tistical measures, and histogram of oriented
One motivation of radiomics is informa- gradients (HOG) (Dalal and Triggs 2005)
tion integration—to merge image features features to quantify image-gradient statistics
with other known quantitative descriptors, with multiple directions not obvious to radi-
including patient information and genomic ologists.
10 data to generate a unique patient signature (or Machine Learning is commonly used for
electronic phenotype). Initially, pre-defined discovering predictive radiomics features.
image features were extracted, including for In machine learning, the parameter space
example a large number of quantifications is searched for an imaging feature statisti-
defined from texture features, SIFT features, cally associated with clinical outcome (Zhou
etc. (Napel et al. 2010) Once machine learning et al. 2018). Before one evaluates machine-
tools, specifically deep learning tools emerged, learning models, a specification for the medi-
the latter have begun to take over several of cal diagnostic task is needed so that models
the stages within the radiomics processing can be appropriately trained. For example,
cycle – specifically, the generation of large sets supervised, unsupervised, and semisuper-
of automatically extracted features for the vised learning models are fundamental
quantification of the data, since deep learn- learning strategies used in accordance with
ing models learn image features as part of the the different levels of available clinical out-
training process (Kontos et al. 2017; Giger come labels.
2018). In supervised learning, the goal is to learn
Radiomics relies on computational tech- from a certain portion of trained samples
niques in computer vision to extract many with known class labels and to predict classes
quantitative features from radiologic images. or numeric values for unknown patterns from
The extracted quantitative features are typi- large and noisy datasets. Conversely, unsuper-
cally within a defined ROI that could include vised learning finds the natural structure from
the whole tumor or specific regions within it. data without having any prior labels. As a
Computational image descriptors quantify hybrid setting, semisupervised learning needs
visual characteristics at different scales from only a small portion of labeled training data.
ROIs. For example, the scale-invariant feature The unlabeled data samples,
transform (SIFT) (Lowe 1999) is computed instead of being discarded, are also used in
through key point detection using a differ- the learning process.
ence of Gaussian function and local image Deep Learning as a new frontier in machine
gradient measurement with radius and scale learning that is quickly rising as a primary
Biomedical Imaging Informatics
331 10
.. Fig. 10.19 Above is a simply 5-layer fully convolu- unit, or ReLU. Finally, should we need to conserve some
tional network. The first convolutional block is sepa- memory, we can use pooling operations, such as a Max-
rated out into three components. The first is the actual Pool layer to decrease the size of our tensors. This fun-
2-dimensional convolution. Different convolutional damental combination of convolutions, nonlinearities,
weights act on the input tensor to create the output ten- and pooling operations is used in every convolutional
sor, each weight acting as a feature extractor. This is fol- neural network since its rise to prominence in 2012.
lowed by a nonlinearity. Common nonlinearities include (Figure courtesy of Darvin Yi)
the sigmoid, hyperbolic tangent, and the rectified linear
approach to many image analysis problems, Deep learning methods have achieved
and it has advanced large-scale medical image record-breaking performances for numerous
analysis. (Greenspan et al. 2016; Shen et al. computer vision applications when the num-
2017). Excitement for using deep learning for ber of available training samples is sufficiently
attacking many image analysis problems in large (Deng et al. 2009).
medical imaging has grown quickly because Among the network architectures and
these methods were the first to be top per- models, convolutional neural networks
forming methods in the ImageNet classifica- (CNNs) have proven to be powerful tools for
tion challenge (Krizhevsky et al. 2012), and a broad range of computer vision tasks. The
much of medical imaging analysis is an image typical CNN architecture for image process-
classification problem. The development of ing consists of a series of layers of convolu-
deep learning, as part of the machine-learning tion filters, followed by or interspersed with
field, provided a new approach in which the a series of data reduction or pooling layers
input data is automatically quantified while (. Fig. 10.19). The convolution filters are
being analyzed. applied to small patches of the input image.
Deep learning has been termed one of Like the low-level vision processing in the
the 10 breakthrough technologies as of 2013 human brain, the convolution filters detect
(MIT Technology Review 2013). It is an increasingly more relevant image features, for
improvement of artificial neural networks, example lines or circles that may represent
architectures of computational units (“neu- straight edges (such as for organ detection) or
rons”), which are designed in several (all the circles (such as for round objects like colonic
way to thousands) of layers (“deep”) – where polyps), and then higher order features like
it was found that more layers permit higher local and global shape and texture. The out-
levels of abstraction and improved predic- put of the CNN is typically one or more
tions from data (LeCun et al. 2015). To date, probabilities or class labels. The convolution
it is emerging as the leading machine-learning filters are learned from training data. This is
tool in the general imaging and computer desirable because it reduces the necessity of
vision domains. the time-consuming hand-crafting of features
332 D. L. Rubin et al.
.. Fig. 10.20 Image segmentation. This figure illus- RA right atrium, LA left atrium, RV right ventricle, LV
trates the process of segmenting and labeling the cham- left ventricle). The boundary of each circumscribed ana-
bers of the heart. On the left, a cross sectional atlas tomic region can be converted into a digital mask (right)
image of the heart has been segmented by hand and which can be used in different applications where label-
each chamber was labeled (RAA right atrial appendage, ing anatomic structures in the image is needed
of pixels to one of the model components or tion now becoming an image segment. In this
segments. In order to be directly relevant for approach, global considerations determine
a segmentation task, the clusters in the model localized decisions. Moreover, such optimiza-
should represent homogeneous regions of the tion procedures are often compute-intensive.
image. In general, the better the image mod- Consider an example application in brain
eling, the better the segmentation produced. image segmentation using parametric mod-
Since the number of clusters in the feature eling and clustering. The tissue and lesion
space is often unknown, segmentation can be segmentation problem in Brain MRI is a
regarded as an unsupervised clustering task in well-studied topic of research. In such images,
the high-dimensional feature space. there is interest in three main tissue types:
There are many works on clustering algo- white matter (WM), gray matter (GM) and
rithms. We can categorize them into three cerebro-spinal fluid (CSF). The volumetric
broad classes: (a) deterministic algorithms, analysis of such tissue types in various part
(b) probabilistic model-based algorithms, of the brain is useful in assessing the prog-
and (c) graph-theoretic algorithms. The sim- ress or remission of various diseases, such
plest of these are the deterministic algorithms as Alzheimer’s disease, epilepsy, sclerosis
such as k-means (Bishop 1995), mean-shift and schizophrenia. A segmentation example
(Comaniciu and Meer 2002), and agglomera- is shown in . Fig. 10.21. In this example,
tive methods (Duda et al. 2001). For certain images from 3 MRI imaging sequences are
data distributions, i.e., distributions of pixel input to the system, and the output is a seg-
feature vectors in a feature space, such algo- mentation map, with different colors repre-
rithms perform well. For example, k-means senting three different normal brain tissues, as
10 provides good results when the data are convex well as a separate color to indicate regions of
or blob-like and the agglomerative approach abnormality (multiple-sclerosis lesions).
succeeds when clusters are dense and there is Various approaches to the segmenta-
no noise. These algorithms, however, have a tion task are reviewed in (Pham et al. 2000).
difficult time handling more complex struc- Among the approaches used are pixel-level
tures in the data. The probabilistic algorithms, intensity based clustering, such as K-means
on the other hand, model the distribution in and Mixture of Gaussians modeling (e.g.,
the data using parametric models (McLachlan (Kapur et al. 1996)). In this approach, the
and Peel 2000). Such models include auto- intensity feature is modeled by a mixture of
regressive (AR) models, Gaussian mixture Gaussians, where each Gaussian is assigned a
models (GMM), Markov random fields semantic meaning, such as one of the tissue
(MRF), conditional random fields, and oth- regions (or lesion). Using pattern recognition
ers. Efficient ways of estimating these models methods and learning, the Gaussians can be
are available using maximum likelihood algo- automatically extracted from the data, and
rithms such as the Expectation-maximization once defined, the image can be segmented into
(EM) algorithm (Dempster et al. 1977). While the respective regions.
probabilistic models offer a principled way to Algorithms for tissue segmentation using
explain the structures present in the data, they pixel-level intensity-based classification often
could be restrictive when more complex struc- exhibit high sensitivity to various noise arti-
tures are present. facts, such as intra-tissue noise, inter-tissue
Another type of clustering algorithms is intensity contrast reduction, partial-volume
non-parametric in that this class imposes no effects and others. Due to the artifacts pres-
prior shape or structure on the data. Examples ent, classical voxel-wise intensity-based clas-
of these are graph-theoretic algorithms based sification methods, including the K-means
on spectral factorization (e.g., (Ng et al. 2001, modeling and Mixture of Gaussians model-
Shi and Malik 2000)). Here, the image data ing, often give unrealistic results, with tissue
are modeled as a graph. The entire image class regions appearing granular, fragmented,
data along with a global cost function are or violating anatomical constraints. Specific
used to partition the graph, with each parti- works can be found addressing various aspects
Biomedical Imaging Informatics
335 10
Ground
T1 T2 PD Truth
.. Fig. 10.21 Brain MRI segmentation example. Brain of the images is shown on the right: Blue: CSF; Green:
slice from multiple acquisition sequences (with 9% Gray matter (GM); Yellow: white matter (WM); Red:
noise) was taken from BrainWEB (7 http://www.bic. Multiple-sclerosis lesions (MSL). (Friefeld et al. 2009).
mni.mcgill.ca/brainweb/). From left to right: T1-, T2-, (Reused with permission © Brainweb)
and proton density (PD)-weighted image. Segmentation
of these concerns (e.g., partial-volume effect tures) and the intensity characteristic per
quantification (Dugas-Phocion et al. 2004)). region (T1 intensity feature). Incorporating
One way to address the smoothness issue is the spatial information within the feature
to add spatial constraints. This is often done space is novel, as is using a large number of
during a pre-processing phase by using a sta- Gaussians per brain tissue to capture the
tistical atlas, or as a post-processing step via complicated spatial layout of the individual
Markov Random Field models. A statistical tissues. Two key features of the proposed
atlas provides the prior probability for each framework are: 1) combining global intensity
pixel to originate from a particular tissue class modeling with localized spatial modeling, as
(e.g., (Van Leemput et al. 1999; Marroquin an alternative scheme to MRF modeling, and
et al. 2002; Prastawa et al. 2004)). 2) segmentation is entirely unsupervised; thus
Algorithms exist that use the maximum- eliminating the need for atlas registration, or
a-posteriori (MAP) criterion to augment any intensity model standardization.
intensity information with the atlas. However, Segmentation can also be improved using
registration between a given image and the a post-processing phase in which smoothness
atlas is required, which can be computation- and immunity to noise can be achieved by
ally prohibitive (Rohlfing and Maurer Jr. modeling the interactions among neighbor-
2003). Further, the quality of the registration ing voxels. Such interactions can be modeled
result is strongly dependent on the physiologi- using a Markov Random Field (MRF), and
cal variability of the subject and may converge thus this technique has been used to improve
to an erroneous result in the case of a diseased segmentation (Held et al. 1997; Van Leemput
or severely damaged brain. Finally, the regis- et al. 1999; Zhang et al. 2001).
tration process is applicable only to complete
volumes. A single slice cannot be registered zz Segmentation Using Deep Learning
to the atlas. Therefore it cannot be segmented As noted earlier in this chapter, one of the fast-
using these state-of-the-art algorithms. In est emerging research field over the last few
(Greenspan et al. 2006) a robust, unsuper- years is deep learning. Deep learning can help
vised, parametric method for segmenting outperform classical machine learning algo-
3D (or 2D) MR brain images with a high rithms due to its ability to learn latent vari-
degree of noise and low contrast, is presented. ables within the features space, features that
A Constrained Gaussian Mixture Model the user can barely detect. On the other hand,
(CGMM) framework is proposed, in which deep learning requires a huge labeled training
each tissue is modeled with multiple four- size that is not always available, which makes
dimensional Gaussians, where each Gaussian developing robust classification models chal-
represents a localized region (3 spatial fea- lenging. However, there are methods to help
336 D. L. Rubin et al.
overcome these challenges, such as data aug- tion of the body. These opportunities are par-
mentation (Greenspan et al. 2016), transfer ticularly widely exploited in brain imaging.
learning (Hoo-Chang et al. 2016), and metric Therefore, this section concentrates on 3-D
learning (Yang et al. 2010) that were specifi- brain imaging, with the recognition that many
cally designed to handle these challenges. In of the methods developed for the brain have
transfer learning one uses a network that was been or will be applied to other areas as well.
pre-trained on another set of images, and we The basic 2-D image processing opera-
use the weights of this network as weights tions of global processing, segmentation, fea-
initialization for the current network that is ture detection, and classification generalize
analyzed. The initial weights are fine-tuned on to higher dimensions, and are usually part of
the relevant current dataset, which should give any image processing application. However,
better results than just using random weights. 3-D and higher dimensionality images give
In metric learning, one actually learns the best rise to additional informatics issues, which
metric that can represent and classify the data – include image registration (which also occurs
instead of learning the classes themselves. In to a lesser extent in 2-D), spatial representa-
that way, we actually teach the network how to tion of anatomy, symbolic representation of
learn. In addition, metric learning is a kind of anatomy, integration of spatial and symbolic
“image ontology,” and as a result, it sketches anatomic representations in atlases, anatomi-
the distances between different instances. cal variation, and characterization of anatomy.
Therefore, if a new test case will not be part of All but the first of these issues deal primar-
the training classes, it will not be misclassified ily with anatomical structure, and therefore
to one of those classes. could be considered part of the field of struc-
10 The core idea of deep learning is to con- tural informatics. They could also be thought
volve the input image with different filters and of as being part of imaging informatics and
within different scales (i.e. pooling), such as neuroinformatics.
they will able to detect both low-level and high- As noted previously, 3-D image volume
level features. Many deep learning architectures data are represented in the computer by a
are considered as patch-wise techniques. 3-D volume array, in which each voxel repre-
Convolutional neural networks such as sents the image intensity in a small volume of
U-Net (Ronneberger et al. 2015; Trebeschi space. In order to depict anatomy accurately,
et al. 2017) and V-Net (Trebeschi et al. 2017) the voxels must be accurately registered (or
were designed specifically to deal with the typ- located) in the 3-D volume (voxel registra-
ical challenges of the medical domain such as tion), and separately acquired image volumes
small amount of labeled data. Autoencoders, from the same subject must be registered with
Variational Autoencoders and stacked-each other (volume registration).
autoencoders can be used for image denois-
ing and as for unsupervised feature extraction zz Voxel Registration
(Vincent et al. 2010; Bengio et al. 2013). Other Imaging modalities such as CT, MRI, and con-
methods were designed to handle with various focal microscopy (7 Sects. 10.2.3 and 10.2.5)
of classifications tasks such as lesion detection are inherently 3-D: the scanner generally out-
(Wang et al. 2016), segmentation (Kayalibay puts a series of image slices that can easily be
et al. 2017; Trebeschi et al. 2017) and disease reformatted as a 3-D volume array, often fol-
classification (Esteva et al. 2017). lowing alignment algorithms that compensate
for any patient motion during the scanning
procedure. For this reason, almost all CT and
10.4.7 Image Registration MR manufacturers’ consoles contain some
form of three-dimensional reconstruction and
The growing availability of 3-D and higher visualization capabilities.
dimensionality structural and functional As noted in 7 Sect. 10.4.4, two-
images leads to exciting opportunities for dimensional images can be converted to 3-D
realistically observing the structure and func- volumes if they are closely spaced parallel
Biomedical Imaging Informatics
337 10
sections through a tissue or whole specimen by a Human Brain Project collaboration
and contain isotropic voxels. In this case, the between computer scientists and biologists at
problem is how to align the sections with the University of Maryland (Agrawal et al.
each other. For whole sections (either frozen 2000).
or fixed), the standard method is to embed a
set of thin rods or strings in the tissue prior zz Volume Registration
to sectioning to manually indicate the loca- A related problem to that of aligning individ-
tion of these fiducials on each section, then to ual sections is the problem of aligning sepa-
linearly transform each slice so that the cor- rate image volumes from the same subject,
responding fiducials line up in 3-D (Prothero that is, intra-subject alignment. Because differ-
and Prothero 1986). An example of this tech- ent image modalities provide complementary
nique is the Visible Human, in which a series information, it is common to acquire more
of transverse slices were acquired, then recon- than one kind of image volume on the same
structed to give a full 3-D volume (Spitzer and individual. This approach has been particu-
Whitlock 1998) (7 Chap. 22). larly useful for brain imaging because each
It is difficult to embed fiducial markers modality provides different information. For
at the microscopic level, so intrinsic tissue example, PET (7 Sect. 10.2.3) provides use-
landmarks are often used as fiducials, but the ful information about function, but does not
basic principle is similar. However, in this case provide good localization with respect to the
tissue distortion may be a problem, so non- anatomy. Similarly, MRV and MRA (7 Sect.
linear transformations may be required. For 10.2.3) show blood flow but do not pro-
example Fiala and Harris (Fiala and Harris vide the detailed anatomy visible with stan-
2001) developed an interface that allows the dard MRI. By combining images from these
user to indicate, on electron microscopy sec- modalities with MRI, it is possible to show
tions, corresponding centers of small organ- functional images in terms of the underlying
elles such as mitochondria. A non-linear anatomy, thereby providing a common neuro-
transformation (warp) is then computed to anatomic framework.
bring the landmarks into registration. The primary problem to solve in multimo-
An approach being pursued (among dality image fusion is volume registration—
other approaches) by the National Center for that is, the alignment of separately acquired
Microscopy and Imaging Research (7 http:// image volumes. In the simplest case, separate
ncmir.ucsd.edu/) combines reconstruction image volumes are acquired during a single
from thick serial sections with electron tomog- sitting. The patient’s head may be immo-
raphy (Soto et al. 1994). In this case the tomo- bilized, and the information in the image
graphic technique is applied to each thick headers may be used to rotate and resample
section to generate a 3-D digital slab, after the image volumes until all the voxels corre-
which the slabs are aligned with each other spond. However, if the patient moves, or if
to generate a 3-D volume. The advantages of examinations are acquired at different times,
this approach over the standard serial section other registration methods are needed. When
method are that the sections do not need to be intensity values are similar across modalities,
as thin, and fewer of them need be acquired. registration can be performed automatically
An alternative approach to 3-D voxel reg- by intensity- based optimization methods
istration from 2-D images is stereo-matching, (Woods et al. 1992; Collins et al. 1994). When
a technique developed in computer vision that intensity values are not similar (as is the case
acquires multiple 2-D images from known with MRA, MRV and MRI), images can be
angles, finds corresponding points on the aligned to templates of the same modalities
images, and uses the correspondences and that are already aligned (Woods et al. 1993;
known camera angles to compute 3-D coor- Ashburner and Friston 1997). Alternatively,
dinates of pixels in the matched images. The landmark-based methods can be used. The
technique is being applied to the reconstruc- landmark-based methods are similar to those
tion of synapses from electron micrographs
338 D. L. Rubin et al.
used to align serial sections (see earlier dis- image retrieval is images with specific con-
cussion of voxel registration in this section), tent—typically to find images that are simi-
but in this case the landmarks are 3-D points. lar in some ways to a query image (e.g., to
The Montreal Register Program (MacDonald find images in the PACS containing similar-
1993) is an example of such a program. appearing abnormalities to that in an image
Techniques and applications of volume regis- being interpreted). Finding images containing
tration in other domains have been described similar content is referred to as content based
(Pelizzari 1998; Ferrante and Paragios 2017). image retrieval (CBIR). By retrieving similar
images and then looking at the diagnosis of
those patients, the radiologist can gain greater
10.5 Image Interpretation confidence in interpreting the images from
and Computer Reasoning patients whose diagnosis is not yet known.
As with the task of medical diagnosis
The preceding sections of this chapter as well (7 Chap. 24), radiological diagnosis can be
as 7 Chap. 22 describe informatics aspects of enhanced using computer-based inference
image generation, storage, manipulation, and systems, the commonest type of which is deci-
display of images. Rendering an interpretation sion support systems, which assist the physi-
is a crucial final stage in the chain of activi- cian in making clinical decisions. In computer
ties related to imaging. Image interpretation inference (also referred to as “reasoning”), the
is this final stage in which the physician has machine takes in the available data (the images
direct impact on the clinical care process, by and possibly other clinical information), per-
rendering a professional opinion as to whether forms a variety of image processing methods
10 abnormalities are present in the image and the (7 Sect. 10.4), and uses one or more types of
likely significance of those abnormalities. The knowledge resources and/or mathematical
process of image interpretation requires “rea- models to render an output comprising either
soning”— drawing inferences from facts; the a decision or a ranked list of possible choices
facts are the image abnormalities detected and (e.g., diagnoses or locations on the image sus-
the known clinical history, and the inferred pected of being abnormal).
information is the diagnosis and management In this section we describe informatics
decision (what to do next, such as another test methods for image retrieval and computer
or surgery, etc.). Such reasoning usually entails inference with images.
uncertainty, and optimally would be carried
out using probabilistic approaches (7 Chap.
3), unless certain classic imaging patterns are 10.5.1 Content-Based Image
recognized. In reality, radiology practice is Retrieval
usually carried out without formal probabi-
listic models that relate imaging observations Since a key aspect of radiological inter
to the likelihood of diseases. However, varia- pretation is recognizing characteristic patt
tion in practice is a known problem in image erns in the imaging features which suggest
interpretation (Robinson 1997), and methods the diagnosis, searching databases for simi-
to improve this process are desirable. lar images with known diagnoses could be
Informatics methods can enhance radio an effective strategy to improving diagnostic
logical interpretation of images in two major accuracy. CBIR is the process of performing a
ways: (1) image retrieval systems and (2) match between images using their visual con-
computer-based inference systems. The con- tent. A query image can be presented as input
cept of image retrieval is similar to that of to the system (or a combination of a query
information retrieval (see 7 Chap. 23), in image and the patient’s clinical record), and
which the user retrieves a set of documents the system searches for similar cases in large
pertinent to a question or information need. archive settings (such as PACS) and returns
The information being sought when doing a ranked list of such similar data (images).
Biomedical Imaging Informatics
339 10
This task requires an informative representa- The qualitative image features reported by the
tion for the image data, along with similarity radiologist (“semantic features”) are comple-
measures across image data. CBIR methods mentary to the quantitative data contained
are already useful in non-medical applica- in image pixels. One approach to capturing
tions such as consumer imaging and on the image semantics is analyzing and processing
Web (Wang et al. 1997; Smeulders et al. 2000; “visual words” in images, captured as image
Datta et al. 2008). patches or codebooks (7 Sect. 10.4.5). These
There has also been ongoing work to techniques have been shown to perform well
develop CBIR methods in radiology and sev- in CBIR applications (Qiu 2002). Another
eral reviews on this subject have been pub- approach to capture image semantics is to
lished (Akgul et al. 2011; Endo et al. 2012; use the radiologist’s imaging observations as
Kumar et al. 2013; Muramatsu 2018). The image features. Several studies have found
approach generally is based on deriving quan- that combining the semantic information
titative characteristics from the images (e.g., obtained from radiologists’ imaging reports
pixel statistics, spatial frequency content, or annotations with the pixel-level features
etc.; 7 Sect. 10.4.5), followed by applica- can enhance performance of CBIR systems
tion of similarity metrics to search databases (Ruiz 2006; Zhenyu et al. 2009; Napel et al.
for similar images (Lehmann et al. 2004; 2010). The knowledge representation meth-
Muller et al. 2004; Greenspan and Pinhas ods described in 7 Sects. 10.3.2 and 10.4.5
2007; Datta et al. 2008; Deserno et al. 2009; make it possible to combine these types of
Napel et al. 2010; Faruque et al. 2013, 2015). information.
The focus of the current work is on entire
images, describing them with sets of numeri-
cal features, with the goal of retrieving similar 10.5.2 Computer-Based Inference
images from medical collections (Hersh et al.
2009; Napel et al. 2010; Faruque et al. 2013, Though image retrieval described above (and
2015) that provide benchmarks for image information retrieval in general) can be help-
retrieval. However, in many cases only a par- ful to a practitioner interpreting images, it
ticular region of the image is of interest when does not directly answer a specific question at
seeking similar images (e.g., finding images hand, such as, “what is the diagnosis in this
containing similar-appearing lesions to those patient” or “what imaging test should I order
in the query image). More recently, “local- next?” Answering such questions requires
ized” CBIR methods are being developed in inference, either by the physician with all
which a part of the image containing a region the available data, or by a computer, using
of interest is analyzed (Deselaers et al. 2007; physician inputs and the images. As the use
Rahmani et al. 2008; Napel et al. 2010). of imaging proliferates and the number of
There are several unsolved challenges in images being produced by imaging modali-
CBIR. First, CBIR has been largely focused ties explodes, it is becoming a major challenge
on query based on single 2-D images; methods for practicing radiologists to integrate the
need to be developed for 3D retrieval in which multitude of imaging data, clinical data, and
a volume is the query “image.” A second chal- soon molecular data, to formulate an accu-
lenge is the need to integrate images with non- rate diagnosis and management plan for the
image clinical data to permit retrieval based patient. Computer-based inference systems—
on entire patient cases and not single images specifically decision support systems—can
(e.g., the CBIR method should take into con- help radiologists understand the biomedical
sideration the clinical history in addition to import of this information and to provide
the image appearance in retrieving a similar guidance (Hudson and Cohen 2009).
“case”). There are two major approaches to
Another limitation of current CBIR is that computer- based inference using images: (1)
image semantics is not routinely included. using quantitative image features only (quan-
340 D. L. Rubin et al.
10
.. Fig. 10.22 Bayesian network-based system for deci- terior probabilities of disease. A list of diseases, ranked
sion support in mammography CADx. The radiologist by the probability of each disease, is return to the user
interpreting the image enters the radiology observations who can make a decision based on a threshold of prob-
and clinical information (patient history) in a structured ability of malignancy, or based on shared decision mak-
reporting Web-based data capture form to render the ing with the patient. (Figure reprinted with permission
report. This form is sent to a server which inputs the from (Rubin 2011). © Radiological Society of North
observations into the Bayesian network to calculate pos- America)
et al. 2006). Deploying CADx systems, how- standard formats and with controlled termi-
ever, can be challenging. Since the inputs nologies (7 Sect. 10.3.2).
to CADx generally need to be structured
(semantic features from the radiologist and/ zz Computerized Prediction
or quantitative features from the image), a The goal of computerized prediction using
means of capturing the structured image images is to analyze characteristics of the
information as part of the routine clinical disease manifest in the image and use that
workflow is required. A promising approach (without or with additional clinical data)
is to combine structured reporting with to make predictions about the disease (e.g.,
CADx (. Fig. 10.22); the radiologist records life expectancy of the patient, whether or
the imaging observations with a data capture not the patient’s disease will respond to a
form, which provides the structured image particular treatment, or whether the disease
content required to the CADx system. Ideally will recur or progress at some time in the
the output would be presented immediately future). Many methods have been developed
to the radiologist as the report is generated to predict such future event, using both pre-
so that the output of decision support can be defined image features and unsupervised fea-
incorporated into the radiology report. Such ture learning (7 Sect. 10.4.5) (Huang et al.
implementations will be greatly facilitated by 2016; Jun et al. 2016; Li et al. 2016; Nie et al.
informatics methods to extract and record 2016; Bogowicz et al. 2017; Fave et al. 2017;
the image information in structured and van Timmeren et al. 2017; Wu et al. 2017;
Biomedical Imaging Informatics
343 10
Zhou et al. 2017b; Betancur et al. 2018; Cha to directly answer particular questions about
et al. 2018; Gastounioti et al. 2018; Shi et al. how those entities relate to each other. For
2018). example, by traversing the part-of relationship
in an anatomy ontology, a reasoning applica-
zz Knowledge-Based Image Inference tion can infer that the left ventricle and right
Systems ventricle are part-of the chest (given that the
The CAD and CADx systems do not require ontology asserts they are each part of the
processing radiological knowledge (e.g., ana- heart, and that the heart is part of the chest),
tomic knowledge) in order to carry out their without our needing to specify this fact explic-
tasks; they are based on quantitative modeling itly in the ontology.
of relationships of images features to diagno- In reasoning by logical inference, ontolo-
ses. However, not all image-based reasoning gies that encode sufficient information
problems are amenable to this approach. In (“explicit semantics”) to apply generic rea-
particular, knowledge-based tasks such as rea- soning engines are used. The Web Ontology
soning about anatomy, physiology, and pathol- Language (OWL) (Bechhofer et al. 2004; Smith
ogy—tasks that entail symbolic manipulations et al. 2004; Motik et al. 2008) is an ontology
of biomedical knowledge and application of language recommended by the World Wide
logic—are best handled using different meth- Web Consortium (W3C) as a standard lan-
ods, such as ontologies and logical inference guage for the Semantic Web (WorldWideWeb
(see 7 Chap. 24). Consortium W3C Recommendation 10 Feb
Knowledge-based computer reasoning 2004). OWL is similar to other ontology lan-
applications use knowledge representations, guages in that it can capture knowledge by
generally ontologies, in conjunction with representing the entities (“classes”) and their
rules of logic to deduce information from attributes (“properties”). In addition, OWL
asserted facts (e.g., from observations in the provides the capability of defining “formal
image). For example, an anatomy ontology semantics” or meaning of the entities in the
may express the knowledge that “if a seg- ontology. Entities are defined using logic state-
ment of a coronary artery is severed, then ments that provide assertions about entities
branches distal to the severed branch will not (“class axioms”) using description logics (DL)
receive blood,” and “the anterior and lateral (Grau et al. 2008). DLs provide a formalism
portions of the right ventricle are supplied enabling developers to define precise seman-
by branches of the right coronary artery, tics of knowledge in ontologies and to per-
with little or no collateral supply from the form automated deductive reasoning (Baader
left coronary artery.” Using this knowledge, et al. 2003). For example, an anatomy ontol-
and recognition via image processing that the ogy in OWL could provide precise semantics
right coronary artery is severed in an injury, a for “hemopericardium,” by defining it as a
computer reasoning application could deduce pericardial cavity that contains blood.
that the anterior and lateral portions of the Highly optimized computer reasoning
right ventricle will become ischemic (among engines (“reasoners”) have been developed for
other regions; . Fig. 10.23). In performing OWL, helping developers to incorporate rea-
this reasoning task, the application uses the soning efficiently and effectively in their appli-
knowledge to draw correct conclusions by cations (Tsarkov and Horrocks 2006; Motik
manipulating the anatomical concepts and et al. 2009). These reasoners work with OWL
relationships using the rules of logical infer- ontologies by evaluating the asserted logical
ence during the reasoning process. statements about classes and their proper-
Computer reasoning with ontologies is ties in the original ontology (the “asserted
performed by one of two methods: (1) rea- ontology”), and they create a new ontology
soning by ontology query and (2) reasoning by structure that is deduced from the asserted
logical inference. In reasoning by ontology knowledge (the “inferred ontology”). This
query, the application traverse relationships reasoning process is referred to as “auto-
that link particular entities in the ontology matic classification.” The inferences obtained
344 D. L. Rubin et al.
10
.. Fig. 10.23 Knowledge-based reasoning with images semantic annotations on the image based on the trajec-
in a task to predict the portions of the heart that will tory of injury (injured anatomic structures shown in
become ischemic after a penetrating injury that injures bold in the left panel). b The anatomic structures that
particular anatomic structures. The application allows are predicted to be initially injured are displayed in the
the user to draw a trajectory of penetrating injury on the volume rendering (dark gray = total ischemia; light
image, a 3-D rendering of the heart obtained from seg- gray = partial ischemia). In this example, the right coro-
mented CT images. The reasoning application automati- nary artery was injured, and the reasoning application
cally carries out two tasks. a The application first correctly inferred there will be total ischemia of the
deduces the anatomic structures that will be injured con- anterior and lateral wall of the right ventricle and par-
sequent to the trajectory (arrow, right) by interrogating tial ischemia of the posterior wall of the left ventricle
Biomedical Imaging Informatics
345 10
from the reasoning process are obtained by between the heart chambers and will pro-
querying the inferred ontology and looking duce abnormal physiological blood flow.
for classes (or individuals) that have been The simulation community has created
assigned to classes of interest in the ontol- mathematical models to predict the physi-
ogy. For example, an application was created ological signals, such as time-varying pres-
to infer the consequences of cardiac injury in sure and flow, given particular parameters
this manner (. Fig. 10.23). in the model such as capacitance, resistance,
Several knowledge-based image reasoning etc. The knowledge in these mathematical
systems have been developed that use ontolo- models can be represented ontologically, in
gies as the knowledge source to process the which the entities correspond to nodes in
image content and derive inferences from the simulation model; the advantage is that
them. These include: (1) reasoning about the a graphical representation of the ontology,
anatomic consequences of penetrating injury, corresponding to a graphical representation
(2) inferring and simulating the physiologi- of the mathematical model, can be created.
cal changes that will occur given anatomic Morphological alterations seen in images
abnormalities seen in images, (3) automated can be directly translated into alterations in
disease grading/staging to infer the grade and/ the ontological representation of the ana-
or stage of disease based on imaging features tomic structures, and simultaneously can
of disease in the body (4) surgical planning by update the simulation model appropriately
deducing the functional significance of dis- to simulate the physiological consequences
ruption of white matter tracts in the brain, of the morphological anatomic alteration
(5) inferring the types of information users (Rubin et al. 2006b). Such knowledge-based
seek based on analyzing query logs of image image reasoning methods could greatly
searches, and (6) inferring the response of dis- enable functional evaluation of the static
ease in patients to treatment based on analysis abnormalities seen in medical imaging.
of serial imaging studies. We briefly describe
these applications. kAutomated disease grading/staging
A great deal of image-based knowledge is
kReasoning about anatomic consequences encoded in the literature and not readily avail-
of penetrating injury able to clinicians needing to apply it. A good
In this system, images were segmented and example of this is the criteria used to grade
semantic annotations applied to identify car- and stage disease based on imaging crite-
diac structures. An ontology of cardiac anat- ria. For example, there are detailed criteria
omy in OWL was used to encode knowledge specified for staging tumors and grading the
about anatomic structures and the portions severity of disease. This knowledge has been
of them that are supplied by different arte- encoded in OWL ontologies and used to
rial branches. Using knowledge about part-of automate grading of brain gliomas (Marquet
relationships and connectivity, the applica- et al. 2007) and staging of cancer (Dameron
tion uses the anatomy ontology to infer the et al. 2006) based on the imaging features
anatomic consequences of injury that are rec- detected by radiologists. This ontology-based
ognized on the input images (. Fig. 10.23) paradigm could provide a good model for
(Rubin et al. 2004, 2005, 2006a). delivering current biomedical knowledge
to practitioners “ just-in-time” to help them
kInferring and simulating the physiological grade and stage disease as they view images
changes and record their observations.
Morphological changes in anatomy have
physiological consequences. For example, kSurgical planning
if a hole appears in the septum dividing Understanding complex anatomic relati
the atria or ventricles of the heart (a sep- onships and their functional significance in
tal defect), then blood will flow abnormally the patient is crucial in surgical planning, par-
346 D. L. Rubin et al.
ticular for brain surgery, since there are many This application demonstrates the poten-
surgical approaches possible, and some will tial for a streamlined workflow of radiology
have less severe consequences to patients than image interpretation and lesion measurement
others. It can be challenging to be aware of all automatically feeding into decision support
these relationships and functional dependen- to guide patient care.
cies; thus, surgical planning is an opportune
area to develop knowledge-based image rea-
soning systems. The anatomic and functional 10.6 Conclusions
knowledge can be encoded in an ontology and
used by an application to plan the optimal This chapter focuses on methods for com-
surgical approach. In recent work, such an putational representation and for processing
ontological model was developed to assess the images in biomedicine, with an emphasis on
functional sequelae of disruptions of motor radiological imaging and the extraction and
pathways in the brain, which could be used characterization of anatomical structure and
in the future to guide surgical interventions abnormalities. It has been emphasized that
(Talos et al. 2008; Rubin et al. 2009b). the content of images is complex—compris-
ing both quantitative and semantic informa-
kInferring types of information users seek tion. Methods of making that content explicit
from images and computationally-accessible have been
Knowledge-based reasoning approaches described, and they are crucial to enable com-
have been used to evaluate image search puter applications to access the “biomedical
logs on Web sites that host image databases meaning” in images; presently, the vast archives
10 to ascertain the types of queries users sub- of images are poorly utilized because the image
mit. RadLex (7 Sect. 10.3.2) was used as the content is not explicit and accessible. As the
ontology, and by mapping the queries to leaf methods to extract quantitative and semantic
classes in RadLex and then traversing the sub- image information become more widespread,
sumption relations, the types of queries could image databases will be as useful to the discov-
be deduced by interrogating the higher-level ery process as the biological databases (they
classes in RadLex (such as “visual observa- will even likely become linked), and an era of
tion” and “anatomic entity”) (Rubin et al. “data-driven” and “high-throughput imaging”
2011). will be enabled, analogous to modern “high-
throughput” biology. In addition, the computa-
kInferring the response of disease treat- tional imaging methods will lead to applications
ment that leverage the image content, such as CAD/
As mentioned above, the complex knowl- CADx and knowledge-based image reasoning
edge required to grade and stage disease can that use image content to improve physicians’
be represented using an ontology. Similarly, capability to care for patients.
the criteria used to assess the response of Though this chapter has focused on radi-
patients to treatment is also complex, evolv- ology, we stress that the biomedical imaging
ing, and dependent on numerous aspects of informatics methods presented are generaliz-
image information. The knowledge needed to able and either have been or will be applied
apply criteria of disease response assessment to other domains in which visualization and
have been encoded ontologically, specifically imaging are becoming increasingly impor-
in OWL, and used to determine automati- tant, such as microscopy, pathology, ophthal-
cally the degree of cancer response to treat- mology, and dermatology. As new imaging
ment in patients (Levy et al. 2009, Levy and modalities increasingly become available
Rubin 2011). The inputs to the computer- for imaging other and more detailed body
ized reasoning method are the quantitative regions, the techniques presented in this chap-
information about lesions seen in the images, ter will increasingly be applied in all areas of
recorded as semantic annotations using the biomedicine. For example, the development
AIM information model (7 Sect. 10.3.2).
Biomedical Imaging Informatics
347 10
of molecular imaging methods is analogous Robb, R. A. (2000). Biomedical imaging, visualiza-
to functional brain imaging, since functional tion, and analysis. New York: Wiley-Liss.
data, in this case from gene expression rather Overview of biomedical imaging modalities
than cognitive activity, can be mapped to an and processing techniques.
anatomical substrate. Sahiner, B., Pezeshk, A., Hadjiiski, L. M., Wang,
Thus, the general biomedical imaging X., Drukker, K., Cha, K. H., Summers, R. M.,
informatics methods described here will & Giger, M. L. (2019). Deep learning in medi-
increasingly be applied to diverse areas of bio- cal imaging and radiation therapy. Medical
medicine. As they are applied, and as imaging Physics, 46(1), e1–e36.
modalities continue to proliferate, a growing Shapiro, L. G., & Stockman, G. C. (2001).
demand will be placed on leveraging the con- Computer vision. Upper Saddle River:
tent in these images to characterize the clinical Prentice-Hall. Detailed description of many
phenotype of disease and relate it to genotype of the representations and methods used in
and clinical data from patients to enhance image processing. Not specific to medicine,
research and clinical care. but most of the methods are applicable to
medical imaging.
nnSuggested Reading Soffer, S., Ben-Cohen, A., Shimon, O., Amitai,
Brinkley, J. F. (1991). Structural informatics and M. M., Greenspan, H., & Klang, E. (2019
its applications in medicine and biology. Mar). Convolutional neural networks for
Academic Medicine, 66(10), 589–591 Short radiologic images: A Radiologist’s guide.
introduction to the field. Radiology, 290(3), 590–606.
Brinkley, J. F., & Rosse, C. (2002). Imaging and Sonka, M., & Fitzpatrick, J. M. (2000). Handbook
the human Brain project: A review. Methods of medical imaging, volume 2. Medical image
of Information in Medicine, 41, 245–260 processing and analysis. Bellingham: SPIE
Review of image processing work related to Press. Overview of biomedical imaging modal-
the brain. Much of the brain-related material ities and processing techniques.
for this chapter was adapted from this article. Wang, S., & Summers, R. M. (2012). Machine
Deserno, T. M. (2011). Biomedical image process- learning and radiology. Medical Image
ing. Berlin: Springer-Verlag. Edited book of Analysis, 16(5), 933–951.
current approaches to variety of biomedical Yoo, T. S. (2004). Insight into images: Principles
image processing tasks. and practice for segmentation, registration, and
Dreyer, K. J., & Geis, J. R. (2017 Dec). When image analysis. Wellesley: A K Peters
machines think: Radiology’s next frontier. Comprehensive overview of digital image pro-
Radiology, 285(3), 713–718. cessing methods with examples from the
Gonzalez, R. C., Woods, R. E., et al. (2009). Insight Toolkit (ITK).
Digital Image processing using MATLAB. S.I.,
Gatesmark Publishing. A comprehensive ??Questions for Discussion
overview of image processing methods focus- 1. How might you create an image
ing on Matlab examples. processing pipeline to build an
Pham, D. L., Xu, C., & Prince, J. L. (2000). image-analysis program looking for
Current methods in medical image segmenta- abnormal cells in a PAP smear? How
tion. Annual Review of Biomedical Engineering, would you collect and incorporate
2, 315–337 Overview of medical image seg- semantic features into the program?
mentation. 2. Why is segmentation so difficult to per-
Potchen, E. J. (2000). Prospects for progress in form? Give two examples of ways by
diagnostic imaging. Journal of Internal which current systems avoid the prob-
Medicine, 247(4), 411–424 Nontechnical lem of automatic segmentation.
description of newer imaging methods such as 3. How might you build a decision-support
cardiac MRI, diffusion tensor imaging, fMRI, system that is based on searching the
and molecular imaging. Current and potential hospital image archive for similar images
use of these methods for diagnosis. and returning the diagnosis associated
348 D. L. Rubin et al.
with the most similar images? How computer-aided diagnosis system for digital X-ray
might you make use of the semantic mammograms via deep learning detection, segmen-
tation, and classification. International Journal of
information in images in images to
Medical Informatics, 117, 44–54.
improve the accuracy of retrieval? Alberini, J. L., Edeline, V., Giraudet, A. L., Champion,
4. What are limitations of deep learning L., Paulmier, B., Madar, O., Poinsignon, A., Bellet,
compared with other machine learning D., & Pecking, A. P. (2011). Single photon emission
methods for creating image interpreta- tomography/computed tomography (SPET/CT) and
positron emission tomography/computed tomogra-
tion systems? When might a classical
phy (PET/CT) to image cancer. Journal of Surgical
machine learning approach be better Oncology, 103(6), 602–606.
than deep learning? André, B., Vercauteren, T., Perchant, A., Buchner,
5. Give an example of how knowledge A. M., Wallace, M. B., & Ayache, N. (2009).
about the problem to be solved (e.g., Introducing space and time in local feature-based
Endomicroscopic image retrieval. Medical content-
local anatomy in the image) could be
based retrieval for clinical decision support.
used in future systems to aid in auto- B. Caputo, H. Mller, T. Syeda-Mahmood et al.
matic segmentation. Berlin, Heidelberg, Springer. Lecture Notes in
6. Both images and free text share the Computer Science, 5853, 18–30.
characteristic that they are unstructured Appel, B. (2001). Nomenclature and classification of
lumbar disc pathology. Neuroradiology, 43(12),
information; image processing methods
1124–1125.
to make the biomedical content in Armstrong, R. A. (2010). Review paper. Quantitative
images explicit are very similar to methods in neuropathology. Folia Neuropathologica,
related problems in natural language 48(4), 217–230.
processing (NLP; 7 Chap. 8). How are Ashburner, J., & Friston, K. J. (1997). Multimodal
image coregistration and partitioning - a unified
10 image processing methods and NLP
framework. NeuroImage, 6(3), 209–217.
similar in terms of (1) computer Avni. (2009). Addressing the ImageClef 2009 challenge
representation of the raw content? (2) using a patch-based visual words representation
representation of the semantic content? %U. http://www.clef-campaign.org/2009/working_
(3) processing of the content (e.g., what notes/avni-paperCLEF2009.pdf. Working notes
CLEF2009.
is the NLP equivalent of segmentation,
Baader, F. E., McGuinness, D. E., Nardi, D. E.,
or the image processing equivalent of Schneider, P. P. E., & Calvanese, D. E. (Eds.). (2003).
named entity recognition)? The description logic handbook: Theory, implementa-
tion and applications. Cambridge University Press:
New York.
Baker, J. A., Kornguth, P. J., Lo, J. Y., Williford,
References M. E., & Floyd, C. E., Jr. (1995). Breast cancer:
Prediction with artificial neural network based on
Abou-Elkacem, L., Bachawal, S. V., & Willmann, J. K. BI-RADS standardized lexicon. Radiology,
(2015). Ultrasound molecular imaging: Moving 196(3), 817–822.
toward clinical translation. European Journal of Baumann, B., Gotzinger, E., Pircher, M., Sattmann, H.,
Radiology, 84(9), 1685–1693. Schuutze, C., Schlanitz, F., Ahlers, C., Schmidt-
Agrawal, M., Harwood, D., Duraiswami, R., Davis, Erfurth, U., & Hitzenberger, C. K. (2010).
L. S., & Luther, P. W. (2000). Three-dimensional Segmentation and quantification of retinal lesions
ultrastructure from transmission electron microp- in age-related macular degeneration using
scope tilt series. Proceedings, Second Indian polarization- sensitive optical coherence tomogra-
Conference on Vision, Graphics and Image phy. Journal of Biomedical Optics, 15(6), 061704.
Processing. Bangaore, India. Bechhofer, S., van Harmelen, F., Hendler, J., Horrocks,
Aine, C. J. (1995). A conceptual overview and critique of I., McGuinness, D. L., Patel-Schneider, P. F., &
functional neuroimaging techniques in humans Stein, L. A. (2004). OWL web ontology language
I. MRI/fMRI and PET. Critical Reviews in reference. Technical report REC-owl-ref-20040210,
Neurobiology, 9, 229–309. the WorldwideWeb consortium, Available from
Akgul, C. B., Rubin, D. L., Napel, S., Beaulieu, C. F., h t t p : / / w w w.w 3 .o r g / T R / 2 0 0 4 / R E C - o w l -
Greenspan, H., & Acar, B. (2011). Content-based ref-20040210/
image retrieval in radiology: Current status and Becich, M. J. (2000). The role of the pathologist as tissue
future directions. Journal of Digital Imaging, 24(2), refiner and data miner: The impact of functional
208–222. genomics on the modern pathology laboratory and
Al-Antari, M. A., Al-Masni, M. A., Choi, M. T., Han, the critical roles of pathology informatics and bio-
S. M., & Kim, T. S. (2018). A fully integrated informatics. Molecular Diagnosis, 5(4), 287–299.
Biomedical Imaging Informatics
349 10
Bengio, Y., Yao, L., Alain, G., & Vincent, P. (2013). Brinkley, J. F., Bradley, S. W., Sundsten, J. W., & Rosse,
Generalized Denoising auto-encoders as generative C. (1997). The digital anatomist information system
models. In C. J. C. Burges, L. Bottou, M. Welling, and its use in the generation and delivery of web-
Z. Ghahramani, & K. Q. Weinberger (Eds.), based anatomy atlases. Computers and Biomedical
Advances in neural information processing systems 26 Research, 30, 472–503.
(pp. 899–907). Curran Associates, Inc. Brinkley, J. F., Wong, B. A., Hinshaw, K. P., & Rosse, C.
Bennett, T. J., & Barry, C. J. (2009). Ophthalmic imaging (1999). Design of an anatomy information system.
today: An ophthalmic photographer’s viewpoint – Computer Graphics and Applications, 19(3), 38–48.
A review. Clinical & Experimental Ophthalmology, Brown, D. B., Gould, J. E., Gervais, D. A., Goldberg,
37(1), 2–13. S. N., Murthy, R., Millward, S. F., Rilling, W. S.,
Betancur, J., Commandeur, F., Motlagh, M., Sharir, T., Geschwind, J. F. S., Salem, R., Vedantham, S.,
Einstein, A. J., Bokhari, S., Fish, M. B., Ruddy, Cardella, J. F., Soulen, M. C., Techn, S. I. R., &
T. D., Kaufmann, P., Sinusas, A. J., Miller, E. J., Tumor, I. W. G. I.-G. (2009). Transcatheter therapy
Bateman, T. M., Dorbala, S., Di Carli, M., for hepatic malignancy: Standardization of termi-
Germano, G., Otaki, Y., Tamarappoo, B. K., Dey, nology and reporting criteria (reprinted from J Vasc
D., Berman, D. S., & Slomka, P. J. (2018). Deep Interv Radiol, vol 18, pg 1469-1478, 2007). Journal
learning for prediction of obstructive disease from of Vascular and Interventional Radiology, 20(7),
fast myocardial perfusion SPECT: A multicenter S425–S434.
study. JACC: Cardiovascular Imaging. Burnside, E., Rubin, D., & Shachter, R. (2000). A
Bidgood, W. D., Jr., & Horii, S. C. (1992). Introduction Bayesian network for mammography. Proceedings
to the ACR-NEMA DICOM standard. of the AMIA Symposium, 106–110.
Radiographics, 12(2), 345–355. Burnside, E. S., Rubin, D. L., & Shachter, R. D. (2004a).
Billingsley, A. (2020). The Latest in Coronavirus Using a Bayesian network to predict the probability
(COVID-19) Testing Methods and Availability. and type of breast cancer represented by microcalci-
Retrieved 5/17/2020, from https://www.goodrx.com/ fications on mammography. Studies in Health
blog/coronavirus-covid-19-testing-updates-meth- Technology and Informatics, 107(Pt 1), 13–17.
ods-cost-availability/ Burnside, E. S., Rubin, D. L., Shachter, R. D., Sohlich,
Bishop, C. M. (1995). Neural networks for pattern recog- R. E., & Sickles, E. A. (2004b). A probabilistic
nition. Oxford/New York: Clarendon Press; Oxford expert system that provides automated
University Press. mammographic-histologic correlation: Initial expe-
Biswal, S., Resnick, D. L., Hoffman, J. M., & Gambhir, rience. AJR. American Journal of Roentgenology,
S. S. (2007). Molecular imaging: Integration of 182(2), 481–488.
molecular imaging into the musculoskeletal imaging Burnside, E. S., Rubin, D. L., Fine, J. P., Shachter, R. D.,
practice. Radiology, 244(3), 651–671. Sisney, G. A., & Leung, W. K. (2006). Bayesian net-
Bittorf, A., Bauer, J., Simon, M., & Diepgen, T. L. work to predict breast cancer risk of mammo-
(1997). Web-based training modules in dermatol- graphic microcalcifications and reduce number of
ogy. MD Computing, 14(5), 371–376. 381. benign biopsy results: Initial experience. Radiology,
Bloom, F. E., & Young, W. G. (1993). Brain browser. 240(3), 666–673.
New York: Academic. Burnside, E. S., Ochsner, J. E., Fowler, K. J., Fine, J. P.,
Bodenreider, O. (2008). Biomedical ontologies in action: Salkowski, L. R., Rubin, D. L., & Sisney, G. A.
Role in knowledge management, data integration (2007). Use of microcalcification descriptors in
and decision support. Yearbook of Medical BI-RADS 4th edition to stratify risk of malignancy.
Informatics, 67–79. Radiology, 242(2), 388–395.
Bogowicz, M., Riesterer, O., Stark, L. S., Studer, G., Burnside, E. S., Davis, J., Chhatwal, J., Alagoz, O.,
Unkelbach, J., Guckenberger, M., & Tanadini- Lindstrom, M. J., Geller, B. M., Littenberg, B.,
Lang, S. (2017). Comparison of PET and CT Shaffer, K. A., Kahn, C. E., Jr., & Page, C. D.
radiomics for prediction of local tumor control in (2009). Probabilistic computer model developed
head and neck squamous cell carcinoma. Acta from clinical data in national mammography data-
Oncologica, 56(11), 1531–1536. base format to classify mammographic findings.
Bosch, A., Munoz, X., Oliver, A., & Marti, J. (2006). Radiology, 251(3), 663–672.
Modeling and classifying breast tissue density in Buxton, R. B. (2009). Introduction to functional mag-
mammograms. Computer Vision and Pattern netic resonance imaging: Principles and techniques.
Recognition, IEEE Computer Society Conference on, Cambridge. In UK. New York: Cambridge
2, 1552–1558. University Press.
Brain Innovation B.V. (2001). BrainVoyager. From caBIG In-vivo Imaging Workspace. (2008). Annotation
http://www.BrainVoyager.de/ and Image Markup (AIM). Retrieved December 26,
Brinkley, J. F. (1993). The potential for three-dimen- 2008, from https://cabig.nci.nih.gov/tools/AIM
sional ultrasound. In F. A. Chervenak, G. C. Cabrera Fernandez, D., Salinas, H. M., & Puliafito,
Isaacson, & S. Campbell (Eds.), Ultrasound in C. A. (2005). Automated detection of retinal layer
obstetrics and gynecology. Boston: Little, Brown structures on optical coherence tomography images.
and Company. Optics Express, 13(25), 10200–10216.
350 D. L. Rubin et al.
Caputo, B., Tornmasi, T., & Orabona, F. (2008). space. Journal of Computer Assisted Tomography,
Discriminative cue integration for medical image 18(2), 192–205.
annotation. Pattern Recognition Letters, 29(15), Collins, D. L., Holmes, D. J., Peters, T. M., & Evans,
1996–2002. A. C. (1995). Automatic 3-D model-based neuro-
Carneiro, G., Chan, A. B., Moreno, P. J., & Vasconcelos, anatomical segmentation. Human Brain Mapping, 3,
N. (2007). Supervised learning of semantic classes 190–208.
for image annotation and retrieval. IEEE Comaniciu, D., & Meer, P. (2002). Mean shift: A robust
Transactions on Pattern Analysis and Machine approach toward feature space analysis. IEEE
Intelligence, 29(3), 394–410. Transactions on Pattern Analysis and Machine
Carpenter, A. E., Jones, T. R., Lamprecht, M. R., Intelligence, 24(5), 603–619.
Clarke, C., Kang, I. H., Friman, O., Guertin, D. A., Corina, D. P., Poliakov, A. V., Steury, K., Martin, R. F.,
Chang, J. H., Lindquist, R. A., Moffat, J., Golland, Brinkley, J. F., Mulligan, K. A., & Ojemann, G. A.
P., & Sabatini, D. M. (2006). CellProfiler: Image (2000). Correspondences between language cortex
analysis software for identifying and quantifying identified by cortical stimulation mapping and
cell phenotypes. Genome Biology, 7(10), R100. fMRI. Neuroimage (Human Brain Mapping Annual
Caviness, V. S., Meyer, J., Makris, N., & Kennedy, D. N. Meeting, June 12–16), 11(5), S295.
(1996). MRI-based topographic parcellation of Cox, R. W. (1996). AFNI: Software for analysis and
human neocortex: An anatomically specified visualization of functional magnetic resonance neu-
method with estimate of reliability. Journal of roimages. Computers and Biomedical Research, 29,
Cognitive Neuroscience, 8(6), 566–587. 162–173.
Cha, Y. J., Jang, W. I., Kim, M. S., Yoo, H. J., Paik, Dalal, N., & Triggs, B. (2005). Histograms of oriented
E. K., Jeong, H. K., & Youn, S. M. (2018). Prediction gradients for human detection. 2005 IEEE Computer
of response to stereotactic radiosurgery for Brain Society Conference on Computer Vision And Pattern
metastases using convolutional neural networks. Recognition (CVPR'05).
Anticancer Research, 38(9), 5437–5445. Dale, A. M., Fischl, B., & Sereno, M. I. (1999). Cortical
Chakraborty, A., Staib, L. H., & Duncan, J. S. (1996). surface-based analysis. I. Segmentation and surface
Deformable boundary finding in medical images by reconstruction. NeuroImage, 9(2), 179–194.
10 integrating gradient and region information. IEEE Dameron, O., Roques, E., Rubin, D. L., Marquet, G., &
Transactions on Medical Imaging, 15(6), Burgun, A. (2006). Grading lung tumors using
859–870. OWL-DL based reasoning. 9th International
Channin, D. S., Mongkolwat, P., Kleper, V., & Rubin, Protégé Conference. Stanford, CA.
D. L. (2009a). Computing human image annota- Datta, R., Joshi, D., Li, J., & Wang, J. Z. (2008). Image
tion. Conference Proceedings: Annual International retrieval: Ideas, influences, and trends of the new
Conference of the IEEE Engineering in Medicine and age. ACM Computing Surveys, 40(2).
Biology Society, 1, 7065–7068. Davatzikos, C., & Bryan, R. N. (1996). Using a deform-
Channin, D. S., Mongkolwat, P., Kleper, V., Sepukar, able surface model to obtain a shape representation
K., & Rubin, D. L. (2009b). The caBIG annotation of the cortex. IEEE Transactions on Medical
and image markup project. Journal of Digital Imaging, 15(6), 785–795.
Imaging. de Figueiredo, E. H., Borgonovi, A. F., & Doring, T. M.
Chen, S., Liu, W., Qin, J., Chen, L., Bin, G., Zhou, Y., (2011). Basic concepts of MR imaging, diffusion
Wang, T., & Huang, B. (2017). Research progress of MR imaging, and diffusion tensor imaging.
computer-aided diagnosis in cancer based on deep Magnetic Resonance Imaging Clinics of North
learning and medical imaging. Sheng Wu Yi Xue America, 19(1), 1–22.
Gong Cheng Xue Za Zhi, 34(2), 314–319. de Sisternes, L., Simon, N., Tibshirani, R., Leng, T., &
Choi, H. S., Haynor, D. R., & Kim, Y. (1991). Partial Rubin, D. L. (2014). Quantitative SD-OCT imaging
volume tissue classification of multichannel mag- biomarkers as indicators of age-related macular
netic resonance images – a mixel model. IEEE degeneration progression. Investigative
Transactions on Medical Imaging, 10(3), 395–407. Ophthalmology & Visual Science, 55(11), 7093–7103.
Cimino, J. J. (1996). Review paper: Coding systems in de Sisternes, L., Jonna, G., Greven, M. A., Chen, Q.,
health care. Methods of Information in Medicine, Leng, T., & Rubin, D. L. (2017a). Individual Drusen
35(4–5), 273–284. segmentation and repeatability and reproducibility
Clarysse, P., Friboulet, D., & Magnin, I. E. (1997). of their automated quantification in optical coher-
Tracking geometrical descriptors on 3-D deform- ence tomography images. Translational Vision
able surfaces: Application to the left-ventricular Science & Technology, 6(1), 12.
surface of the heart. IEEE Transactions on Medical de Sisternes, L., Jonna, G., Moss, J., Marmor, M. F.,
Imaging, 16(4), 392–404. Leng, T., & Rubin, D. L. (2017b). Automated intra-
Cohen, J. D. (2001). FisWidgets. 2001, from http:// retinal segmentation of SD-OCT images in normal
neurocog.lrdc.pitt.edu/fiswidgets/ and age-related macular degeneration eyes.
Collins, D. L., Neelin, P., Peters, T. M., & Evans, A. C. Biomedical Optics Express, 8(3), 1926–1949.
(1994). Automatic 3-D intersubject registration of Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977).
MR volumetric data in standardized Talairach Maximum likelihood from incomplete data via the
Biomedical Imaging Informatics
351 10
EM algorithm. Journal of the Royal Statistical Drury, H. A., & Van Essen, D. C. (1997). Analysis of
Society Series B, 39, 1–38. functional specialization in human cerebral cortex
Deng, J., Dong, W., Socher, R., Li, L., Kai, L., & Li, using the visible man surface based atlas. Human
F.-F. (2009). ImageNet: A large-scale hierarchical Brain Mapping, 5, 233–237.
image database. 2009 IEEE Conference on Computer Duda, R. O., Hart, P. E., & Stork, D. G. (2001). Pattern
Vision and Pattern Recognition. classification. New York: Wiley.
Depeursinge, A., Foncubierta-Rodriguez, A., Van De Dugas-Phocion, G., Ballester, M. A. G., Malandain, G.,
Ville, D., & Muller, H. (2014). Three-dimensional Lebrun, C., & Ayache, N. (2004). Improved
solid texture analysis in biomedical imaging: Review EM-based tissue segmentation and partial volume
and opportunities. Medical Image Analysis, 18(1), effect quantification in multi-sequence brain MRI.
176–196. Medical Image Computing and Computer-Assisted
Deselaers, T., Hegerath, A., Keysers, D., & Ney, H. Intervention – Miccai 2004, Pt 1, Proceedings, 3216,
(2006). Sparse patch-histograms for object classifi- 26–33.
cation in cluttered images. In: DAGM 2006, Pattern Durot, I., Wilson, S. R., & Willmann, J. K. (2018).
Recognition, 27th DAGM Symposium, Lecture Notes Contrast-enhanced ultrasound of malignant liver
in Computer Science, pp. 202–211. lesions. Abdominal Radiology (NY), 43(4), 819–847.
Deselaers, T., Muller, H., Clough, P., Ney, H., & Ehteshami Bejnordi, B., Veta, M., Johannes van Diest, P.,
Lehmann, T. M. (2007). The CLEF 2005 automatic van Ginneken, B., Karssemeijer, N., Litjens, G., van
medical image annotation task. International der Laak, J., The, C. C., Hermsen, M., Manson, Q. F.,
Journal of Computer Vision, 74(1), 51–58. Balkenhol, M., Geessink, O., Stathonikos, N., van
Deserno, T. M., Antani, S., & Long, R. (2009). Ontology Dijk, M. C., Bult, P., Beca, F., Beck, A. H., Wang, D.,
of gaps in content-based image retrieval. Journal of Khosla, A., Gargeya, R., Irshad, H., Zhong, A., Dou,
Digital Imaging, 22(2), 202–215. Q., Li, Q., Chen, H., Lin, H. J., Heng, P. A., Hass, C.,
Deshpande, N., Needles, A., & Willmann, J. K. (2010). Bruni, E., Wong, Q., Halici, U., Oner, M. U., Cetin-
Molecular ultrasound imaging: Current status and Atalay, R., Berseth, M., Khvatkov, V., Vylegzhanin,
future directions. Clinical Radiology, 65(7), 567– A., Kraus, O., Shaban, M., Rajpoot, N., Awan, R.,
581. Sirinukunwattana, K., Qaiser, T., Tsang, Y. W., Tellez,
Dhenain, M., Ruffins, S. W., & Jacobs, R. E. (2001). D., Annuscheit, J., Hufnagl, P., Valkonen, M.,
Three-dimensional digital mouse atlas using high- Kartasalo, K., Latonen, L., Ruusuvuori, P.,
resolution MRI. Developmental Biology, 232(2), Liimatainen, K., Albarqouni, S., Mungal, B., George,
458–470. A., Demirci, S., Navab, N., Watanabe, S., Seno, S.,
DICOM Standards Committee. (2017). DICOM PS3.21 Takenaka, Y., Matsuda, H., Phoulady, H. A., Kovalev,
2017e – Transformations between DICOM and V., Kalinovsky, A., Liauchuk, V., Bueno, G.,
other representations; A.6 AIM v4 to DICOM TID Fernandez-Carrobles, M. M., Serrano, I., Deniz, O.,
1500 mapping. From http://dicom.nema.org/medi- Racoceanu, D., & Venancio, R. (2017). Diagnostic
cal/Dicom/2017e/output/chtml/part21/ assessment of deep learning algorithms for detection
sect_A.6.html of lymph node metastases in women with breast can-
DICOM Standards Committee – Working Group 8 – cer. JAMA, 318(22), 2199–2210.
Structured Reporting. (2017). Digital Imaging and Endo, M., Aramaki, T., Asakura, K., Moriguchi, M.,
Communications in Medicine (DICOM); sup 200 – Akimaru, M., Osawa, A., Hisanaga, R., Moriya, Y.,
Transformation of NCI annotation and image Shimura, K., Furukawa, H., & Yamaguchi, K.
markup (AIM) and DICOM SR measurement tem- (2012). Content-based image-retrieval system in
plates. From ftp://medical.nema.org/medical/dicom/ chest computed tomography for a solitary pulmo-
Supps/LB/sup200_lb_AIM_DICOMSRTID1500.pdf nary nodule: Method and preliminary experiments.
Diepgen, T. L., & Eysenbach, G. (1998). Digital images International Journal of Computer Assisted
in dermatology and the dermatology online atlas on Radiology and Surgery, 7(2), 331–338.
the world wide web. The Journal of Dermatology, Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter,
25(12), 782–787. S. M., Blau, H. M., & Thrun, S. (2017).
Doi, K. (2007). Computer-aided diagnosis in medical Dermatologist-level classification of skin cancer with
imaging: Historical review, current status and future deep neural networks. Nature, 542(7639), 115–118.
potential. Computerized Medical Imaging and Eysenbach, G., Bauer, J., Sager, A., Bittorf, A., Simon,
Graphics, 31(4–5), 198–211. M., & Diepgen, T. (1998). An international derma-
Donovan, T., & Manning, D. J. (2007). The radiology tological image atlas on the WWW: Practical use for
task: Bayesian theory and perception. The British undergraduate and continuing medical education,
Journal of Radiology, 80(954), 389–391. patient education and epidemiological research.
D'Orsi, C. J., & Newell, M. S. (2007). BI-RADS decoded: Studies in Health Technology and Informatics, 52(Pt
Detailed guidance on potentially confusing issues. 2), 788–792.
Radiologic Clinics of North America, 45(5), 751–763. Faruque, J., Rubin, D. L., Beaulieu, C. F., & Napel, S.
Drude, N., Tienken, L., & Mottaghy, F. M. (2017). (2013). Modeling perceptual similarity measures in
Theranostic and nanotheranostic probes in nuclear CT images of focal liver lesions. Journal of Digital
medicine. Methods, 130, 14–22. Imaging, 26(4), 714–720.
352 D. L. Rubin et al.
Faruque, J., Beaulieu, C. F., Rosenberg, J., Rubin, D. L., Friefeld, O., Greenspan, H., & Jacob, G. (2009).
Yao, D., & Napel, S. (2015). Content-based image Multiple sclerosis lesion detection using constrained
retrieval in radiology: Analysis of variability in GMM and curve evolution. Journal of Biomedical
human perception of similarity. Journal of Medical Imaging, 2009, 1–13.
Imaging (Bellingham), 2(2), 025501. Friston, K. J., Holmes, A. P., Worsley, K. J., Poline, J. P.,
Fave, X., Zhang, L., Yang, J., Mackin, D., Balter, P., Frith, C. D., & Frackowiak, R. S. J. (1995). Statistical
Gomez, D., Followill, D., Jones, A. K., Stingo, F., parametric maps in functional imaging: A general
Liao, Z., Mohan, R., & Court, L. (2017). Delta- linear approach. Human Brain Mapping, 2, 189–210.
radiomics features for the prediction of patient out- Gabril, M. Y., & Yousef, G. M. (2010). Informatics for
comes in non-small cell lung cancer. Scientific practicing anatomical pathologists: Marking a new
Reports, 7(1), 588. era in pathology practice. Modern Pathology, 23(3),
Federative Committee on Anatomical Terminology 349–358.
(1998). Terminologia Anatomica. Stuttgart, Thieme. Gastounioti, A., Oustimov, A., Hsieh, M. K., Pantalone,
Fedorov, A. (2012). 3D Slicer Annotation Image L., Conant, E. F., & Kontos, D. (2018). Using con-
Markup. Accessed 11/11/2012, from http://www.na- volutional neural networks for enhanced capture of
mic.o rg/Wiki/index.p hp/Projects:QIN:3D_Slicer_ breast parenchymal complexity patterns associated
Annotation_Image_Markup with breast cancer risk. Academic Radiology, 25(8),
Fedorov, A., Beichel, R., Kalpathy-Cramer, J., Finet, J., 977–984.
Fillion-Robin, J. C., Pujol, S., Bauer, C., Jennings, George, J. S., Aine, C. J., Mosher, J. C., Schmidt, D. M.,
D., Fennessy, F., Sonka, M., Buatti, J., Aylward, S., Ranken, D. M., Schlitz, H. A., Wood, C. C., Lewine,
Miller, J. V., Pieper, S., & Kikinis, R. (2012). 3D J. D., Sanders, J. A., & Belliveau, J. W. (1995).
Slicer as an image computing platform for the quan- Mapping function in human brain with magnetoen-
titative imaging network. Magnetic Resonance cephalography, anatomical magnetic resonance
Imaging, 30(9), 1323–1341. imaging, and functional magnetic resonance imag-
Fei-Fei, L., & Perona, P. (2003) A Bayesian hierarchical ing. Journal of Clinical Neurophysiology, 12(5), 406–
model for learning natural scene categories. Proc. of 431.
IEEE Computer Vision and Pattern Recognition, Gerstner, E. R., & Sorensen, A. G. (2011). Diffusion and
10 pp. 524–531. diffusion tensor imaging in brain cancer. Seminars
Ferrante, E., & Paragios, N. (2017). Slice-to-volume in Radiation Oncology, 21(2), 141–146.
medical image registration: A survey. Medical Image Gevaert, O., Xu, J., Hoang, C., Leung, A., Quon, A.,
Analysis, 39, 101–123. Rubin, D. L., Napel, S., & Plevritis, S. (2011).
Fiala, J. C., & Harris, K. M. (2001). Extending unbiased Integrating medical images and transcriptomic data
stereology of brain ultrastructure to three- in non-small cell lung cancer. AACR 102nd annual
dimensional volumes. Journal of the American meeting. Orlando, FL.
Medical Association, 8(1), 1–16. Gevaert, O., Mitchell, L. A., Xu, J., Yu, C., Rubin, D.,
Figurska, M., Robaszkiewicz, J., & Wierzbowska, J. Zaharchuk, G., Napel, S., & Plevritis, S. (2012a).
(2010). Optical coherence tomography in imaging Radiogenomic analysis indicates MR images are
of macular diseases. Klinika Oczna, 112(4–6), potentially predictive of EGFR mutation status in
138–146. glioblastoma multiforme. AACR 103nd annual
Firmino, M., Morais, A. H., Mendoca, R. M., Dantas, meeting. Chicago, IL.
M. R., Hekis, H. R., & Valentim, R. (2014). Gevaert, O., Hoang, C. D., Leung, A. N., Xu, J., Quon,
Computer-aided detection system for lung cancer in A., Rubin, D. L., Napel, S., & Plevritis, S. (2012b).
computed tomography scans: Review and future Non-small cell lung cancer: identifying prognostic
prospects. Biomedical Engineering Online, 13, 41. imaging biomarkers by leveraging public gene
FMRIDB Image Analysis Group. (2001). FSL – The expression microarray data–methods and prelimi-
FMRIB Software Libarary. From http://www.fmrib. nary results. Radiology, 264:387–396.
ox.ac.uk/fsl/index.html PMID:22723499. PMCID:PMC3401348.
Fougerousse, F., Bullen, P., Herasse, M., Lindsay, S., Giger, M. L. (2018). Machine learning in medical imag-
Richard, I., Wilson, D., Suel, L., Durand, M., Robson, ing. Journal of the American College of Radiology,
S., Abitbol, M., Beckmann, J. S., & Strachan, T. (2000). 15(3 Pt B), 512–520.
Human-mouse differences in the embryonic expression Giger, M., & MacMahon, H. (1996). Image processing
of developmental control genes and disease genes. and computer-aided diagnosis. The Radiologic
Human Molecular Genetics, 9(2), 165–173. Clinics of North America, 34(3), 565–596.
Fox, P. T. (Ed.). (2001). Human brain mapping. Gimenez, F., Xu, J., Liu, T. T., Beaulieu, C., Rubin,
New York: John Wiley & Sons. D. L., Napel, S., & Liu, Y. (2011a). Prediction of
Frackowiak, R. S. J., Friston, K. J., Frith, C. D., Dolan, radiologist observations using computational image
R. J., & Mazziotta, J. C. (Eds.). (1997). Human brain features: Method and preliminary results. Ninety-
function. Academic Press: New York. seventy annual scientific meeting of the RSNA,
Freton, A., & Finger, P. T. (2011). Spectral domain- Chicago, IL.
optical coherence tomography analysis of choroidal Gimenez, F., Xu, J., Liu, Y., Liu, T. T., Beaulieu, C.,
osteoma. The British Journal of Ophthalmology. Rubin, D. L, & Napel, S. (2011b). On the Feasibility
Biomedical Imaging Informatics
353 10
of Predicting Radiological Observations from Hansen, L. K., Nielsen, F. A., Toft, P., Liptrot, M. G.,
Computational Imaging Features of Liver Lesions Goutte, C., Strother, S. C., Lange, N., Gade, A.,
in CT scans. First IEEE Conference on Healthcare Rottenberg, D. A., & Paulson, O. B. (1999).
Informatics, Imaging, and Systems Biology (HISB), Lyngby – Modeler’s Matlab toolbox for spatio-
IEEE Computer Society, San Jose, CA. temporal analysis of functional neuroimages.
Goldberg, S. N., Grassi, C. J., Cardella, J. F., NeuroImage, 9(6), S241.
Charboneau, J. W., Dodd, G. D., 3rd, Dupuy, D. E., Haralick, R. M. (1988). Mathematical morphology.
Gervais, D. A., Gillams, A. R., Kane, R. A., Lee, University of Washington.
F. T., Jr., Livraghi, T., McGahan, J., Phillips, D. A., Haralick, R. M., & Shapiro, L. G. (1992). Computer and
Rhim, H., Silverman, S. G., Solbiati, L., Vogl, T. J., robot vision. Reading: Addison-Wesley.
Wood, B. J., Vedantham, S., & Sacks, D. (2009). Haralick, R. M., Shanmugam, K., & Dinstein, I. (1973).
Image-guided tumor ablation: Standardization of Textural features for image classification. IEEE
terminology and reporting criteria. Journal of Transactions on Systems, Man, and Cybernetics,
Vascular and Interventional Radiology, 20(7 Suppl), SMC-3(6), 610–621.
S377–S390. Harney, A. S., & Meade, T. J. (2010). Molecular imaging
Gombas, P., Skepper, J. N., Krenacs, T., Molnar, B., & of in vivo gene expression. Future Medicinal
Hegyi, L. (2004). Past, present and future of digital Chemistry, 2(3), 503–519.
pathology. Orvosi Hetilap, 145(8), 433–443. Hasan, K. M., Walimuni, I. S., Abid, H., & Hahn, K. R.
Gonzalez, R. C., Woods, R. E., & Eddins, S. L. (2009). (2010). A review of diffusion tensor magnetic reso-
Digital image processing using MATLAB. S.I., nance imaging computational methods and soft-
Gatesmark Publishing. ware tools. Computers in Biology and Medicine.
Grau, B., Horrocks, I., Motik, B., Parsia, B., Heiss, W. D., & Phelps, M. E. (Eds.). (1983). Positron
Patelschneider, P., & Sattler, U. (2008). Chapter 3: emission tomography of the brain. Berlin/New York:
Description logics. In B. Porter, V. Lifschitz, & Springer.
F. Van Harmelen (Eds.), Handbook of knowledge Held, K., Rota Kops, E., Krause, B. J., Wells, W. M., 3rd,
representation (p. 1005). Amsterdam/Boston: Kikinis, R., & Muller-Gartner, H. W. (1997).
Elsevier: xxviii. Markov random field segmentation of brain MR
Greenspan, H., & Pinhas, A. T. (2007). Medical image images. IEEE Transactions on Medical Imaging,
categorization and retrieval for PACS using the 16(6), 878–886.
GMM-KL framework. IEEE Transactions on Hersh, W., Muller, H., & Kalpathy-Cramer, J. (2009).
Information Technology in Biomedicine, 11(2), 190– The ImageCLEFmed medical image retrieval task
202. test collection. Journal of Digital Imaging, 22(6),
Greenspan, H., Ruf, A., & Goldberger, J. (2006). 648–655.
Constrained Gaussian mixture model framework Hinshaw, K. P., Poliakov, A. V., Martin, R. F., Moore,
for automatic segmentation of MR brain images. E. B., Shapiro, L. G., & Brinkley, J. F. (2002).
IEEE Trans Med Imaging, 25(9): 1233–1245. Shape-based cortical surface segmentation for visu-
Greenspan, H., Avni, U., Konen, E., Sharon, M., & alization brain mapping. NeuroImage, 16(2), 295–
Goldberger, J. (2011). X-ray categorization and 316.
retrieval on the organ and pathology level, using Hoang, C., Napel, S., Gevaert, O., Xu, J., Rubin, D. L.,
patch-based visual words. IEEE Transactions on Leung, A., Merritt, R., Whyte, R., Shrager, J., &
Medical Imaging, 30(3), 733–746. Plevritis, S. (2011). NSCLC gene profiles correlate
Greenspan, H., Ginneken, B. V., & Summers, R. M. with specific CT characteristics: Image-omics.
(2016). Guest editorial deep learning in medical Philadelphia: American Association for Thoracic
imaging: Overview and future promise of an excit- Surgery (AATS).
ing new technique. IEEE Transactions on Medical Hoffman, J. M., & Gambhir, S. S. (2007). Molecular
Imaging, 35(5), 1153–1159. imaging: The vision and opportunity for radiology
Greenspan, H., Estepar, R. S., Niessen, W. J., Siegel, E., in the future. Radiology, 244(1), 39–47.
& Nielsen, M. (2020). Position paper on COVID-19 Hohne, K., Bomans, M., Pommert, A., Riemer, M.,
imaging and AI: From the clinical needs and tech- Schiers, C., Tiede, U., & Wiebecke, G. (1990). 3-D
nological challenges to initial AI solutions at the lab visualization of tomographic volume data using the
and national level towards a new era for AI in generalized voxel model. The Visual Computer, 6(1),
healthcare. Medical Image Analysis, 66. 28–36.
Grossmann, P., Stringfield, O., El-Hachem, N., Bui, Hohne, K. H., Bomans, M., Riemer, M., Schubert, R.,
M. M., Rios Velazquez, E., Parmar, C., Leijenaar, Tiede, U., & Lierse, W. (1992). A volume-based ana-
R. T., Haibe-Kains, B., Lambin, P., Gillies, R. J., & tomical atlas. IEEE Computer Graphics and
Aerts, H. J. (2017). Defining the biological basis of Applications, 72–78.
radiomic phenotypes in lung cancer. eLife, 6. Hohne, K. H., Pflesser, B., Riemer, M., Schiemann, T.,
Hansell, D. M., Bankier, A. A., MacMahon, H., Schubert, R., & Tiede, U. (1995). A new representa-
McLoud, T. C., Muller, N. L., & Remy, J. (2008). tion of knowledge concerning human anatomy and
Fleischner society: Glossary of terms for thoracic function. Nature Medicine, 1(6),
imaging. Radiology, 246(3), 697–722. 506–510.
354 D. L. Rubin et al.
Hoo-Chang, S., Roth, H. R., Gao, M., Lu, L., Xu, Z., Jokerst, J. V., & Gambhir, S. S. (2011). Molecular imag-
Nogues, I., Yao, J., Mollura, D., & Summers, ing with theranostic nanoparticles. Accounts of
R. M. (2016). Deep convolutional neural net- Chemical Research, 44(10), 1050–1060.
works for computer-aided detection: CNN archi- Jun, W., Xia, L., Di, D., Jiangdian, S., Min, X., Yali, Z.,
tectures, dataset characteristics and transfer & Jie, T. (2016). Prediction of malignant and benign
learning. IEEE Transactions on Medical Imaging, of lung tumor using a quantitative radiomic
35(5), 1285–1298. method. Conference Proceedings: Annual
Hoogi, A., Subramaniam, A., Veerapaneni, R., & International Conference of the IEEE Engineering in
Rubin, D. L. (2017). Adaptive estimation of active Medicine and Biology Society, 2016, 1272–1275.
contour parameters using convolutional neural net- Jurie, F., & Triggs, B. (2005). Creating efficient code-
works and texture analysis. IEEE Transactions on books for visual recognition. Proceedings of the
Medical Imaging, 36(3), 781–791. Tenth IEEE International Conference on Computer
Hosny, A., Parmar, C., Quackenbush, J., Schwartz, Vision (ICCV'05) Volume 1 – Volume 01, IEEE
L. H., & Aerts, H. (2018). Artificial intelligence in Computer Society: 604–610%@ 600–7695-2334-X-
radiology. Nature Reviews. Cancer, 18(8), 500–510. 7601.
Horii SC. Image acquisition. Sites, technologies, and Kahn, C. E., Jr., Langlotz, C. P., Burnside, E. S.,
approaches. Radiol Clin North Am, 1996;34:469– Carrino, J. A., Channin, D. S., Hovsepian, D. M., &
494. PMID:8657867. Rubin, D. L. (2009). Toward best practices in radiol-
Hu, Z., Abramoff, M. D., Kwon, Y. H., Lee, K., & ogy reporting. Radiology, 252(3), 852–856.
Garvin, M. K. (2010a). Automated segmentation of Kahn, C. E., & Rubin, D. L. (2009). Automated seman-
neural canal opening and optic cup in 3D spectral tic indexing of figure captions to improve radiology
optical coherence tomography volumes of the optic image retrieval. Journal of the American Medical
nerve head. Investigative Ophthalmology & Visual Informatics Association, 16(3), 380–386.
Science, 51(11), 5708–5717. Kang, J. H., & Chung, J. K. (2008). Molecular-genetic
Hu, Z., Niemeijer, M., Abramoft, M. D., Lee, K., & imaging based on reporter gene expression. Journal
Garvin, M. K. (2010b). Automated segmentation of of Nuclear Medicine, 49(Suppl 2), 164S–179S.
3-D spectral OCT retinal blood vessels by neural Kapur, T., Grimson, W. E., Wells, W. M., 3rd, & Kikinis,
10 canal opening false positive suppression. Medical R. (1996). Segmentation of brain tissue from mag-
Image Computing and Computer-Assisted netic resonance images. Medical Image Analysis,
Intervention, 13(Pt 3), 33–40. 1(2), 109–127.
Huang, Y. Q., Liang, C. H., He, L., Tian, J., Liang, C. S., Kass, M., Witkin, A., & Terzopoulos, D. (1987). Snakes:
Chen, X., Ma, Z. L., & Liu, Z. Y. (2016). Active contour models. International Journal of
Development and validation of a radiomics nomo- Computer Vision, 1(4), 321–331.
gram for preoperative prediction of lymph node Katragadda, C., Finnane, A., Soyer, H. P., Marghoob,
metastasis in colorectal cancer. Journal of Clinical A. A., Halpern, A., Malvehy, J., Kittler, H.,
Oncology, 34(18), 2157–2164. Hofmann-Wellenhof, R., Da Silva, D., Abraham, I.,
Hudson, D. L., & Cohen, M. E. (2009). Multidimensional Curiel-Lewandrowski, C., & G. International
medical decision making. Conference Proceedings: Society of Digital Imaging of the Skin -International
Annual International Conference of the IEEE Skin Imaging Collaboration. (2016). Technique
Engineering in Medicine and Biology Society, 1, standards for skin lesion imaging: A Delphi consen-
3405–3408. sus statement. JAMA Dermatology.
International Anatomical Nomenclature Committee. Kayalibay, B., Jensen, G., & van der Smagt, P. (2017).
(1989). Nomina Anatomica. Edinburgh: Churchill CNN-based segmentation of medical imaging data.
Livingstone. arXiv:1701.03056 [cs].
Ishioka, J., Matsuoka, Y., Uehara, S., Yasuda, Y., Kennedy, D. (2001). Internet brain segmentation reposi-
Kijima, T., Yoshida, S., Yokoyama, M., Saito, K., tory. 2001, from http://neuro-www.mgh.harvard.
Kihara, K., Numao, N., Kimura, T., Kudo, K., edu/cma/ibsr
Kumazawa, I., & Fujii, Y. (2018). Computer-aided Kevles, B. (1997). Naked to the bone: Medical imaging in
diagnosis of prostate cancer on magnetic reso- the twentieth century. New Brunswick: Rutgers
nance imaging using a convolutional neural net- University Press.
work algorithm. BJU International, 122(3), Kimborg, D. Y., & Aguirre, G. K.. (2002). A flexible
411–417. architecture for neuroimaging data analysis and pre-
Jiang, Y.-G., Ngo, C.-W., & Yang, J. (2007). Towards opti- sentation. From http://www.nimh.nih.gov/neuroin-
mal bag-of-features for object categorization and formatics/kimberg.cfm
semantic video retrieval. Proceedings of the 6th ACM King, W., Proffitt, J., Morrison, L., Piper, J., Lane, D., &
international conference on Image and video retrieval. Seelig, S. (2000). The role of fluorescence in situ
Amsterdam, The Netherlands, ACM, pp. hybridization technologies in molecular diagnostics
494–501. and disease management. Molecular Diagnosis, 5(4),
Johnson, K. A., & Becker, J. A.. (2001). The Whole 309–319.
Brain Atlas. 2001, from http://www.med.harvard. Klinger, C. (2010). AIM on ClearCanvas Workstation
edu/AANLIB/home.html Documentation. Accessed 11/11/2012, from https://
Biomedical Imaging Informatics
355 10
wiki.nci.nih.gov/display/AIM/AIM+on+ClearCanv Ledley, R. S., & Lusted, L. B. (1991). Reasoning founda-
as+Workstation+Documentation tions of medical diagnosis. MD Computing, 8(5),
Kontos, D., Summers, R. M., & Giger, M. (2017). 300–315.
Special section guest editorial: Radiomics and deep Lee, D. H. (2003). Magnetic resonance angiography.
learning. Journal of Medical Imaging (Bellingham), Advances in Neurology, 92, 43–52.
4(4), 041301. Lee, J. K. T. (2006). Computed body tomography with
Korner, M., Weber, C. H., Wirth, S., Pfeifer, K. J., Reiser, MRI correlation. Philadelphia: Lippincott Williams
M. F., & Treitl, M. (2007). Advances in digital radi- & Wilkins.
ography: Physical principles and system overview. Lee, Y., Kim, N., Cho, K. S., Kang, S. H., Kim, D. Y.,
Radiographics, 27(3), 675–686. Jung, Y. Y., & Kim, J. K. (2009). Bayesian classifier
Koslow, S. H., & Huerta, M. F. (Eds.). (1997). for predicting malignant renal cysts on MDCT:
Neuroinformatics: An overview of the human Brain Early clinical experience. AJR. American Journal of
project. Mahwah: Lawrence Erlbaum. Roentgenology, 193(2), W106–W111.
Kremkau, F. W. (2006). Diagnostic ultrasound principles Lee, J. H., Baek, J. H., Kim, J. H., Shim, W. H., Chung,
and instruments. St. Louis: Saunders Elsevier. S. R., Choi, Y. J., & Lee, J. H. (2018). Deep learning-
Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). based computer-aided diagnosis system for localiza-
ImageNet classification with deep convolutional tion and diagnosis of metastatic lymph nodes on
neural networks. Advances in Neural Information ultrasound: A pilot study. Thyroid, 28(10), 1332–
Processing Systems 25, Curran Associates, Inc., 1338.
pp. 1097–1105. Lehmann, T. M., Guld, M. O., Thies, C., Fischer, B.,
Kulikowski, C. A. (1997). Medical imaging informatics: Spitzer, K., Keysers, D., Ney, H., Kohnen, M.,
Challenges of definition and integration. Journal of Schubert, H., & Wein, B. B. (2004). Content-based
the American Medical Informatics Association, 4(3), image retrieval in medical applications. Methods of
252–253. Information in Medicine, 43(4), 354–361.
Kumar, V., Gu, Y., Basu, S., Berglund, A., Eschrich, Leong, F. J., & Leong, A. S. (2003). Digital imaging
S. A., Schabath, M. B., Forster, K., Aerts, H. J., applications in anatomic pathology. Advances in
Dekker, A., Fenstermacher, D., Goldgof, D. B., Anatomic Pathology, 10(2), 88–95.
Hall, L. O., Lambin, P., Balagurunathan, Y., Levy, M. A., & Rubin, D. L. (2008). Tool support to
Gatenby, R. A., & Gillies, R. J. (2012). Radiomics: enable evaluation of the clinical response to treat-
The process and the challenges. Magnetic Resonance ment. American Medical Informatics Association
Imaging, 30(9), 1234–1248. Annual Symposium Proceedings, 399–403.
Kumar, A., Kim, J., Cai, W., Fulham, M., & Feng, D. Levy, M. A., & Rubin, D. L. (2011a). Computational
(2013). Content-based medical image retrieval: A approaches to assist in the evaluation of cancer
survey of applications to multidimensional and treatment response. Imaging in Medicine, 3(2), 233–
multimodality data. Journal of Digital Imaging, 246.
26(6), 1025–1039. Levy, M. A., & Rubin, D. L. (2011b). Current and future
Kumar, A., Gupta, P. K., & Srivastava, A. (2020). A trends in imaging informatics for oncology. Cancer
review of modern technologies for tackling COVID-19 Journal, 17(4), 203–210.
pandemic. Diabetes and Metabolic Syndrome: Clinical Levy, M. A., O'Connor, M. J., & Rubin, D. L. (2009).
Research and Reviews, 14(4), 569–573. Semantic reasoning with image annotations for
Lambin, P., Rios-Velazquez, E., Leijenaar, R., Carvalho, tumor assessment. American Medical Informatics
S., van Stiphout, R. G., Granton, P., Zegers, C. M., Association Annual Symposium Proceedings, 2009,
Gillies, R., Boellard, R., Dekker, A., & Aerts, H. J. 359–363.
(2012). Radiomics: Extracting more information Levy, M. A., Freymann, J. B., Kirby, J. S., Fedorov, A.,
from medical images using advanced feature analysis. Fennessy, F. M., Eschrich, S. A., Berglund, A. E.,
European Journal of Cancer, 48(4), 441–446. Fenstermacher, D. A., Tan, Y., Guo, X., Casavant,
Langlotz, C. P. (2006). RadLex: A new method for T. L., Brown, B. J., Braun, T. A., Dekker, A.,
indexing online educational materials. Roelofs, E., Mountz, J. M., Boada, F., Laymon, C.,
Radiographics, 26(6), 1595–1597. Oborski, M., & Rubin, D. L. (2012). Informatics
Langlotz, C. P. (2009). Structured radiology reporting: methods to enable sharing of quantitative imaging
Are we there yet? Radiology, 253(1), 23–25. research data. Magnetic Resonance Imaging, 30(9),
Larabell, C. A., & Nugent, K. A. (2010). Imaging cellu- 1249–1256.
lar architecture with X-rays. Current Opinion in Lexe, G., Monaco, J., Doyle, S., Basavanhally, A., Reddy,
Structural Biology, 20(5), 623–631. A., Seiler, M., Ganesan, S., Bhanot, G., &
Le Bihan, D., Mangin, J. F., Poupon, C., Clark, C. A., Madabhushi, A. (2009). Towards improved cancer
Pappata, S., Molko, N., & Chabriat, H. (2001). diagnosis and prognosis using analysis of gene
Diffusion tensor imaging: Concepts and applica- expression data and computer aided imaging.
tions. Journal of Magnetic Resonance Imaging, Experimental Biology and Medicine (Maywood,
13(4), 534–546. N.J.), 234(8), 860–879.
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learn- Li, B. N., Chui, C. K., Chang, S., & Ong, S. H. (2011a).
ing. Nature, 521(7553), 436–444. Integrating spatial fuzzy clustering with level set
356 D. L. Rubin et al.
methods for automated medical image segmenta- MacDonald, D. (1993). Register, McConnel Brain
tion. Computers in Biology and Medicine, 41(1), Imaging Center, Montreal Neurological
1–10. Institute.
Li, C., Huang, R., Ding, Z., Gatenby, J. C., Metaxas, MacDonald, D., Kabani, N., Avis, D., & Evans, A. C.
D. N., & Gore, J. C. (2011b). A level set method for (2000). Automated 3-D extraction of inner and
image segmentation in the presence of intensity outer surfaces of cerebral cortex from MRI.
Inhomogeneities with application to MRI. IEEE NeuroImage, 12(3), 340–356.
Transactions on Image Processing, 20(7), 2007–2016. Margolis, D. J., Hoffman, J. M., Herfkens, R. J., Jeffrey,
Li, H., Zhu, Y., Burnside, E. S., Huang, E., Drukker, K., R. B., Quon, A., & Gambhir, S. S. (2007). Molecular
Hoadley, K. A., Fan, C., Conzen, S. D., Zuley, M., imaging techniques in body imaging. Radiology,
Net, J. M., Sutton, E., Whitman, G. J., Morris, E., 245(2), 333–356.
Perou, C. M., Ji, Y., & Giger, M. L. (2016). Marquet, G., Dameron, O., Saikali, S., Mosser, J., &
Quantitative MRI radiomics in the prediction of Burgun, A. (2007). Grading glioma tumors using
molecular classifications of breast cancer subtypes OWL-DL and NCI thesaurus. American Medical
in the TCGA/TCIA data set. NPJ Breast Cancer, 2. Informatics Association Annual Symposium
Liang, W., Liang, H., Ou, L., Chen, B., Chen, A., Li, C., Proceedings, 508–512.
Li, Y., Guan, W., Sang, L., Lu, J., Xu, Y., Chen, G., Marroquin, J. L., Vemuri, B. C., Botello, S., Calderon,
Guo, H., Guo, J., Chen, Z., Zhao, Y., Li, S., Zhang, F., & Fernandez-Bouzas, A. (2002). An accurate and
N., Zhong, N., He, J., & C. China Medical Treatment efficient bayesian method for automatic segmenta-
Expert Group for. (2020). Development and valida- tion of brain MRI. IEEE Transactions on Medical
tion of a clinical risk score to predict the occurrence Imaging, 21(8), 934–945.
of critical illness in hospitalized patients with Martin, R. F., & Bowden, D. M. (2001). Primate Brain
COVID-19. JAMA Internal Medicine. maps: Structure of the macaque Brain. New York:
Lichtenbelt, B., Crane, R., & Naqvi, S. (1998). Elsevier Science.
Introduction to volume rendering. Prentice Hall: Martin, R. F., Mejino, J. L. V., Bowden, D. M., Brinkley,
Upper Saddle River. J. F., & Rosse, C. (2001). Foundational model of
Liu, Y. I., Kamaya, A., Desser, T. S., & Rubin, D. L. neuroanatomy: implications for the Human Brain
10 (2009). A controlled vocabulary to represent sono- Project. Proc AMIA Annu Fall Symp. Washington,
graphic features of the thyroid and its application in DC, pp. 438–442.
a Bayesian network to predict thyroid nodule malig- Marwede, D., Schulz, T., & Kahn, T. (2008). Indexing
nancy. Summit on Translat Bioinforma, 2009, 68–72. thoracic CT reports using a preliminary version of a
Liu, Y. I., Kamaya, A., Desser, T. S., & Rubin, D. L. standardized radiological lexicon (RadLex). Journal
(2011). A bayesian network for differentiating of Digital Imaging, 21(4), 363–370.
benign from malignant thyroid nodules using sono- Massoud, T. F., & Gambhir, S. S. (2003). Molecular
graphic and demographic features. AJR. American imaging in living subjects: Seeing fundamental bio-
Journal of Roentgenology, 196(5), W598–W605. logical processes in a new light. Genes and
Liu, Z., Jin, C., Wu, C. C., Liang, T., Zhao, H., Wang, Development, 17, 545–580.
Y., Wang, Z., Li, F., Zhou, J., Cai, S., Zeng, L., & McLachlan, G. J., & Peel, D. (2000). Finite mixture mod-
Yang, J. (2020). Association between initial chest els. New York: Wiley.
CT or clinical features and clinical course in patients Mechouche, A., Golbreich, C., Morandi, X., & Gibaud,
with coronavirus disease 2019 pneumonia. Korean B. (2008). Ontology-based annotation of brain
Journal of Radiology, 21(6), 736–745. MRI images. AMIA Annu Symp Proc, pp. 460–464.
Lohmann, P., Kocher, M., Steger, J., & Galldiks, N. Mehta, T. S., Raza, S., & Baum, J. K. (2000). Use of
(2018). Radiomics derived from amino-acid PET Doppler ultrasound in the evaluation of breast car-
and conventional MRI in patients with high-grade cinoma. Seminars in Ultrasound, CT, and MR, 21(4),
gliomas. The Quarterly Journal of Nuclear Medicine 297–307.
and Molecular Imaging, 62(3), 272–280. Mei, X., H.-C. Lee, K. Diao, M. Huang, B. Lin, C. Liu,
Lowe, D. (1999). Object recognition from local scale invari- Z. Xie, Y. Ma, P. M. Robson, M. Chung, A. Bernheim,
ant features. Proceedings of the International Conference V. Mani, C. Calcagno, K. Li, S. Li, H. Shan, J. Lv,
on Computer Vision, Greece, pp. 1150–1157. T. Zhao, J. Xia, Q. Long, S. Steinberger, A. Jacobi,
Lowe, H. J., Antipov, I., Hersh, W., & Smith, C. A. T. Deyer, M. Luksza, F. Liu, B. P. Little, Z. A. Fayad
(1998). Towards knowledge-based retrieval of medi- and Y. Yang (2020). Artificial intelligence-enabled
cal images. The role of semantic indexing, image rapid diagnosis of COVID-19 patients. medRxiv:
content representation and knowledge-based 2020.2004.2012.20062661.
retrieval. Proc AMIA Symp, pp. 882–886. Milletari, F., Navab, N., & Ahmadi, S. A. (2016). V-Net:
Luo, Z., Wang, N., Liu, P., Guo, Q., Ran, L., Wang, F., Fully convolutional neural networks for volumetric
Tang, Y., & Li, Q. (2020). Association between chest medical image segmentation. 2016 Fourth
CT features and clinical course of coronavirus dis- International Conference on 3D Vision (3DV).
ease 2019. Respiratory Medicine, 168, 105989. Min, J. J., & Gambhir, S. S. (2008). Molecular imaging
Lusted, L. B. (1960). Logical analysis in roentgen diag- of PET reporter gene expression. Handbook of
nosis. Radiology, 74, 178–193. Experimental Pharmacology, 185 Pt 2, 277–303.
Biomedical Imaging Informatics
357 10
MIT Technology Review. (2013). Deep Learning. With Neural Information Processing Systems
massive amounts of computational power, machines (NIPS 13).
can now recognize objects and translate speech in Nie, D., Zhang, H., Adeli, E., Liu, L., & Shen, D. (2016).
real time. Artificial intelligence is finally getting 3D deep learning for multi-modal imaging-guided
smart. Retrieved 10/5/2018, from https://www. survival time prediction of brain tumor patients.
technologyreview.com/s/513696/deep-learning/ Medical Image Computing and Computer-Assisted
Modayur, B., Prothero, J., Ojemann, G., Maravilla, K., Intervention, 9901, 212–220.
& Brinkley, J. F. (1997). Visualization-based map- Nielsen, B., Albregtsen, F., & Danielsen, H. E. (2008).
ping of language function in the brain. NeuroImage, Statistical nuclear texture analysis in cancer
6, 245–258. research: A review of methods and applications.
Motik, B., Grau, B. C., Horrocks, I., Parsia, B., Patel- Critical Reviews in Oncogenesis, 14(2–3),
Schneider, P., & Sattler, U. (2008). OWL 2: The next 89–164.
step for OWL. Journal of Web Semantics, 6(4), Nishio, M., Sugiyama, O., Yakami, M., Ueno, S., Kubo,
309–322. T., Kuroda, T., & Togashi, K. (2018). Computer-
Motik, B., Shearer, R., & Horrocks, I. (2009). aided diagnosis of lung nodule classification
Hypertableau reasoning for description logics. between benign nodule, primary lung cancer, and
Journal of Artificial Intelligence Research, 36, metastatic lung cancer at different image size using
165–228. deep convolutional neural network with transfer
Muller, H., Michoux, N., Bandon, D., & Geissbuhler, A. learning. PLoS One, 13(7), e0200721.
(2004). A review of content-based image retrieval Niu, S., de Sisternes, L., Chen, Q., Leng, T., & Rubin,
systems in medical applications-clinical benefits and D. L. (2016). Automated geographic atrophy seg-
future directions. International Journal of Medical mentation for SD-OCT images using region-based
Informatics, 73(1), 1–23. C-V model via local similarity factor. Biomedical
Muramatsu, C. (2018). Overview on subjective similar- Optics Express, 7(2), 581–600.
ity of images for content-based medical image Nowak, E., Jurie, F., & Triggs, B. (2006). Sampling strat-
retrieval. Radiological Physics and Technology. egies for bag-of-features image classification.
Nagarkar, D. B., Mercan, E., Weaver, D. L., Brunye, Computer Vision – Eccv 2006, Pt 4, Proceedings,
T. T., Carney, P. A., Rendi, M. H., Beck, A. H., 3954, 490–503.
Frederick, P. D., Shapiro, L. G., & Elmore, J. G. Ojala, T., Pietikainen, M., & Maenpaa, T. (2002).
(2016). Region of interest identification and diag- Multiresolution gray-scale and rotation invariant
nostic agreement in breast pathology. Modern texture classification with local binary patterns.
Pathology, 29(9), 1004–1011. IEEE Transactions on Pattern Analysis and Machine
Napel, S. A., Beaulieu, C. F., Rodriguez, C., Cui, J., Xu, Intelligence, 24(7), 971–987.
J., Gupta, A., Korenblum, D., Greenspan, H., Ma, Organization for Human Brain Mapping (2001). Annual
Y., & Rubin, D. L. (2010). Automated retrieval of Conference on Human Brain Mapping. Brighton,
CT images of liver lesions on the basis of image United Kingdom.
similarity: Method and preliminary results. Paddock, S. W. (1994). To boldly glow. Applications of
Radiology, 256(1), 243–252. laser scanning confocal microscopy in developmen-
Napel, S., Hoang, C., Xu, J., Gevaert, O., Rubin, D. L., tal biology. BioEssays, 16(5), 357–365.
Plevritis, S., Xu, Y., Leung, A., & Quon, A. (2011). Panwar, N., Huang, P., Lee, J., Keane, P. A., Chuan,
Computational and semantic annotation of CT and T. S., Richhariya, A., Teoh, S., Lim, T. H., &
PET images and integration with genomic assays of Agrawal, R. (2016). Fundus photography in the 21st
tumors in non-small cell lung cancer (NSCLC) for century–A review of recent technological advances
decision support and discovery: method and pre- and their implications for worldwide healthcare.
liminary results. Ninety-seventy annual scientific Telemedicine Journal and E-Health, 22(3), 198–208.
meeting of the RSNA, Chicago, IL. Pawlus, A., Sokolowska-Dabek, D., Szymanska, K.,
National Cancer Institute. (2012). Annotation and Inglot, M. S., & Zaleska-Dorobisz, U. (2015).
image markup on clear canvas. Retrieved Accessed Ultrasound Elastography–Review of techniques
11/7/2012, from https://wiki.nci.nih.gov/display/ and its clinical applications in pediatrics–Part 1.
AIM/Annotation+and+Image+Markup+-+AIM Advances in Clinical and Experimental Medicine,
National Library of Medicine. (1999). Medical subject 24(3), 537–543.
headings – Annotated alphabetic list. Bethesda: Pelizzari, C. A. (1998). Image processing in stereotactic
U.S. Department of Health and Human Services, planning: Volume visualization and image
Public Health Service. registration. Medical Dosimetry, 23(3), 137–145.
Neumann, H., Kiesslich, R., Wallace, M. B., & Neurath, Perkins, G., Renken, C., Martone, M. E., Young, S. J.,
M. F. (2010). Confocal laser endomicroscopy: Ellisman, M., & Frey, T. (1997). Electron tomogra-
Technical advances and clinical applications. phy of neuronal mitochondria: Three-dimensional
Gastroenterology, 139(2), 388–392. 392 e381-382. structure and organization of cristae and menbrane
Ng, A. Y., Jordan, M., & Weiss, Y. (2001). On spectral contacts. Journal of Structural Biology, 119(3),
clustering: analysis and an algorithm. In: Advances in 260–272.
358 D. L. Rubin et al.
Permuth, J. B., Choi, J., Balarunathan, Y., Kim, J., genes in living subjects. Methods in Molecular
Chen, D. T., Chen, L., Orcutt, S., Doepker, M. P., Biology, 411, 131–144.
Gage, K., Zhang, G., Latifi, K., Hoffe, S., Jiang, K., Rector, A. L., Nowlan, W. A., & Glowinski, A. (1993).
Coppola, D., Centeno, B. A., Magliocco, A., Li, Q., Goals for concept representation in the GALEN
Trevino, J., Merchant, N., Gillies, R., Malafa, M., & project. Proceedings of the 17th Annual Symposium
Florida Pancreas, C. (2016). Combining radiomic on Computer Applications in Medical Care (SCAMC
features with a miRNA classifier may improve pre- 93). C. Safran. New York: McGraw Hill, pp. 414–
diction of malignant pathology for pancreatic intra- 418.
ductal papillary mucinous neoplasms. Oncotarget, Ribaric, S., Todorovski, L., Dimec, J., & Lunder, T.
7(52), 85785–85797. (2001). Presentation of dermatological images on
Pham, D. L., Xu, C. Y., & Prince, J. L. (2000). Current the internet. Computer Methods and Programs in
methods in medical image segmentation. Annual Biomedicine, 65(2), 111–121.
Review of Biomedical Engineering, 2, 315. Ritchie, C. J., Edwards, W. S., Cyr, D. R., & Kim, Y.
Pieper, S., Halle, M., & Kikinis, R. (2004). 3D SLICER. (1996). Three-dimensional ultrasonic angiography
IEEE International Symposium on Biomedical using power-mode Doppler. Ultrasound in Medicine
Imaging ISBI, 2004, 632–635. and Biology, 22(3), 277–286.
Plevritis, S., Gevaert, O., Xu, J., Hoang, C., Leung, A., Robinson, P. J. (1997). Radiology’s Achilles’ heel: Error
Xu, Y., Quon, A., Rubin, D. L., & Napel, S. (2011). and variation in the interpretation of the Rontgen
Rapid Identification of Prognostic Imaging image. The British Journal of Radiology, 70(839),
Biomarkers for Non-small Cell Lung Carcinoma 1085–1098.
(NSCLC) by integrating image features and gene Rohlfing, T., & Maurer, C. R., Jr. (2003). Nonrigid
expression and leveraging public gene expression image registration in shared-memory multiproces-
databases. Ninety-seventy annual scientific meeting sor environments with application to brains, breasts,
of the RSNA, Chicago, IL. and bees. IEEE Transactions on Information
Pouratian, N., Sheth, S. A., Martin, N. A., & Toga, Technology in Biomedicine, 7(1), 16–25.
A. W. (2003). Shedding light on brain mapping: Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net:
Advances in human optical imaging. Trends in Convolutional networks for biomedical image seg-
10 Neurosciences, 26(5), 277–282. mentation. Springer International Publishing.
Prastawa, M., Gilmore, J., Lin, W. L., & Gerig, G. Rosen, G. D., Williams, A. G., Capra, J. A., Connolly,
(2004). Automatic segmentation of neonatal brain M. T., Cruz, B., Lu, L., Airey, D. C., Kulkarni, K.,
MRI. Medical Image Computing and Computer- & Williams, R. W. (2000). The Mouse Brain Library
Assisted Intervention – Miccai 2004, Pt 1, @ www.mbl.org. Int. Mouse Genome Conference
Proceedings, 3216, 10–17. 14, p. 166.
Prothero, J. S., & Prothero, J. W. (1986). Three- Ross, B., & Bluml, S. (2001). Magnetic resonance spec-
dimensional reconstruction from serial sections troscopy of the human brain. Anatomical Record
IV. The reassembly problem. Computers and (New Anat.), 265(2), 54–84.
Biomedical Research, 19(4), 3610373. Rosse, C. (2000). Terminologia Anatomica; considered
Pujara, A. C., Kim, E., Axelrod, D., & Melsaether, from the perspective of next-generation knowledge
A. N. (2018). PET/MRI in breast cancer. Journal of sources. Clinical Anatomy, 14, 120–133.
Magnetic Resonance Imaging. Rosse, C., & Mejino, J. L. V. (2003). A reference ontol-
Pysz, M. A., Gambhir, S. S., & Willmann, J. K. (2010). ogy for bioinformatics: The foundational model of
Molecular imaging: Current status and emerging anatomy. Journal of Bioinformatics, 36(6), 478–500.
strategies. Clinical Radiology, 65(7), 500–516. Rosse, C., Mejino, J. L., Modayur, B. R., Jakobovits,
Qiu, G. (2002). Indexing chromatic and achromatic pat- R. M., Hinshaw, K. P., & Brinkley, J. F. (1998a).
terns for content-based colour image retrieval. Motivation and organizational principles for ana-
Pattern Recognition, 35(8), 1675–1686. tomical knowledge representation: The digital anat-
Rahmani, R., Goldman, S. A., Zhang, H., Cholleti, S. R., omist symbolic knowledge base. Journal of the
& Fritts, J. E. (2008). Localized content-based image American Medical Informatics Association, 5(1),
retrieval. IEEE Transactions on Pattern Analysis and 17–40.
Machine Intelligence, 30(11), 1902–1912. Rosse, C., Shapiro, L. G., & Brinkley, J. F. (1998b). The
Rajpurkar, P., J. Irvin, K. Zhu, B. Yang, H. Mehta, Digital Anatomist foundational model: Principles
T. Duan, D. Ding, A. Bagul, C. Langlotz, for defining and structuring its concept domain.
K. Shpanskaya, M. P. Lungren and A. Y. Ng (2017). Proceedings, American Medical Informatics
CheXNet: Radiologist-Level Pneumonia Detection Association Fall Symposium. Orlando, Florida,
on Chest X-Rays with Deep Learning. CoRR pp. 820–824.
abs/1711.05225. Rubin, D. L. (2008). Creating and curating a terminol-
Ray, P. (2011). Multimodality molecular imaging of dis- ogy for radiology: Ontology modeling and analysis.
ease progression in living subjects. Journal of Journal of Digital Imaging, 21(4), 355–362.
Biosciences, 36(3), 499–504. Rubin, D. L. (2011). Measuring and improving quality
Ray, P., & Gambhir, S. S. (2007). Noninvasive imaging in radiology: Meeting the challenge with informat-
of molecular events with bioluminescent reporter ics. Radiographics, 31(6), 1511–1527.
Biomedical Imaging Informatics
359 10
Rubin, D. L., & Napel, S. (2010). Imaging informatics: Informatics Association Annual Symposium
Toward capturing and processing semantic informa- Proceedings, 674–678.
tion in radiology images. Yearbook of Medical Sandor, S., & Leahy, R. (1997). Surface-based labeling
Informatics, 34–42. of cortical anatomy using a deformable atlas. IEEE
Rubin, D. L., & Snyder, A. (2011). ePAD: A cross- Transactions on Medical Imaging, 16(1), 41–54.
platform semantic image annotation tool ninety- Schaltenbrand, G., & Warren, W. (1977). Atlas for
seventh annual scientific meeting of the RSNA. Stereotaxy of the human Brain. Stuttgart: Thieme.
Chicago, IL. Scheckenbach, K., Colter, L., & Wagenmann, M. (2017).
Rubin, D. L., Bashir, Y., Grossman, D., Dev, P., & Radiomics in head and neck cancer: Extracting
Musen, M. A. (2004). Linking ontologies with valuable information from data beyond recognition.
three-dimensional models of anatomy to predict the ORL: Journal for Otorhinolaryngology and Its
effects of penetrating injuries. Conference Related Specialties, 79(1–2), 65–71.
Proceedings: Annual International Conference of the Schimel, A. M., Fisher, Y. L., & Flynn, H. W., Jr. (2011).
IEEE Engineering in Medicine and Biology Society, Optical coherence tomography in the diagnosis and
5, 3128–3131. management of diabetic macular edema: Time-
Rubin, D. L., Bashir, Y., Grossman, D., Dev, P., & domain versus spectral-domain. Ophthalmic
Musen, M. A. (2005). Using an ontology of human Surgery, Lasers & Imaging, 42(4), S41–S55.
anatomy to inform reasoning with geometric mod- Schultz, E. B., Price, C., & Brown, P. J. B. (1997).
els. Studies in Health Technology and Informatics, Symbolic anatomic knowledge representation in the
111, 429–435. read codes version 3: Structure and application.
Rubin, D. L., Dameron, O., Bashir, Y., Grossman, D., Journal of the American Medical Informatics
Dev, P., & Musen, M. A. (2006a). Using ontologies Association, 4, 38–48.
linked with geometric models to reason about pen- Schwartz, L. H., Panicek, D. M., Berk, A. R., Li, Y., &
etrating injuries. Artificial Intelligence in Medicine, Hricak, H. (2011). Improving communication of
37(3), 167–176. diagnostic radiology findings through structured
Rubin, D. L., Grossman, D., Neal, M., Cook, D. L., reporting. Radiology, 260(1), 174–181.
Bassingthwaighte, J. B., & Musen, M. A. (2006b). Seidenari, S., Pellacani, G., & Grana, C. (2003).
Ontology-based representation of simulation mod- Computer description of colours in dermoscopic
els of physiology. AMIA Annu Symp Proc, pp. 664– melanocytic lesion images reproducing clinical
668. assessment. The British Journal of Dermatology,
Rubin, D. L., Mongkolwat, P., Kleper, V., Supekar, K., 149(3), 523–529.
& Channin, D. S. (2008a). Medical imaging on the Sensor Systems Inc. (2001). "MedEx." From http://
semantic web: Annotation and image markup. 2008 medx.sensor.com/products/medx/index.html
AAAI Spring Symposium Series, Semantic Scientific Shao, L., Zhang, H., & de Haan, G. (2008). An overview
Knowledge Integration. Stanford University. and performance evaluation of classification-based
Rubin, D. L., Rodriguez, C., Shah, P., & Beaulieu, C. least squares trained filters. IEEE transactions on
(2008b). iPad: Semantic annotation and markup of image processing: a publication of the IEEE Signal
radiological images. American Medical Informatics Processing Society, 17(10), 1772–1782.
Association Annual Symposium Proceedings, 626–630. Shapiro, L. G., & Stockman, G. C. (2001). Computer
Rubin, D. L., Mongkolwat, P., & Channin, D. S. (2009a). vision. Prentice Hall: Upper Saddle River.
A semantic image annotation model to enable inte- Shattuck, D. W., & Leahy, R. M. (2001). Automated
grative translational research. Summit on Translat graph-based analysis and correction of cortical vol-
Bioinforma, 2009, 106–110. ume topology. IEEE Transactions on Medical
Rubin, D. L., Talos, I. F., Halle, M., Musen, M. A., & Imaging, 20(11), 1167–1177.
Kikinis, R. (2009b). Computational neuroanatomy: Shen, D., Wu, G., & Suk, H. I. (2017). Deep learning in
Ontology-based representation of neural compo- medical image analysis. Annual Review of Biomedical
nents and connectivity. BMC Bioinformatics, Engineering, 19, 221–248.
10(Suppl 2), S3. Shi, J. B., & Malik, J. (2000). Normalized cuts and image
Rubin, D. L., Korenblum, D., Yeluri, V., Frederick, P., & segmentation. IEEE Transactions on Pattern
Herfkens, R. J. (2010). Semantic annotation and Analysis and Machine Intelligence, 22(8), 888–905.
image markup in a commercial PACS workstation. Shi, B., Grimm, L. J., Mazurowski, M. A., Baker, J. A.,
Scientific Paper, Ninety-sixth annual scientific meet- Marks, J. R., King, L. M., Maley, C. C., Hwang,
ing of the RSNA. Chicago, IL. E. S., & Lo, J. Y. (2018). Prediction of occult inva-
Rubin, D. L., Flanders, A., Kim, W., Siddiqui, K. M., & sive disease in ductal carcinoma in situ using deep
Kahn, C. E., Jr. (2011). Ontology-assisted analysis of learning features. Journal of the American College of
web queries to determine the knowledge radiologists Radiology, 15(3 Pt B), 527–534.
seek. Journal of Digital Imaging, 24(1), 160–164. Shin, H. C., Roth, H. R., Gao, M., Lu, L., Xu, Z.,
Ruiz, M. E. (2006). Combining image features, case Nogues, I., Yao, J., Mollura, D., & Summers, R. M.
descriptions and UMLS concepts to improve (2016). Deep convolutional neural networks for
retrieval of medical images. American Medical computer-aided detection: CNN architectures,
360 D. L. Rubin et al.
dataset characteristics and transfer learning. IEEE Subramaniam, B., Hennessey, J. G., Rubin, M. A.,
Transactions on Medical Imaging, 35(5), 1285–1298. Beach, L. S., & Reiss, A. L. (1997). Software and
Simpson, S., Kay, F. U., Abbara, S., Bhalla, S., Chung, methods for quantitative imaging in neuroscience:
J. H., Chung, M., Henry, T. S., Kanne, J. P., the Kennedy Krieger Institute Human Brain
Kligerman, S., Ko, J. P., & Litt, H. (2020). Project. In S. H. Koslow & M. F. Huerta (Eds.),
Radiological Society of North America Expert Neuroinformatics: an overview of the Human Brain
Consensus Statement on Reporting Chest CT Project (pp. 335–360). Mahwah: Lawrence Erlbaum.
Findings Related to COVID-19. Endorsed by the Sundsten, J. W., Conley, D. M., Ratiu, P., Mulligan,
Society of Thoracic Radiology, the American K. A., & Rosse, C.. (2000). Digital Anatomist web-
College of Radiology, and RSNA. Journal of based interactive atlases. From http://www9.biostr.
Thoracic Imaging. washington.edu/da.html
Singh, A., Massoud, T. F., Deroose, C., & Gambhir, S. S. Swanson, L. W. (1999). Brain maps: Structure of the rat
(2008). Molecular imaging of reporter gene expres- brain. New York: Elsevier Science.
sion in prostate cancer: An overview. Seminars in Talairach, J., & Tournoux, P. (1988). Co-planar stereo-
Nuclear Medicine, 38(1), 9–19. taxic atlas of the human brain. New York: Thieme
Sivic, J., & Zisserman, A. (2003). Video Google: A text Medical Publishers.
retrieval approach to object matching in videos. Talos, I. F., Rubin, D. L., Halle, M., Musen, M., &
Proceedings of the International Conference on Kikinis, R. (2008). A prototype symbolic model of
Computer Vision., 2, 1470–1477. canonical functional neuroanatomy of the motor
Smeulders, A. W. M., Worring, M., Santini, S., Gupta, system. Journal of Biomedical Informatics, 41(2),
A., & Jain, R. (2000). Content-based image retrieval 251–263.
at the end of the early years. IEEE Transactions on Toga, A. W. (2001). UCLA Laboratory for Neuro
Pattern Analysis and Machine Intelligence, 22(12), Imaging (LONI). From http://www.loni.ucla.edu/
1349–1380. Toga, A. W., Ambach, K. L., & Schluender, S. (1994).
Smith, M. K., Welty, C., & McGuinness, D. (2004). High-resolution anatomy from in situ human brain.
OWL Web Ontology Language Guide. http://www. NeuroImage, 1(4), 334–344.
w3.org/TR/owl-guide/ Toga, A. W., Santori, E. M., Hazani, R., & Ambach, K.
10 Smith, M. Q., Staley, C. A., Kooby, D. A., Styblo, T., (1995). A 3-D digital map of rat brain. Brain
Wood, W. C., & Yang, L. (2009). Multiplexed fluo- Research Bulletin, 38(1), 77–85.
rescence imaging of tumor biomarkers in gene Toga, A. W., Frackowiak, R. S. J., & Mazziotta, J. C.
expression and protein levels for personalized and (Eds.). (2001). Neuroimage: A journal of brain func-
predictive medicine. Current Molecular Medicine, tion. Academic Press: New York.
9(8), 1017–1023. Tommasi, T., Caputo, B., Welter, P., Güld, M. O., &
Sohrab, M. A., Smith, R. T., & Fawzi, A. A. (2011). Deserno, T. M. (2010). Overview of the CLEF 2009
Imaging characteristics of dry age-related macular medical image annotation track. Proceedings of the
degeneration. Seminars in Ophthalmology, 26(3), 10th international conference on Cross-language eval-
156–166. uation forum: multimedia experiments. Corfu,
Solomon, M., Liu, Y., Berezin, M. Y., & Achilefu, S. Greece, Springer-Verlag, pp. 85–93.
(2011). Optical imaging in cancer research: Basic Toomre, D., & Bewersdorf, J. (2010). A new wave of cel-
principles, tumor detection, and therapeutic monitor- lular imaging. Annual Review of Cell and
ing. Medical Principles and Practice, 20(5), 397–415. Developmental Biology, 26, 285–314.
Soto, G. E., Young, S. J., MArtone, M. E., Deerinick, Trebeschi, S., Griethuysen, J. J. M. V., Lambregts, D. M.
T. J., Lamont, S. L., Carragher, B. O., Hamma, K., & J., Lahaye, M. J., Parmar, C., Bakers, F. C. H.,
Ellisman, M. H. (1994). Serial section electron Peters, N. H. G. M., Beets-Tan, R. G. H., & Aerts,
tomography: A method for three-dimensional recon- H. J. W. L. (2017). Deep learning for fully-automated
struction of large structures. NeuroImage, 1, 230–243. localization and segmentation of rectal Cancer on
Spackman, K. A., Campbell, K. E., & Cote, R. A. multiparametric MR. Scientific Reports, 7(1), 5301.
(1997). SNOMED-RT: A reference terminology for Tsarkov, D., & Horrocks, I. (2006). FaCT++ description
health care. Proceedings, AMIA Annual Fall logic reasoner: System description. Automated
Symposium. D. R. Masys. Philadelphia, Hanley and Reasoning, Proceedings, 4130, 292–297.
Belfus, pp. 640–644. Valdora, F., Houssami, N., Rossi, F., Calabrese, M., &
Sperrin, M., Grant, S. W., & Peek, N. (2020). Prediction Tagliafico, A. S. (2018). Rapid review: Radiomics
models for diagnosis and prognosis in Covid-19. and breast cancer. Breast Cancer Research and
BMJ, 369, m1464. Treatment, 169(2), 217–229.
Spitzer, V. M., & Whitlock, D. G. (1998). The visible Van Essen, D. C., & Drury, H. A. (1997). Structural and
human dataset: The anatomical platform for human functional analysis of human cerebral cortex using
simulation. The Anatomical Record, 253(2), a surface-basec atlas. The Journal of Neuroscience,
49–57. 17(18), 7079–7102.
Stensaas, S. S., & Millhouse, O. E.. (2001). Atlases of the Van Essen, D. C., Drury, H. A., Dickson, J., Harwell, J.,
Brain. From http://medstat.med.utah.edu/kw/ Hanlon, D., & Anderson, C. H. (2001). An inte-
brain_atlas/credits.htm grated software suite for surface-based analysis of
Biomedical Imaging Informatics
361 10
cerebral cortex. Journal of the American Medical Weissleder, R., & Mahmood, U. (2001). Molecular
Association, 8(5), 443–459. imaging. Radiology, 219, 316–333.
Van Leemput, K., Maes, F., Vandermeulen, D., & Weissleder, R., Schwaiger, M. C., Gambhir, S. S., &
Suetens, P. (1999). Automated model-based tissue Hricak, H. (2016). Imaging approaches to optimize
classification of MR images of the brain. IEEE molecular therapies. Science Translational Medicine,
Transactions on Medical Imaging, 18(10), 897–908. 8(355), 355ps316.
Van Noorden, S. (2002). Advances in immunocytochem- Wessels, J. T., Yamauchi, K., Hoffman, R. M., &
istry. Folia Histochemica et Cytobiologica, 40(2), Wouters, F. S. (2010). Advances in cellular, subcel-
121–124. lular, and nanoscale imaging in vitro and in vivo.
van Timmeren, J. E., Leijenaar, R. T. H., van Elmpt, W., Cytometry. Part A, 77(7), 667–676.
Reymen, B., Oberije, C., Monshouwer, R., Bussink, Willmann, J. K., van Bruggen, N., Dinkelborg, L. M., &
J., Brink, C., Hansen, O., & Lambin, P. (2017). Gambhir, S. S. (2008). Molecular imaging in drug
Survival prediction of non-small cell lung cancer development. Nature Reviews. Drug Discovery, 7(7),
patients using radiomics analyses of cone-beam CT 591–607.
images. Radiotherapy and Oncology, 123(3), Wilson, T. (1990). Confocal Microscopy. San Diego:
363–369. Academic Press Ltd.
Vapnik, V. N. (2000). The nature of statistical learning Wong, B. A., Rosse, C., & Brinkley, J. F. (1999). Semi-
theory. New York: Springer. automatic scene generation using the Digital
Varma, M., & Zisserman, A. (2003). Texture classifica- Anatomist Foundational Model. Proceedings,
tion: are filter banks necessary?, in. Proceedings of American Medical Informatics Association Fall
International Conference on Computer Vision and Symposium. Washington, D.C., pp. 637–641.
Pattern Recognition, pp. 91–698. Woods, R. P., Cherry, S. R., & Mazziotta, J. C. (1992).
Venkatesh, S. K., & Ehman, R. L. (2015). Magnetic Rapid automated algorithm for aligning and reslic-
resonance elastography of abdomen. Abdominal ing PET images. Journal of Computer Assisted
Imaging, 40(4), 745–759. Tomography, 16, 620–633.
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., & Woods, R. P., Mazziotta, J. C., & Cherry, S. R. (1993).
Manzagol, P.-A. (2010). Stacked Denoising autoen- MRI-PET registration with automated algorithm.
coders: Learning useful representations in a deep Journal of Computer Assisted Tomography, 17,
network with a local Denoising criterion. Journal of 536–546.
Machine Learning Research, 11(Dec), 3371–3408. World Health Organization. (2020.) WHO Director-
Vreeman, D. J., & McDonald, C. J. (2005). Automated General's opening remarks at the media briefing on
mapping of local radiology terms to LOINC. AMIA COVID-19 – 11 March 2020. Retrieved 5/15/2020,
Annu Symp Proc, pp. 769–773. from https://www.who.int/dg/speeches/detail/who-
Vreeman, D. J., Abhyankar, S., Wang, K. C., Carr, C., director-general-s-opening-remarks-at-the-media-
Collins, B., Rubin, D. L., & Langlotz, C. P. (2018). briefing-on-covid-19%2D%2D-11-march-2020
The LOINC RSNA radiology playbook – A unified WorldWideWeb Consortium. (W3C Recommendation
terminology for radiology procedures. Journal of the 10 Feb 2004). OWLWeb Ontology Language
American Medical Informatics Association, 25(7), Reference.
885–893. Wu, S., Zheng, J., Li, Y., Yu, H., Shi, S., Xie, W., Liu, H.,
Wang, L., & Wong, A.. (2020). COVID-Net: A tailored Su, Y., Huang, J., & Lin, T. (2017). A Radiomics
deep convolutional neural network design for detec- nomogram for the preoperative prediction of lymph
tion of COVID-19 cases from chest X-ray images. node metastasis in bladder Cancer. Clinical Cancer
Wang, J. Z., Wiederhold, G., Firschein, O., & Wei, S. X. Research, 23(22), 6904–6911.
(1997). Content-based image indexing and search- Wynants, L., Van Calster, B., Bonten, M. M. J., Collins,
ing using Daubechies’ wavelets. International G. S., Debray, T. P. A., De Vos, M., Haller, M. C.,
Journal on Digital Libraries, 1(4), 311–328. Heinze, G., Moons, K. G. M., Riley, R. D., Schuit,
Wang, D., A. Khosla, R. Gargeya, H. Irshad and A. H. E., Smits, L. J. M., Snell, K. I. E., Steyerberg, E. W.,
Beck (2016). Deep learning for identifying meta- Wallisch, C., & van Smeden, M. (2020). Prediction
static breast cancer. arXiv:1606.05718 [cs, q-bio]. models for diagnosis and prognosis of covid-19
Wang, K. C., Patel, J. B., Vyas, B., Toland, M., Collins, infection: Systematic review and critical appraisal.
B., Vreeman, D. J., Abhyankar, S., Siegel, E. L., BMJ, 369, m1328.
Rubin, D. L., & Langlotz, C. P. (2017). Use of radi- Yamashita, R., Nishio, M., Do, R. K. G., & Togashi, K.
ology procedure codes in health care: The need for (2018). Convolutional neural networks: an overview
standardization and structure. Radiographics, 37(4), and application in radiology. Insights Imaging, 9(4),
1099–1110. 611–629.
Weaver, O., & Leung, J. W. T. (2018). Biomarkers and Yang, L., Jin, R., Mummert, L., Sukthankar, R., Goode,
imaging of breast cancer. AJR. American Journal of A., Zheng, B., Hoi, S. C. H., & Satyanarayanan, M.
Roentgenology, 210(2), 271–278. (2010). A boosting framework for visuality-preserving
Weiss, D. L., & Langlotz, C. P. (2008). Structured report- distance metric learning and its application to medi-
ing: Patient care enhancement or productivity cal image retrieval. IEEE Transactions on Pattern
nightmare? Radiology, 249(3), 739–747. Analysis and Machine Intelligence, 32(1), 30–44.
362 D. L. Rubin et al.
Yang, Z., Shi, J., He, Z., Lu, Y., Xu, Q., Ye, C., Chen, S., Zhang, H., Ingham, E. S., Gagnon, M. K., Mahakian,
Tang, B., Yin, K., Lu, Y., & Chen, X. (2020). L. M., Liu, J., Foiret, J. L., Willmann, J. K., &
Predictors for imaging progression on chest CT Ferrara, K. W. (2017). In vitro characterization and
from coronavirus disease 2019 (COVID-19) patients. in vivo ultrasound molecular imaging of nucleolin-
Aging (Albany NY), 12(7), 6037–6048. targeted microbubbles. Biomaterials, 118,
Yasaka, K., Akai, H., Kunimatsu, A., Kiryu, S., & Abe, 63–73.
O. (2018). Deep learning with convolutional neural Zhenyu, H., Yanjie, Z., Tonghui, L., & Jianguo, Z.
network in radiology. Japanese Journal of Radiology, (2009). Combining text retrieval and content-based
36(4), 257–272. image retrieval for searching a large-scale medical
Yoo, T. S. (2004). Insight into images : Principles and image database in an integrated RIS/PACS environ-
practice for segmentation, registration, and image ment, SPIE.
analysis. Wellesley: A K Peters. Zhou, S. K., Greenspan, H., & Shen, D. (2017a). Deep
Yu, F., & Ip, H. H. (2008). Semantic content analysis learning for medical image analysis. London/San
and annotation of histological images. Computers in Diego: Elsevier/Academic Press.
Biology and Medicine, 38(6), 635–649. Zhou, Y., He, L., Huang, Y., Chen, S., Wu, P., Ye, W., Liu,
Yuan, M., Yin, W., Tao, Z., Tan, W., & Hu, Y. (2020). Z., & Liang, C. (2017b). CT-based radiomics signa-
Association of radiologic findings with mortality of ture: A potential biomarker for preoperative predic-
patients infected with 2019 novel coronavirus in tion of early recurrence in hepatocellular carcinoma.
Wuhan, China. PLoS One, 15(3), e0230548. Abdominal Radiology (NY), 42(6), 1695–1704.
Zaleska-Dorobisz, U., Pawlus, A., Szymanska, K., Zhou, M., Scott, J., Chaudhury, B., Hall, L., Goldgof,
Lasecki, M., & Ziajkiewicz, M. (2015). Ultrasound D., Yeom, K. W., Iv, M., Ou, Y., Kalpathy-Cramer,
Elastography–Review of techniques and its clinical J., Napel, S., Gillies, R., Gevaert, O., & Gatenby, R.
applications in pediatrics–Part 2. Advances in (2018). Radiomics in brain tumor: Image assess-
Clinical and Experimental Medicine, 24(4), ment, quantitative feature descriptors, and machine-
725–730. learning approaches. AJNR. American Journal of
Zalis, M. E., Barish, M. A., Choi, J. R., Dachman, Neuroradiology, 39(2), 208–216.
A. H., Fenlon, H. M., Ferrucci, J. T., Glick, S. N., Zijdenbos, A. P., Evans, A. C., Riahi, F., Sled, J., Chui,
10 Laghi, A., Macari, M., McFarland, E. G., Morrin, J., & Kollokian, V. (1996). Automatic quantification
M. M., Pickhardt, P. J., Soto, J., & Yee, J. (2005). CT of multiple sclerosis lesion volume using stereotac-
colonography reporting and data system: A consen- tic space. Proc. 4th Int. Conf. on Visualization in
sus proposal. Radiology, 236(1), 3–9. Biomedical Computing. Hamburg. pp. 439–448.
Zhang, Y. Y., Brady, M., & Smith, S. (2001). Zimmerman, S. L., Kim, W., & Boonn, W. W. (2011).
Segmentation of brain MR images through a hid- Informatics in radiology: Automated struc-
den Markov random field model and the tured reporting of imaging findings using the
expectation-maximization algorithm. IEEE AIM standard and XML. Radiographics, 31(3),
Transactions on Medical Imaging, 20(1), 45–57. 881–887.
363 11
Contents
References – 385
Personal Health Informatics
365 11
nnLearning Objectives disease. This recognition has given rise to a
After reading this chapter, you should know movement that underpins the birth of per-
the answers to these questions: sonal health informatics, as described below.
1. What is the role of the patient or con-
sumer in healthcare decisions?
2. How does patient empowerment come 11.2 Patient-Centered Care
into play in the various care delivery and Personal Health
settings and phases of health care? Informatics
3. What are some examples of sensors that
can be used to assist in personal health Patient-centered care has become a core com-
management? ponent of medical care. From early work in
4. How can you ensure patient privacy the 1960s and 70s (Balint 1969; Waitzkin and
and the security of patient generated Stoeckle 1972), to the concept of the chronic
data in the home and environment? disease model (Bodenheimer et al., 2002c;
5. What are the various features of per- Coleman et al. 2009), the National Academy
sonal health technologies (e.g., personal of Medicine landmark report Crossing the
health records, mobile applications, Quality Chasm (Institute of Medicine (US)
etc.)? Committee on Quality of Health Care in
6. How do individuals obtain various America 2001) making “patient-centered”
types of health information? one of the six aims of health care, and the
development and incorporation of the medi-
cal home (Kellerman and Kirk 2007) and
11.1 Introduction patient engagement (Dentzer 2013),1 patient-
centered care has taken center stage in
Complexity and collaboration characterize medicine.
health care in the early twenty-first century. As a result of this visibility, healthcare
Complexity arises from our deeper and more institutions, health planners, congressional
sophisticated understanding of health and representatives, and hospital public relations
disease, including the addition of molecular/ departments are among many promoters of
genomic processes and social/behavioral patient-centered care, a concept rooted in
determinants. Complexity also arises from the “deep respect for patients as unique living
myriad of new treatments available for many beings, and the obligation to care for them on
diseases, and emerging data about the role of their terms”(Epstein and Street Jr 2011). To
nutrition, exercise, sleep, and stress in preserv- be patient-centered, one must accept people
ing health. Collaboration begins with the real- seeking care as persons with a unique social
ization that successful attainment of optimal world, who should be listened to, informed,
wellbeing and effective management of dis- respected, and involved in their own care—
ease processes necessitate active engagement and whose wishes are heard, if not acted
of clinicians, laypersons, support systems, and upon, during their healthcare journey. Patient-
society as a whole. Collaboration extends centered care complements evidence-based
beyond the societally-focused opportunities medicine by including patient preferences into
into the healthcare system itself, where care is the decision making about treatment options.
more fragmented, leading to greater needs for Berwick, described three maxims of patient-
collaboration and communication.
Now more than ever before, the healthcare
system recognizes the role of the person who 1 Health CfA. A New Definition of Patient Engage-
interacts with this system, who is increasingly ment. What is Engagement and Why is it Impor-
interested in engaging or called upon to tant? 2010. Available from: 7 http://www.cfah.org/
file/CFAH_Engagement_Behavior_Framework_
engage through various states of health and
current.pdf
366 R. M. Cronin et al.
centered care: (1) “The needs of the patient 11.2.1 Using Biomedical
come first.”(2) “Nothing about me without Informatics to Impact
me.” (3) “Every patient is the only patient”
(Berwick 2009).
Patient-Centered Medicine
Patient-centered care represents a shift in
To conclude this section, we provide a few
the physician’s role from paternalistic and
illustrative examples of how patient-centered
authoritative to collaborative--leveraging the
care and personal health informatics can help
perspectives of people and their support
shape the care delivered.
system, whether the support system consists
1. Patient-centered ambulatory care.
of family, caregivers, or even technology, as
Routine ambulatory care, by its very
partners in making decisions and delivering
nature, focuses on the whole person, and
care. Patient-centered care requires the entire
healthcare team to be more mindful, informa- not just their diagnosis. Caring for the
tive, and empathic, and for patients to actively whole person requires the ability to utilize
participate in their care. Patient-centered care resources such as social workers, financial
encourages inclusiveness and engagement for counselors, mental health providers, trans-
shared decision making among the stakehold- portation, peer support programs, daily
ers to develop a comprehensive care plan living assistance, and language and liter-
aligned with the whole person. acy education and resources. Making the
The maxims of patient-centered care form provider aware of the needs of the person
the basis of the field personal health informat- being cared for can be enabled through
ics, as originally proposed by Warner Slack electronic health records (EHRs) and clin-
and Tom Ferguson in 1993 (Demiris 2016). In ical decision support that work within the
particular, these maxims translate into the fol- provider workflow, potentially utilizing
lowing desiderata: data provided by the patient or the patient’s
11 55 People are able to access care that is coor- social and family network. In addition,
dinated and collaborative using tools like a patient portal, the health-
55 Care is focused on the whole person, not care system can provide alerts and remind-
just the physical comfort ers to patients for care such as the Influenza
55 Care considers people’s values, culture, vaccine, nutrition counseling, and medica-
and socioeconomic status tion refill reminders.
55 People and their support system are active Access to care can be facilitated
partners in care, not passive listeners through telemedicine and telehealth (see
55 People’s goals within the healthcare sys- 7 Chap. 20), as well as through apps that
tem align with the system’s mission, val- could enable daily living assistance and
ues, and quality metrics peer support programs (see 7 Chap. 19).
55 People and their caregivers participate in All of these applications need to consider
shared decision making with their provid- language and literacy (health and technol-
ers and play a role in the decisions at the ogy), as well as challenges created by asyn-
personal, population, and system level chronous healthcare-related discussions.
55 Sharing health information with people Example: Jane uses her mobile app to
and caregivers enables informed decision remind her when to fill her asthma medica-
making tions and communicate with her social
55 Support systems’ presence in the care set- worker and financial counselor to help her
ting are encouraged and facilitated purchase those medications. She uses the
same app to talk with her support system
As described in 7 Sect. 13.5, these desiderata and other people with asthma who can
provide a new lens through which the whole understand and support her journey
of biomedical informatics should be viewed. through the stages of her disease.
Personal Health Informatics
367 11
2. Patient-centered acute care and care tran- porting, and behavioral factors can affect
sitions. a person’s health care and management.
Acute care settings are characterized Applications like patient portals or per-
by sudden and rapid changes in patient sonal health records can enable a person to
status, ad hoc appointments, and frequent review lab results and clinical notes, com-
handoffs of care. To help patients under- municate with their healthcare providers,
stand what is happening to them and to schedule appointments, pay bills, and
facilitate decision-making, patients often obtain educational information. Wearable
request unrestricted and continuous access technologies enable self-management and
to their social support network in this set- monitoring of critical information, such as
ting. Patients and their support system weight, blood pressure, glucose levels, and
should be present during rounds, which medication adherence (Marcolino et al.
are performed at the bedside, and at shift 2018). Finally, informatics applications
changes. Family interaction should take could deliver information that can help
place in an environment that is as comfort- augment knowledge about disease status
able as possible, equipped with access to that can enable people to make informed
information and to experts as necessary. decisions about the need to escalate home
Information technology can help make care or schedule a return visit (Asnani
this scenario possible through inpatient et al. 2016).
personal health records, where patients Example: Richard knows a lot about
and their families can view information in his heart failure, but when given the right
real time to help participate in rounds and personal health informatics tools, he
make informed decisions (Huerta et al. learned about signs and symptoms when
2017; O’Leary et al. 2016; Prey et al. 2014). his heart failure gets worse. He avoids
Mobile devices can improve communica- costly readmissions to the hospital by
tion, care plan management, and knowl- remembering to take his medications daily
edge transfer, even when family members thanks to reminders from his wearable
are not present. Other personal informat- technologies, and now knows when to
ics tools, such as wearable sensors or smart communicate with his providers through
scales, could be introduced during a hospi- his patient portal when his heart failure
tal stay to develop behavior changes that worsens.
could carry over into the home environ- 4. Personalized medicine.
ment (Steinberg et al. 2013). At its core, the maxims of patient-cen-
Example: Through her mobile phone, tered care require that any management
Beverly’s family was present remotely in plan provided by healthcare providers to
the hospital during rounds to ask the patients will need to be personalized (see
health care team questions. Her family 7 Chap. 28). Medications, procedures,
also encouraged Beverly to learn to use the supportive or curative plans all should be
hospital’s smart scale, which can monitor tailored to the person/family receiving
her weight and keep her heart failure under them. However, the augmentation of our
control. Beverly now uses her smart scale knowledge about the impact of a person’s
daily at home to send updates to her fam- omics (genomics, proteomics, metabolo-
ily and healthcare providers who can mics, etc.), and environment can now be
encourage her and help her stay out of the used to personalize therapy (Collins 2004).
hospital. Example: Using sensors on James’
3. Patient-centered care at home. phone along with his genetic information,
A significant amount of health care his providers can use targeted medications
can and should occur in the home setting. to treat his cancer and can help him meet
This setting is where social, financial, sup- his treatment goal of golfing again.
368 R. M. Cronin et al.
people in their handling of health challenges. General Hospital in the late 1950’s, computer-
Patient education companies such as Krames driven telephone systems were used to con-
would partner with organizations like the duct home-based follow-up with post-surgery
American Heart Association to provide gen- cardiac patients, calling them daily to obtain
eral printed material on heart disease or with pulse readings. This was followed by an era of
the American Cancer Society to provide infor- interactive video systems that augmented
mation on cancer. delivery of information and helped patients
Personal Health Informatics applications understand the risks and benefits associated
followed a similar trend as patient-centered with treatment options, but also to help define
medicine with early applications in the 1950’s their values for possible future health out-
and 1960’s. Collen and colleagues at Kaiser comes. The prime examples of this type of
Permanente created one of the earliest patient system originated with the Foundation for
data collection applications--a health Informed Medical Decision Making. As early
appraisal system that prompted for patient as 1973, Wennberg and others discovered that
data and returned a systematic risk appraisal the rates of many expensive surgeries and
(Collen et al. 1964). Warner Slack and other treatments would vary from location to
colleagues at the University of Wisconsin
location throughout the U.S. (Wennberg and
used a mainframe computer system as a Gittelsohn 1973). This variation seemed to
health assessment tool. Patients sat at a cath- occur for medical conditions where there were
ode ray tube (CRT) terminal and responded multiple viable treatment options and choices
to text questions, receiving a printed summary depended more on physician and resource
of their health appraisal at the end of the ses- availability than need or patient characteris-
sion (Slack et al. 1966). . Figure 11.1 shows tics. Barry et al. focused on developing inter-
an early example of the mainframe-based tool active video consumer decision aids that
developed by Slack. At Massachusetts focused on these conditions (e.g., prostate
11 cancer, breast cancer, back pain, etc.), discov-
ering that patient preferences and priorities
for possible health outcomes could vary dra-
matically from person to person, and could be
critical to defining an optimal decision (Barry
et al. 1995).
In the 1980s, clinicians and health educa-
tors capitalized on the increasingly common
personal computers as vehicles for health edu-
cation. As shown in . Fig. 11.2 The Body
Awareness Resource Network (BARN),
developed in the 1980s by Gustafson and col-
leagues at the University of Wisconsin,
engaged adolescents in game-like interactions
to help them learn about growth and develop-
ment, develop healthy attitudes towards
avoidance of risky behaviors, and rehearse
strategies for negotiating the complex inter-
personal world of adolescence (Bosworth
et al. 1983).
Another influential early personal health
informatics application was the
Comprehensive Health Enhancement Support
.. Fig. 11.1 Early use of computing in consumer
health informatics, here taking a medical history directly System (CHESS) developed by Gustafson
from a patient (Slack WV et al. (1966). NEJM with per- and colleagues in 1989 at the University of
mission) Wisconsin (Gustafson et al. 2001). CHESS
Personal Health Informatics
371 11
.. Fig. 11.2 BARN topic index a BARNY’S TOPIC INDEX
and use by teens (Bosworth et al. WHO TO
1983). This picture shows teens CALL FOR BODY CARE STRESS
interacting with a game on an
early graphical computer. The
figure on the left is the topic
index as displayed on the screen. HELP 2 3
(With permission from 1
Gustafson, D, personal
communication)
4
5
BERR
DRUGS SMOKING
6
or 7
SEX EXIT
provided women with breast cancer informa- of development, continuing through today
tion through curated articles and directories (Eysenbach et al. 2004). During this time,
of cancer services, decision-making through computer games also increased in uptake, and
charts, decision aids, and action plans, and while time-consuming and expensive to build,
emotional support through online support they were relatively easy to disseminate, and
groups. Ferguson was also heavily influential were associated with measurable changes in
in the 1980’s and 90’s in creating and analyz- knowledge (Lieberman 1988) and, in some
ing online social support groups for patients cases, symptom management (Patel et al.
(Ferguson 1996). 2006; Redd et al. 1987).
As the Internet became more available in Some major areas of growth in personal
homes, Internet support groups gathered health informatics over the past 20 years
momentum. One such example, Hopkins Teen include home telehealth, mobile health, per-
Central developed by johnson et al. (2001), sonal health records, and personal genomics.
allowed otherwise isolated children with cys- The growth of home telehealth technologies
tic fibrosis to meet virtually and to discuss has grown rapidly over the past 20 years (see
health and developmental issues that impacted 7 Chap. 20). Some notable randomized
healthy decision making. The idea of the controlled trials include Informatics for
Internet support group became an active area Diabetes Education and Telemedicine
372 R. M. Cronin et al.
(IDEATel), the Telemonitoring Study for to customers of these companies for concern
Chronic Obstructive Pulmonary Disease about false information (see 7 Chap. 28).
(COPD), and the Tele-ERA study. Large ran- Patient-centered medicine and Personal
domized controlled trials demonstrated Health Informatics have become and will con-
improved effects of these interventions. tinue to be a bigger and more important driv-
Personal health records started in the late ing force in medicine. In 1998 the Institute of
1990s with the Patient- Centered Access to Medicine (IOM) established a major program
Secure Systems Online (PCASSO) portal on Quality of Health Care in America and
(Masys and Baker 1997). Tethered personal placed patient-centered as one of the six aims
health records, commonly referred to as in their landmark paper on improving the
patient portals, have been implemented by quality of health care, Crossing the Quality
hundreds of institutions, with increasing Chasm (Institute of Medicine (US) Committee
adoption being driven by governmental policy on Quality of Health Care in America 2001).
such as the Affordable Care Act and Many studies have demonstrated improve-
Meaningful Use in the US, and the Power of ment in care using patient-centered medicine
Information strategy in the UK. There has including classic medical outcomes (Epstein
been increasing literature demonstrating and Street Jr 2011), improved shared decision
increased uptake and use of patient portals, making (Golomb et al. 2007), and reducing
and also improvement in patient satisfaction, unnecessary surgical operations.2 Personal
communication, and outcomes (Ammenwerth Health Informatics has also demonstrated
et al. 2012; Goldzweig et al. 2012). With the important improvements in medical care
advent of mobile technologies, such as smart (Gibbons et al. 2009). In a Medline search of
phones and tablets, mobile health (mHealth) “patient-centered care”, tens of thousands of
has exploded over the past 5 years (see articles have been published about patient
7 Chap. 19). While significant work has been centered medicine, with only 59 from 1950 to
11 done in the U.S. utilizing this technology, a 1992. A similar search for “consumer health
significant push of mHealth has occurred in informatics” OR “personal health informat-
low- and middle- income countries because of ics” demonstrates hundreds of articles, with
the ubiquitous nature and low cost of mobile only 4 articles from 1950 to 1992. In recogni-
phones as compared to other forms of tech- tion of the growth of scientific studies in this
nology. With the increased ability to record domain, in 2008 the MeSH term “Consumer
and review daily activities through mobile Health Information” was introduced, defined
technologies, a movement called the as “information intended for potential users
Quantified Self has evolved (Appelboom et al. of medical and health care services” (Demiris
2014). The Quantified Self movement, driven 2016). As people increase their engagement in
by the theory of patient engagement, is a fast their health, and technologies improve their
growing practice of self-monitoring driven by ability to do so, personal health informatics
technological advances and breakthroughs in will become a bigger and more important part
miniaturization of wearable and environmen- of biomedical informatics and the sub-
tal sensors. Personal genomics first became disciplines within it.
available in the early 2000s because of direct
to consumer genetic testing. Companies like
23andMe, 7 Ancestry.com, and Pathway 2 O’Connor A, Stacey, D, Rovner, D, Holmes-Rovner,
genomics provide the ability to test one’s own M, Tetroe, J, Llewellyn-Thomas, H, Entwistle, V.,
Tait, V, Rostom, A, Fiset, V, Barry, M. Institute for
genetic composition at home to discover
Healthcare Improvement: Patient decision aids for
genetic risk for diseases like breast cancer, balancing the benefits and harms of health care
ancestry, and pharmacogenomic information. options: A systematic review and meta-analysis
Issues with regulatory bodies such as the Food 2018 [cited 2018 June 22]. Available from: 7 http://
and Drug Administration has prevented all www.ihi.org/resources/Pages/Publications/Patient-
DecisionAidsforBalancingBenefitsHarmsofHealth-
information about genetic risk to be provided
CareOptions.aspx
Personal Health Informatics
373 11
Patient Characteristics
• Age Patient Activation
• Gender • Self-Efficacy
• Education • Readiness-to-Change
• Income • Motivation
Overall
• Ethnicity • Barriers
Patient Outcomes
• Culture • Knowledge
• Length of Life
• Location Medical Outcomes
• Quality of Life
• Language • Independence
Health Behaviors
• Social Support Mental Health • Satisfaction
• Physical Exercise • Cognitive
• Diet • Emotional
• Smoking • Social
Use of Technology • Alcohol/Drug Abuse
Personal Health Informatics • Medication Adherence
Physical Health Overall
• Cognitive Exercise Payer/Provider
• Physical Function
• Sleep Hygiene Outcomes
Environmental Characteristics • Physical Role
• Stress Management
• Reimbursement Policies • Pain • Health Costs
• Access to Medical Care • Clinician Satisfaction
• Internet Access • Ratings
• Environmental Quality
.. Fig. 11.3 Analytic framework showing the interplay between social, cultural, and behavioral features and the
opportunities for personal health informatics on outcomes
outcome disparities related to race and ethnic- to overcome these literacy issues such as
ity, income, and education. With the increas- adaptable interfaces and intuitive icons and
ing availability and use of personal health graphics; however, designers need to give these
technologies, we have an opportunity to considerations priority.
reduce these disparities with appropriate tar-
geted design choices. The following sections
identify some of the design challenges. 11.4.2 Digital Divide
Self-efficacy [Bandura An individual’s impression of one’s own knowledge and skill to perform any
1977] task, based on prior success, physical ability, and outside sources of persuasion.
Predicts the amount of effort a person will expend to change behavior. It is a key
component of other theories, such as the Theory of Planned Behavior.
Social cognitive theory Behavior change is determined by personal, environmental and behavioral
[Bandura 1989] elements, which are interdependent.
Theory of planned A link between attitudes and behavior. It asserts that behaviors viewed positively
behavior [Ajzen 1985] and supported by others (subjective norm) are more likely to have higher levels
of motivation and more likely to be performed.
Transtheoretical/stages of This model asserts that behavioral change is a 5-step process, between which a
change model [Prochaska person may oscillate before achieving complete change.
2005]
Patient engagement/patient Patient engagement describes the actions one can take to achieve maximum
activation [Dentzer 2013; benefits from the healthcare services available. Patient activation is a person’s
Hibbard 2004] knowledge, confidence, and skills used to manage their health. Improved
engagement and activation have been associated with improved healthcare
outcomes
Early games targeted at behavior change (described above) attempted to remove the stigma (attitude) associ-
ated with engaging in healthy behaviors (such as taking medications to combat a chronic illness)
11 teristics impact both the development of ity). Data collected by patients who may be
methods to acquire and store data, as well as less familiar with healthcare terminology add
applications of these methods to impact care a layer of complexity to the variety, veracity,
and discovery. To provide a frame of reference and vulnerability challenges inherent in these
about the impact of person-centered care on data. Data collected using commonly avail-
the field, we will describe the magnitude of able sensor technologies add to the velocity
change using selected examples covered else- and volume challenges, but also present new
where in the book. opportunities. These, and other new chal-
lenges imposed by the addition of personal
data into the healthcare/discovery system are
11.5.1 Data Science new opportunities for personal health infor-
matics research.3
The field of data science has been impacted
greatly by personal health informatics meth-
ods and advances. This discipline, which is 11.5.2 Precision Medicine
focused on the use of scientific processes,
algorithms and systems to extract knowledge Perhaps no area has received greater atten-
and insights from both structured and tion in the last 10 years than precision medi-
unstructured data, addresses a number of cine. The goal of precision medicine as noted
concerns inherent in so-called “big data”
(Provost and Fawcett 2013). These data come
in large amounts (volume), with often fast 3 Technology OotNCfHI. Conceptualizing a Data
speeds of arrival and (velocity), varying for- Infrastructure for the Capture, Use, and Sharing of
Patient-Generated Health Data in Care Delivery
mats (variety), varying timing (variability), and Research through 2024, 2018. Available from:
unclear accuracy/validity (veracity), and sig- 7 https://www.healthit.gov/sites/default/files/onc_
nificant risks to patient privacy (vulnerabil- pghd_final_white_paper.pdf
Personal Health Informatics
377 11
in the Mission statement of the NIH’s All of Search engines regularly exploit this opportu-
Us initiative, is “to enable a new era of medi- nity to create a profile of each searcher and to
cine through research, technology and polices improve the relevance of retrieved results
that empower patients, researchers and pro- (Frisse 1996). We can expect the use of these
viders to work together toward develop- data to impact how medical care is personal-
ment of individualized treatments” (All of ized. Data created by consumers, coupled
Us Research Program Investigators 2019) with ubiquitous computing, might provide
(7 https://obamawhitehouse.a rchives.g ov/ just–in-time nutritional consults, over the
precision-medicine) (see 7 Chap. 28). Patient counter medication advice, or advice that
empowerment in this new era of medicine will might prevent illnesses, such as convenient
take many forms, and will question how we locations to receive a flu vaccine or when to
help patients with concepts such as contribut- begin medications for seasonal allergies (Guo
ing personal data to empower discovery, under- et al. 2016; Pellegrini et al. 2018; Swendeman
standing a radically new way to subcategorize et al. 2015).
diseases according to patient-specific attributes, This technology, which is likely to improve
learning a new “trust” model to understand the user experience and functionality available
why two patients with the same disease may with consumer-facing technologies, also has
have very different treatment plans, and even significant downside risks to patient privacy.
recognizing an expanding role for consum- In particular, data from email, internet
ers in establishing and critiquing health policy searches, support group chats, genome risk
(Adams and Petersen 2016; Juengst et al. 2012). prediction sites, and mobile or cloud-based
apps all may be directly identifiable, or may be
combined with data from other sources to be
identifiable. Together, they may be used to cre-
11.5.3 thical, Legal and Social
E ate a profile of an individual, his or her health
Issues status, the health status of related individuals,
or other profiles, all of which may be useful
As our understanding about the implications for targeted advertising, insurance risk,
of adopting a person-centered care philoso- employability, or other purposes. Biomedical
phy mature, so do the ways in which this phi- informatics research in the person-centered
losophy should govern the treatment of the care era focuses on such topics as the bound-
person and her data and information. This aries of HIPAA protection, genomic privacy,
philosophy introduces new ethical, legal and and re-identification risk.
social issues that are deserving of thought to A digital divide can lead to serious ethical
maximize the effective and safe use of these issues. If technological interventions are only
data and applications (see 7 Chap. 12). For available or usable to a segment of the popula-
example, concerns raised by the access to tion, this imbalance can threaten the impact
decision-making applications became the of these technologies on improving healthcare
impetus for thoughtful discussion about the for all and increase healthcare disparities.
timing of FDA regulation of mHealth apps Certain interventions, such as mHealth and
(Lye et al. 2018). the Internet of Things technologies, require a
One of the most exciting, though poten- certain digital literacy. Installing and main-
tially alarming consequences of our extensive taining these devices with the tremendous
use of the Web for shopping, communicating, amount of data they generate can be daunting
and learning is that each of us leaves behind a to populations with lower literacy and numer-
profile of who we are, what we like or dislike, acy as well as those with low proficiency levels
what we know or don’t know, and what we for problem solving in technology-rich envi-
want or already have. When combined with ronments. Also, mobile phones made by dif-
data mining and natural language processing ferent companies can have different
techniques, it is possible to create highly tar- capabilities. At the present time, Android
geted and predictive personal knowledge. phones are widely used by people with lower
378 R. M. Cronin et al.
socioeconomic status. If Apple iOS apps are 11.5.5 obile Health Care
M
shown to be more effective than Android, lim- (mHealth)
ited access may increase the disparities and
effect of these interventions for many people. Perhaps the most significant change in the
Finally, many mHealth apps are only available landscape of personal health informatics has
in English, which further increases the digital been the adoption of “smartphone” technol-
divide. ogy into society. Smartphones are mobile
While the direction that personal health phones that perform many functions found
informatics will take in the future is at best, on present-day computers. They typically
educated speculation, it is clear that as long as contain a touch screen interface, camera,
patient-provider partnerships are endorsed, Internet access, short-range wireless intercon-
technology will be a third partner in ensuring nection technology, and an operating system
that activated people manage their health and capable of executing downloaded applica-
disease effectively. tions. Smartphone ownership has grown
worldwide, with an estimated 81% ownership
by adults in the United States.4 When com-
11.5.4 Communication bined with a new generation of wireless or
connected peripheral technologies (imaging
Inherent in all aspects of information man- tools, wearable sensors, monitoring systems,
agement is the recognition about the audi- etc.) downloadable applications (apps) have
ence to whom information is being revolutionized information collection and use
communicated. Perhaps no field has a larger by people, and have defined a new field called
gap between what is understood by its pro- mobile health care (mHealth) (see 7 Chap.
fessionals and its consumers than health 19) (Cameron et al. 2017).
care. Numerous studies have demonstrated
11 issues that persist with the age-old challenge
One of the main byproducts of the
mHealth era has been a radical improvement
of how to educate people about diseases. in consumer empowerment, coupled with
These challenges are magnified as the scope information sharing that enables individual
of consumer engagement broadens to include groups to make “informed” decisions about
many of the topics listed above, and as we their care with or without the assistance of a
use data and evidence for care enters every- healthcare professional. With these new capa-
day discourse with patients (McCormack bilities come enormous opportunities for bio-
et al. 2013). Furthermore, the separation of medical informatics to influence the entire
information communication to and from healthcare system. Terms such as “quantified
medical professionals creates opportunities self ” (Dudley et al. 2015) and “Internet of
for misunderstanding at best, and inappro- Things” (Dimitrov 2016) begin to characterize
priate actions being taken by patients at the potential of mHealth.
worse (Isaacs and Creinin 2003; Morgan It is through the use of mHealth applica-
2013). This is an area ripe for research and tions that the notion of personal health infor-
evaluation by biomedical informatics profes- matics has grown beyond the individual’s
sionals, much of which is already underway clinical needs to population-level care needs.
in areas such as methods to circumvent liter-
acy and numeracy challenges. It is clear that
there is a correlation between health literacy
and quality of life (Zheng et al. 2018), but 4 Pew Research Center. Smartphone ownership is
also clear that more research needs to be growing rapidly around the world, but not always
equally. Available from: 7 https://www.pewre-
done to understand how to communicate in
search.org/global/2019/02/05/smartphone-owner-
the face of this reality (Newnham et al. 2017; ship-is-growing-rapidly-around-the-world-but-not-
Fisher et al. 2016). always-equally/. Accessed November 9, 2019.
Personal Health Informatics
379 11
Indeed, innovations such as Apple’s attributes, establish or break connections to
ResearchKit © and the explosion of wearable other members, communicate, and share
fitness trackers connected to social networks information. This simple strategy creates a
are designed for smartphones and mHealth virtually unlimited method to connect similar
technologies. These technologies also are people to one another, and has been shown to
being positioned improve the structure of be an effective tool to connect people with
healthcare delivery, through innovations such specific health needs (Moorhead et al. 2013).
as appointment self-scheduling, direct-to- The for-profit online health-related social net-
consumer e-consults, and peripheral devices working community Patients Like Me has
that make home diagnoses commonplace demonstrated that individuals with a severe
(Topol 2015; Kawano et al. 2012). chronic disease—amyotrophic lateral sclero-
Like any foundational change in biomedi- sis—are highly willing, even without compen-
cal informatics, the advent of mHealth creates sation, to contribute data and observations to
new paradigms for concepts such as usability a patient community (Frost and Massagli
and usefulness. Unique characteristics of peo- 2008) to accelerate learning about their dis-
ple (literacy, numeracy, language differences) ease. The site has no ties to the conventional
must be kept in mind and considered, along healthcare system and short-circuits the tradi-
with the capabilities of people and their liv- tional research enterprise, rewarding partici-
ing, working, and social environments. pants, not just researchers, with knowledge.
Research in user-centered design, usability The patient outcomes of diverse therapies are
assessment, failure modes and effects analysis, collected using crowd sourcing, where patients
and other techniques to assure safe and effec- contribute their information to a common
tive use are increasingly critical to advances in database that can be queried to obtain sum-
mHealth (see 7 Chap. 5) (Overdijkink et al. maries of an aggregated experience of their
2018; Matthew-Maich et al. 2016). peers.
Another significant challenge for mHealth Social networking Web sites share most or
is integration into clinical workflows. If all of the features of electronic support
healthcare providers are not prescribing groups, and even some data commonly pro-
mHealth apps or using their data, patients vided through a portal (through creating an
may be less likely to use them if they cannot affiliation with a group who externalizes pub-
engage their provider in shared decision mak- lic or private information). Social networking
ing. In other personal health informatics platforms combined with personal health
tools, such as patient portals, adoption and records provide a means for social network
promotion of patient portal usage by provid- members to share and aggregate data obtained
ers leads to increased usage by patients from the traditional health system, and to do
(Cronin et al. 2015). It will be critical to so in a private manner (Eysenbach 2008;
improve usefulness of the vast amount of Weitzman et al. 2011a).
data generated by mHealth, aid provider and One of the features of health-related
patients in choosing and using mHealth apps, online social networks is the rapid dissemina-
and determine the appropriate touch points tion of information across a network; how-
of these interventions between providers and ever, there is great variability in the quality of
patients, which will enable the potential of discourse on health-related social networking
mHealth in the future. sites. Conversations may be moderated, in cer-
tain cases by a health coach (Jimison et al.
2007). Conversations also may be unmoder-
11.5.6 Social Network Systems ated and commercial influences may enter the
discourse without transparency. There are
Social network systems, epitomized by also concerns around privacy. Compared with
Facebook (7 www.facebook.com), are online the restrictive institutional consents and com-
virtual communities where participants pacts with patients that limit use of data and
describe themselves with member-entered specimens under federal regulations applica-
380 R. M. Cronin et al.
ble to much federally sponsored research, change, along with new EHR portal
online social networks are generally governed application capabilities.
by no more than a terms of use statement,
often subject to change without notice in
30 days. These privacy policies may be diffi- 11.5.8 Application Example:
cult to find and not written in language acces- Personal Health Records
sible by a population with a broad range of
health literacy (Weitzman et al. 2011b). According to the Markle Foundation, a per-
Industry standards governing safety and pri- sonal health record (PHR) is “an electronic
vacy of online health-related social network- application through which individuals can
ing are yet to emerge. access, manage and share their health infor-
mation, and that of others for whom they are
authorized, in a private, secure, and confiden-
11.5.7 Application Example: EHR tial environment” Connecting for Health
Portals Personal Health Working Group (2003). Like
the EHR portal, the PHR has become the
As the electronic health record gains accep- foundation for developing tools to store data
tance, its relevance to individual people also and to facilitate its reuse in ways people find
grows. Many hospitals and clinics have begun engaging.
providing direct patient access to the clinical The idea of a personal repository for med-
record. These portals are defined as person- ical information is far from new. Families with
facing systems tethered to electronic health infants have used an immunization record
records, allowing them views of clinical or book for decades. The immunization blue
claims data in institutional electronic health book is a quintessential, efficient system with
11 record systems or payer systems (Tang et al. portable information that supports entry by
2006; Kim and Johnson 2002). Portals pro- multiple providers and storage by the patient.
vide motivated people with a way to electron- Clayton Christensen, who invented the con-
ically access sections of their records to recall cept of “disruptive innovation,” summarizes
salient instructions or obtain results of tests. the widely held promise of this technology in
Some of the first such personal health portals his book, the Innovator’s Prescription: A
were Columbia’s PatCIS system (Cimino Disruptive Solution for Health Care
et al. 2002) and Beth Israel Deaconess’s (Christensen et al. 2009). “We cannot over-
PatientSite, developed in 1999 (Weingart state how important PHRs are to the efficient
et al. 2006). Two of the most widely deployed functioning of a low-cost, high quality health-
portals are Epic’s MyChart (Serrato et al. care system.” PHRs enable users to acquire
2007) and MyHealtheVet (Nazi and Woods copies of their data from every site of care. In
2008). Many of these portals provide capa- some ways, this model advances information
bilities besides simply viewing EHR informa- flow far more than models requiring inter-
tion, such as secure physician-patient institutional data sharing agreements. Data
messaging, appointment scheduling, provid- from two competing healthcare networks may
ing educational information, and viewing and reside in the same PHR without cumbersome
managing medical bills. One of the more agreements between those two networks. The
recent additions to this set of capabilities is patient asserts her claim to the data for each
the OpenNotes effort started by network independently. This model of data
MyHealtheVet, which exposes every progress aggregation may promote data liquidity far
note to patients, instead of exposing only dis- more than competing approaches, such as
charge summaries. As new data types are pro- health information exchanges, which require
vided to the healthcare system by people in centralized management of data sharing
support of their health, we can expect the agreements between networks and institutions
data types exposed through EHR portals to (Adler-Milstein et al. 2008).
Personal Health Informatics
381 11
In part fueled by the knowledge gained management, and voice statistics for mood
during the Robert Wood Johnson management).
Foundation’s Project HealthDesign, various In health care there is an increasing need
companies in the early 2000’s developed com- to manage chronic conditions more effectively
mercially available personal health records. by empowering patients and family caregivers
While some of these remain viable, many with more active roles in self management.
from that era were discontinued, largely due Sensors in the home and environment (includ-
to the complex nature of establishing interop- ing wearables) provide important input to
erability with external data sources, as well as algorithms that infer patient state and deliver
unsustainable financial models.5 Recently, tailored feedback and motivational messaging.
however, as data liquidity and standards pro- . Figure 11.4 shows patient-generated sensor
moting interoperability have been mandated data being aggregated in a local device (typi-
through Federal legislation, there has been a cally a smartphone) and transferred with
resurgence of activity to create PHRs from strict security protocols to a secure server. The
both small start-up companies and large inference algorithms in real time then gener-
EHR vendors. The future is still uncertain, ate messaging and summary content for the
but this suite of applications continues to be a patient, as well as a health coach, clinician,
likely foundation for more sophisticated ser- and remote or local family caregiver (Pavel
vices and applications used by people in sup- et al. 2015).
port of their health or illness management For disease management interventions
(Staccini et al. 2018). typical sensors include wireless weight scales
for fluid management, wireless blood pressure
cuffs for cardiac disease, blood glucose meters
11.5.9 Application Example: for patients with diabetes, and peak flow
Sensors for Home meters for those with asthma. Additionally,
Monitoring and Tailored many disease management protocols include
weight management, physical exercise and
Health Interventions medication management. . Figure 11.5a–c
show examples of sensor technology used in
With rapidly advancing technologies, sensors
home health settings. Motion sensors, as
that measure and monitor are everywhere. We
shown in . Fig. 11.5a, can be used to deter-
see an increasing population interested in
mine real-time patient location for inferring
monitoring their activity levels with wrist
context as well as for measuring walking speed
devices that now measure movement (con-
statistics (a useful cognitive indicator) (Hagler
verted into steps or calories burned), heart
et al. 2010). Sleep quality can now be mea-
rate, and electrodermal activity (for stress
sured with varying accuracy with techniques
level). There are wireless weight scales that are
ranging from accerometers in wrist fitness
useful for weight management of everyone,
trackers to pressure sensitive bed strips placed
and for fluid management of heart failure
under the mattress that detect heart rate, heart
patients. Even smartphones have a myriad of
rate variability (for stress recovery at night),
embedded sensors useful for managing one’s
respiration, as well as total sleep time and
health (e.g., GPS for location context, motion
sleep efficiency. Another more complex
for activity level, light and noise level for sleep
approach to sensing in the home involves
imaging, as shown with the interactive video
exercise in . Fig. 11.5c. In this case, data
5 Google Official Blog. An update on Google Health from the Kinect camera is used to detect
and Google PowerMeter. Available at: 7 https:// movement compared to goal state and pro-
googleblog.blogspot.com/2011/06/update-on- vide just-in-time feedback to the user (Jimison
google-health-and-google.html. Accessed Novem-
et al. 2015; Obdrzalek et al. 2012; Ofli et al.
ber 9, 2019.
382 R. M. Cronin et al.
.. Fig. 11.4 This diagram shows how sensor data from interest in improving health, can be used as input to a
the home or environment, generated by a patient who coaching platform to provide tailored motivation and
may have a chronic condition or an individual with an feedback
11
.. Fig. 11.5 This series of images shows examples of vidual interacting with a nurse care manager using a
sensors and technology used to gain information about remote controlled Double Robot. c shows in-home chair
an individual’s state and provide tailored just-in-time exercises with real-time feedback using data from the
feedback. a shows a motion sensor near the door, sen- Kinect camera. (With permission from the Consortium
sors on a smart cane, a presence lamp, wireless blood on Technology for Proactive Care at Northeastern Uni-
pressure cuff, and an Amazon Echo. b shows an indi- versity. Photo courtesy of Dr. Holly Jimison)
2016). Inferences on strength, flexibility, bal- measurements over time of motor speed,
ance and endurance can be monitored over search time and cognitive load (Hagler et al.
time and provided to both the user and clini- 2011, 2014).
cian. Interactive voice messaging systems The streaming data from a variety of sen-
(e.g., Amazon Echo or Google Home) have an sors in the home and environment can be
important role in home health interventions, overwhelming from a clinical perspective.
both as communication devices, but also as . Figure 11.6 shows a sample phone inter-
sensors of voice affect. Finally, interactions face for coordinating information from a vari-
with computers, tablets and smartphones pro- ety of sensors (Williamson 2015). This
vide valuable information on cognitive perfor- example shows a summary main screen for a
mance, both specifically with adaptive patient or caregiver with feedback on calen-
cognitive computer games (Hagler et al. 2014; dar to-do’s, adherence to goals, medication
Jimison et al. 2010), but also with indirect taking (an issue in taking medications noted
Personal Health Informatics
383 11
Even worldwide, the access to mobile phones
is becoming nearly ubiquitous, and the afford-
ability of health sensors and devices for
continuous monitoring and just-in-time inter-
vention is also improving rapidly. However,
we also see upcoming challenges in the areas
of payment models and equity.
11.6.1 Reimbursement
and Business Models
Many countries have global budgets for health
care, usually managed at a regional level,
where it is possible to allocate funds for cost-
effective health interventions that personal
health technologies may enable. However,
medical care reimbursement in the United
States is moving slowly towards value-based
care. Healthcare systems in the U.S. require
incentives and fairly short-term business
model demonstrations to modify their work-
flow and hiring practices for a new model of
value-based care. This model of care would
bring healthcare consumers and family mem-
.. Fig. 11.6 This example shows a main screen for a bers as integral members of the care team,
patient or caregiver summary information. (Reprinted facilitated by personal health technologies.
with permission of author, S. Williamson)
from sensors could allow for improved assess- Office of the Assistant Secretary for Planning and
ment of people and their health. Evaluation, U.S. Department of Health &
Many of the innovations, however, will Human Services. Conceptualizing a data
need to be social and protocol-based. For infrastructure for the capture and use of
example, new workflow and hiring practices patient-generated health data. Available at:
will be needed to compensate for the data and https://aspe.h hs.g ov/conceptualizing-d ata-
information these innovations will create. An infrastructure-capture-and-use-patient-gener-
increased emphasis on proactive person- ated-health-data. Accessed 1 Nov 2019. This
centered care to improve outcomes and reduce paper describes a project sponsored by the
costs will necessitate better use of community Office of the National Coordinator for Health
health workers and health behavior change Information Technology (ONC) regarding a
coaches that interface with both the patient data infrastructure for patients to share their
and the clinical team. One of the most excit- data with caregivers, providers, researchers,
ing, though potentially alarming consequences and others according to their preferences.
of our extensive use of the Web for shopping, Prey, J. E., Woollen, J., Wilcox, L., Sackeim,
communicating, and learning is that each of A. D., Hripcsak, G., Bakken, S., et al. (2014).
us leaves behind a profile of who we are, what Patient engagement in the inpatient setting: A
we like or dislike, what we know or don’t know, systematic review. Journal of the American
and what we want or already have. When com- Medical Informatics Association, 21(4), 742–
bined with data mining and natural language 750. This paper reviews literature involving
processing techniques, it is possible to create patient engagement in the hospital setting.
highly targeted and predictive personal knowl- The authors identify challenges such as incon-
edge. Data created by consumers, coupled sistent use of termionology and gaps ini
with ubiquitous computing, might provide knowledge regarding impact on health out-
just–in-time nutritional consults, over the comes and cost-effectiveness.
11 counter medication advice, or advice that Topol, E. J. (2015). The patient will see you now:
might prevent illnesses, such as convenient The future of medicine is in your hands.
locations to receive a flu vaccine or when to New York: Basic Books. This popular book
begin medications for seasonal allergies. We describes the author’s vision for medicine
can expect the use of these massive data sets based on patient-centered health care, mobile
(also called “big data”) to impact how medical health, and consumer health informatics.
care is personalized. While the direction that
consumer health informatics will take in the ??Questions for Discussion
future is at best, educated speculation, it is 1. What is the role of the health system in
clear that as long as patient-provider partner- monitoring the quality of discourse on
ships are endorsed, technology will be a third online social networks?
partner in ensuring that activated consumers 2. What is the optimal model for personal
manage their health and disease effectively. health records? Should personal health
records display advertisements?
nnSuggested Readings 3. Which populations of consumers would be
Berwick, D. M. (2009). What ‘patient-centered’ most likely to use personal health records?
should mean: confessions of an extremist. 4. Which consumer technologies do you
Health Affairs, 28(4), w555–ww65. This paper think will be most influential in
is written by Dr. Donald Berwisk, an influ- consumer-focused health informatics?
ential proponent of patient-centered health 5. What is the right balance between
care and former administrator of the Centers privacy of personal health information
for Medicare and Medicaid Services (CMS). and ready access to it? For example, for
The paper describes a number of maxims of an unconscious patient in the
patient-centered care. emergency department?
Personal Health Informatics
385 11
References JAMA, 288(15), 1909–1914. Retrieved from https://
www.ncbi.nlm.nih.gov/pubmed/12377092
Bosworth, K., Gustafson, D. H., Hawkins, R. P.,
Adams, S. A., & Petersen, C. (2016). Precision medicine:
Chewning, B., & Day, T. (1983). Adolescents, health
Opportunities, possibilities, and challenges for
education, and computers: The body awareness
patients and providers. Journal of the American
resource network (BARN). Health Education, 14(6),
Medical Informatics Association, 23(4), 787–790.
58–60.
https://doi.org/10.1093/jamia/ocv215.
Brennan, P. F., Downs, S., Casper, G., & Kenron, D.
Adler-Milstein, J., McAfee, A. P., Bates, D. W., & Jha,
(2007). Project HealthDesign: Stimulating the next
A. K. (2008). The state of regional health informa-
generation of personal health records. Paper
tion organizations: Current activities and financing.
presented at the AMIA annual symposium
Health Affairs (Millwood), 27(1), w60–w69. https://
proceedings.
doi.org/10.1377/hlthaff.27.1.w60.
Cameron, J. D., Ramaprasad, A., & Syn, T. (2017). An
All of Us Research Program Investigators. (2019). The
ontology of and roadmap for mHealth research.
“All of Us” research program. New England Journal
International Journal of Medical Informatics, 100,
of Medicine, 381(7), 668–676. Chicago.
16–25. https://doi.org/10.1016/j.ijmedinf.2017.01.
Ammenwerth, E., Schnell-Inderst, P., & Hoerbst, A.
007.
(2012). The impact of electronic patient portals on
Carr, D. (2007). The global digital divide. Contexts, 6(3),
patient care: A systematic review of controlled tri-
58.
als. Journal of Medical Internet Research, 14(6).
Cassell, E. J. (1985). Talking with patients: Clinical tech-
https://doi.org/10.2196/jmir.2238.
nique (Vol. 2). Cambridge, MA: Mit Press.
Appelboom, G., LoPresti, M., Reginster, J.-Y., Sander
Christensen, C. M., Grossman, J. H., & Hwang, J.
Connolly, E., & Dumont, E. P. (2014). The quanti-
(2009). The innovator’s prescription : A disruptive
fied patient: A patient participatory culture. Current
solution for health care. New York: McGraw-Hill.
Medical Research & Opinion. Taylor & Francis,
Cimino, J., Patel, V., & Kushniruk, A. (2002). The
30(12), 2585–2587.
patient clinical information system (PatCIS):
Asnani, M. R., Quimby, K. R., Bennett, N. R., &
Technical solutions for and experience with giving
Francis, D. K. (2016). Interventions for patients and
patients access to their electronic medical records.
caregivers to improve knowledge of sickle cell dis-
International Journal of Medical Informatics, 68(1–
ease and recognition of its related complications.
3), 113–127.
Cochrane Database of Systematic Reviews, 10,
Coleman, M. T., & Newton, K. S. (2005). Supporting
Cd011175. https://doi.org/10.1002/14651858.
self-management in patients with chronic illness.
CD011175.pub2.
American Family Physician, 72(8), 1503–1510.
Balint, E. (1969). The possibilities of patient-centered
Coleman, K., Austin, B. T., Brach, C., & Wagner, E. H.
medicine. The Journal of the Royal College of
(2009). Evidence on the Chronic Care Model in the
General Practitioners, 17(82), 269.
new millennium. Health Affiars (Millwood), 28(1),
Barry, M. J., Fowler, F. J., Mulley, A. G., Henderson,
75–85. https://doi.org/10.1377/hlthaff.28.1.75.
J. V., & Wennberg, J. E. (1995). Patient reactions to
Collen, M. F., Rubin, L., Neyman, J., Dantzig, G. B.,
a program designed to facilitate patient participa-
Baer, R. M., & Siegelaub, A. (1964). Automated
tion in treatment decisions for benign prostatic
multiphasic screening and diagnosis. American
hyperplagia. Medical Care, 33(8), 771–782.
Journal of Public Health and the Nations Health,
Berkman, N. D., Sheridan, S. L., Donahue, K. E.,
54(5), 741–750.
Halpern, D. J., Viera, A., Crotty, K., et al. (2011).
Collins, F. S. (2004). The case for a US prospective cohort
Health literacy interventions and outcomes: An
study of genes and environment. Nature, 429(6990),
updated systematic review. Evidence Report
475–477. https://doi.org/10.1038/nature02628.
Technology Assessment (Full Report), 199(1), 941.
Connecting for Health Personal Health Working Group.
Retrieved from https://www.ncbi.nlm.nih.gov/
(2003). The Personal Health Working Group: final
pubmed/23126607
report. Markle Foundation.
Berwick, D. M. (2009). What ‘patient-centered’ should
Cronin, R. M., Davis, S. E., Shenson, J. A., Chen, Q.,
mean: Confessions of an extremist. Health Affairs,
Rosenbloom, S. T., & Jackson, G. P. (2015). Growth
28(4), w555–w565.
of secure messaging through a patient portal as a
Bodenheimer, T., Lorig, K., Holman, H., & Grumbach,
form of outpatient interaction across clinical spe-
K. (2002a). Patient self-management of chronic
cialties. Applied Clinical Informatics, 6(2), 288–304.
disease in primary care. JAMA, 288(19), 2469–
https://doi.org/10.4338/ACI-2014-12-RA-0117.
2475.
Demiris, G. (2016). Consumer health informatics: Past,
Bodenheimer, T., Wagner, E. H., & Grumbach, K.
present, and future of a rapidly evolving domain.
(2002b). Improving primary care for patients with
Yearbook of Medical Informatics, 25(Suppl 1), S42.
chronic illness. JAMA, 288(14), 1775–1779.
Dentzer, S. (2013). Rx for the ‘blockbuster drug’of
Bodenheimer, T., Wagner, E. H., & Grumbach, K.
patient engagement. Health Affairs (Project Hope),
(2002c). Improving primary care for patients with
32(2), 202.
chronic illness: the chronic care model, Part 2.
386 R. M. Cronin et al.
Dimitrov, D. V. (2016). Medical internet of things and online patient community: What can happen when
big data in healthcare. Healthcare Informatics patients have access to one another’s data. Journal
Research, 22(3), 156–163. https://doi.org/10.4258/ of Medical Internet Research, 10(3), e15. https://doi.
hir.2016.22.3.156. org/10.2196/jmir.1053.
Dudley, J. T., Listgarten, J., Stegle, O., Brenner, S. E., & Gallego, B., Walter, S. R., Day, R. O., Dunn, A. G.,
Parts, L. (2015). Personalized medicine: From geno- Sivaraman, V., Shah, N., et al. (2015). Bringing
types, molecular phenotypes and the quantified self, cohort studies to the bedside: Framework for a
towards improved medicine. In Pacific symposium ‘green button’ to support clinical decision-making.
on biocomputing co-chairs (pp. 342–346). Retrieved Journal of Comparative Effectiveness Research, 4(3),
from https://www.ncbi.nlm.nih.gov/pubmed/ 191–197. https://doi.org/10.2217/cer.15.12.
25592594 Gibbons, M. C., Wilson, R. F., Samal, L., Lehman,
Engel, G. L. (1977). The need for a new medical model: C. U., Dickersin, K., Lehmann, H. P., et al. (2009).
A challenge for biomedicine. Science, 196(4286), Impact of consumer health informatics applications.
129–136. Evidence Report/Technology Assessment, 188, 1.
Enthoven, A. C. (2009). Integrated delivery systems: Goldzweig, C. L., Towfigh, A. A., Paige, N. M.,
The cure for fragmentation. American Journal of Orshansky, G., Haggstrom, D. A., Beroes, J. M., et al.
Managed Care, 15(10 Suppl), S284–S290. Retrieved (2012). Systematic review: Secure messaging between
from https://www.ncbi.nlm.nih.gov/pubmed/ providers and patients, and patients’ access to their own
20088632 medical record: Evidence on health outcomes, satisfac-
Epstein, R. M., & Street, R. L., Jr. (2011). The values tion, efficiency and attitudes. Washington, DC:
and value of patient-centered care. Annals of Family Department of Veterans Affairs (US).
Medicine, 9(2), 100–103. https://doi.org/10.1370/ Golomb, B. A., McGraw, J. J., Evans, M. A., & Dimsdale,
afm.1239. J. E. (2007). Physician response to patient reports of
Ernest, J., III, Wilson, E. J., & Wilson III, E. J., III. adverse drug effects. Drug Safety, 30(8), 669–675.
(2004). The information revolution and developing Guo, J., Zhou, X., Sun, Y., Ping, G., Zhao, G., & Li, Z.
countries. Cambridge, MA: MIT Press. (2016). Smartphone-based patients’ activity recog-
Eysenbach, G. J. (2008). Medicine 2.0: Social network- nition by using a self-learning scheme for medical
ing, collaboration, participation, apomediation, and monitoring. Journal of Medical Systems, 40(6), 140.
openness. Journal of Medical Internet Research, https://doi.org/10.1007/s10916-016-0497-2.
11 10(3), e22.
Eysenbach, G., Powell, J., Kuss, O., & Sa, E.-R. (2002).
Gustafson, D. H., Hawkins, R., Pingree, S., McTavish,
F., Arora, N. K., Mendenhall, J., et al. (2001). Effect
Empirical studies assessing the quality of health of computer support on younger women with breast
information for consumers on the world wide web: cancer. Journal of General Internal Medicine, 16(7),
A systematic review. JAMA, 287(20), 2691–2700. 435–445.
Eysenbach, G., Powell, J., Englesakis, M., Rizo, C., & Hagler, S., Austin, D., Hayes, T. L., Kaye, J., & Pavel, M.
Stern, A. (2004). Health related virtual communities (2010). Unobtrusive and ubiquitous in-home moni-
and electronic support groups: Systematic review of toring: A methodology for continuous assessment
the effects of online peer to peer interactions. BMJ, of gait velocity in elders. IEEE Transactions on
328(7449), 1166. Biomedical Engineering, 57(4), 813–820.
Fehske, A., Fettweis, G., Malmodin, J., & Biczok, G. Hagler, S., Jimison, H., & Pavel, M. (2011). Unobtrusive
(2011). The global footprint of mobile communica- in-home assessment by means of everyday computer
tions: The ecological and economic perspective. mouse usage. Paper presented at the Engineering in
IEEE Communications Magazine, 49(8). Medicine and Biology Society (EMBC), 2011
Ferguson, T. (1996). Health online: How to find health annual international conference of the IEEE.
informaiton, support groups, and self help communi- Hagler, S., Jimison, H. B., & Pavel, M. (2014). Assessing
ties in cyberspace. The Millenium Whole Earth executive function using a computer game:
Catalogue. Reading, MA: Da Capo Press. Computational modeling of cognitive processes.
Fisher, A., Manicavasagar, V., Kiln, F., & Juraskova, I. IEEE Journal of Biomedical and Health Informatics,
(2016). Communication and decision-making in 18(4), 1442–1452.
mental health: A systematic review focusing on Hibbard, J. H., & Greene, J. (2013). What the evidence
bipolar disorder. Patient Education and Counseling, shows about patient activation: Better health out-
99(7), 1106–1120. https://doi.org/10.1016/j. comes and care experiences; fewer data on costs.
pec.2016.02.011. Health Affiars (Millwood), 32(2), 207–214. https://
Frisse, M. E. (1996). What is the Internet learn- doi.org/10.1377/hlthaff.2012.1061.
ing about you while you are learning about the Hibbard, J. H., Stockard, J., Mahoney, E. R., & Tusler,
Internet? Academic Medicine, 71(10), 1064–1067. M. (2004). Development of the Patient Activation
Retrieved from https://www.ncbi.nlm.nih.gov/ Measure (PAM): Conceptualizing and measuring
pubmed/9177639 activation in patients and consumers. Health
Frost, J. H., & Massagli, M. P. (2008). Social uses of per- Services Research, 39(4 Pt 1), 1005–1026. https://
sonal health information within PatientsLikeMe, an doi.org/10.1111/j.1475-6773.2004.00269.x.
Personal Health Informatics
387 11
Hibbard, J. H., Peters, E., Dixon, A., & Tusler, M. Kellerman, R., & Kirk, L. (2007). Principles of the
(2007). Consumer competencies and the use of com- patient-centered medical home. American Family
parative quality information: It isn’t just about lit- Physician, 76(6), 774.
eracy. Medical Care Research and Review, 64(4), Keselman, A., Logan, R., Smith, C. A., Leroy, G., &
379–394. Zeng-Treitler, Q. (2008). Developing informatics
Hoffman, C., Rice, D., & Sung, H.-Y. (1996). Persons tools and strategies for consumer-centered health
with chronic conditions: Their prevalence and costs. communication. Journal of the American Medical
JAMA, 276(18), 1473–1479. Informatics Association, 15(4), 473–483.
Holman, H., & Lorig, K. (2004). Patient self- Kim, M. I., & Johnson, K. B. (2002). Personal health
management: A key to effectiveness and efficiency in records: Evaluation of functionality and utility.
care of chronic disease. Public Health Reports, Journal of the American Medical Informatics
119(3), 239–243. Association, 9(2), 171–180. Retrieved from http://
Huerta, T. R., McAlearney, A. S., & Rizer, M. K. (2017). www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retri
Introducing a patient portal and electronic tablets to eve&db=PubMed&dopt=Citation&list_
inpatient care. Annals of Internal Medicine, 167(11), uids=11861632.
816–817. https://doi.org/10.7326/m17-1766. Kleinman, A. (1988). The illness narratives: Suffering,
Institute of Medicine (US) Committee on Quality of healing, and the human condition. New York: Basic
Health Care in America. (2001). Crossing the quality Books.
chasm: A new health system for the 21st century. Kontos, E., Blake, K. D., Chou, W.-Y. S., & Prestin, A.
Washington, DC: National Academy Press, ©2001. (2014). Predictors of eHealth usage: Insights on the
Isaacs, J. N., & Creinin, M. D. (2003). Miscommunication digital divide from the Health Information National
between healthcare providers and patients may Trends Survey 2012. Journal of Medical Internet
result in unplanned pregnancies. Contraception, Research, 16(7), e172.
68(5), 373–376. Retrieved from https://www.ncbi. Korsch, B. M. (1989). Current issues in communication
nlm.nih.gov/pubmed/14636942 research. Health Communication, 1(1), 5–9.
Jimison, H., Pavel, M., Larimer, N., & Mullen, P. (2007). Lieberman, M. A. (1988). The role of self-help groups in
A general architecture for computer based health helping patients and families cope with cancer. CA:
coaching. Paper presented at the international con- a Cancer Journal for Clinicians, 38(3), 162–168.
ference on technology & aging. Lorig, K. R., Ritter, P., Stewart, A. L., Sobel, D. S.,
Jimison, H., Gorman, P., Woods, S., Nygren, P., Walker, Brown, B. W., Jr., Bandura, A., et al. (2001). Chronic
M., Norris, S., & Hersh, W. (2008). Barriers and disease self-management program: 2-year health
drivers of health information technology use for the status and health care utilization outcomes. Medical
elderly, chronically ill, and underserved. Evidence Care, 1217–1223.
Report Technology Assessment (Full Report), 175, Lye, C. T., Forman, H. P., Daniel, J. G., & Krumholz,
1–1422. H. M. (2018). The 21st Century Cures Act and elec-
Jimison, H. B., McKanna, J., Ambert, K., Hagler, S., tronic health records one year later: Will patients see
Hatt, W. J., & Pavel, M. (2010). Models of cognitive the benefits? Journal of the American Medical
performance based on home monitoring data. Paper Informatics Association, 25(9), 1218–1220. https://
presented at the Engineering in Medicine and doi.org/10.1093/jamia/ocy065.
Biology Society (EMBC), 2010 annual international Marcolino, M. S., Oliveira, J. A. Q., D’Agostino, M.,
conference of the IEEE. Ribeiro, A. L., Alkmim, M. B. M., & Novillo-Ortiz,
Jimison, H. B., Hagler, S., Kurillo, G., Bajcsy, R., & D. (2018). The impact of mHealth interventions:
Pavel, M. (2015). Remote health coaching for interac- Systematic review of systematic reviews. JMIR
tive exercise with older adults in a home environment. mHealth and uHealth, 6(1), e23. https://doi.
Paper presented at the Engineering in Medicine and org/10.2196/mhealth.8873.
Biology Society (EMBC), 2015 37th annual interna- Masys, D. R., & Baker, D. B. (1997). Patient-Centered
tional conference of the IEEE. Access to Secure Systems Online (PCASSO): A
Johnson, K. B., Ravert, R. D., & Everton, A. (2001). secure approach to clinical data access via the World
Hopkins Teen Central: Assessment of an internet- Wide Web. In Proceedings: a conference of the
based support system for children with cystic fibro- American Medical Informatics Association/ ...
sis. Pediatrics, 107(2), e24-e24. AMIA annual fall symposium. AMIA fall symposium
Juengst, E. T., Flatt, M. A., & Settersten, R. A., Jr. (pp. 340–343). Retrieved from http://www.ncbi.nlm.
(2012). Personalized genomic medicine and the rhet- nih.gov/pubmed/9357644
oric of empowerment. The Hastings Center Report, Matthew-Maich, N., Harris, L., Ploeg, J., Markle-Reid,
42(5), 34–40. https://doi.org/10.1002/hast.65. M., Valaitis, R., Ibrahim, S., et al. (2016). Designing,
Kawano, K., Suzuki, M., & Araki, K. (2012). Use of implementing, and evaluating mobile health tech-
digital patient photographs and electronic medical nologies for managing chronic conditions in older
record data as diagnostic tools in Japan. Journal of adults: A scoping review. JMIR mHealth and
Medical Systems, 36(5), 3321–3326. https://doi. uHealth, 4(2), e29. https://doi.org/10.2196/
org/10.1007/s10916-012-9824-4. mhealth.5127.
388 R. M. Cronin et al.
McCormack, L., Haun, J., Sorensen, K., & Valerio, M. pregnancy: Systematic review. JMIR mHealth and
(2013). Recommendations for advancing health lit- uHealth, 6(4), e109. https://doi.org/10.2196/
eracy measurement. Journal of Health mhealth.8834.
Communication, 18(Suppl 1), 9–14. https://doi.org/1 Patel, A., Schieble, T., Davidson, M., Tran, M. C.,
0.1080/10810730.2013.829892. Schoenberg, C., Delphin, E., & Bennett, H. (2006).
Moorhead, S. A., Hazlett, D. E., Harrison, L., Carroll, Distraction with a hand-held video game reduces
J. K., Irwin, A., & Hoving, C. (2013). A new dimen- pediatric preoperative anxiety. Pediatric Anesthesia,
sion of health care: Systematic review of the uses, 16(10), 1019–1027.
benefits, and limitations of social media for health Pavel, M., Jimison, H. B., Korhonen, I., Gordon, C. M.,
communication. Journal of Medical Internet & Saranummi, N. (2015). Behavioral informatics
Research, 15(4), e85. https://doi.org/10.2196/ and computational modeling in support of proac-
jmir.1933. tive health management and care. IEEE Transactions
Morgan, S. (2013). Miscommunication between patients on Biomedical Engineering, 62(12), 2763–2775.
and general practitioners: Implications for clinical Pellegrini, C. A., Conroy, D. E., Phillips, S. M.,
practice. Journal of Primary Health Care, 5(2), 123– Pfammatter, A. F., McFadden, H. G., & Spring, B.
128. Retrieved from https://www.ncbi.nlm.nih.gov/ (2018). Daily and seasonal influences on dietary self-
pubmed/23748393 monitoring using a smartphone application. Journal
Mossberger, K., Tolbert, C. J., & Gilbert, M. (2006). of Nutrition Education and Behavior, 50(1), 56–61
Race, place, and information technology. Urban e51. https://doi.org/10.1016/j.jneb.2016.12.004.
Affairs Review, 41(5), 583–620. Prey, J. E., Woollen, J., Wilcox, L., Sackeim, A. D.,
Nazi, K. M., & Woods, S. S. (2008). MyHealtheVet Hripcsak, G., Bakken, S., et al. (2014). Patient
PHR: A description of users and patient portal use. engagement in the inpatient setting: A systematic
In AMIA annual symposium proceedings (p. 1182). review. Journal of the American Medical Informatics
Retrieved from https://www.ncbi.nlm.nih.gov/ Association, 21(4), 742–750. https://doi.org/10.1136/
pubmed/18999142 amiajnl-2013-002141.
Neter, E., & Brainin, E. (2012). eHealth literacy: Provost, F., & Fawcett, T. (2013). Data science and its
Extending the digital divide to the realm of health relationship to big data and data-driven decision
information. Journal of Medical Internet Research, making. Big Data, 1(1), 51–59. https://doi.
14(1), e19. org/10.1089/big.2013.1508.
Ethics in Biomedical
and Health Informatics:
Users, Standards,
and Outcomes
Kenneth W. Goodman and Randolph A. Miller
Contents
References – 421
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
393 12
nnLearning Objectives an important obligation to explore the moral
After reading this chapter, you should know underpinnings, ethical challenges and social
the answers to these questions: issues related to their research and practice.
55 Why is ethics important to informatics? Ethical questions in medicine, nursing,
55 What are the leading ethical issues that human subjects research, psychology, social
arise in health care informatics? work, and affiliated fields continue to evolve
55 What are examples of appropriate and and increase in number; nevertheless, the key
inappropriate uses and users for health- issues are generally well known. Major ques-
related software? tions in general bioethics have long been
55 Why does the establishment of stan- addressed in numerous professional, schol-
dards touch on ethical issues? arly, and educational contexts. Ethical issues
55 Why does system evaluation involve in health informatics are, for the most part,
ethical issues? less familiar, even though certain of them
55 What challenges does informatics pose have received attention for decades (Szolovits
for patient and provider confidentiality? and Pauker 1979; Miller et al. 1985a; de
55 How can the tension between the obli- Dombal 1987). Indeed, informatics now con-
gation to protect confidentiality and stitutes a source of some of the most impor-
that to share data be minimized? tant and interesting ethical debates in all the
55 How might computational health care health professions. It has even been suggested
alter the traditional provider–patient that biomedical informatics raises so many
relationship? such issues it could itself be used as the basis
55 What ethical issues arise at the intersec- for a bioethics curriculum (Goodman 2017).
tion of informatics and managed care? People often assume that the confidential-
55 What are the leading ethical and legal ity of electronically stored patient informa-
issues in the debate over governmental tion is the most important ethical issue in
regulation of health care computing informatics. Although confidentiality and pri-
tools? vacy are indeed of vital interest and signifi-
cant concern, the field is rich with other ethical
issues, including the appropriate selection and
12.1 thical Issues in Biomedical
E use of informatics tools in clinical settings; the
and Health Informatics determination of who should use such tools;
the role of system evaluation; the obligations
»» More and more the tendency is towards the of system developers, maintainers, and ven-
dors; the appropriate standards for interact-
use of mechanical aids to diagnosis; never-
theless, the five senses of the doctor do still, ing with industry; and the use of computers to
and must always, play the preponderating track clinical outcomes to guide future prac-
part in the examination of the sick patient. tice. In addition, informatics engenders many
Careful observation can never be replaced important legal and regulatory questions.
by the tests of the laboratory. The good phy- To consider ethical issues in health care
sician now or in the future will never be a informatics is to explore a significant intersec-
diagnostic robot. – Scottish surgeon Sir tion among several professions—health care
William Arbuthnot-Lane (Lane 1936) informatics per se, health care delivery and
administration, applied computing and sys-
Human values should govern research and tems engineering, and ethics—each of which
practice in the health professions. Health constitutes a vast field of inquiry. Fortunately,
informatics, like other health professions, growing interest in bioethics and computation-
encompasses issues of appropriate and inap- related ethics has produced a starting point
propriate behavior, of honorable and disrepu- for such exploration. An initial ensemble of
table actions and intentions, of right and guiding principles, or ethical criteria, has
wrong. Students and practitioners of the emerged to orient decision making in health
health sciences, including informatics, share care informatics. These criteria are of practi-
394 K. W. Goodman and R. A. Miller
cal utility to health informatics, and often determine the appropriate level of caution.
have broader implications for all of biomedi- For instance, there is considerable evidence
cal informatics. that electronic laboratory information sys-
tems improve access to clinical data when
compared with manual, paper-based test-
12.2 Health-Informatics result distribution methods. To the extent that
such systems improve care at an acceptable
Applications: Appropriate
cost in time and money, there is an obligation
Use, Users, and Contexts to use computers to store and retrieve clinical
laboratory results. There is a small but grow-
Application of computer-based technologies ing body of evidence that existing clinical
in the health professions can build on previ- expert systems can improve patient care in a
ous experience in adopting other devices, small number of practice environments at an
tools, and methods. Before clinicians perform acceptable cost in time and money (Kuperman
most health-related interventions (e.g., diag- and Gibson 2003). Nevertheless, such systems
nostic testing, prescription of medications, cannot yet uniformly improve care in typical,
surgical and other therapeutic procedures), more general practice settings, at least not
they generally evaluate appropriate evidence, without careful attention to the full range of
standards, available technologies, presupposi- managerial as well as technical issues affecting
tions, and values. Indeed, the very evolution the particular care delivery setting in which
of the health professions entails the evolution they are used (Kaplan and Harris-Salamone
of evidence, of standards, of available tech- 2009; Holroyd-Leduc et al. 2011; Shih et al.
nologies, of presuppositions, and of values. 2011).
To answer the clinical question, “What Clinical expert systems (see 7 Chap. 24)
should be done in this case?” one must pay attempt to provide decision support for diag-
attention to a number of subsidiary questions, nosis, therapy, and/or prognosis in a more
such as: detailed and sophisticated manner than do
12 1. What is the problem? simple reminder systems (Duda and Shortliffe
2. What resources are available and what am 1983). A necessary adjunction of expert sys-
I competent to do? tems – creation and maintenance of their
3. What will maintain or improve this related knowledge bases – still involves
patient’s care? leading-edge research and development.
4. What will otherwise produce the most Humans for the most part remain superior to
desirable results (e.g., in public health)? electronic systems in understanding patients
5. How strong are my beliefs in the accuracy and their problems, in efficiently interacting
of my answers to questions 1 through 4? with patients to ascertain pertinent past his-
tory and current symptoms across the spec-
Similar considerations determine the appro- trum of clinical practice, in the interpretation
priate use of informatics tools. and representation of data, and in clinical
synthesis. Humans might in the future how-
ever not hold the upper hand in these tasks,
12.2.1 The Standard View and claims of their superiority must continu-
of Appropriate Use ally be tested empirically (Blois 1980).
What has been called the “standard view”
Excitement and enthusiasm often accompany of computer-assisted clinical diagnosis (Miller
initial use of new tools in clinical settings. 1990a; cf. Friedman 2009) holds in part that
Negative emotions are also common (Sittig human cognitive processes, being more suited
et al. 2005). Based on the uncertainties that to the complex task of diagnosis than machine
surround any new technology, scientific evi- intelligence, should not be overridden or
dence counsels caution and prudence. As in trumped by computers. The standard view
other clinical areas, evidence and reason states that when adequate (and even exem-
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
395 12
plary) decision-support tools are developed, from it shape the obligations of practitioners.
they should be viewed and used as supple- In computer-software use, as in all other areas
mentary and subservient to human clinical of clinical practice, good intentions alone are
judgment: They support decisions by human insufficient to insulate recklessness from cul-
beings; they do not make decisions. Progress pability. Thus, the standard view may be seen
should be measured in terms of whether clini- as a tool for both error avoidance and ethi-
cians using a CDS tool perform better on spe- cally optimized action.
cific tasks than the same clinicians without the Ethical software use, then, should be eval-
tool (Miller 1990a; cf. Friedman 2009). These uated against a broad background of evidence
tools should assume subservient roles because for actions that produce favorable outcomes.
the clinician caring for the patient knows and Because informatics is a science in ongoing
understands the patient’s situation and can ferment, system improvements and evidence
make compassionate judgments better than of such improvements are constantly emerg-
computer programs. Furthermore, clinicians, ing. Clinicians have an obligation to be famil-
and not machine algorithms, are the entities iar with this evidence after attaining minimal
which the state licenses, and specialty boards acceptable levels of familiarity with informat-
accredit, to practice medicine, surgery, nurs- ics in general and with the clinical systems
ing, pharmacy, and other health-related they use in particular (. Fig. 12.1).
activities.
Corollaries of the standard view are that
(1) practitioners have an obligation to use any
computer-based tool responsibly, through 12.2.2 Appropriate Users
adequate user training and by developing an and Educational Standards
understanding of the system’s abilities and
limitations; and (2) practitioners must not Efficient and effective use of health care infor-
abrogate their clinical judgment reflexively matics systems requires prior system evalua-
when using computer-based decision aids. tions demonstrating utility, education and
The skills required for diagnosis are in training of new users, monitoring of experi-
many respects different from those required ence, and appropriate, timely updating.
for the acquisition, storage, and retrieval of Indeed, such requirements resemble those for
laboratory data. There is no contradiction in other tools used in health care and in other
urging extensive use of efficient, non-burden- domains. Inadequate preparation in the use
some laboratory information systems, and, of tools is an invitation to catastrophe. When
for the time being, cautious deployment of the stakes are high and the domain large and
expert diagnostic decision-support tools (i.e., complex—as is the case in the health profes-
not permitting their use in settings in which sions—education and training take on moral
knowledgeable clinicians cannot immediately significance.
override faulty advice). Nevertheless, U.S. Who should use a health care-related com-
policy under the HITECH act of 2009 (as dis- puter application? Consider expert decision-
cussed in 7 Chap. 29), led to widespread support systems as an example. An early
adoption of less-than ideal electronic health paper on ethical issues in informatics noted
record systems. In many settings, those sys- that potential users of such systems include
tems engendered less efficient, burdensome, physicians, nurses, physicians’ assistants,
error-prone care delivery and physician burn- paramedical personnel, students of the health
out (cf. Halamka and Tripathi 2017). sciences, patients, and insurance and govern-
More over, the standard view addresses a ment evaluators (Miller et al. 1985a). Are
key aspect of the question, “How and when members of all these groups appropriate
should computers be used in clinical prac- users? One cannot answer the question until
tice?” by capturing important moral intuitions one precisely specifies the intended use for the
about error avoidance and evolving standards. system (i.e., the particular clinical questions
Error avoidance and the benefits that follow the system will address). The appropriate level
396 K. W. Goodman and R. A. Miller
.. Fig. 12.1 The U.S. Department of Veterans Affairs demonstrates some of the system’s functions and utili-
in the 1970s developed the highly regarded “Veterans ties. (Credit: Courtesy of U.S. Department of Veterans
12 Health Information Systems and Technology Architec- Affairs, Veterans Health Administration Office of Infor-
ture” (VistA), once the largest electronic health record matics and Analytics)
system in the United States. This fictitious screen shot
of training must be correlated with the ques- Benjamin Spock’s 1950s era print-based child-
tion at hand. At one end of an appropriate- care primer, one might condone system use by
use spectrum, we can posit that medical and laypersons. There are additional legal con-
nursing students should employ decision- cerns related to negligence and product liabil-
support systems for educational purposes; ity, however, when health-related products are
this assertion is relatively free of controversy sold directly to patients rather than to licensed
once it has been verified that such tools con- practitioners, and when such products give
vey accurately a sufficient quantity and qual- patient-specific counsel rather than general
ity of educational content. But it is less clear clinical advice (Miller et al. 1985a).
that patients, administrators, or insurance Suitable use of a software program that
company gatekeepers, for example, should use helps a user to suggest diagnoses, to select
expert decision-support systems for assistance therapies, or to render prognoses must be
in making diagnoses, in selecting therapies, or plotted against an array of goals and best
in evaluating the appropriateness of health practices for achieving those goals, including
professionals’ actions or determining their consideration of the characteristics and
reimbursement. To the extent that some sys- requirements of individual patients. For
tems present general medical advice in gener- example, the multiple, interconnected inferen-
ally understandable but sufficiently nuanced tial strategies required for arriving at an accu-
formats, such as once was the case with Dr. rate diagnosis depend on knowledge of facts;
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
397 12
experience with procedures; and familiarity 3. All uses of informatics tools, especially
with human behavior, motivation, and values. inpatient care, should be preceded by ade-
Diagnosis is a process rather than an event quate training and instruction, which
(Miller 1990a), so even well-validated diag- should include review of applicable prod-
nostic systems must be used appropriately in uct evaluations.
the overall context of patient care.
To use a diagnostic decision-support sys- Such principles and claims should be viewed
tem (7 Chap. 24), a clinician must be able to as analogous to other standards or rules in
recognize when the computer program has clinical medicine and nursing.
erred, and, when it is accurate, what the out-
put means and how it should be interpreted.
This ability requires knowledge of both the 12.2.3 Obligations and Standards
diagnostic sciences and the software applica- for System Developers
tions, and the strengths and limitations of
each. After assigning a diagnostic label, the
and Maintainers
clinician must communicate the diagnosis,
Users of clinical programs must rely on the
prognosis, and implications to a patient, and
work of other people who are often far
must do so in ways both appropriate to the
removed from the context of use. As with all
patient’s educational background and condu-
complex technologies, users depend on the
cive to future treatment goals. It is not
developers and maintainers of a system and
enough to be able to tell patients that they
must trust evaluators who have validated a
have cancer, human immunodeficiency virus
system for clinical use. Health care software
(HIV), diabetes, or heart disease and then
applications are among the most complex
simply hand over a prescription. The care
tools in the technological armamentarium.
provider must also offer context when avail-
Although this complexity imposes certain
able, comfort when needed, and hope as
obligations on end users, it also commits a
appropriate. For instance, the reason many
system’s developers, designers, and maintain-
organizations have required counseling both
ers to adhere to reasonable standards and,
before and after HIV and genetic testing is
indeed, to acknowledge their moral responsi-
not to vex busy health professionals but to
bility for doing so.
ensure that comprehensive, high-quality care,
rather than mere diagnostic labeling, is deliv- 12.2.3.1 Ethics, Standards,
ered. and Scientific Progress
This discussion points to the following set
The very idea of a standard of care embodies
of ethical principles for appropriate use of
a number of complex assumptions linking
decision-support systems:
ethics, evidence, outcomes, and professional
1. A computer program should be used in
training. To say that nurses or physicians must
clinical practice only after appropriate
adhere to a standard is to say, in part, that
evaluation of its efficacy and the docu-
they ought not stray from procedures previ-
mentation that it performs its intended
ously shown or generally believed to work bet-
task at an acceptable cost in time and
ter than alternatives. The difficulty lies in how
money.
to determine if a procedure or device “works
2. Users of most clinical systems should be
better” than another. Such determinations in
health professionals who are qualified to
the health sciences constitute progress, and
address the question at hand on the basis
provide evidence that we now know more.
of their licensure, clinical training, and
Criteria for weighing such evidence, albeit
experience. Software systems should be
short of proof in most cases, are applied. For
used to augment or supplement, rather
example, evidence from well-designed ran-
than to replace or supplant, such individu-
domized controlled trials merits greater trust
als’ decision making.
than evidence derived from uncontrolled ret-
398 K. W. Goodman and R. A. Miller
rospective studies (see 7 Chap. 13). Typically, their diligence, creativity, and effort. Rather, it
verification by independent investigators must implies that no amount of financial benefit for
occur before placing the most recent study a designer or builder can counterbalance bad
results into common practice. outcomes or ill consequences that result from
People who develop, maintain, and sell recklessness, avarice, or inattention to the
health care computing systems and their com- needs of clinicians and their patients.
ponents have obligations that parallel those of Purchasers and users should require demon-
system users. These obligations include hold- strations that systems are worthy of such trust
ing patient care as the foremost value. The and reliance before placing patients at risk,
duty to limit or prevent harm to patients and that safeguards (human and mechanical)
applies to system developers as well as to are in place to detect, alert, and rectify situa-
practitioners. Although this principle is easy tions in which systems underperform.
to suggest and, generally, to defend, it invites Quality standards should stimulate scien-
subtle, and sometimes overt, resistance from tific progress and innovation while safeguard-
people for whom profit or fame are primary ing against system error and abuse. These goals
motivators. (This is of course also true for might seem incompatible, but they are not. Let
other medical devices, processes and indus- us postulate a standard that requires timely
tries.) To be sure, quests for fame and fortune updating and testing of knowledge bases that
often produce good outcomes and improved are used by decision-support systems. To the
care, at least eventually. Even so, some extent that database accuracy is needed to
approaches fail to take into account the role maximize the accuracy of inferential engines,
of intention as a moral criterion (cf. Goodman it is trivially clear how such a standard will
et al. 2010). help to prevent or reduce decision- support
In medicine, nursing, and psychology, a mistakes. Furthermore, the standard should be
number of models of the professional–patient seen to foster progress and innovation in the
relationship place trust and advocacy at the same way that any insistence on best possible
apex of a hierarchy of values. Such a stance accuracy helps to protect scientists and clini-
12 cannot be maintained if goals and intentions cians from pursuing false leads, or wasting
other than patient well-being are (generally) time in testing poorly wrought hypotheses. It
assigned primacy. The same principles apply will not do for database maintainers to insist
to those who produce and attend to health that they are busy doing the more productive
care information systems. Because these sys- or scientifically stimulating work of improving
tems are health care systems—and are not knowledge representation, say, or database
devices for accounting, entertainment, real design. Although such tasks are important,
estate, and so on—and because system under- they do not supplant the tasks of updating and
performance can cause pain, disability, illness, testing tools in their current configuration or
and death, it is essential that the threads of structure. Put differently, scientific and techni-
trust run throughout the fabric of clinical sys- cal standards are perfectly able to stimulate
tem design and maintenance. progress while taking a cautious or even con-
System purchasers, users, and patients servative stance toward permissible risk in
must rely upon developers and maintainers to patient care.
recognize the potentially grave consequences This approach has been described as pro-
of errors or carelessness, trust them to care gressive caution: “Medical informatics is, hap-
about the uses to which the systems will be pily, here to stay, but users and society have
put, and rely upon them to value the reduced extensive responsibilities to ensure that we use
suffering of other people at least as much as our tools appropriately. This might cause us
they value their own personal gain. This reli- to move more deliberately or slowly than
ance emphatically does not entail that system some would like” (Goodman 1998).
designers and maintainers are blameworthy or A more recent concern, with both ethical
unethical if they hope and strive to profit from and legal implications, is the responsibility of
software developers to design and implement
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
399 12
software programs that cannot easily be 6. How well have individuals been trained to
hacked by malicious code writers. This con- use it?
cern goes beyond privacy and confidentiality 7. What are the anticipated long-term effects
issues (discussed below), and includes the pos- on how organizational units interact?
sibility that medical devices with embedded 8. What are the long-term effects on the
software might be nefariously “repro- delivery of medical care?
grammed” in a manner that might cause harm 9. Will the system have an impact on control
to patients (see, for example, Pugh et al. 2018 in the organization?
and Sackner-Bernstein 2017). A more detailed 10. To what extent do effects depend on prac-
discussion of this topic appears under the tice setting?
7 Sect. 12.5 below.
Another way to make this important point is
12.2.3.2 System Evaluation by emphasizing that people use computer sys-
as an Ethical Imperative tems. Even the finest system might be misused,
Any move toward “best practices” in biomedi- misunderstood, or mistakenly allowed to alter
cal informatics is shallow and feckless if it or erode previously productive human rela-
does not include a way to measure whether a tionships. Evaluation of health information
system performs as intended. This and related systems in their contexts of use should be taken
measurements provide the ground for quality as a moral imperative. Such evaluations require
control and, as such, are the obligations of consideration of a broader conceptualization
system developers, maintainers, users, admin- of “what works best” and must look toward
istrators, and perhaps other players (see improving the overall health care delivery sys-
7 Chap. 13). tem rather than only that system’s technologi-
cally based components. These higher goals
»» Medical computing is not merely about
entail the creation of a corresponding mecha-
medicine or computing. It is about the
nism for ensuring institutional oversight and
introduction of new tools into environments
responsibility (Miller and Gardner 1997a, b).
with established social norms and practices.
The effects of computing systems in health
care are subject to analysis not only of accu-
12.3 Privacy, Confidentiality,
racy and performance but of acceptance by
users, of consequences for social and profes- and Data Sharing
sional interaction, and of the context of use.
We suggest that system evaluation can illu- Some of the greatest challenges of the
minate social and ethical issues in medical Information Age arise from placing computer
computing, and in so doing improve patient applications in health care settings while
care. That being the case, there is an ethical upholding traditional principles and values.
imperative for such evaluation (Anderson One challenge involves balancing two com-
and Aydin 1998). peting values: (1) free access to information,
and (2) protection of patients’ privacy and
To give a flavor of how a comprehensive confidentiality.
evaluation program can ethically optimize Only computers can efficiently manage the
implementation and use of an informatics now-vast amount of information generated
system, consider these ten criteria for system during clinical encounters and other health
scrutiny (Anderson and Aydin 1994): care transactions (see 7 Chap. 2); at least in
1. Does the system work as designed? principle, such information should be easily
2. Is it used as anticipated? available to health professionals and others
3. Does it produce the desired results? involved in the administration of the care-
4. Does it work better than the procedures it delivery system, so that they can provide effec-
replaced? tive, efficient care for patients. Yet, making
5. Is it cost effective? this information readily available creates
400 K. W. Goodman and R. A. Miller
greater opportunities for inappropriate access. privacy is violated; if someone sneaks into the
Such access may be available to curious health clinic without observing you in person and
care workers who do not need the information looks at your health care record, your record’s
to fulfill job-related responsibilities, and, even confidentiality is breached. In discussions of
more worrisome, to other people who might the electronic health record, the term privacy
use the information to harm patients physi- may also refer to individuals’ desire to restrict
cally, emotionally, or financially. Clinical sys- the disclosure of personal data (National
tem administrators must balance the goals of Research Council 1997).
protecting confidentiality by restricting use of There are several important reasons to pro-
computer systems and improving care by tect privacy and confidentiality. One is that pri-
assuring the integrity and availability of data. vacy and confidentiality are widely regarded as
These objectives are not incompatible, but rights of all people, and such protections help
there are trade-offs that cannot be avoided. to accord them respect. On this account, peo-
ple do not need to provide a justification for
limiting access to their identifiable health data;
12.3.1 Foundations of Health privacy and confidentiality are entitlements
Privacy and Confidentiality that a person does not need to earn, to argue
for, or to defend. Another reason is more prac-
Privacy and confidentiality are necessary for tical: protecting privacy and confidentiality
people to evolve and mature as individuals, to benefits both individuals and society. Patients
form relationships, and to serve as function- who know that their identifiable health care
ing members of society. Imagine what would information will not be shared inappropriately
happen if the local newspaper or gossip blog are more comfortable disclosing that informa-
produced a daily report detailing everyone’s tion to clinicians. This trust is vital for the suc-
actions, meetings, and conversations. It is not cessful physician–patient, nurse–patient, or
that most people have terrible secrets to hide psychologist-patient relationship, and it helps
but rather that the concepts of solitude, inti- practitioners to do their jobs. This insight is as
12 macy, and the desire to be left alone make no old as the Hippocratic corpus.
sense without the expectation that at least Privacy and confidentiality protections
some of our actions and utterances will be also benefit public health. People who fear
kept private or held in confidence among a disclosure of personal information are less
limited set of persons. likely to seek out professional assistance,
The “average” sentiment about the appro- increasing the risks that contagion will be
priate sphere of private vs. public may vary spread and maladies will go untreated. In
considerably from culture to culture, and even addition, people still suffer discrimination,
from generation to generation within any par- bias, and stigma when certain health data do
ticular culture; and it may differ widely among fall into the wrong hands. Financial harm
persons within a culture or generation, and may occur if insurers are given unlimited
evolve for any particular person over a life- access to family members’ records, or access
time. Even the “born digital” generation, for to patient data, because some insurers might
which social media are a fixture of everyday be tempted to increase the price of insurance
life, has – and ought to have –its boundaries for individuals at higher risk of illness or dis-
(Palfrey and Gasser 2010). criminate in other ways if such price differen-
The terms privacy and confidentiality are tiation were forbidden by law. This is, in the
not synonymous. As commonly used, “pri- United States, among the reasons the Patient
vacy” generally applies to people, including Protection and Affordable Care Act of 2010
their desire not to suffer eavesdropping, (U.S. Public Law 111-148), in prohibiting
whereas “confidentiality” is best applied to insurers from discrimination based on “pre-
information. One way to think of the differ- existing conditions,” was so important – and
ence is as follows. If someone follows you and why subsequent efforts on ideological grounds
spies on you entering an AIDS clinic, your to overturn the act are dangerously erosive.
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
401 12
The ancient idea that physicians should syndromic surveillance has been asserted as
hold health care information in confidence is necessary for adequate bioterrorism prepared-
therefore applicable whether the data are writ- ness, for earlier detection of naturally occur-
ten on paper or processed in silicon. The obli- ring disease outbreaks, and, most dramatically,
gations to protect privacy and to keep in the Coronavirus pandemic of 2020 (Ienca
confidences fall to system designers and main- and Vayena 2020; see also 7 Chap. 18).
tainers, to administrators, and, ultimately, to
the physicians, nurses, and others who elicit
the information in the first place. The upshot
for all of them is this: protection of privacy 12.3.2 Electronic Clinical
and confidentiality is not an option, a favor, and Research Data
or a helping hand offered to patients with
embarrassing health problems; it is a duty, Access to electronic patient records holds
regardless of the malady or the medium in extraordinary promise for clinicians and for
which information about it is stored. other people who need timely, accurate patient
Some sound clinical practice and public data (see 7 Chap. 14). Institutions that do not
health traditions run counter to the idea of yet deploy electronic health record systems
absolute confidentiality. When a patient is hos- have fallen behind; this may become blame-
pitalized, it is expected that all appropriate worthy. Failure to use such systems may also
(and no inappropriate) employees or affiliates disqualify institutions for reimbursements
of the institution—primary physicians, con- from public and private insurance, making it
sultants, nurses, therapists, and technicians— effectively an organizational death sentence.
will have access to the patient’s medical Conversely, systems that make it easy for clini-
records, when it is in the interest of the patient’s cians to access data also make it easier for
care to do so. In most communities of the people in general to access the data, and elec-
United States, the contacts of patients who tronic systems generally magnify number of
have active tuberculosis or certain sexually persons whose information becomes available
transmitted diseases are routinely identified when a system security breach occurs. Some
and contacted by public health officials so that would consider failure to prevent inappropri-
the contacts may receive proper medical atten- ate access as at least as blameworthy as failure
tion. Such disclosures serve the public interest to provide adequate and appropriate access.
and are and should be legal because they Nonetheless, there is no contradiction
decrease the likelihood that more widespread between the obligation to maintain a certain
harm to other individuals might occur through standard of care (in this case, regarding mini-
transmission of an infection unknowingly. mal levels of computer use) and ensuring that
A separate but important public health such a technical standard does not imperil the
consideration (discussed in more detail below) rights of patients. Threats to confidentiality
involves the ability of health care researchers and privacy are fairly well known. They
to anonymously pool data (i.e., pool by include economic abuses, or discrimination by
removing individual persons’ identifying third-party payers, employers, and others who
information) from patient cases that meet take advantage of the ever-burgeoning market
specified conditions to determine the natural in health data; insider abuse, or record snoop-
history of the disease and the effects of vari- ing by hospital or clinic workers who are not
ous treatments. Examples of benefits from directly involved in a patient’s care but exam-
such pooled data analyses range from the ine a record out of curiosity, for instance;
ongoing results generated by regional collab- identity theft for insurance or other forms of
orative chemotherapy trials to the discovery, financial fraud; and malevolent hackers, or
more than four decades ago, of the people who, via networks or other means,
appropriateness of shorter lengths of stay for copy, delete, or alter confidential informa-
patients with myocardial infarction (McNeer tion – or threaten to do so, a component of
et al. 1975). More recently, the need for robust “ransomware” (see, e.g., Sittig and Singh
402 K. W. Goodman and R. A. Miller
2016; and Slayton 2018). Moreover, wide- breaches are uniformly punished in a semi-
spread dissemination of information through- public manner.
out the health care system often occurs Technological efforts to improve health
without explicit patient consent. Health care system security have emerged as a kind of
providers, third-party payers, managers of sub-specialty in health informatics, with sys-
pharmaceutical benefits programs, equipment tem developers, computer scientists and oth-
suppliers, and oversight organizations collect ers working to improve confidentiality
large amounts of patient-identifiable health protections. This often entails both better fire-
information for use in managing care, con- walls against intrusion and software to pre-
ducting quality and utilization reviews, pro- vent re-identification of stored data with the
cessing claims, combating fraud, and individuals to whom the data apply (see, for
analyzing markets for health products and example, Malin and Goodman 2018).
services (National Research Council 1997).
The proper approach to such challenges is 12.3.2.2 Policy Approaches
one that will ensure both that appropriate cli- In its landmark report, the National Research
nicians and other people have rapid, easy Council (1997) recommended that hospitals
access to patient records and that others do and other health care organizations create
not have access. Is that a contradictory bur- security and confidentiality committees and
den? No. Is it easy to achieve both? No. There establish education and training programs.
are many ways to restrict inappropriate access These recommendations parallel an approach
to electronic records, but all come with a cost. that had worked well elsewhere in hospitals
Sometimes the cost is explicit, as when it for matters ranging from infection control to
comes in the form of additional security soft- bioethics. The U.S. Health Insurance
ware and hardware; sometimes it is implicit, Portability and Accountability Act (HIPAA)
as when procedures are required that increase requires the appointment of privacy and secu-
the time commitment by system users. rity officials, special policies, and the training
A well-established standard way to view of health care workforce members who have
12 the landscape of protective measures is to access to health information systems. The
divide it into technological methods and insti- European Union’s General Data Protection
tutional or policy approaches (Alpert 1998): Regulation (GDPR) requires new account-
ability and governance measures, standards
12.3.2.1 Technological Methods for access by people to data and information
Computer systems per se can optimize some about them, and rules for use of that data and
aspects of security. Typical systems verify information.
that users are who they claim to be (“authen- Such measures are all the more important
ticating”) with passwords, tokens or biomet- when health data are accessible through net-
rics. Other controls limit access to people works. The rapid growth of integrated delivery
with a professional “need to know.” Creating networks (IDNs) (see 7 Chap. 16) and Health
audit trails, or logs, to record who viewed Information Exchanges, for example, illus-
confidential records enables authorized facil- trate the need not to view health data as a well
ity administrators, automated security audit- into which one drops a bucket but rather as an
ing programs, and patients to later review irrigation system that makes its contents
who accessed what. Encryption can protect available over a broad—sometimes an
data in transit and at rest (in storage). These extremely broad—area. It is not yet clear
technical means are complemented by pro- whether privacy and confidentiality protec-
tecting the elements of the electronic infra- tions that are appropriate in hospitals will be
structure with physical barriers when fully effective in a ubiquitously networked
operations allow it. Auditing works best
environment, but it is a start. System develop-
when appropriately severe punishments are ers, users, and administrators are obliged to
widely known to be policy, and when policy identify appropriate measures in light of the
particular risks associated with a given imple-
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
403 12
mentation. There is no excuse for failing to mation can in combination serve as a data
make this a top priority throughout the data fingerprint that picks out an individual patient
storage and sharing environment. even though the patient’s name, Social
Security number, or other (official) unique
12.3.2.3 lectronic Data and Human
E identifiers have been removed from the record.
Subjects Research Challenges and opportunities related to de-
The use of patient information for clinical identifying and re-identifying data are among
research and for quality assessment raises the most interesting, difficult and important
interesting ethical challenges. The presump- in all health computing (Atreya et al. 2013;
tion of a right to confidentiality seems to Benitez and Malin 2010; Malin and Sweeney
include the idea that patient records are inex- 2004; Malin et al. 2011; Sweeney 1997;
tricably linked to patient names or to other Tamersoy et al. 2012).
identifying data. In an optimal environment, Such challenges point to a second means
then, patients can monitor who is looking at of balancing ethical goals in the context of
their records. But if all unique identifiers have database research: the use of institutional
been stripped from the records, is there any panels, such as medical record committees
sense in talking about confidentiality? or institutional review boards. Submission of
The benefits to public health loom large in database research to appropriate institutional
considering record-based research (7 Chap. scrutiny is one way to make the best use of
18). A valuable benefit of the electronic health more or less anonymous electronic patient
record is the ability to access vast numbers of data. Competent panel members should be
patient records to estimate the incidence and educated in the research potential of elec-
prevalence of various maladies, to track the tronic health records, as well as in ethical
efficacy of clinical interventions, and to plan issues in epidemiology and public health.
efficient resource allocation (see 7 Chap. 18). Scrutiny by such committees can also give
Such research and planning would, however, appropriate weight to competing ethical con-
impose onerous or intractable burdens if cerns in the context of internal research for
informed, or valid consent had to be obtained quality control, outcomes monitoring, and
from every patient whose record was repre- so on (Goodman 1998; Miller and Gardner
sented in the sample. Using confidentiality to 1997a, b).
impede or forbid such research fails to benefit
patients at the same time it sacrifices benefi- 12.3.2.4 Challenges in Bioinformatics
cial scientific investigations. Safeguards are increasingly likely to be chal-
A more practical course is to establish lenged as genetic information makes its way
safeguards that better balance the ethical obli- into the health care record (see 7 Chaps. 11
gations to privacy and confidentiality against and 28). The risks of bias, discrimination, and
the social goals of public health and systemic social stigma increase dramatically as genetic
efficiency. This balancing can be pursued via a data become available to clinicians and inves-
number of paths. The first is to establish tigators. Indeed, genetic information “goes
mechanisms to anonymize the information in beyond the ordinary varieties of medical
individual records or to decouple the data information in its predictive value” (Macklin
contained in the records from any unique 1992). Genetic data also may be valuable to
patient identifier. This task is not always people predicting outcomes, allocating
straightforward; it can be remarkably difficult resources, and the like. In addition, genetic
to anonymize data such that, when coupled data are rarely associated with only a single
with other data sets, the individuals are not at person; they may provide information about
risk of re-identification. A relatively rare relatives, including relatives who do not want
disease diagnosis coupled with demographic to know about their genetic risk factors or
data such as age and gender, or geographic potential maladies, as well as relatives who
data such as a postal code, may act as a sur- would love dearly to know more about their
rogate unique identifier; that is, detailed infor- kin’s genome. There is still much work to be
404 K. W. Goodman and R. A. Miller
done in sorting out and addressing the ethical 12.4 ocial Challenges and Ethical
S
issues related to electronic storage, sharing, Obligations
and retrieval of genetic data (Goodman 1996,
2016a). The expansion of evidence-based medicine
Bioinformatics or computational biology and, in the United States, of managed care
provides an exciting ensemble of new tools to (now sometimes called accountable care since
increase our knowledge of genetics, genetic the passage of health reform legislation in
diseases, and public health. Use of these tools 2010; see 7 Chap. 29) places a high premium
is accompanied by responsibilities to attend to on the tools of health informatics. The need
the ethical issues raised by new methods, for data on clinical outcomes is driven by a
applications, and consequences (Goodman number of important social and scientific fac-
and Cava 2008). Identifying and analyzing tors. Perhaps the most important among these
these issues are among the key tasks of those factors is the increasing unwillingness of gov-
who work at the intersection of ethics and ernments and insurers to pay for interventions
health information technology. The future of and therapies that do not work or that do not
genetics and genomics is utterly computa- work well enough to justify their cost.
tional, with data storage and analysis posing Health informatics helps clinicians, admin-
some of greatest financial and scientific chal- istrators, third-party payers, governments,
lenges. For instance: researchers, and other parties to collect, store,
55 How, to what extent, and by whom should retrieve, analyze, and scrutinize vast amounts
genomic databases be used for clinical or of data—though the task of documenting this
public health decision support? is itself a matter of research on what has come
55 Are special rules needed to govern the to be called “meaningful use.” The functions
study of information in digital genetic of health informatics might be undertaken
repositories (or are current human sub- not for the sake of any individual patient but
jects research protection rules adequate)? rather for cost analysis and review, quality
55 Does data mining software present new assessment, scientific research, and so forth.
12 challenges when applied to human genetic These functions are critical, and if computers
information? can improve their quality or accuracy, then so
55 What policies are required to guide and much the better.
inform the communication of patient- Challenges arise when intelligent applica-
specific and incidental findings? tions are mistaken for decision-making sur-
55 Are special protections and precautions rogates or when institutional or public policy
needed to address and transmit findings recommends or favors computer output over
about population subgroups? human cognition. This may be seen as a
question or issue arising under the rubric of
It might be that the tools and uses of compu- “appropriate uses and users.” That is, by
tational biology will eventually offer ethical whom, when, and under what constraints
challenges—and opportunities—as impor- may we elicit and invoke computational
tant, interesting and compelling as any tech- analysis in shaping or applying public policy?
nology in the history of the health sciences. The question whether an individual physi-
Significantly, this underscores the importance cian or multispecialty group, say, should be
of arguments to the effect that attention to hired or retained or reimbursed or rewarded
ethics must accompany attention to science. is information-intensive. The question that
Victories of health science research and devel- follows, however, is the key one: How should
opment will be undermined by any failures to the decision-making skills of human and
address corresponding ethical challenges. We machine be used, and balanced (cf. Glaser
must strive to identify, analyze, and resolve or 2010)?
mitigate important ethical issues.
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
405 12
12.4.1 Vendor Interactions While many or most contracts between
vendors and hospitals are confidential, it has
Motivated if not inspired by both technologi- been reported that some HIT vendors require
cal necessity and financial opportunity, hum- contract language that indemnifies system
ble private practices and sprawling medical developers for personal injury claims or mal-
centers have—or should have—begun the practice, even if the vendor is at fault; some
transition from a paper patient record to an vendors require system purchasers to agree
electronic one. The need to make such a tran- not to disclose system errors except to the ven-
sition is not in dispute: paper (and handwrit- dor (Koppel and Kreda 2009). Such provi-
ing) are hard to store, find, read and analyze. sions elicit concern to the extent they place or
Electronic Health Records (EHR) are not, or appear to place corporate interests ahead of
should not be. While there are important patient safety and welfare. In this case, a
debates about the speed of the transition and working group chartered by AMIA, the soci-
regarding software quality, usability and abil- ety for informatics professionals (see 7 Chap.
ity to protect patient safety, it is widely agreed 1), issued a report that provided guidance on
that the recording and storage of health infor- a number of vendor interaction issues
mation must be electronic. (Goodman et al. 2010). Importantly, the
Public policy has attempted to overcome working group comprised industry represen-
some of the reluctance to make the change tatives as well as scientists and other academ-
because of financial concerns. Notably, the ics. The group’s recommendations included
U.S. Health Information Technology for these:
Economic and Clinical Health (HITECH) 55 Contracts should not contain language
Act, a part of the American Recovery and that prevents system users, including clini-
Reinvestment Act of 2009 (Blumenthal 2010), cians and others, from using their best
authorized some $27 billion in incentives for judgment about what actions are neces-
EHR adoption. These incentives helped sary to protect patient safety. This includes
address but did not eliminate financial con- freedom to disclose system errors or flaws,
cerns in that they offset only some of the cost whether introduced or caused by the ven-
of converting to an e-system. Still, while a dor, the client, or a third party. Disclosures
number of companies had previously found made in good faith should not constitute
opportunity in developing hospital and other violations of HIT contracts. This recom-
clinical information systems, HITECH accel- mendation neither entails nor requires the
erated the pace (see 7 Chap. 29). disclosure of trade secrets or of intellec-
The firms that make and sell EHRs are not tual property.
regulated in the same way as those that manu- 55 Because vendors and their customers share
facture pharmaceutical products or medical responsibility for patient safety, contract
devices (see 7 Sect. 12.5.3). In an increasingly provisions should not attempt to circum-
competitive environment, this has led to con- vent fault and should recognize that both
troversy about the nature of vendor interac- vendors and purchasers share responsibil-
tions with the institutions that buy their ity for successful implementation. For
products. An EHR system for a mid-sized example, vendors should not be absolved
hospital can cost upwards of $100 million from harm resulting from system defects,
over time, including consulting services, hard- poor design or usability, or hard-to-detect
ware and training. It follows that it is reason- errors. Similarly, purchasers should not be
able to ask what values should guide such absolved from harm resulting from inade-
vendor interactions with clients, and whether quate training and education, inadequate
they should be similar to or different from val- resourcing, customization, or inappropri-
ues that govern other free-market dealings. ate use.
406 K. W. Goodman and R. A. Miller
While some of the debates that led to those Now suppose that most previous patients
conclusions were about political economy with a particular physiologic profile have died
(regulation vs. free enterprise) as much as eth- in critical-care units; this information might
ics (right vs. wrong), the opportunity for rap- be used to identify ways to improve care of
prochement in the service of a patient-centered such patients—or it might be used in support
approach may be seen as an affirmation of the of arguments to contain costs by denying care
utility of an applied ethics process in the evo- to subsequent patients fitting the profile (since
lution of health information technology. they are likely to die anyway).
An argument in support of such an appli-
cation might be that decisions to withdraw or
12.4.2 Computational Prognosis withhold care are often and customarily made
on the basis of subjective and fragmented evi-
Consider the utility of prognostic scoring sys- dence; so it is preferable to make such deci-
tems that use physiologic and mortality data sions on the basis of objective data of the sort
to compare new critical-care patients with that otherwise underlie sound clinical practice.
thousands of previous patients (Knaus et al. Such outcomes data are precisely what fuels
1991). Such systems allow hospitals to track the engines of managed care, wherein health
the performance of their critical-care units by, professionals and institutions compete on
say, comparing the previous year’s outcomes the basis of cost and outcomes. Why should
to this year’s or by comparing one hospital to society, or a managed-care organization, or
another. If, for instance, patients with a par- an insurance company pay for critical care
ticular profile tend to survive longer than their when seemingly objective evidence exists that
predecessors, then it might be inferred that such care will not be efficacious? Contrarily,
critical care has improved. Such scoring sys- consider the effect on future scientific insights
tems can be useful for internal research and of denying care to such patients. Scientific
for quality management (. Fig. 12.2). progress is often made by noticing that cer-
12
.. Fig. 12.2 “Severity adjusted daily data” in fictitious number of ethical issues. This image shows 10 CCU
APACHE® Outcomes screen shot. Using prognostic patients. For the second one in the leftmost column, for
scoring systems, clinicians in critical-care units can instance, the “acute physiology score” is 128; the risk of
monitor events and interventions and administrators hospital mortality is 96% and the risk of ICU mortality
can manage staffing based on patient acuity. Clinicians is 92%. (Credit: Courtesy of Cerner Corporation, with
can also use such systems to predict mortality, raising a permission)
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
407 12
tain patients do better under certain circum- 2. Decisions about whether to treat a given
stances, and investigation of such phenomena patient are often value laden and must be
leads to better treatments. If all patients meet- made relative to treatment goals. In other
ing certain criteria were denied therapy on the words, it might be that a treatment will
basis of a predictive tool, it would become improve the quality of life but not extend
a self-fulfilling prophecy for a much longer life, or vice versa (Youngner 1988). Whether
time that all such patients would not do well such treatment is appropriate cannot be
(Miller 1997). determined scientifically or statistically
Now consider use of a decision-support (Brody 1989). The decisions ultimately
system to evaluate, review, or challenge deci- depend on human preferences—those of
sions by human clinicians; indeed, imagine an the provider or, even more importantly, the
insurance company using a diagnostic expert patient.
system to determine whether a physician 3. Applying computational operations on
should be reimbursed for a particular proce- aggregate data to individual patients runs
dure. If the expert system has a track record the risk of including individuals in groups
for accuracy and reliability, and if the system they resemble but to which they do not
“disagrees” with the human’s diagnosis or actually belong. Of course, human clini-
treatment plan, then the insurance company cians run this risk all the time—the chal-
can contend that reimbursement for the pro- lenge of inferring correctly that an
cedure would be a mistake. Why pay for a pro- individual is a member of a set, group, or
cedure that is not indicated, at least according class is one of the oldest problems in logic
to a computational analysis? and in the philosophy of science. The point
In the two examples just offered (a prog- is that computers have not solved this
nostic scoring system is used to justify termi- problem, yet, and allowing policy to be
nation of treatment to conserve resources, guided by simple or unanalyzed correla-
and a diagnostic expert system is used to deny tions constitutes a conceptual error.
a physician reimbursement for procedures
deemed inappropriate), there seems to be jus- The idea is not that diagnostic or prognostic
tification for adhering to the computer out- computers are always wrong—we know that
put. There are, however, three reasons why it is they are not—but rather there are numerous
problematic to rely exclusively on clinical instances in which we do not know whether
computer programs to guide policy or prac- they are right. It is one thing to allow aggre-
tice in these ways: gate data to guide policy; doing so is just using
1. As we argued earlier with the standard scientific evidence to maximize good out-
view of computational diagnosis (and, by comes. But it is altogether different to require
easy extension, prognosis), human cogni- that a policy disallow individual clinical judg-
tion is, at least for a while longer, still supe- ment and expertise.
rior to machine intelligence. Moreover, the Informatics can contribute in many ways
act of rendering a diagnosis or prognosis is to health care reform. Indeed, computer-
not merely a statistical or computational based tools can help to illuminate ways to
operation performed on uninterpreted reduce costs, to optimize clinical outcomes,
data. Rather, identifying a malady and and to improve care. Scientific research, qual-
predicting its course requires understand- ity assessment, and the like are, for the most
ing a complex ensemble of causal rela- part, no longer possible without computers.
tions, interactions among a large number But it does not follow that the insights from
of variables, and having a store of salient such research apply in all instances to the
background knowledge—considerations myriad variety of actual clinical cases at which
that have thus far failed to be grasped, competent human clinicians excel.
assessed, and effectively blended into deci- The Coronavirus crisis of 2020 provided
sions made by computer programs. an opportunity to assess and review the use of
408 K. W. Goodman and R. A. Miller
prognostic scoring systems to inform or guide result. If reliance on computers impedes the
triage and rationing. Controversy inherent in abilities of health professionals to establish
resource allocation under conditions of scar- trust and to communicate compassionately,
city was magnified when decisions about ven- however, or further contributes to the dehu-
tilator allocation, for instance, were made manization of patients (Shortliffe 1993, 1994),
based on a prognostic score rendered by utili- then we may have paid too dearly for our use
ties resident in electronic health records of these machines.
(Truog et al. 2020). Although a strong case Suppose that a physician uses a decision-
can be made that such use was permissible support system to test a diagnostic hypothesis
during a crisis and in the absence of anything or to generate differential diagnoses, and sup-
better, an equally strong case must be made pose further that a decision to order a particu-
that this opportunity to assess and review the lar test or treatment is based on that system’s
use of prognostic scoring systems not be output. A physician who is not able to articu-
squandered. The in situ use of this informatics late the proper role of computational support
tool should be scrutinized and studied. in his decision to treat or test will risk alienat-
ing those patients who, for one reason or
another, will be disappointed, angered, or
12.4.3 Effects of Informatics confused by the use of computers in their
on Traditional Relationships care. To be sure, the physician might just with-
hold this information from patients, but such
Patients are often frightened and vulnerable. deception carries its own threats to trust in the
Treating illness, easing fear, and respecting relationship.
vulnerability are among the core obligations Patients are not completely ignorant about
of physicians, nurses, and other clinicians. the processes that constitute human decision
Health informatics has the potential to com- making. What they do understand, however,
plement these traditional duties and the rela- may be subverted when their doctors and
tionships that they entail. We have pointed nurses use machines to assist delicate cogni-
12 out that medical decisions are shaped by non- tive functions. We must ask whether patients
scientific considerations. This point is impor- should be told the accuracy rate of decision
tant when we assess the effects of informatics support automata—when they have yet to be
on human relationships. Thus: given comparable data for humans. Would
such knowledge improve the informed-
»» The practice of medicine or nursing is not consent process, or would it “constitute
exclusively and clearly scientific, statistical, another befuddling ratio that inspires doubt
or procedural, and hence is not, so far, com- more than it informs rationality?” (Miller and
putationally tractable. This is not to make a Goodman 1998).
hoary appeal to the “art and science” of To raise such questions is consistent with
medicine; it is to say that the science is in promoting the responsible use of computers
many contexts inadequate or inapplicable: in clinical practice. The question whether
Many clinical decisions are not exclusively computer use will alienate patients is an
medical—they have social, personal, ethi- empirical one; it is a question for which,
cal, psychological, financial, familial, legal, despite many initial studies, we lack conclu-
and other components; even art might play sive data to answer. (For example, we cannot
a role. (Miller and Goodman 1998) yet state definitively whether all categories of
patients will respond well to all specific types
12.4.3.1 Professional–Patient
of e-mail messages from their doctors.
Relationships Nevertheless, as a moral principle discussed
If computers, databases, and networks can above, one should not convey a new diagnosis
improve physician–patient or nurse–patient of a malignancy via email.) To address the
relationships, perhaps by improving commu- question now anticipates potential future
nication, then we shall have achieved a happy problems. We must ensure that the exciting
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
409 12
potential of health informatics is not sub- but there is a chance that someone who
verted by our forgetting that the practice of might benefit from seeing a physician will
medicine, nursing, and allied professions is not do so because of anecdotes and infor-
deeply human and fundamentally intimate mation otherwise attained. How should
and personal. this problem be addressed?
becoming more common to store the data on Traditional insurers and health care pro-
a Web site, where PHR data can also be linked viders are duty-bound by privacy laws and
to other health information relevant to the regulations to protect the information under
person. The PHR data can also be linked via their control. PHRs have a somewhat shakier
a portal to a health care provider institution’s set of protections given their relatively short
records, to allow updating in both directions, history. The legal obligations of institutions
or be free of any such tie. A flash drive can be that provide PHRs, but do not fully manage
forgotten or lost, whereas a Web site can be the content of those records nor their use, as
centrally updated and uniformly available via well as the obligations (if any) of the individu-
any properly authenticated device on the als who “manage” their own health records,
Internet. remain to be resolved (Cushman et al. 2010).
PHRs are now commonly linked to so-
called “personal health applications” (PHAs)
which provide ways of moving beyond simple
static storage of one’s medical history. Most
provide some sort of primitive decision sup-
port, if only in linking to additional informa-
tion about a particular disease or condition.
Others include more ambitious decision-
support functionality. All the concerns about
the accuracy of Web-based information recur
in this context, with concerns about the reli-
ability of decision support added to that.
Compounding concerns about accuracy are
the inherent limitations of the “owner-
.. Fig. 12.3 Project HealthDesign (Brennan et al. operator”: If it can be difficult for trained
2010) was a landmark program sponsored by the Robert
health care providers to evaluate the quality
12 Wood Johnson Foundation’s Pioneer Portfolio and
intended to foster development of personal health of advice rendered by a decision support sys-
records. Here is a barcode scanner that recognizes medi- tem, the challenges for patients will be com-
cation labels. Designed by researchers at the University mensurately greater.
of Colorado at Denver, the “Colorado Care Tablet” Traditional health care institutions may
allows elderly users to track prescriptions with such
see the PHR as a device for patient empower-
scanners and portable touch-screen tablets. (Credit:
Courtesy of Project HealthDesign; Creative Commons ment because it adds a way for persons to keep
Attribution 3.0 Unported License) track of their own data; but they can also be
chasers. Services are intangible activities pro- 1985a). Similarly, a patient might sue a practi-
vided to consumers at a price by (presumably) tioner who has not used a decision-support
qualified individuals. system when it can be shown that use of the
The practice of clinical medicine has been decision-support system is part of the current
deemed a service through well-established standard of care, and that use of the program
legal precedents. On the other hand, clinical might have prevented the clinical error that
software applications can be viewed as either occurred (Miller 1989). It is not clear whether
goods (“products”) (software programs the patients in such circumstances could also
designed, tested, debugged, placed on DVDs successfully sue the software manufacturers,
or other media, and distributed physically to as it is the responsibility of the licensed prac-
purchasers) or services (applications that titioner, and not of the software vendor, to
present data or provide advice to practitioners uphold the standard of care in the community
engaged in a service such as delivering health through exercising sound clinical judgment.
care). There are few legal precedents to deter- Based on a successful malpractice suit against
mine unequivocally how software will be a clinician who used a clinical software sys-
viewed by the courts, and it is possible that tem, it might be possible for the practitioner
clinical software programs will be treated as to sue the manufacturer or vendor for negli-
goods under some circumstances and as ser- gence in manufacturing a defective clinical
vices under others. It might be the case that software product, but cases of this sort have
that software purchased and running in a pri- not yet been filed. If there were such suits, it
vate office to handle patient records or billing might be difficult for a court to discriminate
would be deemed a product, but the same between instances of improper use of a blame-
software mounted on shared, centralized less system and proper use of a less-than-
computers and accessed over the Internet perfect system.
(and billed on a monthly basis) would be In contrast to negligence, strict product
offering a service. liability applies only to harm caused by defec-
Three ideas from tort law potentially apply tive products and is not applicable to services.
12 to the clinical use of software systems: The primary purpose of strict product liabil-
(1) Harm by intention—when a person ity is to compensate the injured parties rather
injures another using a product or service to than to deter or punish negligent individuals
cause the damage, (2) the negligence theory, (Miller et al. 1985a). For strict product liabil-
and (3) strict product liability (Miller et al. ity to apply, three conditions must be met:
1985a). Providers of goods and services are 1. The product must be purchased and used
expected to uphold the standards of the com- by an individual.
munity in producing goods and delivering 2. The purchaser must suffer physical harm
services. When individuals suffer harm due to as a result of a design or manufacturing
substandard goods or services, they may sue defect in the product.
the service providers or goods manufacturers 3. The product must be shown in court to be
to recover damages. Malpractice litigation in “unreasonably dangerous” in a manner
health care is based on negligence theory. that is the demonstrable cause of the pur-
Because the law views delivery of health chaser’s injury.
care as a service (provided by clinicians), it is
clear that negligence theory will provide the Note that negligence theory allows for adverse
minimum legal standard for clinicians who outcomes. Even when care is delivered in a
use software during the delivery of care. competent, caring, and compassionate man-
Patients who are harmed by clinical practices ner, some patients with some illnesses will not
based on imperfect software applications may do well. Negligence theory protects providers
sue the health care providers for negligence or from being held responsible for all individuals
malpractice, just as patients may sue attend- who suffer bad outcomes. As long as the qual-
ing physicians who rely on the imperfect ity of care has met the prevailing standards, a
advice of a human consultant (Miller et al. practitioner should not be found liable in a
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
413 12
malpractice case (Miller et al. 1985a). Strict Corresponding to potential strict product
product liability, on the other hand, is not as liability for faulty software embedded in med-
forgiving or understanding. ical devices is potential negligence liability if
No matter how good or exemplary a man- such software can easily be “hacked”
ufacturer’s designs and manufacturing pro- (Robertson 2011). Malicious code writers
cesses might be, if even one in ten million might mimic external software-based “radio”
products is defective, and that one product controllers for pacemakers and insulin pumps
defect is the cause of a purchaser’s injury, then and reprogram them to cause harm to patients.
the purchaser may collect damages (Miller While such “hackers” should face criminal
et al. 1985a). The plaintiff needs to show only prosecution if they cause harm by intention,
that the product was unreasonably dangerous the device manufacturers have a responsibility
and that its defect led to harm. In that sense, to make it difficult to change the software
the standard of care for strict product liability code embedded in devices without proper
is 100-percent perfection. To some extent, authorization.
appropriate product labeling (e.g., “Do not
use this metal ladder near electrical wiring”) 12.5.2.2 Privacy and Confidentiality
may protect manufacturers in certain strict The ethical basis for privacy and confidential-
product liability suits in that clear, visible ity in health care is discussed in 7 Sect. 12.3.1.
labeling may educate the purchaser to avoid For a long time, the legal state of affairs for
“unreasonably dangerous” circumstances. privacy and confidentiality of electronic
Appropriate labeling standards may similarly health records was chaotic (as it remains for
benefit users and manufacturers of clinical written records, to some extent). This state of
expert systems (Geissbuhler and Miller 1997). affairs in the U.S. had not significantly
Health care software programs sold to cli- changed in the three decades since it was
nicians who use them as decision-support described in a classic New England Journal of
tools in their practices are likely to be treated Medicine article (Curran et al. 1969).
under negligence theory as services. When However, a key U.S. law, the Health
advice-giving clinical programs are sold Insurance Portability and Accountability Act
directly to patients, however, and there is less (HIPAA), has prompted significant change.
opportunity for intervention by a licensed HIPAA’s privacy standards became effective
practitioner, it is more likely that the courts in 2003 for most health care entities, and its
will treat them as products, using strict prod- security standards followed 2 years later; a
uct liability, because the purchaser of the pro- breach-notification rule was expanded in 2010
gram is more likely to be the individual who is and HITECH provisions were incorporated in
injured if the product is defective. (As per- 2013. A major impetus for the law was that
sonal health records become more common, the process of “administrative simplification”
this legal theory may well be tested.) via electronic recordkeeping, prized for its
A growing number of software “bugs” in potential to increase efficiency and reduce
medical devices have been reported to cause costs, would also pose threats to patient pri-
injury to patients (Majchrowski 2010; Levis vacy and confidentiality. Coming against a
2014). The U.S. Food and Drug Administration backdrop of a variety of noteworthy cases in
(FDA) has traditionally viewed software which patient data were improperly—and
embedded within medical devices, such as car- often embarrassingly—disclosed, the law was
diac pacemakers and implantable insulin also seen as a badly needed tool to restore
pumps, as part of the physical device, and so confidence in the ability of health profession-
regulates such software as part of the device als to protect confidentiality. While the law
(FDA 2011). The courts are likely to view such has been accompanied by debate both on the
software using principles of strict product lia- adequacy of its measures and the question
bility (Miller and Miller 2007). Most recently, whether compliance was unnecessarily bur-
the FDA has contemplated wider regulatory densome, it nevertheless established the first
scope (see 7 Sect. 12.5.3). nationwide health privacy protections. At its
414 K. W. Goodman and R. A. Miller
core, HIPAA embodies the idea that individu- 12.5.2.3 Copyright, Patents,
als should have access to their own health and Intellectual Property
data, and more control over uses and disclo- Intellectual property protection afforded to
sures of that health data by others. Among its developers of software programs, biomedical
provisions, the law requires that patients be knowledge bases, and World Wide Web pages
informed about their privacy rights, including remains an underdeveloped area of law.
a right of access; that uses and disclosures of Although there are long traditions of copy-
“protected health information” generally be right and patent protections for non-electronic
limited to exchanges of the “minimum neces- media, their applicability to computer-based
sary”; that uses and disclosures for other than resources is not clear. Copyright law protects
treatment, payment and health care opera- intellectual property from being copied verba-
tions be subject to patient authorization; and tim, and patents protect specific methods of
that all employees in “covered entities” (insti- implementing or instantiating ideas. The
tutions that HIPAA legally affects) be edu- number of lawsuits in which one company
cated about privacy and information security. claimed that another copied the functionality
As noted above, the HITECH Act pro- of its copyrighted program (i.e., its “look and
vided substantial encouragement for feel”) has grown, however, and it is clear that
Electronic Health Record (EHR) develop- copyright law does not protect the “look and
ment, particularly the encouragement of bil- feel” of a program beyond certain limits.
lions of dollars in federal subsidies for Consider, for example, the unsuccessful suit in
“meaningful use” of EHRs. However, the 1980s by Apple Computer, Inc., against
HITECH also contained many changes to Microsoft, Inc., over the “look and feel” of
HIPAA privacy and security requirements, Microsoft Windows as compared with the
strengthening the regulations that affect the Apple Macintosh interface (which itself
collection, use and disclosure of health infor- resembled the earlier Xerox Alto interface).
mation not only by covered entities, but also It is not straightforward to obtain copy-
the “business associates” (contractors) of right protection for a list that is a compilation
12 those covered entities, and other types of of existing names, data, facts, or objects (e.g.,
organizations engaged in health information the telephone directory of a city), unless you
exchange. can argue that the result of compiling the
The Office of Civil Rights in the compendium creates a unique object (e.g., a
U.S. Department of Health and Human new organizational scheme for the informa-
Services remains the entity primarily charged tion) (Tysyer 1997). Even when the compila-
with HIPAA enforcement, but there is now a tion is unique and copyrightable, the
role for states’ attorneys general as well as individual components, such as facts in a
other agencies such as the Federal Trade database, might not be copyrightable. That
Commission. HITECH increases penalty they are not copyrightable has implications
levels under HIPAA and includes a mandate for the ability of creators of biomedical data-
for investigations and periodic audits, shifting bases to protect database content as intellec-
the enforcement balance away from voluntary tual property. How many individual,
compliance and remediation plans. unprotected facts can someone copy from a
HITECH’s changes to HIPAA, those from copyright-protected database before legal
other federal laws such as the Genetic protections prevent additional copying?
Information Nondiscrimination Act of 2008 A related concern is the intellectual-
(GINA) and the Patient Safety and Quality property rights of the developers of materials
Improvement Act of 2005, and the new atten- made available through the World Wide Web.
tion to information privacy and security in Usually, information made accessible to the
most states’ laws, comprise significant changes public that does not contain copyright anno-
to the legal-regulatory landscape for health tations is considered to be in the public
information. domain. It is tempting to build from the work
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
415 12
of other people in placing material on the The consortium recommended the
Web, but copyright protections must be following:
respected. Similarly, if you develop poten- 55 Recognition of four categories of clinical
tially copyrightable material, the act of plac- system risks and four classes of monitor-
ing it on the Web, in the public domain, would ing and regulatory actions that can be
allow other people to treat your material as applied based on the level of risk in a given
not protected by copyright. Resolution of this setting.
and related questions may await workable 55 Local oversight of clinical software sys-
commercial models for electronic publication tems, whenever possible, through the cre-
on the World Wide Web, whereby authors ation of autonomous software oversight
could be compensated fairly when other peo- committees, in a manner partially analo-
ple use or access their materials. Electronic gous to the institutional review boards
commerce might eventually provide copyright that are federally mandated to oversee pro-
protection (and perhaps revenue) similar to tection of human subjects in biomedical
age-old models that now apply to paper-based research. Experience with prototypical
print media; for instance, to use printed books software-oversight committees at pilot
and journals, you must generally borrow them sites should be gained before any national
from a library, purchase them or access them dissemination.
under Creative Commons or similar open- 55 Adoption by health care-information sys-
access platforms. tem developers of a code of good business
practices.
55 Recognition that budgetary, logistic, and
12.5.3 Regulation and Monitoring other constraints limit the type and num-
of Computer Applications ber of systems that the FDA can regulate
in Health Care effectively.
55 Concentration of FDA regulation on
In the mid-1990s, the U.S. Food and Drug those systems posing highest clinical risk,
Administration (FDA) held public meetings with limited opportunities for competent
to discuss new methods and approaches to human intervention, and FDA exemption
regulating clinical software systems as medi- of most other clinical software systems.
cal devices. In response, a consortium of pro-
fessional organizations related to health care The recommendations for combined local and
information (AMIA, the Center for Health FDA monitoring are summarized in
Care Information Management, the . Table 12.1. We do not yet know whether
Computer-Based Patient Record Institute, the improved outcomes would occur if vendors
American Health Information Management were to give qualified (i.e., informatics-
Association, the Medical Library Association, capable) institutional purchasers greater local
the Association of Academic Health Science control over system functionality.
Libraries, and the American Nurses Section 618 of the 2012 Food and Drug
Association) drafted a position paper pub- Administration Safety and Innovation Act
lished in both summary format and as a lon- (FDASIA), Public Law 112- 144, mandated a
ger discussion with detailed background and new generation of oversight guidelines for
explanation (Miller and Gardner 1997a, b). clinical software. Pursuant to the legislation,
The position paper was subsequently endorsed the FDA, the Office of the National
by the boards of directors of all the organiza- Coordinator
for Health Information
tions (except the Center for Health Care Technology, and the Federal Communications
Information Management) and by the Commission held public hearings and con-
American College of Physicians Board of ducted workshops. The ensuing April 2014
Regents. FDASIA Health IT report (FDASIA 2014)
416 K. W. Goodman and R. A. Miller
.. Table 12.1 Consortium recommendations for monitoring and regulating clinical software systems
Regulatory class
Variable A B C D
Reproduced with permission from Author(s). Miller and Gardner (1997a). ©American College of Physicians
aIncludes systems that provide factual content or simple, generic advice (such as “give flu vaccine to eligible
ences) and give ample opportunity for users to ignore or override suggestions
cSystems that have higher clinical risk (such as those that generate diagnoses or therapies ranked by score) but
allow users to ignore or override suggestions easily; net risk is therefore intermediate
dSystems that have great clinical risk and give users little or no opportunity to intervene (such as a closed-loop
met Congress’ requirements to propose “strat- The FDASIA Health IT Report specified
egy and recommendations on an appropriate, three categories of risk based on system func-
risk-
based regulatory framework pertaining tionality, rather than software product cate-
to health-information technology, including gory or on implementation platform
mobile medical applications, that promotes (FDASIA 2014). The functionality categories
innovation, protects patient safety, and avoids are:
regulatory duplication.” The Report imple- 1. Administrative health IT functions. Non-
mented many recommendations from the exhaustive examples given in the Report
National Academy of Medicine’s 2012 report, include billing and claims processing,
“Health IT and Patient Safety: Building Safer inventory management, and scheduling.
Systems for Better Care” (IOM 2012a). The FDASIA Report categorizes those
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
417 12
functions posing little or no risk to the as we just saw, (2) there are good reasons to
patient, and exempt from additional over- consider the adoption of software oversight
sight. committees or something similar, then it is
2. Health management IT functions. Non- worthwhile to consider the ethical utility of
exhaustive examples cited in the Report efforts to review and endorse medical soft-
include encounter documentation, elec- ware and systems.
tronic access to clinical results, non-device- Established in 2004, the Certification
related clinical decision support, Commission for Health Information
medication management, provider order Technology, in collaboration with the Office
entry, knowledge management, and elec- of the National Coordinator for health infor-
tronic communication, including health mation technology, assesses electronic health
information exchange. The FDASIA records according to an array of criteria, in
Report asserts that the actual safety risks part to determine their success in contributing
posed by this category are for the most to “meaningful use.” These criteria address
part outweighed by their potential bene- matters ranging from electronic provider
fits, and require limited national-level order entry and electronic problem lists to
oversight. Whereas the FDA previously decision support and access control (cf.
played a major regulatory and monitoring Classen et al. 2007; Wright et al. 2009). The
role for applications with this category of criteria, tests and test methods are developed
functionality, the FDASIA Report trans- in concert with the National Institute of
fers responsibility for such oversight to a Standards and Technology. Practices and
collaboration between ONC and commer- institutions that want to receive government
cial vendors. incentive payments must adopt certified elec-
3. Medical device health IT functions. Non- tronic health record technologies.
exhaustive examples listed in the Report Conceived under the American Recovery
include computer-assisted detection soft- and Reinvestment Act, these processes aim to
ware, notification of real-time alarms from improve outcomes, safety and privacy.
bedside monitors, and robotic surgery sys- Whether they can accomplish this—as
tems. The FDA will maintain oversight opposed to celebrate technology for its own
responsibility for device-related clinical sake—is an excellent source of debate
software. (Hartzband and Groopman 2008). What
should be uncontroversial is that any system
A given application or product may involve of regulation, review or certification must be
more than one of the functionality categories. based on and, as a matter of process empha-
In concordance with the National size, certain values. These might include,
Academy of Medicine 2012 recommendations among others, patient-centeredness, ethically
(IOM 2012a), the FDASIA Report also cre- optimized data management practices, and
ated a public-private Health IT Safety Center, what we have here commended as the “stan-
to be coordinated by ONC. It will promote dard view,” that is, human beings and not
innovations regarding patient safety and iden- machines practice medicine, nursing and
tify interventions to improve safety, including psychology.
education about best practices. The move to certification has unfortu-
nately engendered precious little in the way of
ethical analysis, however. To make any system
12.5.4 Software Certification of regulation, review or certification ethically
and Accreditation credible, government and industry leaders
must eventually make explicit that attention
If, as above, (1) there is an ethical obligation to ethics is a core component of their efforts.
to evaluate health information systems in the An ethical approach to certification of
contexts in which they are being used, and if, clinical applications should entail “in vivo”
418 K. W. Goodman and R. A. Miller
(on the front lines of clinical care) as well as developers could directly and efficiently
“in vitro” (laboratory-based) testing (IOM address local problems of system functional-
2012a). As stated in 7 Sect. 12.2.2, “A com- ity. Present-day commercial systems are inflex-
puter program should be used in clinical prac- ible, opaque, and maintained by vendors from
tice only after appropriate evaluation of its a distance. The high sticker price of such sys-
efficacy and the documentation that it per- tems guarantees that, once purchased, institu-
forms its intended task at an acceptable cost tions cannot afford to de-install and replace
in time and money.” Federal and other certifi- problematic systems. Government mandates
cation programs currently address, in vitro, to install such expensive, disruptive “certified”
whether certain pre-specified technical capa- software systems appear to some as unethical.
bilities exist within a software application. The certification process should be expanded
They do not, however, determine whether a to evaluate the pragmatic, local, post-
given software package will be usable at an installation aspects of system function “at an
affordable cost in time and money in vivo, acceptable cost in time and money.”
post-installation. The diminished patient-care workflows
Almost all vendors’ comprehensive, engendered by cumbersome clinical software
institutional-
level clinical information sys- systems potentially increase the time and
tems now pass the minimal federal certifica- money costs of delivering quality healthcare –
tion standards. Those certification standards costs borne by patients, third-party payors,
test algorithmic functionality while neglecting and the government. Clinicians often pay a
to assess real-world clinical impacts post- higher price – beyond lost revenues. In their
installation. The 2012 National Academy of 2017 commentary, “The HITECH Era in
Medicine report on Health IT and Patient Retrospect,” Halamka and Tripathi stated,
Safety noted, “poor usability … is one of the “we lost the hearts and minds of clinicians. …
single greatest threats to patient safety. … We expected interoperability without first
Evaluation of the impact of health IT on building the enabling tools. In a sense, we gave
usability and on cognitive workload is impor- clinicians suboptimal cars, didn’t build roads,
12 tant to determine unintended consequences and then blamed them for not driving”
and the potential for distraction, delays in (Halamka and Tripathi 2017). Verghese, Shah,
care, and increased workload in general. and Harrington, in a 2018 JAMA Viewpoint,
Usability guidelines and principles focused on added: “The nationwide implementation of
improving safety need to be put into practice” electronic medical records (EMRs) resulted in
(IOM 2012a, p. 81). The certification stan- many unanticipated consequences … the
dards also do not fully evaluate the accuracy redundancy of the notes, the burden of alerts,
or completeness of systems’ underlying infor- and the overflowing inbox has contributed to
mation/knowledge bases. Nor do they evalu- … physician reports of symptoms of burnout.
ate information/knowledge base accuracy and … Most EMRs serve their front-line users
maintainability over time. quite poorly” (Vergese et al. 2018).
Many institutional-level systems are so Installation of vendors’ massive, complex,
expensive that they financially cripple the institutional clinical software products creates
medical centers and clinicians’ offices that additional ethical dilemmas. Post-installation,
adopt them. Many sites experience substan- clinicians lack a clear or deep understanding
tial decreases in post-installation revenue of how system functions affect their patients’
(usually transient, lasting several months, but care and safety. Institutions no longer possess
sometimes persistent). System-induced dis- the level of control/autonomy to change their
ruptions of workflows diminish the number systems locally, as they did decades ago when
of patients who can be seen, and impair (at many academic medical centers had home-
least temporarily) charge capture for billable grown systems that they could manage and
services. The FDASIA Report minimizing “evolve” at will. Prior to the advent of such
decision support oversight fits better with past large, complex clinical systems, clinicians
implementations, when academic system directly responsible for a patient’s care
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
419 12
personally knew and supervised all important, each decision-support rule transparent. Such
relevant decision-making. By contrast, after transparency should require that, without
system installation, clinicians must obtain sub- special training, a clinical user could easily
stantial system-related technical training and (and on request during system use) determine
expertise to be able to access or alter the clini- the underlying logic and data supporting each
cal knowledge underlying a vendor system’s instance of patient-specific advice. Systems
patient-specific recommendations. should also enable display of the evidential
Even more opaque are the hidden mecha- basis for other, more general advice – infor-
nisms for how specific knowledge is brought mation that might become outdated over time.
to bear during clinical decision support. For Vendors must expose, in non-technical terms,
example, a system might have a “patient is how decision-support triggers are locally
pregnant” indicator. That indicator may trig- determined at each site.
ger warnings when someone orders medica-
tions contra-indicated in pregnancy, or when
physicians order radiological studies deemed 12.6 Summary and Conclusions
unsafe for the condition. The ethical issue is
that most vendor systems obscure the basis Ethical issues are important in biomedical
for how the “patient is pregnant” flag is set informatics, and especially so in the clinical
locally. For example, Hospital A might use the arena. An initial ensemble of guiding princi-
nurse’s admission intake interview form to set ples, or ethical criteria, has emerged to orient
the pregnancy flag. The underlying informa- decision making:
tion gathered by the nurse in such a setting 1. Specially trained human beings (e.g.,
might be the patient’s response to the question licensed practitioners) remain, so far, best
“Are you pregnant now?” The latter is inade- able to provide health care for other human
quate for patient safety. Hospital B might use beings. Computer software developers
the result of a patient’s beta-HCG test to set should strive to warn caregivers whenever
the pregnancy flag. While a more reliable indi- it appears that a mistake is imminent.
cator of pregnancy than hearsay from the However, because clinical practice involves
patient, the beta-HCG test may not be ordered as many exceptions as rules, software sys-
by every clinician in all relevant circum- tems should not be allowed to overrule a
stances; hence, that mechanism may also be clinician’s decision once a warning has
imperfect for decision support. A physician been issued.
practicing at both Hospital A and Hospital B 2. Practitioners who use informatics tools
would be unlikely to know that the flags have should be clinically qualified and ade-
different meanings at each site. Yet all the cli- quately trained in using the software
nician can see is the status of the flag. products.
This ethical problem extends far beyond 3. The tools themselves should be carefully
setting pregnancy flags. System vendors may evaluated and validated, in vitro and
claim to provide a wide range of patient- in vivo.
safety related decision support tools, but 4. Health informatics tools and applications
responsible care providers cannot trust should be evaluated not only in terms of
system-generated advice when the triggers for performance, including efficacy, but also in
decision support rules are potentially unreli- terms of their influences on institutions,
able and inaccessible. Tort law requires institutional cultures, and workplace social
clinician-providers to uphold the standard of forces.
care for their patients. Ignorance of the basis 5. Ethical obligations should extend to sys-
for important system-initiated clinical advice tem developers, maintainers, and supervi-
is inconsistent with upholding the standard of sors as well as to clinician users.
care. Certifying agencies should require that 6. Education programs and security mea-
clinical system vendors make the basis for sures should be considered essential for
420 K. W. Goodman and R. A. Miller
protecting confidentiality and privacy electronic health records, including their rela-
while improving appropriate access to per- tionships with hospitals.
sonal patient information. IOM (Institute of Medicine). (2012b). Health IT
7. Adequate oversight should be maintained and patient safety: Building safer systems for
to optimize ethical use of electronic patient better care. Washington, DC: The National
information for scientific and institutional Academies Press. A key National Academy of
research. Medicine report linking appropriate use of
EHRs and patient safety.
New sciences and technologies always raise Miller, R. A. (1990b). Why the standard view is
interesting and important ethical issues. Much standard: People, not machines, understand
the same is true for legal issues, although in patients’ problems. Journal of Medicine and
the absence of precedent or legislation any Philosophy. 15, 581–591. This contribution
legal analysis will remain vague. Similarly, lays out the standard view of health informat-
important challenges confront people who are ics. This view holds, in part, that because only
trying to determine the appropriate role for humans have the diverse skills necessary to
government in regulating health care soft- practice medicine or nursing, machine intelli-
ware. The lack of clear public policy for such gence should never override human clinicians.
software underscores the importance of ethi- Miller, R. A., Schaffner, K. F., Meisel, A. (1985b).
cal insight and education as the exciting new Ethical and legal issues related to the use of
tools of biomedical and health informatics computer programs in clinical medicine.
become more common. Annals of Internal Medicine. 102, 529–536.
This article constitutes a major early effort to
nnSuggested Readings identify and address ethical issues in informat-
Goodman, K. W. (2016b). Ethics, medicine, and ics. By emphasizing the questions of appropri-
information technology: Intelligent machines ate use, confidentiality, and validation, among
and the transformation of health care. others, it sets the stage for all subsequent
Cambridge: Cambridge University Press. work.
12 This volume contains material on informatics
and human values, electronic health records, ??Questions for Discussion
confidentiality and privacy, decision support, 1. What is meant by the “standard view”
prognostic scoring systems, research, includ- of appropriate use of medical
ing the role of learning health systems, and information systems? Identify three
governance of medical computer systems. key criteria for determining whether a
Goodman, K. W. (2020). Ethics and health infor- particular use or user is appropriate.
matics. International Yearbook of Medical 2. Can quality standards for system devel-
Informatics. https://doi. opers and maintainers simultaneously
org/10.1055/s-0040-1701966. This overview safeguard against error and abuse and
emphasizes issues raised by Big Data and stimulate scientific progress? Explain
Machine Learning/Artificial Intelligence. your answers. Why is there an ethical
Goodman, K. W., Berner, E. S., Dente, M. A., obligation to adhere to a standard of
Kaplan, B., Koppel, R., Rucker, D., Sands, care?
D. Z., & Winkelstein, P. (2011). Challenges in 3. Identify (a) two major threats to patient
ethics, safety, best practices, and oversight data confidentiality, and (b) policies or
regarding HIT vendors, their customers, and strategies that you propose for protect-
patients: a report of an AMIA special task ing confidentiality against these threats.
force. Journal of the American Medical 4. Many prognoses by human beings are
Informatics Association, 18(1), 77–81. This subjective and are based on faulty mem-
document is one of the first to examine issues ory or incomplete knowledge of previ-
related to the manufacturers and vendors of ous cases. What are the two drawbacks
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
421 12
to using objective prognostic scoring of personal health records. Journal of Biomedical
systems to determine whether to allo- Informatics, 43(5 Suppl), S3–S5.
Brody, B. A. (1989). The ethics of using ICU scoring
cate care to individual patients?
systems in individual patient management. Problems
5. People who are educated about their in Critical Care, 3, 662–670.
illnesses tend to understand and to Classen, D. C., Avery, A. J., & Bates, D. W. (2007).
follow instructions, to ask insightful Evaluation and certification of computerized pro-
questions, and so on. How can the vider order entry systems. Journal of the American
Medical Informatics Association: JAMIA, 14(1),
World Wide Web improve patient
48–55.
education? How, on the other hand, Curran, W. J., Stearns, B., & Kaplan, H. (1969). Privacy,
might Web access hurt traditional confidentiality, and other legal considerations in the
physician–patient and nurse–patient establishment of a centralized health-data system.
relationships? New England Journal of Medicine, 281, 241–248.
Cushman, R., Froomkin, M. A., Cava, A., Abril, P., &
Goodman, K. W. (2010). Ethical, legal and social
Acknowledgments Reid Cushman, PhD, issues for personal health records and applications.
contributed to this chapter in an earlier edi- Journal of Biomedical Informatics, 43, S51–S55.
tion. His comments are gratefully acknowl- de Dombal, F. T. (1987). Ethical considerations con-
cerning computers in medicine in the 1980s. Journal
edged. of Medical Ethics, 13, 179–184.
Duda, R. O., & Shortliffe, E. H. (1983). Expert systems
research. Science, 220, 261–268.
FDA (U.S. Food and Drug Administration). (2011).
References Infusion pump software safety research at FDA.
Retrieval 18 November 2020: http://www.fda.gov/
Alpert, S. A. (1998). Health care information: Access, MedicalDevices/ProductsandMedicalProcedures/
confidentiality, and good practice. In K. W. GeneralHospitalDevicesandSupplies/
Goodman (Ed.), Ethics, computing, and medicine: InfusionPumps/ucm202511.htm.
Informatics and the transformation of health care FDASIA. (2014). FDASIA health IT report: Proposed
(pp. 75–101). Cambridge: Cambridge University strategy and recommendations for a risk-based frame-
Press. work. Retrieval 18 November 2020: https://www.fda.
Anderson, J. G., & Aydin, C. E. (1994). Overview: g ov / d ow n l o a d s / A b o u t F DA / C e n t e r s O ff i c e s /
Theoretical perspectives and methodologies for the OfficeofMedicalProductsandTobacco/CDRH/
evaluation of health care information systems. In CDRHReports/UCM391521.pdf
J. G. Anderson, C. E. Aydin, & S. J. Jay (Eds.), Friedman, C. P. (2009). A “fundamental theorem” of
Evaluating health care information systems: Methods biomedical informatics. Journal of the American
and applications (pp. 346–354). Thousand Oaks: Medical Informatics Association, 16(2), 169–170.
Sage. Geissbuhler, A. J., & Miller, R. A. (1997). Desiderata for
Anderson, J. G., & Aydin, C. E. (1998). Evaluating med- product labeling of medical expert systems.
ical information systems: Social contexts and ethical International Journal of Medical Informatics, 47(3),
challenges. In K. W. Goodman (Ed.), Ethics, com- 153–163.
puting, and medicine: Informatics and the transfor- Glaser, J. (2010). HITECH lays the foundation for more
mation of health care (pp. 57–74). Cambridge: ambitious outcomes-based reimbursement. The
Cambridge University Press. American Journal of Managed Care, 16(12), SP19–
Atreya, R. V., Smith, J. C., McCoy, A. B., Malin, B., & SP23.
Miller, R. A. (2013). Reducing patient re- Goodman, K. W. (1996). Ethics, genomics and informa-
identification risk for laboratory results within tion retrieval. Computers in Biology and Medicine,
research datasets. Journal of the American Medical 26, 223–229.
Informatics Association, 20, 95–101. Goodman, K. W. (1998). Outcomes, futility, and health
Benitez, K., & Malin, B. (2010). Evaluating re- policy research. In K. W. Goodman (Ed.), Ethics,
identification risks with respect to the HIPAA pri- computing, and medicine: Informatics and the trans-
vacy rule. Journal of the American Medical formation of health care (pp. 116–138). Cambridge:
Informatics Association, 17, 169–177. Cambridge University Press.
Blois, M. S. (1980). Judgement and computers. New Goodman, K. W. (2016a). Ethics, medicine, and informa-
England Journal of Medicine, 303, 192–197. tion technology: Intelligent machines and the trans-
Blumenthal, D. (2010). Launching HITECH. New formation of health care. Cambridge: Cambridge
England Journal of Medicine, 362(5), 382–385. University Press.
Brennan, P. F., Downs, S., & Casper, G. (2010). Project Goodman, K. W. (2017). Health information technol-
HealthDesign: Rethinking the power and potential ogy as a universal donor to bioethics education.
422 K. W. Goodman and R. A. Miller
Cambridge Quarterly of Healthcare Ethics, 26(2), Majchrowski, B. (2010). Medical software’s increasing
342–347. impact on healthcare and technology management.
Goodman, K. W., & Cava, A. (2008). Bioethics, business Biomedical Instrumentation and Technology, 44(1),
ethics, and science: Bioinformatics and the future of 70–74.
healthcare. Cambridge Quarterly of Healthcare Malin, B., & Goodman, K. W. (2018). Between access
Ethics, 17(4), 361–372. and privacy: Challenges in sharing health data. IMIA
Goodman, K. W., Berner, E. S., Dente, M. A., Kaplan, Yearbook of Medical Informatics, 27, 55–59.
B., Koppel, R., Rucker, D., Sands, D. Z., & Malin, B., & Sweeney, L. (2004). How (not) to protect
Winkelstein, P. (2010). Challenges in ethics, safety, genomic data privacy in a distributed network:
best practices, and oversight regarding HIT ven- Using trail re-identification to evaluate and design
dors, their customers, and patients. Journal of the anonymity protection systems. Journal of Biomedical
American Medical Informatics Association, 18(1), Informatics, 37, 179–192.
77–81. Malin, B., Loukides, G., Benitez, K., & Clayton, E. W.
Halamka, J. D., & Tripathi, M. (2017). The HITECH (2011). Identifiability in biobanks: Models, mea-
era in retrospect. New England Journal of Medicine, sures, and mitigation strategies. Human Genetics,
310, 907–909. 130, 383–392.
Hartzband, P., & Groopman, J. (2008). Off the record— McNeer, J. F., Wallace, A. G., Wagner, G. S., Starmer,
Avoiding the pitfalls of going electronic. New C. F., & Rosati, R. A. (1975). The course of acute
England Journal of Medicine, 358(16), 1656–1658. myocardial infarction: Feasibility of early discharge
Holroyd-Leduc, J. M., Lorenzetti, D., Straus, S. E., of the uncomplicated patient. Circulation, 51, 410–
Sykes, L., & Quan, H. (2011). The impact of the 413.
electronic medical record on structure, process and Miller, R. A. (1989). Legal issues related to medical deci-
outcomes within primary care: A systematic review sion support systems. International Journal of
of the evidence. Journal of the American Medical Clinical Monitoring and Computing, 6, 75–80.
Informatics Association, 18, 732–737. Miller, R. A. (1990a). Why the standard view is stan-
Ienca, M., & Vayena, E. (2020). On the responsible use dard: People, not machines, understand patients’
of digital data to tackle the COVID-19 pandemic. problems. The Journal of Medicine and Philosophy,
Nature Medicine, 26, 463–464. 15, 581–591.
IOM (Institute of Medicine). (2012a). Health IT and Miller, R. A. (1997). Predictive models for primary care-
patient safety: Building safer systems for better care. givers: Risky business? Annals of Internal Medicine,
Washington, DC: The National Academies Press. 127(7), 565–567.
Kaplan, B., & Harris-Salamone, K. D. (2009). Health IT Miller, R. A., & Gardner, R. M. (1997a). Summary rec-
12 success and failure: Recommendations from litera-
ture and an AMIA workshop. Journal of the
ommendations for the responsible monitoring and
regulation of clinical software systems. Annals of
American Medical Informatics Association, 16, 291– Internal Medicine, 127(9), 842–845.
299. Miller, R. A., & Gardner, R. M. (1997b). Recommen
Knaus, W. A., Wagner, D. P., & Lynn, J. (1991). Short- dations for responsible monitoring and regulation of
term mortality predictions for critically ill hospital- clinical software systems. Journal of the American
ized adults: Science and ethics. Science, 254, Medical Informatics Association, 4, 442–457.
389–394. Miller, R. A., & Goodman, K. W. (1998). Ethical chal-
Koppel, R., & Kreda, D. (2009). Health care informa- lenges in the use of decision-support software in
tion technology vendors’ “hold harmless” clause: clinical practice. In K. W. Goodman (Ed.), Ethics,
Implications for patients and clinicians. JAMA, 301, computing, and medicine: Informatics and the trans-
1276–1278. formation of health care (pp. 102–115). Cambridge:
Kuperman, G. J., & Gibson, R. F. (2003). Computer Cambridge University Press.
physician order entry: Benefits, costs, and issues. Miller, R. A., & Miller, S. M. (2007). Legal and regula-
Annals of Internal Medicine, 139(1), 31–39. tory issues related to the use of clinical software in
Lane, W. A. (1936). What the mouth reveals. New health care delivery. In R. A. Greenes (Ed.), Clinical
Health, 11, 34–35. decision support: The road ahead (pp. 423–444).
Levis, J. (Ed.). (2014). H.I.T. or miss: Lessons learned Boston: Elsevier.
from health information technology implementations. Miller, R. A., Schaffner, K. F., & Meisel, A. (1985a).
Chicago: American Health Information Ethical and legal issues related to the use of com-
Management Association. puter programs in clinical medicine. Annals of
Macklin, R. (1992). Privacy and control of genetic Internal Medicine, 102, 529–536.
information. In G. J. Annas & S. Elias (Eds.), Gene National Research Council. (1997). For the record:
mapping: Using law and ethics as guides (pp. 157– Protecting electronic health information.
172). New York: Oxford University Press. Washington, D.C.: National Academy Press.
Ethics in Biomedical and Health Informatics: Users, Standards, and Outcomes
423 12
Palfrey, J., & Gasser, U. (2010). Born digital: Sittig, D. F., & Singh, H. (2016). A socio-technical
Understanding the first generation of digital natives. approach to preventing, mitigating, and recovering
New York: Basic Books. from ransomware attacks. Applied Clinical
Pugh, J., Pycroft, L., Sandberg, A., Aziz, T., & Savulescu, Informatics, 7(2), 624–632.
J. (2018). Brainjacking in deep brain stimulation Slayton, T. B. (2018). Ransomware: The virus attacking
and autonomy. Ethics and Information Technology, the healthcare industry. Journal of Legal Medicine,
20(3), 219–232. 38(2), 287–311.
Robertson, J. (2011). Insulin pumps, monitors vulnera- Sweeney, L. (1997). Weaving technology and policy
ble to hacking. Associated Press via various media, together to maintain confidentiality. The Journal of
e.g., The Washington Times, August 4. Retrieval Law, Medicine Ethics, 25, 98–110.
November 17, 2020: http://www.washingtontimes. Szolovits, P., & Pauker, S. G. (1979). Computers and
com/news/2011/aug/4/insulin-pumps-monitors-vul- clinical decision making: Whether, how much, and
nerable-to-hacking/?page=all. for whom? Proceedings of the IEEE, 67, 1224–1226.
Sackner-Bernstein, J. (2017). Design of hack-resistant Tamersoy, A., Loukides, G., Nergiz, M. E., Saygin, Y., &
diabetes devices and disclosure of their cyber safety. Malin, B. (2012). Anonymization of longitudinal
Journal of Diabetes Science and Technology, 11(2), electronic medical records. IEEE Transactions on
198–202. Information Technology in Biomedicine, 16, 413–423.
Shih, S. C., McCullough, C. M., Wang, J. J., Singer, J., & Truog, R. D., Mitchell, C., & Daley, G. Q. (2020). The
Parsons, A. S. (2011). Health information systems in toughest triage — Allocating ventilators in a pan-
small practices. Improving the delivery of clinical demic. New England Journal of Medicine, 382, 1973–
preventive services. American Journal of Preventive 1975.
Medicine, 6, 603–609. Tysyer, D. A. (1997). Database legal protection. Bitlaw.
Shortliffe, E. H. (1993). Doctors, patients, and comput- Retrieval 18 November 2020: http://www.bitlaw.
ers: Will information technology dehumanize com/copyright/database.html
health-care delivery? Proceedings of the American Vergese, A., Shah, N. H., & Harrington, R. A. (2018).
Philosophical Society, 137(3), 390–398. What this computer needs is a physician. Journal of
Shortliffe, E. H. (1994). Dehumanization of patient the American Medical Association, 319(1), 19–20.
care. Are computers the problem or the solution? Wright, A., Sittig, D. F., Ash, J. S., Sharma, S., Pang,
Journal of the American. Medical Informatics J. E., & Middleton, B. (2009). Clinical decision sup-
Association, 1, 76–78. port capabilities of commercially-available clinical
Sittig, D. F., Krall, M., Kaalaas-Sittig, J., & Ash, J. S. information systems. Journal of the American
(2005). Emotional aspects of computer-based pro- Medical Informatics Association, 16(5), 637–644.
vider order entry: A qualitative study. Journal of the Youngner, S. J. (1988). Who defines futility? JAMA, 260,
American Medical Informatics Association, 12(5), 2094–2095.
561–567.
425 13
Evaluation of Biomedical
and Health Information
Resources
Charles P. Friedman and Jeremy C. Wyatt
Contents
References – 462
Evaluation of Biomedical and Health Information Resources
427 13
nnLearning Objectives decision to be made and, particularly, how
After reading this chapter, you should know much is at stake. But all activities labeled as
the answers to these questions: evaluation involve the empirical process of
55 Why are empirical studies based on the collecting information that is relevant to the
methods of evaluation and technology decision at hand. For example, when choosing
assessment important to the successful a holiday destination, members of a family
implementation of information may informally ask friends which Hawaiian
resources to improve health? island they prefer and browse various websites
55 What challenges make studies in infor- including those that provide ratings of specific
matics difficult to carry out? How are destinations. After factoring in costs and con-
these challenges addressed in practice? venience, the family reaches a decision. More
55 Why can all evaluations be classified as formally, when a health care organization
empirical studies? faces the choice of a new electronic health
55 What features do all evaluations have in record system, the leadership will develop a
common? plan to collect comparable data about com-
55 What are the key factors to take into peting systems, analyze the data according to
account as part of a process of deciding the plan, and ultimately, through a predeter-
what are the most important questions mined process, make a decision.
to use to frame a study? The field of biomedical and health infor-
55 What are the major assumptions under- matics focuses on the collection, processing,
lying objectivist and subjectivist and communication of health-related infor-
approaches to evaluation? What are the mation and the implementation of informa-
strengths and weaknesses of each tion resources—usually consisting of digital
approach? technology designed to interact with people—
55 How does one distinguish measurement to facilitate these activities.1 These informa-
and demonstration aspects of objectiv- tion resources can collect, store, and process
ist studies, and why are both aspects data related to the health of individual per-
necessary? In the demonstration aspect sons (institutional or personal electronic
of objectivist studies, how are control health records), manage and reason about
strategies used to draw inferences? biomedical knowledge (knowledge acquisi-
55 What steps are followed in objectivist tion tools, knowledge bases, decision-support
and subjectivist studies? What tech- systems, and intelligent tutoring systems), and
niques are employed by investigators to support activities related to public health (dis-
ensure rigor and credibility of their ease registries and vital statistics, disease out-
findings? break detection and tracking). Thus, there is a
55 Why is communication between investi- vast range of biomedical and health informa-
gators and stakeholders central to the tion resources that can be foci of evaluation.
success of any evaluation? Information resources have many differ-
ent aspects that can be studied (Friedman
and Wyatt 2005; Chap. 3). Where safety is
13.1 Introduction an issue, as it often is, (Fox 1993; Black et al.
2011; Russ et al. 2014), we might focus on
Most people understand the term evalu-
ation to mean an assessment of an orga-
nized, purposeful activity. Evaluations are 1 In this chapter, we will use the terms “information
usually conducted to answer questions or in resource” and “information system” generally as
anticipation of the need to make decisions synonyms. However, “information system” applies
more specifically to applications of digital technol-
(Wyatt and Spiegelhalter 1990; Ammenwerth ogy whereas a “resource” is a broad term that could,
2015). Evaluations may be informal or for- for example, include informal collegial consulta-
mal, depending on the characteristics of the tions.
428 C. P. Friedman and J. C. Wyatt
process of evaluation we address in this chap- Nielsen’s invaluable resource on user testing3),
ter. Almost all of the methodological issues but we will see that all measurement processes
we raise apply to safety assessments. Casual have features that make their results more or
assessments that fail to address these issues less dependable and useful. We will also see
will not resolve the safety question, and will that the measurement processes built into
not reveal safety defects that can be remedied. evaluation studies can themselves be designed
Many of these issues are issues of sampling to make the results of the studies more helpful
that we introduce in 7 Sect. 15.4.2. For exam- to all stakeholders, including those focused on
ple, the advice or other “output” generated by safety.
most information resources depends critically
on the quality and quantity of data available
to it and on the manner in which the resource 13.3 Two Universals of Evaluation
is used by patients or practitioners. People or
practitioners who are untrained, in a hurry, or 13.3.1 he Full Range of What Can
T
exhausted at 3 A.M., are more likely to fail to Be Formally Studied
enter key data that might lead to the resource
generating misleading advice, or to fail to heed Deciding what to study is fundamentally a
an alarm that is not adequately emphasized process of winnowing down from a universe
by the user interface. Coded data automati- of potential questions to a parsimonious set
cally entered into resources may be inaccurate, of questions that can be realistically addressed
incomplete, or not coded in the manner antici- given the priorities, time, and resources avail-
pated by the resource. Thus, to generate valid able. This winnowing process can begin with
results, functional tests must put the resources the full range of what can potentially be stud-
in actual users’ hands under the most realistic ied. To both ensure that the most important
conditions possible, or in the hands of people questions do get “on the table” and to help
with similar knowledge, skills and experience eliminate the less important ones, it can be
if samples of intended users are not available. useful to start with a comprehensive list.
For example, a Facebook advertisement for While experienced evaluators do not typically
an app to help women detect when conception begin study planning from this broadest per-
13 was likely from body temperature readings spective, it is always helpful to have a broad
was withdrawn in the UK because the accu- range of options in mind.
racy quoted in publicity materials related to There are five major aspects of an infor-
ideal use rather than use in everyday practice.2 mation resource that can be studied:
[BBC news story 29-8-18]. 1. Need for the resource: In advance of any
Other safety issues are, from a methodolog- development, investigators can study the
ical perspective, issues of measurement that we status quo absent the resource, includ-
address in 7 Sect. 13.4.2. For example, should ing the nature of problems the resource
“usability” of an information resource be is intended to address and how frequently
determined by documenting that the resource these problems arise. (When an informa-
development process followed best practices tion resource is already deployed, the “sta-
to inculcate usability, asking end-users if they tus quo” might be the currently deployed
believed the resource was usable, or by docu- resource, and the resource under study is
menting and studying their “click streams” to a proposed replacement for it or enhance-
determine if end-users actually navigated the ment to it.)
resource as the designers intended? There is no 2. Design and development process:
single clear answer to this question (see Jackob Investigators study the skills of the devel-
2 7 h t t p s : / / w w w. b b c . c o . u k / n e w s / t e c h n o l - 3 7 https://www.nngroup.com/articles/ (Accessed
ogy-45328965 (Accessed 11.20.19). 11.20.19).
Evaluation of Biomedical and Health Information Resources
431 13
opment team, and the development 13.3.2 he Structure of All
T
methodologies employed by the team, Evaluation Studies,
to understand if the resulting resource is
Beginning with a
likely to function as intended.
3. Resource static structure: Here the focus Negotiation Phase
of the evaluation includes specifications,
flow charts, program code, and other rep- If the list offered in the previous section can
resentations of the resource that can be be seen as the universe of what can be studied,
inspected without actually running it. . Fig. 13.1 can be used as a framework for
4. Resource usability and dynamic functions: planning all evaluation studies. The first stage
The focus is on whether the resource has in any study is negotiation between the “inves-
the potential to be beneficial: the degree tigators” (or “evaluators”) who will be carry-
to which intended end-users can navigate ing out the study and the “stakeholders” who
the resource and how it performs when it have interests in or otherwise will be concerned
is used in pilots prior to full deployment. about the study results. Before a study can pro-
5. Resource use, effect and impact: Finally, ceed, the key stakeholders who are supporting
after deployment, the focus switches from the study financially and providing other essen-
the resource itself to the extent of its use and tial resources for it—such as the institution
its effects on professional, patient or public where the information resource is deployed—
users, and on health care organizations. must be satisfied with the general plan. The
negotiation phase identifies the broad aim and
In a theoretically “complete” evaluation, objectives of the study, what kinds of reports
sequential studies of a particular resource might and other deliverables will result and by when,
address all of these aspects, over the life cycle where the study personnel will be based, the
of the resource. In the real world, however, it is resources available to conduct the study, and
difficult, and rarely necessary, to be so compre- any constraints on what can be studied. When a
hensive. Over the course of its development and study of an information resource is being con-
deployment, a resource may be studied many ducted internally—that is, when all of the key
times, with the studies in their totality touching stakeholders represent one organization that
on many or most of these aspects. Some aspects also employs the investigators—it is still very
of an information resource will be studied infor- useful to have an internal negotiation to lay out
mally using anecdotal data collected via casual details of the study.
methods. Other aspects will be studied more The results of the negotiation phase are
formally in ways that are purposefully designed expressed in a document, generally known as
to inform specific development decisions and a contract between the evaluators and the key
that involve systematic collection and analysis stakeholders. The contact guides the planning
of data. Distinguishing those aspects that will and execution of the study and, in a very sig-
be studied formally from those left for informal nificant way, protects all parties from misun-
exploration is a challenging task facing all evalu- derstandings about intent and execution. Like
ators. any contract, an evaluation contract can be
changed later with consent of all parties.
Stakeholder
Contract
Decisions
432 C. P. Friedman and J. C. Wyatt
Following the negotiation process and its on how clinical staff spend their time, the
reflection in a contract, the planning of the rate and severity of adverse drug events and
evaluation proceeds in a sequence of logi- the length of patient stay?”).
cal steps, starting with the formulation of 55 It allows different stakeholders in the
specific questions to be addressed, then the evaluation process—patients, professional
selection of the type(s) of study that will be groups, managers – to see the extent to
used, the investigation that entails the collec- which their own concerns are being
tion and analysis of data, and ultimately the addressed, and to ensure that these feed
communication back to the stakeholders of into the evaluation process.
the findings, which typically inform a range 55 Most important, perhaps, it is hard if not
of decisions. Although . Fig. 13.1 portrays impossible to develop investigative
a one-way progression through this sequence methods without first identifying quest
of stages, in the real world of evaluation there ions, or at least focused issues, for
are often detours and backtracks. exploration. The choice of methods
follows from the evaluation questions: not
from the novel technology powering the
13.4 eciding What to Study and
D information resource or the type of
resource being studied. Unfortunately,
What Type of Study to Do:
some investigators choose to apply the
Questions and Study Types same set of the methods to any study,
irrespective of the questions to be
13.4.1 he Importance of
T addressed, or even to limit the evaluation
Identifying Questions questions addressed to those compatible
with the methods they prefer. We do not
Once the study’s objective, scope and other endorse this limiting approach.
applicable “ground rules” have been estab-
lished, the real work of study planning Consider the distinction made earlier between
can begin. The next step, as suggested by informal evaluations that people undertake
. Fig. 13.1, is to convert the perspectives of continuously as they make choices as part of
13 the concerned parties, and what these indi-
viduals or groups want to know, into a finite,
their everyday personal or professional lives,
and more formal evaluations that are planned
specific set of questions. It is important to rec- and then executed according to that plan. In
ognize that, for any evaluation setting that is short, formal evaluations are those that con-
interesting enough to merit formal evaluation, form to the architecture of . Fig. 13.1. In
the number of potential questions is infinite. these formal evaluations, the questions that
This essential step of identifying a tractable actually get addressed survive a narrowing pro-
number of questions has a number of benefits: cess that begins with a broad set of candidate
55 It helps to crystallize thinking of both questions. When starting a formal evaluation,
investigators and key members of the therefore, a major decision is whom to consult
audience who are the stakeholders in the to establish the questions that will get “on the
evaluation. table”, how to log and analyze their views, and
55 It guides the investigators and stakeholders what weight to place on each of these views.
through the critical process of assigning There is always a wide range of potential play-
priority to certain issues and thus ers in any evaluation (see 7 Box 13.1) and
productively narrowing the focus of a there is no formula that defines whom to con-
study. sult or in what order. Through this process, the
55 It converts broad statements of aim (e.g., investigators apply their common sense and,
“to evaluate a new order communications with experience, learn to follow their instincts.
system”) into specific questions that can The only universal mistake is to fail to consult
potentially be answered (e.g., “What is the one or more of the key stakeholders, especially
impact of the order communications system those paying for the study or those ultimately
Evaluation of Biomedical and Health Information Resources
433 13
making the key decisions to be informed by
the evaluation. It is often useful to establish 55 Information technology staff and
a group to advise and guide the evaluators, a leaders in the organization where the
group with broad representation that will help resource is deployed
ensure that study remains true to the interests 55 Senior managers in the organization
and preferences of the stakeholders. where the resource is deployed
Through discussions with various stake- 55 The patients whose care the resource
holder groups, the hard decisions regarding may directly or indirectly influence
the questions to be addressed in the study are 55 Staff in ancillary services whose work-
made. A significant challenge for investigators load may be affected by resource
is the risk of getting swamped by detail, result- deployment, for example laboratory or
ing from the multiplicity of questions that can imaging departments following deploy
be asked in any study. To manage through the ment of a diagnostic decision support
process, it is important to reflect on the major system
issues identified after each round of discus- 55 Quality improvement and safety pro
sions with stakeholders, and then identify fessionals in the organization in which
the questions that map to these issues. Where the resource is implemented
possible, keep questions at the same level of
granularity.
It is critical that the specific questions serv-
ing as the beacon guiding the study be deter- 13.4.2 Selecting a Study Type
mined and endorsed by all key stakeholders,
before any significant decisions about the After developing the list of evaluation ques-
detailed design of the study are made. We tions, the next step is to understand which
will see later that evaluation questions can, in study type(s) the evaluation questions natu-
many circumstances, change over the course rally invoke. The study types we will introduce
of a study; but that fact does not obviate the in this chapter are specific to the evaluation
need to specify a set of questions at the out- of information resources, and are particularly
set. 7 Appendix A describes two evaluation informative to the design of evaluation stud-
scenarios, and suggests some evaluation ques- ies in biomedical informatics. These study
tions that may be appropriate for each. types are described below and also summa-
rized in . Table 13.1. The second column
of . Table 13.1 links the study types to the
Box 13.1 Some of the Potential Players aspect of the resource that is studied—as pre-
in an Evaluation Study viously introduced in 7 Sect. 13.3.1. Each
55 Those commissioning the evaluation study type is likely to appeal to certain inter-
study, who will typically have questions ests of particular stakeholders, as suggested
or decisions that rely on the data col- in the rightmost column of the table. A wide
lected range of data collection and analysis meth-
55 Those paying for the evaluation study ods, as discussed later in 7 Sect. 13.5, can be
55 Those paying for the development used to answer the questions embraced by all
and/or deployment of the information nine study types. Choice of a study type typi-
resource cally does not constrain the methods that can
55 End-users of the resource, who are be used to collect and analyze data. And we
often providers of data for the study will see later in this chapter, and specifically in
55 Developers of the resource and their 7 Sect. 13.5.2.2, that all of these study types
managers are what can be called demonstration studies,
55 Care providers and their managers in contrast to so-called measurement studies.
55 Staff responsible for resource imple Finally, a set of studies, exemplifying many of
mentation and user training these study types, is introduced and described
in 7 Appendix B.
434 C. P. Friedman and J. C. Wyatt
.. Table 13.1 Classification of demonstration study types by broad study question and the stakeholders
most concerned
Study type Aspect studied Broad study question Audience/stakeholders primarily interested
in results
1. Needs Need for the What is the problem? Resource developers, funders of the
assessment resource resource
2. Design Design and Is the development Funders of the resource; professional and
validation development method in accord with governmental certification agencies e.g.,
process accepted practices? Food and Drug Administration, Office of
the National Coordinator for HIT
3. Structure Resource static Is the resource Professional indemnity insurers, resource
validation structure appropriately designed developers; professional and
to function as intended? governmental certification agencies
4. Usability Resource Can intended users Resource developers, users, funders
test dynamic usability navigate the resource so
and function it carries out intended
functions?
5. Laboratory Resource Does the resource have Resource developers, funders, users,
function dynamic usability the potential to be academic community
study and function beneficial?
6. Field Resource Does the resource have Resource developers, funders, users
function dynamic usability the potential to be
study and function beneficial in the real
world?
7. L
ab user Resource effect Is the resource likely to Resource developers and funders, users
effect study and impact change user behavior?
8. F
ield user Resource effect Does the resource Resource users and stakeholders, resource
effect study and impact change actual user purchasers and funders
13 behavior in ways that are
positive?
9. Problem Resource effect Does the resource have a The universe of stakeholders
impact and impact positive impact on the
study original problem?
1. Needs assessment studies seek to clarify and attitudes, as well as how they make
the information problem the resource is decisions or take actions. To ensure that
intended to solve. These studies take place developers have a clear model of how a
before the resource is designed–usually proposed information resource will fit with
in the setting where the resource is to be working practices and structures, they may
deployed, although simulated settings may also need to study health care or research
sometimes be used. Ideally, the potential processes, team functioning, or relevant
users of the resource will be studied while aspects of the larger organization in which
they work with real problems or cases, work is done (Wyatt et al. 2010).
to understand better how information is 2. Design validation studies focus on the
used and managed, and to identify the quality of the processes of information
causes and consequences of inadequate resource design and development, for
information flows. The investigator seeks example by asking experts to review these
to understand users’ skills, knowledge processes. The experts may review docu-
Evaluation of Biomedical and Health Information Resources
435 13
ments, interview the development team, ment and should improve its usability.
compare the suitability of the software Although usability testing can be per-
engineering methodology and program- formed by obtaining opinions of usability
ming tools used with others that are avail- experts who “test drive” the resource,
able, and generally apply their expertise usability can also be tested by deploying
to identify potential shortcomings in the the resource in a laboratory or classroom
approach used to develop the software, as setting, introducing users to it, and then
well as constructively to suggest how these allowing them either to navigate at will
shortcomings might be corrected. and provide unstructured comments or to
3. Structure validation studies address the attempt to complete some scripted tasks
static form of the software, usually after (see extensive material by Nielsen, 7 www.
a first prototype has been developed. useit.com). Data can be collected by the
This type of study is most usefully per- computer itself, from the user, by a live
formed by an expert or a team of experts observer, via audio or video capture of
with experience in developing software users’ actions and statements, or by spe-
for the problem domain and concerned cialized instrumentation such as eye-track-
users. For these purposes, the investiga- ing tools. Many software developers have
tors need access to both summary and usability testing labs equipped with sophis-
detailed documentation about the sys- ticated measurement systems, staffed by
tem architecture, the structure and func- experts in human computer interaction to
tion of each module, and the interfaces carry out these studies—an indication of
among them. The expert might focus on the importance increasingly attached to
the appropriateness of the algorithms this type of study (Zhang et al. 2003;
that have been employed and check that Saitwal et al. 2010).
they have been correctly implemented 5. Laboratory function studies go beyond
by examining the code and its docu- usability to explore more specific aspects
mentation. Experts might also exam- of the information resource, such as the
ine the data structures (e.g., whether quality of data captured, the speed of
they are appropriately normalized) and communication, the validity of the
knowledge bases (e.g., whether they are calculations carried out, or the
evidence-based, up to date, and modelled appropriateness of the results or advice
in a format that will support the intended given. These functions relate less to the
analyses or reasoning). Most of this will basic usability of the resource and more to
be done by inspection and discussion how the resource performs in relation to
with the development team. Sometimes what it is trying to achieve for the user or
specialized software may be used to test the organization. When carrying out any
the structure of the resource (Somerville kind of function testing, real or proxy
10th edition 2015). users are employed. The study results
Note that the study types listed up to depend crucially on what problems the
this point do not require a functioning infor- users are asked to solve, so the “tasks”
mation resource. However, beginning with employed in these studies (eg. case scenar-
usability testing below, the study types ios) should correspond as closely as possi-
require the existence of at least a function- ble to those to which the resource will be
ing prototype. applied in real working life. Such tasks very
4. Usability testing studies address whether across a set of dimensions—for example,
intended users can actually operate or difficulty, problem domain, and urgency—
navigate the software, to determine so it is very important to employ in these
whether the resource has the potential to studies a set of tasks that spans the range of
be helpful to them (see also 7 Chap. 5). In these dimensions.
this type of study, testing of a prototype 6. Field function studies are a variant of lab-
by typical users informs further develop- oratory function testing in which the
436 C. P. Friedman and J. C. Wyatt
Study type Study setting Version of the Sampled Sampled What is observed
resource users tasks
health care or decision environment, in a “labo- can sometimes be used as proxies for more
ratory” setting. Studies to elucidate the need for experienced physicians.) In other types of
a resource and studies of its impact on users studies, the users are sampled from the end-
would usually take place in settings—known users for whom the resource is ultimately
generically as the “field”—where health care designed. The type of users employed gives
practitioners, researchers, students, patients or shape to a study and can affect its results pro-
administrators are making real choices in the foundly. The usability of a resource is easily
real world. These studies can take place only in overestimated if the “users” in a study are
settings where the resource is available for use those who designed or are otherwise familiar
and where health care or health behavior activ- with the resource. As another example, volun-
ities occur and/or where other important deci- teer users of a consumer- oriented reso urce
sions are made. To an investigator planning such as a dieting app may be more motivated
such studies, an important consideration that than the general population the resource is
determines the kind of study possible is the designed to benefit.
degree of access to resource users in the field
setting. If, as a practical matter, access to the The sampled tasks For function, effect, and
field setting is very limited, then several study impact studies, the users included in the study
types listed in . Tables 13.1 and 13.2 are either actually interact with the resource. This requires
not possible, or the validity of the field studies tasks, often clinical or scientific decision prob-
that are possible will be reduced. lems, for the users to undertake. These tasks
can be invented or simulated; they can be
The version of the resource used For some abstracted versions of real cases or problems,
kinds of studies, a simulated or prototype ver- shortened to suit the specific purposes of the
sion of the resource may be sufficient (Scott study; or they can be live cases or research
et al. 2011; Russ et al. 2014), whereas for studies problems as they present to resource users in
in which the resource is employed by intended their everyday work. Clearly, the kinds of tasks
users to support real decisions and actions, a employed, and how they are sampled, in a
fully robust and reliable version is needed (e.g., study have serious implications for the study
Lester et al. 2010). results and the conclusions that can be drawn
13 from them.
The sampled resource users Information
resources nearly always function through The observations that are made All evalua-
interaction with one or more such “users” tion studies entail observations that generate
who bring to the interaction their own domain data that are subsequently analyzed to gener-
knowledge and knowledge of how to operate ate the study results. As seen in . Table 13.2,
the resource. Exceptions might include closed many different kinds of observations can be
loop control systems such as smart insulin made.
pumps and pharmacy robots, but even in these In the paragraphs above we have intro
cases, humans who set the parameters for the duced the term “sampled” for both tasks and
otherwise autonomous operations of these users. It is important to establish that in real
devices can be seen as a form of “users”. In evaluation studies, tasks and users are always
some types of evaluation studies, the users of sampled from some real or hypothetical pop-
the resource are not the end users for whom ulation. Choosing appropriate methods to
the resource is ultimately designed, but are sample users and tasks is a major challenge in
members of the development or evaluation evaluation study design since it is never pos-
teams, or other individuals who can be called sible, practical, or desirable to study everyone
“proxy users”, chosen because they are conve- doing everything possible with an information
niently available or because they are afford- resource. Sampling issues are addressed later
able. (For example, senior medical students in this chapter.
Evaluation of Biomedical and Health Information Resources
439 13
13.5 Conducting Investigations: sured, with all observations yielding the
Collecting and Drawing same result. It also assumes that an
investigator can measure these attributes
Conclusions from Data
without affecting how the resource under
study functions or is used.
13.5.1 wo Grand Approaches
T 55 Rational persons can and should agree
to Study Design, Data on what attributes of a resource are
Collection, and Analysis important to measure and what results
of these measurements would be
Several authors have developed classifica- identified as the most desirable, correct,
tions, or typologies, of evaluation methods or positive outcome. In informatics,
or approaches. Among the best is that devel- making this assertion is tantamount to
oped in 1980 by Ernest House (1980). Even stating that perfection in resource or
though it is somewhat old, a major advantage user performance can always be identified
of House’s typology is that each approach is and that all rational individuals can be
linked elegantly to an underlying philosophi- brought to consensus on what
cal model, as detailed in his book. This clas- “perfection” is.
sification divides current practice into eight 55 Because numerical measurement allows
discrete approaches, four of which may be precise statistical analysis of performance
viewed as objectivist and four of which may over time or performance in comparison
be viewed as subjectivist. While the distinc- with some alternative, numerical meas
tions between the eight approaches House urement is prima facie superior to a verbal
describes are beyond the scope of this chap- description. Verbal, descriptive data
ter, the grand distinction between objectivist (generally known as qualitative data) are
and subjectivist approaches is very important. thus useful in only preliminary studies to
Note that these approaches are not entitled identify hypotheses for subsequent, more
“objective” and “subjective”, because those precise analysis using quantitative methods.
labels carry strong and fundamentally mis- 55 Through these kinds of comparisons, it is
leading connotations of scientific precision in possible to demonstrate to a reasonable
the former case and of idiosyncratic impreci- degree that a resource is or is not superior
sion in the latter. We will see in this section to what it replaced, or to a competing
how both objectivist (often called quantita- resource.
tive) and subjectivist (often called qualitative)
approaches find rigorous application across Contrast these assumptions with the set of
the range of study types described earlier. assumptions that derives from an intuition-
To appreciate the fundamental differ- ist–pluralist or de-constructivist philosophi-
ence between the approaches, it is necessary cal position that spawns a set of subjectivist
to address their very different philosophical approaches to evaluation:
roots. The objectivist approaches derive from 55 What is observed about a resource
a logical–positivist philosophical orienta- invariably depends in fundamental ways
tion—the same orientation that underlies the on the observer. Different observers of the
classic experimental sciences. The major prem- same resource might legitimately come to
ises underlying the objectivist approaches are different conclusions. Both can be objective
as follows: in their appraisals even if they do not
55 In general, the attributes of interest are agree; it is not necessary that one is right
properties of the resource under study, or and the other wrong. Important insight
the people interacting with it. More can derive from both, and from their
specifically, this premise suggests that the juxtaposition.
merit and worth of an information 55 Merit and worth must be explored in
resource—the attributes of most interest context. The value of a resource emerges
in evaluation—can in principle be mea through study of the resource as it
440 C. P. Friedman and J. C. Wyatt
functions in a particular decision-making 1992; Sheikh et al. 2011; Wright et al. 2015)
environment. the importance and utility of these subjec-
55 Individuals and groups can legitimately tivist approaches in evaluation have been
hold different perspectives on what established within biomedical informatics.
constitutes the most desirable outcome of It is important for people trained in classic
introducing a resource into an environment. experimental methods at least to understand,
There is no reason to expect them to agree, and possibly even to embrace, the subjectivist
and it may be counterproductive to even worldview if they are to conduct fully infor-
try to lead them to consensus. An important mative evaluation studies.
aspect of an evaluation would be to
document the ways in which they disagree.
55 Verbal description can be highly illuminating. 13.5.2 Conduct of Objectivist
Qualitative data are valuable, in and of Studies
themselves, and can lead to conclusions as
convincing as those drawn from quantitative . Figure 15.2 expands the generic process for
data. The value of qualitative data, therefore, conducting evaluation studies to illustrate the
goes far beyond that of identifying issues for steps involved in conducting an objectivist study.
later more “precise” exploration using . Figure 13.2 illustrates the linear sequence in
quantitative methods. which the investigation portion of an objectivist
55 Evaluation should be viewed as an exercise evaluation study is typically carried out. We will
in argument or rhetoric, rather than as a focus in this chapter on issues of study design
demonstration, because every study can that are most challenging in objectivist stud-
“appear equivocal when subjected to ies and we will further focus on that subset of
serious scrutiny” (House 1980). objectivist study designs which are comparative
in nature. More details on the other aspects of
The approaches to evaluation that derive objectivist studies are available in standard ref-
from this subjectivist philosophical perspec- erences on experimental design (Campbell and
tive may seem strange, imprecise, and unsci- Stanley 1963; Rothman et al. 2008).
entific when considered for the first time.
13 This perception stems in large part from 13.5.2.1 Structure and Terminology
the widespread acceptance of the objectiv- of Comparative Studies
ist worldview in biomedicine. Over the last Most objectivist evaluations performed in
two decades, however, thanks to some early the world make a comparison of some type.
high quality studies (e.g., Forsythe et al. For informatics, aspects of performance of
Instrumentation
Analysis
Communicate
Contract Strakeholder Results
Decisions
Evaluation of Biomedical and Health Information Resources
441 13
individuals, groups, or organizations with the more dependent variables. In a typical study,
information resource are often compared to the dependent variables will be computed, for
those same aspects without the resource, with each participant, as an average over a number
some alternative resource, or an alternate of tasks. For example, clinicians’ diagnostic
design of the same resource. After identify- performance may be measured over a set of
ing a sample of participants for the study, the cases, or “tasks”, that provide a range of diag-
investigator assigns each participant, often nostic challenges.
randomly, to one or a set of conditions and The independent variables are included
some outcomes of interest are measured for in a study to explain the measured values
each participant. The averaged values of these of the dependent variables. For example,
outcomes are then compared across the con- whether an information resource is avail-
ditions. If all other factors are controlled, able, or not, to support certain clinical tasks
either directly through the design of the study could be the major independent variable in a
or statistically through randomization, then study designed to evaluate the effects of that
any measured difference in the averaged out- resource.
comes can be attributed to the resource. Measurement challenges almost always
This relatively simple description of a arise in the assessment of the outcome or
comparative study belies the many issues that dependent variable for a study. Often, for
affect their design, execution, and ultimate example, the dependent variable is some type
usefulness. To understand these issues, we of performance measure that invokes con-
must first develop a precise terminology. cerns about reliability (precision) and validity
The participants in a study are the enti- (accuracy) of measurement. The indepen-
ties about which/whom data are collected. It dent variables may also raise measurement
is key to emphasize that participants are often challenges. When the independent variable is
people—for example, care providers or recipi- marital status, for example, the measurement
ents—but also may be information resources, problems are relatively straightforward. If the
groups of people, or organizations. Because independent variable is an attitude or other
many of the activities in informatics are con- “state of mind”, such as computer or health
ducted in hierarchical settings with naturally literacy, profound measurement challenges
occurring groups (a “physician’s patients”; can arise.
the “researchers in a laboratory”), investiga-
tors must, for a particular study, define the 13.5.2.2 Issues of Measurement
participants carefully and consistently. Measurement is the process of assigning a
Variables are specific characteristics of value corresponding to the presence, absence,
the participants or the setting that either are or degree of a specific attribute in a specific
measured purposefully by the investigator or object. When we speak specifically of measure-
are self-evident properties that do not require ment, it is customary to use the term “object”
measurement. Some variables may take a con- to refer to the entity on which measurements
tinuous range of values while others have a are made. Measurement usually results in
discrete set of levels, corresponding to each either (1) the assignment of a numerical score
of the possible measured values. For example, representing the extent to which the attribute
in a hospital setting, physician members of a of interest is present in the object, or (2) the
ward team can be classified as residents, fel- assignment of an object to a specific category.
lows, or attending physicians. In this case, the Taking and recording the temperature (attri-
variable “physician’s level of qualification” is bute) of a person (object) is an example of the
said to have three discrete “levels”. process of measurement.
The dependent variables are those vari- From the premises underlying objectiv-
ables in the study that capture the outcomes ist studies (see 7 Sect. 13.5.1), it follows that
of interest to the investigator. (For this rea- proper execution of such studies requires
son, dependent variables are also called out- careful and specific attention to methods of
come variables.) A study may have one or measurement. It can never be assumed that
442 C. P. Friedman and J. C. Wyatt
5 See 7 https://www.nngroup.com/articles/why-you-
only-need-to-test-with-5-users/ (Accessed 11.20.18).
444 C. P. Friedman and J. C. Wyatt
Immersion
Complete Select Analysis
&
Negotiation Study Types
Identify
Questions
Reflection/
Reorganization
Communicate
Contract
Results
Strakeholder
Decisions
data or insights resulting from its analysis. the context of what is already known.
Backtracking is a legitimate step in this model. Analysis and reflection entail the contem-
1. Negotiation of the ground rules of the study: plation of the new findings during each
The understanding between the study cycle of the loop. Member checking is the
team and the persons commissioning a sharing of the investigator’s emerging
study should embrace the general aims of thoughts and beliefs with the participants
the study; the kinds of methods to be used; themselves. Reorganization results in a
access to various sources of information, revised agenda for data collection in the
including health care providers, patients, next cycle of the loop.
and various documents; and the format Although each cycle within the itera-
for interim and final reports. The aims of tive loop is depicted as unidirectional, this
the study may be formulated in a set of ini- representation is misleading. Net progress
tial orienting questions. Ideally, this under- through the loop is clockwise, as shown in
standing will be expressed in a . Fig. 13.3, but backward steps are natu-
memorandum of understanding, analo- ral and inevitable. They are not reflective
gous to a contract. of mistakes or errors. An investigator may,
2. Immersion into the environment: At this after conducting a series of interviews and
stage, the investigators begin spending studying what participants have said, decide
time in the work environment. Their activ- to speak again with multiple participants to
ities range from formal introductions to clarify their positions on a particular issue.
informal conversations, or to silent pres- 4. Communicate results: Subjectivist studies
ence at meetings and other events. tend to have a multi-staged reporting and
Investigators use the generic term field to communication process. The first draft of
refer to the setting, which may be multiple the study report should itself be viewed as
physical locations, where the work under a research instrument. By sharing this
study is carried out. Trust and openness report with a variety of individuals, the
between the investigators and the people in investigator obtains a major check on the
the field are essential elements of subjec- validity of the findings. Typically, reac-
tivist studies to ensure full and candid tions to the preliminary report will gener-
13 exchange of information. ate useful clarifications and a general
Even as immersion is taking place, sharpening of the study findings. Because
the investigator is already collecting data the report usually includes a prose narra-
to sharpen the initial questions or issues tive, it is vitally important that it be well
guiding the study. Early discussions with written in language understandable by all
people in the field, and other activities pri- intended audiences. Circulation of the
marily targeted toward immersion, inevita- report in draft, for comments by the
bly begin to shape the investigators’ views. intended recipients, can ensure that the
Almost from the outset, the investigator is final document communicates as
typically addressing several aspects of the intended. Use of anonymous quotations
study simultaneously. from interviews and documents makes a
3. Iterative loop: At this point, the procedural report highly vivid and meaningful to
structure of the study becomes akin to an readers.
iterative loop, as the investigator engages The final report, once completed,
in cycles of data collection, analysis and should be distributed as negotiated in the
reflection, “member checking”, and reor- original memorandum of understanding.
ganization. Data collection involves inter- Distribution is often accompanied by
view, observation, document analysis, and “meet the investigator” sessions that allow
other methods. Data are collected on interested persons to ask the author of the
planned occasions, as well as serendipi- report to expand or explain what has been
tously and spontaneously. The data are written.
recorded carefully and are interpreted in
Evaluation of Biomedical and Health Information Resources
451 13
13.5.3.4 Subjectivist Data-Collection are no predetermined questions. Between
and Data-Analysis Methods the extremes is the semi structured interview,
What data-collection strategies are in the where the investigator specifies in advance a
subjectivist researcher’s tool kit? There are set of topics that he/she would like to address-
several, and they are typically used in combi- -but is flexible as to the order in which these
nation. We shall discuss each one, assuming a topics are addressed, and is open to discus-
typical setting for a subjectivist study in bio- sion of topics not on the pre-specified list. At
medical informatics: the introduction of an the other extreme is the structured interview,
information resource into patient care activi- with a schedule of questions that are always
ties in a hospital. presented in the same words and in the same
order. In general, the unstructured and semi
zz Observation structured interviews are preferred in subjec-
The investigators typically immerse them tivist research. Informal interviews—sponta-
selves into the setting under study in one of neous discussions between the investigators
two ways. The investigator may act purely and members of a team that occur during
as a detached observer, becoming a trusted routine observation—are also part of the data
and unobtrusive feature of the environment collection process. Informal interviews are
but not a participant in the day-to-day work invariably considered a source of important
and thus reliant on multiple “informants” as data. Group interviews, akin to focus groups,
sources of information. True to the natural- may also be employed (e.g., Haddow et al.
istic feature of this kind of study, great care 2011). Group interviews are very efficient
is taken to diminish the possibility that the ways to reach large numbers of participants,
presence of the observer will skew any work but investigators should not assume that indi-
activities or that the observer will be rejected vidual participants will express in a group set-
outright by the team. An alternative approach ting the same sentiments they will express if
is participant observation, where the investi- interviewed one-on-one.
gator becomes a member of the work team. Sampling also enters into the interview
Participant observation is more difficult to process. There are usually more participants
engineer; it may require the investigator to to interview than resources to conduct them.
have specialized training in the study domain. Unlike in objectivist studies, where random
It is time consuming but can give the investiga- sampling is a form of gold standard to inform
tor a more vivid impression of life in the work statistical attributions of effects, subjectiv-
environment. During both kinds of observa- ist studies employ more purposeful sampling
tion, data accrue continuously. These data are strategies. Investigators might actively seek
qualitative and may be of several varieties: interviewees they suspect to have unique or
statements by health care providers, patients, particularly insightful or influential opinions.
family members, administrative staff, and oth- They might remain in more frequent contact
ers; gestures and other nonverbal expressions with key informants who, for various reasons,
of these same individuals; and characteristics have the most insight into what is happening.
of the physical setting that seem to affect the
delivery of health care. zz Document and Artifact Analysis
Every project produces a trail of papers and
zz Interviews other artifacts. These include patient charts,
Subjectivist studies rely heavily on interviews. the various versions of an information
Formal interviews are occasions where both resource and its documentation, memoranda
the investigator and interviewee are aware that prepared by the project team, perhaps a car-
the answers to questions are being recorded toon hung on an office door. Unlike the day-
(on paper or digitally) for direct contribution to-day events of health care, these artifacts
to the evaluation study. Formal interviews vary do not change once created or introduced.
in their degree of structure. At one extreme With appropriate permissions negotiated in
is the unstructured interview, where there advance, they can be examined retrospectively
452 C. P. Friedman and J. C. Wyatt
and referred to repeatedly, as necessary, over hand-held devices are changing the way sub-
the course of a study. Also included under jectivist research is carried out.
this heading are unobtrusive measures, which The subjectivist analysis process is fluid,
are the records accrued as part of the routine with analytic goals shifting as the study
use of the information resource. They include, matures. At an early stage, the goal is primar-
for example, user log files of an information ily to focus the questions that themselves will
resource. Data from these measures are often be the targets of further data elicitation. At
quantifiable and analyzed quantitatively even the later stages of study, the primary goal
though the overall study design is qualitative is to organize data that address these ques-
in nature. tions into specific themes, interpretations, and
explanations. Conclusions derive credibility
zz Anything Else That Seems Useful from a process of “triangulation”, which is the
Subjectivist investigators are supreme oppor- degree to which information from different
tunists. As questions of importance to a study independent sources generate the same theme
emerge, the investigators will collect any infor- or point to the same conclusion. Subjectivist
mation that they perceive as bearing on these analysis also employs a strategy known as
questions. This data collection could include “member checking” whereby investigators
clinical chart reviews, questionnaires, tests, take preliminary conclusions back to the per-
simulated patients, and other methods more sons in the setting under study, asking if these
commonly associated with the objectivist conclusions make sense, and if not, why not.
approaches. In subjectivist investigation, unlike objectivist
When to end data collection is another studies, the agenda is never completely closed.
challenge in otherwise open-ended subjectiv- The investigator is constantly on the alert for
ist studies. “Saturation” is important princi- new information that can require a significant
ple to help investigators know when to stop. reorganization of the findings and conclu-
Stated simply, a data collection process is sat- sions that have been drawn to date.
urated when it becomes evident that, as more
data are collected, no new findings or insights
are emerging. 13.6 Communicating Evaluation
13 Results
zz Analysis of Subjectivist Data
There are many alternative procedures for Once any study, qualitative or quantitative,
analysis of qualitative data. In general terms, is complete, the results need to be commu-
the investigator looks for insights, themes or nicated to the stakeholders and others who
trends emerging from several different sources. might be interested. In many ways, commu-
He/she collates individual statements and nication of evaluation results, a term we pre-
observations by theme, as well as by source. fer over “reporting”, is the most challenging
Investigators typically use software especially aspect of evaluation. Elementary theory tells
designed to facilitate analysis of qualitative us that, in general, successful communication
data.6 Because they allow electronic record- requires a sender, one or more recipients, and
ing of the data while the investigator is “in the a channel linking them, along with a mes-
field”, tablets, smartphone Apps and other sage that travels along this channel (Ong and
Coiera 2011).
Seen from this perspective, successful
communication of evaluation results is chal-
lenging in several respects. It requires that
the recipient of the message actually receive
it. That is, for evaluations, the recipient must
6 Examples include: Atlas.ti: 7 https://atlasti.com/
(Accessed November 18, 2019) and NVivo 7 https:// read the written report or attend the meet-
www.qsrinternational.com/nvivo/home (Accessed ing intended to convey evaluation results.
November 18, 2019).
Evaluation of Biomedical and Health Information Resources
453 13
For this reason, the investigator is invariably A written, textual report is not the sole
challenged to create a report the stakeholders medium for communicating evaluation results.
will want to read or to choreograph a meeting Verbal, graphical, or multimedia approaches
they will be motivated to attend. Successful can be helpful as ways to enhance communi-
communication also requires that the recipi- cation with specific audiences. Another useful
ent understand the message, which challenges strategy is to hold a “town meeting” to discuss
investigators to draft written documents at the a traditional written report after it has been
right reading level, with audience-appropriate released. Photographs or videos can portray
technical detail. Sometimes there must be sev- the work setting for a study, the people in the
eral different forms of the written report to setting, and the people using the resource. If
match several different audiences. Overall, we appropriate permissions are obtained, these
encourage investigators to recognize that their images—whether included as part of a writ-
obligation to communicate does not end with ten report, shown at a town meeting, or placed
the submission of a written document com- on a Web site—can be worth many thousands
prising their technical evaluation report. The of words. The same may be true for recorded
report is one channel for communication, not statements of resource users. If made avail-
an end in itself. able, with permission, as part of a multime-
Depending on the nature, number, and dia report, the voices of the participants can
location of the recipients—and permissions convey a feeling behind the words that can
which have been obtained or written into enhance the credibility of the investigator’s
evaluation agreements--many options exist conclusions (. Fig. 13.4).
for communicating the results of a study, In addition to the varying formats for
including: communication described above, investigators
55 Written reports have other decisions to make after the data
–– Document(s) prepared for specific collection and analysis phases of a study are
audience(s) complete. One key decision is what personal
–– Internal newsletter article role they will adopt after the formal investiga-
–– Published journal article, with appro- tive aspects of the work are complete. They
priate permissions may elect only to communicate the results, but
–– Monograph, picture album, or book they may also choose to persuade stakehold-
55 One-to-one or small group meetings ers to take specific actions in response to the
–– With stakeholders or specific stake study results, and perhaps even assist in the
holder groups implementation of these actions. This raises a
–– With the general public, if appropriate key question: Is the role of an evaluator sim-
55 Formal oral presentations ply to record and communicate study findings
–– To groups of project stakeholders and then to move on to the next study, or is
–– Conference presentation with a poster it also to engage with the study stakeholders
or published paper in proceedings and help them change how they work as a
–– To external meetings or seminars result of the study?
55 Internet To answer this question about the role of
–– Project Web site or blog an evaluator, we need to understand that an
–– Web “chat”, forum or Twitter feed to evaluation study, particularly a successful one,
socialize results has the potential to trigger a series of events,
–– Online preprint starting with the communication of study
–– Internet based journal results, but then including interpretation,
55 Other recommendation, and even implementation.
–– Video or podcast describing the study Some evaluators—perhaps enthused by the
and information resource clarity of their results and an opportunity to
–– Interview with a journalist on news use them to improve health care, biomedical
paper, TV, radio research, or education—prefer to go beyond
454 C. P. Friedman and J. C. Wyatt
Field User Effect Study Electronic health Amsterdam: IOS Press. This work includes an
records and health care quality over time in a fed- extensive exploration of evaluation methods
erally qualified health center. pertinent to health informatics.
Kern et al. (2015) conducted a three-year Anderson, J. G., & Aydin, C. E. (2005). Evaluating
comparative study across six sites of a fed- the organizational impact of health care infor-
erally qualified health center in New York mation systems. New York: Springer. This is
to analyze the association between post- an excellent edited volume that covers a wide
implementation of an electronic health record range of methodological and substantive
(EHR) and quality of care delivery as mea- approaches to evaluation in informatics.
sured by change in compliance with Stage 1 Brender, J. (2006). Handbook for evaluation for
Meaningful Use quality measures. This study health informatics. Burlington: Elsevier
is an exemplary field user effect study for its Academic Press. Along with the Friedman and
attention to measures of clinician behav- Wyatt text cited below, one of few textbooks
ior in care delivery through test/screening available that focuses on evaluation in health
ordering using the EHR and explicit use of informatics.
statistical analysis techniques to account for Cohen, P. R. (1995). Empirical methods for artifi-
repeated measures on patients over time. The cial intelligence. Cambridge, MA: MIT Press.
study also includes two problem impact met- This is a nicely written, detailed book that is
rics (change in HbA1c and LDL cholesterol) focused on evaluation of artificial intelligence
analyzed over the study period; however, the applications, not necessarily those operating
study intent was primarily focused on clini- in medical domains. It emphasizes objectivist
cian ordering behavior. methods and could serve as a basic statistics
course for computer science students.
Problem Impact Study Effects of a mobile Fink, A. (2004). Evaluation fundamentals: Insights
phone short message service on antiretroviral into the outcomes, effectiveness, and quality of
treatment adherence in Kenya (WelTel Kenya1): health programs (2nd ed.). Thousand Oaks:
A randomised trial. Sage Publications. A popular text that dis-
Lester et al. (2010) is an exemplar for cusses evaluation in the general domain of
problem impact studies. Authors conducted health.
13 a randomized controlled trial to measure Friedman, C. P., & Wyatt, J. C. (2006). Evaluation
improvement in patient adherence to anti- methods in biomedical informatics. New York:
retroviral therapy (ART) and suppression of Springer. This is the book on which the cur-
viral load following receipt of mobile phone rent chapter is based. It offers expanded dis-
communications with health care workers. The cussion of almost all issues and concepts
study randomized patients to the intervention raised in the current chapter.
group (receiving mobile phone messages from Jain, R. (1991). The art of computer systems per-
healthcare workers) or to the control group formance analysis: Techniques for experimental
(standard care). Outcomes were clearly identi- design, measurement, simulation, and model-
fied and focused on behavioral effects (drug ling. New York: Wiley. This work offers a tech-
adherence) and an overall intent to measure nical discussion of a range of objectivist
the extent that improvements in adherence methods used to study computer systems. The
influenced patient health status (viral load). scope is broader than Cohen’s book (1995)
The special attention to randomization and described earlier. It contains many case stud-
use of effect size metrics for analysis are ies and examples and assumes knowledge of
critical components to measuring the overall basic statistics.
impact of mobile phone communications on Lincoln, Y. S., & Guba, E. G. (1985). Naturalistic
patient health. inquiry. Thousand Oaks: Sage Publications.
This is a classic book on subjectivist methods.
nnSuggested Reading The work is very rigorous but also very easy to
Ammenwerth, E., & Rigby, M. (Eds.). (2016). read. Because it does not focus on medical
Evidence-
based health informatics.
Evaluation of Biomedical and Health Information Resources
461 13
domains or information systems, readers must pared. A panel of intensive care
make their own extrapolations. experts is asked to identify, inde-
Rossi, P. H., Lipsey, M. W., & Freeman, H. E. pendently, episodes of hypotension
(2004). Evaluation: A systematic approach (7th from each data set.
ed.). Thousand Oaks: Sage Publications. This (f) A biomedical informatics professor
is a valuable textbook on evaluation, empha- is invited to join the steering group
sizing objectivist methods, and is very well for a series of apps to support peo-
written. It is generic in scope, and the reader ple living with diabetes. The only
must relate the content to biomedical infor- documentation available to critique
matics. There are several excellent chapters at the first meeting is a statement of
addressing pragmatic issues of evaluation. the project goal, description of the
These nicely complement the chapters on sta- planned development method, and
tistics and formal study designs. the advertisements and job descrip-
tions for team members.
??Questions for Discussion (g) Developers invite educationalists to
1. Associate each of the following hypo- test a prototype of a computer-aided
thetical evaluation scenarios with one learning system as part of a user-
or more of the nine types of studies centered design workshop
listed in . Table 13.1. Note that some (h) A program is devised that generates
scenarios may include more than one a predicted 24-h blood glucose pro-
type of study. file using seven clinical parameters.
(a) An order communication system Another program uses this profile and
is implemented in a small hospital. other patient data to advise on insu-
Changes in laboratory workload lin dosages. Diabetologists are asked
are assessed. to prescribe insulin for a series of
(b) The developers of the order commu- “paper patients” given the 24-h profile
nication system recruit five potential alone, and then again after seeing the
users to help them assess how read- computer-generated advice. They are
ily each of the main functions can be also asked their opinion of the advice.
accessed from the opening screen and (i) A program to generate alerts to pre-
how long it takes users to complete vent drug interactions is installed in
them. a geriatric clinic that already has a
(c) A study team performs a thorough computer-based medical record
analysis of the information required system. Rates of clinically signifi-
by psychiatrists to whom patients cant drug interactions are com-
are referred by a community social pared before and after installation
worker. of the alerting program.
(d) A biomedical informatics expert is
2. Choose any alternative area of bio-
asked for her opinion about a PhD
medicine (e.g., drug trials) as a point of
project on a new bioinformatics
comparison, and list at least four fac-
algorithm. She requests copies of
tors that make studies in biomedical
the student’s code and documenta-
informatics more difficult to conduct
tion for review.
successfully than in that area. Given
(e) A new intensive care unit system
these difficulties, discuss whether it
is implemented alongside manual
is worthwhile to conduct empirical
paper charting for a month. At the
studies in biomedical informatics or
end of this time, the quality of the
whether we should use intuition or the
computer- derived data and data
recorded on the paper charts is com-
462 C. P. Friedman and J. C. Wyatt
marketplace as the primary indicators Brender, J. (2005). Handbook of evaluation methods for
of the value of an information resource. health informatics. Burlington: Academic Press.
Campbell, D. T., & Stanley, J. C. (1963). Experimental
3. Assume that you run a philanthropic
and quasi experimental designs for research. Boston:
organization that supports biomedi- Houghton Mifflin, reprinted often since.
cal informatics. In investing the scarce Campbell, M., Fitzpatrick, R., Haines, A., Kinmonth,
resources of your organization, you have A. L., Sandercock, P., Spiegelhalter, D., & Tyrer,
to choose between funding a new system P. (2000). Framework for design and evaluation
of complex interventions to improve health. BMJ,
or resource development, or funding
321(7262), 694–696.
empirical studies of resources already Friedman, C. P., Abbas, U. L. (2003), Is medical infor-
developed. What would you choose? matics a mature science? A review of measurement
How would you justify your decision? practice in outcome studies of clinical systems,
4. To what extent is it possible to be cer- International Journal of Medical Informatics, 69;
(2–3), Pages 261–272, ISSN 1386-5056, https://doi.
tain how effective a medical informatics
org/10.1016/S1386-5056(02)00109-0.
resource really is? What are the most Davey Smith, D. (2007). Capitalizing on Mendelian
important criteria of effectiveness? randomization to assess the effects of treatments.
5. Do you believe that independent, unbi- Journal of the Royal Society of Medicine, 100(9),
ased observers of the same behavior or 432–435. https://www.ncbi.nlm.nih.gov/pmc/arti-
cles/PMC1963388/.
outcome should agree on the quality of
Demiris, G., Speedie, S., & Finkelstein, S. (2000). A ques-
that outcome? tionnaire for the assessment of patients’ impressions
6. Many of the evaluation approaches of the risks and benefits of home telecare. Journal of
assert that a single unbiased observer Telemedicine and Telecare, 6(5), 278–284.
is a legitimate source of information Elizabeth Murray, Eric B. Hekler, Gerhard Andersson,
Linda M. Collins, Aiden Doherty, Chris Hollis,
in an evaluation, even if that observ-
Daniel E. Rivera, Robert West, Jeremy C. Wyatt,
er’s data or judgments are unsubstan- (2016) Evaluating Digital Health Interventions.
tiated by other people. Give examples American Journal of Preventive Medicine 51
drawn from our society where we vest (5):843–851.
important decisions in a single expe- Eminovic, N., Wyatt, J. C., Tarpey, A. M., Murray, G.,
& Ingrams, G. J. (2004, June 02). First evaluation
rienced and presumed impartial indi-
of the NHS direct online clinical enquiry service:
vidual. A nurse-led Web chat triage service for the public.
13 7. Do you agree with the statement that all
evaluations appear equivocal when sub-
Journal of Medical Internet Research, 6(2), E17.
European Union Medical Devices Regulatory
jected to serious scrutiny? Explain your Framework. (2018). https://ec.europa.eu/growth/
sectors/medical-devices/regulatory-framework_en.
answer.
Accessed 24 Oct 2018.
Forsythe, D. E. (1992). Using ethnography to build a
working system: Rethinking basic design assump-
tions. In Proceedings annual symposium computer
References applications in medical care (pp. 505–509).
Forsythe, D. E., Buchanan, B. G., Osheroff, J. A., &
Ammenwerth, E. (2015). Evidence-based health infor- Miller, R. A. (1992). Expanding the concept of med-
matics: How do we know what we know? Methods of ical information: An observational study of physi-
Information in Medicine, 54(04), 298–307. cians’ information needs. Computers and Biomedical
Anderson, J. G., Aydin, C. E., & Jay, S. J. (Eds.). Research, 25, 181–200.
(1994). Evaluating health care information systems. Fox, J. (1993). Decision support systems as safety-
Thousand Oaks: Sage Publications Inc. critical components: Towards a safety culture for
Beaudoin, M., Kabanza, F., Nault, V., & Valiquette, L. medical informatics. Methods of Information in
(2016). Evaluation of a machine learning capabil- Medicine, 32, 345–348.
ity for a clinical decision support system to enhance Friedman, C. P., & Wyatt, J. C. (2005). Evaluation
antimicrobial stewardship programs. Artificial methods in biomedical informatics (2nd ed., p. 386).
Intelligence in Medicine, 68, 29–36. New York: Springer-Publishing. ISBN 0-387-
Black, A. D., Car, J., Pagliari, C., Anandan, C., Cresswell, 25889-2.
K., Bokun, T., et al. (2011). The impact of eHealth Gaschnig, J., Klahr, P., Pople, H., Shortliffe, E., & Terry,
on the quality and safety of health care: A system- A. (1983). Evaluation of expert systems: Issues and
atic overview. PLoS Medicine, 8(1), e1000387. case studies. In F. Hayes-Roth, D. A. Waterman, &
Evaluation of Biomedical and Health Information Resources
463 13
D. Lenat (Eds.), Building expert systems. Reading: Lunenburg, F. C. (2010). Managing change: The role
Addison Wesley. of the change agent. International Journal of
Goddard, K., Roudsari, A., & Wyatt, J. C. (2012). Management, Business and Administration, 13(1),
Automation bias: A systematic review of frequency, 1–6.
effect mediators, and mitigators. Journal of the Mant, J., & Hicks, N. (1995). Detecting differences in
American Medical Informatics Association: JAMIA, quality of care: The sensitivity of measures of
19, 121–127. process and outcome in treating acute myocardial
Gray, E., Marti, J., Brewster, D. H., Wyatt, J. C., Piaguet- infarction. BMJ, 311, 793–796.
Rossel R., & Hall P. S. (2019). Feasibility and results McDonald, C. J., Hui, S. L., Smith, D. M., Tierney,
of four real-world evidence methods for estimating W. M., Cohen, S. J., Weinberger, M., & McCabe,
the effectiveness of adjuvant chemotherapy in early G. P. (1984). Reminders to physicians from an intro-
stage breast cancer. J Clin Epidemiol. spective computer medical record: A two-year ran-
Haddow, G., Bruce, A., Sathanandam, S., & Wyatt, domized trial. Annals of Internal Medicine, 100(1),
J. C. (2011). ‘Nothing is really safe’: A focus 130–138.
group study on the processes of anonymizing McMurry, T. L., Hu, Y., Blackstone, E. H., & Kozower,
and sharing of health data for research purposes. B. D. (2015, July). Propensity scores: Methods,
Journal of Evaluation in Clinical Practice, 17, considerations, and applications in the Journal of
1140–1146. Thoracic and Cardiovascular Surgery. The Journal
Herasevich, V., & Pickering, B. W. (2017). Health infor- of Thoracic and Cardiovascular Surgery, 150(1),
mation technology evaluation handbook: From mean- 14–19. https://doi.org/10.1016/j.jtcvs.2015.03.057.
ingful use to meaningful outcome. Boca Raton: CRC Epub 2015 Apr 2.
Press. Michaelis, J., Wellek, S., & Willems, J. L. (1990).
House, E. (1980). Evaluating with validity. San Reference standards for software evaluation.
Francisco: Sage. Methods of Information in Medicine, 29, 289–297.
Kern, L. M., Edwards, A. M., Pichardo, M., & Kaushal, Murray, M. D., Harris, L. E., Overhage, J. M., Zhou,
R. (2015). Electronic health records and health care X. H., Eckert, G. J., Smith, F. E., Buchanan, N. N.,
quality over time in a federally qualified health cen- Wolinsky, F. D., McDonald, C. J., & Tierney, W. M.
ter. Journal of the American Medical Informatics (2004, March). Failure of computerized treatment
Association, 22(2), 453–458. suggestions to improve health outcomes of outpa-
Koppel, R., Metlay, J. P., Cohen, A., Abaluck, B., tients with uncomplicated hypertension: Results of
Localio, A. R., Kimmel, S. E., & Strom, B. L. a randomized controlled trial. Pharmacotherapy,
(2005). Role of computerized physician order entry 24(3), 324–337.
systems in facilitating medication errors. JAMA: Nielsen, J. (1994). Usability inspection methods. Paper
The Journal of the American Medical Association, presented at the conference companion on human
293(10), 1197–1203. factors in computing systems, Boston.
Lester, R. T., Ritvo, P., Mills, E. J., Kariri, A., Karanja, Office of the National Coordinator for Health
S., Chung, M. H., Jack, W., Habyarimana, J., Information Technology (ONC) website. (2014).
Sadatsafavi, M., Najafzadeh, M., Marra, C. A., FDASIA Committee Report. https://www.healthit.
Estambale, B., Ngugi, E., Ball, T. B., Thabane, L., gov/sites/default/files/fdasiahealthitreport_final.
Gelmon, L. J., Kimani, J., Ackers, M., & Plummer, pdf. Accessed 25 Oct 2018.
F. A. (2010). Effects of a mobile phone short mes- Ong, M. S., & Coiera, E. (2011, June). A systematic
sage service on antiretroviral treatment adherence review of failures in handoff communication during
in Kenya (WelTel Kenya1): A randomised trial. The intrahospital transfers. Joint Commission Journal on
Lancet, 376(9755), 1838–1845. Quality and Patient Safety, 37(6), 274–284.
Littlejohns, P., Wyatt, J. C., & Garvican, L. (2003, Pinsky, P. F., Miller, A., Kramer, B. S., Church, T.,
April 19). Evaluating computerised health informa- Reding, D., Prorok, P., Gelmann, E., Schoen,
tion systems: Hard lessons still to be learnt. BMJ, R. E., Buys, S., Hayes, R. B., & Berg, C. D. (2007,
326(7394), 860–863. April 15). Evidence of a healthy volunteer effect in
Liu, J. L. Y., & Wyatt, J. C. (2011). The case for random- the prostate, lung, colorectal, and ovarian cancer
ized controlled trials to assess the impact of clini- screening trial. American Journal of Epidemiology,
cal information systems. Journal of the American 165(8), 874–881.
Medical Informatics Association: JAMIA, 18(2), Pope, C., Halford, S., Turnbull, J., Prichard, J., Calestani,
173–180. M., & May, C. (2013). Using computer decision sup-
Liu, Y. I., Kamaya, A., et al. (2011). A Bayesian net- port systems in NHS emergency and urgent care:
work for differentiating benign from malignant thy- Ethnographic study using normalisation process
roid nodules using sonographic and demographic theory. BMC Health Services Research, 13(1), 111.
features. AJR: American Journal of Roentgenology, Ramnarayan, P., Kapoor, R. R., Coren, M., Nanduri,
196(5), W598–W605. V., Tomlinson, A. L., Taylor, P. M., Wyatt, J. C.,
Lundsgaarde, H. P. (1987). Evaluating medical expert & Britto, J. F. (2003, November–December).
systems. Social Science & Medicine, 24, 805–819. Measuring the impact of diagnostic decision
support on the quality of clinical decision mak-
464 C. P. Friedman and J. C. Wyatt
ing: Development of a reliable and valid composite The Journal of Machine Learning Research, 15(1),
score. Journal of the American Medical Informatics 1929–1958.
Association: JAMIA, 10(6), 563–572. Stead, W., Haynes, R. B., Fuller, S., et al. (1994).
Ravi, D., Wong, C., Deligianni, F., Berthelot, M., Designing medical informatics research and library
Andreu-Perez, J., Lo, B., & Yang, G. Z. (2017). projects to increase what is learned. Journal of
Deep learning for health informatics. IEEE Journal theAmerican Medical Informatics Association, 1,
of Biomedical and Health Informatics, 21(1), 4–21. 28–34.
Rigby, M., Forsström, J., Ruth, R., & Wyatt, J. (2001). Streeter, A. J., Lin, N. X., Crathorne, L., Haasova, M.,
Verifying quality and safety in health informatics Hyde, C., Melzer, D., & Henley, W. E. (2017, July).
services. BMJ, 323, 552–556. Adjusting for unmeasured confounding in nonran-
Rothman, K. J., Greenland, S., & Lash, T. L. (2008). domized longitudinal studies: A methodological
Modern epidemiology (3rd ed.). Philadephia: review. Journal of Clinical Epidemiology, 87, 23–34.
Lippincott Williams & Wilkins. https://doi.org/10.1016/j.jclinepi.2017.04.022. Epub
Russ, A. L., Zillich, A. J., Melton, B. L., Russell, S. A., 2017 Apr 28.
Chen, S., Spina, J. R., et al. (2014). Applying human Szczepura, A., & Kankaanpaa, J. (1996). Assessment of
factors principles to alert design increases effi- health care technologies. London: Wiley.
ciency and reduces prescribing errors in a scenario- Talmon, J., Ammenwerth, E., Brender, J., de Keizer, N.,
based simulation. Journal of the American Medical Nykänen, P., & Rigby, M. (2009). STARE-HI—
Informatics Association, 21(e2), e287–e296. statement on reporting of evaluation studies in
Saitwal, H., Feng, X., Walji, M., Patel, V., & Zhang, health informatics. International Journal of Medical
J. (2010). Assessing performance of an electronic Informatics, 7, 1–9.
health record (EHR) using cognitive task analysis. van Gennip, E. M., & Talmon, J. L. (Eds.). (1995).
International Journal of Medical Informatics, 79(7), Assessment and evaluation of information technolo-
501–506. gies in medicine. Amsterdam: IOS Press.
Scott, G. P., Shah, P., Wyatt, J. C., Makubate, B., & Van Way, C. W., Murphy, J. R., Dunn, E. L., & Elerding,
Cross, F. W. (2011, August 11). Making electronic S. C. (1982). A feasibility study of computer-aided
prescribing alerts more effective: Scenario-based diagnosis in appendicitis. Surgery, Gynecology &
experimental study in junior doctors. Journal of the Obstetrics, 155, 685–688.
American Medical Informatics Association: JAMIA, Ventres, W., Kooienga, S., Vuckovic, N., Marlin,
18(6), 789–798. R., Nygren, P., & Stewart, V. (2006) The Annals
Scott, P. J., Brown, A. W., Adedeji, T., Wyatt, J. C., of Family Medicine, 4(2) 124-131; https://doi.
Georgiou, A., Eisenstein, E. L., & Friedman, C. P. org/10.1016/10.1370/afm.425.
(2019). A review of measurement practice in stud- Wasson, J. H., Sox, H. C., Neff, R. K., & Goldman,
ies of clinical decision support systems 1998–2017. L. (1985). Clinical prediction rules: Applications
Journal of the American Medical Informatics and methodological standards. The New England
13 Association, 26(10), 1120–1128.
Sheikh, A., Cornford, T., Barber, N., Avery, A., Takian,
Journal of Medicine, 313, 793–799.
Wolf, J. A., Moreau, J. F., Akilov, O., Patton, T., English,
A., Lichtner, V., et al. (2011). Implementation and J. C., Ho, J., & Ferris, L. K. (2013). Diagnostic inac-
adoption of nationwide electronic health records in curacy of smartphone applications for melanoma
secondary care in England: Final qualitative results detection. JAMA Dermatology, 149(4), 422–426.
from prospective national evaluation in “early Wright, A., Sittig, D. F., Ash, J. S., Erickson, J. L.,
adopter” hospitals. BMJ, 343, d6054. Hickman, T. T., Paterno, M., et al. (2015). Lessons
Sherman, R. E., Anderson, S. A., Dal Pan, G. J., Gray, learned from implementing service-oriented clinical
G. W., Gross, T., Hunter, N. L., LaVange, L., decision support at four sites: A qualitative study.
Marinac-Dabic, D., Marks, P. W., Robb, M. A., International Journal of Medical Informatics, 84(11),
Shuren, J., Temple, R., Woodcock, J., Yue, L. Q., & 901–911.
Califf, R. M. (2016, December 8). Real-world evi- Wyatt, J., & Spiegelhalter, D. (1990). Evaluating medi-
dence – what is it and what can it tell us? The New cal expert systems: What to test and how? Medical
England Journal of Medicine, 375(23), 2293–2297. Informatics (London), 15, 205–217.
Slight, S. P., & Bates, D. W. (2014). A risk-based regulatory Wyatt, J., & Wyatt, S. (2003). When and how to evaluate
framework for health IT: Recommendations of the clinical information systems ? International Journal
FDASIA working group. Journal of the American of Medical Informatics, 69, 251–259.
Medical Informatics Association, 21(e2), e181–e184. Wyatt, J. C., Batley, R. P., & Keen, J. (2010, October). GP
Somerville I. (2015). Software Engineering (10th edi- preferences for information systems: Conjoint anal-
tion). Pearson, 816 pages. ISBN-10: 0133943038. ysis of speed, reliability, access and users. Journal of
Spiegelhalter, D. J. (1983). Evaluation of medical Evaluation in Clinical Practice, 16(5), 911–915.
decision-aids, with an application to a system for Zhang, J., Johnson, T. R., Patel, V. L., Paige, D. L.,
dyspepsia. Statistics in Medicine, 2, 207–216. & Kubose, T. (2003). Using usability heuristics to
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, evaluate patient safety of medical devices. Journal of
I., & Salakhutdinov, R. (2014). Dropout: A simple Biomedical Informatics, 36(1–2), 23–30.
way to prevent neural networks from overfitting.
465 II
Biomedical
Informatics
Applications
Contents
Contents
References – 503
Degree of structure of data Narrative notes rized users, but they also provide the benefit
stored in EHRs have the advantage over their of greater control over data and user access
paper predecessors as they can be searched by and improved enforcement of applicable pri-
word or patterns. The success of such searches vacy regulations as required by the Health
is limited by the sophistication of the EHR’s Insurance Portability and Accountability Act
text mining and search capabilities along with (HIPAA) (see 7 Chap. 31).
the quality of the user’s search criteria particu- System challenges can be experienced
larly since there can often be language variabil- from several perspectives:
ity in expression of medical terms including the
use of abbreviations. Challenges with system use and train-
A great deal of valuable information is ing Physicians and other key personnel have
contained in clinical notes mostly recorded in to take time from their work to learn how to
narrative. Though often time consuming and use the system. Furthermore, clinical work-
cumbersome, one way to obtain structured flows have to be re-designed in order to utilize
data with clinical notes is to ask the clinician the EHR effectively. It is increasingly appreci-
to enter key information through structured ated that EHR usability needs to be improved
forms which restrict data entry with a con- so that clinicians can easily document and
trolled vocabulary that is standards-based. provide patient care.
Powerful automated techniques like natural
language processing can also be increasingly Readability of clinical notes One of the advan-
leveraged to extract clinical data from notes tages of EHRs is the ease with which text can
(see 7 Chap. 9). be entered compared to paper notes.
Unfortunately, this can result in something
Ubiquity of access With today’s secure net- referred to as “note bloat”, resulting from cut-
works and other distributed technologies, cli- ting and pasting text and inserting other results
nicians and patients can access a patient’s (7 Sect. 14.3.1.2), resulting in voluminous
EHR from geographically distributed sites. documentation compared to paper records,
Paper records have significant inaccessibility making EHR notes often more difficult to
issues as they can only be in one place and with review efficiently.
at most one user at any point in time. Previously,
completing discharge summaries and signing System failures and ensuring adequate redun-
orders with paper records or borrowing records dancy and security Computer-based systems
14 for administrative or research purposes from have the potential for catastrophic failures that
medical record departments was logistically could cause extended unavailability of patients’
challenging. In contrast, EHRs are ubiq computer records. To combat this, EHRs today
uitously available to users for these and other often run across distributed technologies offer-
purposes. ing redundancy such as parallel operation at
In some cases, the collective data about geographically separate sites and hot fail over
one patient from independent care systems with separate computer systems running syn-
can also be accessed through health informa- chronously with the primary system that can
tion exchanges (see 7 Chap. 17). Such avail- take over near instantaneously from the pri-
ability can also support health care continuity mary system if it were to fail. Yet, nothing pro-
during disasters. Brown et al. (2007) found a vides complete protection; contingency plans
“stark contrast” between the care VA versus with downtime procedures must be developed
non-VA patients received after Hurricane for handling brief (even planned) or longer sys-
Katrina, because appropriate and uninter- tem outages. Also, cybersecurity is increasingly
rupted care were supported by nationwide a concern that health care systems are working
access to the comprehensive VA EHRs. EHRs to improve as attacks are of increasing sophis-
not only make data more accessible to autho- tication and frequency (see 7 Chap. 18).
Electronic Health Records
471 14
14.2 Historical Perspective: Lockheed’s hospital information system (HIS)
Development of EHRs at El Camino Hospital, which became opera-
tional in 1971 (Coffey 1979).
The initial development of automated sys- Weed’s problem-oriented medical record
tems in health care was stimulated by regula- book (POMR) (1968) shaped medical think-
tory and reimbursement requirements. Early ing about both manual and automated medi-
health care systems in the inpatient setting cal records. His computer-based inpatient
provided charge capture functionality to meet system followed (Schultz et al. 1971). Morris
billing requirements in a fee-for-service Collen, who also pioneered the multiphasic
environment. screening system (1969), wrote a readable 500-
The Flexner report on medical education page history of medical informatics (1995)
was the first formal statement made about the that provides rich details about these early
function and contents of the medical record medical records systems, as does a three-
(Flexner 1910). In advocating a scientific decade summary of computer-based medical
approach to medical education, the Flexner record research projects from the U.S. Agency
report also encouraged physicians to keep a for Health Care Policy and Research
patient-oriented medical record. Three years (AHCPR) (Fitzmaurice et al. 2002).
earlier, Dr. Henry Plummer initiated the “unit EHRs can provide Clinical Decision
record” for the Mayo Clinic (including its St. Support (CDS) by suggesting needed action
Mary’s Hospital), placing all the patient’s vis- based on the patient data it carries. A few
its and types of information in a single folder. early systems: HELP (Warner 1972; Pryor
This innovation represented the first longitu- 1988) the RMRS (McDonald 1973, 1976)
dinal medical record (Melton 3rd 1996). The offered CDS as part of their initial design.
Presbyterian Hospital (New York) adopted Other early EHRs added CDS capability as
the unit record for its inpatient and outpatient they grew: the Columbia University system
care in 1916, studying the effect of the unit (Johnson et al. 1991; Hripcsak et al. 1999), the
record on length of stay and quality of care CCC (Center for Clinical Computing) system
(Openchowski 1925) and writing a series of at Beth Israel Deaconess Medical Center
letters and books about the unit record that (Rind et al. 1994; Slack and Bleich 1999;
disseminated the approach around the nation Bleich et al. 1985; Halamka and Safran 1998),
(Lamb 1955). and others (Giuse and Mickish 1996; Teich
The first record we could find of a et al. 1999; Cheung et al. 2001; Duncan et al.
computer-based medical record was a short 2001; Brown et al. 2003).
newspaper article describing a new “electronic Since those early years, hundreds of com-
brain” – to replace punched and file index mercial venders have emerged to supply out-
cards and to track hospital and medical patient practice computer systems and a few
records by the Michigan Hospital Service dozen offered for inpatient systems. However,
(Brain 1956). The first, operational EHRs as hospital systems merged into ever-larger
emerged in the early 1970’s. Some started an aggregations and pulled office practices
out patient systems, including Costar from through acquisitions, the boundaries between
Massachusetts General Hospital (Grossman outpatient and inpatient computer systems
et al. 1973; Barnett et al. 1979; Barnett 1984), have blurred and the number of health care
RMRS from the Regenstrief Institute system EHR vendors has shrunk consider-
(McDonald 1973; McDonald et al. 1975; ably. However, specialized clinical informa-
McDonald et al. 1999), Duke University tion systems that cover the special needs
(Stead 1977; Stead & Hammond 1988), STOR within large and complex health care systems
(Simborg and Whiting-O’Keefe III 1981), and including medical and imaging areas con-
others (see outpatient EHR review by Kuhn tinue. Today, a majority of health systems in
et al. 1984). Other systems began on the inpa- the US use a select few EHRs provided by
tient side including HELP (Warner 1972) and major health information technology (IT)
vendors.
472 G. B. Melton et al.
.. Fig. 14.1 A screenshot of the combined WorldVistA figure shows how clinical images can be presented with
Computer Based Patient Record System (CPRS) and laboratory test results, medications, notes and other rel-
ISI Imaging system. These systems are derived from the evant clinical information in a single longitudinal medical
Department of Veterans Affairs VistA and VistA Imaging record. (Source: Courtesy of WorldVistA (7 worldvista.
systems (7 http://www.va.gov/vista_monograph/). The org) and II Group (7 www.isigp.com), 2012)
based on name, birth date, and other patient variables, and assessments (McDonald et al.
characteristics such as address and employ- 2003; Vreeman et al. 2010); SNOMED CT7
ment. The performance of these algorithms (Wang et al. 2002) for diagnoses, symptoms,
must be monitored for data integrity issues findings, organisms and answers; UCUM8 for
and errors and associated processes created computable units of measure; and RxNorm9,10
to manage patient identity for cases where for clinical drug names, ingredients, and
algorithms are not sufficient to adjudicate orderable drug names for various purposes
potential matches (Just et al. 2016; Zech et al. (see also 7 Chaps. 8 and 31). Supporting this
2016). trend are laboratory instrument vendors,
One of the more significant barriers today which are beginning to specify LOINC codes
to the integration of health record data from to use for each of the tests results that their
different organizations are the local and idio- instruments generate.11
syncratic identifiers used to label observations As healthcare providers consolidate and
and coded observation values — recapitulat- bring together EHR data or implement a new
ing the Babel story. However, those barriers
are shrinking as Health IT regulations5 and
institutions adopt terminology standards, 7 SNOMED Clinical Terms® (SNOMED CT®) Five-
including LOINC6 for observations, questions, step briefing. 7 https://www.snomed.org/snomed-ct/
(Accessed 6/4/2020).
8 The Unified Code for Units of Measure. 7 http://
unitsofmeasure.org/ (Accessed 6/4/2020).
5 Certification of Health IT, Testing Process & Test 9 RxNorm Overview. 7 http://www.nlm.nih.gov/
Methods, 2015 Edition Test Method. 7 https:// research/umls/rxnorm/overview.html (Accessed
www.healthit.gov/topic/certification-ehrs/2015-edi- 6/4/2020).
tion-test-method (Accessed 6/4/2020). 10 RxTerms. 7 https://wwwcf.nlm.nih.gov/umlslicense/
6 Logical Observation Identifiers Names and Codes rxtermApp/rxTerm.cfm (Accessed 6/4/2020).
(LOINC®) from Regenstrief. 7 http://loinc.org/ 11 7 https://ivdconnectivity.org/fda-encourages-livd
(Accessed 6/4/2020). (Accessed 6/4/2020).
474 G. B. Melton et al.
Data Entry
Order Event
& Results
Entry Monitor
Review
Database
Pharmacy Interface
Medical
Entities
Dictionary
Radiology
Patient Research
Databases Databases
.. Fig. 14.2 A block diagram of multiple-source-data router of information to the central database. It may
systems that contribute patient data, which ultimately also provide more intelligent filtering, translating, and
reside in a computerized patient record (CPR). The alerting functions, as it does at Columbia University
database interface, commonly called an interface engine, Medical Center. (Source: Courtesy of Columbia
performs a number of functions. It may simply be a University Medical Center, New York)
EHR, organizations have used a number of Today, most clinical data sources and
approaches to load EHRs with pre-existing EHRs can send and receive clinical content as
patient data. One approach is to interface the version 2.x Health Level 7 (HL7)12 messages.
EHR from the available electronic source (e.g., Most organizations use interface engines for
dictation service, pharmacy system, and labo- HL7 messages and integration platforms (either
ratory information system) and load data from part of the integration engine or separate tech-
these sources for a pre-specified length of time nology platform) that can support other data
(e.g., 12 months). A second approach is to formats to send, receive, and, when necessary,
14 abstract select data (e.g., key laboratory results, translate the format of and the codes within
the problem lists, and active medications) and exchanged data (see 7 Chap. 17); . Fig. 14.2
either automatically or hand enter those data shows an example of architecture to integrate
into the new EHR prior to each patient’s visit data from multiple source systems. The
for a period of time. The third approach is to Columbia University Medical Center comput-
scan and store 1–2 years of the old paper erized patient record (CPR) interface depicted
records or to produce electronic “printout” in this diagram not only provides message-han-
versions (e.g., as Portable Document Format dling capability but can also automatically
[PDF]) of content stored in by preceding translate codes from the external source to the
EHR. This approach can be applied to any preferred codes of the receiving EHR. And
kind of document, including handwritten although many vendors now offer single sys-
records that predate the EHR installation. tems that serve “all” needs, they never escape
Optical Character Recognition (OCR) capabil- the need for standards- based data exchange
ity is built into most document scanners today, from ancillary systems (e.g., EKG carts, cardiol-
and converts typed text within scanned docu-
ments to computer understandable text with
98–99% character accuracy, which can make 12 Health Level Seven International, 7 http://www.
this content potentially searchable. hl7.org/ (Accessed 6/4/2020).
Electronic Health Records
475 14
ogy systems, radiology imaging systems, anes- expressions can verify that the entered data have
thesia systems, off-site laboratories, community a required pattern (e.g., the three digits, hyphen,
pharmacies and external collaborating health and four digits of a local telephone number).
systems). At least one high-capability open- Range and pattern checks (among others can
source interface engine, NextGen Connect (for- be implemented using standard browser
merly Mirth Connect),13,14 is available and used features).Computed checks can verify that val-
relatively widely for data exchange. ues have the correct mathematical relationship
HL7 Fast Healthcare Interoperability (e.g., white blood cell differential counts,
Resources (FHIR) is an elegant application- reported as percentages, must sum to 100).
programming interface for exchanging clini- Consistency checks can detect errors by com-
cal data (see 7 Chap. 17) with a recognizable paring entered data (e.g., the recording of can-
heritage from V2. In 2018, it was embraced by cer of the prostate as the diagnosis for a female
a surge of large organizations including patient). Software for accomplishing this is
Apple, (Apple Health), Microsoft and Google embedded in standard web browser). Delta
(Mandl et al. 2019), many federal agencies checks warn of large and unlikely differences
(CMS, ONC, the Veterans Administration), between the values of a new result and of the
major EHR vendors, and health insurance previous observations (e.g., a recorded weight
companies. Most of them are also using the that changes by 100 lbs. in 2 weeks). Spelling
related, SMART on FHIR App specification checks verify the spelling of individual words.
(Mandel et al. 2016), with which users can Clinician-gathered patient information
develop Apps designed to access data within requires special comment because it presents
EHRs from outside of that EHR. one of the most difficult challenge to EHR
developers and users. Physicians spend at least
14.3.1.2 Clinician Data Entry 20% of their time documenting the clinical
Clinical data may be entered as narrative free- encounter (Gottschalk and Flocke 2005;
text, as codes, or as a combination of the two. Hollingsworth et al. 1998). The burden has risen
Trade-offs exist between the use of codes and over time for several reasons (Poissant et al.
narrative text. The major advantage of struc- 2005). EHRs tend to require far more data
tured data is that it makes the data “under- entry than the pre-existing manual systems.
standable” to the computer and thus enables Many studies suggests that the EHR functions
selective retrieval, clinical research, quality taken together may consume up to 1–2 hours of
improvement, and clinical operations man- the physician’s free time per clinic day (Sinsky
agement. The coding of diagnoses, allergies, et al. 2016; McDonald et al. 2014). In one study,
problems, orders, and medications is of par- the computer system was a primary cause of
ticular importance for these purposes. clinician dissatisfaction (Edgar 2009) and their
Because of the chance of errors with the reason for leaving military medicine. In addi-
hand entry of data, EHRs apply validity checks tion, EHR documentation requirements have
scrupulously. A number of different kinds of been repeatedly been cited as a significant cause
checks apply to clinical data (Schwartz et al. of physician burnout (Gardner et al. 2019).
1985). Range checks can detect or prevent entry EHRs tend to forbid simple narrative text
of values that are out of range (e.g., a serum responses; so providers have to dig through
potassium level of 50.0 mmol/L—which is menus to find the coded term that expresses
impossibly outside the normal range of 3.5– their intended meaning. Billing requirements
5.0 mol/L). Pattern checks for including regular and fear of malpractice have fueled the demand
for ever more data entry. Adding to provider’
data entry pain is the fact that EHR user inter
13 7 https://github.com/nextgenhealthcare (Accessed faces can be clumsy and non-intuitive.
6/4/2020). The requirement that providers enter all of
14 NextGen Connect Integration Engine. 7 https://
www.nextgen.com/products-and-services/integra-
their findings, impressions and plans into
tion-engine and 7 https://github.com/nextgen their note and orders into the computer comes
healthcare (Both accessed 6/4/2020). from a dictum that says the person who cre-
476 G. B. Melton et al.
ates the content should enter it. This dictum proper attention to detail, users may copy infor-
often makes sense for prescriptions, orders, mation that is no longer pertinent or true or pos-
and perhaps diagnoses and procedure orders, sibly lose context especially with time expressions
because immediate provider entry during the (e.g., “yesterday”). Studies note high rates of
course of care makes diagnostic testing, treat- text duplicated from previous notes of over 50%
ments and check out more efficient and pro- (Wrenn et al. 2010; Zhang et al. 2014).
vides crucial grist for CDS. The justification Dictation with transcription has been com-
for direct entry of visit notes by clinicians is mon historically for entering narrative informa-
weaker because of its time cost to physicians tion into EHRs. Transcriptionists are often able
is high and the information is not a pre- to maintain a degree of structure in the tran-
requisite to the checkout process. scribed document via section headers, and the
Today, clinical notes can be entered into structure can also be delivered as an HL7 CDA
the EHR via one of three general mecha- document (Ferranti et al. 2006). Speech recogni-
nisms: (1) transcription or use of speech rec- tion software offers an approach to “dictating”
ognition systems to convert spoken word to without the cost or delay of transcription by
dictated or written notes, (2) clinic staff translating clinician speech to text automati-
(scribes) who transfer text or codes of provid- cally. Historically, these systems resulted in
ers written or spoken content into the com- errors that required significant time to find and
puter system, and (3) direct data entry by correct misunderstood words. Increasingly,
physicians themselves (potentially facilitated these solutions have improved speech recogni-
by electronic templates or macros). tion algorithms, and companies can reach accu-
Free-form narrative entry—by typing, dicta- racies better than 99% without training.
tion, or speech recognition—allows clinicians to Skeptical readers can try it themselves.15
express important clinical information in the In addition, some dictation services use
most natural manner. When clinicians commu- speech recognition to generate a draft transcrip-
nicate in narrative, they naturally prioritize find- tion, which the transcriptionist corrects while
ings and leave much information implicit. For listening to the audio dictation, thus saving tran-
example, an experienced clinician often leaves scriptionist time. Natural-language processing
out “pertinent negatives” (i.e., findings that the (NLP) (see 7 Chap. 9) offers hope for automatic
patient does not have but that nevertheless encoding of narrative text (Nadkarni et al.
inform the decision-making process) knowing 2011). Some companies are exploring the use of
that the clinician who reads the record will inter- NLP to auto-encode transcribed text, and
14 pret them properly to be absent. The result is employ the transcriptionist to correct any NLP
usually a more concise history with a high sig- coding errors (see 7 Chap. 9).
nal-to-noise ratio that not only shortens the data Some practices have scribes (a variant on
capture time but also lessens the cognitive bur- the stenographers of old) to do much of the
den on the reading clinician. Weir and colleagues physicians’ data entry work (Koshy et al. 2010;
present compelling evidence about these advan- Misra-Hebert et al. 2016). Scribes typically
tages, especially when narrative is focused and work alongside the care provider in the exami-
vivid, and emphasize that too much information nation room, or remotely through an audiovi-
interferes with inter- provider communication sual connection or recording. The Joint
(Weir et al. 2011). Commission is agnostic about the use of
Most EHRs let physicians cut and paste scribes, but provides guidelines for their use
notes from previous visits and other sources or (The Joint Commission 201816).
even have automated functionality to “bring
forward” content from previous visits. For
example, a physician can cut and paste parts of 15 7 https://cloud.google.com/speech-to-text/
a visit note into a letter to a referring physician 16 The Joint Commission: Documentation assistance
provided by scribes. 7 https://www.jointcommission.
and into an admission note, a most appropriate org/en/standards/standard-faqs/nursing-care-center/
use of this capability. However, when over-used, record-of-care-treatment-and-services-rc/000002210/
this can cause ‘note bloat.’ In addition, without (Accessed 6/4/2020).
Electronic Health Records
477 14
Another data-entry method is to have clini- cific data-capture form completed by the
cians record information on a structured form, child’s family in the waiting room
from which data and associated documenta- (. Fig. 14.3a), the CHICA computer uses the
tion are created (Downs et al. 2006; Hagen entered data to generate a physician encounter
et al. 1998). One system, called CHICA form with a tailored agenda for the encounter
(Anand et al. 2018), originally generated a (. Fig. 14.3b). The CHICA system generates
patient specific and scannable paper document a prose version of associated form responses
and used optical character and mark recogni- which can be incorporated into and used to
tion to capture the recoded data in a two-step help generate a clinical note.
process. Today, data can be data captured from The third alternative is the structured,
a handheld electronic tablet or via a webpage coded entry of data by clinicians. A major
(Anand et al. 2017). In addition to a child-spe- issue associated with direct physician entry is
.. Fig. 14.3 a The family completes the first form with played as a web form in the EHR or printed on paper
questions tailored to patient’s age and other factors. that an OCR system interprets. Coded results are stored
Form can be displayed on a tablet or printed on a tai- in the computer and a prose version is returned by the
lored paper form that is scanned by an OCR system that system to be incorporated in the physician’s note.
passes the content to the EHR. b The computer gener- (Source: Courtesy of Prof Stephen M Downs, Indiana
ates a physician encounter form based on the contents University, Indianapolis, IN)
of the first form and adds reminders. The form is dis-
478 G. B. Melton et al.
14
the physician time cost. Studies document tency with previously stored information and
that structured data entry consumes more cli- can ask for additional detail or dimensions
nician time than the traditional record keep- conditional on the information just entered.
ing (Chaudhry et al. 2006), as much as Some of these data will be entered into fields,
20 seconds per SNOMED CT coded diagno- with menu selection. For ease of entry, such
sis (Fung et al. 2011). On the other hand, this menus should not be very long, require scroll-
option has the advantage that the computer ing, or impose a rigid hierarchy (Kuhn et al.
can immediately check the entry for consis- 1984). Using a process called auto complete,
Electronic Health Records
479 14
clinicians can code items by typing in a few The long-term solution for effective data
letters of an item name, then choose the item capture of information generated by clini-
they need from the list of items that match the cians is still evolving. Semi-structured data
string they entered. In some cases this process entry combines the use of narrative text fields
can be fast and efficient, but critics have amenable to natural language processing
described it as “death by a thousand clicks” combined with structured data entry fields
(Fry and Schulte 2019). where needed. With time and better input
The use of templates and menus can speed devices, acquisition of data will become
note entry, but they can also generate excessive faster and easier. In addition, direct entry of
boilerplate and discourages specificity (i.e., it is some data by patients could reduce the data
easier to pick an available menu option than to entry burden for clinicians (Janamanchi et al.
describe a finding or event in detail). Notes 2009).
written via templates may not convey as clear,
or as accurate, a picture of the patient’s state as 14.3.1.3 Data Display
a provider note written in narrative. Developers Once stored in the computer, EHRs can pres-
can develop separate data capture forms using ent patient data in different formats for differ-
APIs specific to the manufacturer resulting in ent purposes. These systems can also present
forms that are relatively easier to use and often content in novel formats. Clinicians need
can adapt to various form factors. more than just integrated access to patient
Among its many other capabilities, FHIR data; they also need various views of these
has developed specifications for web data cap- data: in chronologic order as flowsheets or
ture forms. Questionnaire17 is FHIR’s data cap- graphs to highlight changes over time, and as
ture resource. Questionnaire supports skip snapshots that show a computer view of the
logic, nesting and repeating groups of questions patients’ current status and their most impor-
and many simple data validation checks. It also tant observations.
supports nested, and repeating, groups of ques-
tions and thus can accommodate complicated Timeline Graphs
forms such as the surgeon general is family his- A graphical presentation can help the physi-
tory (see . Fig. 14.4a). FHIR has developed cian to assimilate and draw conclusions
an enhanced version of Questionnaire called from the information quickly and draw con-
Structured Data Capture (SDC)18 in STU4 trial clusions (Fafchamps et al. 1991; Tang and
use. It adds many capabilities to Questionnaire, Patel 1994; Starren and Johnson 2000). An
including arithmetic and logical calculations anesthesia system vendor provides an espe-
(see . Fig. 14.4b – BMI and . Fig. 14.4c – cially good example of the use of numbers
Apgar score) and mechanisms for pre-populat- and graphics in a timeline to convey the
ing forms with existing patient data from an patient’s state in form that can be digested at
EHR. It also includes regular expressions for a glance (Vigoda and Lubarsky 2006).
validation of data entry along with mechanisms Sparklines – “small, high resolution graph-
for storing form content data into designated ics embedded in a context of words, num-
FHIR resources. Finally, SDC supports adap- bers, images” (Tufte 2006), which today’s
tive (CAT) survey instruments such as browsers and spreadsheets can easily gener-
PROMIS19 patient reported outcomes. ate – provide a way to embed graphic time-
lines into any report. One study found that
with sparklines, “physicians were able to
17 7 https://www.hl7.org/fhir/questionnaire.html assess laboratory data faster”. Sparklines
(Accessed 6/4/2020). enable more information to be presented
18 7 http://hl7.org/fhir/uv/sdc/2019May/ (Accessed more compactly in a single view and thus
6/4/2020).
reduce the need to scroll or flip between
19 7 http://www.healthmeasures.net/explore-measure-
ment-systems/promis/intro-to-promis (Accessed screens” (Bauer et al. 2010).
6/4/2020).
480 G. B. Melton et al.
.. Fig. 14.5 A flow sheet of radiology reports. The on the radiology image icon provides the images.
rows all report one kind of study and the columns report (Source: Courtesy of Regenstrief Institute, Indianapo-
one date. Each cell shows the impression part of the lis, IN). The CareWeb program from Regenstrief, which
radiology report as a quick summary of the content of generated this flowsheet, presents cross-institutional
that report. The cells include two icons. Clicking on the patient flowsheets from the Indiana HIE to office prac-
report icon provides the full radiology report. Clicking tices today
14 complications of hypertension, or the medica- will be available for free download20,21 under a
tions used to treat it. Physicians at the LOINC-like agreement (. Fig. 14.7). In this
University of Wisconsin and University of example of a Problem Oriented View, displays
Texas-Southwestern have developed tables of medications and lab results relevant to the
mappings from problem categories (e.g., renal problem of acute systolic heart failure. If a
failure, ischemic heart disease) to observa- user chose a different problem in the panel on
tions, and to medications, with mappings the left, it would display content relevant to
developed through a multi-institutional con- that problem.
sensus process with universal code specifica-
tions that use SNOMED CT or ICD (for
defining problem classes), LOINC for obser- 20 125 SNOMED CT groupers published as online sup-
vations, and RxNorm for medications plement to open-access article above in reference #3.
(. Fig. 14.7) (Buchanan 2017; Willett et al. 7 https://www.thieme-connect.de/media/10.
1055-s-00035026/201803/supmat/10-1055-s-0038-
2018). Physicians at the University of Wis 1668090-s180031ra.pdf (Accessed 6/4/2020).
consin and University of Texas-Southwestern 21 See Problem List MD at 7 https://problemlist.org
have developed Problem Concept Maps that (Accessed 6/4/2020).
Electronic Health Records
483 14
.. Fig. 14.6 The Pocket rounds report—so called includes the all active orders (including medications),
because when folded from top to bottom, it fits in the recent laboratory results, vital signs and the summary
clinician’s white coat pocket as a booklet. It is a dense impressions of radiology, endoscopy, and cardiology
report (12 lines per inch, 36 characters per inch), printed reports. (Source: Courtesy of L. Simonaitis, Regenstrief
in landscape mode on one 8 1/2 × 11 in. page), and Institute, Indianapolis, IN)
The LHC Flowsheet on FHIR app (named preferred unit of measure using LHC’s unit
for the NLM’s Lister Hill Center), is an open validation/converter (UCUM).22
source web app (7 https://lhcflowsheet.nlm.
nih.gov/) that generates clinical flowsheets Summaries and Snapshots
from any FHIR server with LOINC coded EHRs can highlight important components
observation identifiers (. Fig. 14.8a, b). It (e.g., active allergies, active problems, active
can display very large datasets (the example treatments, and recent observations) in clini-
presents a patient with more than 20 thou- cal summaries or snapshots (Tang et al.
sand observations), can scroll quickly in the X 1999). . Figure 14.9 shows an example from
and Y-axis and show or hide selected groups the Epic EHR where active patient problems,
of variables. This app can also provide a active medications, allergies, health main
problem-focused flowsheet as well as time axis tenance reminders, and other relevant sum-
(column) compression, variable (row) axis
compression and units’ conversion according
to user preferences. It can also convert obser- 22 7 https://ucum.nlm.nih.gov/ucum-lhc/demo.html
vations with mixed units of measure into a (Accessed 6/4/2020).
484 G. B. Melton et al.
.. Fig. 14.7 In this example of a Problem Oriented chose a different problems in the panel on the left, it
would display content relevant to that problem. (© 2019
14 View, displays medications and lab results relevant to
the problem of acute systolic heart failure If a user Epic Systems Corporation. Used with permission)
mary information are summarized. These cian’s hospital hand crafted discharge sum-
views are automatically updated and kept mary.
current as new data arrives. In the future, we Researchers are developing increasingly
can expect more sophisticated summarizing sophisticated summaries. The HARVEST sys-
and surveillance strategies, such as auto- tem (Hirsch et al. 2015), for example, pro-
mated detection of adverse events (Bates cesses all of a patient’s notes, extracts unique
et al. 2003) or automated time-series events concepts, ranks them in importance for the
(e.g., cancer chemotherapy cycles). We may patient, and displays them in a word cloud.
also see patient data views that distinguish Clicking a word reveals the notes that support
abnormal changes that have been explained, the concept, and clicking the individual note
or treated, from those that have not, and dis- reveals the relevant snippet(s) of text. The
plays that dynamically organize the support- user can pick a subset of the patient’s time-
ing evidence for existing problems (Tang and line. The system reveals information that the
Patel 1994; Tang et al. 1994a; Buchanan user may not know to ask for and has been
2017). Ultimately, computers should be able found to be especially useful for emergency
to produce concise and flowing summary department doctors and quality assurance
reports that are like an experienced physi- nurses.
Electronic Health Records
485 14
.. Fig. 14.8 LHC Flowsheet on FHIR app presenting high (red up arrow) or low (blue down arrow) using HL7
the content of a large, (more than 15,000 observations) interpretation codes and normal value ranges. A spark
de- identified medical records. The dates and values line appears graphing all of the values in a row, appears
within the dataset have been slightly shifted from their to the right of each observation name. In a, values for
original values and salted with additional data using each observation have separate rows, whereas b all of the
random methods. In this display, the user may choose to values for within one equivalence class are folded into a
collapse columns to one value per quarter to facilitate single row after converting all values with any molar or
user interpretation. Out of range results are flagged as mass unit into a common, but configurable, mass unit
Dynamic On-Demand Data Views to 81% (Tang et al. 1994) of the time, when
Anyone who has reviewed a patient’s chart looking in the paper record, physicians did not
knows how hard it can be to find a particular find important patient information that was
piece of information. From 10% (Fries 1974) present in that record. Furthermore, the ques-
486 G. B. Melton et al.
.. Fig. 14.9 Summary record. The patient’s active one-page screen provides an instant display of core clin-
medical problems, current medications, and drug aller- ical data elements as well as reminders about required
gies are among the core data that physicians must keep preventive care. (Source: Courtesy of Epic Systems,
in mind when making any decision on patient care. This Madison, WI)
tions clinicians routinely ask are often the ones (CPOE), where clinicians make and act upon
that are difficult to answer from perusal of a therapeutic and diagnostic decisions by enter-
typical medical record. Common questions ing orders. CPOE systems can reduce errors
include whether a specific test has ever been and costs compared to paper systems, in
performed, what kinds of medications have which orders are transcribed manually from
been tried, and how the patient has responded one paper form (e.g., the orders section of the
to particular treatments (e.g., a class of medi- paper chart) to another (e.g., the nurse’s work
cations) in the past. Physicians constantly ask list, a laboratory request form), or faxed to a
14 these questions as they flip back and forth in receiving area for fulfillment (e.g., transporta-
the chart searching for the facts needed to sup- tion services, pharmacy). CPOE orders pass
port or refute a given hypotheses as their think- electronically from the decision-maker to the
ing about the patient evolves. On demand order filler with minimal to no manual labor.
search tools help clinicians locate and then Order entry systems also provide opportuni-
organize relevant patient data, into flowsheets, ties to deliver CDS when providers are enter-
problem oriented displays (see section ing orders and making clinical decisions. Most
“7 Timeline Flowsheets”) or graphs (Fafch existing CPOEs provide alerts about drug
amps et al. 1991; Tang et al. 1994a; Starren and interactions, allergies, and dosing adjustments
Johnson 2000) to facilitate provider assim for renal insufficiency when new drug orders
ilation of the relevant facts. are entered. However, EHR implementers
should be selective about which alerts to evoke
and be parsimonious with the use of interrup-
14.3.2 Computerized Provider tive alerts to avoid wasting provider time on
Order Entry trivial or low-likelihood outcomes (Miller et
al. 2005a; Phansalkar et al. 2012a, b). We dis-
One of the most important components of an cuss this capability in more fully in the next
EHR is computerized provider order entry section.
Electronic Health Records
487 14
.. Fig. 14.10 Neonatal Intensive Care Unit (NICU) provider’s prescribed goal for amount of fluid,
Total Parenteral Nutrition (TPN) Advisor provides calories, nutrition, and special additives. (Source:
information about complex interactive advice and Miller et al. (2005b). Elsevier Reprint License No.
performs various calculations in response to the 2800411402464)
Order entry systems can also remind pro- care, with benefits to quality and costs.
viders about important orders, which might Because of these advantages, health care
otherwise be forgotten. Very intelligently organizations have adopted CPOE widely and
designed order entry systems can shrink the federal regulations for certified health IT
work of entering complicated orders such require core CPOE functionality for medica-
as declining dosing of prednisone, intrave- tions, laboratories, diagnostic imaging, and
nous fluid orders, and total parenteral nutri- EHRs, which include checking for drug-drug
tion (TPN) orders the last of which require and drug-allergy interactions23–though some
entry of many additives and calculations question the wisdom of those interaction
to avoid dangerous mixtures and to reach requirements (Khajouei and Jaspers 2010)
specified targets for calories each additives. and, in one study, the use of these checks with
. Figure 14.10 shows an example of a TPN alerts had no effect in the rate of adverse drug
order entry screen from Vanderbilt (Miller reactions (Nebeker et al. 2005).
et al. 2005b). However, with some order entry
systems, entry of intravenous and declining
dose orders can be more difficult than with 23 Certification of Health IT, Testing Process & Test
the manual alternative. Methods, 2015 Edition Test Method. 2015 Edition
When a CPOE system is operational, sim- Certification Regulations – 170.315(a) (1, 2, 3, 4):
Computerized Provider Order Entry and Drug-drug
ply changing the default drug or dosing based and Drug-allergy checks for CPOE 7 https://www.
on the latest scientific evidence can shift order- healthit.gov/topic/certification-ehrs/2015-edition-
ing behavior toward the optimum standard of test-method (Accessed 6/4/2020).
488 G. B. Melton et al.
14.3.3 Clinical Decision Support dose, and duration of treatment from the sys-
tem improved clinical outcomes and reduced
Clinical trials have shown that certain remind- costs of infections among patients managed
ers from CDS can improve care processes with the assistance of this system (Evans et al.
(McDonald 1976; Haynes 2011; Damiani 1998; Pestotnik 2005). Vanderbilt’s inpatient
et al. 2010; Schedlbauer et al. 2009; Ranta “WizOrder” CPOE system also addressed
et al. 2015; Clyne et al. 2012; Tajmir et al. antibiotic orders, as shown in . Fig. 14.12; it
2017) but the efficacy of CDS broadly is suggests the use of Cefepine rather than
mixed (Delvaux et al. 2017; Parshuram et al. Ceftazidine, and provides choices of dosing
2018; Muth et al. 2018; Fried et al. 2017). by indication.
EHRs can deliver CDS in batch mode at Clinical alerts attached to a laboratory test
intervals across a whole practice population result can include suggestions for appropriate
to identify patients who are not reaching to follow up or treatments for some abnormali-
treatment targets or are past due for immuni- ties (Ozdas et al. 2008; Rosenbloom et al.
zations, cancer screening, or have missed their 2005). Also, CPOE functionality can warn the
recent appointments, to cite a few examples. physician about allergies (. Fig. 14.13a) and
In batch mode, clinical practices can utilize drug interactions (. Fig. 14.13b) before the
lists of patients generated by CDS to contact provider completes a medication order, as
the patient and encourage him or her to reach exemplified by screenshots from Partner’s out-
a goal or to schedule an appointment for the patient medical record orders.
delivery of suggested care and can reach Reminders and alerts are employed widely
patients who have not kept scheduled appoint- in outpatient care. Indeed, the outpatient set-
ments. ting is where the first study of clinical remind-
Decision support—especially for preven- ers and the first randomized trial of medical
tive care—is most efficiently delivered in the informatics systems, was performed
course of routine care while the patient and (McDonald 1976) and remains the setting for
provider are together. Suggestions can be the majority of such studies (Garg et al. 2005).
delivered during the physician order entry Reminders to physicians in outpatient settings
process, which in some cases can be the best quadrupled the use of recommended vaccines
point in the workflow at which to discourage in eligible patients compared with those who
or countermand an order that might be dan- did not receive reminders (McDonald et al.
gerous or wasteful. It is also a convenient 2014; McPhee et al. 1991; Hunt et al. 1998;
14 point to offer reminders about needed tests or Teich et al. 2000). Reminder systems can also
treatments, which can easily be initiated dur- suggest needed tests and treatments for eligi-
ing that order session. ble patients (Overhage 1997). . Figure 14.14
One of the best ways for CDS systems to shows an Epic system screen with reminders
remind providers about tests or treatments is to consider ordering a cardiac echocardio-
by presenting pre-constructed order(s) to the gram and starting an ACE inhibitor—in an
provider who can confirm or reject the order(s) outpatient patient with a diagnosis of heart
with a single keystroke or mouse click. It is best failure but no record of a cardiac echo
to annotate such suggestions with their ratio- cardiogram or treatment with one of the most
nale (e.g., “the patient is due for his pneumonia beneficial drugs for heart failure.
vaccine because he has emphysema and is over Though the outpatient setting is the pri-
65”) so that the provider understands the ratio- mary setting for preventive care reminders,
nale for the suggestion (Mamlin et al. 2007). preventive reminders have also been applied
. Figure 14.11 shows some suggestions effectively in the hospital setting (Dexter et al.
from a sophisticated inpatient CDS system 2001). Furthermore, reminders directed to
developed by Intermountain Health Care. inpatient nurses improve preventive care even
This system used a wide range of clinical more than reminders directed to physicians
information to recommend antibiotic choice, (Dexter et al. 2004).
Electronic Health Records
489 14
.. Fig. 14.11 Example of the main screen a from the the culture results, and b disclaimers. (Source: Courtesy
Intermountain Health Care Antibiotic Assistant pro- of R. Scott Evans, Robert A. Larsen, Stanley L. Pestot-
gram needed. The program displays evidence of an nik, David C. Classen, Reed M. Gardner, and John
infection-relevant patient data (e.g., kidney function, P. Burke, LDS Hospital, Salt Lake City, UT (Larsen
temperature), recommendations for antibiotics based on et al. 1989) © Cambridge University Press)
.. Fig. 14.12 User ordered an antibiotic for which the through ordering an alternative antibiotic. Links to
Vanderbilt’s former inpatient “WizOrder” CPOE “package inserts” (via buttons) detailed how to prescribe
system, based on their Pharmaceuticals and recommended drug under various circumstances.
Therapeutics (P and T) Committee input, recommended (Source: Miller et al. (2005b). Elsevier Reprint License
a substitution. This educational advisor guided clinician No. 2800411402464)
14
UpToDate, Micromedex, and a variety of Health IT requirements24 (see . Fig. 14.15).
electronic textbooks. Some EHRs are proac- To support this function, HL7 Version 3
tive and routinely present short informational standard has produced the Context Aware
nuggets adjacent to the order item that the Knowledge Retrieval Application
clinician has chosen. Through an Infobutton
designed to pull context-specific information,
EHRs can also pull literature, textbook or
other sources of information relevant to a 24 Certification of Health IT, Testing Process & Test
particular clinical situation, and present that Methods, 2015 Edition Test Method, Clinical Deci-
sion Support. 2015 Edition Certification Regula-
information to the clinician on the fly (Del
tions - 170.315(a)(9): Clinical Decision Support
Fiol et al. 2012). The Infobutton standard is 7 https://www.healthit.gov/test-method/clinical-
now a core CDS functionality within certified decision-support-cds#ccg (Accessed 6/4/2020).
Electronic Health Records
491 14
.. Fig. 14.13 Drug-alert display screens from Partners alert for captopril, and b a drug-drug interaction between
former outpatient medical record application (Longitudinal ciprofloxacin and warfarin. (Source: Courtesy of Partners
Medical Record, LMR). The screens show a a drug-allergy Health Care System, Chestnut Hill, MA)
.. Fig. 14.14 Example of CDS alerts to order an echocardiogram and to start an ACE inhibitor in a patient with
diagnosed congestive heart failure. (Source: Courtesy of Epic Systems, Madison, WI)
492 G. B. Melton et al.
.. Fig. 14.15 This figure shows the use of Columbia tions. When the user clicks on one of the questions, the
University Medical Center’s Infobuttons during results Infobutton delivers the answers. (Source: Courtesy of
review. Clicking on the Infobutton adjacent to the Iron Columbia University Medical Center, New York)
result generates a window (image) with a menu of ques-
(“Infobutton” standard).25 HL7 FHIR also the one in front of the clinician. Conceptually,
has powerful CDS capabilities; namely, sup- this “green button” (Longhurst et al. 2014),
port for CDS Hooks and two scripting lan- like other CDS, would be accessible to clini-
14 guages: FHIRPath26 and CQL27 for algebraic cians at the point of care and provide aggre-
and logical calculations (See 7 Chap 8). gate patient data (e.g., outcomes to a particular
One idea increasingly of interest is to pro- medication for disease treatment according to
vide to clinicians at the point of care opti- similar patient characteristics) to help support
mized treatment plans or customized treatment decisions in the absence of high
information, derived from patients “just like” quality evidence.
25 HL7 Version 3 Standard: Context Aware Knowledge 14.3.5 Care Team and Patient
Retrieval Application (“Infobutton”), Knowledge Communication
Request, Release 2. 7 http://www.hl7.org/imple-
ment/standards/product_brief.cfm?product_id=208
Communication tools, that support timely
(Accessed 6/4/2020).
26 FHIRPath STU1 Release. [Internet]. 2019 [cited and efficient communication between patients
01/29/2019]. Available from: 7 http://hl7.org/fhir- and the health care team and amongst team
path/ (Accessed 6/4/2020). members, can enhance coordination of care
27 Health Level 7. Clinical quality language (CQL) and disease management. Patients are pro-
standard. [Internet]. 2018 [cited 01/29/2019]. Avail-
vided secure online access to their EHR and
able from: 7 http://www.hl7.org/implement/stan-
dards/product_brief.cfm?product_id=400 (Accessed integrated communication tools to ask medi-
6/4/2020). cal questions or conveniently perform other
Electronic Health Records
493 14
clinical (e.g., renew a prescription) or admin- of such reports up to date and accurate can be
istrative tasks (e.g., schedule an appointment) a challenge (Arsoniadis et al. 2017).
(Tang 2003). Increasingly, the delivery of Although most patient encounters are
optimal patient care also requires multiple defined by scheduled face-to-face visits (e.g.,
health care professionals that may cross sev- outpatient visit, home health visit), provider
eral organizations; thus, it is important that decision-making also occurs during non-face-
communication among team members and to-face and nonscheduled events (e.g., patient
organizations is delivered effectively, effi- telephone calls, prescription renewal requests,
ciently, and on time. Such communications and the arrival of new test results). Recently,
usually focus on a single patient and may CMS has indicated they will pay for another
require a care provider to assess or inter- kind of non-face-to-face event, namely virtual
change information from several systems and visits.28 EHRs and other Health IT will sup-
providers in order to coordinate relevant care. port and facilitate these non-face-to-face
Direct connectivity to patients is and will events, and video-visit capabilities will become
be increasingly important to patient-provider part of many EHR vendor system offerings.
communication. It will permit direct-to- EHRs are traditionally bounded by the
patient reminders (Sherifali et al. 2011) and institution in which they reside. The National
deliver home health monitoring data (such as Health Information Infrastructure (NHII)
home blood pressure measurements and glu- (NCVHS 2001) has proposed a future in
cose testing results) to the EHR and other which providers caring for a patient could
information systems (Earle 2011; Green et al. reach beyond his or her local institution to
2008). The patient’s personal health record automatically obtain patient information
(PHR) will also become an important desti- from all relevant sources (see 7 Chap. 13).
nation for clinical messages and test results Today, examples of such regional “EHRs,”
(see 7 Chap. 13). Relevant information can often referred to as Health Information
be “pushed” to the patient or their PHR via Exchanges (HIE), serve routine and emer-
e-mail, pager services, or other secure texting gency care, public health and other functions.
or closed loop communication (Major et al. The first HIE was the IHIE (Indiana Health
2002; Poon et al. 2002; Gulacti and Lok 2017; Information Exchange) (McDonald et al.
Rief et al. 2017; Przvbvlo et al. 2014) or 2005) which started in 1994 with 3 Indianapolis
“pulled” by users on demand during their hospitals and now includes hospitals from
routine interactions with the computer. most of Indiana. Other early HIEs include
EHRs can also provide electronic func- Ontario, Canada (electronic Child Health
tionality to assist in the transfer (or hand-off) Network),29 Kentucky (Kentucky Health
of care responsibility from one clinician to Information Exchange),30 and Memphis
another. When the transfer is between settings (Frisse et al. 2008) Today, scores of HIEs are
(e.g., from hospital to nursing home), the in operation 7 https://strategichie.com/mem-
sending clinician usually provides a brief ver- bership/member-list/. A study from the
bal or written turnover note to the receiving Memphis HIE showed that the extra patient
clinician(s) summarizing the patient’s prob- information provided by this HIE saves
lems, treatments, and other relevant clinical resource use and costs (Frisse et al. 2011).
issues. . Figure 14.16 shows an example of a
“turn-over report” that includes instructions
from the “sending” physician, as well as rele- 28 7 https://www.cms.gov/outreach-and-education/
vant recent laboratory test results and other medicare-learning-network-mln/mlnproducts/
data pulled from the patient’s EHR and a “to- downloads/telehealthsrvcsfctsht.pdf (Accessed
do” list, that ensures that critical tasks are 6/4/2020).
complete (Stein et al. 2010). Such reports 29 eCHN electronic Child Health Network. 7 http://
www.echn.ca/ (Accessed 6/4/2020).
facilitate communication among team mem- 30 Kentucky Health Information Exchange Frequently
bers, and can improve both coordination and Asked Questions. 7 http://khie.ky.gov/Pages/faq.
patient safety. However, keeping the contents aspx?fc=010 (Accessed 6/4/2020).
494 G. B. Melton et al.
.. Fig. 14.16 Patient handoff report—a user- was developed by a customer within a vendor EHR
customizable hard copy report with automatic inclusion product (Sunrise Clinical Manager, Allscripts, Chicago,
of patient allergies, active medications, 24-h vital signs, IL) and was disseminated among other customers
recent common laboratory test results, isolation require- around the nation. (Source: Courtesy of Columbia Uni-
14 ments, code status, and other EHR data. This system versity Medical Center, New York)
According to a December 2018 ONC report, included NHIN Connect and NHIN Direct,33
half of all US hospitals now share some data the former with a special focus on large enter-
through one or more HIEs.31 prises and government organizations, the
In 2010, the Office of the National latter with a focus on simpler or local net-
Coordinator (ONC) proposed the Nationwide works. The Direct protocol34 is an email
Health Information Network (NwHIN) to (SMTP)-based protocol designed to deliver
connect regional HIEs and promote health encrypted messages and attachments securely
data exchange (see 7 Chaps. 13 and 31).32 It among pre-arranged groups of individuals or
organizations. ONC has nurtured the devel-
opment of the Direct Protocol and required
31 7 https://www.ruralcenter.org/resource-library/
methods-used-to-enable-interoperability-among- 33 7 https://www.healthdatamanagement.com/news/
us-non-federal-acute-care-hospitals (Accessed connect-nhin-direct-what-are-they (Accessed
6/4/2020). 6/4/2020).
32 eHealth Exchange. 7 https://ehealthexchange.org/ 34 7 h t t p : / / w i k i . d i re c t p ro j e c t . o rg / M a i n _ Pag e
(Accessed 6/4/2020). (Accessed 6/4/2020).
Electronic Health Records
495 14
EHR vendors to support it, and they have, but coders is more streamlined. The same system
minimally in some cases. can be used to review records, code, and pro-
A least seven organizations, provide gen- vide feedback to clinicians on documentation,
eral tools and trust policies to create new as well as allow clinicians to drop charges
secure health care networks or connect exist- directly into the EHR. In some cases, EHR
ing ones. Some of them are direct descendants coding and billing can also be augmented
of NHIN Connect. These seven are described with 3rd party systems, such as those leverag-
and compared in an excellent 2017 ONC ing computer-assisted coding with clinical
report.35 Among the seven, Direct Trust and NLP technologies (see 7 Chap. 9).
National Association for Trusted Exchange
(NATE) use the Direct Protocol exclusively,
and Surescripts and eHealth Exchange use it 14.4 EHRs for Secondary
optionally. The other three use a mix of non- and Population-Based Uses
Direct Protocols. Some of these organizations
are discussing partnerships or mergers. Unfor This section considers secondary uses of
tunately, support for delivering stru ctured EHRs, which are increasing greatly. Some of
data is limited in many of these systems, and this functionality can be done by or in concert
EHR vendors have not chosen a common with other systems and platforms. We expect
approach. that EHRs will continue to evolve and trans-
With the goal of one national network of form as information systems over time with
networks, ONC has proposed an overarching greater analysis and secondary use functions.
set of policies and protocols, called “The Medical personnel, quality and patient safety
trusted exchange framework and common professionals, and administrators use these
agreement” (TEFCA).36 It has been met with capabilities to find particular patterns and
both praise and criticism. FHIR offers addi events that predict patient outcomes. Public
tional mechanisms for linking independent health professionals can use reporting func-
networks and is highlighted in the TEFCA tions of computer-stored records for surveil-
proposal. To see how this all evolves, stay lance, including looking for emergence of new
tuned. diseases or other health threats that warrant
medical attention.
generate an alert or reminder message retrospective studies of existing data have con-
(McDonald 1976) to that patient. tributed much to medical progress (see
Organizations can use these systems at a pop- 7 Chap. 29). Retrospective studies can also
ulation level for care coordination, patient obtain answers at a small fraction of the time
empanelment for primary care providers, and and cost of comparable prospective studies.
other tasks. EHRs can often provide much of the data
For example, a cross-population query can required for a retrospective study. They can,
be used to identify patients who are due for for example, identify study cases and compa-
periodic screening examinations such as immu- rable control cases, and provide data needed
nizations, mammograms, and cervical Pap tests for statistical analysis of the comparison cases
and then can generate letters to patients or call (Brownstein et al. 2007). Combined with
lists for office staff to encourage the preventive access to discarded specimens, they also offer
care. This can also be especially useful for con- powerful approaches to retrospective genome
ducting ad hoc searches such as those required association studies that researcher can do
to identify and notify patients who have been much faster and at fraction of the comparable
receiving a recalled drug. Such systems can also prospective studies (Kohane 2011; Roden
facilitate quality management and patient et al. 2008).
safety activities, identify candidate patients for Computer-stored records do not eliminate
concurrent review and gather many of the data all the work required to complete an epide-
required to complete such audits. miologic study; chart reviews and patient
interviews may still be necessary. Computer-
stored records are likely to be most complete
14.4.2 Clinical Research and accurate with respect to visit diagnoses
that are carefully coded for administrative
Researchers can use EHRs particularly asso- purpose, as well as to prescribed drugs, and
ciated cross-patient queries and alerting capa- laboratory tests, because the latter two usually
bilities to identify patients who meet or have a come directly from automated laboratory and
high chance of meeting eligibility require- pharmacy systems, respectively. Consequently,
ments for a prospective clinical trial. For computer-stored records are likely to contrib-
example, an investigator could identify all ute to research on a physician’s practice pat-
patients seen in a medical clinic who have a terns, on the efficacy of tests and treatments,
particular diagnosis and satisfy the eligibility and on the toxicity of drugs. The research
14 requirements specified in a given study proto- opportunities will only improve with FHIR
col (Kho et al. 2007). These approaches can API and coding standards required by pro-
sometimes be applied in real time. At one posed CMS and ONC rules. NIH and AHRQ
institution, the physician’s workstation was have both encouraged research interest in
programmed to ask permission to invite the FHIR37,38 Also, improvements in NLP tech-
patient into a study, when that physician niques may make the content of narrative text
entered a problem that suggested the patient more accessible to automatic searches (see
might be a candidate for a local clinical trial. 7 Chap. 9).
If the physician gave permission, an auto-
mated electronic page could then be triggered
and sent to the nurse recruiter who would
then invite the patient to participate in the
study. One early such study was for patients
37 7 https://grants.nih.gov/grants/guide/notice-files/
with back pain (Damush et al. 2002).
NOT-HS-19-020.html (Accessed 6/4/2020).
Randomized prospective studies are the 38 7 https://grants.nih.gov/grants/guide/notice-files/
gold standard for clinical investigations, but NOT-OD-19-122.html (Accessed 6/4/2020).
Electronic Health Records
497 14
14.4.3 Quality Reporting and diagnoses coded from that record, we
know that the accuracy of coding varies by
EHRs are also increasingly important in the kind of diagnosis, setting and hospital size,
production or autonomation of quality and variability in granularity. Claims data can
reports that are used for both internal quality also provide structured and coded records of
improvement activities and for external regu- ambulatory prescriptions, but generally pro-
latory or public reporting. Although it is dif- vide no test results or clinical measurement
ficult for paper-based records to incorporate values. So, considering only claims-based
patient-generated input and it requires careful diagnostic codes can lead to inappropriate
tagging of data sources, EHRs can increas- policymaking and conclusions (Tang et al.
ingly include data contributed by patients 2007).
(e.g., patient-reported outcomes such as func- EHRs complement claim- and adminis-
tional status, pain scores, and symptom; trative-based data and can provide informa-
review of systems). Patient-reported data is tion about the relationships among diagnoses,
also being incorporated in future quality mea- severity of illness indication, and resource
sures particularly for disease specific or epi- consumption. Thus, these systems are impor-
sodic specific conditions, e.g., use of the tant tools for administrators who wish to
Seattle Angina Questionnaire Short Form make informed decisions in the increasingly
(SAQ-7) and Rose Dyspnea Scale (RDS) fol- value-based world of health care. On the
lowing Non-Emergent Percutaneous other hand, the use of EHR data for billing
Coronary Intervention (PCI).39 and administrative purposes can incentiv-
With changing reimbursement payment ize clinicians to bias their documentation
models focusing more on outcomes measures for maximal payment, and possibly reducing
instead of volume of transactions, generating the clinical accuracy of the diagnoses. It may
efficient and timely reports of clinical quality therefore be best to base financial decisions
measures will play an increasingly important on variables that are not open to interpreta-
role in management and payment. FHIR will tion.
likely play a role here, as well. FHIR supports Despite Reiser’s (1991) clinically oriented
the Clinical Quality Language (CQL)40 and goals for EHRs, much of what these systems
FHIRPath,41 a subset CQL. CQL was devel- are currently is driven by complex and pre-
oped for CMS quality reporting but has wider scriptive medical-legal, reimbursement, and
applicability to other initiatives. regulatory requirements (Cusack et al. 2013).
These requirements may lead to redundant
data capture, cumbersome documentation
14.4.4 Administration processes, and information that is biased
towards optimized billing. One potential solu-
In the past, administrators had to rely on data tion would be policy changes such that the tie
from billing systems to understand practice between payment and documentation ele-
patterns and resource utilization. However, ments are less emphasized, including the pro-
claims data have their limits, including their posed CMS rule “Patients Over Paperwork”42
delayed and retrospective nature. From direct which features decreased documentation bur-
comparisons between medical record content den for office visits.
39 7 h t t p s : / / c m i t . c m s . g o v / C M I T _ p u b l i c /
ViewMeasure?MeasureId=3516 (Accessed 6/4/2020).
40 7 https://ecqi.healthit.gov/cql-clinical-quality-lan-
guage (Accessed 6/4/2020). 42 7 CMS.gov. Patients Over Paperwork. 7 https://
41 7 https://www.hl7.org/fhir/fhirpath.html (Accessed www.cms.gov/About-CMS/story-page/patients-
6/4/2020). over-paperwork.html (Accessed 6/4/2020).
498 G. B. Melton et al.
14.5.2 Standards
46 7 https://www.lri.fr/presentation_en.php (Accessed
6/4/2020).
We alluded to the importance of standards 47 7 https://www.ncpdp.org/NCPDP/media/pdf/NCP-
earlier in this chapter, when we discussed the DPEprescribingBasics.pdf (Accessed 6/4/2020).
architectural requirements of integrating data 48 7 https://searchhealthit.techtarget.com/definition/
from multiple sources. Standards are the focus DICOM-Digital-Imaging-and-Communications-in-
Medicine (Accessed 6/4/2020).
49 7 https://www.edibasics.com/edi-resources/docu-
ment-standards/hipaa (Accessed 6/4/2020).
45 7 https://www.regenstrief.org/resources/clinical- 50 HIPAA for professionals. 7 https://www.hhs.gov/
learning/ (Accessed 6/4/2020). hipaa/for-professionals/index.html (Accessed 6/4/2020).
500 G. B. Melton et al.
.. Fig. 14.17 An example of NLP and rule-based inferred by NLP analysis to suggest possible matches in
c onversion of provider notes to order items from Blain the order items box
Takasu. Notes written in free text in the A/P section are
HITECH Act51 (see 7 Chaps. 8 and 31) and 4004)53 and these positions are present in
subsequently with the Medicare Access and proposed ONC and CMS rules.
CHIP Reauthorization Act of 2015
(MACRA),52 have stimulated significant
efforts to increase the adoption and function- 14.5.3 Costs and Benefits
ality of EHRs, as well as leverage these sys-
14 tems for quality reporting. More recently, the The National Academy of Medicine (for-
twenty-first Century Cure Act has significant merly Institute of Medicine) declared the
provisions around delivery of Health IT EHR an essential infrastructure for the deliv-
usability (Sect. 4001), Conditions of ery of health care, and the protection of
Certification (Sect. 4002(a)), Trusted patient safety (IOM Committee on Improving
Exchange Framework and Common Agree the Patient Record 2001). Like any infrastruc-
ment (Sect. 4003(b)), and guidelines around ture project, the benefits specifically attribut-
reasonable and necessary activities that do able to infrastructure are not immediate and
not constitute information blocking (Sect. sometimes difficult to establish; an infrastruc-
ture plays an enabling role in all projects that
take advantage of it. Early randomized con-
trolled clinical studies showed that computer-
based decision-support systems reduce costs
51 7 https://www.healthit.gov/sites/default/files/ and improve quality compared with usual
hitech_act_excerpt_from_arra_with_index.pdf care supported with a paper medical record
(Accessed 6/4/2020).
52 7 CMS.gov: MACRA. 7 https://www.cms.gov/
Medicare/Quality-Initiatives-Patient-Assessment-
Instruments/Value-Based-Programs/MACRA- 53 H.R.34 – 114th Congress. 7 https://www.congress.
MIPS-and-APMs/MACRA-MIPS-and-APMs.html gov/bill/114th-congress/house-bill/34 (Accessed
(Accessed 6/4/2020). 6/4/2020).
Electronic Health Records
501 14
(Tierney et al. 1993; Bates et al. 1997, 2003; federal government in health care—as a
Classen et al. 1997), and meta-analyses of payer, provider, policymaker, and regula-
Health IT have demonstrated quality benefits tor—federal leadership to create incentives
(Buntin et al. 2011; Lau et al. 2010; Clyne for developing and adopting standards and
et al. 2012). However, others have not found for promoting the implementation and use
consistent associations between EHRs and of EHRs remains crucial. Technological
CDS and better quality (Romano and Stafford change will continue to occur at a rapid
2011; Delvaux et al. 2017; Parshuram et al. pace, driven by consumer demand for enter-
2018; Muth et al. 2018). tainment, retail, games, and business tools.
Because of the significant resources needed Nurturing the use of IT in health care
and the significant broad-based potential ben- requires leaders including informatician
efits of these systems, the decision to imple- leaders who promote the use of EHRs and
ment an EHR is a strategic one for most work to overcome the obstacles that impede
healthcare organizations. Hence, the evalua- widespread use of computers for the benefit
tion of the costs and benefits must consider of health care.
the effects on the organization’s strategic
goals, as well as the objectives for individual nnSuggested Reading
health care (Samantaray et al. 2011). Today, Barnett, G. O. (1984). The application of
there are a number of Open Source options computer- based medical-record systems in
for EHR software with a range of capabilities ambulatory practice. New England Journal of
(Syzdykkova et al. 2017). Medicine, 310(25), 1643–1650. This seminal
The cost of installing an EHR in a large article compares the characteristics of manual
health system can exceed $100 million and and automated ambulatory patient record sys-
even $1 billion for the largest imple tems, discusses implementation issues, and
mentations.54 The cost of the system itself in predicts future developments in technology.
license fees and related items is usually only a Bates, D. W., Kuperman, G. J., Wang, S., et al.
portion of that number. Other costs include (2003). Ten commandments for effective clini-
configuration, training, and lost revenue as cal decision support: Making the practice of
care providers learn to use the system. The evidence-based medicine a reality. Journal of
benefits of such an investment are often the American Medical Informatics Association,
related to the integration of a health system’s 10(6), 523–530. The authors present ten very
diverse components into a single, coordinated practical tips to designers and installers of
enterprise. clinical decision support systems.
Berner, E. S. (Ed.). (2010). Clinical decision sup-
port systems, theory and practice: Health infor-
14.5.4 Leadership matics series (3rd ed.). New York: Springer.
This text focuses on the design, evaluation,
Leaders from all segments of the health care and application of Clinical Decision Support
industry must work together to articulate systems, and examines the impact of com-
the needs, to continue to define and expand puter-based diagnostic tools both from the
upon the standards, to fund the develop- practitioner’s and the patient’s perspectives. It
ment, to implement the social change, and is designed for informatics specialists, teachers
to write the laws to accelerate the develop- or students in health informatics, and clini-
ment and routine use of EHRs in health cians.
care. Because of the prominent role of the Collen, M. F. (1995). A history of medical infor-
matics in the United States, 1950–1990.
Indianapolis: American Medical Informatics
Association, Hartman Publishing. This rich
54 EHR Intelligence. Top 5 Most Expensive Imple-
history of medical informatics from the late
mentations of 2017. 7 https://ehrintelligence.com/
news/top-5-most-expensive-ehr-implementations- 1960s to the late 1980s includes an extremely
of-2017 (Accessed 6/5/2020). detailed set of references.
502 G. B. Melton et al.
Collen, M. F., & Ball, M. J. (Eds.). (2015). The his- Scottsdale: Scottsdale Institute, AMIA,
tory of medical informatics in the United States AMDIS and SHM. This text provides guid-
(2nd ed.). London: Springer-Verlag, Springer ance on using clinical decision support inter-
Nature. This rich history of medical informat- ventions to improve care delivery and
ics from the 1990s to mid 2010s (25 years) pro- outcomes in a hospital, health system or phy-
vides an updated medical informatics sician practice. The book also presents consid-
perspective. It includes an extremely detailed erations for health IT software suppliers to
set of references. effectively support their CDS implementer cli-
Hartley, C. P., & Jones, E. D. (2011). EHR imple- ents.
mentation: A step-by-step guide for the medical Sittig, D. F., & Ash, J. S. (2011). Clinical informa-
practice (2nd ed.). Chicago: American Medical tion systems: Overcoming adverse conse-
Association. This book provides rich details quences (Jones and Bartlett series in
for implementing an EHR. It is a great biomedical informatics) (1st ed.). Burlington:
resource for anyone trying to learn about Jones and Bartlett Learning. This book
EHR deployments, covering topics related to explores the challenges and obstacles with
preparation, support, and implementation. implementation of clinical information sys-
Institute of Medicine (IOM) Roundtable on Value tems including the nine categories of unin-
and Science-Driven Health Care. (2011). tended adverse consequences with
Digital infrastructure for the learning health implementation and optimization of these
system: The foundation for continuous improve- systems as well as best practices.
ment in health and health care – workshop Weed, L. L. (1969). Medical records, medical eval-
series summary. Washington, DC: National uation and patient care: The problem-oriented
Academy Press. This report summarizes three record as a basic tool. Chicago: Year Book
workshops that presented new approaches to Medical Publishers. In this classic book, Weed
the construction of advanced medical record presents his plan for collecting and structuring
system that would gather the crucial data patient data to produce a problem-oriented
needed to improve the health care system. medical record.
Kuperman, G. J., Gardner, R. M., & Pryor, T. A.
(1991). The HELP system. Berlin/Heidelberg: ??Questions for Discussion
Springer-Verlag GmbH and Co. K. The HELP 1. What is the definition of an EHR?
(Health Evaluation through Logical What, then, is an EHR? What are five
Processing) system was a computerized hospi- advantages of an EHR over a
14 tal information system developed by the paper-based record? Name three
authors at the LDS Hospital at the University limitations of an EHR.
of Utah, USA. It provided clinical, hospital 2. What are the five functional compo-
administration and financial services through nents of an EHR? Think of the infor-
the use of a modular, integrated design. This mation systems used in health care
book thoroughly documents the HELP sys- institutions in which you work or that
tem. Chapters discuss the use of the HELP you have seen. Which of the compo-
system in intensive care units, the use of nents that you named do those systems
APACHE and APACHE II on the HELP sys- have? Which are missing? How do the
tem, various clinical applications and inactive missing elements limit the value to the
or experimental HELP system modules. clinicians or patients?
Although the HELP system has now been 3. Discuss three ways in which a computer
retired from routine use, it remains an impor- system can facilitate information trans-
tant example of several key issues in EHR fer between hospitals and ambulatory
implementation and use that continue in the care facilities, thus enhancing continu-
commercial systems of today.. ity of care for previously hospitalized
Osheroff, J., Teich, J., Levick, D., et al. (2012). patients who have been discharged and
Improving outcomes with clinical decision sup- are now being followed up by their pri-
port: An implementers guide (2nd ed.). mary physicians.
Electronic Health Records
503 14
4. Much of medical care today is prac- laboratories traditionally provide sum-
ticed in teams, and coordinating the mary test results in flowsheet format,
care delivered by teams is a major chal- thus highlighting clinically important
lenge. Thinking in terms of the EHR changes over time. A medical record
functional components, describe four system that contains information for
ways that EHRs can facilitate care patients who have chronic diseases
coordination. Describe two ways in must present serial clinical observa-
which EHRs are likely to create addi- tions, history information, and medica-
tional challenges in care coordination. tions, as well as laboratory test results.
5. How does the health care financing envi- Suggest a suitable format for presenting
ronment affect the use, costs, and bene- the information collected during a
fits of an EHR? How has the financing series of ambulatory-care patient visits.
environment affected the functionality 12. The public demands that the
of information systems? How has it confidentiality of patient data must
affected the user population? be maintained in any patient record
6. Would a computer scan of a paper- system. Describe three protections
based record be an EHR? What are two and auditing methods that can be
advantages and two limitations of this applied to paper-based systems.
approach? Describe three technical and three
7. Among the key issues for designing an nontechnical measures you would
EHR are what information should be like to see applied to ensure the
captured and how can it be entered into confidentiality of patient data in
the system. Physicians may enter data an EHR. How do the risks of
directly or may record data on a paper privacy breaches differ for the two
worksheet (encounter form) for later systems?
transcription by a data-entry worker.
What are two advantages and two dis-
advantages of each method? Discuss References
the relative advantages and disadvan-
tages of entry of free text instead of Anand, V., McKee, S., Dugan, T. M., & Downs, S. M.
entry of fully coded information. (2017). Leveraging electronic tablets for general
pediatric care: A pilot study. Applied Clinical
Describe an intermediate or compro-
Informatics, 6(1), 1–15.
mise method. Anand, V., Carroll, A. E., Biondich, P. G., Dugan, T. M.,
8. EHR data may be used in clinical & Downs, S. M. (2018). Pediatric decision support
research, quality improvement, and using adapted Arden Syntax. Artificial Intelligence
monitoring the health of populations. in Medicine, 92, 15–23.
Arsoniadis, E. G., Skube, S. J., Bjerke, T. M., Jarabek,
Describe three ways that the design of
B., & Melton, G. B. (2017). Assessing provider-
the EHR may affect how the data may generated free-text quality in EHR-integrated
be used for other purposes. handoff notes. Studies in Health Technology and
9. Identify four locations where clinicians Informatics, 245, 999–1003.
need access to the information con- Barnett, G. O. (1984). The application of computer-
based medical-record systems in ambulatory prac-
tained in an EHR. What are the major
tice. The New England Journal of Medicine, 310(25),
costs or risks of providing access from 1643–1650.
each of these locations? Barnett, G. O., Justice, N. S., Somand, M. E., et al.
10. What are three important reasons to (1979). COSTAR – a computer-based medical infor-
have physicians enter orders directly mation system for ambulatory care. Proceedings of
the Institute of Electrical and Electronics Engineers
into an EHR? What are three chal-
(IEEE), 67, 1226–1237.
lenges in implementing such a system? Bates, D. W., Spell, N., Cullen, D. J., et al. (1997). The
11. Consider the task of creating a sum- costs of adverse drug events in hospitalized patients.
mary report for clinical data collected Journal of the American Medical Association,
over time and stored in an EHR. Clinical 277(4), 307–311.
504 G. B. Melton et al.
Bates, D. W., Evans, R. S., Murff, H., et al. (2003). ogy to reduce inappropriate medication use and
Detecting adverse events using information technol- polypharmacy in older people: A review of cur-
ogy. Journal of the American Medical Informatics rent evidence. Clinical Geriatric Medicine, 28(2),
Association: JAMIA, 10(2), 115–128. 301–322.
Bauer, D. T., Guerlain, S., & Brown, P. J. (2010). Coffey, R. (1979). How medical information systems
The design and evaluation of a graphical display affect costs: The El Camino experience. Washington,
for laboratory data. Journal of the American DC: Public Health Service. Publication DHEW-
Medical Informatics Association: JAMIA, 17(4), PHS-80-3626.
416–424. Collen, M. F. (1969). Value of multiphasic health check-
Bleich, H. L., Beckley, R. F., Horowitz, G. L., Jackson, ups. The New England Journal of Medicine, 280(19),
J. D., Moody, E. S., Franklin, C., Goodman, S. R., 1072–1073.
McKay, M. W., Pope, R. A., Walden, T., et al. (1985). Collen, M. F. (1995). A history of medical informatics in
Clinical computing in a teaching hospital. The New the United States, 1950 to 1990. Bethesda: American
England Journal of Medicine, 312(12), 756–764. Medical Informatics Association.
‘Brain’ to Store Medical Data. (1956, October 24). The Covell, D., Uman, G., & Manning, P. (1985). Information
New York Times. needs in office practice: Are they being met? Annals
Brown, S. H., Lincoln, M. J., Groen, P. J., et al. (2003). of Internal Medicine, 103, 596–599.
VistA–U.S. Department of Veterans Affairs Cusack, C. M., Hripcsak, G., Bloomrosen, M.,
national-scale HIS. International Journal of Medical Rosenbloom, T., Weaver, C., Wright, A., Vawdrey,
Informatics, 69(2–3), 135–156. D., Walker, J., & Mamykina, L. (2013). The future
Brown, S. H., Fischietti, L. F., Graham, G., Bates, J., state of clinical data capture and documentation: A
et al. (2007). Use of electronic health records in report from AMIA’s 2011 Policy Meeting. Journal
disaster response: The experience of department of of the American Medical Informatics Association:
veterans affairs after hurricane Katrina. American JAMIA, 20, 134–140.
Journal of Public Health, 97, S136–S141. Damiani, G., Pinnarelli, L., Colosimo, S. C., Almiento,
Brownstein, J. S., Sordo, M., Kohane, I. S., & Mandl, R., Sicuro, L., Galasso, R., Sommello, L., &
K. D. (2007). The tell-tale heart: Population-based Ricciardi, W. (2010). The effectiveness of comput-
surveillance reveals an association of rofecoxib and erized clinical guidelines in the process of care: A
celecoxib with myocardial infarction. PLoS One, systematic review. BMC Health Services Research,
2(9), e840. 10, 2.
Buchanan, J. (2017). Accelerating the benefits of the Damush, T. M., Weinberger, M., Clark, D. O., Tierney,
problem oriented medical record. Applied Clinical W. M., Rao, J. K., Perkins, S. M., & Verel, K. (2002).
Informatics, 8(1), 180. Acute low back pain self management intervention
Buntin, M. B., Burke, M. F., Hoaglin, M. C., & for urban primary care patients: Rationale, design,
Blumenthal, D. (2011). The benefits of health infor- and predictors of participation. Arthritis and
mation technology: A review of the recent literature Rheumatism, 47, 372–379.
shows predominantly positive results. Health Affairs Davis, L. S., Collen, M. F., Rubin, F. L., & Van Brunt,
(Millwood), 30(3), 464–471. E. E. (1968). Computer stored medical record.
14 Burykin, A., Peck, T., & Buchman, T.G. (2011). Using Computer Biomedical Research, 1, 452–469.
“off-the-shelf ” tools for terabyte-scale waveform Del Fiol, G., Huser, V., Strasberg, H. R., et al. (2012).
recording in intensive care: computer system Implementations of the HL7 context-aware knowl-
design, database description and lessons learned. edge retrieval (“infobutton”) standard: Challenges,
In Computational Methods Programs Biomed. strengths, limitations, and uptake. Journal of
103(3) (pp. 151–60). https://doi.org/10.1016/j. Biomedical Informatics, 45(4), 726–735.
cmpb.2010.10.004. Delvaux, N., Van Thienen, K., Heselmans, A., et al.
Chaudhry, B., Wang, J., Wu, S., Maglione, M., Mojica, (2017). The effects of computerized clinical decision
W., Roth, E., Morton, S. C., et al. (2006). Systematic support systems on laboratory test ordering: A sys-
review: Impact of health information technology on tematic review. Archives of Pathology & Laboratory
quality, efficiency, and costs of medical care. Annals Medicine, 141(4), 585–595.
of Internal Medicine, 144(10), 742–752. Dexter, P. R., Perkins, S., Overhage, J. M., et al. (2001).
Cheung, N. T., Fung, K. W., Wong, K. C., et al. (2001). A computerized reminder system to increase the
Medical informatics – the state of the art in the use of preventive care for hospitalized patients. The
hospital authority. International Journal of Medical New England Journal of Medicine, 345(13), 965–970.
Informatics, 62, 113–119. Dexter, P. R., Perkins, S. M., Maharry, K. S., Jones,
Classen, D. C., Pestotnik, S. L., Evans, R. S., Lloyd, J. F., K., & McDonald, C. J. (2004). Inpatient computer-
& Burke, J. P. (1997). Adverse drug events in hospi- based standing orders vs. physician reminders to
talized patients. Excess length of stay, extra costs, increase influenza and pneumococcal vaccination
and attributable mortality. Journal of the American rates. Journal of the American Medical Association,
Medical Association, 277(4), 301–306. 292(19), 2366–2371.
Clyne, B., Bradley, M. C., Hughes, C., et al. (2012). Downs, S. M., Biondich, P. G., Anand, V., Zore, M.,
Electronic prescribing and other forms of technol- & Carroll, A. E. (2006). Using Arden syntax and
Electronic Health Records
505 14
adaptive turnaround documents to evaluate clinical Informatics Association annual symposium proceed-
guidelines. In AMIA annual symposium proceedings, ings, 2008 (pp. 212–216).
2006 (pp. 214–218). Frisse, M. E., Johnson, K., Nian, H., et al. (2011). The
Duke, J. D., Morea, J., Mamlin, B., Martin, D. K., financial impact of health information exchange
Simonaitis, L., Takesue, B. Y., Dixon, B. E., & on emergency department care. Journal of the
Dexter, P. R. (2014). Regenstrief Institute’s medical American Medical Informatics Association: JAMIA,
gopher: A next-generation homegrown electronic 19(3), 328–333.
medical record system. International Journal of Fry, E., & Schulte, F. (2019, March 18). Death by a
Medical Informatic, 83(3), 170–179. thousand clicks: Where electronic health records
Duncan, R. G., Saperia, D., Dulbandzhyan, R., et al. went wrong. Fortune Magazine.
(2001). Integrated web-based viewing and secure Fung, K. W., Xu, J., Rosenbloom, S. T., et al. (2011).
remote access to a clinical data repository and Testing three problem list terminologies in a simu-
diverse clinical systems. In Proceedings of the AMIA lated data entry environment. In American Medical
fall symposium, 2001 (pp. 149–153). Informatics Association annual symposium proceeds,
Earle, K. (2011). In people with poorly controlled hyper- 2011 (pp. 445–454).
tension, self-management including telemonitoring Gardner, R. L., Cooper, E., Haskell, J., et al. (2019).
is more effective than usual care for reducing sys- Physician stress and burnout: The impact of health
tolic blood pressure at 6 and 12 months. Evidence- information technology. JAMIA, 26(2), 106–114.
Based Medicine, 16(1), 17–18. Garg, A. X., Adhikari, N. K. J., McDonald, H., et al.
Edgar, E. P. (2009). Physician retention in Army Medical (2005). Effects of computerized clinical decision
Department. Strategic Research Project, U.S. Army support systems on practitioner performance and
War College, Carlisle Barracks PA 17013–5050. patient outcomes: A systematic review. Journal of the
Available at: http://handle.dtic.mil/100.2/ American Medical Association, 293(10), 1223–1238.
ADA499087 Giuse, D. A., & Mickish, A. (1996). Increasing the
Evans, R. S., Pestotnik, S. L., Classen, D. C., Clemmer, availability of the computerized patient record. In
T. P., Weaver, L. K., Orme, J. F., Lloyd, J. F., & Proceedings of the AMIA annual fall symposium,
Burke, J. P. (1998). A computer-assisted manage- 1996 (pp. 633–637).
ment program for antibiotics and other antinfec- Gottschalk, A., & Flocke, S. A. (2005). Time spent
tive agents. The New England Journal of Medicine, in face-to-face patient care and work outside the
338(4), 232–238. examination room. Annals of Family Medicine, 3(6),
Fafchamps, D., Young, C. Y., & Tang, P. C. (1991). 488–493.
Modelling work practices: Input to the design of a Green, B. B., Cook, A. J., Ralston, J. D., et al. (2008).
physician’s workstation. In Proceedings of the 15th Effectiveness of home blood pressure monitor-
annual symposium on computer applications in medi- ing, web communication, and pharmacist care on
cal care (pp. 788–792). Washington, DC: CAMC. hypertension control: A randomized controlled
Ferranti, J. M., Musser, R. C., Kawamoto, K., & trial. Journal of the American Medical Association,
Hammond, W. E. (2006). The clinical document 299(24), 2857–2867.
architecture and the continuity of care record: A Grossman, J. H., Barnett, G. O., Koepsell, T. D., Nesson,
critical analysis. Journal of the American Medical H. R., Dorsey, J. L., & Phillips, R. R. (1973). An
Informatics Association: JAMIA, 13(3), 245–252. automated medical record system. Journal of the
Fitzmaurice, J. M., Adams, K., & Eisenberg, J. M. (2002). American Medical Association, 224(12), 1616–1621.
Three decades of research on computer applications Gulacti, U., & Lok, U. (2017). Comparison of secure mes-
in health care: Medical informatics support at the saging application (WhatsApp) and standard tele-
agency for healthcare research and quality. Journal phone usage for consultations on Length of Stay in
of the American Medical Informatics Association: the ED. A prospective randomized controlled study.
JAMIA, 9(2), 144–160. Applied Clinical Informatics, 8(3), 742–753.
Flexner, A. (1910). Medical education in the united Hagen, P. T., Turner, D., Daniels, L., & Joyce, D. (1998).
states and Canada: A report to the Carnegie foun- Very large-scale distributed scanning solution for
dation for the advancement of teaching. New York: automated entry of patient information. TEPR
The Carnegie Foundation for the Advancement of Proceedings (Toward an ElectronicPatient Record),
Teaching. OCLC 9795002. 1, 228–232.
Fried, T. R., Niehoff, K. M., Street, R. L., et al. (2017). Halamka, J. D., & Safran, C. (1998). CareWeb: A web-
Effect of the tool to reduce inappropriate medica- based medical record for an integrated healthcare
tions on medication communication and depre- delivery system. In Proceedings of Medinfo 1998,
scribing. Journal of the American Geriatrics Society, Part 1 (pp. 36–39).
65(10), 2265–2271. Haynes, R. B. (Ed.). (2011). Computerized clini-
Fries, J. F. (1974). Alternatives in medical record for- cal decision support systems: How effective
mats. Medical Care, 12(10), 871–881. are they? [article collection] Implementation
Frisse, M. E., King, J. K., Rice, W. B., et al. (2008). A Science, 6, 87–108. Articles available at: http://
regional health information exchange: Architecture www.i mplementationscience.c om/series/CCDSS.
and implementation. In American Medical Accessed 2 Jan 2012.
506 G. B. Melton et al.
Health IT Standards Committee. (2011). Koshy, S., Feustel, P. J., Hong, M., & Kogan, B. A.
Recommendations to ONC on the assignment (2010). Scribes in an ambulatory urology prac-
of code sets to clinical concepts [data elements] tice: Patient and physician satisfaction. Journal of
for use in quality measures. [Letter]. Retrieved Urology, 184(1), 258–262.
from http://www.healthit.gov/sites/default/files/ Kuhn, I. M., Wiederhold, G., Rodnick, J.E., et al. (1984).
standards-certification/HITSC_CQMWG_VTF_ Automated ambulatory medical record systems in
Transmit_090911.pdf. Accessed 3 Jan 2012. the U.S. In Blum, B. (Ed.), Information systems
Hirsch, J. S., Tanenbaum, J. S., Lipsky Gorman, S., Liu, for patient care (pp.199–217), New York: Springer.
C., Schmitz, E., Hashorva, D., Ervits, A., Vawdrey, Original 1982 report available at: http://infolab.
D., Sturm, M., & Elhadad, N. (2015). HARVEST, stanford.edu/TR/CS-TR-82-928.html. Accessed 2
a longitudinal patient record summarizer. Journal Jan 2012.
of the American Medical Informatics Association: Lamb, A. (1955). The Presbyterian Hospital and the
JAMIA, 22, 263–274. Columbia-Presbyterian Medical Center, 1868–1943:
Hollingsworth, J. C., Chisholm, C. D., Giles, B. K., A history of a great medical adventure. New York:
Cordell, W. H., & Nelson, D. R. (1998). How do Columbia University Press.
physicians and nurses spend their time in the emer- Larsen, R. A., Evans, R. S., Burke, J. P., Pestotnik,
gency department? Annals of Emergency Medicine, S. L., Gardner, R. M., & Classen, D. C. (1989).
31, 87–91. Improved perioperative antibiotic use and reduced
Hripcsak, G., Cimino, J. J., & Sengupta, S. (1999). surgical wound infections through use of computer
WebCIS: Large scale deployment of a web-based decision analysis. Infection Control and Hospital
clinical information system. In Proceedings of the Epidemiology, 10(7), 316–320.
annual AMIA symposium, 1999 (pp. 804–808). Lau, F., Kuziemsky, C., Price, M., & Gardner, J. (2010).
Hunt, D. L., Haynes, R. B., et al. (1998). Effects of A review on systematic reviews of health informa-
computer- based clinical decision support systems tion system studies. Journal of the American Medical
on physician performance and patient outcomes: Informatics Association: JAMIA, 17(6), 637–645.
A systematic review. JAMA : The Journal of the Longhurst, C. A., Harrington, R. A., & Shah, N. H. (2014).
American Medical Association, 280(15), 1339–1346. A ‘green button’ for using aggregate patient data at the
Institute of Medicine Committee on Improving the point of care. Health Affairs, 33(7), 1229–1235.
Patient Record. (2001). Crossing the quality Major, K., Shabot, M. M., & Cunneen, S. (2002).
chasm: A new health system for the 21st century. Wireless clinical alerts and patient outcomes in the
Washington, DC: National Academy Press. Available surgical intensive care unit. American Surgery, 68,
at: http://www.iom.edu/Reports/2001/Crossing-the- 1057–1060.
Quality-Chasm-A-New-Health-System-for-the- Mamlin BW, Overhage JM, Tierney WM, Dexter PR,
21st-Century.aspx. Accessed 2 Jan 2012. McDonald CJ. Clinical Decision Support Within
Janamanchi, B., Katsamakas, E., Raghupathi, W., & the Regenstrief Medical Record System. Book
Gao, W. (2009). The state and profile of open source chapter - Clinical Decision Support Systems -Theory
software projects in health and medical informat- and Practice - Series: Health Informatics, Berner,
ics. International Journal of Medical Informatics, Eta S. (Ed.) 2nd ed., 2007, 190–21.
14 78(2009), 457–472. Mandel, J. C., Kreda, D. A., Mandl, K. D., Kohane,
Johnson, S. B., Friedman, C., Cimino, J. J., Clark, T., I. S., & Ramoni, R. B. (2016). SMART on FHIR:
Hripcsak, G., & Clayton, P. D. (1991). Conceptual A standards-based, interoperable apps platform for
data model for a central patient database. In electronic health records. Journal of the American
Proceedings of the symposium on computer applica- Medical Informatics Association, 23(5), 899–908.
tions in medical care (pp. 381–385). Mandl, K. D., Gottlieb, D., & Ellis, A. (2019). Beyond
Just, B. H., Marc, D., Munns, M., et al. (2016). Why one-off integrations: A commercial, substitutable,
patient matching is a challenge: Research on mas- reusable, standards-based, electronic health record–
ter patient index (MPI) data discrepancies in key connected app. Journal of Medical Internet Research,
identifying fields. Perspectives in Health Information 21(2), e12902. https://doi.org/10.2196/12902.
Management, 13(Spring), 1e. McDonald, C. J. (1973, November) The clinical automa-
Khajouei, R., & Jaspers, M. W. (2010). The impact of ton—An ambulatory care record with control capa-
CPOE medication systems’ design aspects on usability, bilities. In Proceedings of the IEEE Conf Cyber Soc.
workflow and medication orders: A systematic review. (pp. 27–31).
Methods of Information in Medicine, 49(1), 3–19. McDonald, C. J. (1976). Protocol-based computer
Kho, A., Zafar, A., & Tierney, W. (2007). Information reminders. The quality of care and the non-
technology in PBRNs: The Indiana University perfectibility of man. The New England Journal
Medical Group Research Network (IUMG ResNet) of Medicine, 295(24), 1351–1355. https://doi.
experience. The Journal of the American Board of org/10.1056/NEJM197612092952405.
Family Medicine, 20(2), 196–203. McDonald, C. J., Bhargava, B., & Jeris, D. W. (1975).
Kohane, I. S. (2011). Using electronic health records to A clinical information system (CIS) for ambulatory
drive discovery in disease genomics. Nature Reviews care. In Proceedings AFIPS national computing con-
Genetics, 12(6), 417–428. ference, Anaheim.
Electronic Health Records
507 14
McDonald, C. J., Overhage, J. M., Tierney, W. M., et al. Report and recommendations from the National
(1999). The regenstrief medical record system: A Committee on Vital and Health Statistics. Available
quarter century experience. International Journal of at http://www.ncvhs.hhs.gov/nhiilayo.pdf. Accessed
Medical Informatics, 54(3), 225–253. 17 Dec 2012.
McDonald, C. J., Huff, S. M., Suico, J. G., et al. (2003). Nebeker, J. R., Hoffman, J. M., Weir, C. R., Bennett,
LOINC, a universal standard for identifying C. L., & Hurdle, J. F. (2005). High rates of adverse
laboratory observations: A 5-year update. Clinical drug events in a highly computerized hospital.
Chemistry, 49(4), 624–633. Archives of Internal Medicine, 165(10), 1111–1116.
McDonald, C. J., Overhage, J. M., Barnes, M., Schadow, Openchowski, M. W. (1925). The effect of the unit
G., Blevins, L., Dexter, P. R., & Mamlin, B. W. (2005). record system and improved organization on hos-
The Indiana network for patient care: A working pital economy and efficiency. Archives of Surgery,
local health information infrastructure (LHII). 10(3), 925–934.
Health Affairs (Millwood), 24(5), 1214–1220. Overhage JM, Tierney WM, Zhou XH, McDonald
McDonald CJ, Callaghan FM, Weissman A, Goodwin CJ. A Randomized Trial of “Corollary Orders”
RM, Mundkur M, Kuhn T. Use of Internist’s to Prevent Errors of Omission. J Am Med Inform
Free Time by Ambulatory Care Electronic Assoc. 1997;4(5):364–75. PMID: 9292842.
Medical Record Systems. JAMA Intern Med. Ozdas, A., Miller, R. A., & Waitman, L. R. (2008). Care
2014;174(11):1860–1863. https://doi.org/10.1001/ Provider Order Entry (CPOE): A perspective on
jamainternmed.2014.4506. factors leading to success or to failure. Yearbook of
McPhee, S. J., Bird, J. A., Fordham, D., Rodnick, J. E., Medical Informatics, 2007, 128–137. Review. Erratum
& Osborn, E. H. (1991). Promoting cancer preven- in: Yearbook of Medical Informatics, 2008, 19.
tion activities by primary care physicians: Results Parshuram, C. S., Dryden-Palmer, K., Farrell, C., et al.
of a randomized, controlled trial. Journal of the (2018). Effect of a pediatric early warning system
American Medical Association, 266(4), 538–544. on all-cause mortality in hospitalized pediatric
Melton, L. J., 3rd. (1996). History of the Rochester epi- patients: The EPOCH randomized clinical trial.
demiology project. Mayo Clinic Proceedings, 71, JAMA, 319(10), 1002–1012.
266–274. Pestotnik, S. L. (2005). Expert clinical decision support
Miller, R. A., & Gardner, R. M. (1997). systems to enhance antimicrobial stewardship pro-
Recommendations for responsible monitoring and grams: Insights from the Society of Infectious Diseases
regulation of clinical software systems. Journal Pharmacists. Pharmacotherapy, 25(8), 1116–1125.
of the American Medical Informatics Association: Phansalkar, S., Desai, A. A., Bell, D., et al. (2012a).
JAMIA, 4, 442–457. High-priority drug-drug interactions for use in
Miller, R. A., Gardner, R. M., Johnson, K. B., & electronic health records. Journal of the American
Hripcsak, G. (2005a). Clinical decision support and Medical Informatics Association: JAMIA, 19(5),
electronic prescribing systems: A time for respon- 735–743.
sible thought and action. Journal of the American Phansalkar, S., van der Sijs, H., Tucker, A. D., et al.
Medical Informatics Association: JAMIA, 12(4), (2012b). Drug-drug interactions that should be non-
403–409. interruptive in order to reduce alert fatigue in elec-
Miller, R. A., Waitman, L. R., Chen, S., & Rosenbloom, tronic health records. Journal of the American Medical
S. T. (2005b). The anatomy of decision support dur- Informatics Association: JAMIA, 20(3), 489–493.
ing inpatient care provider order entry (CPOE): Poissant, L., Pereira, J., Tamblyn, R., & Kawasumi,
Empirical observations from a decade of CPOE Y. (2005, September 1). The impact of electronic
experience at Vanderbilt. Journal of Biomedical health records on time efficiency of physicians and
Informatics, 38(6), 469–485. nurses: A systematic review. Journal of the American
Misra-Hebert, A. D., et al. (2016, March). Medical Medical Informatics Association, 12(5), 505–516.
scribes: How do their notes stack up? Journal of Poon, E. G., Kuperman, G. J., Fiskio, J., & Bates, D. W.
Family Practice, 155+. Gale Academic Onefile, (2002). Real-time notification of laboratory data
Accessed 14 Oct 2019. requested by users through alphanumeric pag-
Muth, C., Ulhmann, L., Haefeli, W. E., et al. (2018). ers. Journal of the American Medical Informatics
Effectiveness of a complex intervention on Association: JAMIA, 9, 217–222.
Prioritising Multimedication in Multimorbidity Pryor, T.A. (1988) The HELP medical record system. In
(PRIMUM) in primary care: Results of a pragmat- MD Computing. 5(5). (pp. 22–33). PMID: 3231033.
ics cluster randomised controlled trial. BMJ Open, Przvbvlo, J. A., Wang, A., Loftus, P., et al. (2014).
8(2), e017740. Smarter hospital communication: Secure smart-
Nadkarni, P. M., Ohno-Machado, L., & Chapman, phone text messaging improves provider satisfac-
W. W. (2011). Natural language processing: An tion and perception of efficacy, workflow. Journal of
introduction. Journal of the American Medical Hospital Medicine, 9(9), 573–578.
Informatics Association: JAMIA, 18(5), 544–551. Ranta, A., Dovey, S., Weatherall, M., et al. (2015).
National Committee on Vital and Health Statistics. Cluster randomized controlled trial of TIA elec-
(2001). Information for health: A strategy for build- tronic decision support in primary care. Neurology,
ing the National Health Information Infrastructure. 84(15), 1545–1551.
508 G. B. Melton et al.
Reiser, S. (1991). The clinical record in medicine. Part 1: oriented record (STOR)—a progress report. In
Learning from cases. Annals of Internal Medicine, Proceedings of the annual symposium on computer
114(10), 902–907. application in Medical Care (p. 100).
Rief, J. J., Hamm, M. E., Zickmund, S. L., et al. (2017). Simonaitis, L., Belsito, A., Warvel, J., Hui, S., &
Using health information technology to Foster McDonald, C. J. (2006). Extensible stylesheet
engagement: Patients’ experiences with an active language formatting objects (XSL-FO): A tool
patient health record. Health Communications, to transform patient data into attractive clinical
32(3), 310–319. reports. In AMIA annual symposium proceedings,
Rind, D. M., Safran, C., Phillips, R. S., Wang, Q., 2006 (pp. 719–723).
Calkins, D. R., Delbanco, T. L., Bleich, H. L., & Sinsky, C., Colligan, L., Li, L., Prgomet, M., Reynolds,
Slack, W. V. (1994). Effect of computer-based alerts S., Goeders, L., Westbrook, J., Tutty, M., & Blike,
on the treatment and outcomes of hospitalized G. (2016). Allocation of physician time in ambula-
patients. Archives of Internal Medicine, 154(13), tory practice: A time and motion study in 4 special-
1511–1517. ties. Annals of Internal Medicine, 2165(11), 753–760.
Roden, D. M., Pulley, J. M., Basford, M. A., Bernard, Slack, W. V., & Bleich, H. L. (1999). The CCC sys-
G. R., Clayton, E. W., Balser, J. R., & Masys, D. R. tem in two teaching hospitals: A progress report.
(2008, September). Development of a large-scale International Journal of Medical Informatics, 54(3),
de-identified DNA biobank to enable personalized 183–196.
medicine. Clinical Pharmacology and Therapeutics, Starren, J., & Johnson, S. B. (2000). An object-oriented
84(3), 362–369. taxonomy of medical data presentations. Journal
Romano, M. J., & Stafford, R. S. (2011). Electronic of the American Medical Informatics Association:
health records and clinical decision support sys- JAMIA, 7(1), 1–20.
tems: Impact on national ambulatory care quality. Stead, W.W., Brame, R.G., Hammond, W.E., et. al.
Archives of Internal Medicine, 171(10), 897–903. (1977). A computerized obstetric medical record. In
Rosenbloom, S. T., Geissbuhler, A. J., Dupont, W. D., Obstetrics & Gynecology. 49(4). (pp. 502-9). PMID
Giuse, D. A., Talbert, D. A., Tierney, W. M., 854253.
Plummer, W. D., Stead, W. W., & Miller, R. A. Stead, W. W., & Hammond, W. E. (1988). Computer-
(2005). Effect of CPOE user interface design on based medical records: The centerpiece of TMR.
user-initiated access to educational and patient MD Computing, 5(5), 48–62.
information during clinical care. Journal of the Stein, D. M., Vawdrey, D. K., Stetson, P. D., & Bakken,
American Medical Informatics Association: JAMIA, S. (2010). An analysis of team checklists in physi-
12(4), 458–473. cian signout notes. In American Medical Informatics
Samantaray, R., Njoku, V. O., Brunner, J. W., Raghavan, Association annual symposium proceedings (pp. 767–
V., Kendall, M. L., & Shih, S. C. (2011). Promoting 771).
electronic health record adoption among small Strom, B. L., Schinnar, R., Aberra, F., Bilker, W.,
independent primary care practices. The American Hennessy, S., Leonard, C. E., & Pifer, E. (2010).
Journal of Managed Care, 17(5), 353–358. Unintended effects of a computerized physician
Schedlbauer, A., Prasad, V., Mulvaney, C., Phansalkar, order entry nearly hard-stop alert to prevent a drug
14 S., Stanton, W., Bates, D. W., & Avery, A. J. (2009). interaction: A randomized controlled trial. Archives
What evidence supports the use of computer- of Internal Medicine, 170(17), 1578–1583.
ized alerts and prompts to improve clinicians’ Syzdykkova, A., Malta, A., Zolfo, M., et al. (2017).
prescribing behavior? Journal of the American Open-source electronic health record systems for
Medical Informatics Association: JAMIA, 16(4), low-resource settings: Systematic review. JMIR
531–538. Medical Informatics, 5(4), e44.
Schultz, J. R., Cantrill, S. V., & Morgan, K. G. (1971). Tajmir, S., Raja, A. S., Ip, I. K., et al. (2017). Impact of
AFIPS Conference Proceedings, 38, 239–264. clinical decision support on radiography for acute
Schwartz, R. J., Weiss, K. M., & Buchanan, A. V. (1985). ankle injuries: A randomized trial. The Western
Error control in medical data. MD Computing, 2(2), Journal of Emergency Medicine, 18(3), 487–495.
19–25. Tang, P. C. (2003). Key capabilities of an electronic
Sherifali, D., Greb, J. L., Amirthavasar, G., Hunt, D., health record system (letter report). Committee on
Haynes, R. B., Harper, W., Holbrook, A., Capes, Data Standards for Patient Safety. Board on Health
S., Goeree, R., O’Reilly, D., Pullenayegum, E., Care Services, Institute of Medicine.
& Gerstein, H. C. (2011). Effect of computer- Tang, P. C., & Patel, V. L. (1994). Major issues in
generated tailored feedback on glycemic control in user interface design for health professional
people with diabetes in the community: A random- workstations: Summary and recommendations.
ized controlled trial. Diabetes Care, 34(8), 1794– International Journal of Biomedical Computing,
1798. 34(104), 130–148.
Simborg, D. W., & Whiting-O’Keefe, Q. E. III. (1981, Tang, P. C., Annevelink, J., Suermondt, H. J., & Young,
November 4). The Physician’s Office and Group C. Y. (1994a). Semantic integration in a physician’s
practice—ambulatory medical record systems: A workstation. International Journal of Bio-Medical
automated medical record systems: Summary time Computing, 35(1), 47–60.
Electronic Health Records
509 14
Tang, P. C., Fafchamps, D., & Shortliffe, E. H. (1994b). American Medical Informatics Association annual
Traditional medical records as a source of clinical symposium proceedings, 2010 (pp. 832–836).
data in the outpatient setting. In Proceedings of the Wang, A.Y., Sable, J.H., & Spackman, K.A. (2002).
annual symposium computer applications in Medical The SNOMED clinical terms development process:
Care (pp. 575–579). refinement and analysis of content. In American
Tang, P. C., Marquardt, W. C., Boggs, B., et al. (1999). Medical Informatics Association annual sympo-
NetReach: Building a clinical infrastructure for the sium proceedings, 2002 (pp. 845–9). PMCID:
enterprise. In J. M. Overhage (Ed.), Fourth annual PMC2244575.
proceedings of the Davies CPR recognition sympo- Warner, H. R. (1972). A computer-based patient infor-
sium (pp. 25–68). Chicago: McGraw-Hill. mation system for patient care. In G. A. Bekey &
Tang, P. C., Ralston, M., Arrigotti, M. F., Qureshi, L., M. D. Schwartz (Eds.), Hospital information systems
& Graham, J. (2007). Comparison of methodologies (pp. 293–332). New York: Marcel Dekker.
for calculating quality measures based on adminis- Weir, C. R., Hammond, K. W., Embi, P. J., et al. (2011).
trative data versus clinical data from an electronic An exploration of the impact of computerized
health record system: Implications for perfor- patient documentation on clinical collaboration.
mance measures. Journal of the American Medical International Journal of Medical Informatics, 80(8),
Informatics Association: JAMIA, 14, 10–15. e62–e71.
Teich, J. M., Glaser, J. P., Beckley, R. F., et al. (1999). Willett, D. L., Kannan, V., Chu, L., Buchanan, J. R.,
The Brigham integrated computing system (BICS): Velasco, F. T., Clark, J. D., Fish, J. S., Ortuzar,
Advanced clinical systems in an academic hospi- A. R., Youngblood, J. E., Bhat, D. G., & Basit,
tal environment. International Journal of Medical M. A. (2018). SNOMET CT concept hierarching
Informatics, 54(3), 197–208. for sharing definitions of clinical conditions using
Teich, J. M., Merchia, P. R., Schmiz, J. L., et al. (2000). electronic health record data. Applied Clinical
Effects of computerized physician order entry on Informatics, 9(3), 667.
prescribing practices. Archives of Internal Medicine, Wrenn, J. O., Stein, D. M., Bakken, S., & Stetson, P. D.
160(18), 2741–2747. (2010). Quantifying clinical narrative redundancy in
Tierney, W. M., Miller, M. E., Overhage, J. M., & an electronic health record. Journal of the American
McDonald, C. J. (1993). Physician inpatient order Medical Informatics Association: JAMIA, 17(1),
writing on microcomputer workstations: Effects 49–53.
on resource utilization. Journal of the American Zech, J., Husk, G., Moore, T., et al. (2016). Measuring
Medical Association, 269(3), 379–383. the degree of unmatched patient records in a health
Tufte, E. (2006). Beautiful evidence. Cheshire: Graphics information exchange using exact matching. Applied
Press. ISBN 978-0-9613921-7-8.. Clinical Informatics, 7(2), 330–340.
Vigoda, M. M., & Lubarsky, D. A. (2006). Failure to Zhang, R., Pakhomov, S. V., Lee, J. T., & Melton, G. B.
recognize loss of incoming data in an anesthesia (2014). Using language models to identify relevant
record-keeping system may have increased medical new information in inpatient clinical notes. In
liability. Anesthesia and Analgesia, 102, 1798–1802. American Medical Informatics Association annual
Vreeman, D. J., McDonald, C. J., & Huff, S. M. (2010). symposium proceedings (pp. 1268–1276).
Representing patient assessments in LOINC®. In
511 15
Health Information
Infrastructure
William A. Yasnoff
Contents
References – 539
(HIE) organizations have failed in the past blocking” in response to legitimate requests
few years, e.g., Washington (DC), Kansas, for patient records.
Tennessee, CalRHIO, and CareSpark
(Kingsport, TN).
Importantly, today no patients in the U.S. 15.2.2 Value in Patient Care
can be assured that wherever they seek care,
their comprehensive records from all sources The potential benefits of HII are both numer-
will be available to their provider. ous and substantial. Perhaps most important
The U.S. Congress has recognized the are error reduction and improved quality of
importance of comprehensive records and has care. Many studies have shown that the com-
mandated that the problem be solved in the plexity of present-day medical care results in
21st Century Cures Act, enacted in late 2016: very frequent errors of both omission and
55 “The Secretary shall use existing authorities commission (IOM 1999). The source of this
to encourage partnerships … with the goal problem was clearly articulated by Masys,
of offering patients access to their electronic who observed that current medical practice
health information in a single, longitudinal depends upon the “clinical decision-making
format that is easy to understand, secure, capacity and reliability of autonomous prac-
and may be updated automatically.” [… titioners for classes of problems that rou-
and …] tinely exceed the bounds of unaided human
55 “… promote policies that ensure that a cognition” (Masys 2002). Electronic health
patient’s electronic health information is information systems can contribute signifi-
accessible to that patient and the patient’s cantly to alleviating this problem by remind-
designees, in a manner that facilitates ing practitioners about recommended actions
communication with the patient’s health at the point of care. This can include both
care providers and other individuals, notifications of actions that may have been
including researchers, consistent with such missed and warnings about planned treat-
patient’s consent.”2 ments or procedures that may be harmful or
unnecessary. Literally dozens of research
The Final Rule implementing the 21st Century studies have shown that such reminders
Cures Act,3 reinforces these goals by requiring improve safety and reduce costs (Bates 2000).
application programming interfaces (APIs) In one such study, medication errors were
that allow all data elements of a patient’s elec- reduced by 55% (Bates et al. 1998). Another
tronic health record to be accessed, exchanged, study by the Rand Corporation showed that
and used without special effort and a more only 55% of U.S. adults were receiving rec-
15 granular approach to consent management. It ommended care (McGlynn et al. 2003). The
also requires that patients be allowed to access same techniques used to reduce medical
all of their electronic health record informa- errors with electronic health information sys-
tion at no cost and prohibits “information tems also contribute substantially to ensuring
that recommended care is provided. This is
becoming increasingly important as the pop-
2 21st Century Cures Act, H.R.34 — 114th Congress ulation ages and the prevalence of chronic
(2015–2016), Section 4006. Retrieval 29 Oct 2018: disease increases.
7 https://www.congress.gov/bill/114th-congress/
house-bill/34
Guidelines and reminders also can improve
3 21st Century Cures Act: Interoperability, the effectiveness of dissemination of new
Information Blocking and the ONC Health IT research results. Widespread dissemination of
Certification Program. Released May 1, 2020. new research to the clinical setting is very
Retrieval 23 May 2020: 7 https://www. slow; one study showed an average of 17 years
federalregister.gov/documents/2020/05/01/2020-
07419/21st-century-cures-act-interoperability-infor-
(Balas and Boren 2000). Patient-specific
mation-blocking-and-the-onc-health-it-certification reminders delivered at the point of care, high-
Health Information Infrastructure
515 15
lighting important new research results, could 7 Chap. 18). Not only would this be an
substantially accelerate this adoption rate. invaluable aid in early detection of bioter-
Another important contribution of HII to rorism, it would also serve to improve the
the research domain is improving the effi- detection of the much more common natu-
ciency of clinical trials. At present, most such rally occurring disease outbreaks. In fact,
trials require the creation of a unique infor- early results from a number of electronic
mation infrastructure to ensure protocol com- reporting demonstration projects show that
pliance and to collect essential research data. disease outbreaks can routinely be detected
With an effective HII, every practitioner sooner than was ever possible using the cur-
would have access to a fully functional and rent system (Overhage et al. 2001). While
comprehensive electronic health record (EHR) early detection has been shown to be a key
for each patient, so clinical trials could rou- factor in reducing morbidity and mortality
tinely be implemented through the dissemina- from bioterrorism (Kaufmann et al. 1997), it
tion of guidelines that specify the research will also be extremely helpful in reducing the
protocol. Data collection could occur auto- negative consequences from other disease
matically in the course of administering the outbreaks.
protocol, reducing time and costs. In addi- Although the U.S. Congress mandated the
tion, there would be substantial value in ana- creation of a national public health situational
lyzing de-identified aggregate data from awareness network in the Pandemic and All-
routine patient care to assess the outcomes of Hazards Preparedness Acts of both 2006 and
various treatments and monitor the health of 2013, the General Accounting Office has doc-
the population. umented that such a system has yet to be
Another critical function for HII is early deployed in two separate reports4,5 and has
detection of patterns of disease, particularly issued letters in both 2019 and 20206 high-
early detection of outbreaks from newly- lighting the failure of DHHS to implement
virulent microorganisms or possible bioter- the required capabilities. HII can in fact func-
rorism. Our current system of disease tion as an effective public health situational
surveillance, which primarily depends on alert awareness network by reporting and/or pro-
clinicians diagnosing and reporting unusual viding access to relevant disease events in near
conditions, is both slow and potentially unre- real time to public health authorities without
liable. These problems are illustrated by the necessity of creating a separate, duplica-
delayed detection of the anthrax attacks in tive, and expensive system solely for public
the Fall of 2001, when seven cases of cutane- health (Sittig and Singh 2020). For example,
ous anthrax in the New York City area 2
weeks before the so-called “index” case in
Florida went unreported (Lipton and Johnson 4 General Accounting Office Report 11-99 (2010)
2001). Since all the patients were seen by dif- Public Health information Technology: Additional
Strategic Planning Needed to Guide HHS’s Efforts
ferent clinicians, the pattern could not have
to Establish Situational Awareness Capabilities.
been evident to any of them even if the correct Retrieval 23 May 2020: 7 https://www.gao.gov/
diagnosis had immediately been made in every products/GAO-11-99
case. Wagner et al. described nine categories 5 General Accounting Office Report 17-377 (2017)
of requirements for surveillance systems for Public Health Information Technology: HHS Has
Made Little Progress toward Implementing
potential bioterrorism outbreaks—several
Enhanced Situational Awareness Network
categories must have immediate electronic Capabilities. Retrieval 23 May 2020: 7 https://
reporting to ensure early detection (Wagner www.gao.gov/products/GAO-17-377
et al. 2003). 6 General Accounting Office (2020) Priority Open
HII would allow immediate electronic Recommendations: Department of Health and
Human Services (April 23, 2020). Retrieval 23 May
reporting of both relevant clinical events and
2020: 7 https://www.gao.gov/assets/710/706568.
laboratory results to public health (see pdf
516 W. A. Yasnoff
Findings of a 2011 survey of HIEs “call into persistent concern among grantees.”11 The
question whether RHIOs in their current form latter challenge has since resulted in the shut-
can be self-sustaining and effective” (Adler- down of multiple projects.
Milstein et al. 2011). Although those few surviving HIEs have
Over the past few years, these concerns nearly all abandoned their original institution-
have been proven correct; HIE efforts have centric architectures in favor of patient-centric
largely floundered. In the 2013 “reboot” repositories (more on HII architecture in
report, a group of U.S. Senators criticized 7 Sect. 15.5 below), the predicted reductions
lack of a clear path to interoperability and in health care costs from the widespread use
sustainability, inadequate privacy and secu- of EHRs have not materialized primarily due
rity protections, and failure to achieve health to the incomplete nature of patient records
care cost reductions.8 That same year, DHHS available through current HIE systems. A
itself admitted that “[current policy] alone 2017 study observed “To date, only a small
will not be enough to achieve the widespread number of HIE studies have demonstrated
interoperability and electronic exchange of benefits to patients, providers, public health,
information necessary for delivery reform or payers” (Yeager et al. 2017).
where information will routinely follow the Overall, it is clear that three decades after
patient regardless of where they receive care” the 1991 IOM report urging universal adop-
in an RFI requesting ideas for “accelerating tion of EHRs, the U.S. still lacks a clear and
HIE progress.”9 At least one experienced for- feasible roadmap leading to the ultimate goal
mer DHHS official bemoaned the lack of an of widespread availability of comprehensive
HII architecture to guide efforts.10 Even the electronic patient information when and
mainstream media noticed the continuing where needed. Despite much progress, no one
lack of progress (Creswell 2014). in the U.S. as yet receives their medical care
A systematic review found that HIE use with the assured, immediate availability of all
through 2015 in the U.S. was “growing but their records across multiple providers and
still limited” (Devine et al. 2017). The official provider organizations.
evaluation of ONC’s state HIE program tried
to put a positive spin on these activities with
the statistically meaningless conclusion that 15.4 Requirements for HII
“about half the states [were] performing at or
above the national average,” a result that by As with any informatics system development
definition would always be the case. The project, it is critical at the outset to understand
report also observed that “sustainability was a the desired end result. In the case of a large,
15 extremely complex system such as HII, this is
especially important because there are many
8 Reboot: Re-examining the strategies needed to stakeholders with conflicting incentives and
successfully adopt health IT. Retrieval 29 Oct 2018:
agendas, as well as challenging policy and
7 https://www.thune.senate.gov/public/_cache/
files/0cf0490e-76af-4934-b534-83f5613c7370/ operational issues. The ultimate goal is the
C60F25439BE1CEC36DF9E3834942D908. “universal availability of comprehensive elec-
ehr-white-paper-april-15.pdf tronic patient records when and where needed.”
9 CMS. Advancing Interoperability and Health In transforming this goal into a design specifi-
Information Exchange (Request for Information).
cation, it is critical to understand the issues
Retrieval 29 Oct 2018: 7 https://www.
federalregister.gov/docu-
ments/2013/03/07/2013-05266/
advancing-interoperability-and-health-information- 11 Dullabh PM, Parashuram S, Hovey L, Ubri P,
exchange Fischer K. (2016) Evaluation of the State HIE
10 Loonsk JW. Where’s the plan for interoperability? Cooperative Agreement Program: Final Report.
Healthcare IT News (22 Sept 2014). Retrieval 29 Retrieval 29 Oct 2018: 7 https://www.healthit.gov/
Oct 2018: 7 https://www.healthcareitnews.com/ sites/default/files/reports/
news/wheres-plan-interoperability finalsummativereportmarch_2016.pdf
Health Information Infrastructure
519 15
and constraints that must be addressed. Then tronic health records. The general public
any proposed system design must first demon- understands that making electronic patient
strate on paper how the objectives will be records available for good and laudable pur-
achieved within those limitations. poses simultaneously makes them more avail-
able for evil and nefarious purposes, thereby
necessitating higher levels of protection to
15.4.1 Privacy and Trust avoid abuses. Assigning decision-making for
disclosure of personal medical records to any-
The most important and overriding require- one other than the patient or the patient’s rep-
ment of HII is privacy. Clearly, health records resentative inherently erodes trust. In essence,
are very sensitive – perhaps the most sensitive the patient is being told, “we are going to
personal information that exists. In addition decide for you where your medical records
to our natural desire to keep our medical should go because we know what’s in your
information private, improper disclosure can interest better than you do.” A patient may
lead to employment and other types of dis- wonder why, if a given disclosure is in their
crimination. Furthermore, failure to assure interest, their consent would not be sought.
the privacy of records will naturally result in Furthermore, failing to seek such consent
patients being unwilling to disclose important inevitably leads to suspicion that the disclo-
personal details to their providers – or even to sure is in fact not in the patient’s interest, but
avoid seeking care at all. In addition to the rather in the interest of the organization
contents of the records, the very existence of deciding that the records should be released.
certain records (e.g., a visit to psychiatric hos- The concern about privacy of medical
pital) is sensitive even if no details are avail- records is not at all theoretical or insignifi-
able. Therefore, extraordinary care must be cant. In at least two consumer surveys,
taken to ensure that information is protected 13–17% of consumers indicated that they
from unauthorized disclosure and use. already employ “information hiding” behav-
In general, U.S. Federal law (the HIPAA iors with respect to their medical records12
Privacy Rule as introduced in 7 Chap. 12) (Shortliffe 2011). This includes activities such
requires patient consent for disclosure and use as obtaining laboratory tests under an
of medical records. However, consent is not assumed name or seeking out-of-state treat-
required for record release for treatment, pay- ment to conceal an illness from their primary
ment, and healthcare operations. These “TPO” care provider. Even assuming that everyone
exceptions have, as a practical matter, allowed engaged in such behaviors was willing to
healthcare organizations to utilize medical admit to them in such a survey, this represents
records extensively while bypassing patient a substantial proportion of consumers who
consent. The organization that holds medical would, at a minimum, refuse to participate in
information has sole discretion to make the an electronic medical information system that
decision whether a proposed disclosure is or is did not provide them with control over their
not a TPO exception. Until recently, TPO dis- own records. Of even greater concern, such a
closures did not even need to be recorded, large percentage of consumers would likely
effectively preventing discovery of improper organize and use their political power to halt
disclosures. Even under the HITECH legisla- the deployment and operation of such a sys-
tion that requires records of TPO disclosures, tem. Indeed, it was a much smaller percentage
such records are not automatically available to of concerned citizens that, citing the threat to
the subjects of the disclosures. The net effect is privacy, convinced Congress to repeal the pro-
that individuals not only lack control over the
dissemination of their medical records, but are
not even informed when they are disclosed 12 California Health Care Foundation. (2005)
beyond where they were created. National Consumer Health Privacy Survey.
Retrieval 29 Oct 2018: 7 http://www.chcf.org/
It seems appropriate to question whether publications/2005/11/national-consumer-health-
this disclosure regime is adequate for elec- privacy-survey-2005
520 W. A. Yasnoff
visions in the original HIPAA legislation call- Indeed, only a handful of communities have
ing for a unique medical identifier for all U.S. succeeded in developing and maintaining an
residents (see 7 Chap. 12). organization that includes the active participa-
In view of this, there are those who argue tion of the majority of healthcare providers.
that all decisions about release of patient Even in these communities, the system could
records need to be entrusted to the patient be disrupted at any time by the arbitrary with-
(with rare exceptions, such as mental incom- drawal of one or more participants. The unfor-
petence). They also suggest that attention to tunate reality is that healthcare stakeholders
these concerns may be especially important are often quite reluctant to share patient
for enabling HII, because patients must trust records, fearing loss of competitive advantage.
that their records are not being misused in Therefore, some would argue that mandat-
such a system. Some argue that patients are ing healthcare stakeholder participation in a
not sufficiently informed to make such deci- system for sharing electronic patient records is
sions and may make mistakes that are harm- highly desirable, since it would result in con-
ful to them, whereas others believe that the sistently more comprehensive individual
negative consequences of delegating this records. Since imposing a new requirement on
decision- making to others than the patient healthcare stakeholders would be a daunting
could be much greater. Advocates of patient political challenge, such an approach would
control of medical information argue, by be most easily accomplished as part of an
analogy, that society has accepted that existing mandate. Proponents of this approach
individuals retain the right to make decisions have noted that one such mandate that could
about how their own money is spent, even be utilized is the HIPAA Privacy Rule itself,
though this can lead to adverse consequences which requires all providers to respond to
when those decisions prove to be unwise. In patient requests for their own records (U.S. 45
considering these issues, it should be noted CFR 164.524(a)). Furthermore, if patients
that prior to the 2002 HIPAA Privacy Rule request their records in electronic form, and
that established the TPO exceptions, both law they are available in electronic form, this regu-
and practice had always required patient con- lation also requires that they be delivered in
sent for all access to medical records. While electronic form. Although not well known,
acknowledging the need for consumer educa- this latter provision is included in the original
tion about decisions relating to release of HIPAA Privacy Rule (U.S. 45 CFR 164.524(c)
medical records, patient-control advocates (2)), and has been reinforced by HITECH. It
believe that the same freedom and personal is also being promoted by the “blue button”
responsibility that applies to an individual’s initiative that seeks to allow patients to
15 financial decisions should be applied to the retrieve their own records electronically13 and
medical records domain. These medical infor- the growing movement by patients advocating
mation privacy policy issues may be even guaranteed access to their own data.14
more urgent in the context of the enhanced Advocates argue that patient control, in
trust necessary when seeking to implement an addition to being an effective approach to pri-
effective and widely accepted HII.
15.4.2 Stakeholder Cooperation 13 Chopra A, Park T, Levin PL. (2010) ‘Blue Button’
Provides Access to Downloadable Personal Health
Data. Retrieval 29 Oct 2018: 7 http://www.
To ensure the availability of comprehensive whitehouse.gov/blog/2010/10/07/
patient records, all healthcare stakeholders blue-button-provides-access-downloadable-per-
that generate such records must consistently sonal-health-data
make them available. While it would be ideal if 14 Miliard M. (2014) Patients want online access to
records. Healthcare IT News (5 May 2014).
such cooperation were voluntary and univer- Retrieval 29 Oct 2018: 7 https://www.
sal, assuring long-term collaboration of com- healthcareitnews.com/news/
peting healthcare stakeholders is problematic. patients-want-online-access-records
Health Information Infrastructure
521 15
vacy, could also serve to ensure ongoing, con- ongoing cost/benefit ratio. Most of the bene-
sistent healthcare stakeholder participation. fits of EHRs in physician offices accrue not to
Of course, in order for this approach to be the physician, but to other stakeholders. In
practical, the rights of patients to electronic one study, 89% of the economic benefit was
copies of their records under HIPAA would attributed to other stakeholders (Hersh 2004).
need to be universally enforced. Such enforce- It is unreasonable to expect physicians to
ment has to date been inconsistent, and, until shoulder 100% of the cost of systems while
recently, exclusively dependent on the Office accruing only 11% of the benefits. Even as
of Civil Rights at DHHS (since patients do physician EHR adoption levels have increased,
not have a private right of action). Under there have been increasing complaints about
HITECH, state attorneys general may also the burden that EHRs impose on physicians
bring legal action, which provides another (Sinsky et al. 2016).
legal avenue for improving compliance. It is important to note that EHRs alone,
Another option to ensure that providers even if adopted by all healthcare providers,
make their electronic records available to are a necessary but not sufficient condition for
patients for use in compiling comprehensive achieving HII. Indeed, each EHR simply con-
records would be to create a linkage to reim- verts an existing paper “silo” of information
bursement for care. In this scenario, each pro- to electronic form. These provider-based sys-
vider would be required to offer to deposit the tems manage the provider information on the
new information generated from an encounter patient in question, but do not have all the
in a place of the patient’s choice. In cases information for each patient. To achieve the
where the patients designated a destination goal of availability of comprehensive patient
for their information, payment for the care information, there must also be an efficient
received would be contingent on the deposit and cost-effective mechanism to aggregate the
of the required data. While this may seem scattered records of each patient from all their
somewhat coercive and even radical, it is con- various providers. Such aggregation also
sistent with practices in other service indus- requires effective standards for encoding and
tries. Whether it be car repair, plumbing, or communicating EHR information. Major
legal services, payment is nearly always con- gains in quality and efficiency of care will be
tingent on the client receiving detailed justifi- attainable only through HII that ensures the
cations and descriptions of the services availability of every patient’s comprehensive
provided. While a growing number of health record when and where needed.
care providers are increasingly providing such
“visit summaries,” at least in paper form, this
remains the exception in the medical domain. 15.4.4 Financial Sustainability
other key infrastructures such as public utili- medical information for research – both to
ties and the Internet, although regulated, are find eligible subjects for clinical trials and to
funded through user fees rather than taxation. utilize the data itself for research queries.
Note, however, that at least two U.S. states While this latter application has the poten-
(Maryland and Vermont) are using this mech- tial to defray a substantial portion of the
anism to help fund their HII. costs of HII, it requires efficient mecha-
The most common approach suggested for nisms for both searching data and recording
long-term HII sustainability is leveraging and maintaining patient consent that have
anticipated healthcare cost savings. This is not generally been incorporated into HII
based on the substantial and growing body of systems.
evidence that the availability of more compre- Perhaps the most lucrative HII revenue
hensive electronic patient records to providers source lies in the development of innovative
results in higher quality and lower cost care applications that rely on the underlying infor-
(AHRQ 2006; Menachemi et al. 2018). Some mation to deliver compelling value to con-
of the best examples include large, mostly sumers and other healthcare stakeholders. For
closed healthcare systems such as Kaiser, example, HII allows the delivery of timely and
Group Health and the Veterans Administra- accurate reminders and alerts to patients for
tion, where the availability of more complete recommended preventive services, needed
patient records in electronic form over time medication refills, and other medically related
has been consistently associated with both cost events of immediate interest to patients and
savings and better care. While the case for HII their families. It also would allow deployment
reducing healthcare costs is compelling, the of applications that assist consumers auto-
distribution and timing of those savings is dif- matically with management of their chronic
ficult to predict. In addition, cost savings to diseases. Utilizing new value to finance HII
the healthcare system means revenue losses to avoids the prediction and allocation problems
one or more stakeholders – clearly an undesir- inherent in attempts to leverage expected
able result from their perspective. Finally, the healthcare cost savings, with the added incen-
allocation of savings for a given population of tive that any such savings would fully accrue
patients is unknown, with the result that orga- to whoever achieves them.
nizations are reluctant to make specific finan-
cial commitments that could be larger than
their own expected benefits. 15.4.5 Community Focus
The final but least frequently mentioned
path to financial sustainability of HII is uti- Most observers believe that successful HII
15 lizing the new value created by the availabil- must be focused on the community. An essen-
ity of comprehensive electronic information. tial element in HII is trust, which is inherently
While it is widely recognized that this infor- local. Furthermore, health care itself is pre-
mation will be extremely valuable for a wide dominantly local, since the vast majority of
variety of purposes, this option has medical care for residents of a given commu-
remained largely unexplored. One example nity is provided in that community. Indeed,
of such new value is the potential reduction people traveling away from home who are
in cost for delivering laboratory results to injured or become ill inevitably will return
ordering physicians. The expenses borne by home at their earliest opportunity if their con-
individual laboratories for their own infra- dition permits (and does not resolve quickly).
structure providing this essential service can Since medical care is predominantly local,
be greatly reduced by a single uniform com- creating a system that delivers comprehensive
munity infrastructure providing electronic electronic patient information in a community
delivery to physicians through one mecha- solves the overwhelming majority of informa-
nism. Another example is availability of tion needs in that community. While move-
Health Information Infrastructure
523 15
ment of health information over long zation ideally must be devoid of any biases or
distances has some value and ultimately must hidden agendas that would favor one category
be addressed to assure completeness of of healthcare stakeholders over another, or
records, its contribution to a total solution is favor specific stakeholders within a given cat-
marginal. egory.
The lack of any examples of working HII None of the existing healthcare stakehold-
in communities larger than about 10 million ers seem well suited to meet the trust require-
people provides additional evidence of the ment. Many argue that government cannot
need for local focus. Keeping the scope of operate an HII because it is inherently not
such projects relatively small also increases trusted with sensitive personal records, and
their likelihood of success by reducing com- furthermore needs to assume the role of pro-
plexity, thereby avoiding the huge increases in viding regulatory oversight for whatever orga-
failure rates of extremely large-scale IT proj- nization does take the HII responsibility.
ects. This rule of thumb is reinforced by the Similarly, it seems problematic for employers
relatively small populations of countries that to be responsible for the HII since one of the
have successfully implemented effective HII primary concerns of consumers is to avoid dis-
such as Finland (5.5 million) and Estonia (1.3 closing sensitive medical information to their
million). employers. Health plans and insurers are typi-
In thinking about HII, analogies are often cally not trusted by consumers because their
made to the international financial system incentives are not aligned – they have a finan-
that efficiently transfers and makes funds cial incentive to deny care, which is a natural
available to individuals anywhere in the concern to consumers. Hospitals are in com-
world. However, it is often forgotten that petition with each other and therefore are not
these financial institutions, that also are in a good position to cooperate in a long-term
heavily dependent on trust, began as “build- HII effort. Physicians are the most trusted
ing and loan funds” in very small communi- healthcare stakeholders, from a consumer per-
ties designed to share financial resources spective, but are not organized in a way to
among close neighbors. It took many decades facilitate the creation of HII. Furthermore,
of building trust before large-scale national they are also in competition with each other
and international financial institutions and, most importantly, do not generally have
emerged. the informatics capabilities necessary for such
a complex endeavor.
Therefore, many believe that an indepen-
15.4.6 Governance dent (perhaps entirely new) organization is
and Organizational Issues needed to operate HII in communities. This
organization would have no direct connec-
Trust is arguably the most important element tions to existing healthcare stakeholders and
in considering the appropriate governance for therefore would be unbiased. Its sole function
HII. Even in a system where patients exert full would be to protect and make available com-
control over their own records, the organiza- prehensive electronic patient records on
tion that operates the HII must earn the full behalf of consumers. Such an independent
faith and confidence of consumers for the organization would also ideally facilitate
security, integrity, and protection of the cooperation among all existing stakeholders,
records, as well as ensuring that records are who would know that the HII activity was
appropriately available only for purposes that completely neutral and designed primarily to
consumers specify. Furthermore, the organi- serve consumers.
524 W. A. Yasnoff
15.5 Architecture for HII ments, does not scale effectively, and is com-
plex and expensive to operate. The most
critical requirement that is not addressed by
15.5.1 Institution-Centric
this architecture is searching the data, e.g., to
Architecture find all patients with a cholesterol level above
300. To do such a search, the records of every
Initially, most developing HII systems chose patient must be assembled from their various
an institution-centric approach to data stor- locations and examined one at a time. Known
age, leaving patient records wherever they are as a sequential search, it has a very long com-
created (. Fig. 15.2). Although records are pletion time that increases linearly with the
not stored centrally in this model, there is a size of the population. For example, in a
need to maintain at least a central index of modest- sized HIE with 500,000 patients,
where information can be found for a particu- assuming retrieval and processing of each
lar patient; without such an index, finding patient’s records requires just 2 seconds (a
information about each patient would require very low estimate), each such search would
queries to every possible source of medical take at least 12 days (1 million seconds).
information worldwide -- clearly an impracti- Furthermore, every such search would require
cal approach. When a given patient’s record is that each provider record system connected to
requested, the index is used to generate que- the HIE retrieve and transmit all its informa-
ries to the locations where information is tion – a very substantial computing and com-
stored. The responses to those queries are munications burden (that also increases the
then aggregated (in real time) to produce the risk of interception of information). In stan-
patient’s complete record. After the patient dard database systems, impractical sequential
encounter, the new data is entered into the cli- search times are reduced by pre-indexing the
nician’s EHR system and another pointer (to contents of the records. However, such pre-
that system) is added to the index so it will be indexing would in essence create a central
queried (in addition to all the other prior repository of indices that could be used to
locations) next time that patient’s record is reconstruct most of the original data, and
requested. therefore is inconsistent with this architec-
While this architecture appeals to health- tural approach.
care stakeholders because they continue to It may be argued that the searches could
“control” the records they generate, one can themselves be distributed to the provider sys-
argue that it fails to meet several key require- tems, and then the results aggregated into a
15
HIE
Clinician 1 2
Index
EHR
5
Patient 4 Assembly Other
Encounter 3
EHRs
.. Fig. 15.2 Institution-centric HII architecture. (1) HIE must wait for all responses. (4) The returned
The clinician EHR requests prior patient records from records are assembled and sent to the clinician EHR;
the HIE; this clinician’s EHR is added to the index for any inconsistencies or incompatibilities between records
future queries for this patient (if not already present). must be resolved in real time. (5) After the care episode,
(2) Queries are sent to EHRs at all sites of prior care the new information is stored in the clinician EHR only.
recorded in the HIE Index. (3) EHRs at each prior site (Used with permission of Health Record Banking Alli-
of care return records for that patient to the HIE; the ance (HRBA))
Health Information Infrastructure
525 15
coherent result. However, this approach also record and made available to the requestor.
fails because individual patient records are While the query-response cycles can all be
incomplete in each system. Therefore, searches done in parallel, the final integration of results
that require multiple items of patient data must wait for the slowest response. As the
(e.g., patients with chest pain who have taken number of connected systems increases, so
a certain medication in the past year), will does the probability of a slow (or absent)
produce anomalous results unless all the response from one of them when queried for
instances of the relevant data for a given patient records. In addition, more systems
patient are in a single provider system (i.e., if mean more processing time to integrate mul-
one system finds a patient with chest pain, but tiple sources of information into one coherent
without any indication of the medication of record. Thus, the response time will become
interest [which is in another provider’s sys- slower as the number of connected systems
tem], that patient will not be reported as satis- increases (Lapsia et al. 2012).
fying the conditions) (Weber 2013). It is The institution-centric architecture also
possible to launch multiple searches each lim- introduces high levels of operational com-
ited to a single criterion and then combine the plexity. Since the completeness of retrieval of
results from each to generate a correct result. a given patient’s records is dependent on the
However, this would require multiples of the availability of all the systems that contain
completion time for a single criterion (e.g., information about that patient, ongoing real-
12 days × 2 = 24 days for the two criteria time monitoring of all connected information
example), making the retrieval times and pro- sources is essential. This translates into a
cessing burdens even more untenable. requirement for a 24 × 7 network operations
In addition to the scaling issues for this center (NOC), that constantly monitors the
architecture related to searching, there is also operational status of every medical informa-
a problem with response time for assembling a tion system and is staffed with senior IT per-
patient record. When a given patient record is sonnel who can immediately troubleshoot and
requested, the locations where the patient has correct any problems detected (. Fig. 15.3).
available records are found using the central Even with modest system failure rates (e.g.,
index. Then, a query-response cycle is required one per thousand), a community with thou-
for each location where patient records are sands of EHRs will typically have a handful
available. Following completion of the query- of systems that are unresponsive to queries
response cycles, all the information obtained for patient records and require immediate
must be integrated into a comprehensive expert attention to restore to full operation.
The cost of this around-the-clock monitoring dent organization that provides a secure elec-
is very substantial, since a staff of at least five tronic repository for storing and maintaining
full-time network engineers is required to an individual’s lifetime health and medical
assure that at least one person is always avail- records from multiple sources and assuring
able for every shift 7 days a week. that the individual always has complete con-
Adding to the cost of the NOC, every trol over who accesses their information”
EHR system in an institution-centric model (Detmer et al. 2008).
must incur additional expenses to always be Using a community HRB to provide
able to respond to queries in real-time. This patient information for medical care is
will be extremely problematic for physician straightforward (. Fig. 15.4). Prior to seek-
offices, since their EHR systems will need to ing care (or at the time of care in an emer-
operate 24 × 7 and include additional hard- gency), the patient gives permission for the
ware, software, and telecommunications caregiver to access his/her HRB account
capabilities to simultaneously support such records (either all or part) through a secure
queries while also serving its local users. Internet portal. The provider then accesses
Clearly, the transaction volumes generated (and optionally, downloads) the records
will be substantial, since each patient’s records through a similar secure web site. When the
will be queried whenever they receive care at care episode is completed, the caregiver then
any location. Contrast this to a central repos- transmits any new information generated to
itory model where the information from a the HRB to be added to the account-holder’s
care episode is transmitted once to the reposi- lifetime health record. The updated record is
tory and no further queries to the source sys- then immediately available for subsequent
tems are ever needed. This analysis has been care.
confirmed by a simulation study of the
institution-centric architecture demonstrat- zz History of HRBs
ing that both the transaction volume and The health record banking concept has been
probability of incomplete records (from miss- evolving for over two decades since it was ini-
ing data due to a malfunctioning network tially proposed (Szolovits et al. 1994). The
node) increase exponentially with the average
number of sites where each patient’s data is
HRB
located (Lapsia et al. 2012).
1
Clinician Patient
15.5.2 Patient-Centric Architecture EHR Records
15 (Health Record Banking) 2
16 Haislmaier EF. (2006) Health Care Information 20 Yaraghi N. (2016) You Should Control Your Own
Technology: Getting the Policy Right. Retrieval 29 Health Care Data. U.S. News (12 Feb 2016).
Oct 2018: 7 http://www.heritage.org/Research/ Retrieval 29 Oct 2018: 7 https://www.usnews.com/
Reports/2006/06/Health-Care-Information-Tech- opinion/blogs/policy-dose/articles/2016-02-12/
nology-Getting-the-Policy-Right to-protect-patients-privacy-give-them-control-of-
17 Health Record Banking Alliance. (2006) Retrieval their-health-data
29 Oct 2018: 7 http://www.healthbanking.org 21 Kushner J and Verma S. (2018) Giving patients
18 State of Washington Health Care Authority. (2006) control of their health information will help give
Washington State Health Information patients control of their health. Recode (15 Mar
Infrastructure: Final Report and Roadmap for State 2018). Retrieval 29 Oct 2018: 7 https://www.
Action. Retrieval 29 Oct 2018: 7 http:// recode.net/2018/3/15/17114684/
providersedge.com/ehdocs/ehr_articles/ health-care-digital-information-jared-kushner-
Washington_State_Health_Information_Infra- seema-verma-president-trump
structure-Final_Report_and_Roadmap_for_State_ 22 Yaraghi N. (2015) A Sustinable Busines Model for
Action.pdf Health Information Exchange Platforms: The
19 Castro D. (2007) Improving Health Care: Why a Solution to Interoperability in Health Care
Dose of IT May Be Just What the Doctor Ordered. IT. Brookings Institution (30 Jan 2015). Retrieval
Information Technology and Innovation 29 Oct 2018: 7 https://www.brookings.edu/
Foundation. Retrieval 29 Oct 2018: 7 http://www. research/a-sustainable-business-model-for-health-
itif.org/publications/improving-health-care-why- information-exchange-platforms-the-solution-to-
dose-it-may-be-just-what-doctor-ordered interoperability-in-health-care-it/
528 W. A. Yasnoff
learned from an early HRB startup (Yasnoff companies such as Project Hugo,30
and Shortliffe 2014). Endorsement of the CareDox,31 Patients Know Best,32
HRB concept (albeit referenced under various 33 34
Betterpath, and Ciitizen (as well as a num-
different terms) has come from a variety of ber of others) are pursuing the challenge of
sources including a prominent observer of the developing HRBs. Even the tech giant Apple
digital transformation of health care (Mikk appears to be moving towards HRB imple-
et al. 2017), a policy think tank,23 a former mentation with its Apple Health Kit serving
ONC Director,24 and a leading market as the infrastructure for a patient record
research firm.25 Even the current CMS repository controlled by individuals on their
Administrator has been openly advocating for own smartphones.35
lifetime, longitudinal patient records accessi-
ble to and controlled by patients.26 zz How Requirements Lead to HRB
Several countries have established suc- Architecture
cessful HRBs, including Finland,27 Estonia,28 . Figure 15.5 shows how the HII require-
and Brazil.29 In addition, a number of startup ments for complete records, low cost, and
high benefits lead directly to the HRB archi-
tecture. In order to ensure complete records,
information from each patient encounter
23 Kendall D and Quill E. (2015) A Lifetime
must be available. The only currently available
Electronic Health Record for Every American.
Third Way (28 May 2015). Retrieval 29 Oct 2018: mandate for this is for the patient to request
7 https://www.thirdway.org/ records (in digital form) from their provider,
report/a-lifetime-electronic-health-record-for- invoking their rights under the HIPAA pri-
every-american vacy rule. In order for patients to feel comfort-
24 Blumenthal D. (2016) The Biggest Obstacle to the
able doing that, they must trust the system.
Health-Care Revolution. Wall Street Journal (28
Jun 2016) Retrieval 29 Oct 2018: 7 https://blogs. The first element of trust is the architectural
wsj.com/experts/2016/06/28/ end point of security; the patients must know
how-to-make-health-care-records-as-mobile-as- that the records being sent from their provider
patients/ will be protected from improper use. This also
25 Frost & Sullivan. (2015) Moving Beyond the
leads to the architectural end point of patient
Limitations of Fragmented Solutions: Empowering
Patients with Integrated, Mobile On-Demand control of all access to the records; how can it
Access to the Health Information Continuum. be justified to patients for some other entity to
Retrieval 29 Oct 2018: 7 http://coranetsolutions.
com/wp-content/uploads/Moving%20Beyond%20
the%20Limitations%20of%20Fragmented%20
15 Solutions%20Whitepaper.pdf
26 Coombs B. (2018) Medicare chief says it’s time
the Kingdom of the Netherlands. Retrieval 29 Oct
2018: 7 https://www.rvo.nl/sites/default/
health care caught up to other industries to benefit files/2017/01/Brazil%20Healthcare%20-%20
consumers. CNBC (30 Apr 2018). Retrieval 29 Oct Guilherme%20Hummel.pdf
2018: 7 https://www.cnbc.com/2018/04/30/ 30 Retrieval 29 Oct 2018: 7 http://hugophr.com
cms-verma-says-its-time-health-care-caught-up-to- 31 Retrieval 29 Oct 2018: 7 https://caredox.com
other-industries.html 32 Retrieval 29 Oct 2018: 7 https://www.
27 Kunnamo I. (2018) National health record bank patientsknowbest.com/patients.html
(the eArchive) in Finland. Retrieval 29 Oct 2018: 33 Retrieval 29 Oct 2018: 7 https://www.betterpath.
7 https://www.dropbox.com/s/yfpck5hpzkd8lh0/ com
HRBA%20Kunnamo.pdf 34 Retrieval 29 Oct 2018: 7 https://www.ciitizen.com
28 e-Estonia Healthcare. Retrieval 29 Oct 2018: 35 Bell K. (2018) Apple wants to put medical records
7 https://e-estonia.com/solutions/ on your iPhone. Here’s how it works. Mashable (24
healthcare/e-health-record/ Jan 2018). Retrieval 29 Oct 2018: 7 https://
29 Hummel GS. (2016) Brazil eHealth – Overview, mashable.com/2018/01/24/
Trends, and Opportunities. Consulate General of apple-iphone-medical-records-how-to/
Health Information Infrastructure
529 15
.. Fig. 15.5 How
Requirement:
requirements lead to HRB Complete Records
architecture. This diagram
shows how the key
requirements of complete Mandated Record Deposit
records, low cost, and high
benefits lead directly to the HIPAA Patient Record Request
architectural specifications
of patient access control, a Patient Trust
repository for data storage, PATIENT ACCESS CONTROL
and trustworthy security
that are characteristic of REPOSITORY SECURITY
health record banks
Simple Architecture Searchability of records across population
decide which records to release to whom and zz Patient Control Ensures Privacy and
when? In order to assure feasible patient Stakeholder Cooperation
access control, the records must be stored in a In an HRB, everything is done with consumer
repository (the third and final architectural consent, with account-holders controlling
end point) so that patients have a single point their copy of all their records and deciding
for indicating and revising their record access who gets to see any or all of it. This protects
permissions. privacy (since each consumer sets their own
The requirement for low cost means that customized privacy policy), promotes trust,
the architecture must be simple. The most and ensures stakeholder cooperation since all
straightforward architecture for compiling holders of medical information must provide
and accessing complete records for each per- it when requested by the patient (Kendall
son is a repository – it is easy (and therefore 2009). Of course, the operations of an HRB
inexpensive) to implement and operate. must be open and transparent with indepen-
Finally, the requirement for high value dent auditing of privacy practices. World-
leads to the intermediate requirement to be class state-of-the-art computer security is
able to search records across the population. needed to protect the HRB, which will be a
It is those searching operations that provide natural target for hackers. At least one new
critical information of great value for public security approach is now available that abso-
health, medical research, quality improve- lutely prevents large-scale data loss from a
ment, and policy. Such searchability is much repository of medical records (Yasnoff 2016),
more easily achieved when the records are in a the key security requirement for an HRB (see
repository. subsection on HRB Security below).
Therefore, the architecture needed for a Natural concerns arise from the ability of
low cost, high benefit HII with complete the patient to suppress any or all of their
records is a secure repository with record HRB account information, which could lead
access controlled by patients. These are the to misdiagnosis and dangerous treatment.
exact characteristics of health record banks, This capability could be abused by patients
which were conceived and designed to specifi- who, for example, may seek multiple prescrip-
cally address these key HII requirements. tions for controlled substances for the pur-
530 W. A. Yasnoff
pose of diversion for illegal sale. With respect would help to offset the unfavorable cost/
to the possibility of medical errors resulting benefit of EHRs for office-based providers.
from incomplete information, the patient These incentives could be paid on a per-
would be clearly and unmistakably warned encounter or per-month basis. In addition,
about this when choosing not to disclose any those few physicians who do not currently
specific information (e.g., “Failure to disclose have EHRs could receive no-cost Internet-
any of your medical information may lead to accessible EHR systems (at HRB expense)
serious medical problems, including your with the understanding that information
death”). The expectation is that few people from patient encounters would be automati-
will choose to do this, particularly after such a cally transferred to the HRB. Another option
warning. However, as noted earlier, 13–17% is to link reimbursement for medical services
of patients already engage in this practice, to HRB deposits – i.e., providers would not
leading many observers to conclude that the be paid unless the medical record informa-
general public may not be comfortable with a tion generated from those services is trans-
system that provides easy access to their mitted to an HRB. This makes sense
records unless they are in control of such economically, as the value of medical ser-
access. This issue ultimately becomes one of vices is greatly limited if the information
public policy and may also be a subject of dis- about patients is not readily available for
cussion between the doctor and the patient their ongoing care.
(i.e., the doctor will want to be assured by the Incentives for HRB deposits also serve to
patient that all information is being provided). ensure compliance with data standards, both
Clearly, physicians should not be liable for the initially and on an ongoing basis. Clearly, any
consequences of the patient’s choice to with- EHR provided through the HRB would, by
hold information. definition, transmit information back to the
With respect to patients who use their HRB in a standard format (since the HRB
power to hide information as a way to facili- would only provide systems that can do so).
tate improper or illegal activity, there is clearly For physicians who already have EHRs, HRB
an overriding public policy concern. For reimbursements for deposits from those sys-
example, in the case of controlled substances, tems would naturally require complete and
it may be necessary to report to the physician fully encoded encounter data using estab-
(or, if legislatively mandated, to regulatory lished standards to be sent to the HRB. Over
authorities) whenever a patient suppresses time, higher levels of encoding of medical
any information about controlled substance information can be promoted through the
prescriptions. The information itself would gradual introduction of more stringent stan-
15 still be under the patient’s control, but the dards requirements (with plenty of lead time
physician would be alerted with a notice such to allow for system upgrades). Compliance
as “some controlled substance prescription with such changes in standards could also be
information has been withheld at the patient’s further assured through a direct relationship
request.” There may be other situations where to reimbursement.
such warnings are needed.
zz HRB Business Model
zz Assuring that Standardized, Encoded Health record banking has advantages on
Electronic Patient Information Is both the cost and revenue sides of the busi-
Consistently Deposited ness model; the cost is lower and the revenue
HRBs can provide ongoing incentives for opportunities greater. Because of the lower
deposits of EHR records by clinicians, which operating costs and additional functionality
Health Information Infrastructure
531 15
for searching records, one can envision a vari- and fringe benefits. Assuming an additional
ety of business models for HRBs that do not $500,000/year for hardware and software to
depend on public subsidies or attempt to cap- operate the institution-centric system (over
ture any healthcare savings, but are solely and above the data repository needed for an
funded through new value created for con- HRB) yields an annual cost of $1.5 million or
sumers and other stakeholders.36 $1.50/person/year. This would add nearly 20%
Due to the simplicity of HRB operations, for the institution-centric model to the esti-
the cost is substantially less than an equiva- mated $8/person/year needed to operate an
lent institution-centric architecture. For an HRB (Kaelber et al. 2008).
HRB, providing access at the point of care Beyond this, the additional costs imposed
only involves a single retrieval from the bank’s in the institution-centric model for each con-
repository of records. In an institution-centric nected EHR for additional hardware, soft-
model, the records for a given patient are ware, telecommunications capability, and
located at an arbitrary number of dispersed additional operational expenses to maintain
sites, and must be assembled in real-time and 24 × 7 system availability must also be
integrated into a comprehensive record before included. Even if such costs were only a very
they can be used for patient care. Not only is modest $1,000/year/system (less than $100/
this process of assembly complex, time- month), this would result in an additional
consuming, and prone to error, it necessitates, $1,000,000 or $1/person/year. Adding this to
as noted above, the creation of a fully staffed the $1.50/person/year for the NOC gives a
24 × 7 NOC to monitor the availability of all total estimated cost of $2.50/person/year,
information sources as well as troubleshoot resulting in over 30% higher costs for the
and correct those that are malfunctioning. institution-
centric model than a basic
The estimated cost for the NOC in an HRB. Added to this would be the costs and
institution-centric model is substantial. For complexity of establishing and maintaining
example, given a population of 1,000,000, at data sharing agreements among all the enti-
least 1,000 systems would need monitoring ties, which would be substantial.
(one for every 1,000 patients). Assuming a An HRB with comprehensive electronic
reasonable failure rate for fully functional medical records for individuals in a commu-
query connectivity to each system of once/ nity can generate substantial value (Yasnoff
year (representing a mean time between fail- and Shortliffe 2014). In addition to empower-
ures [MTBF] of over 8,700 hours), there ing physicians to provide safer, more effective,
would be an average of 2.73 failures/day or and more efficient care, the availability of
0.91 failures per 8-hour shift that would need such records readily enables many types of
troubleshooting attention. A minimum staff heretofore infeasible yet very desirable ser-
for the NOC would be one person 24 × 7; vices for patients, providers, and other stake-
given 21 shifts/week plus leeway for vacations holders.
and sick leave, this would require at least 5 Perhaps the most compelling example of a
full-time equivalent staff costing about service for patients is the “peace of mind”
$200,000 each including equipment, overhead alert that immediately notifies the patient’s
loved ones about a critical event, such as an
emergency room visit. As soon as the emer-
36 Health Record Banking Alliance. (2012) Health gency provider accesses the patient’s HRB
Record Banking: A Foundation for Myriad Health record, the alert, which is delivered electroni-
Information Sharing Business Models. White Paper
(12 Dec 2012). Retrieval 29 Oct 2018: 7 http://
cally in a manner chosen by the recipient, is
www.healthbanking.org/uploads/9/6/9/4/9694117/ delivered. Such a service is particularly valu-
hrba_white_paper_business_model_dec_2012.pdf able for children of seniors who may not be
532 W. A. Yasnoff
Requirements
Privacy Patient consent difficult to implement; Simple; patients in control of all access to
many complex data sharing agreements their own records; consent easy to
needed implement
Security Inherently weak because records Very strong using new security techniques
transmitted multiple times for each use that eliminate large-scale data loss from
repositories
Searchability Impractical to search population data Search feasible using parallel processing
in the cloud
Completeness Requires queries to all data sources each Comprehensive data available at all times
time a patient’s records are requested; all for each patient
must respond for completeness
Organizational issues
Cooperation Extensive; community-wide Unifying; HIPAA mandates records on
needed patient request
Organizational High; ongoing collaboration of multiple Low; HRB is neutral and independent of
complexity competing stakeholders necessary all stakeholders
Cost
Startup cost Substantial (due to high complexity) Moderate
Operational cost Inefficient/expensive Efficient/inexpensive
Business model Complex; no clear approach has emerged; Sustainable using new value of complete
typically requires ongoing subsidies from information; many options possible
health care stakeholders funded by patients/payers/purchasers
Operations
IT design Complex, based on “fetch and show”; Simple, based on “deposit once to
requires queries to multiple entities, account”; no secondary queries or
real-time reconciliation of inconsistencies, real-time reconciliation needed; NOC
and NOC unnecessary
15 Reliability Prone to error (record sources unavailable) Reliable; one operation to retrieve
individual patient data
Interoperability Compliance voluntary Compliance can be assured with financial
incentives
Incentives
Clinician Not included Easy to include
incentives
Clinician burden Extensive; incoming query each time Minimal; information deposited once in
current patients seen anywhere increases HRB; no incoming queries
EHR costs
Health Information Infrastructure
535 15
openly). However, this is an oversimplification EHR systems. The financial interests of health
of the complex incentives involved. plans and hospitals also appear to conflict
For insurers, their claims data provides the with the deployment of HRBs, since their
most comprehensive picture of individual adoption would likely reduce their fee-for-
health care service usage. While this informa- service incomes by eliminating unnecessary
tion is not sufficiently detailed to facilitate and duplicative care. Again, this opposition is
clinical management of patients, it gives the behind the scenes to avoid having to openly
insurers the “informational advantage” in advocate for higher institutional income over
negotiations with their customers, employers patient safety and effective, efficient care.
that purchase insurance for their employees, In a situation such as this, where progress
and their payees, physicians, hospitals, and that would benefit everyone (i.e., HRBs) is
health plans. HRBs would make much more opposed by specific groups, one would hope
detailed information about each patient that the government, representing the inter-
potentially available to all healthcare stake- ests of all the people, would help ensure that
holders, eliminating this informational advan- the good of everyone prevails. However, in the
tage. In addition, HRBs have the potential to U.S., the ability of the government to impose
reduce health care costs. While this would major changes is quite limited (due to the sys-
seem to be positive for insurers (making their tem of “checks and balances”), and is espe-
offerings less expensive to employers), it also cially so when the proposed changes are
would reduce their profitability, which is typi- opposed by many key stakeholders. It has
cally a percentage of those health care costs. therefore been relatively easy for major health-
For these reasons (and perhaps others), insur- care stakeholders to delay and/or redirect
ers have worked actively against HRB devel- HRB efforts by, for example, advocating for
opment, for example, by legislatively derailing other solutions, raising seemingly valid objec-
promising efforts to jump start HRB develop- tions to reasonable efforts to move forward,
ment in Washington State. Of course, since or creating organizations that appear to be
comprehensive patient-controlled electronic working towards a solution while in reality
records are so appealing to the public, the being dedicated to maintaining the status quo.
opposition of insurers has been behind the
scenes. The latter is a theme that is common to
all the stakeholder opposition, as it is difficult 15.6.2 actors Accelerating HRB
F
to openly justify opposing the safer, more Progress
effective, more efficient care likely to result
from HRBs. Perhaps the most important factor accelerat-
Health plans and hospitals have somewhat ing HRB progress is the recent and ongoing
different reasons for opposing HRBs. Mostly, change in the reimbursement system for care.
this involves concerns about empowering their The move from “fee for service” to “pay for
competitors. Typically, the largest hospitals value” is changing the incentives for all health
and health plans in each region have more care stakeholders. Under “fee for service,” the
complete information than their smaller com- more efficient care enabled by HRBs would
petitors, and therefore see HRBs as weaken- directly translate to lower reimbursement, a
ing or eliminating that perceived market clearly undesirable outcome for providers. But
advantage. Sadly, this is true despite the fact a “pay for value” system financially rewards
that HRBs would allow all providers to deliver efficiency, thereby creating a strong incentive
better care. Another issue, particularly for for the more complete and timely patient
hospitals, is that they are concerned about the information that HRBs can make available.
idea of sharing their electronic patient infor- While the transition from “fee for service” to
mation after spending hundreds of millions “pay for value” is still in its relatively early
of dollars (or more) to install and operate stages, it is reasonable to expect that ongoing
536 W. A. Yasnoff
additional progress is highly likely over the building the National Health Information
next few years. Infrastructure. Report and recommendations
from the National Committee on Vital and
nnSuggested Readings Health Statistics. Retrieval 31 Oct 2018: https://
Castro, D. (2007). Improving health care: Why a aspe.hhs.gov/report/information-health-strat-
dose of IT may be just what the doctor ordered. egy-building-national-health-information-
Information Technology and Innovation infrastructure. This seminal work was the first
Foundation. Retrieval 31 Oct 2018: http:// to call for a national HII, coining the term. It
www.i tif.o rg/publications/improving-health- comprehensively describes the need for HII, the
care-why-dose-it-may-be-just-what-doctor- problems it would solve, and the necessity for
ordered. This is the first independent report government investment to incentivize its devel-
that endorsed patient-centric architecture opment.
(HRBs) as an effective approach to HII. It Yasnoff, W. A. (2016). A secure and efficiently
describes clearly the problems and challenges searchable health information architecture.
of HIEs. Journal of Biomedical Informatics, 61, 237–
Kendall, D., & Quill, E. (2015, May 28). A life- 246. This paper describes a new and innova-
time electronic health record for every tive personal grid architecture for digital
American. Third Way. Retrieval 29 Oct 2018: health records that absolutely prevents large-
h t t p s : / / w w w .t h i r d w a y .o r g / scale data loss, an essential capability to
report/a-lifetime-electronic-health-record-for- ensure user trust in large central repositories
every-american. This paper is a more recent of health records.
endorsement of the HRB concept by an inde- Yasnoff, W. A., & Shortliffe, E. H. (2014). Lessons
pendent think tank. learned from a health record bank start-up.
Krist, A. H., & Woolf, S. H. (2011) A vision for Methods of Information in Medicine, 53,
patient-centered health information systems. 66–72. This paper gives a detailed post-mor-
Journal of the American Medical Association, tem of a health record bank startup and
305(3):300–301. A vision of how fully func- includes specific recommendations for future
tional patient-centric electronic medical success.
record systems could be the basis for an effec- Yasnoff, W. A., Humphreys, B. L., Overhage,
tive HII. J. M., Detmer, D. E., Brennan, P. F., Morris,
Lapsia, V., Lamb, K., & Yasnoff, W. A. (2012). R. W., Middleton, B., Bates, D. W., & Fanning,
Where should electronic records for patients J. P. (2004). A consensus action agenda for
be stored? International Journal of Medical achieving the national health information
Informatics, 81(12), 821–827. This paper eluci- infrastructure. Journal of the American
15 dates clearly the advantages of patient-centric Medical Informatics Association, 11(4), 332–
architecture by comparing it via simulation to 338. This paper describes the results of the
an institution-centric approach. first national consensus conference on HII
Miller, R. H., & Miller, B. S. (2007). The Santa held in Washington, DC, in 2003. This was the
Barbara County Care Data Exchange: What meeting that led to the creation of ONC in
happened? Health Affairs, 26(5), w568–w580. 2004.
This paper describes the history of one of the Yasnoff, W. A., Sweeney, L., & Shortliffe, E. H.
earliest HIEs, including details about the fac- (2013). Putting health IT on the path to suc-
tors leading to its f ailure. cess. Journal of the American Medical
Mikk, K. A., Sleeper, H. A., & Topol, E. J. (2017). Association, 309(10), 989–990. This paper pro-
The pathway to patient data ownership and vides a concise overview of the problems with
better health. JAMA, 318(15), 1433–1434. HIE and the rationale for HRBs.
This paper describes how patient data owner-
ship and records stored in repositories can ??Questions for Discussion
result in an effective HII. 1. Make the case for and against investing
National Committee on Vital and Health Statistics. $billions in the HII. How successful
(2001). Information for health: A strategy for have the HITECH Meaningful Use
Health Information Infrastructure
539 15
incentives been in promoting HII ingful use. Annals of Internal Medicine, 154,
development? What could have been 666–671.
Agency for Healthcare Research and Quality. (2006).
done differently to make them more
Costs and benefits of health information technology.
effective? Evidence Report/Technology Assessment 132, pub-
2. What organizational options would you lication 06-E006. Retrieval 29 Oct 2018: https://
consider if you were beginning the www.ahrq.gov/downloads/pub/evidence/pdf/hitsys-
development of HII? What are the pros costs/hitsys.pdf
Associated Press. (2015, July 17). Online attacks at
and cons of each? How would you pro-
UCLA Health exposed 4.5 million. New York
ceed with making a decision about Times. Retrieval 29 Oct 2018: http://www.nytimes.
which one to use? com/2015/07/18/business/online-attacks-at-ucla-
3. Estimate the required bandwidth and health-exposed-4-5-million.html
transaction rate for patient-centric (HRB) Balas, E. A., & Boren, S. A. (2000). Managing clinical
knowledge for health care improvement. In Yearbook
vs. institution-centric HII architecture.
of medical informatics 2000: Patient-centered systems
4. Consider the policy implications of uni- (pp. 65–70). Schattauer: Stuttgart.
versal availability of comprehensive Ball, M., & Gold, J. (2006). Banking on health: Personal
electronic patient records. What are the records and information exchange. Journal of
risks and how could they be mitigated? Healthcare Information Management, 20(2), 71–83.
Bates, D. W. (2000). Using information technology to
5. Given the architectural and other advan-
reduce rates of medication errors in hospitals.
tages of HRBs, why have most commu- British Medical Journal, 320, 788–791.
nities adopted institution-centric Bates, D. W., Leape, L. L., Cullen, D. J., et al. (1998).
architectures up to now? What are some Effect of computerized physician order entry and a
steps that might be helpful in encourag- team intervention on prevention of serious medica-
tion errors. Journal of theAmerican Medical
ing communities to evaluate alternative
Association, 280(15), 1311–1316.
architectures such as HRBs? Creswell, J. (2014, September 30). Doctors find barriers
6. Show specifically the potential locations to sharing digital medical records. New York Times.
where patient consent functionality Retrieval 29 Oct 2018: https://www.nytimes.
could be added to the institution-centric com/2014/10/01/business/digital-medical-records-
become-common-but-sharing-remains-challenging.
and patient-centric HII architectures in
html
. Figs. 15.2 and 15.4 and describe the Detmer, D., Bloomrosen, M., Raymond, B., & Tang, P.
granularity of consent that would be (2008). Integrated personal health records:
possible at each proposed location. Transformative tools for consumer-centric care.
After eliminating any redundant BMC Medical Informatics and Decision Making, 8,
45. Retrieval 29 Oct 2018: https://www.ncbi.nlm.
functionality, compare and contrast the
nih.gov/pmc/articles/PMC2596104/
consent implementation issues for the Devine, E. B., Totten, A. M., Gorman, P., Eden, K. B.,
two alternative architectures, describing Kassakian, S., Woods, S., Daeges, M., Pappas, M.,
the advantages and disadvantages of McDonagh, M., & Hersh, W. R. (2017, December 7).
each. Which architecture more Health information exchange use (1990–2015): A sys-
tematic review. EGEMS (Washington, DC), 5(1), 27.
efficiently addresses the issue of patient
Retrieval 29 Oct 2018: https://egems.academyhealth.
consent? Why? org/articles/10.5334/egems.249/
Dodd, B. (1997, October). An independent “health
information bank” could solve health data security
References issues. British Journal of Healthcare Computing and
Information Management, 14(8), 2.
Dyson, E. (2005, September). Personal health informa-
Abelson, R., & Goldstein, M. (2015, May 2). Millions
tion: Data comes alive! Release, 1.0(24), 1.
of anthem customers targeted in cyberattack. New
Gold, J. D., & Ball, M. J. (2007). The health record
York Times. Retrieval 29 Oct 2018: http://www.
banking imperative: A conceptual model. IBM
ny t i m e s.c o m / 2 0 1 5 / 0 2 / 0 5 / bu s i n e s s / h a c ke r s -
Systems Journal, 46(1), 43–55.
breached-data-of-millions-insurer-says.html
Haun, K., & Topol, E. J. (2017, January 2). The health
Adler-Milstein, J., Bates, D. W., & Jha, A. K. (2011). A
data conundrum. New York Times. Retrieval 29 Oct
survey of health information exchange organiza-
2018: https://www.nytimes.com/2017/01/02/opinion/
tions in the United States: Implications for mean-
the-health-data-conundrum.html
540 W. A. Yasnoff
Management of Information
in Health Care Organizations
Lynn Harold Vogel and William C. Reed
Contents
References – 574
Management of Information in Health Care Organizations
545 16
nnLearning Objectives ness lines, how to partner with other organi-
After reading this chapter, you should know zations, and how to eliminate underutilized
the answers to these questions: services. Collectively, health care professionals
55 What are the primary information comprise a heterogeneous group with diverse
requirements of health care organiza- objectives and information requirements, and
tions (HCOs)? in the end, all are expected to be focused on
55 What are the clinical, financial, and the patients who are, after all, the reason for
administrative functions provided all of this.
by health care information systems The purpose of health care information
(HCISs), and what are the potential systems (HCISs) is to support the access, pro-
benefits of implementing such systems? cessing, and management of the information
55 How have changes in health care deliv- that health care professionals need to perform
ery models changed the scope and their jobs effectively and efficiently. HCISs
requirements of HCISs over time? facilitate communication, integrate informa-
55 How do differences among business tion, and coordinate action among multiple
strategies and organizational structures health care professionals—and increasingly,
influence information systems choices? patients. In addition, HCISs organize and
55 What are the major challenges to imple- store substantial amounts of data and they
menting and managing HCISs? support record-keeping and reporting func-
55 How are ongoing health care reforms, tions. Many of the clinical information
technological advances, and chang- functions of an HCIS were detailed in our
ing social norms likely to affect HCIS discussion of the computer-based patient
requirements in the future? record (CPR) in 7 Chap. 14; systems to sup-
port nurses and other care providers are dis-
cussed in 7 Chap. 17. Furthermore, HCISs
16.1 Overview are key elements that interface with the
health information infrastructure (HII), as
Health care organizations (HCOs), and discussed in 7 Chap. 15. An HCIS also sup-
Integrated Delivery Networks (IDNs) like ports the financial and administrative func-
many other business entities, are information- tions of a health organization and associated
intensive enterprises. Health care person- operating units, including the operations of
nel require sufficient data and information ancillary and other clinical-support depart-
management tools to make appropriate deci- ments. The evolving complexities of HCOs
sions. At the same time, they need to care for place great demands on an HCIS. Many
patients and manage and run the enterprise; HCOs are broadening their scope of activi-
they also need to document and communicate ties to cover the care continuum, partially in
plans and activities, and to meet the require- response to Accountable Care Organization
ments of numerous regulatory and accredit- (ACO), Value-based Purchasing and Bundled
ing organizations. Clinicians assess patient Payment initiatives from the federal govern-
status, plan patient care, administer appropri- ment. HCISs must organize, manage, and
ate treatments, and educate patients and fami- integrate large amounts of clinical and finan-
lies regarding clinical management of various cial data collected by diverse users in a vari-
conditions. They are also concerned about ety of organizational settings (from patient
evaluating the clinical outcomes, quality, homes to physicians’ offices to hospitals to
and increasingly, the cost of health services health care systems) and must provide health
provided. Administrators determine appro- care workers (and, increasingly, patients) with
priate staffing levels, manage inventories of timely access to complete, accurate, and up-
drugs and supplies, and negotiate payment to-date information presented in a useful for-
contracts for services. Governing boards make mat. The diversity and extent of the modern
decisions about whether to invest in new busi- IDN is illustrated in . Fig. 16.1.
546 L. H. Vogel and W. C. Reed
Physician
Academic Group Practice
Community Institution Clinics
Patients
Hospital
Suppliers
Physician
Group Internet
Practice
Community
Hospital
Reference
Lab
Payers
Long Firewall
Term Home
Care Clinics Care
.. Fig. 16.1 Major organizational components of an practices, etc.). Components within the same geographic
integrated delivery network (IDN). A typical IDN area may have direct data connections, but increasingly
might include several components of the same type the Internet is the preferred way to connect organiza-
(e.g., clinics, community hospitals, physician group tional components
A 3 A Interface B 6
Engine
3 6
B C C
A B
Interface
4 8
Engine
A B
C D
4 12
Number of Potential Interfaces = n x 2
C D
.. Fig. 16.3 The introduction of the Interface Engine
Number of Potential Interfaces = n(n-1) (IE) made system interfaces much more manageable,
particularly so with the implementation of HL7 data
.. Fig. 16.2 The challenge of moving data from one messaging standards. With an IE, each additional sys-
system to another becomes complicated with the addi- tem only added two additional interfaces to the mix, one
tion of each new system. Considering that even small to send data and one to acknowledge receipt of the data
size hospitals may have several hundred applications,
interfacing is a major challenge. While not all systems
need to have two-way interfaces to every other system, series of strategies for managing multiple sys-
this figure illustrates the challenges that even small num- tems. Many of the vendors who got their start
bers of systems bring in health care interfacing subsequently found
new markets in financial services as well as
increased the complexity further, since it often other industries.
meant the need for two-way interfaces to each Continued advancements in departmen-
of the original three systems, for a total of tal systems were made possible as comput-
12 (. Fig. 16.2). With the prospect of inter- ing power increased and corresponding costs
faces increasing exponentially as new systems decreased through the advent of microcom-
were added (represented by the formula, I = n puters. Even smaller ancillary departments
(n − 1) where I represents the number of inter- such as Respiratory Therapy, which previ-
faces needed and n represents the number of ously could not justify a major computer
systems), it was clear that a new solution was acquisition, could now purchase dedicated file
needed to address the complexity and cost of servers and workstations and participate in
interfacing. the HCIS environment. Health care providers
16 In response, an industry niche was born in nursing units or at the bedside, physicians
which focused on creating a hardware and in their offices or homes, and managers in the
software application combination designed administrative offices could eventually access
to manage the interfacing challenges among and analyze data locally using what were ini-
disparate systems in the HCIS environment. tially termed microcomputers (later known as
Instead of each system having to interface to desktop personal computers or PCs).
every other system independently, an interface
engine served as the central connecting point
for all interfaces (. Fig. 16.3). Each system 16.2.3 Integration Challenges
had only to connect to the interface engine;
the engine would then manage the sending of In the early 1980s, researchers at the
data to and from any other system that needed University of California, San Francisco
it. The interface engine concept, which origi- (UCSF) Hospital successfully implemented
nated in health care, has given rise to a whole one of the first Local Area Networks (LANs)
Management of Information in Health Care Organizations
549 16
to support communication among several data values in ways that are incompatible with
of the hospital’s standalone systems. Using the definitions chosen by other areas of the
technology developed at the Johns Hopkins organization. The promise of sharing among
University, they connected minicomputers independent departments, entities, and even
that supported patient registration, medical independent institutions has increased the
records, radiology, the clinical laboratory, importance of defining clinical data standards
and the outpatient pharmacy. Interestingly, (see 7 Chap. 7) As noted earlier, some HCOs
each of the five computer systems was differ- pursued a best of breed strategy in which they
ent from the others: the computers were made chose the best system, regardless of vendor
by different manufacturers and ran different and technology, then worked to integrate that
operating systems but were able to commu- system into their overall HCIS environment.
nicate with each other through standardized Some HCOs modified this strategy by choos-
communications protocols. ing suites of related applications, e.g., select-
By the late 1980s, HCISs based on evolv- ing all ancillary systems from a single vendor
ing network-communications standards were (also known as best of cluster), thereby reduc-
being developed and implemented in HCOs. ing the overall number of vendors they work
As distributed computer systems, connected with and, in theory, reducing the costs and dif-
through electronic networks, these HCISs ficulty of integration.
consisted of a federation of independent Commercial software vendors have sup-
systems that had been tailored for specific ported this strategy by broadening their offer-
application areas. The computers operated ings of application suites and managing the
autonomously and shared data (and some- integration at the suite level rather than at the
times programs and other resources, such as level of individual applications. For example,
printers) by exchanging information over a Epic, one the of major enterprise-wide HCIS
local area network (LAN) using standard vendors, started in the late 1960s with a focus
protocols such as TCP/IP and Health Level on physician billing systems, and over the sub-
7(HL7) for communication and in many cases sequent 40 years evolved into a fully integrated
utilizing the interface engine strategy we dis- product suite encompassing ambulatory,
cussed earlier in 7 Sect. 16.2.2. One advan- inpatient, ancillary and billing functionality.
tage of LAN-connected distributed systems Cerner, another major enterprise-wide HCIS
was that individual departments could have vendor, started with a suite of products for
greater flexibility in choosing hardware and ancillary systems such as clinical laboratories,
software that optimally suited their specific pharmacy and radiology, and over a similar
needs. On the downside, the distribution time period evolved into a full-service product
of information processing capabilities and suite including inpatient, outpatient and bill-
responsibility for data among diverse systems ing functionality.
made the tasks of data integration, communi- With the increasing availability of single
cation, and security more difficult—a fact that vendor systems, the need to develop and
continues to the present day. Development of implement applications that support the
industry-wide standard network and interface specific functionality has diminished signifi-
protocols such as TCP/IP and HL7 has eased cantly. PC-based universal workstations are
the technical problems of electronic commu- now the norm and some HCOs and IDNs
nication considerably. support thousands of PCs in enterprise-wide
Still, there are problems to overcome in networked environments. The requirement for
managing and controlling access to a patient direct access to independent ancillary systems
database that is fragmented over multiple has been largely eliminated not only by enter-
computers, each with its own file structure and prise data networks, but by vendors offering
method of file management. Furthermore, integrated product suites which include sup-
when no global architecture or vocabulary port for general clinical as well as ancillary
standards are imposed on the HCISs, indi- service functionality. Where specialized sys-
vidual departments and entities may encode tems remain, interfaces join such systems to
550 L. H. Vogel and W. C. Reed
a core clinical system or to a centralized clini- separate computers with separate databases,
cal data repository. These permit the ability thus minimizing conflicts about priorities in
to access patient databases (by clinicians), services and investment. In addition, infor-
human resources documents (by administra- mation systems to support hospital functions
tors and employees), financial information and ambulatory care historically have, due to
(by administrators) and basic information organizational boundaries, developed inde-
about facilities, departments, and staff (by the pendently. Many hospitals, for example, have
public) through a single enterprise-wide data rich databases for inpatient data but maintain
network (See . Fig. 16.1). less information for outpatients. As fee-for-
Historically, as more information systems service reimbursement models continue to be
were added to the HCIS environment, the challenged for their focus on activity-driven
challenge of moving data from one system to care, alternatives such as ACOs, bundled pay-
another became overwhelming. In response, ments for services, and pay for performance
beginning in the 1980s, two unique develop- proposals will stimulate efforts toward greater
ments occurred: (1) the interface engine; and data integration.
(2) Health Level Seven (HL7), a standard for By the late 1980s, clinical information
the structure of the data messages that were system (CIS) components of HISs offered
being sent from one information system to clinically oriented capabilities, such as order
another (see 7 Chap. 7). writing and results communications. During
The creation of HL7 was yet another the same period, ambulatory medical record
response to the challenge of moving data systems (AMRSs) and practice management
among disparate health care systems. HL7 is systems (PMSs) were being developed to sup-
a health care-based initiative focused on the port large outpatient clinics and physician
development of data messaging standards for offices, respectively. These systems performed
the sharing of data among the many individ- functions analogous to those of hospital sys-
ual systems that comprise an HCIS. The basic tems, but were generally less complex, reflect-
idea was to use messaging standards so that ing the complexity of patient care delivered
data could be sent back and forth using stan- in outpatient settings. However, there has his-
dard formats within the HCIS environment. torically been little or no systems integration
Most of the departmental systems that were between hospital and ambulatory settings.
introduced at this time were the products of The historical lack of integration of data
companies focused on specific niche markets, from diverse sources creates a host of prob-
including laboratories, pharmacies and radi- lems. If clinical and administrative data are
ology departments. Consequently, there was stored on separate systems, then data needed
strong support for both the interface engine by both must either be entered separately
and the HL7 efforts as mechanisms to per- into each system, be copied from one system
16 mit smaller vendors to compete successfully
in the marketplace. In recent years, many of
to another, or data from both sources trans-
ferred to yet another location to be analyzed.
these pioneering vendors have been purchased In addition to the expense of redundant data
and their products included as components of entry and data maintenance incurred by these
larger single vendor product families. approaches (see also the related discussion
In hospitals, clinical and administrative for the health information infrastructure in
personnel have traditionally had distinct areas 7 Chap. 15), the consistency of information
of responsibility and performed many of their tends to be poor because data may be updated
functions separately. Thus, it is not surpris- in one place and not in the other, or informa-
ing that administrative and clinical data have tion may be copied incorrectly from one place
often been managed separately—administra- to another. In the extreme example, the same
tive data in business offices and clinical data in data may be represented differently in differ-
medical-records departments. When comput- ent settings. As we noted earlier within the
ers were first introduced, the hospital’s infor- hospital setting, many of these issues have
mation processing was often performed on been addressed through the development of
Management of Information in Health Care Organizations
551 16
automated interfaces to transfer demographic ing existing systems to allow data sharing.
data, orders, results, and charges between The capital investment required to pursue a
clinical systems and billing systems. Even with strategy of system-wide data integration can
an interface engine managing data among dis- be significant, and with ongoing challenges to
parate systems, however, an organization still reimbursement rates for both hospitals and
must solve the thorny issues of synchroniza- physicians, the funding to pursue this strat-
tion of data and comparability of similar data egy is often limited either due to competing
types. investment requirements (e.g., acquiring or
With the development of IDNs and other maintaining buildings and equipment) or the
complex HCOs, the sharing of data elements continued downward trend in reimbursement
among operating units becomes both more for services.
critical and more problematic. Data integra-
tion issues are further compounded in IDNs
by the acquisition of previously independent 16.2.4 Evolution to Enterprise-
organizations that have clinical and admin- Wide Health Care System
istrative information systems incompatible Information Systems
with those of the rest of the IDN. It is still
not unusual to encounter minimal automated If an HCO or IDN is to manage patient care
information exchange among organizations effectively, project a focused market identity,
even within an IDN. Patients register and and control its operating costs, it must perform
reregister at the physician’s office, diagnos- in a unified and consistent manner. For these
tic imaging center, ambulatory surgery facil- reasons, information technologies to support
ity, and acute-care hospital—and sometimes data and process integration are recognized
face multiple registrations even within a single as critical to an IDN’s or HCO’s operations.
facility. Each facility may continue to keep its From an organizational perspective, informa-
own clinical records, and shadow files may tion should be available when and where it is
be established at multiple locations with cop- needed; users must have an integrated view,
ies of critical information such as operative regardless of system or geographic boundar-
reports and hospital discharge summaries. ies; data must have a consistent interpreta-
Inconsistencies in these multiple electronic tion; and adequate security must be in place
and manual databases can result in inappro- to ensure access only by authorized personnel
priate patient management and inappropriate and only for appropriate uses. Unfortunately,
resource allocation. For example, medica- these criteria are much easier to describe than
tions that are first given to a patient while to meet.
she is a hospital inpatient may inadvertently Over time, changes in the health care
be discontinued when she is transported to a economic and regulatory environments have
rehabilitation hospital or nursing home. Also, radically transformed the structure, strategic
information about a patient’s known allergies goals, and operational processes of health
and medication history may be unavailable to care organizations through a gradual shift-
physicians treating an unconscious patient in ing of financial risk from third party payers
an emergency department. (e.g., traditional insurance companies such
The objectives of coordinated, high- as Blue Cross and Blue Shield, Medicare and
quality, and cost-effective health care cannot Medicaid programs that emerged in the 1960s
be completely satisfied if an organization’s and 1970s) to the providers themselves and
multiple computer systems operate in isola- in many cases directly to the patient through
tion. Unfortunately, free-standing systems higher deductibles, co-pays, benefit caps, etc.
within HCOs are still common, although This shifting of risk initially brought about
HCOs and IDNs are increasingly investing in a consolidation of health care providers into
the implementation of new more consistent integrated delivery networks (IDNs) in the
systems across all their facilities or in integrat- 1990s.
552 L. H. Vogel and W. C. Reed
ture. For example, the routing of paper of patient care, and they must be able to
medical records, a cumbersome process show that HCOs meet the safety require-
even within a single hospital, is an impos- ments for infectious disease management,
sibility for a regional network of providers buildings, and equipment. Employer and
trying to act in coordination. Similarly, it consumer groups are also joining the list
is neither timely nor cost effective to copy of external monitors.
and distribute hard copy documents to
all participants in a regionally distributed
organization. An HCO’s technology infra- 16.2.6 Process Integration
structure can enable information exchange
via web-based access to shared databases To be truly effective, information systems
and documents, collaborative platforms, must mesh smoothly with the people who
electronic mail, document-management use them and with the specific operational
systems, and on-line calendaring systems, workflows of the organization. But process
as well as providing and controlling access integration poses a significant challenge for
for authorized users at the place and time HCOs and for the HCIS’s as well. Today’s
that information is required. health care-delivery models represent a radi-
55 Documentation and reporting requirements. cal departure from historical models of care
The need to maintain records for future delivery. Changes in reimbursement and
reference or analysis and reporting makes documentation requirements often lead, for
up the fourth category of informational example, to changes in the responsibilities and
requirements. Some requirements are work patterns of physicians, nurses, and other
internally imposed. For example, a com- care providers; the development of entirely
plete record of each patient’s health sta- new job categories (such as care managers
tus and treatment history is necessary to who coordinate a patient’s care across facili-
ensure continuity of care across multiple ties or between encounters); and the more
providers and over time. External require- active participation of patients in their own
ments create a large demand for data col- personal health management (. Table 16.1).
lection and record keeping in HCOs (as Process integration is further complicated in
with mandated reporting of vaccination that component entities typically have evolved
records to public health agencies). As dis- different operational policies and procedures,
cussed in 7 Chap. 14, the medical record which can reflect different historical and lead-
is a legal document. If necessary, the ership experiences from one office to another,
courts can refer to the record to determine or in the extreme example, from one floor to
whether a patient received proper care. another within a single hospital.
Insurance companies require itemized The most progressive HCOs are develop-
16 billing statements, and medical records
substantiate the clinical justification of
ing new enterprise-wide processes for provid-
ing easy and uniform access to health services,
services provided and the charges submit- for deploying consistent clinical guidelines,
ted to them. The Joint Commission (JC), and for coordinating and managing patient
which certifies the qualifications and per- care across multiple care settings through-
formance of many health care organiza- out the organization. Integrated information
tions, has specific requirements concerning technologies are essential to supporting such
the content and quality of medical records, enterprise-wide processes. Mechanisms for
as well as requirements for organization- information management aimed at integrat-
wide information-management processes. ing operations across entities must address
Furthermore, to qualify for participation not only the migration from legacy systems
in the Medicare and Medicaid programs, but also the migration from legacy work pro-
the JC requires that hospitals follow cesses to new, more consistent and more stan-
standardized procedures for auditing the dardized policies and processes within and
medical staff and monitoring the quality across entities.
Management of Information in Health Care Organizations
555 16
.. Table 16.1 The changing health care environment and its implications for an IDN’s core competencies.
Columns 1 and 2 are used with permission with CSC. Column 3 is the authors’ addition
Characteristics Old care model Twentieth Century care Twenty-first Century care model
model
The introduction of new HCISs changes electronic record simultaneously). More funda-
the workplace. Research has shown that in mental business transformation is also possible
most cases the real value from an investment in with new technologies; for example, direct entry
information systems comes only when underly- of medication orders by physicians, linked with
ing work processes are changed to take advan- a decision-support system, allows immediate
tage of the new information technology (see checking for proper dosing and potential drug
. Figs. 16.4 and 16.5). At times, these changes interactions, and the ability to recommend less
can be substantial. The implementation of a new expensive drug substitutes (Vogel 2003).
system offers an opportunity to rethink and rede- Few health care organizations today have
fine existing work processes to take advantage of the time or resources to develop entirely new
the new information- management capabilities, information systems or redesign processes on
thereby reducing costs, increasing productivity, their own; therefore, most opt to purchase
or improving service levels. For example, pro- commercial software products and to use con-
viding electronic access to information that was sultants to assist them in the implementation
previously accessible only on paper can shorten of industry “best practices”. Although these
the overall time required to complete a multi- commercial systems allow some degree of
step activity by enabling conversion of serial custom tailoring, they also reflect an under-
processes (completed by multiple workers using lying model of work processes that may have
the same record sequentially) to concurrent pro- evolved through development in other health
cesses (completed by the workers accessing an care organizations with different underlying
556 L. H. Vogel and W. C. Reed
.. Fig. 16.4 The process of managing the manual creation of orders requesting services on behalf of patients in a
hospital involves numerous tasks performed not only by the ordering physician, but by nursing and clerical staff
.. Fig. 16.5 The implementation of an electronic phy- such a system will only be successful if other “comple-
sician order entry system reduces the number of tasks mentary” changes are made to both the workflow of
that a physician needs to perform to enter an order, but staff and the responsibilities of the IS Department
HCIS functionality needed by hospitals. This tained commitment of resources to use that
has led to the demise of the “best of breed” technology well; dedicated leadership with
strategy we mentioned earlier, as hospitals the willingness to make difficult, sometimes
focused more on single vendor solutions. The unpopular decisions; education; and possi-
market for ambulatory systems, on the other bly new performance incentives to overcome
hand, has remained highly fragmented, with cultural inertia and politics. Government
as many as 700 vendors competing for market incentives to stimulate HCOs toward the
share. Meaningful Use of information technology,
To meet the continually evolving financial which emerged from the 2010 Health care
and quality documentation requirements of Reform legislation are a recent example of
today’s health care environment, HCOs must attempts to bring process integration and data
continually evolve as well—and the analogy integration together.
between changing an HCO and turning an
aircraft carrier seems apt. Although an HCO’s
business plans and information-systems strat- 16.2.7 Security and Confidentiality
egies may be reasonable and necessary, chang- Requirements
ing ingrained organizational behavior can
be much more complex than changing the HCISs are one of the most frequent targets
underlying information systems. Technology of hackers. They seek to gain access to health
capabilities often exceed an HCO’s ability to care data, which contains more personalized
use them effectively and efficiently. Successful data (and hence is considered more valu-
process integration requires not only success- able) than what is typically collected in other
ful deployment of the technology but also sus- industries. Breeches of health care data stored
558 L. H. Vogel and W. C. Reed
in both provider organizations and insur- Computer systems can be designed to pro-
ance companies have increased significantly vide security, but only people can promote the
over the past several years—often exceeding trust necessary to protect the confidentiality
breeches of financial organization data found of patients’ clinical information. In fact, most
in organizations such as banks. In addition, breeches and inappropriate disclosures stem
the task of breaking into and stealing health from human actions rather than from com-
care data has been superseded by ransom- puter system failures. To achieve the goal of
ware--hackers blocking access to the data and delivering coordinated and cost-effective care,
demanding payment (often in bitcoin, which clinicians need to access information on spe-
tends to be less traceable than cash payments). cific patients from many different locations.
Using ransomware is much easier for hackers However, it is difficult to predict in advance
since data does not need to be moved and then which clinicians will need access to which
offered for sale; payments come directly from patient data and from which locations. And
the organization that was hacked in order to unfortunately, there is an inverse relationship
obtain the code needed to unblock access. between ease of access and security robust-
The protection of health information ness. Therefore, an HCIS must strike a bal-
from unwanted or inappropriate use is gov- ance between restricting information access,
erned not only by the trust of patients in their enabling health care workers to do their jobs,
health providers but also by law. In accor- and ensuring the accountability of the users
dance with the Health Insurance Portability of patient information.
and Accountability Act (HIPAA) of 1996 With federal requirements to provide
(7 Chap. 12), the Secretary of Health and patients with access to their data through
Human Services recommended that “Congress portals, the risk of inappropriate access only
enact national standards that provide funda- increases. In addition, more and more devices
mental privacy rights for patients and define (e.g., ventilators, monitoring machines,
responsibilities for those who serve them.” IV pumps) are connected to HCO’s inter-
This law and subsequent federal regulations nal communications networks, making the
now mandate standardized data transactions Internet of Things (IoT)-- a reality in health
for sending data to payer organizations, the care. To build trust with its patients and meet
development and adherence to formal policies HIPAA requirements, an HCO should adopt
for securing and maintaining access to patient a three-pronged approach to securing infor-
data, and under privacy provisions, prohibit mation. First, the HCO needs to designate
disclosure of patient-identifiable information a security officer (and typically a privacy
by most providers and health plans, except as officer as well) and develop uniform security
authorized by the patient or explicitly permit- and confidentiality policies, including speci-
ted by legislation. Subsequent updates to the fication of sanctions, and to enforce these
16 HIPAA regulations have strengthened consid-
erably the requirements for security and pri-
policies rigorously. Second, the HCO needs
to train employees so they understand the
vacy protections and have also given patients appropriate uses of patient-identifiable infor-
the right to pursue actions against both orga- mation and the consequences of violations.
nizations and individuals when they feel that Third, the HCO must use electronic tools
their personal information has been compro- such as intrusion detection, access controls
mised. HIPAA also provides consumers with and audit trails not only to discourage misuse
significant rights to be informed about how of information, but also to inform employees
and by whom their health information will and patients that people who access confi-
be used, and to inspect and sometimes amend dential information without proper authori-
their health information. Stiff criminal pen- zation or a “need to know”, can be tracked
alties including fines and possible imprison- and will be held accountable.
ment are associated with noncompliance or
the knowing misuse of patient-identifiable
information.
Management of Information in Health Care Organizations
559 16
16.2.8 he Impact of Health Care
T costs of storing, retrieving, and transport-
Information Systems ing charts in the medical-records depart-
ment.
On average, health care workers in adminis- 55 Productivity enhancements. A second area
trative departments spend about three-fourths of benefit from an HCIS comes in the form
of their time handling information; work- of improved productivity of clinicians and
ers in nursing units spent about one-fourth other staff. With continuing (and at times
of their time on these tasks. The fact is that increasing) constraints on reimbursements,
information management in health care orga- HCOs are continually faced with the chal-
nizations, even with significant computeriza- lenge of doing more with less. Providing
tion, is a costly activity in terms of both time information systems support to staff can
and money. The collection, storage, retrieval, in many cases enable them to manage a
analysis, and dissemination of the clinical and larger variety of tasks and data than would
administrative information necessary to sup- otherwise be possible using strictly manual
port the organization’s daily operations, to processes. Interestingly, in some cases hos-
meet external and internal requirements for pital investments in an HCIS support the
documentation and reporting, and to support productivity improvement of staff that are
short-term and strategic planning remain not employed by the hospital, namely the
important and time-consuming aspects of the physicians, and can even extend to payers
jobs of health-care workers. by lowering their costs. One of the major
Today, the justifications for implement- challenges with introducing a new HCIS is
ing HCISs include cost reduction, productiv- that the productivity of users may actually
ity enhancements, and quality and service decrease in the initial months of the imple-
improvement, as well as strategic consider- mentation. With complex clinical applica-
ations related to competitive advantage, patient tions, learning new ways of working can
expectations, and regulatory compliance: lead to high levels of user dissatisfaction in
55 Cost reduction. Much of the historical addition to lowered productivity.
impetus for implementing HCISs was their 55 Quality and service improvement. As HCISs
potential to reduce the costs of informa- have broadened in scope to encompass
tion management in hospitals and other support for clinical processes, the ability
facilities largely by reducing the number of to improve the quality of care has become
employees. HCOs continue to make tacti- an additional benefit. Qualitative benefits
cal investments in information systems to of HCISs include improved accuracy and
streamline administrative processes and completeness of documentation, reduc-
departmental workflows. Primary benefits tions in the time clinicians spend docu-
that may offset some information-systems menting (and associated increases in time
costs include reductions in labor require- spent with patients), fewer drug errors
ments, reduced waste (e.g., dated surgical and quicker response to adverse events,
supplies that are ordered but unused or and improved provider-to-provider com-
food trays that are delivered to the wrong munication. Through telemedicine and
destination and therefore are wasted), and remote linkages (see 7 Chap. 20), HCOs
more efficient management of supplies can expand their geographical reach and
and other inventories. Large savings can improve delivery of specialist care to rural
be gained through efficient scheduling of and outlying areas. Once patient data are
expensive resources such as surgical suites converted from a purely transaction for-
and imaging equipment. In addition, mat to a format better suited for analytic
HCISs can help to eliminate inadvertent work, the use of clinical decision-support
ordering of duplicate tests and procedures. systems in conjunction with a clinically
Once significant patient data are available focused HCIS can produce impressive ben-
online, information systems can reduce the efits, namely improving the quality of care
560 L. H. Vogel and W. C. Reed
while reducing costs (7 Chap. 24) (Bates economy. State and federal regulatory agen-
and Gawande 2003; James and Savitz cies perform a variety of oversight activities,
2011; Goldzweig, et al. 2009; Himmelstein and these require increasingly sophisticated
and Woolhandler 2010; McCullough et al. and responsive HCISs to provide the nec-
2010; Castaneda et al. 2015). essary reports. For example, the Food and
55 Competitive advantage. Information tech- Drug Administration now mandates the use
nologies must be deployed appropriately of barcodes on all drugs. Similarly, HIPAA
and effectively; however, with respect to rules specify how personal health informa-
HCISs, the question is no longer whether tion must be managed as well as the required
to invest, but rather how much and what content and format for certain electronic
to buy. Although some organizations still data transactions for those HCOs that
attempt to cost justify all information- exchange data electronically. OSHA, the
systems investments, many HCOs have Department of Labor, the Environmental
recognized that HCISs are “enabling Protection Agency, the Nuclear Regulatory
technologies” which means that the value Commission, and a host of other agencies
comes not from the system itself but from all have an interest in seeing that the health
what it “enables” the organization to do care provided by HCOs is consistent with
differently and better (Vogel 2003). If standards of safety and fairness.
workflow and processes are not changed 55 Consumerism and patient expectations.
to take advantage of the technology, the As medical breakthroughs promise bet-
value of the investment will largely go ter outcomes for patients, the patients as
unrealized. And it is not just the ratio of consumers are becoming more involved.
financial benefits to costs that is important; Federal mandates have driven the cre-
access to clinical information is necessary ation of Patient Portals, essentially dedi-
not only to carry out patient manage- cated web sites for patients to access and
ment, but also to attract and retain the loy- in some cases contribute to their own
alty of physicians who care for (and thus medical records. While there are limita-
control much of the HCO’s access to) the tions to patient involvement (e.g., to date
patients. The long-term benefits of clini- the number of patients accessing and
cal systems include the ability to influence contributing through portals has not met
clinical practices by reducing large unnec- expectations, and older, sicker patients
essary variations in medical practices, to often have limited abilities to access and/
improve patient outcomes, and to reduce or understand their own data)—it is likely
costs—although these costs might be more that trends toward more patient involve-
broadly economic and societal than related ment will c ontinue.
to specific reductions for the hospital itself
16 (Leatherman et al. 2003; James and Savitz
2011; Gottlieb et al. 2015). Physicians ulti-
mately control the great majority of the
16.2.9 Managing Information
resource-utilization decisions in health Systems in a Changing
care through their choices in prescrib- Health Care Environment
ing drugs, ordering diagnostic tests, and
referring patients for specialty care. Thus, Despite the importance of integrated informa-
providing physicians with access to infor- tion systems, implementation of HCISs has
mation on “best practices” based on the proved to be a daunting task, often requiring
latest available clinical evidence, as well a multiyear capital investment of hundreds of
as giving them other clinical and financial millions of dollars and forcing fundamental
data to make appropriate decisions, is an changes in the types and ways that health care
essential HCIS capability. professionals perform their jobs. To achieve
55 Regulatory compliance. Health care is among the potential benefits, HCOs must plan care-
the most heavily regulated industries in our fully and invest wisely. The grand challenge
Management of Information in Health Care Organizations
561 16
for an HCO is to implement an HCIS that is 16.2.11 Changing Culture
sufficiently flexible and adaptable to meet the
changing needs of the organization. Given In the current health care environment, physi-
the rapidly changing environment and the cians are confronted with significant obstacles
multiyear effort involved, people must be to the practice of medicine as they have his-
careful to avoid implementing a system that torically performed it. Physicians’ long his-
is obsolete functionally or technologically tory of entrepreneurial practice is changing
before it becomes operational. Success in as more and more physicians work directly as
implementing an HCIS entails consistent and employees of HCOs and fewer and fewer own
courageous handling of numerous technical, their own practices. As a result, physicians
organizational, and political challenges. face significant adjustments as they are con-
fronted by pressures to practice in accordance
with institutional standards aimed at reduc-
16.2.10 Changing Technologies ing variation in care, and to focus on the costs
of care regardless of whether those costs are
As we discussed in 7 Chap. 6, dramatic borne by HCOs or by third party payers. They
changes in computing and networking technol- are expected to assume responsibility not sim-
ogies are continuing to occur. These advances ply for healing the sick, but for the wellness
are important in that they allow quicker and of people who come to them not as patients
easier information access, less expensive com- but as members of health plans and health
putational power and data storage, greater maintenance organizations. In addition, they
flexibility, and other performance advantages. must often work as members of collaborative
A major challenge for many HCOs is how patient-care teams. The average patient length
to decide whether to support a best of breed of stay in a hospital is decreasing; at the same
strategy, with its requirement either to upgrade time, the complexity of the care provided
individual systems and interfaces to newer both during and after discharge is increasing.
products or to migrate from their patchwork The time allotted for an individual patient
of legacy systems to a more integrated systems visit in an ambulatory setting is decreasing
environment. Such migration requires integra- as individual clinicians face economic incen-
tion and selective replacement of diverse sys- tives to increase the number of patients for
tems that are often implemented with closed whom they care each day. Some HCOs, aided
or nonstandard technologies and medical by federal funding incentives, are now insti-
vocabularies. Unfortunately, the trade-off tuting pay-for performance incentives to
between migrating from best of breed to more reward desired work practices. At the same
integrated systems is that vendors offering time, it is well known that the amount of
more integrated approaches seldom match the knowledge about disease diagnosis and treat-
functionality of historical best of breed envi- ment increases significantly each year, with
ronments, although with each new software whole new areas of medicine being added
version the gap lessens. As a result, best of from major breakthroughs in areas such as
breed strategies are becoming less of an option genomic and imaging research. To cope with
since commercial vendors are broadening the increasing workload, greater complexity
and deepening the scope of their application of care, extraordinary amounts of new medi-
suites to minimize the challenges of building cal knowledge, new skills requirements, and
and managing interfaces and to protect their the wider availability of medical knowledge
market share. In a sense, it is the information to consumers through the Internet, both cli-
content of the systems and the ability to imple- nicians and health executives must become
ment them that is much more important than more effective information managers, and the
the underlying technology—as long as the supporting HCISs must meet their ever more
data are accessible, the choice of specific tech- complex workflow and information require-
nology is less critical. ments. As the health care culture and the roles
562 L. H. Vogel and W. C. Reed
Clinical
Financial Systems
Laboratory
General Ledger
Pharmacy
Cost Accounting
Radiology
Accounts payable
Payroll/Personnel
Patient Accounting
ADT
Mainframe
.. Fig. 16.6 An example of an information systems mix of information systems that pose integration and
environment for a small integrated delivery network information management challenges for the organiza-
(IDN). Even this relatively simply IDN has a complex tion
sions is often more important than what spe- patient satisfaction. As described in 7 Sect.
cific decisions are made (Weill and Ross 2004; 16.1, the HCISs support a variety of functions,
Haddad et al. 2018). Because of the dynamic ranging from the delivery and management
nature of both health care business strategies of patient care to the administration of the
and the supporting technologies, many HCOs HCO. From a functional perspective, HCISs
16 have seen the timeframes of their strategic typically consist of components that support
information-management thinking shrink five distinct purposes: (1) patient management
from 5 years to three, and then be changed yet and billing, (2) ancillary services, (3) care
again through annual updates. delivery and clinical documentation, (4) clini-
cal decision support, (5) institutional financial
and resource management (. Fig. 16.7).
16.3 Functions and Components
of a Health Care Information
System 16.3.1 Patient Management
and Billing
Carefully designed computer-based informa-
tion systems can increase the effectiveness and Systems that support patient management
productivity of health professionals, improve functions perform the basic HCO operations
the quality and reduce the costs of health related to patients, such as registration, sched-
services, and improve levels of service and of uling, admission, discharge, transfer among
Management of Information in Health Care Organizations
565 16
Population Health,
Departmental Clinical Systems Enhanced Clinical
Systems (Labs, (Order Entry, Decision Support,
ADT, Billing Radiology, Clinical Artficial Intelligence,
Systems Pharmacy) Documentation) Robotic Surgery, IoT
.. Fig. 16.7 The evolution of computing systems in tals has followed a path from financial systems to
hospitals has followed a path that parallels the evolution departmental systems to systems designed specifically to
of computing systems in general. From mainframes to enhance the productivity and raise the quality of health
minicomputers to desktops, and more recently mobile care services
devices, the purpose and function of systems in hospi-
locations, and billing. Historically within systems provide a common reference base for
HCOs, maintenance of the hospital census the basic patient demographic data needed
and a patient billing system were the first tasks by these systems. Without access to the cen-
to be automated—largely because a patient’s tralized database of patient financial, demo-
location determined not only the daily room/ graphic, registration and location data, these
bed charges (since an ICU bed was more subsystems would have to maintain duplicate
expensive than a regular medical/surgical bed) patient records. In addition, the transmission
but where medications were to be delivered, of registration data can trigger other activi-
and where clinical results were to be posted. ties, such as notification of hospital house-
Today, virtually all hospitals and ambula- keeping when a bed becomes available after a
tory centers and many physician offices use a patient is discharged. The billing function in
computer-based master patient index (MPI) these systems serves as a collection point for
to store patient-identification information all the chargeable patient activity that occurs
that is acquired during the patient-registration in a facility, including room/bed charges,
process, and link to simple encounter-level ancillary service charges, and supplies used
information such as dates and locations where during a patient’s stay.
services were provided. The MPI can also be Scheduling in a health care organization is
integrated within the registration module of complicated because patient load and resource
an ambulatory care or physician-practice sys- utilization can vary by day, week, or season or
tem or even elevated to an enterprise master even through the course of a single day sim-
patient index (EMPI) across several facilities. ply due to chance, emergencies that arise, or
Within the hospital setting, the census is main- to patterns of patient and physician behavior.
tained by the admission–discharge–transfer Effective resource management requires that
(ADT) module, which is updated whenever a the appropriate resources be on hand to meet
patient is admitted to the hospital, discharged such fluctuations in demand. At the same
from the hospital, or transferred from one bed time, resources should not remain unneces-
to another. sarily idle since that would result in their inef-
Registration and patient census data serve ficient use. The most sophisticated scheduling
as a reference base for the financial programs systems have been developed for the operat-
that perform billing functions. When an HCIS ing rooms and radiology departments, where
is extended to other patient-care settings— scheduling challenges include matching the
e.g., to the laboratory, pharmacy, and other patient not only with the providers but also
ancillary departments—patient-management with special equipment and support staff
566 L. H. Vogel and W. C. Reed
such as technicians. Patient-tracking applica- system). Second, the ancillary systems contrib-
tions monitor patient movement in multistep ute major data components to online patient
processes; for example, they can monitor, esti- records, including laboratory-test results and
mate, and manage patient wait times in the pathology reports, medication profiles, digi-
emergency department. tal images (see 7 Chap. 22), records of blood
Within a multi-facility HCO, the basic tasks orders and usage, and various transcribed
of patient management are compounded by the reports including history and physical exami-
need to manage patient care across multiple set- nations, operating room and radiology reports.
tings, some of which may be supported by inde- HCOs that consolidate ancillary functions
pendent information systems. Is the Patricia outside hospitals to gain economies of scale—
C. Brown who was admitted last month to for example, creating outpatient diagnostic
Mountainside Hospital the same Patsy Brown imaging centers and reference laboratories—
who is registering for her appointment at the increase the complexity of integrated patient
Seaview Clinic? Integrated delivery networks management, financial, and billing processes.
ensure unique patient identification either
through conversion to common registration
systems or, more frequently, through implemen- 16.3.3 are Delivery and Clinical
C
tation of an EMPI that links patient identifiers Documentation
and data from multiple registration systems.
Electronic medical record (EMR) systems that
support care delivery and clinical documenta-
16.3.2 Ancillary Services tion are discussed at length in 7 Chap. 14.
Although comprehensive EMRs are the ulti-
Ancillary departmental systems support mate goal of most HCOs, many organizations
the information needs of individual clinical today are still building more basic clinical-
departments within an HCO. From a sys- management capabilities. Automated order
tems perspective, those areas most commonly entry and results reporting are two important
automated are the laboratory, pharmacy, radi- functions provided by the clinical components
ology, blood-bank, operating rooms, and med- of an HCIS. Health professionals can use the
ical-records departments, but can also include HCIS to communicate with ancillary depart-
specialized systems to support cardiology (for ments electronically, eliminating the easily
EKGs), respiratory therapy and social work. misplaced paper slips or the transcription
Such systems serve a dual purpose within an errors often associated with translating hand-
HCO. First, ancillary systems perform many written notes into typed requisitions, thus
dedicated tasks required for specific depart- minimizing delays in conveying orders. The
mental operations. Such tasks include gener- information then is available online, where it
16 ating specimen-collection lists and capturing
results from automated laboratory instru-
is easily accessible by any authorized health
professional that needs to review a patient’s
ments in the clinical laboratory, printing medication profile or previous laboratory-test
medication labels and managing inventory in results. Ancillary departmental data represent
the pharmacy, and scheduling examinations an important subset of a patient’s clinical
and supporting the transcription of image record. A comprehensive clinical record, how-
interpretations in the radiology department. ever, also includes various data that clinicians
In addition, information technology coupled have collected by questioning and observing
with robotics can have a dramatic impact on the patient, including the history and physi-
the operation of an HCO’s ancillary depart- cal report, progress notes and problem lists.
ments, particularly in pharmacies (to sort and In the hospital, an HCIS can help health per-
fill medication carts) and in clinical laborato- sonnel perform an initial assessment when a
ries (where in some cases the only remaining patient is admitted to a unit, maintain patient-
manual task is the collection of the specimen specific care plans, chart vital signs, maintain
and its transport to the laboratory’s robotic medication- administration records, record
Management of Information in Health Care Organizations
567 16
diagnostic and therapeutic information, docu- developed, clinical decision-support systems
ment patient and family teaching, and plan for can use the information stored there to moni-
discharge (also see 7 Chap. 17). Many orga- tor patients and issue alerts, to make diagnos-
nizations have developed diagnosis-specific tic suggestions, to provide limited therapeutic
clinical pathways that identify clinical goals, guidance, and to provide information on med-
interventions, and expected outcomes by time ication costs. These capabilities are particu-
period. Using clinical pathways, case man- larly useful when they are integrated with
agers or care providers can document actual other information-management functions. For
versus expected outcomes and are alerted example, a useful adjunct to computer-based
to intervene when a significant unexpected physician order-entry (CPOE) is a decision-
event occurs. More hospitals are now imple- support program that alerts physicians to
menting systems to support what are called patient food or drug allergies; helps physi-
closed loop medication management systems cians to calculate patient-specific drug-dosing
in which every task from the initial order for regimens; performs advanced order logic, such
medication to its administration to the patient as recommending an order for prophylactic
is recorded in an HCIS—one outcome of antibiotics before certain surgical procedures;
increased attention to patient safety issues. automatically discontinues drugs when appro-
With the shift toward delivering more care priate or prompts the physician to reorder
in outpatient settings, clinical systems have them; suggests more cost-effective drugs with
become more common in ambulatory clinics the same therapeutic effect; or activates and
and physician practices. Numerous vendors displays applicable clinical-practice guidelines
have introduced software compatible with (see 7 Chap. 24). Clinical-event monitors inte-
smart phones, tablets, and other mobile devices grated with results-reporting applications can
designed specifically for physicians in ambula- alert clinicians to abnormal results and drug
tory settings, so that they can access appropriate interactions by electronic mail, text message
information even as they move from one exam or page. In the outpatient setting, these event
room to another. Such systems allow clinicians monitors may produce reminders to provide
to record problems and diagnoses, symptoms preventive services such as screening mammo-
and physical examinations, medical and social grams and routine immunizations. The same
history, review of systems, functional status, event monitors might trigger access to the
active and past prescriptions, provide access HCO’s approved formulary, displaying infor-
to therapeutic and medication guidelines, etc. mation that includes costs, indications, contra-
The most successful systems are integrated indications, approved clinical guidelines, and
with a practice management system, providing relevant online medical literature (Kaushal
additional support for physician workflow and et al. 2003). Unfortunately, an overabundance
typical clinic functions, for example, by docu- of alerts has also caused care givers to experi-
menting telephone follow-up calls or printing ence alarm fatigue resulting in them ignoring
prescriptions. In addition, specialized clinical potentially critical warnings.
information systems have been developed to
meet the specific requirements of intensive-care
units (see 7 Chap. 21), long-term care facilities, 16.3.5 Financial and Resource
home- health organizations, and specialized Management
departments such as cardiology and oncology.
Financial and administrative systems assist
with the traditional business functions of an
16.3.4 Clinical Decision Support HCO, including management of the payroll,
human resources, general ledger, accounts pay-
Clinical decision-support systems (7 Chap. able, and materials purchasing and inventory.
24) directly assist clinical personnel in data Most of these data-processing tasks are well
interpretation and decision-making. Once the structured and have been historically labor
basic clinical components of an HCIS are well intensive and repetitious—ideal opportunities
568 L. H. Vogel and W. C. Reed
for substitution with computers. Furthermore, adjusted outcomes of that provider’s patients
with the exception of patient-billing functions, such as their rate of hospital readmission and
the basic financial tasks of an HCO do not dif- mortality by diagnosis. Such systems are also
fer substantially from those of organizations being used by government bodies and con-
in other industries. Not surprisingly, financial sumer advocate organizations as they publi-
and administrative applications have typically cize their findings, often through the Internet.
been among the first systems to be standard- Contract-management systems have capa-
ized and centralized in IDNs. bilities for estimating the costs and payments
Conceptually, the tasks of creating a associated with potential managed care con-
patient bill and tracking payments are straight- tracts and comparing actual with expected
forward, and financial transactions such as payments based on the terms of the contracts.
claims submission and electronic funds trans- More advanced managed-care information
fer have been standardized to allow electronic systems handle patient triage and medical
data interchange (EDI) among providers management functions, helping the HCOs to
and payers. In operation, however, patient direct patients to appropriate health services
accounting requirements are complicated by and to proactively manage the care of chroni-
the myriad and oft-changing reimbursement cally ill and high-risk patients. Health plans,
requirements of government and third-party and IDNs that incorporate a health plan,
payers. These requirements vary substantially also must support payer and insurance func-
by payer, by insurance plan, by type of facil- tions such as claims administration, premium
ity where service was provided, and often by billing, marketing, and member services.
state. As the burden of financial risk for care As HCOs continue to seek ways to reduce
has shifted from third party payers to provid- their expenses, they are introducing both more
ers (through per diem or diagnosis-based reim- automation and more integration into “back
bursements), these systems have become even office” systems. Enterprise Resource Planning
more critical to the operation of a successful (ERP) systems integrate human resources
HCO. As another example, managed care con- functions with payroll, accounts receivable
tracts add even more complexity, necessitating and payment systems and overall supply man-
processes and information systems to check a agement. ERP systems can be effectively inte-
patient’s health-plan enrollment and eligibility grated with other components of the HCIS,
for services, to manage referrals and preautho- e.g., nurse staffing systems and surgical sched-
rization for care, to price claims based on nego- uling systems, in an attempt to use informa-
tiated contracts, and to create documentation tion to optimize operations and resource
required to substantiate the services provided. utilization. Implementing these systems can
As HCOs increasingly go “at risk” for bring challenges similar to clinical systems,
delivery of health services by negotiating per albeit with different types of employees. Long
16 diem, diagnosis-based, bundled and capitated
payments, their incentives need to focus not
term employees often develop their own styles
of managing information based on historical
only on reducing the cost per unit service patterns and preferences, much of which must
but also on maintaining the health of mem- change when an ERP system is implemented.
bers while using health resources effectively
and efficiently. Similarly, the HCO’s scope of
accountability broadens from a relatively small 16.4 Forces That Will Shape
population of sick patients to a much larger
population of plan members (such as might be the Future of Health Care
found in ACOs), most of whom are still well. Information Systems
Provider-profiling systems support uti- Management
lization management by tracking each pro-
vider’s resource utilization (costs of drugs As we have discussed throughout this chap-
prescribed, diagnostic tests and procedures ter, the changing landscape of the health-care
ordered, and so on) compared with severity- industry and the strategic and operational
Management of Information in Health Care Organizations
569 16
requirements of HCOs and IDNs have accel- of change—IDNs reorganize, merge, uncouple,
erated the acquisition and implementation acquire, sell off, and strategically align services
of HCISs. The acquisition and implementa- and organizational units in a matter of weeks.
tion of Electronic Medical Records (EMRs) While information technology is itself chang-
have been a particular focus, especially with ing with accelerating frequency, today’s state-
the availability of federal stimulus fund- of-the-art systems (computer systems and
ing through the provisions of the Health people processes) typically require months or
Information Technology for Economic and years to build and refine.
Clinical Health (HITECH) Act under the The continuing pressures of the market
American Recovery and Reinvestment Act place, including the downward trend in reim-
of 2009 (ARRA). Although there are many bursements and the evolving efforts by the
obstacles to implementation and acceptance Federal government to change the payment
of smoothly functioning, fully integrated structure from Procedure-based Payments
HCISs, few people today would debate the to Value-Based Payments has led to merger
critical role that information technology plays and acquisition activity across the health care
in an HCO’s success or in an IDN’s efforts at industry. Although the outright merger or
clinical and operational management. acquisition of one HCO by another HCO is
We have emphasized the dynamic nature of still relatively infrequent, HCO’s and IDNs
today’s health care environment and the asso- have increasingly targeted physician prac-
ciated implications for HCISs. A host of new tices for acquisition in order both to control
requirements loom that will challenge today’s the flow of patients to inpatient beds and to
available solutions. We anticipate additional increase the standardization of care proto-
expectations and requirements associated with cols across physician groups. In addition,
the changing organizational landscape, techno- in situations in which an outright merger or
logical advances, and broader societal changes. acquisition seems likely to be unsuccessful,
affiliations between HCOs are an increasingly
common strategy to ensure better coordina-
tion across historically competing boundaries.
16.4.1 Changing Organizational All too frequently, business deals are cut with
Landscape insufficient regard to the cost and time required
to create the supporting information infrastruc-
Although the concepts underlying HCOs and ture. For IDNs even in the best of circumstances,
IDNs are no longer new, the underlying organi- the cultural and organizational challenges of
zational forms and business strategies of these linking diverse users and care-delivery settings
complex organizations continue to evolve. will tax their ability to change their information
The success of individual HCOs varies widely. systems environments quickly enough. These
Some, serving target patient populations such issues will increase in acuity as operational bud-
as those with heart disease or cancer or age- gets continue to shrink—today’s HCOs and
defined groups such as children, have been IDNs are spending significant portions of their
relatively more successful financially than those capital budgets on information-systems invest-
attempting to serve patients across a wide range ments. In turn, these new investments translate
of illnesses or those attempting to combine into increased annual operating costs (costs of
diverse missions of clinical care, teaching and regular system upgrades, maintenance, user
research. IDNs, on the other hand, have by and support, and staffing). Still most health care
large failed to achieve the operational improve- organizations devote at most 3–4% of their total
ments and cost reductions they were designed revenues to their information systems operating
to deliver. It is possible that entirely new forms budgets; in other information-intensive indus-
of HCOs and IDNs will emerge in the coming tries (e.g., financial services, air transportation),
years. Key to understanding the magnitude of the percentage of operating budgets devoted to
the information systems challenge for IDNs in information technology investment can be three
particular is recognizing the extraordinary pace to four times higher (Weil 2001).
570 L. H. Vogel and W. C. Reed
16.4.2 Changes Within the HCIS Health Care (Institute of Medicine 1991) an
Organization entire new era of information technology began
to emerge with the goal of capturing all clinical
Information technology in the health care activity electronically, giving rise to Electronic
industry became a major focus for invest- Medical Records. Followed a little more than
ment starting in the late 1980s and early a decade later by an executive order from then
1990s, as developments in both hardware and President George Bush with the goal of cre-
software technology led both to increased ating an electronic medical record within a
affordability and enhanced functionality. decade, and then the financial incentives in the
We noted earlier the evolution from main- 2009 HITECH legislation to induce hospitals
frames to minicomputers to networked PCs, and physician offices to implement EMRs, the
as well as the transformation of application complexity of the health care computing envi-
suites from enterprise-wide financial and bill- ronment increased d ramatically. In addition,
ing systems to more departmentally-focused with the new emphasis on clinical computing,
information technology investment deci- physicians as the primary users of clinical sys-
sions in provider organizations. This evolu- tems became much more invested not only in
tion led to an ever-more complex computing what functionality was being purchased, but
environment encompassing a diversity of who was leading the purchasing process.
hardware and software and networking capa- During the 1990s and into the 2000s, an
bilities to enable everything to work together. entire generation—including clinicians--be-
Organizational capabilities to manage these came more sophisticated users of computer
changes evolved as well, from a small group technology. As clinical systems came to be
of technicians managing a mainframe envi- many organizations’ largest information
ronment to a growing number not only of technology investment, clinicians sought to
technicians, but groups of dedicated applica- exercise more leadership in system selection
tion support staff, networking engineers and and implementation. The position of Chief
specialists in desktop support. Information Medical Information (or Informatics) Officer
Services Departments began to replace “Data (CMIO) was created, at times reporting to
Processing Departments” and their reach and the Chief Information Officer (CIO), but at
importance to the organization. Leaders were other times reporting outside of the historical
no longer department directors but “Chief information technology chain of command.
Information Officers” (CIOs), charged with Additional recognition for computer-savvy
responsibilities as diverse as keeping the sys- clinicians came with the creation in 2011
tems running on a daily basis, overseeing the of a formal board certification in Clinical
seemingly endless upgrades to both hardware Informatics. In addition, with the increasing
and software, aligning information technol- importance of clinical computing capabili-
16 ogy investments with business strategy, and ties, provider organizations began looking to
recognize formally the physician informat-
keeping the personal data of patients secure
from unauthorized access. As “CIOs” they ics leaders by appointing them to the CIO
were expected to lead the procurement of position. More traditional CIOs with back-
new hardware and software in conjunction grounds in business or computing became in
with other “C-Suite” executives or physician some organizations relegated to operational
leaders of the various departments seeking to roles—keeping the data center and the net-
introduce computers into their work environ- work running on the 24-hour basis that was
ments. Most often these were systems sup- essential in a health care environment. Some
porting administrative and financial processes organizations created separate roles for
or ancillary departments like the laboratory Chief Technology Officers (CTOs) or Chief
or the radiology or pharmacy departments. Innovation Officers that had little or no opera-
However, with the 1991 publication of The tional responsibilities. Additionally, as HCO’s
Institute of Medicine’s The Computer-Based began to recognize the impact of social media
Patient Record: An Essential Technology for
Management of Information in Health Care Organizations
571 16
on their marketing efforts, the role of Chief as Microsoft’s .NET should eventually yield
Digital Officer also started to emerge. more flexible information technology systems.
These organizational changes can be One of the most significant technologi-
expected to continue as responses to not cal challenges facing HCOs and IDNs today
only more complex clinical and computing occurs because, while much of the health care
environments, but also to the overwhelm- delivered today continues to be within the
ing importance of clinical activities overall four walls of a physician’s office or a hospi-
to the success of any provider organization. tal, new venues such as retail clinics are grow-
The roles and responsibilities of these various ing in number and geographical distribution.
information CxO positions can be expected to Further, as the population ages, patients may
continue to evolve as both the overall orga- seek care from a multitude of sources, includ-
nizational landscape and the capabilities of ing primary and specialty practices, multiple
computer systems evolve as well. hospital visits (and even visits to multiple hos-
pitals) and may increasingly be monitored in
their homes. Health care information technol-
16.4.3 Technological Changes ogies (and clinical systems in particular) have
Affecting Health Care focused historically on what happens within a
Organizations physician’s office or within a hospital, and not
across physicians’ office nor between the physi-
Future changes in technology are hard to pre- cians’ office and the hospital nor in the home
dict. For example, although we have heard of the patient. As technology developments
for over two decades that seamless voice-to- increasingly permit care to be provided outside
text systems are 5 years away from practi- physician offices and hospitals, HCISs will be
cal use, with the introduction of controlled challenged to incorporate if not the data, then
vocabularies in areas such as radiology and at least the access to these new sources of data.
pathology, we are beginning to see commer- In general, EMR products on the market
cial products that can “understand” dictated today started with a single purpose: to auto-
speech and represent it as text that can then mate the workflow of clinicians within a par-
be structured for further analysis. Both the ticular organizational setting. Among other
emergence of increasingly powerful processor features, EMRs focus on making data from
and memory chips, and the decreasing cost of previous encounters or activities easier to
storage media will continue to be a factor in access and assuring that orders for tests and
future health-systems design—although the x-rays have the correct information, or that
tsunami of data coming from imaging modal- the next shift knows what went on previously.
ities and from genomic medicine sequencing Despite visible successes and failures for all
and analysis will continue to be a significant manner of products, EMRs in general can
challenge (see 7 Chaps. 2, 22, and 26). The facilitate the automation of a complex work-
ever expanding availability of Internet access, flow—of automating intra-organizational
the increasing integration of voice, video, and clinical processes as well as those that cross
data, and platforms which permit the inte- organizational boundaries.
gration of these various technologies as well Architectures that focus on what happens
as the availability of ever smaller platforms within organizational boundaries do not eas-
like tablets and smart phones, will challenge ily facilitate access to data across organiza-
HCOs and IDNs—and HCISs--to have com- tional boundaries. This is the challenge of
munications capacity not only within their interoperability. Recognizing that patients
traditional domain but also to an extended often receive care in a variety of organiza-
enterprise that may include patients’ homes, tional settings—hospitals, physicians offices,
schools, and workplaces. The design of mod- rehabilitation facilities, pharmacies, retail
ern software based on the replicability of clinics, their homes, etc.—the challenge is
code, coding standards such as XML, C++, to extend the internal workflow beyond the
C#, Python, and Java and frameworks such boundaries of individual organizations so
572 L. H. Vogel and W. C. Reed
information could lead to inadequate Castaneda, C., Nalley, K., Mannion, C., Bhattacharyya,
patient care, loss of revenue, or inappro- P., Blake, P., Pecora, A., Goy, A., & Suh, K. S.
(2015). Clinical decision support systems for
priate administrative decisions. Identify
improving diagnostic accuracy and achieving preci-
and discuss the challenges and limita- sion medicine. Journal of Clinical Bioinformatics,
tions of two methods for improving data 5(1), 1.
integration. Goldzweig, C. L., Towfigh, A., Maglione, M., &
3. Describe three situations in which lack Shekelle, P. G. (2009). Costs and benefits of health
information technology: New trends from the litera-
of integration of information systems
ture. Health Affairs (Millwood), 28(2), w282–w293.
with clinicians’ workflow can lead to Gottlieb, L. M., Tirozzi, K. J., Manchanda, R., Burns,
inadequate patient care, reduced physi- A. R., & Sandel, M. T. (2015). Moving electronic
cian productivity, or poor patient satis- medical records upstream: Incorporating social
faction with an HCO’s services. Identify determinants of health. American Journal of
Preventive Medicine, 48(2), 215–218.
and discuss the challenges and limita-
Haddad, P., McConchie, S., Schaffer, J. L., &
tions of two methods for improving pro- Wickramasinghe, N. (2018). IS/IT governance in
cess integration. health care: An integrative model. In
4. Describe the trade-off between func- N. Wickramasinghe & J. Schaffer (Eds.), Theories to
tionality and integration. Discuss three inform superior health informatics research and prac-
tice. Healthcare delivery in the information age
strategies currently used by HCOs to
(pp. 37–54). Cham, Switzerland: Springer.
minimize this tradeoff. Himmelstein, D. U., & Woolhandler, S. (2010).
5. Assume that you are the chief informa- Obama’s reform: No cure for what ails us. BMJ,
tion officer of multi-facility HCO. You 340, c1778.
have just been charged with planning a James, B. C., & Savitz, L. A. (2011). How intermountain
trimmed health care costs through robust quality
new clinical HCIS to support a large
improvement efforts. Health Affairs (Millwood),
tertiary care medical center, two smaller 30(6), 1185–1191.
community hospitals, a nursing home, Kastor, J. (2001). Mergers of teaching hospitals in Boston,
and a 40-physician group practice. Each New York, and Northern California. Ann Arbor:
organization currently operates its own University of Michigan Press.
Kaushal, R., Shojania, K. G., & Bates, D. W. (2003).
set of integrated and standalone tech-
Effects of computerized physician order entry and
nologies and applications. What techni- clinical decision support systems on medication
cal and organizational factors must you safety: A systematic review. Archives of Internal
consider? What are the three largest Medicine, 163(12), 1409–1416.
challenges you will face over the next Leatherman, S., Berwick, D., Iles, D., et al. (2003). The
business case for quality: Case studies and an analy-
24 months?
sis. Health Affairs, 22(2), 17–30.
6. How do you think the implementation McCullough, J. S., Casey, M., Moscovice, I., & Prasad,
of clinical HCISs will affect the quality S. (2010). The effect of health information technol-
of relationships between patients and ogy on quality in U.S. Hospitals. Health Affairs
(Millwood), 29(4), 647–654.
16 providers? Discuss at least three
potential positive and three potential
Shortell, S. M., Gillies, R. R., Anderson, D. A., et al.
(2000). Remaking healthcare in America: The evolu-
negative effects. What steps would you tion of organized delivery (2nd ed.). San Francisco:
take to maximize the positive value of Jossey-Bass.
these systems? Vogel, L. H. (2003). Finding value from information
technology investments: Exploring the elusive ROI
in healhcare. Journal for Health Information
Management, 17(4), 20–28.
References Vogel, L. H. (2006, September 1) Everyone gets to play.
CIO.
Bates, D. W., & Gawande, A. A. (2003). Improving Weil, T. P. (2001). Health networks: Can they be the solu-
safety with information technology. The New tion? Ann Arbor: University of Michigan Press.
England Journal of Medicine, 348(25), 2526–2534. Weill, P., & Ross, J. W. (2004). IT governance: How top
Blackstone, E. A., & Fuhr, J. P., Jr. (2003). Failed hospi- performers manage IT decision rights for superior per-
tal mergers. Journal of Health Law, 36(2), 301–324. formance. Boston: Harvard Business School Press.
575 17
Patient-Centered Care
Systems
Suzanne Bakken, Patricia C. Dykes, Sarah Collins Rossetti,
and Judy G. Ozbolt
Contents
References – 606
Is care
No previously Yes Evaluating
given to be care given
evaluated?
Analyzing
Formulating objectives causes of
successes
and failures
Establishing priorities for objectives
No Is nursing Yes
terminated?
Exit
5. Occupational therapist: assess abilities and to go out. The nurse reports this to the physi-
limitations for performing activities of cian and the other clinicians so that they can
17 daily living; prescribe exercises to improve understand why the patient is not carrying
strength and flexibility of hands and arms; out the prescribed regime. The physician then
teach adaptive techniques and provide changes the strategy for treating hypertension
assistive devices as needed while initiating treatment for urinary inconti-
nence. The nurse helps the patient to under-
In a collaborative, interdisciplinary, patient- stand the interaction of the various treatment
centered practice, the nurse discovers that the regimes, provides practical advice and assis-
patient is not taking walks each day as pre- tance in dealing with incontinence, and helps
scribed because her urinary incontinence is the patient to find personally acceptable ways
exacerbated by the diuretic prescribed to treat to follow the prescribed treatments. The nutri-
hypertension, and the patient is embarrassed tionist works with the patient on the timing
Patient-Centered Care Systems
579 17
of meals and fluid intake so that the patient Care Organizations seek to integrate provid-
can exercise and sleep with less risk of urinary ers and services to generate value for defined
incontinence. The physical and occupational populations. Substantial federal investment in
therapists adjust their recommendations to health information technology (HIT) through
accommodate the patient’s personal needs the HITECH and 21st Century Cures Acts
and preferences while moving toward the ther- dramatically increased the adoption and use
apeutic goals. Finally, the patient, rather than of EHRs (Adler-Milstein & Jha 2017). While
being assailed with the sometimes conflicting HIT spending increased markedly, care coor-
demands of multiple clinicians, is supported dination technologies have not been a focus.
by an ensemble of services that meets shared HIT tools dedicated to coordination pro-
therapeutic goals in ways consistent with her cesses could improve care of complex patients
preferences and values. through increased access to data, facilitated
This kind of patient-centered collabora- communication, timely shared decisions,
tion requires exquisite communication and and greater engagement of patients and their
feedback. The potential for information sys- families as partners in their care plans. Well-
tems to support or sabotage patient-centered designed information systems with patient
care is obvious. facing-technologies (e.g., personal health
records and patient portals) enable care coor-
dination as they ensure that patients and
17.1.3 Coordination providers have immediate access to accurate
of Patient-Centered Care health information at home and across care
settings (Ahern et al. 2011; Collins, Bavuso
When patients receive services from mul- et al. 2017; Collins, Klinkenberg-Ramirez
tiple clinicians, patient-centeredness requires et al. 2017; Collins, Rozemblum et al. 2017).
coordinating those services. Coordination
includes seeing that patients receive all the
services they need in logical sequence with- 17.1.4 Patient-Centered Care
out scheduling conflicts and ensuring that Across Multiple Patients
each clinician communicates as needed with
the others. Sometimes, a case manager or care Delivering and managing interdisciplinary
coordinator is designated to do this coordi- patient-centered care for an individual is
nation. In other situations, a physician or a challenging enough, but patient care has yet
nurse assumes the role by default. Sometimes, another level of complexity. Each clinician is
coordination is left to chance, and both the responsible for the care of multiple patients. In
processes and the outcomes of care are put planning and executing the work of patient-
at risk. In recognition of this, the Institute of centered care, each professional must consider
Medicine designated coordination of care as the competing demands of all the patients for
1 of 14 priorities for national action to trans- whom she is responsible, as well as the exigen-
form health care quality (Adams & Corrigan cies of all the other professionals involved in
2003). The Health Information Technology for each patient’s care. Thus, the nurse on a post-
Economic and Clinical Health Act (HITECH operative unit must plan for scheduled treat-
Programs 20091) calls for patients to have a ments for each of her patients to occur near
medical home, a primary care practice that the optimal time for that patient. She must
will maintain a comprehensive problem list take into account that several patients may
to make fully informed decisions in coordi- require treatments at nearly the same time and
nating their care. In addition, Accountable that some of them may be receiving other ser-
vices, such as imaging or physician’s visits, at
the time when it might be most convenient for
1 7 http://healthit.hhs.gov/portal/server.pt/commu- the nurse to administer the treatment. When
nity/healthit_hhs_gov__hitech_programs/1487 unexpected needs arise, as they often do—an
(Accessed: 4/26/13).
580 S. Bakken et al.
.. Table 17.1 Framework for design characteristics of a patient-care information system with examples of
patient-specific data, agency-specific data, and domain information and knowledge for patient care
Source: Framework adapted with permission from Next Generation Nursing Information Systems, 1993,
American Nurses Association, Washington, DC. Reused with permission.
US E
W ERS OP
Po orld h SC
l
Res icym ealt
Law earc ake h offi
DATA/
W IDE
ma her rs cia INFORMATION RLD
ker s ls WO TA
s General health D A
status and
health-related needs
Po of individual nations
La licy
Res wma make
Ins earc kers rs ABSTRACTED, IDE
ure he SUMMARIZED, AGGREGATED NW
rs rs TIO
NA TA
Trends in incidence, prevalence, D A
outcomes, and costs by region, by
An diagnosis, by type of agency
Re alys
Qu searc ts /
Pu ality hers ABSTRACTED, ITY
blic m UN IDE
he anag
SUMMARIZED, AGGREGATED MM -W
alth em CO GION
offi ent Comparisons of treatments, outcomes, RE TA
cia and costs by locality and by agency. DA
ls
Ad Incidence and prevalence of diagnosis by region.
Re min
Ac searc istrat E
c
Qu red hers ors WID
alit itor
ABSTRACTED, SUMMARIZED, AGGREGATED CY-
ym s EN
ana AG TA
ge Costs of care by category of patient. DA
rs Number of patients admitted with specific diagnosis.
Ca Volume of tests, procedures, and interventions.
Ag regi Outcomes for patients grouped by diagnosis.
Ins ency vers L
Q.A urer dep ABSTRACTED, SUMMARIZED, AGGREGATED UA
. pe s art IVID
rso
nn
me
nts IND TIENT
el “Atomic-level’’ patient-specific data: e.g., assessments, diagnoses, interventions, PA TA
diagnostic test results, procedures, treatments, hours of care, outcomes. DA
Used to provide most appropriate care.
.. Fig. 17.2 Examples of uses for atomic-level patient © 1993 American Nurses Publishing, American Nurses
data collected once but used many times (Source: Foundation/American Nurses Association, Washing-
Reprinted with permission from Zielstorff et al. (1993). ton, DC. Reproduced with permission.)
access to knowledge resources at the point of policies and procedures), and domain infor-
care. The Infobutton, developed at NewYork- mation (guidelines), such systems can greatly
Presbyterian Hospital, is an HL7 standard aid the coordination of interdisciplinary ser-
for context-aware knowledge retrieval (Del vices for individual patients and the planning
Fiol et al. 2012). Incorporated into EHRs, and scheduling of each caregiver’s work activ-
Infobuttons, along with associated man- ities. Patient acuity is taken into account in
agement tools, can integrate data about the scheduling nursing personnel, but historically
patient and the clinical context to provide has most often been entered into a separate
immediate, point- of-
care access to relevant system rather than derived directly from care
knowledge resources (Cimino et al. 2013). requirements as recorded in the EHR. Fully
To support patient-centered care, informa- integrated, patient-centered systems—still an
tion systems must be geared to the needs of ideal today—would enhance our understand-
all the clinicians involved in care. The systems ing of each patient’s situation, needs, and
17 should acquire, store, process, and present values, improve decision-making, facilitate
each type of information (patient-, agency-, communications, aid coordination, and use
and domain-specific) where, when, and how clinical data to provide feedback for improv-
the information is needed by each clinician in ing clinical processes.
the context of his or her professional domain. Clearly, when other clinical information
Systems designed for patient-centered care systems designed to support patient-centered
have the potential to go beyond supporting care fulfill their potential, they will not merely
the collaborative, interdisciplinary care of replace oral and paper-based methods of
individual patients. Through appropriate use recording and communicating. They will be
of patient-specific information (care require- an integral and essential part of the transfor-
ments), agency-specific information (clini- mation of health care to apply evidence-based
cians and their responsibilities and agency interventions in accordance with patient needs
Patient-Centered Care Systems
583 17
and values. How far have we come toward the safety, and implementation of safe practices
ideal? What must we do to continue our prog- and systems within health care organizations.
ress? The follow-on report, Crossing the Quality
Chasm: A New Health System for the 21st
Century (Committee on Quality of Health
17.2 he Emergence of Patient-
T Care in America 2001), addressed the need
Centered Care Systems for fundamental change in the health care
delivery system. Shortcomings included inad-
Events in the first decade of the Twenty-first equate focus on quality and clinical infrastruc-
Century planted the seeds of transformative tures not sufficiently developed to provide the
change in patient care and clinical informat- full range of services needed by persons with
ics. Over 10 years, the shared ideal of health chronic conditions. Significantly, the report
care began to move from the Twentieth placed the blame not on individual health care
Century “doctor knows best” model toward a professionals, but on inadequate and broken
new vision of health care based on interdisci- systems of care.
plinary teams drawing on a variety of knowl- Crossing the Quality Chasm outlined a call
edge and information resources to collaborate for action by government, payers, providers,
with one another and with patients and fami- and the public to embrace a statement of pur-
lies to resolve or alleviate health problems and pose for the health care system as a whole—
to achieve health goals. Recursive and itera- to reduce illness and improve health and
tive developments grew from reports of the functioning—and to adopt a shared agenda
National Research Council and the Institute to achieve health care that would be safe,
of Medicine (now known as the National effective, patient-centered, timely, efficient,
Academy of Medicine); from government pol- and equitable. The report recommended the
icies and initiatives; from changes in organiza- redesign of care processes to achieve conti-
tional and financial structures for health care nuity in care relationships; customization in
delivery; and from advances in the informatics accordance with patient needs and values; the
methods and technologies that have become sharing of knowledge, information, and deci-
integral to the provision, management, reim- sion-making with patients; evidence- based
bursement, and improvement of health care. decision-making; safety as a system prop-
Much remains to be done to nurture continu- erty; transparency of information to facilitate
ing development, but with care and patience informed decision-making by patients and
we can begin to harvest the benefits of better families; anticipation of patient needs; con-
health care and better health for individuals tinuous decrease in waste; and cooperation
and populations. among clinicians. The report gave consid-
erable attention to informatics as an essen-
tial methodology to achieve these aims and
17.2.1 Publications of the National called for a renewed national commitment to
Academy of Sciences a national health information infrastructure,
with the elimination of most hand-written
With its seminal publication, To Err is Human: clinical information by 2010.
Building a Safer Health System (Kohn et al. In Patient Safety: Achieving a New Standard
2000), the Institute of Medicine startled the for Care (Committee on Data Standards for
world by estimating that clinical errors were Patient Safety 2004), the Committee high-
killing up to 98,000 hospitalized Americans lighted the fact that a national health infor-
each year. The report called for a national mation infrastructure—a foundation of
focus to advance knowledge about safety, systems, technology, applications, standards,
reporting efforts to identify and learn from and policies—is required for error preven-
errors, higher standards and expectations for tion and capture of data that facilitate local
584 S. Bakken et al.
and global learning from adverse events, near Meeting Series (Tcheng et al. 2017). Several
misses, and hazards. The report emphasized priorities for collaborative action from the
the need for data interchange standards as an last are of particular relevance to enable scal-
essential building block. able patient-centered care systems: create a
The National Academies followed these national Clinical Decision Support (CDS)
three reports with a number of others that repository, develop tools to assess CDS effi-
explored in greater depth aspects of the prob- cacy, publish performance evaluations, pro-
lems and recommendations described within mote financing and measurement to accelerate
them and made further recommendations CDS adoption, develop a multi-stakeholder
for public and private actions to improve CDS learning community to inform usability,
health care and its costs and outcomes. The and establish an investment program for CDS
National Research Council (Stead & Lin research.
2009) published Computational Technology
for Effective Health Care: Immediate Steps
and Strategic Directions. This report noted
that many health information technologies in 17.2.2 Federal Government
the current marketplace lacked the function- Initiatives
ality to achieve the goals of improving health
care. The central finding was that computer The Health Information Technology for
scientists, experts in health and biomedical Economic and Clinical Health (HITECH)
informatics, and clinicians would need to col- Act provided an unprecedented federal
laborate to create technologies that would pro- investment in HIT through a series of initia-
vide cognitive support to clinicians, patients, tives aimed at ensuring that all Americans
and family members as they sought to under- benefit from EHR-supported patient-cen-
stand, resolve, or alleviate health challenges. tered care. Administered by the Office of the
The report recommended that federal and National Coordinator for Health Information
state governments and clinicians join forces to Technology, the activities are designed:
require vendors to provide systems that offer 55 To support the health care workforce
such “meaningful” support. through Regional Extension Centers for
In its Learning Health System series, the technical assistance for implementation of
National Academy of Medicine’s Leadership EHRs and training initiatives to ensure
Consortium for a Value & Science-Driven meaningful use of EHRs
Health System synthesized the insights of a 55 To enable coordination and alignment
broad array of experts and explored the need within and among states (State Health
for transformational change in the funda- Information Exchange Cooperative
mental elements of health and health care. Agreement Program)
Multiple volumes reflect the fundamental role 55 To establish connectivity to the public
of informatics in a learning health system: health community in case of emergencies
Clinical Data as the Basic Staple of Health (Beacon Community Program)
17 Learning: Creating and Protecting a Public
Good (Institute of Medicine 2010), Digital In addition, two federal rules support mean-
Infrastructure for the Learning Health System: ingful use of EHRs. The Incentive Programs
The Foundation for Continuous Improvement for EHRs rule from the Centers for Medicare
in Health and Health Care – Workshop Series & Medicaid Services (CMS) defines mini-
Summary (Institute of Medicine 2011), mum requirements that hospitals and eligible
Digital Data Improvement Priorities for professionals must meet through their use
Continuous Learning in Health and Health of certified technology to qualify for incen-
Care – Workshop Summary (Institute of tive payments. Criteria related to providing
Medicine 2013), and Optimizing Strategies patients with an electronic copy of their own
for Clinical Decision Support: Summary of health information and ability to electronically
Patient-Centered Care Systems
585 17
exchange key clinical information are particu- 17.2.3 Financial and Organizational
larly important to patient-centered care. The Structures in Health Care
complementary Standards and Certification
Criteria for Electronic Health Records rule The historical evolution of information sys-
defines the criteria for certification of the tech- tems that support patient care, and even-
nology. Also relevant to patient-centered care tually patient-centered care, is not solely a
are NHIN Direct and NHIN CONNECT, reflection of the available technologies (e.g.
which support health information exchange Web 2.0, cloud computing). Societal forces—
to enable patient-centered care. including delivery-system structure, practice
The Agency for Healthcare Research model, payer model, and quality focus—have
and Quality (AHRQ) has also invested in influenced the design and implementation of
advancing patient-centered care through patient-care systems (. Table 17.2).
investments in HIT. A particular focus is
promoting systems engineering approaches zz Delivery-System Structure
to improve patient safety through evaluation Authors have noted the significant influence
of clinical processes resulting in improved of the organization and its people on the
work and information flows. This is reflected success or failure of informatics innovations
in the AHRQ Patient Safety Learning Labs (Massaro 1993; Campbell et al. 2006; Ash
grant portfolio. For example, the Brigham et al. 2007). Others have documented unin-
and Women’s Hospital project, Making tended consequences of implementation of
Acute Care More Patient-centered, focused HIT and called for applications of models
on developing tools and care processes to of processes, such as Iterative Sociotechnical
engage patient, family, and professional Analysis, that take into account health care
care team members in reliable identification, organizations’ workflows, social interactions,
assessment, and reduction of patient safety culture, etc. to further elucidate the relation-
threats in real time before they manifest into ship between organizations and technology
actual harm. Widespread adoption of EHR (Harrison et al. 2007; Koppel et al. 2005;
systems and web-based technologies makes it Ozbolt et al. 2012). As delivery systems shifted
possible to reuse EHR data to create visual from the predominant single-institution struc-
displays at the bedside to provide decision ture of the 1970s to the integrated delivery
support to the care team, patients, and fam- networks of the 1990s to the complex link-
ily. . Figure 17.3 is an example of a bedside ages of the Twenty-First Century, the infor-
screensaver display used at Brigham and mation needs changed, and the challenges of
Women’s Hospital. The icons are driven by meeting those information needs increased
patient-specific information in the EHR. The in complexity. The patient centered medi-
display provides information to patients cal home (PCMH)2 (also known as primary
about their safety plan and provides decision care medical home, advanced primary care,
support at the bedside related to patient-spe- and health care home) is a model of primary
cific needs to professional and paraprofes- care that delivers care designed to be patient-
sional care team members. centered, comprehensive, coordinated, acces-
Given these major investments in pro- sible, and continuously improved through a
moting EHR adoption and use for patient- systems-based approach to quality and safety
centered care and research, the vision of every (Patient Centered Medical Home Resource
American reaping the benefits of EHRs is Center 2011).3 AHRQ and others (Bates and
moving closer to reality. However, this will
continue to be heavily influenced by associ-
ated changes in health care financial and orga-
2 7 http://www.ncqa.org/ (Accessed: 7/1/19).
nizational structures.
3 7 https://pcmh.ahrq.gov/ (Accessed 7.8.19).
586 S. Bakken et al.
.. Fig. 17.3 A personalized screen saver used at team. (© Brigham and Women’s Hospital, Center for
Brigham and Women’s Hospital to display patient- Patient Safety, Research and Practice. Reused with per-
specific safety information to patients, family, and care mission.)
Bitton 2010) have noted the seminal role of zz Professional Practice Models
HIT (e.g., health information exchange, dis- Professional practice models have also
ease registries, alerts and reminders) to sup- evolved for nurses and physicians. In the
port tasks related to National Committee 1970s, team nursing was the typical practice
on Quality Assurance PCMH standards for model for the hospital, and the nursing care
enhancing access and continuity, identifying plan—a document for communicating the
and managing patient populations, planning plan of care among nursing team members—
and managing care, providing self-care and was most frequently the initial computer-
community support, tracking and coordinat- based application designed for use by nurses.
ing care, and measuring and improving per- The 1990s were characterized by a shift to
formance (Patient Centered Medical Home interdisciplinary-care approaches necessi-
2011). Accountable Care Organizations tating computer-based applications such as
(ACOs) focus on care of designated popula- critical paths to support case management of
tions. A recent systematic review (Kaufman aggregates of patients, usually with a com-
et al. 2019) reveals consistent associations mon medical diagnosis, across the contin-
between ACO implementation and outcomes uum of care. The Twenty-First Century sees
17 across payer types and reduced inpatient use, advanced practice nurses increasingly taking
reduced emergency department visits, and on functions previously provided by physi-
improved measures of preventive care and cians while maintaining a nursing perspective
chronic disease management. See 7 Chaps. on collaborative, interdisciplinary care. This
11, 15, and 18 for discussions of managing trend is likely to accelerate given the recom-
clinical information in consumer- provider mendations for facilitating full scope of prac-
partnerships in care, in the public health tice for nurses and advanced practice nurses
information infrastructure, and in integrated (e.g., certified nurse midwives, nurse prac-
delivery systems. titioners) in the 2010 Institute of Medicine
Patient-Centered Care Systems
587 17
.. Table 17.2 Societal forces that have influenced the design and implementation of patient-centered
systems
report on The Future of Nursing: Leading from other health professions, in particular
Change, Advancing Health (Committee on the nursing, due to the persistence of siloed EHR
Robert Wood Johnson Foundation Initiative terminology management (Collins, Bavuso
on the Future of Nursing at the Institute of et al. 2017; Collins, Klinkenberg-Ramirez
Medicine 2010). These changes broaden and et al. 2017; Collins, Rozemblum et al. 2017).
diversify the demands for decision support, In the 2010s the use of shared clinical
feedback about clinical effectiveness, and dashboards accelerated the operationaliza-
quality improvement as a team effort. tion of known patient-centered best practices,
such as interdisciplinary clinical rounding and
Physician practice models have shifted from safety checklists, to drive increased situational
single physician or small group offices to com- awareness and promote common ground in
plex constellations of provider organizations. relation to patient goals and preferences and
The structure of the model (e.g., staff model prevention of harm (Collins, Gazarian et al.
health-maintenance organization (HMO), 2014; Mlaver et al. 2017). These shared, inter-
captive-group model health- maintenance disciplinary, team-based dashboards are typi-
organization, or independent- practice asso- cally configured to a particular practice and
ciation) determines the types of relationships specialty area (e.g., cardiac ICU, emergency
among the physicians and the organizations. department) or a clinical quality improvement
These include issues— such as location of strategic initiative (e.g., optimizing length of
medical records, control of practice patterns stay, promoting effective and early discharge
of the physicians, and data-reporting require- planning) (Collins, Hurley et al. 2014). In the
ments—that have significant implications for ambulatory setting shared dashboards are
the design and implementation of patient-care also used and configured to drive population
17 systems. In addition, the interdisciplinary and health initiatives.
distributed care approaches of the 1990s and
the 2000s have given impetus to system-design zz Payer Models
strategies, such as the creation of a single Changes in payer models have been a sig-
patient problem list, around which the patient- nificant driving force for information-system
care record is organized, in place of a separate implementation in many organizations. With
list for each provider group (e.g., nurses, phy- the shift from fee for service to prospective
sicians, respiratory therapists). While a shared payment in the 1980s, and then toward capi-
single patient problem list remains a goal of tation in the 1990s, information about costs
patient- centered care, few implementations and quality of care has become an essential
have achieved successful integration of the commodity for rational decision-making in
medical problem list with the problem list the increasingly competitive health care mar-
Patient-Centered Care Systems
589 17
ketplace. Because private, third-party payers of care guides clinical decisions in real time,
often adopt federal standards for reporting and “dashboards” are used to display indica-
and regulation, health care providers and tors related to different dimensions of quality.
institutions have struggled in the early 2000s
to keep up with the movement toward data Data science approaches using clinical
and information system standards acceler- data with a quality focus greatly expanded in
ated by the Health Insurance Portability the 2010s (Bates et al. 2014; Hruby et al. 2016).
and Accountability Act (HIPAA)4 and the While much of this work has focused only on
initiatives to develop a National Health the subset of data within the EHR that are
Information Network. With the advent of consistently structured and coded, such as
pay for performance (P4P), CMS has elimi- medical diagnoses, allergies, and medication
nated reimbursement for preventable condi- orders, there are increasing efforts to process
tions (e.g., catheter-associated urinary tract and derive value from additional data sources,
infections) that occur during hospitalizations such as nursing assessment data and narrative
(“CMS P4P,”). In this decade, there is no notes (Klann et al. 2018) (see 7 Chap. 8). The
doubt that the implementation of the highly use of high volume clinical metadata, or data
controversial Affordable Care Act of 2010 about the clinical data, also provides value for
and evolving ACOs will profoundly impact quality purposes, such as developing health-
patient-centered care and the information sys- care process models of care delivery processes,
tems needed to support it. for a richer understanding of patient centered
workflows and evaluation of best practices
zz Quality Focus and gaps in care than previously feasible
Demands for information about quality of (Hripcsak & Albers 2013; Collins & Vawdrey
care have also influenced the design and 2012; Collins et al. 2013).
implementation of patient-care systems. Of note, while advances in natural language
The quality-assurance techniques of the processing continue, many of the comput-
1970s were primarily based on retrospective able processes outlined above, such as practice
chart audit. In the 1980s, continuous quality guidelines, alerts, and reminders, still require
improvement techniques became the modus structured data capture, which typically requires
operandi of most health care organizations. manual data entry by clinicians (Collins,
The quality-management techniques of the Couture et al. 2018). Evaluation of the value
1990s were much more focused on concur- and downstream uses of manually captured
rently influencing the care delivered than structured data is an important consideration
on retrospectively evaluating its quality. In when configuring clinical systems to decrease
the Twenty-first Century, patient-centered burden on the clinician and promote ‘top of
systems-based approaches—such as practice license’ practice. Data science approaches for
guidelines, alerts, and reminders tailored on continuous knowledge development are seen
patient clinical data and, in some instances, as an essential aspect of a Learning Health
genomic data (i.e., personalized medicine)— System. Innovations related to clinical data
are an essential component of quality man- capture, such as increased capabilities and
agement. In addition, institutions must have implementations of device-integrated data and
the capacity to capture data for benchmark- voice-recognition tools, are expected to increase
ing purposes and to report process and out- the volume, veracity, variety, and velocity of
comes data to regulatory and accreditation clinical data for data science and quality pur-
bodies, as well as to any voluntary reporting poses. In turn, automated data capture promises
programs to which they belong. Increasingly, to decrease clinician data entry burden and pro-
concurrent feedback about the effectiveness vide greater opportunity for clinical data con-
sumption, interpretation, and decision-making
tailored to patient preferences by the care team
4 7 http://hhs.gov/ocr/privacy/ (Accessed: 4/26/13). as part of a patient-centered system.
590 S. Bakken et al.
are responding by developing such systems ing the clinical decision making and docu-
themselves and by purchasing the rights to mentation process (Senathirajah, Bakken, &
patient care systems developed in academic Kaufman 2014; Senathirajah, Kaufman, &
medical centers that have demonstrated Bakken 2014).
reductions in errors and gains in quality of If patient-centered care systems are to
care and cost control. Closed loop medica- be effective in supporting better care, health
tion systems use technologies such as bar care professionals must possess the infor-
codes and decision support to guard against matics competencies to use the systems.
errors throughout the process of prescribing, Consequently, many are integrating informat-
dispensing, administering, and recording and ics competencies into health science education
have been identified as a key intervention to (See 7 Chap. 25). For example, the Quality
improve medication safety. In a before-and- and Safety Education for Nurses initiative has
after evaluation of the closed loop electronic named and described necessary competencies
medication administration system at Brigham and associated curriculum to support patient-
and Women’s Hospital, investigators found a centered care, including competencies related
significant reduction in the rates of transcrip- to quality, safety, team work, and collabora-
tion errors, medication errors, and potential tion (Cronenwett et al. 2009). Recently, infor-
adverse events (Poon et al. 2010). In other matics competencies for nursing leaders were
contexts, decision support systems offer clini- validated (Yen et al. 2017).
cal practice guidelines, protocols, and order To what degree do patient-care disciplines
sets as a starting point for planning indi- need to prepare their practitioners for roles
vidualized patient care; providing alerts and as informatics specialists? To the degree that
reminders; using knowledge bases and patient members of the discipline use information in
data bases to assess orders for potential con- ways unique to the discipline, the field needs
traindications; and offering point-and-click members prepared to translate the needs of cli-
access to knowledge summaries and full-text nicians to those who develop, implement, and
publications. See 7 Chap. 26 for more infor- make decisions about information systems. If
mation about these systems. the information needs are different from those
Many health care organizations have sub- of other disciplines, some practitioners should
stantial investment in legacy systems and be prepared as system developers.
cannot simply switch to more modern tech- The mere existence of information systems
nology. Finding ways to phase the transition does not improve the quality of patient care.
from older systems to newer and more func- The adoption and use of advanced features
tional ones is a major challenge to health (such as CDS) that are sensitive to both work-
informatics. To make the transition from a flow and human factors are needed to improve
patchwork of systems with self-contained the quality of care (Stead & Lin 2009; Zhou
functions to truly integrated systems with et al. 2009). Recent safety reports, public pol-
the capacity to meet emerging information icy, and reimbursement incentives raise aware-
needs is even more challenging (see 7 Chap. ness of the need for patient-centered care
17 16). Approaches to making this transition systems. Because traditional requirements for
are described in the Journal of the American EHRs were provider- centric, existing infor-
Medical Informatics Association (Stead et al. mation systems rarely provide the compre-
1996). More recently, some institutions have hensive suite of advanced features needed to
applied Web 2.0 approaches to create con- support patient-centered care. However, the
figurable user interfaces to legacy systems. ability of systems to support patient-centered
For example, MedWISE integrates a set of care is essential for achieving the vision of
features that supports custom displays, plot- health care reform. What are the requirements
ting of selected clinical data, visualization of for patient-centric information systems? How
temporal trends, and self-updating templates do these requirements drive the design of sys-
as mechanisms for facilitating cognition dur- tems that will support patient-centered care?
Patient-Centered Care Systems
593 17
17.3 Designing Systems 55 Developing innovative computer-based
for Patient-Centered Care systems, using these models, that deliver
information or knowledge to health care
In the second decade of the Twenty-first providers
Century, informaticians and clinicians increas- 55 Installing such systems and then making
ingly share a vision for systems that support them work reliably in functioning health
patient-centered care practices such as inter- care environments
disciplinary care planning, care coordination, 55 Studying the effects of these systems on
quality reporting, and patient engagement. the reasoning and behavior of health care
This evolution is fueled in part by meaningful providers, as well as on the organization
use requirements that aim to engage patients and delivery of health care
and families in their health care and to improve
care coordination and the overall quality of While the Friedman typology continues to
care provided. Traditional EHR functional- be useful more than 20 years after its incep-
ity must be expanded to support new features, tion, we propose extending the second and
functions, and care practices including seam- third categories in 7 Sects. 17.4.2 and 17.4.3
less communication, interdisciplinary collabo- to expand the focus from clinical informatics
ration, and patient access to information. To as a provider-centric discipline to a discipline
optimize human and organizational factors that enables and supports patient-centric care.
and the integration of systems and workflow, Following are examples of recent research
these features must be built into information with implications for patient-centered care.
systems as core requirements, rather than as an
afterthought.
The Principles to Guide Successful Use of 17.4.1 Formulation of Models
Health Care Information Technology described
by the National Research Council (Stead & For several decades, standards development
Lin 2009) provide a comprehensive frame- organizations (SDOs) and professional groups
work for defining a set of core requirements alike have focused on the formulation of mod-
that will support the design of systems for els that describe the patient care process and
patient-centered care. This framework defines the formal structures that support manage-
nine principles related to both evolutionary ment and documentation of patient care. The
(i.e., iterative, long-term improvements) and efforts of SDOs are summarized in 7 Chap. 8.
radical (i.e., revolutionary, new-age improve- Early SDO efforts focused primarily on rep-
ments) changes occurring in the United States’ resenting health care concepts such as pro-
health care system. The principles and associ- fessional diagnoses (e.g., medical diagnoses,
ated system design prerequisites are included nursing diagnoses) and actions (e.g., proce-
in . Table 17.3. dures, education, referrals). These efforts were
complemented by professional efforts such as
those of the Nursing Terminology Summit
(Ozbolt 2000). As a result of multi-national
17.4 Current Research Toward efforts, SNOMED CT became an interna-
Patient-Centered Care tional standard that provides a formal model
Systems for concepts that describe clinical conditions
and the actions of the multidisciplinary health
Friedman (1995) proposed a typology of the care team (International Health Terminology
science in medical informatics. His four cat- Standards Development Organization 2011).
egories build from fundamental conceptual- In addition, SNOMED CT subsets have been
ization to evaluation as follows: developed for specific domains such nurs-
55 Formulating models for acquisition, repre- ing problems (Matney et al. 2012). Toward
sentation, processing, display, or transmis- the goal of patient-centered care, attention
sion of biomedical information or knowledge has also been paid to approaches for formal
594 S. Bakken et al.
.. Table 17.3 Principles to guide successful use of health care information technology
Evolutionary 1. Focus on improvements in Gaps in patient-centered care are clearly defined and
change care—technology is secondary operationalized. Health care IT is employed to enable
the process changes needed to close gaps in
patient-centered care.
2. Seek incremental gain from An organization’s portfolio of health care IT projects
incremental effort has varying degrees of investment. Each project is
linked to measurable process changes to provide
ongoing visible success with closing gaps in
patient-centered care.
3. Record available data so they Health care IT systems support auto capture of data
can be used for care, process about people, processes, and outcomes at the point of
improvement, and research care. Data are used in the short term to support
incremental improvements in patient-centered care
processes. An expandable data collection infrastructure
is employed that is responsive to future needs that
cannot be anticipated today.
4. Design for human and Clear consideration is given to sociological,
organizational factors psychological, emotional, cultural, legal, economic, and
organizational factors that serve as barriers and
incentives to providing patient-centered care. Health
care IT should eliminate the barriers and enable the
incentives, making it easy to provide patient-centered
care.
5. Support the cognitive Health care IT systems include advanced clinical
functions of all caregivers, decision support for high-level decision-making that is
including health professionals, sensitive to both workflow and human factors.
patients, and their families
Radical 6. Architect information and Health care IT systems are designed using standard
change workflow systems to interconnection protocols that support the
accommodate disruptive patient-centered care processes and roles of today while
change accommodating rapidly changing requirements dictated
by new knowledge, care venues, policy, and increasing
patient engagement.
7. Archive data for subsequent Health care IT systems support archival of raw data to
re-interpretation enable ongoing review and analysis in the context of
advances in biomedical science and patient-centered
care practices.
systems relate directly to several of the daily schedules, and direct care team com-
Principles to Guide Successful Use of Health munication. Some inpatient portals support
Care Information Technology described by patient-generated content such as notes,
the National Research Council (Stead & Lin patient-provider messaging, and patient
2009) and outlined in 7 Sect. 17.3; specifi- feedback related to their care plan or dis-
cally principles 5, 8 and 9 (see . Table 17.3). charge plan. Research is needed on the use
Partners Health Care System in Boston, of acute care portals to explore their impact
MA, developed Patient Gateway, a secure on patients’ abilities to successfully navigate
patient portal serving over 65,000 patient information- rich acute care hospitalizations
users from primary and specialty care prac- and to examine the effects of portal use on
tices affiliated with the Dana Farber Cancer patient activation, engagement, and the over-
Institute, Brigham and Women’s Hospital, all quality of care (Grossman et al. 2018).
and Massachusetts General Hospital. Patient The involvement of users has been identi-
Gateway is a tethered personal health record fied as fundamental to well-designed systems
(see 7 Chap. 11) that provides functionality that are usable and useful in the context of
in line with the five levels described by Krist busy patient care workflows (Rahimi et al.
and Woolf. For example, tools for manage- 2009). Some examples of development activi-
ment of chronic illness are used by patients ties where user involvement is needed are con-
and providers to promote adherence with tent standardization, workflow modeling, and
evidence-based health maintenance guidelines usability testing.
and to improve collaboration on diabetes self- 55 Content standardization: Content stan-
management plans (Grant et al. 2006; Wald dardization includes identifying EHR con-
et al. 2009). Research on patient response and tent needed to support documentation of
satisfaction with the Patient Gateway suggests care provided and identification of data
that patients appreciate the ability to com- needed for reuse (e.g., decision support,
municate electronically with providers, they quality reporting, and research). Content
welcome greater access to their health infor- that is shared across disciplines and
mation including test results, and they believe patients is identified. Content is modeled
that Patient Gateway enables them to better using standards to ensure data reuse and
prepare for visits (Grant et al. 2006; Schnipper interoperability (Principle 3, . Table 17.3)
et al. 2008; Wald et al. 2009). Evaluations (Chen et al. 2008; Dykes et al. 2010; Kim
of patient satisfaction with personal health et al. 2011).
records with similar levels of functional- 55 Workflow modeling: Sound modeling of
ity at other sites, including Geisinger Health the clinical workflow that underlies an
System (Hassol et al. 2004), Group Health electronic system is essential to designing
Cooperative (Ralston et al. 2007), and Virginia systems that are usable by care team mem-
Commonwealth University (Krist et al. 2010), bers (Peute et al. 2009). Workflow models
are consistent with the results reported at are based on observations of current state
Partners Health Care System. clinical workflows including interactions
17 While traditionally patient portals have with patients, staff, equipment, and sup-
been associated with ambulatory care, some plies. Understanding of workflow interac-
health care systems are providing modules tions, including current state inefficiencies,
within their patient portals to inform and informs effective and efficient future state
activate patients and family during an acute workflows, use-case development, and sys-
hospitalization and associated transitions tem prototypes (Rausch & Jackson 2007;
(Grossman et al. 2018). Early research indi- Mlaver et al. 2017). Workflow modeling of
cates that in addition to the features com- patient-centered systems includes clear
monly found in ambulatory patient portals evaluation of ways to use technology to
(e.g., medications, labs, educational content, identify and eliminate ineffective work
scheduling features), the acute care modules processes (Principle 8, . Table 17.3).
provide patient access to the plan of care, Design of new systems is an opportunity
Patient-Centered Care Systems
597 17
to provide ubiquitous access to informa- plan dashboard was developed that captures
tion and communication by all care team disparate data from the EHR and presents
members including patients. Applying a personalized display as the screensaver on
these principles in workflow modeling the bedside computer workstation. The dash-
assures that future state workflows are not board aligns all care team members, including
constrained by existing care venues or pro- patients and families, in the safety plan. The
vider-centric practice patterns. screensaver content includes icons that pro-
55 Usability testing: A key lesson learned vide actionable alerts related to patient-spe-
from Computerized Physician Order Entry cific safety concerns. These bedside displays
implementations is that electronic systems combine data from many sources to support
with poor usability interfere with clinical the integrated care of physicians, nurses, and
workflow. The unintended consequences family members.
of poorly designed systems are well At Partners Health Care System, system
known, and some widely disseminated developers are working with clinical teams
papers (Ash et al. 2004, 2009; Koppel et al. to identify system requirements, to iteratively
2005) have called into question the safety develop, and to test patient-centric systems
of using such systems with patients. that integrate decision support into the clini-
Examples of common usability problems cal workflow. For example, Dykes et al. (2010)
include overly cluttered screen design, developed a fall prevention toolkit that reuses
poor use of available screen space, and fall risk assessment data entered into the
inconsistencies in design. Involving end- clinical documentation system by nurses and
users in design and enforcing usability automatically generates a tailored set of tools
design standards when building clinical that provide decision support to all care team
systems prevents implementing systems members, including patients and their family
that are difficult to use and interfere with, members at the bedside (Dykes et al. 2009).
rather than support, patient-centered care The fall prevention toolkit logic was developed
(Principles 4, 5 and 8, . Table 17.3). from focus groups of professional and para-
professional caregivers (Dykes et al. 2009),
Innovative systems to support patient care and of patients and family members (Carroll
often take advantage of information entered et al. 2010). As nurses complete and file the
in one context for use in other contexts. routine fall risk assessment scale, the docu-
For example, the Brigham and Women’s mentation system automatically generates a
Hospital Patient Safety Learning Lab in tailored bed poster that alerts all team mem-
Boston developed a provider checklist and a bers about each patient’s fall risk status and
patient-centered toolkit that used informa- patient-appropriate interventions to mitigate
tion from the order entry, scheduling, flow- risk. In addition, a patient education handout
sheets (nursing documentation), and other is generated that identifies why each individual
systems to auto-populate a suite of tools used patient is at risk for falls and what the patient
by clinicians and patients to improve team and family members can do while in the hos-
communication and patient safety. The itera- pital to prevent a fall. The icons used in the
tive, participatory development process led to Fall TIPS poster and patient education hand-
tools that are used every day in the medical out have been developed and validated using
intensive care units and that demonstrated a participatory design process with clinicians
significant reductions in adverse events and and patients (Leung et al. 2017; Hurley et al.
improvement in patient and family satisfac- 2009). In a randomized control trial of over
tion (Mlaver et al. 2017). 10,000 patients, the toolkit was associated with
The principle of entering information a 25% reduction in falls (Dykes et al. 2010).
once for multiple uses also drove development The Fall TIPS toolkit reduced falls by leverag-
of the bedside displays for inpatients and the ing HIT to complete the three-step fall preven-
care team at Brigham and Women’s Hospital tion process: (1) conduct fall risk assessments,
(Duckworth et al. 2017). A patient safety (2) develop tailored fall prevention plans with
598 S. Bakken et al.
evidence-based interventions, and (3) consis- care and clinic settings to include home health,
tently implement the plan. We learned that specialist care, laboratory, pharmacy, popula-
Fall TIPS was most effective at reducing falls tion health, long-term care, and physical and
and related injuries when patients and family behavioral therapies (Payne et al. 2015).
were engaged in all three steps of the fall pre- Several studies have quantified documen-
vention process (Dykes et al. 2017). tation burden. In one setting, resident phy-
sicians spent 85 minutes per day authoring
and viewing notes (Hripcsak et al. 2011). In
17.4.3 Implementation of Systems another setting, on average, nurses perform
631–662 manual flowsheet data entries per
Much has been written about HIT failures and 12-hour shift (excluding device integrated
associated costs and consequences (Bloxham data), averaging to one data point every 0.82–
2008; Booth 2000; McManus & Wood-Harper 1.14 minutes in acute care (Collins, Couture
2007; Ornstein 2003; Rosencrance 2006). et al. 2018). Further, EHR log file analyses
Higgins and associates (see Rotman et al. indicate nurses spend 21.4–38.2 minutes per
1996) described the lessons learned from a day authoring notes, on average (Collins,
failed implementation of a computer-based Couture et al. 2018), yet fewer than 20% of
physician workstation that had been designed nursing notes were read by physicians, and
to facilitate and improve ordering of medica- only 38% were read by other nurses (Hripcsak
tions. Those lessons are not identical to, but et al. 2011). There is an overall lack of stan-
are consistent with, the recommendations of dardization and consistency of data defini-
Leiner and Haux (1996) in their protocol for tions within EHRs, leading to a proliferation
systematic planning and execution of projects of data elements that contribute to EHR
to develop and implement patient-care systems. burden and inhibit interoperability and auto-
While long term follow-up of a vendor EHR mated reporting (Zhou et al. 2016; Collins,
implementation with advanced CDS identified Bavuso et al. 2017; Collins, Klinkenberg-
lower prescribing error rates, achieving prior Ramirez et al. 2017; Collins, Rozemblum et
levels of perceived prescribing efficiency took al. 2017). Methods to increase consistency of
nearly 2 years (Abramson et al. 2013, 2016). data definitions have been published, but are
In response to evidence of unintended often minimally implemented due to project
consequences and clinicians voicing concerns timelines and limited resources (Collins et al.
after system implementations, the American 2015, Collins, Bavuso et al. 2017, Collins,
Medical Informatics Association (AMIA) Klinkenberg-Ramirez et al 2017, Collins,
EHR 2020 Task Force on the Status and Rozemblum et al 2017). Efforts by national
Future Directions of EHRs published a report organizations to improve consistency of data
in 2015 that outlined 10 recommendations definitions include the American Medical
that span five areas. These recommendations Association’s Integrated Health Model
are in response to current barriers to quality Initiative6 and collaborations between the
care delivery experienced at many organiza- Office of the National Coordinator for Health
17 tions after EHR system implementations. The Information Technology (ONC) and CMS.7
five areas addressed were: simplify and speed As these experiences demonstrate, the
documentation, refocus regulation, increase implementation of patient-care systems is
transparency and streamline certification, fos- far more complex than the replacement of
ter innovation, and support person-centered one technology with another. Such systems
care delivery. Specific recommendations dis-
cussed decreasing documentation burden,
improving the designs of interfaces so that 6 AMA. Integrated Health Model Initiative, 2018.
they support and build upon how people 7 https://ama-ihmi.org (Accessed 9/25/18).
7 ONC/CMS Reducing Clinician Burden Meeting.
think (i.e., cognitive-support design), and pro- February 22, 2018. 7 https://www.healthit.gov/
moting the integration of EHRs into the full news/events/onccms-reducing-clinician-burden-
social context of care, moving beyond acute meeting (Accessed 9/25/18).
Patient-Centered Care Systems
599 17
transform work and organizational relation- In 2014, the SAFER (Safety Assurance
ships. If the implementation is to succeed, Factors for Electronic Health Record
attention must be given to these transforma- Resilience) Guides were first published to
tions and to the disruptions that they entail. facilitate proactive risk assessments of EHR
Southon et al. (1997) provided an excellent safety and usability related policies, processes,
case study of the role of organizational procedures, and configurations at healthcare
factors in the failed implementation of a organizations (Ash et al. 2016). These guides
patient-care system that had been successful are endorsed by the ONC and available at
in another site. 7 HealthIT.gov. The SAFER Guides include
To realize the promise of informatics for nine guides organized into three broad groups:
health and clinical management, people who foundational guides, infrastructure guides,
develop and promote the use of applications and clinical process guides. Recent evalua-
must anticipate, evaluate, and accommodate tions indicate that health organizations would
the full range of consequences. In early 2003, benefit from broader implementation of these
these issues came to the attention of the public guides and principles of safety and usability
when a large academic medical center decided (Ash et al. 2016; Sittig et al. 2018).
to temporarily halt implementation of its For the purposes of promoting successful
CPOE system due to mixed acceptance by the implementation of patient-centered systems,
physician staff (Chin 2003; Ornstein 2003). the third category of the Friedman typology is
A case series study by Doolan et al. (2003) expanded to provide access to information to
identified five key factors associated with suc- all team members including patients and their
cessful implementation: (1) having organiza- families or caregivers outside of traditional
tional leadership, commitment, and vision; health care settings as follows: Installing such
(2) improving clinical processes and patient systems and then making them work reliably in
care; (3) involving clinicians in the design and functioning health care environments and other
modification of the system; (4) maintaining settings where information is needed to promote
or improving clinical productivity; and (5) health and wellbeing. The majority of self-
building momentum and support amongst management occurs outside traditional health
clinicians. A collaboration of ten AMIA care settings. As noted in . Table 17.3, a pre-
working groups and the International Medical requisite for patient-centered systems is that
Informatics Association Working Group on they support efficient workflows with ubiqui-
Organizational and Social Issues cospon- tous access to information and communication
sored a workshop to review factors that lead and that the systems are not constrained by
to implementation failure. These include poor existing care venues or provider-centric practice
communication, complex workflows, and fail- patterns (Principle #8). Clinical workflows are
ure to engage end-users in clearly defining highly complex and data-rich, requiring formal
system requirements. Recognizing that the analysis and evaluation before and after system
problems encountered in failed implementa- implementation. For example, a time-motion
tions tend to be more administrative than study found that, on average, nurses engage in
technical, they recommended the following set 31 communications and 52 hands-on tasks per
of managerial strategies to overcome imple- hour, and multi-task 18.63% of the time (Yen
mentation barriers (1) provide incentives for et al. 2016). Strategies to involve users in sys-
adoption and remove disincentives; (2) iden- tem design or selection and customization will
tify and mitigate social, IT, and leadership support successful implementation of systems
risks; (3) allow adequate resources and time that meet user expectations (Burley et al. 2009;
for training before and after implementation, Rahimi et al. 2009; Saleem, Russ, Justice et al.
including ongoing support; and (4) learn from 2009; Saleem, Russ, Sanderson et al. 2009).
the past and from others about implementa- User involvement in defining future workflows
tion successes and failures and about how contributes to a shared understanding about
failing situations were turned around (Kaplan the impact of information systems on clinical
and Harris-Salamone 2009). tasks and workflows (Leu et al. 2008).
600 S. Bakken et al.
Careful attention to the Principles to Guide puter (Campbell et al. 2006; Ash et al. 2007).
Successful Use of Health Care Information These and other unintended consequences of
Technology during system design will support EHRs and CPOE systems are the subject of
successful implementation. For example, the ongoing research. Detecting and finding ways
principles related to evolutionary health care to prevent or mitigate the adverse, unintended
changes keep the focus on designing and imple- consequences of these systems will be critical
menting usable systems that enable patient- for supporting patient-centered care.
centered care practices. Principles related to A number of unintended consequences
radical change focus on development of flex- stem from the incompatibility of system
ible, adaptable systems that are architected to design with the clinician’s cognitive workflow.
accommodate disruptive change and iterative For example, systems that make it difficult
development based on end-user feedback. to find and retrieve information can inter-
fere with patient-centered care. In a hospital
preparing to implement a commercial CPOE
system, investigators compared the efficiency,
17.4.4 Effects of Clinical
usability, and safety of information retrieval
Information Systems using the vendor’s system, the current paper
on the Potential form, and a prototype CPOE developed on
for Patient-Centered Care principles of User Centered Design. They
found the prototype system to be similar to
Electronic health records and CPOE systems the paper form and both to be significantly
are intended to support safe, evidence-based, superior to the vendor’s system in efficiency,
patient-centered care by examining patient- usability, and safety (Chan et al. 2011).
specific information, agency-specific infor- Other unintended consequences arise
mation, and domain-specific information in from over-reliance on the information system
the clinical context and proposing appropri- because of limited understanding of its design
ate courses of action or alerting clinicians and capacities. To date, many CDSS are some-
to potential dangers. Many current systems, what limited in their ability to incorporate
however, fail to follow design principles that patient-specific data into their decision algo-
take into account the real contingency-driven, rithms. A synthesis of 17 systematic reviews
non-linear, highly interrupted, collaborative, conducted with sound methodology found
cognitive, and operational workflow of clini- that CDSS often improved providers’ perfor-
cal practice (Ash et al. 2004). These flaws can mance, especially in medication orders and
lead to errors in entering and retrieving infor- preventive care. The reviewers noted, “These
mation, cognitive overload, fragmentation of outcomes may be explained by the fact that
the clinical overview of the patient’s situation, these types of CDSS require a minimum of
lack of essential operational flexibility, and patient data that are largely available before
breakdown of communication. Physicians, in the advice is (to be) generated: at the time
particular, have found themselves chagrined clinicians make the decisions” (Jaspers et al.
17 by changes in the power structure as they have 2011, p. 327).
devoted more time to entering information On the other hand, many systems offer
and orders while other members of the health functionalities that support patient-centered
team have gained greater access to information care. An important component of patient-
and the concomitant capacity to make certain centered care is the application of evidence in
decisions without consulting the physician. a plan of care tailored to the patient’s needs.
Clinicians across the range of professions To increase the use of evidence-based order
have expressed concern about the decrease sets, investigators at Sinai-Grace Hospital in
in face-to-face communication with its ver- Detroit, Michigan embedded into the general
bal and non-verbal richness, negotiation, and admission order set specific, evidence-based
redundant safety checks as more and more order sets for the most common primary and
clinical information is exchanged via the com- secondary diagnoses for patients admitted to
Patient-Centered Care Systems
601 17
their medical service. The result was a fivefold The quality of documentation tools can
increase in the use of evidence-based order have a profound effect on whether informa-
sets in the 16-month period following imple- tion, even if communicated face-to-face, is
mentation (Munasinghe et al. 2011). acted upon in clinical care (Collins et al. 2011).
As in the examples above, most systems In a neurovascular intensive care unit, thera-
to support the cognitive workload have been peutic goals for patients were stated during
directed toward physicians. Patient-centered daily interdisciplinary rounds. In this setting,
care requires a broader perspective. A study the interdisciplinary team treated the attend-
of the information needs of case managers ing physician’s note as a common patient-
for PLWH found that the most frequent needs focused source of information. Although the
were for patient education resources (33%), attending physician’s note contained 81% of
patient data (23%), and referral resources the stated ventilator weaning goals, it included
(22%) (Schnall, Cimino et al. 2011). The inves- only 49% of the stated sedation weaning
tigators recommended that targeted resources goals. Overall, nearly a quarter of stated goals
to meet these information needs be provided in were not documented in the note. If a goal
EHRs and continuity of care records through was not documented, it was 60% less likely
mechanisms such as the Infobutton Manager. to have a related action documented. Nurses’
Key to patient-centered care is communi- documentation rarely mentioned the goals,
cation among all the health care professionals even if actions recorded were consistent with
on the patient’s team. A study at one academic the goals as stated during rounds. Notably,
medical center (Hripcsak et al. 2011) reviewed the nurses’ structured documentation sys-
EHRs of hospitalized patients, along with tem did not support sedation-related goals,
usage logs, to make inferences about time even though sedation weaning was a nurs-
spent writing and viewing clinical notes and ing responsibility in this setting. The authors
patterns of communication among team mem- noted that the frequent omission of sedation
bers. In this setting, the core team for each goals from the attending physician’s note
patient consisted of one or more attending might be because this nursing function was
physicians, residents, and nurses, with social not a billable goal or act. They also expressed
workers, dieticians, and various therapists concern that the omission from the EHR of
joining the team later. Results showed that evidence of important clinical judgments
clinical notes were more likely to be reviewed nurses make could impair patient safety, qual-
within the same professional group, with ity improvement, and development of nursing
attending physicians and residents viewing knowledge. Thus, in this example, although
notes from nurses or social workers less than the interdisciplinary team was collaborating
a third of the time. The investigators proposed in setting and reaching therapeutic goals, defi-
that it might be useful to develop ways for ciencies in their processes and in the nurses’
EHRs “to summarize information and make documentation system limited their achieve-
it readily available, perhaps with the ability ment of patient-centered care.
of the author to highlight information that A study at Vanderbilt University Medical
may be critical and that has a high priority for Center also demonstrated both strengths and
communication” (p. 116) (Hirsch et al. 2014). shortcomings in the ability of a clinical infor-
They also noted that their study was limited to mation system to support patient-centered
communications within the EHR and did not care. Attending physicians in the Trauma
take into account face-to-face or telephone and Surgical ICUs established protocols for
communications that might have occurred, Intensive Insulin Therapy that were built into a
especially in urgent situations. They suggested CDSS to advise nurses on insulin doses based
further research involving direct observation on a patient’s blood glucose and insulin resis-
of clinicians, time-motion analyses, and think- tance trends. In 94.4% of studied instances,
aloud methods to develop deeper knowledge nurses administered the recommended dose.
of how clinicians communicate about patient When nurses overrode the recommended dose,
care, especially across the professions. they overwhelmingly administered less insu-
602 S. Bakken et al.
lin than the recommended dose, leading to a health care professionals recognize the poten-
higher incidence of hyperglycemia than when tial of social networking sites for the peer sup-
the recommended dose was administered. port that patients gain from participation, but
Nurses appeared more concerned about hypo- they have concerns about the accuracy of the
glycemia than hyperglycemia and to consider content that is offered on these sites. Available
the patient’s blood glucose but not the insulin research indicates great variability in the accu-
resistance trend. They also noted that their racy and effectiveness of the clinical content
workflow was impeded by the need to record offered on social networking sites (Greene
information about the blood glucose, insu- et al. 2011; Mogi et al. 2017). One study found
lin dose, and primary dextrose source in two that approximately one-third of the posts on
places—the CPOE that included the CDSS and health-related social networking sites could be
the separate nurse charting system. The inves- classified as an advertisement of a non-FDA-
tigators’ recommendations included display- approved remedy or cure (Greene et al. 2011).
ing information about insulin resistance trends It was also noted that requests for personal
on screen (provided that this did not produce information were not uncommon, potentially
clutter and confusion) and developing clinical making participants vulnerable to solicita-
information systems that do not require dou- tions from product manufacturers or vendors.
ble documentation. Strengths of this example Clinicians should be aware of the social net-
for supporting patient-centered care include working sites related to their area of expertise.
the collaboration of physicians and nurses in Discussing social networking options with
maintaining blood glucose in the desired range patients, including the advantages and disad-
based on patient data. Shortcomings include vantages and the potential benefits and prob-
the failure to present nurses with information lems, can help patients to be more judicious
about the patient’s insulin resistance trend to consumers of social networking sites.
aid their decision-making and the requirement In patient-centered care, personal health
that they record the same data in two places, records (see 7 Chap. 13) are often viewed as a
thereby reducing time for direct patient care means of communication between patients and
(Campion Jr et al. 2011). providers and as a method of engaging patients
Patient-centered care systems not only in understanding and acting in the interests of
support the cognitive work and communica- their own health. A 2016 international study
tions of clinicians; they also take into account to understand perspectives on sharing data
the resources patients and families use to with patients categorized countries that are
manage their health concerns. Increasingly, focused on encouraging patients to receive
patients and family members engaged in pro- access to their clinical data, within the follow-
moting their own health turn to social net- ing stages of maturity: Established, Emerging,
working sites on the Internet. While research and Limited. Countries with “Established”
on the impact of social networking sites on levels of maturity for sharing patient data
patient-related outcomes is in its infancy, early had been focused on this work since the early
research indicates that social networking sites 2000s and included Israel, England, Canada,
17 are used by patients and family members to Australia, and the United States (Prey et al.
get informational and social support for day- 2016). Expanded efforts to share patient
to-day management of chronic illnesses such data led to “Emerging” status for Austria,
as diabetes and heart failure (Mogi et al. 2017; Argentina, Brazil, the Netherlands, Portugal,
Partridge et al. 2018). Several studies indicate South Korea, Switzerland, and Uruguay.
that the primary objectives for using social Iran, Japan, and Kenya were described as
networking sites were to request information, focused on EHR implementation with “lim-
to provide information to others with similar ited” patient engagement and the potential
conditions, to express emotion about one’s for increased focus on patient engagement in
own condition, to provide emotional support the future. In 2013, ONC announced the Blue
to others, and to promote a specific product Button initiative, promoting patients’ legal
(Greene et al. 2011; Mogi et al. 2017). Many rights to receive their personal health informa-
Patient-Centered Care Systems
603 17
tion and recognizing sites that enable consum- needs for full information about the patient
ers to download their health records. A 2017 and for appropriate opportunities for ethical
ONC Data Brief reported that 52 percent of disclosure of information to patients. Over the
individuals have been offered online access to following decade throughout the 2010s, core
their medical record in the United States and functionality for portals continued to expand
over half of those individuals viewed their and mature. Common features now include
record within the past year, the equivalent of secure messaging, prescription renewals and
28 percent of individuals nationwide (Patel & refills, appointment requests online, online bill
Johnson 2018). pay, referral requests, medication reminders,
Most of the clinical data being shared links to reference materials, use of low health
through portals is structured, coded informa- literacy terms, and increasing access to several
tion such as problem lists, allergies, medication types of clinical data. Clinical data released in
lists, appointments, and laboratory results. portals include routine and sensitive labora-
Sharing of narrative notes is not widespread, tory results, genetic test results, medications,
but, as noted previously, the OpenNotes proj- encounter information, allergies, immuniza-
ect has demonstrated the successful sharing tions, radiology and other diagnostic reports,
of notes with patients. To date, over 28 mil- problem lists, discharge summaries, and clini-
lion patients have online access to notes, and cal notes. In many organizations embargo
the network of providers participating in this periods for releasing results have lessened or
project continues to grow.8 been eliminated, although variability remains.
Patient portals, which evolved as a mecha- As portal functionalities expanded and
nism to access personal health data and popu- were optimized, acute care patient portals
late personal health record platforms, began (portal access tailored to the hospital setting)
to take hold in the 2000s. Several pilots during began to emerge. Although some clinicians
this time, such as the Military Health System’s feared that patient portal access to data dur-
MiCare portal in 2008, resulted in critical ing a hospitalization would burden them with
learnings and informed optimization of por- excessive patient inquiries, their fears were
tal functionality over the following decade. no more realized than were those of clini-
Lessons learned include the following: (1) cians wary of the OpenNotes project (Collins,
transfer of data upon specific patient request Bavuso et al. 2017; Collins, Klinkenberg-
is more efficient than automatic transfer; (2) Ramirez et al. 2017; Collins, Rozemblum et al.
patient representatives prefer instant access 2017). User-centered design with patients and
to all their data, while many providers prefer families and other key clinician and adminis-
an embargo time, particularly for release of trative stakeholders specified that acute care
sensitive results data; (3) inefficient provider portals could provide value by humanizing
access to personal health records and siloed the patient-clinician connection, facilitating
repositories with incomplete information the maintenance and sharing of verbal com-
might pose the danger of ill-informed clinical munication, and promoting ubiquitous and
decisions; and (4) giving patients the power equitable access to information. Key features
to determine what medical information to specified for portals in the acute care setting
share with the provider could similarly lead to are the provision of clinical data, messaging
clinical decisions made in the absence of vital with clinicians, glossary of clinical/hospital
information, with resulting harm to patients. terms, patient education resources, patient
The MiCare pilot concluded that while there diary, patient notepad for reminders, resources
is broad agreement on desired functional- that support family involvement, and tiered
ities for portals, challenging tensions remain displays for information-dense clinical data.
between patients’ desire for access to and New clinical workflows are required to inte-
control of health information and providers’ grate portals within the acute care setting, and
these are facilitated by active clinician engage-
ment and demonstration of improved patient
8 7 https://www.opennotes.org/ (Accessed 9/25/18). outcomes and satisfaction (Collins, Bavuso et
604 S. Bakken et al.
al. 2017; Collins, Klinkenberg-Ramirez et al. Electronic health records and other
2017; Collins, Rozemblum et al. 2017). Patient computer- based information resources can
portals are increasing, but not yet in routine influence the provision of patient-centered
use, in the acute and post-acute setting. care even when the patient and the provider
In 2018, 7 key focus areas were identified are in the same room. A study of computer
for sociotechnical and evaluation research use during acute pediatric outpatient vis-
related to patient engagement and portal its found that female physicians were more
use in the acute and post-acute care settings. likely than males to be communicating with
These identified research areas were (1) patients and families while using the com-
standards for interoperability, functional- puter (Fiks et al. 2011). A recent study in
ity, and patient-driven use case models; (2) ambulatory care revealed that although pro-
appropriate access and policies for privacy viders reported improvements attributable
and security; (3) user-centered design; (4) to EHRs (e.g., communication between pro-
implementations that integrate with work- viders, review of results with patients, and
flows for sustainable adoption; (5) data and review of follow-up to testing results with
content management and visualizations patients), they perceived a negative effect
with inclusion of novel data sources (e.g., on patient-provider connection (Sandoval
multimedia tools, safety reporting plat- et al. 2016). An observational study involv-
forms, social determinants of health [SDOH] ing 20 primary care physicians and 141 of
data); (6) CDS for patients and care part- their adult patients showed how the inclu-
ners; and (7) systematic evaluation of pro- sion of the computer in the clinical consul-
cess, balance, and outcomes measurement tation can help patients shift the balance of
(Collins, Dykes et al. 2018). power and authority toward shared decision-
Importantly, patients expect that patient making and patient-centered care (Pearce
portals across ambulatory and acute care set- et al. 2011). This Australian study found
tings are seamless and integrated for ease of that about one-third of the patients actively
access within the patient’s control. Seamless included the computer as a party to the con-
data access within the patient’s control is not sultation, drawing the physician’s attention
yet broadly realized, but several technical to it as a source of information or author-
and policy initiatives offer promising paths ity. They concluded, “In the future, computers
that were not possible previously. The federal will have greater agency, not less, and patients
21st Century Cures Act (Cures Act) includes will involve themselves in the three-way con-
provisions to improve patients’ access to their sultation in more creative ways—for example,
health data and simplification of the patient’s through online communication, or through the
ability to electronically share their informa- plugging into computers of their own electronic
tion. Aligned with those provisions, there are records, creating a situation where they co-own
ongoing interoperability efforts with a pri- the information in the computer …. By democ-
mary focus on connecting mobile health apps ratizing and commoditizing information flows
and devices to EHRs using open application and authority in the consultation, we may in
17 programming interfaces (APIs) to allow indi- fact create truly patient- centered medicine,
viduals to collect, manage, and share their with the patient directing the action” (p. 142).
health information. ONC, aligned with the As these examples illustrate, the com-
Cures Act, promotes policy choices that will plexity of collaborative, interdisciplinary,
give consumers, clinicians, and innovators patient-centered care poses serious chal-
more options for getting to and using health lenges to the design of clinical informa-
information with specific HIT certification tion systems. Many systems fall short in
criteria that call for the development of mod- supporting cognitive work, even from a
ern APIs that do not require “special effort” clinician-centric perspective. Supporting
to access and use (Rucker 2018). ONC pro- communications among clinicians, between
motes the use of the FHIR standard for rep- clinicians and patients, and among patient
resenting clinical data with APIs. and family support groups presents myr-
Patient-Centered Care Systems
605 17
iad technical and ethical problems. Still, Chimowitz, H., Gerard, M., Fossa, A., Bourgeois,
researchers and clinicians increasingly share F., & Bell, S. K. (2018). Empowering Informal
a vision of patient-centered care that drives Caregivers with Health Information:
them to push the frontiers and develop sup- OpenNotes as a Safety Strategy. Joint
port for this emerging model of care. Commission journal on quality and patient
safety, 44(3), 130–136.
Collins, S., Hurley, A., Chang, F., Illa, A., Benoit,
17.5 Outlook for the Future A., Laperle, S., & Dykes, P. (2014). Content
and functional specifications for a standards-
Social and political forces have begun to based multidisciplinary rounding tool to
transform health care in the United States, maintain continuity across acute and critical
and HIT is advancing to support the care. Journal of the American Medical
changes. The transformation is rapid, dis- Informatics Association: JAMIA, 21(3), 438–
ruptive, and not always smooth, but man- 447. This article describes the content and
dates and incentives are aligning with social functional specifications for a standards-based
and economic imperatives to maintain prog- multidisciplinary rounding tool.
ress. Collins, S., Dykes, P., Bates, D. W., Couture, B.,
To meet demands for patient-centered Rozenblum, R., Prey, J., et al. (2018). An
care, changes must occur in clinician prac- informatics research agenda to support patient
tice patterns and processes, in the organiza- and family empowerment and engagement in
tion and management of health services, and care and recovery during and after hospital-
in the education of health care professionals ization. Journal of the American Medical
and the public. To support patient-centered Informatics Association: JAMIA, 25(2):206–
care, clinicians, informatics professionals, and 209. Based on a national workshop, this paper
computer scientists must develop health infor- presents an informatics research agenda to
mation and communication technologies that support patient and family empowerment and
support collaboration; cognitive processes and engagement in care.
operational workflow; communication and Dykes, P. C., Carroll, D. L., Hurley, A., Lipsitz,
shared decision making between and among S., Benoit, A., Chang, F., et al. (2010). Fall
clinicians, patients, and family members; and prevention in acute care hospitals: A random-
trustworthy tools for the management of per- ized trial. Journal of the American Medical
sonal and family health. Association: JAMIA, 304(17), 1912–1918.
Transformational change is daunting, and This article describes d
evelopment and testing
resistance is inevitable. Still, the chances for of an electronic fall prevention toolkit inte-
success have never been better. The vision grating decision support into clinical work-
of health care articulated by the National flow at the bedside and intended for use by all
Academies of Sciences is guiding policy, team members including patients and family.
research, and practical action by govern- Grossman, L. V., Choi, S. W., Collins, S., Dykes, P.
ment agencies, health care providers, and the C., O’Leary, K. J., Rizer, M., Strong, P., Yen,
public. P. Y., & Vawdrey, D. K. (2018). Implementation
of acute care patient portals: recommenda-
nnSuggested Readings tions on utility and use from six early adopt-
Ahern, D. K., Woods, S. S., Lightowler, M. C., ers. Journal of the American Medical
Finley, S. W., & Houston, T. K. (2011). Informatics Association: JAMIA, 25(4), 370–
Promise of and potential for patient-facing 379.
technologies to enable meaningful use. IOM (Institute of Medicine). (2013). Digital data
American Journal of Preventive Medicine, improvement priorities for continuous learn-
40(5 Suppl 2), S162–S172. This article ing in health and health care: Workshop sum-
describes specific technologies that patients mary. Washington, DC: The National
can use in the interests of their health and that Academies Press. This workshop summary
support patient-centered care. describes digital data improvement priorities
606 S. Bakken et al.
for continuous learning in health and health 5. Over the past decade, many of the
care. patient-care information systems
Matney, S. A., Warren, J. J., Evans, J. L., Kim, T. Y., designed in the 1970s have been replaced
Coenen, A., & Auld, V. A. (2012). Development by vendor-based systems. What role
of the nursing problem list subset of SNOMED have public policy and payment models
CT®. Journal of Biomedical informatics, 45(4), had in driving this change? How do the
683–688. practice models, payer models, and
quality focus of today differ from those
??Questions for Discussion of the past? What differences do these
1. What is the utility of a linear model of changes require in information systems?
patient care as the basis for a decision- What are two advantages and two disad-
support system? What are two primary vantages of the older, internally devel-
limitations? Discuss two challenges oped legacy systems versus
that a nonlinear model poses for vendor-provided systems?
representing and supporting the care 6. What challenges exist in modeling
process in an information system. information for patient-centered care?
2. Compare and contrast additive, What considerations are important in
clinician-
centered versus coordinated, designing patient-facing health
patient-centered models of interdisci- information and communication
plinary patient care. What are the advan- technologies?
tages and disadvantages of each model
as a mode of care delivery? What are the
broad implications for design of infor- References
mation systems to support clinician-
centered versus patient-centered models Abramson, E. L., Malhotra, S., Osorio, S. N., Edwards,
of care? A., Cheriff, A., Cole, C., & Kaushal, R. (2013). A
3. Imagine a patient-care information sys- long-term follow-up evaluation of electronic health
record prescribing safety. Journal of the American
tem that assists in planning the care of Medical Informatics Association, 20, e52–e58.
each patient independently of all the Abramson, E., Patel, V., Pfoh, E., & Kaushal, R. (2016).
other patients in a service center or How physician perspectives on e-prescribing evolve
patient-care unit. What are three advan- over time. Applied Clinical Informatics, 7, 994–1006.
tages to the developer in choosing such Adams, K., & Corrigan, J. (Eds.). (2003). Priority areas
for national action: Transforming health care quality.
an information architecture? What A report of the Institute of Medicine. Washington,
would be the likely result in the real DC: National Academy Press.
world of practice? Does it make a differ- Adler-Milstein, J., & Jha, A. K. (2017). HITECH Act
ence whether the practice setting is hos- drove large gains in hospital electronic health record
pital, ambulatory care, or home care? adoption. Health Affairs (Millwood), 36(8), 1416–
1422.
What would be the simplest information Agency for Healthcare Research and Quality (AHRQ).
architecture that would be sufficiently (2014). Care coordination measures atlas update:
17 complex to handle real-world demands? Table 4. Meaningful use measures that assess care
Explain. coordination*. Available from: https://www.ahrq.
4. Zielstorff et al. (1993) proposed that gov/professionals/prevention-chronic care/improve/
coordination/atlas2014/chapter4tab4.html
data routinely recorded during the pro- Ahern, D. K., Woods, S. S., Lightowler, M. C., Finley,
cess of patient care could be abstracted, S. W., & Houston, T. K. (2011). Promise of and
aggregated, and analyzed for manage- potential for patient-facing technologies to enable
ment reports, policy decisions, and meaningful use. American Journal of Preventative
knowledge development. What are three Medicine, 40(5 Suppl 2), S162–S172.
Ash, J. S., Berg, M., & Coiera, E. (2004). Some unin-
advantages of using patient care data in tended consequences of information technology in
this way? What are three significant limi- health care: The nature of patient care information
tations? system-related errors. Journal of the American
Patient-Centered Care Systems
607 17
Medical Informatics Association: JAMIA, 11(2), Chan, J., Shojania, K. G., Easty, A. C., & Etchells, E. E.
104–112. (2011). Does user-centred design affect the effi-
Ash, J. S., Sittig, D. F., Poon, E. G., Guappone, K., ciency, usability and safety of CPOE order sets?
Campbell, E., & Dykstra, R. H. (2007). The extent Journal of the American Medical Informatics
and importance of unintended consequences related Association: JAMIA, 18(3), 276–281.
to computerized provider order entry. Journal of the Chen, E. S., Zhou, L., Kashyap, V., Schaeffer, M., Dykes,
American Medical Informatics Association: JAMIA, P. C., & Goldberg, H. S. (2008). Early experiences in
14(4), 415–423. evolving an enterprise-wide information model for
Ash, J. S., Sittig, D. F., Dykstra, R. H., Campbell, E., & laboratory and clinical observations. In AMIA
Guappone, K. (2009). The unintended consequences annual symposium proceedings, 2008 (pp. 106–110).
of computerized provider order entry: Results of a Chin, T. (2003, February 17). American Medical News.
mixed methods exploration. International Journal of Cimino, J. J., Overby, C. L., Devine, E. B., Hulse, N. C.,
Medical Informatics, 78(Supplement 1), S69–S76. Jing, X., Maviglia, S. M., & Del Fiol, G. (2013).
Ash, J., Singh, H., & Sittig, D. (2016). Safer guides. Practical choices for infobutton customization:
HealthIT.gov. Available at: https://www.healthit. Experience from four sites. In Proceedings of the
gov/topic/safety/safer-guides AMIA annual symposium (pp. 236–245).
Bates, D. W., & Bitton, A. (2010). The future of health Collen, M. F. (1995). A history of medical informatics in
information technology in the patient-centered the United States, 1950 to 1990. Bethesda: American
medical home. Health Affairs (Millwood), 29(4), Medical Informatics Association.
614–621. Collins, S. A., & Vawdrey, D. (2012). “Reading between
Bates, D. W., Saria, S., Ohno-Machado, L., Shah, A., & the lines” of flowsheet data: Nurses’ optional docu-
Escobar, G. (2014). Big data in health care: Using mentation associated with cardiac arrest outcomes.
analytics to identify and manage high-risk and high- Applied Nursing Research, 25(4), 251–257.
cost patients. Health Affairs (Millwood) [Internet], Collins, S. A., Bakken, S., Vawdrey, D. K., Coiera, E., &
33(7), 1123–1131. Currie, L. M. (2011). Agreement between common
Bickel, R. G. (1979). The TRIMIS concept. In goals discussed and documented in the ICU. Journal
Proceedings of the 1979 annual symposium on com- of the American Medical Informatics Association:
puter applications in medical care (pp. 839–842). JAMIA, 18(1), 45–50.
Washington, DC. Collins, S., Cato, K., Albers, D., Scott, K., Stetson, P.,
Bloxham, A. (2008, August 10). £13 Billion NHS com- Bakken, S., & Vawdrey, D. (2013). Relationship
puter system failures affecting patient care. The between nursing documentation and patients’ mor-
Telegraph. Retrieved April 27, 2020, from https:// tality. American Journal of Critical Care, 22(4), 306–
www.telegraph.co.uk/news/uknews/2535099/13bn- 313.
NHS-computer-system-failures-affecting-patient- Collins, S., Gazarian, P., Stade, D., McNally, K.,
care.html Morrison, C., Ohashi, K., Lehmann, L., Dalal, A.,
Booth, R. (2000). Project failures costly, TechRepublic/ Bates, D., & Dykes, P. (2014). Clinical workflow
Gartner study finds. Retrieved April 27, 2020, from observations to identify opportunities for nurse,
http://www.t echrepublic.c om/article/it-project- physicians and patients to share a patient-centered
f a i l u r e s -c o s t ly - t e c h r e p u b l i c g a r t n e r - s t u dy - plan of care. In Proceedings of the AMIA annual
finds/1062043 symposium (pp. 414–423).
Burley, L., Scheepers, H., & Owen, L. (2009). User Collins, S., Hurley, A., Chang, F., Illa, A., Benoit, A.,
involvement in the design and appropriation of a Laperle, S., & Dykes, P. (2014). Content and func-
mobile clinical information system: Reflections on tional specifications for a standards-based multidis-
organisational learning. New York: Springer. ciplinary rounding tool to maintain continuity
Campbell, E. M., Sittig, D. F., Ash, J. S., Guappone, across acute and critical care. Journal of the
K. P., & Dykstra, R. H. (2006). Types of unintended American Medical Informatics Association: JAMIA,
consequences related to computerized provider 21(3), 438–447.
order entry. Journal of the American Medical Collins, S. A., Gesner, E., Morgan, S., Mar, P., Maviglia,
Informatics Association: JAMIA, 13(5), 547–556. S., Colburn, D., et al. (2015). A practical approach
Campion, T. R., Jr., May, A. K., Waitman, L. R., Ozdas, to governance and optimization of structured data
A., Lorenzi, N. M., & Gadd, C. S. (2011). elements. Studies in Health Technology and
Characteristics and effects of nurse dosing over- Informatics, 216, 7–11.
rides on computer-based intensive insulin therapy Collins, S. A., Rozenblum, R., Leung, W. Y., Morrison,
protocol performance. Journal of the American C. R. C., Stade, D. L., McNally, K., et al. (2017).
Medical Informatics Association: JAMIA, 18(3), Acute care patient portals: A qualitative study of
251–258. stakeholder perspectives on current practices.
Carroll, D. L., Dykes, P. C., & Hurley, A. C. (2010). Journal of the American Medical Informatics
Patients’ perspectives of falling while in an acute Association: JAMIA, 24(e1), e9–e17.
care hospital and suggestions for prevention. Collins, S., Bavuso, K., Swenson, M., Suchecki, C., Mar,
Applied Nursing Research, 23(4), 238–241. P., & Rocha, R. (2017). Evolution of an
608 S. Bakken et al.
implementation-ready interprofessional pain assess- ized safety plan screensaver. Journal of Gerontology
ment reference model. In AMIA annual symposium Nursing, 43(4), 15–22.
proceedings (p. 10). Dykes, P. C., Carroll, D. L., Hurley, A. C., Benoit, A., &
Collins, S., Klinkenberg-Ramirez, S., Tsivkin, K., Mar, Middleton, B. (2009). Why do patients in acute care
P. L., Iskhakova, D., Nandigam, H., Samal, L., & hospitals fall? Can falls be prevented? The Journal
Rocha, R. A. (2017). Next generation terminology of Nursing Administration, 39(6), 299–304.
infrastructure to support interprofessional care Dykes, P. C., Carroll, D. L., Hurley, A., Lipsitz, S.,
planning. Journal of Biomedical Informatics, 75, Benoit, A., Chang, F., et al. (2010). Fall prevention
22–34. in acute care hospitals: A randomized trial. Journal
Collins, S., Couture, B., Kang, M. J., Dykes, P., Schnock, of the American Medical Association, 304(17), 1912–
K., Knaplund, C., Chang, F., & Cato, K. (2018). 1918.
Quantifying and visualizing nursing flowsheet docu- Dykes, P. C., Samal, L., Donahue, M., Greenberg, J. O.,
mentation burden in acute and critical care. In Hurley, A. C., Hasan, O., O’Malley, T. A.,
Proceedings of the AMIA annual symposium. Venkatesh, A. I., Volk, L. A., & Bates, D. W. (2014).
Collins, S., Dykes, P., Bates, D. W., Couture, B., A patient-centered longitudinal care plan: Vison
Rozenblum, R., Prey, J., et al. (2018). An informat- versus reality. Journal of the American Medical
ics research agenda to support patient and family Informatics Association: JAMIA, 21(6), 1082–1090.
empowerment and engagement in care and recovery Dykes, P. C., Duckworth, M., Cunningham, S., Dubois,
during and after hospitalization. Journal of the S., Driscoll, M., Feliciano, Z., Ferrazzi, M., Fevrin,
American Medical Informatics Association: JAMIA, F. E., Lyons, S., Lindros, M. E., Monahan, A.,
25(2), 206–209. Paley, M. M., Jean-Pierre, S., & Scanlan, M. (2017).
Committee on Data Standards for Patient Safety. (2004). Pilot testing fall TIPS (tailoring interventions for
Patient safety: Achieving a new standard for care. A patient safety): A patient-centered fall prevention
report of the Institute of Medicine. Washington, DC: toolkit. The Joint Commission Journal on Quality
The National Academies Press. and Patient Safety, 43(8), 403–413.
Committee on Quality of Health Care in America. Fiks, A. G., Alessandrini, E. A., Forrest, C. B., Khan, S.,
(2001). Crossing the quality chasm: A new health sys- Localio, A. R., Gerber, A., et al. (2011). Electronic
tem for the 21st century. A report of the Institute of medical record use in pediatric primary care. Journal
Medicine. Washington, DC: National Academy of the American Medical Informatics Association:
Press. JAMIA, 18(1), 38–44.
Committee on the Robert Wood Johnson Foundation Friedman, C. P. (1995). Where’s the science in medical
Initiative on the Future of Nursing at the Institute informatics? Journal of the American Medical
of Medicine. (2010). The future of nursing: Leading Informatics Association: JAMIA, 2(1), 65–67.
change, advancing health. Washington, DC: Institute Goodwin, J. O., & Edwards, B. S. (1975). Developing a
of Medicine of the National Academies, The computer program to assist the nursing process:
National Academies Press. Phase I – from systems analysis to an expandable
Cronenwett, L., Sherwood, G., & Gelmon, S. B. (2009). program. Nursing Research, 24, 299–305.
Improving quality and safety education: The QSEN Grant, R. W., Wald, J. S., Poon, E. G., Schnipper, J. L.,
Learning Collaborative. Nursing Outlook, 57(6), Gandhi, T. K., Volk, L. A., et al. (2006). Design and
304–312. implementation of a web-based patient portal
Del Fiol, G., Huser, V., Strasberg, H. R., Maviglia, linked to an ambulatory care electronic health
S. M., Curtis, C., & Cimino, J. J. (2012). record: Patient gateway for diabetes collaborative
Implementations of the HL7 Context-Aware care. Diabetes Technology & Therapeutics, 8(5), 576–
Knowledge Retrieval (“Infobutton”) Standard: 586.
Challenges, strengths, limitations, and uptake. Greene, J. A., Choudhry, N. K., Kilabuk, E., & Shrank,
Journal of Biomedical Informatics, 45, 726–735. W. H. (2011). Online social networking by patients
Leiner, F., & Haux, R. (1996). Systematic planning of tion: Diabetes-related communities in Google+.
clinical documentation. Methods of Information in Diabetology International, 8, 323–327.
Medicine, 35, 25–34. Munasinghe, R. L., Arsene, C., Abraham, T. K., Zidan,
Leu, M. G., Cheung, M., Webster, T. R., Curry, L., M., & Siddique, M. (2011). Improving the utiliza-
Bradley, E. H., Fifield, J., et al. (2008). Centers tion of admission order sets in a computerized phy-
speak up: The clinical context for health informa- sician order entry system by integrating modular
tion technology in the ambulatory care setting. disease specific order subsets into a general medi-
Journal of General Internal Medicine, 23(4), 372– cine admission order set. Journal of the American
378. Medical Informatics Association: JAMIA, 18(3),
Leung, W. Y., Adelman, J., Bates, D. W., Businger, A., 322–326.
Dykes, J. S., Ergai, A., Hurley, A., Katsulis, Z., Oniki, T. A., Zhuo, N., Beebe, C. E., Liu, H., Coyle,
Khorasani, S., Scanlan, M., Schenkel, L., Rai, A., & J. F., Parker, C. G., Solbrig, H. R., Marchant, K.,
Dykes, P. C. (2017, February 22). Validating fall pre- Kaggal, V. C., Chute, C. G., & Huff, S. M. (2016).
vention icons to support patient-centered educa- Clinical element models in the SHARPn consor-
tion. Journal of Patient Safety. [epub ahead of tium. Journal of the American Medical Informatics
print]. Association: JAMIA, 23(2), 248–256.
Lindberg, D. A. B. (1965). Collection, evaluation, and Ornstein, C. (2003, January 22). Hospital heeds doctors,
transmission of hospital laboratory data. Presented suspends use of software. Los Angeles Times.
at the seventh IBM medical symposium, Ozbolt, J. (2000). Terminology standards for nursing:
Poughkeepsie, NY. Collaboration at the summit. Journal of the
Mandel, J. C., Kreda, D. A., Mandl, K. D., Kohane, American Medical Informatics Association: JAMIA,
I. S., & Ramoni, R. B. (2016). SMART on FHIR: A 7, 517–522.
standards-based, interoperable apps platform for Ozbolt, J. G., Schultz, S., Swain, M. A. P., Abraham,
electronic health record. Journal of the American I. L., & Stein, K. F. (1984). Developing expert sys-
Medical Informatics Association: JAMIA, 23, 899– tems for nursing practice. In G. S. Cohen (Ed.),
908. Proceedings of the eighth annual symposium on
Massaro, T. A. (1993). Introducing physician order Computer Applications in Medical Care (pp. 654–
entry at a major academic medical center. Academic 657). Silver Spring: IEEE Computer Society.
Medicine, 68, 20–30. Ozbolt, J. G., et al. (1985). A proposed expert system for
Matney, S. A., Warren, J. J., Evans, J. L., Kim, T. Y., nursing practice. Journal of Medical Systems, 9,
Coenen, A., & Auld, V. A. (2016). Development of 57–68.
the nursing problem list subset SNOMED CT®. Ozbolt, J. Sands, D. Z., Dykes, P. Pratt, W., Rosas,
Journal of Biomedical Informatics, 45, 683–688. A. G., & Walker, J. M. (2012). Building Better
Matney, S. A., Dolin, G., Buhl, L., & Sheide, A. (2016). Consumer eHealth. A report commissioned by the
Communicating nursing care using the health level Office of the National Coordinator for Health
seven consolidated clinical document architecture Information Technology, under contract to Westat,
release 2 care plan. Computers Informatics Nursing, Inc. Available at http://www.Healthit.gov/sites/
34(3), 128–136. default/files/final_report_building_better_con-
McDonald, C. J. (1976). Protocol-based computer sumer_ehealth.pdf.
reminders, the quality of care and the nonperfect- Partridge, S. R., Gallagher, P., Freeman, B., & Gallagher,
ibility of man. The New England Journal of R. (2018). Facebook groups for the management of
Medicine, 295(24), 1351–1355. chronic diseases. Journal of Medical Internet
McDonald, K. M., Schultz, E., Chapman, T., et al. Research, 20(1), e21.
(2012). Prospects for care coordination measurement Patel, V., & Johnson, C. (2018, April). Individuals’ use of
using electronic data sources (Prepared by Stanford online medical records and technology for health
University under subcontract to Battelle on Contract needs. ONC Data Brief, No 40. https://www.healthit.
Stead, W. W., Borden, R., Bourne, J., Giuse, D., Giuse, Understanding and visualizing multitasking and
N., Harris, T. R., Miller, R. A., & Olsen, A. J. (1996). task switching activities: A time motion study to
The Vanderbilt University fast track to IAIMS: capture nursing workflow. In AMIA annual sympo-
From planning to implementation. Journal of the sium proceedings, 2016 (pp. 1264–1273).
American Medical Informatics Association: JAMIA, Yen, P., Phillips, A., Kennedy, M., & Collins, S. (2017).
3, 308–317. Nursing Informatics Competency Assessment for
Tcheng, J. E., Bakken, S., Bates, D. W., Bonner, H., III, the Nurse Leader (NICA-NL): Instrument refine-
Ghandi, T. K., Josephs, M., Kawamoto, K., ment, validation, and psychometric analysis. Journal
Lomoton, E. A., MacKay, E., Middleton, B., Teich, of Nursing Administration, 47(5), 271–277.
J. M., Weingarten, S., & Hamilton-Lopez, M. (Eds.). Zhou, L., Soran, C. S., Jenter, C. A., Volk, L. A., Orav,
(2017). Optimizing strategies for clinical decision E. J., Bates, D. W., et al. (2009). The relationship
Support: Summary of meeting series. Washington, between electronic health record use and quality of
DC: National Academies Press. care over time. Journal of the American Medical
Vawdrey, D., Wilcox, L., Collins, S., Bakken, S., Feiner, Informatics Association: JAMIA, 16(4), 457–464.
S., & Boyer, A. (2011). A tablet computer applica- Zhou L, Collins S, Morgan S, Zafar N, Gesner E,
tion for patients to participate in their hospital care. Fehrenbach M, Rocha R. (2016, November 12–16).
In Proceedings of the 2011 AMIA annual fall sympo- A decade of experience in creating and maintaining
sium (pp. 1428–1435). data elements for structured clinical documentation
Wald, J. S., Grant, R. W., Schnipper, J. L., Gandhi, in EHRs. In Proceeding of the American Medical
T. K., Poon, E. G., Businger, A. C., et al. (2009). Informatics Association annual fall symposium
Survey analysis of patient experience using a (pp. 1293–1302). Chicago, IL. PMID: 28269927.
practice-linked PHR for type 2 diabetes mellitus. In Zielstorff, R. D., Hudgings, C. L., Grobe, S. J., & The
Proceedings of the 2009 AMIA annual symposium National Commission on Nursing Implementation
(pp. 678–682). Project Task Force on Nursing Information Systems.
Weed, L. L. (1975). Your health care and how you can (1993). Next-generation nursing information sys-
manage it. Burlington: PROMIS Laboratory, tems: Essential characteristics for professional prac-
University of Vermont. tice. Washington, DC: American Nurses Publishing,
Yen, P. Y., Kelley, M., Lopetegui, M., Rosado, A. L., American Nurses Foundation/American Nurses
Migliore, E. M., Chipps, E. M., & Buck, J. (2016). Association.
17
613 18
Contents
References – 635
organizations to implement innovative strate- entire textbook in this series (Magnuson and
gies to improve health and reduce costs, such Dixon 2020).
as providing housing, air conditioners, and/or The Centers for Disease Control and
transportation. Prevention (CDC) has characterized public
In the context of public health, a popula- health informatics as developing and deploy-
tion perspective has always existed. However, ing methods for achieving a public health
now there are new opportunities to leverage goal faster, better, or at lower cost by lever-
data sources not traditionally used by pub- aging computer science, information science,
lic health (e.g., social media) and to use the or technology (Savel and Foldy, 2012). Public
information in EHRs to meet public health health systems are implemented across a wide
goals. For example, an EHR-based clinical range of settings (large and small, urban
decision support system (CDSS) and qual- and rural) with variable infrastructure and
ity monitoring can be used to promote and capabilities, and for a workforce with a wide
monitor public health goals (e.g., improved range of informatics experience and skills and
cancer screening and immunization coverage, access to technical resources and support.
or early detection of health threats such as Given this complex context, we define public
lead in water or an individual exposed to an health informatics as:
infectious disease) among persons accessing
clinical care. »» The systematic application of informatics
methods and tools to support public health
goals and outcomes, regardless of the setting.
.. Table 18.2 Sample of information systems used by the Minnesota State Department of Health,
classified by primary function and key features
Monitoring systems for surveillance of knowledge, attitudes, or health events that may impact health
Vital Statistics System tracking births and Data collection may occur (a) in response to a triggered
deathsa eventa, or (b) using a pre-planned sampling strategyb
After a triggering event occurs, a set of data is transferred
Immunization Information Systema
to a public health agency or an organization acting on the
Cancer Surveillance Systema behalf of public health
Triggered data:
Birth Defects Information Systema Often originates in clinical systems
Sudden Unexplained Infant Death/Sudden Death May be individual events or summarized reports
in Youth Programa May be reported at intervals ranging from near
real-time to hourly, daily, monthly, etc.
Traumatic Brain and Spinal Cord Injury Systema Often, systems must be able to associate the data with
other potentially-related events.
Communicable disease surveillance systemsa:
Infectious Disease Surveillance
Sexually Transmitted Infections (STD/SDI)
Surveillance
AIDS/HIV Surveillance
Drug Overdose and Substance Abuse
Surveillance Systema
Blood Lead Information Systema
All-payer claims databasea
Injury surveillancea
Antibiotic Resistance and Stewardshipa
Animal health surveillancea
Syndromic surveillancea
Breast/Cervical Cancer Screening quality
monitoringb
Environmental monitoring of air & waterb
Surveysb:
Behavioral Risk Factor surveillance system
(BRFSS)
National Health and Nutrition examination
survey (NHANES)
Process Control systems to manage workflow
Public Health Laboratory Information System Internal systems with ‘industry-standard’ functionality
18 Women, Infants and Children (WIC)
May interact with external systems to receive requests and
report out information
Food-service inspection system
Vector control operations management system
Vaccine distribution system
Medical Cannabis Information System
Population and Public Health Informatics
623 18
agencies may or may not have a regulatory zz Informatics Challenges and Opportunities
role, but they illustrate the diversity and com- U.S. Public Health Information Infrastructure
plexity of public health at the federal level. The inadequate national infrastructure to
They all require some level of cross-state detect and respond to public health threats
aggregation of information and typically do and support the management of local, state
not gather person-specific data. and national prevention and control pro-
In the U.S., public health data is processed grams are amplified by the COVID-19 pan-
via distributed information systems with min- demic with its devastating morbidity and
imal aggregation at the federal level. In fact, financial impact. The current infrastructure
it is only reportable conditions such as infec- in the U.S. has a patchwork of diverse infor-
tious diseases, births, and deaths that are uni- mation systems due, in part, to siloed and
formly and relatively completely reported on short-term funding and the U.S. Constitution
a national basis by the CDC. The U.S. lacks a which is silent on the responsibility for pro-
unifying public health information infrastruc- tecting the public’s health, leaving state law
ture. Thus, the U.S. relies on state and local to govern public health actions. Federal
health departments to develop and use inde- leadership can provide overarching structure
pendent information systems to support their and resources, consolidate information, and
public health needs. This presents significant present the rationale for a unified approach.
challenges, as described in the next section. However, each state functions independently
In contrast, France, Great Britain, Denmark, to deal with the multiplicity of response, man-
Norway and Sweden have comprehensive agement and recovery decisions presented by
systems in selected areas, such as occupa-
health threats.
tional injuries, infectious diseases, and cancer. The grand challenge for public health
No country, however, has complete reporting is to create and maintain a unified public
for every problem, but many are often able health information infrastructure based on
to deliver timely answers to important public informatics principles that supports multi-
health questions by having information on the jurisdictional (local, state, federal) and cross-
entire population. jurisdictional needs, and seamlessly interacts
624 M. LaVenture et al.
with other relevant information systems such allowing for 2-way communication between
as those containing clinical, laboratory or public health and clinical systems. Closing the
environmental data (see 7 Chap. 15). information loop with providers by enabling
High level informatics characteristics and data aggregated by public health to be used
considerations for such a public health infor- for clinical decision making is a long-time
mation infrastructure include the ability to: goal that is becoming more achievable over
55 Support both routine and emergent public time.
health functions using the same The development of sophisticated knowl-
infrastructure, avoiding the development edge management and decision support meth-
of systems that compete for resources in ods represents a growing opportunity to more
the context of an outbreak. effectively use public health resources and
55 Rapidly scale existing surveillance systems EHR data. For example, there are an increas-
to make local, statewide, and national data ing number of public health guidelines that,
actionable in a timely manner to meet if structured and encoded as standardized
demands at each level in a crisis. digital algorithms, can be widely distributed
55 Provide data and information in sufficient to provide CDS at the point of care through
granularity to meet the needs of local, EHRs (e.g., laboratory testing criteria for
state and national public health needs. coronavirus). Similarly, there are advances in
55 Rapidly add new information system harmonizing and standardizing case defini-
capability (e.g., contact tracing for COVID- tions, reporting logic, and patient summaries
19 control) in the context of an outbreak. that are relevant for identifying conditions of
55 Routinely onboard, validate, and use new interest to be reported from an EHR to a pub-
electronic data sources lic health agency (i.e., to support electronic
55 Ensure high quality, timely, complete and case reporting or death certificate records
accurate data. reporting). These knowledge resources are a
55 Incubate innovations in technology and necessary component for automating event
foster workforce capacity detection and the generation of electronic case
55 Operate with leadership and governance reports but are not sufficient. Ongoing efforts
that is independent from jurisdictional are focused on improving data and exchange
boundaries and with authority to define standards, testing and evaluating various
requirements and engage public and implementation strategies, and increasing col-
private partners. laboration between the multiple stakeholders
55 Educate leaders and the public regarding involved (e.g., the vendor, clinical, and public
the purpose of the infrastructure and the health communities). Eventually, successful
essential role of science to inform public implementation requires 2-way communica-
policy. tion with providers, which is particularly rel-
evant for case reporting where it is important
Emerging Innovations and Opportunities for providers to be promptly alerted about
Advances in informatics methods and unusual disease or risk factors in their local
tools, CDSS and health interoperability area, such as disease clusters, environmental
standards, and the increasing availability hazards, and antibiotic resistance patterns.
18 of clinical and other novel sources of data
(e.g., individual genomic sequences, patient-
generated health data from wearable devices), 18.4.2 Immunization Information
are all impacting the way business is done at Systems: A Public Health
public health agencies, leading to new oppor- Informatics Exemplar
tunities. Public health data collection systems
are a potential gold mine for applying novel Immunization Information Systems (IIS), also
analytic or visualization tools, and advances known as immunization registries, are confi-
in health data standards are improving oppor- dential, computerized, population-based sys-
tunities to exchange data and knowledge, tems that collect and consolidate vaccination
Population and Public Health Informatics
625 18
data from vaccine providers and offer tools and messaging standards, with online secure
for designing and sustaining effective immu- access to patient immunization records 24/7,
nization strategies at the provider and pro- (b) providing vaccine forecasting/decision
gram levels (CDC 2013). In the U.S., an IIS is support based on a patient’s consolidated
operating in almost every state and can share and de-duplicated immunization history, (c)
data with other IIS using national standards. supporting vaccine inventory management
IIS are a critical resource for rapidly identify- and vaccine ordering, producing official
ing and reporting gaps in immunization cov- immunization records for school and other
erage, such as the precipitous decline in child institutional enrollment, (d) generating immu-
vaccinations observed as the COVID-19 pan- nization coverage reports for an individual
demic emerged in spring 2020 and well-child provider, clinical practice or jurisdiction, and
visits transitioned to telemedicine video con- (e) supporting the national Vaccine Adverse
ferences. Administration of immunizations Event Reporting System (VAERS).
require an in-person visit. Using IIS data from
Michigan, public health authorities were able zz History, Context and Success of IIS
to report that “vaccination coverage declined Childhood immunizations are among the most
in all milestone age cohorts, except for birth- successful public health interventions, result-
dose hepatitis B coverage, which is typically ing in the near elimination of nine vaccine pre-
administered in the hospital setting. Among ventable diseases that historically extracted a
children aged 5 months, up-to-date status major human toll in terms of both morbidity
for all recommended vaccines declined from and mortality (IOM 2000). The need for IIS
approximately two thirds of children dur- stems from the challenge of assuring complete
ing 2016–2019 (66.6%, 67.4%, 67.3%, 67.9%, immunization protection for the approximately
respectively) to fewer than half (49.7%) in 10,388 children born each day in the U.S. in the
May 2020” (Bramer et al. 2020). context of three complicating factors: the scat-
The success of IIS in the U.S. is a com- tering of immunization records among mul-
pelling story that illustrates the principles tiple providers given that a complete history is
of public health informatics and describes essential to providing an accurate forecast of
the multi-decade efforts that were needed to vaccines needed at a visit; new vaccines, vac-
implement large-scale population-level IIS cine combinations. and antigen formulations
in each state and link them into a reliable regularly made available leading to a schedule
national network. The story highlights how that is increasingly complex as the number of
informatics capabilities in leadership, collab- vaccines has increased to over 25 doses rec-
oration, and collective problem solving have ommended by age 6; and the conundrum that
been critical to overall success. In addition to the very success of mass immunization has
their orientation to prevention, IIS can func- reduced the incidence of disease, lulling par-
tion properly only through continuing inter- ents and providers into a sense of complacency
action with the health care system; in fact, where potential vaccine side effects often get
they were designed to optimize connections more attention than the diseases.
for use in the clinical setting. Although IIS The IIS history from the 1990s is remark-
are among the largest and most complex pub- able for a number of key success factors includ-
lic health information systems, the successful ing the committed leadership and shared
implementations and extensive interoperabil- vision. It is a story of a long slow discovery
ity in 49 states show conclusively that it has of challenges and the successful collaboration
been possible over time to overcome the chal- of many stakeholders to overcome those chal-
lenging informatics problems they present. lenges over several decades. Understanding
The major functions of IIS include (a) the lessons learned is foundational to success-
the ability to accept immunization records ful implementation of similar large-scale pub-
electronically using a variety of file formats lic health information systems in the future.
626 M. LaVenture et al.
.. Fig. 18.2 Percentage point differences between National Immunization Survey (NIS) and IIS for combined
7-vaccine series* completion — IIS Annual Report, United States, 2017* (Source: CDC 2018)
630 M. LaVenture et al.
7 Taskforce for Global Health. (2018). The global tra- Public health informatics is the systematic
choma mapping project: determining prevalence to application of informatics methods and tools
help eliminate trachoma by 2020. Retrieval 10/20/18: to support public health goals and outcomes,
7 https://www.taskforce.org/case-study/global-tra- regardless of the setting. Effective public
choma-mapping-project-determining-prevalence-
health information systems and communi-
help-eliminate-trachoma-2020
Population and Public Health Informatics
633 18
cation between clinical and other systems mation to inform increasingly complex pub-
can help to assure prevention actions, timely lic health decisions, and the growing costs of
monitoring of disease patterns, and rapid operating aging public health information sys-
responses to epidemics, thereby saving lives tems (Brand et al. 2018a). Information inno-
and money. vation to address growing needs requires an
Public health information and the devel- agency-wide informatics-savvy organizational
opment of health information infrastructure approach (LaVenture et al. 2014c, 2014d;
(HII) (see 7 Chap. 15) are closely related. 2017c, 2017d) and an appreciation that there
Public health informatics supports the popu- are key stages of innovation to ensure the suc-
lation assessment, assurance and policy devel- cessful integration of research in the public
opment roles of public health. In contrast, health practice space (Xu et al. 2011).
HIIs focus on medical care to individuals while Public health systems frequently
also connecting providers and patients within involve non-health organizations such as
a population. Ideally, these two areas work law enforcement and parks and recreation
together supporting both community health departments. Thus, public health informati-
assessment and individual care. In the past, cians must adopt methodologies that bridge
public health and health care have not tradi- professional and organizational divides, such
tionally interacted as closely as they should. as the Public Health Informatics Institute’s
Both domains focus on the health of commu- Collaborative Requirements Methodology
nities—public health does this directly, while (PHII 2011).
the medical care system does it one patient Despite the focus of many current public
at a time. However, it is now clear that medi- health informatics activities on population-
cal care must also focus on the community based extensions of the medical care system
to integrate the effective delivery of services (leading to the orientation of this chapter),
across all care settings for all individuals (IOM applications beyond this scope are possible,
2011; Sittig and Singh 2020). An effective HII desirable, and many innovative strategies and
could allow many public health information applications are under development or in
needs currently met through independently use. Indeed, the phenomenal contributions
operated and maintained systems to be more to health made by the hygienic movement of
efficiently addressed via periodic HII queries the nineteenth and early twentieth centuries
(e.g., to assess relationships between various suggest the power of large-scale environmen-
diseases, conditions, treatments, and possible tal, legislative, and social changes to promote
risk factors) or through automatic real-time human health (Rosen 1993). The effective
reporting of relevant information from the application of informatics to populations
HII to public health (e.g., to support surveil- through public health is a key challenge of the
lance and control of COVID-19). twenty-first century. It is a challenge we must
Successful public health and population accept, understand, and overcome if we want
health informatics requires an informatics- to create an efficient and effective health care
savvy organization that has a clear vision, system as well as truly healthy communities
strategy, and governance for information for all.
management and use; a workforce skilled in
using information and information technolo- nnSuggested Readings
gies; and well-designed and effectively used American Immunization Registry Association
information systems (Baker et al. 2016). The (AIRA). (2018). Fundamentals of
information imperative is urgently driven by IIS. Retrieval Oct 24, 2018 http://repository.
the increasing digitization of data coming immregistries.org/resource/fundamentals-of-
into health departments from an increasing iis/from/major-iis-topics/best-practices-2/. A
number of sources, the need for timely infor- summary of fundamental understanding of
634 M. LaVenture et al.
three essential IIS topics. Staff, Data Quality, Koo, D., O’Carroll, P. W., & LaVenture, M. (2001).
Interoperability and HL7 basics.. Public health 101 for informaticians. Journal of
Brand, B., LaVenture, M., Lipshutz, J., Stevens, the American Medical Informatics Association,
W. F., & Baker, E. L. (2018). The information 8(6), 585–597. An accessible document that
imperative for public health: A call to action introduces public health thinking and
to become informatics-savvy. Journal of Public approach.
Health Management and Practice: JPHMP, 24 LaVenture, M., Brand, B., & Baker, E. L. (2014a).
(6), 586–589. A summary of reasons for the Developing an informatics-savvy health
urgent need for informatics and some strate- department: I: Vision and core strategies.
gies for moving organizations forward. Journal of puBlic Health Management and
Centers for Disease Control and Prevention Practice: JPHMP, 20 (6), 667–669.
(CDC). (2013). Progress in immunization LaVenture, M., Brand, B., & Baker, E. L. (2014b).
information systems – United States, 2011. Developing an informatics-savvy health
MMWR Morb Mortal Wkly Rep. 62(3), 48–51. department: II: Operations and tactics.
Centers for Disease Prevention and Control. Journal of Public Health Management and
(1999). Ten great public health Practice: JPHMP, 21(1), 96–99.
achievements – United States, 1900–1999. LaVenture, M., Brand, B., & Baker, E. L. (2017a).
MMWR; 48(12);241–243. Developing an informatics-savvy health
department: from discrete projects to a coordi-
Centers for Disease Control and Prevention.
nating program. Part I: Assessment and gover-
(2018). IIS functional standards, v4.0 Atlanta,
nance. Journal of Public Health Management
GA: Centers for Disease Control and
and Practice: JPHMP, 23(6), 325–327.
Prevention.
LaVenture, M., Brand, B., Ross, D. A., & Baker,
Centers for Disease Control and Prevention.
E. L. (2017b). Developing an informatics-
(2012). CDC’s Vision for Public Health
savvy health department: from discrete proj-
Surveillance in the 21st Century. MMWR
ects to a coordinating program. Part II:
2012;61(Suppl; July 27, 2012).
Creating a skilled workforce. Journal of Public
Department of Health and Human Services.
Health Management and Practice: JPHMP,
(1994). Essential public health functions.
23(6), 638–640.
Public Health in America. Washington DC:
Magnuson, J. A., & Dixon, B. E. (Eds.). (2020).
Department of Health and Human Services.
Public health informatics and informatics sys-
Hinman, A. R., & Ross, D. A. (2010). Immunization tems. New York: Springer. A comprehensive
registries can be building blocks for national textbook.
health information systems. Health Affairs, Overhage, M., Grannis, S., & McDonald, C. J.
29(4), 676–682. Describes the components of (2008). A comparison of the completeness
IIS and how they are also key elements in the and timeliness of automated electronic labo-
national health infrastructure. ratory reporting and spontaneous reporting
Infectious Diseases Society of America (ISDS). of notifiable conditions. American Journal of
Stories of Surveillance in Action. Retrieval Public Health, 98(2), 344–350.This study doc-
October, 25th, 2018 https://www. uments the improved quality and timeliness of
surveillancerepository.org/search/success-sto- electronic lab reporting of diseases to public
18 ries and https://www.healthsurveillance.org/ health.
default.aspx Contains many practical exam- Public Health Informatics Institute. (2006). The
ples on how public health surveillance is used Unique Records Portfolio. Decatur, GA:
to support clinical care and community health. Public Health Informatics Institute.
International Trachoma Initiative (ITI). (2012). Taskforce for Global Health. (2018). The global
Global Scientific Meeting on Trachomatous trachoma mapping project: determining prev-
Trichiasis (TT). DeCatur, GA: The Task Force alence to help eliminate trachoma by 2020.
for Global Health. Decatur, GA: Taskforce for Global Health.
Population and Public Health Informatics
635 18
Trust for America’s Health. (2020). Impact of interacts with the PHI application? For
Chronic Underfunding on America’s Public example, could the information technol-
Health System: Trends, Risks, and ogy underlying anti-lock braking sys-
Recommendations. Trust for America’s tems be considered a public health
Health. informatics application? Provide other
US Department of Health and Human Services. examples.
(2018). Healthy people 2030 framework. 6. How might cloud computing (shared
Washington, DC: US Department of Health configurable computing resources
and Human Services. Retrieved at https:// including networks, servers, storage,
www.healthypeople.gov/2020/About-Healthy- applications, and services), and mobile
People/Development-Healthy-People-2030/ technology transform public health
Framework (Accessed 23 Oct 2018). This is an informatics?
essential guide to the national public health
goals.
Yasnoff, W. A., O’Carroll, P. W., Koo, D., Linkins, References
R. W., & Kilbourne, E. M. (2000). Public
American Public Health Association (APHA) (2018).
health informatics: Improving and transform- What is Public Health. Washington DC: American
ing public health in the information age. Public Health Association.
Journal of Public Health Management and Baker, E. L., Brand, W., Davidson, A., LaVenture, M.,
Practice, 6(6), 67–75. A concise yet compre- Singletary, V., & Smith, P. (2016). Building the busi-
hensive introduction to the field. ness case for public health information systems.
Journal of Public Health Management and Practice,
22(6), 603–606.
??Questions for Discussion Boom, J. A., Dragsbaek, A. C., & Nelson, C. S. (2007).
1. How might the trend of widespread The success of an immunization information system
adoption of electronic health records in the wake of Hurricane Katrina. Pediatrics, 119(6),
and interest in population health affect 1213–1217.
Bramer, C. A., Kimmins, L. M., Swanson, R., et al.
public health informatics?
(2020). Decline in child vaccination coverage during
2. Compare and contrast the types of data the COVID-19 pandemic — Michigan care improve-
needed and functions required in an ment registry, May 2016–May 2020.
information system for clinical versus MMWR. Morbidity and Mortality Weekly Report,
public health information systems. 69, 630–631.
Brand, B., LaVenture, M., & Baker, E. L. (2018a).
Explain it from non-technical and tech-
Developing an informatics-savvy health depart-
nical perspectives. ment: From discrete projects to a coordinating pro-
3. How can the successful model of immu- gram. Part III: Ensuring well-designed and
nization registries be used in other effectively-used information systems. Journal of
domains of public health (be specific Public Health Management and Practice, 24(6), 181–
184.
about those domains)? How might it fail
Brand, B., LaVenture, M., Lipshutz, J., Stevens, W. F., &
in others? Why? Baker, E. L. (2018b). The information imperative
4. A significant and increasing percentage for public health: A call to action to become
of the U.S. GDP is spent on medical informatics-savvy. Journal of Public Health
care. How could population and public Management and Practice, 24(6), 586–589.
Bunker, J. P., Frazier, H. S., & Mosteller, F. (1994).
health informatics help to use those
Improving health: Measuring effects of medical
monies more efficiently? Or lower the care. Milbank Quarterly, 72, 225–258.
figure absolutely? Department of Health and Human Services. (1994).
5. If public health informatics (PHI) Essential Public Health Services: Public Health in
involves the application of information America. Washington DC: Department of Health
and Human Services.
technology in any manner that improves
Dwyer-Lindgren, L., Stubbs, R. W., Bertozzi-Villa, A.,
or promotes human health, does this Morozoff, C., Callender, C., Finegold, S. B., et al.
necessarily involve a human “user” that (2017). Variation in life expectancy and mortality by
636 M. LaVenture et al.
cause among neighbourhoods in King County, WA, Journal of Public Health Management and Practice,
USA, 1990–2014: A census tract-level analysis for 23(6), 638–640.
the Global Burden of Disease Study 2015. The Magnuson, J. A., & Dixon, B. E. (Eds.). (2020). Public
Lancet Public Health, 2(9), e400–e410. health informatics and informatics systems.
Friede, A., McDonald, M. C., & Blum, H. (1995). Public New York: Springer.
health informatics: How information-age technol- Overhage, J. M., Grannis, S., & McDonald, C. J. (2008).
ogy can strengthen public health. Annual Review of A comparison of the completeness and timeliness
Public Health, 16, 239–252. of automated electronic laboratory reporting and
Friedman, T. (2007). The world is flat 3.0: A brief history spontaneous reporting of notifiable conditions.
of the twenty-first century. New York: Picador/ American Journal of Public Health, 98(2), 344–350.
Farrar, Straus and Giroux. Public Health Informatics Institute. (2011). Collaborative
Guide to Community Preventive Services. (2015). requirements development methodology. Public
Community Preventive Services Task Force Health Informatics Institute.
Recommendation for Use of Immunization Rosen, G. (1993). History of public health. Baltimore:
Information Systems to Increase Vaccination Rates. Johns Hopkins University Press.
Journal of Public Health Management and Practice: Savel, T. G., & Foldy, S. (2012). The role of public health
May/June 2015 - Volume 21 - Issue 3 - p 249–252. informatics in enhancing public health surveillance.
Institute of Medicine. (1988). The future of public health. MMWR. Morbidity and Mortality Weekly Report,
Washington, DC: National Academy Press. Supplement, 61(3), 20–24.
Institute of Medicine. (2000). Calling the shots: Schroeder, S. A. (2007). We can do better – Improving
Immunization finance polices and practices. the health of the American people. New England
Washington, DC: National Academy Press. Journal of Medicine, 357, 1221–1228.
Institute of Medicine. (2011). Engineering a learning Seth, P., Scholl, L., Rudd, R. A., & Bacon, S. (2018).
healthcare system: A look at the future. Washington, Overdose deaths involving opioids, cocaine, and
DC: National Academies Press. psychostimulants United States, 2015–2016.
Kharrazi, H., Lasser, E. C., Yasnoff, W. A., et al. (2017). MMWR, 67, 349–358.
A proposed national research and development Sittig, D. F., & Singh, H. (2020). COVID-19 and the
agenda for population health informatics: Summary need for a national health information technology
recommendations from a national expert workshop. infrastructure. JAMA, 323(23), 2373–2374. https://
Journal of the American Medical Informatics doi.org/10.1001/jama.2020.7239.
Association, 24(1), 2–12. Stiefel, M., & Nolan, K. A. (2012). Guide to measuring
Koplan JP, Bond TC, Merson MH, Reddy KS, the triple aim: Population health, experience of care,
Rodriguez MH, Sewankambo NK, et al. Towards a and per capita cost. IHI innovation series white paper.
common definition of global health. Lancet. Cambridge: Institute for Healthcare Improvement.
2016;373:1993–5. Stoto, M. (2013). Population Health in the Affordable
Last, J. M. (Ed.). (2001). Dictionary of epidemiology (4th Care Act Era. Washington DC: Academy Health.
ed., p. 61). New York: Oxford University Press. Xu, W., Pettey, W., Livnat, Y., Gesteland, P., Rajeev, D.,
LaVenture, M., Brand, B., & Baker, E. L. (2014c). Reid, J., Samore, M., Evans, R. S., Rolfs, R. T., &
Developing an informatics-savvy health depart- Staes, C. (2011). Strengthening partnerships along
ment: I: Vision and core strategies. Journal of Public the informatics innovation stages and spaces:
Health Management and Practice, 20(6), 667–669. Research and practice collaboration in Utah. Online
LaVenture, M., Brand, B., & Baker, E. L. (2014d). Journal of Public Health Informatics, 3(3).
Developing an informatics-savvy health depart- Yasnoff, W. A., O’Carroll, P. W., Koo, D., Linkins,
ment: II: Operations and tactics. Journal of Public R. W., & Kilbourne, E. M. (2000). Public health
Health Management and Practice, 21(1), 96–99. informatics: Improving and transforming public
LaVenture, M., Brand, B., & Baker, E. L. (2017c). health in the information age. Journal of Public
Developing an informatics-savvy health depart- Health Management and Practice, 6(6), 67–75.
ment: From discrete projects to a coordinating pro- Yasnoff, W. A., & Miller, P. L. (2014). Decision support
gram. Part I: Assessment and governance. Journal and expert systems in public health. In J. A.
18 of Public Health Management and Practice, 23(6),
325–327.
Magnuson & P. C. Fu (Eds.), Public health informat-
ics and information systems (2nd ed., pp. 449–467).
LaVenture, M., Brand, B., Ross, D. A., & Baker, E. L. New York: Springer.
(2017d). Developing an informatics-savvy health
department: From discrete projects to a coordinat-
ing program. Part II: Creating a skilled workforce.
637 19
Contents
References – 661
3 Statista. 2019. Projected size of the global market 4 Khalaf S, Kesiraju L. March 2, 2017. U.S. consum-
for wearable devices in the healthcare sector from ers time-spent on mobile crosses 5 hours a day.
2015 to 2021 (in million U.S. dollars) [Data file]. Flurry Analytics Blog. Retrieval June 3, 2019:
Retrieval June 3, 2019: 7 https://www.statista.com/ 7 h t t p s : / / f l u r r y m o b i l e. t u m b l r. c o m /
statistics/607982/healthcare-wearable-device-reve- post/157921590345/us-consumers-time-spent-on-
nue-worldwide-projection/ mobile-crosses-5
640 E. K. Choe et al.
.. Table 19.1 Representative mHealth devices and their key technical specs and supported applications
Personal Palm Pilot Palm OS 2.0, 16 MHz CPU, 512 KB Note taking (patient
Digital Personal (1997) Memory, 160 × 160 pixel monochrome progress notes), calculation,
Assistant display, touchscreen LCD, speaker, serial to-do list, calendar, reference
(PDA) hotsync port, desktop cradle materials (ePocrates,
5-Minute Clinical Consult)
Cellular Nokia 5500 Symbian OS 9.1, 235 MHz CPU, 8 MB Sports (stopwatch, steps,
Phone Sport Memory, 1.7″ (208 × 208 pixel) display, calories burn) tracking,
(2006) camera, microphone, speaker, GSM multimedia messaging,
network, Bluetooth connectivity, email, text to speech
accelerometer,
Smartphone Samsung Android OS 9.0, Octa-core CPU, 8GB Multiple health and fitness
Galaxy S10 Memory, 128GB Storage, applications via Google
(2019) 6.1″ Quad HD+ (3040 × 1440 pixel) Play Store
display, triple camera, GSM+CDMA/4G
LTE network, Wi-Fi, Bluetooth
connectivity,
capacitive multi-touch,
stereo speakers, microphone,
accelerometer, barometer, gyro sensor,
geomagnetic sensor, RGB light sensor,
proximity sensor
Tablet Apple iPad 7th 10.2″ Retina display, LED-backlit with Multiple health and fitness
Generation Multi-Touch and IPS technology, applications via App Store
(2019) 2160 × 1620 pixel display, compatible
with Apple Pencil and Keyboard,
Cellular data network, Wi-Fi, Bluetooth
connectivity, cameras, two speaker audio,
fringerprint identity sensor, gyro sensor,
accelerometer, ambient light sensor,
barometer
Smartwatch Fitbit Versa 2 1.34″ OLED display, Wi-Fi, Bluetooth Floors climbed, activity
(2019) connectivity, microphone, Accelerometer, (e.g., walk, run, swim)
optical heart rate monitor, altimeter, tracking, sleep tracking with
vibration motor, SpO2 sensor, NFC, light, deep, and REM,
ambient light sensor personalized reminders,
guided breathing, 24/7 heart
rate tracking, resting heart
rate, calories burn
were designed to assist everyday tasks such as ued in 1998. The Pilot 1000, introduced in
19 note-taking, calculation, and to-do list man- 1996 by Palm Computing, was the first widely
agement. The Newton was one of the first accepted PDA; it offered a 160 × 160 pixels
devices to feature pen-based handwriting rec- monochrome touch-sensitive display, a sepa-
ognition, which eventually became popular in rate area for pen text input (using a special
other handheld devices. Due to the technical script called Graffiti), and sufficient memory
limitations, however, the Newton was not to store reference materials. Over the next
widely adopted and was eventually discontin- decade, the Pilot 1000 was followed by a large
mHealth and Applications
641 19
a b
.. Fig. 19.2 a The Apple Newton MessagePad 2100, php?curid=7039806. b Figure PalmPilot with stylus. By
running Newton OS, alongside the original iPhone Rama & Musée Bolo - Own work, CC BY-SA 2.0 fr,
running iOS. By Blake Patterson from Alexandria, VA, 7 h t t p s : / / c o m m o n s.w i k i m e d i a .o r g / w / i n d e x .
USA - Newton and iPhone: ARM and ARM, CC BY 2.0, php?curid=36959631
7 h t t p s : / / c o m m o n s.w i k i m e d i a .o r g / w / i n d e x .
number of more advanced Palm devices (e.g., and medical education, the authors reported
the PalmPilot Personal, . Fig. 19.2b), as well that 60–70% of medical students and resi-
as Windows Mobile devices (e.g., the Pocket dents used PDAs for educational purposes or
PC), which incorporated color displays, patient care (e.g., patient tracking and docu-
higher resolution screens, and eventually, cel- mentation) (Kho et al. 2006).
lular connectivity. Many medical apps and For researchers, PDAs were a good proto-
mHealth research prototypes were designed typing platform to test mHealth applications.
for these devices. Intille, Kukla, Farzanfar, and Bakr (2003)
The new modes of data entry using pen- used a standard PDA with a barcode scanner
based gestures, paging navigation, and menu plug-in to create an application that helps
palettes were shown to be effective in captur- people compare food items at the time of
ing structured data in healthcare settings. making food purchasing decisions. The appli-
These interface characteristics were effective cation delivered tailored, motivational infor-
in overcoming some of the limitations of mation with an aim to help people make
paper-based data capture, such as high cap- “just-in-time” incremental changes to their
ture burden and low compliance (Lane et al. diet (Intille et al. 2003). Siek et al. (2006) lev-
2006). As a means to ease the data capture eraged PDAs’ voice recording feature, as well
burden, PDA applications were designed for as a barcode scanner, to create a food moni-
doctors to write patient progress notes (Poon toring tool for patients with chronic kidney
et al. 1996), for patients to capture symptoms disease.
(Stratton et al. 1998), and for researchers to Traditionally, PDAs did not support
collect sensitive health data in resource-poor phone services. However, cellular phones,
field sites (e.g., Jaspan et al. 2007). which became popular around the same time
PDAs also served as medical reference as the early PDAs, did by connecting to a
tools for medical students and doctors (e.g., wireless communications network through
with applications such as ePocrates Rx, radio waves or satellite transmissions. Early
5-Minute Clinical Consult, MedMath, cellular phones were equipped with voice and
MedCalc). In a 2006 review paper on PDAs text communication capabilities, allowing
642 E. K. Choe et al.
researchers to design health interventions that runs on a wide range of devices manufactured
deliver health messages (e.g., a text messaging by a variety of companies, whereas Apple’s
smoking cessation program) (Free et al. 2011). iOS runs on Apple’s iPhone devices only.
As with PDAs, cellular phones became more Due to the broad user base and conve-
powerful over time, adding more advanced nient marketplaces (platforms) to distribute
features, such as multimedia messaging (e.g., apps, the iPhone and Android smartphones
sending and receiving images), text to speech, have led to a rapid growth in digital health in
and motion sensors. These features were the past few years. As of 2017, more than
promptly incorporated into health applica- 325,000 health and fitness apps were avail-
tions. For example, the Nokia 5500 Sport was able in the major mobile app stores, with 3.6
the first mobile phone that had a built-in 3D billion downloads.7 As platforms evolve, so
accelerometer, and it came with an applica- do the mHealth apps. For example, Epocrates,
tion that automatically detected running and a medical reference app, has changed over
walking. The phone also included a diary for time from its earlier version for Palm OS
planning and tracking workouts, as well as devices to its recent versions for iOS/Android
enabling users to add workouts that the phone devices.
did not detect automatically. Although 73% of mHealth apps in 2015
were designed for supporting general well-
19.1.2.2 Smartphones and Tablets ness—for example, apps for tracking exercise
IBM and BellSouth’s Simon is largely consid- and diet, and for managing stress, the
ered the first “smartphone” because it fea- mHealth market is shifting toward the sup-
tured a phone with a touchscreen, email and port for managing specific health conditions.
many other capabilities that became available Such applications constituted 40% of
to the public in 1994.5 Simon did not last long, mHealth apps in 2017.8 Some of the key cat-
but it presaged other smartphones, such as the egories of mHealth apps include: symptom
BlackBerry and Windows Mobile.6 Smart checkers, healthcare professional finders,
phones gained popularity in 2007 when Apple managing clinical and financial records,
announced the first generation of the iPhone, health-condition education and manage-
which featured a large capacitive touchscreen, ment, self-monitoring, remote patient moni-
multi-touch interactions (e.g., pinching for toring, rehabilitation programs, and
zooming), and thin slate-like form factor. In prescription filling and compliance (or adher-
the following year, Apple opened the App ence). Based on a review of commercial
Store, a marketplace to distribute applica- mHealth apps, the most prevalent conditions
tions, which became popularly known as that mHealth apps are targeting are diabetes,
“apps,” for the iPhone (and subsequently depression, migraine, asthma, low vision,
other iOS devices). Soon after the iPhone was and hearing loss (Martínez-Pérez et al. 2013).
launched, in 2008, Google released a new In addition, apps for women’s health and
mobile OS platform, Android. The choice of pregnancy, and medication reminders are
the OS determines which phone to choose and now common.
which apps the user can run. Google’s Android
ture in the ubiquitous computing (Amini et al. Android device, and 44.5% were using an Apple
2017; Gouveia et al. 2016) and visualization device.11 The divide in the market share and the
(Blascheck et al. 2019 (see . Fig. 19.4); idiosyncratic characteristics of individual
Brehmer et al. 2018) communities that have phones pose problems for developers and
begun to examine ways to create effective researchers. To create a native app (i.e., an app
feedback on a small screen. More cross-disci- that has been developed for use on a particular
plinary collaboration on this front is needed platform), developers must redouble their
to design and build effective intervention efforts to create and maintain versions for each
components for wearable devices (Lee et al. platform. Researchers, who often lack skills
2018). and resources to build native mobile apps for
Wearables have been popular subjects of multiple mobile platforms, must pick either of
research in healthcare domains due to their the two platforms instead of supporting both,
potential to serve as an intervention in a which can introduce a selection bias in recruit-
variety of contexts. See a systematic map- ment.12 To address such concerns, efforts have
ping study of how “Internet of Things,” been devoted to developing frameworks and
including wearables have been deployed and
evaluated in the medical field (Sadoughi
et al. 2020).
11 Statista. 2019. Subscriber share held by smartphone
operating systems in the United States from 2012 to
2019 [Data file]. Retrieval June 13, 2019: 7 https://
19.1.3 Current Platforms www.statista.com/statistics/266572/market-share-
held-by-smartphone-platforms-in-the-united-states/
19 The smartphone user base is sharply divided 12 The U.S. Mobile App Report. 2014. Retrieval Janu-
between Google’s Android platform and ary 15, 2020: 7 https://www.comscore.com/
Insights/Infographics/iPhone-Users-Earn-Higher-
Apple’s iOS. As of June 2018, 54.1% percent of Income-Engage-More-on-Apps-than-Android-
U.S. smartphone users were using a Google Users?cs_edgescape_cc=US
mHealth and Applications
645 19
platforms to handle device and operating sys- them closely (Chung et al. 2015; Kim et al.
tem compatibility. For example, Apache 2017; West et al. 2016; Zhu et al. 2016).
Cordova13 enables developers to build cross- People can access their data collected
platform mobile apps with HTML5, CSS, and from mHealth apps through downloading a
JavaScript, with extensions that provide access file (e.g., CSV, FIT, GPX, KML, and TCX)
to some hardware features such as camera, from an app, website, or a health platform
GPS, and Bluetooth. Titanium14 is a mobile (e.g., Apple Health), and using application
development environment that allows the cre- programming interfaces (APIs). However, it
ation of native apps across different mobile is typically difficult for lay people to down-
devices and operating systems, while reusing a load and repurpose the data, even if they are
large part of the codes across apps. Such cross- the ones who contributed to collecting them
platform support enables developers of (Kim et al. 2019). Although recent regula-
mHealth technologies to reach potential users tions—such as the European General Data
of either Android or Apple devices. Protection Regulation (GDPR)19—have the
potential to enhance personal data accessibil-
ity, the industry lags behind the reforms.
19.1.4 ata Access & Data
D Data standards are defined as “docu-
Standards mented agreements on representations, for-
mats, and definitions of common data”
Accessing mHealth data could be valuable for (Fenton et al. 2013).20 They support interop-
many stakeholders, including self-trackers erability among heterogeneous systems, and
who want to learn insights about themselves are critical in leveraging patient-generated
(Choe et al. 2014), researchers who want to data in the current mHealth ecosystem.
incorporate mHealth data in their research Among a few existing health platforms in the
(e.g., Althoff et al. 2016; Jakicic et al. 2016; market (e.g., Apple Health, Google Fit, and
Fitabase15), and app developers who want to Samsung Health), Apple Health and its
integrate multiple data sources in a new ser- frameworks (HealthKit and ResearchKit) are
vice (e.g., Exist.io,16 Gyroscope,17 and widely being adopted, and have become the
Instant18). For some patients, accessing their standard interface to fitness and medical
personal health data is a matter of life and devices.21 Apple’s HealthKit supports data
death, as in the case of Type 1 diabetes integration from iOS apps by allowing third
patients, who have been struggling for a long party apps to transfer data to and from the
time to have direct access to their continuous HealthKit. However, data loss could happen
glucose monitor (CGM) data (Kaziunas et al. during the transfer, and HealthKit is not com-
2018). In clinical contexts, doctors can use the patible with Android apps, failing to accom-
patients’ data collected outside the hospital to modate more than half of the smartphone
diagnose patients accurately and monitor users. As in the case of the Open mHealth ini-
13 Apache Cordova. 2015. The Apache Software 19 European Union General Data Protection Regula-
Foundation. Retrieval June 13, 2019: 7 https://cor- tion. Retrieval September 9, 2019: 7 https://
dova.apache.org/ ec.europa.eu/commission/priorities/justice-and-fun-
14 Titanium Mobile Development Environment. 2017. damental-rights/data-protection/2018-reform-
Axway. Retrieval June 13, 2019: 7 https://www. eu-data-protection-rules_en
appcelerator.com/Titanium/ 20 Data Standards. 2019. Public Health Data Stan-
15 Fitabase. 2019. Retrieval June 11, 2019: 7 https:// dards Consortium. Retrieval June 13, 2019:
www.fitabase.com/ 7 h t t p : / / w w w. p h d s c . o rg / s t a n d a rd s / h e a l t h -
16 Exist. 2019. Hello Code. Retrieval June 11, 2019: information/d_standards.asp
7 https://exist.io/ 21 Apple is going after the healthcare industry, starting
17 Gyroscope. 2019. Gyroscope Innovations, Inc. with personal health data. January 8, 2019. CB
Retrieval June 11, 2019: 7 https://gyrosco.pe/ Insights. Retrieval June 13, 2019: 7 https://www.
18 Instant. 2015. Retrieval June 11, 2019: 7 https:// cbinsights.com/research/apple-healthcare-strategy-
instantapp.today apps/
646 E. K. Choe et al.
tiative22 and Human API,23 some efforts have enabling collection of rich, longitudinal data
been made in creating standard for personal that can be used for assessment and treat-
health data schema. However, the uptake of ment. When self-report is needed, mobile
the Open mHealth initiative and Human API tools offer ways to collect self-report data in
has been slow, and no incentives encourage context and with low burden, greatly increas-
companies to follow the standardization just ing the ecological validity of user-provided
yet. Health platform companies and health information.
app developers need to make a concerted
effort to establish standards for personal data
schemas to support interoperability and to 19.2.1 Sensing to Collect Data
prevent data loss.
On the clinical informatics side, app devel- A key feature of many mHealth applications
opers have been building 3rd party clinical is that they make use of information gener-
apps—some of which are mHealth apps— ated via sensors contained in mobile phones
that can access patients data directly from or worn on the body. Modern mobile phones
EHR by leveraging SMART on FHIR,24 an contain a large number of sensors, including
open, standards-based platform for medical GPS, accelerometers, gyroscopes, cameras,
apps. Through SMART on FHIR, healthcare and microphones. In recent years, the use of
organizations can plug third-party medical on-body sensors that continuously monitor
apps into their EHR that use those standard individuals’ activities and states has exploded.
data types, which poses immense opportuni- Examples include wrist-worn fitness trackers
ties for clinicians and patients to use EHR as well as a large variety of specialty sensors
data for diverse purposes (e.g., patient engage- such as those that detect galvanic skin
ment, disease management, and research) on response, oxygen saturation, blood glucose,
mobile devices. heart rate variability, core body temperature,
and blood pressure.
The data from such on-body sensors are
19.2 ey Features of mHealth
K typically transmitted to a mobile phone,
Technologies where, along with data collected on the phone
itself, it is used to provide feedback to users or
A great deal of utility of mobile technology is made available to third-party applications.
for health applications comes from the ways Sensor data are used for three basic purposes:
in which these tools can collect data about (1) for assessing physiological processes, such
individuals’ health status, behavior, and con- as resting heart rate or blood glucose; (2) for
text. Unique to mobile technology is its ability inferring individual’s activities, such as physi-
to collect data passively via various types of cal activity, eating, and sleep; and (3) for infer-
sensors embedded in mobile phones and ring context, such as location and social
wearable devices. With the number of mobile environment.
sensors and the quality of the inference
19.2.1.1 Assessing Physiological
increasing at a rapid rate, mobile health tools
are able to detect a broad range of behaviors
Processes
and states continuously and in the back- An important use of sensing in mHealth tech-
ground, with minimal user interaction, nologies is to assess physiological processes
and states that are important for supporting
19 health management. Such sensing is usually
done through devices that users wear on their
22 Open mHealth. 2015. The Tides Center. Retrieval bodies (e.g., bands worn on the wrist, or
June 13, 2019: 7 http://www.openmhealth.org/ instrumented adhesive patches worn on the
23 Human API. 2019. Retrieval June 13, 2019:
7 https://www.humanapi.co/
torso) or even through their phone. Regardless
24 SMART Health IT. Retrieval January 15, 2020: of the sensor form factor, physiological data
7 https://smarthealthit.org/ obtained through mobile sensing are typically
mHealth and Applications
647 19
transmitted to a mobile phone via a low- metrics beyond heart rate—including respira-
power radio (e.g., Bluetooth), where it can be tion, body temperature, and electrodermal
used to drive intervention delivery through an activity—enabling, among other things, more
mHealth app or uploaded to remote servers robust assessment of stress (e.g., Moodmetric
for monitoring by the healthcare team. and Spire).
The most common type of sensors used to In addition to consumer-oriented com-
monitor physiological processes are wrist- mercial devices, physiological sensing using
worn wearables. The exact set of sensors— wearable sensors is a thriving research area.
and, thus, what they are able to detect—varies The current crop of commercial devices
by device, but the main physiological sensor described above are based on decades of sens-
found in such devices is the optical heart-rate ing research, with the research on optical
sensor, which uses green and orange light pulse oximetry dating back to the 1980s (see
emitting diodes (LEDs) and a photodetector Alexander et al. 1989 for a brief review), and
to detect the pulse waveform (Alexander et al. stress detection drawing on ten years of work
1989). Recently, heart rate sensors have begun on multimodal sensing (Ertin et al. 2011;
to add pulse oximetry functionality as well, by Hovsepian et al. 2015). More recently, two key
incorporating a red LED, and, as of 2018, topics in mobile physiological sensing research
these more advanced heart rate sensors are have been cuff-less, continuous measurement
beginning to show up even in devices in the of blood pressure and noninvasive measure-
$150 range, such as Fitbit Charge 3 and ment of blood glucose.
Garmin Vivosmart 4, which represent a large Over the last few years, there has been a
segment of the wearables market. quickly growing body of research aimed at
The chief physiological metrics sensed by developing unobtrusive methods for blood
heart rate sensors are momentary heart rate pressure (BP) measurement. Much of this
and resting heart rate. Combining momen- work attempts to repurpose optical heart rate
tary heart rate data with physical activity data sensors already present in the wrist-based
(sensed via an accelerometer), wrist-worn activity trackers to estimate blood pressure
devices also try to estimate energy expendi- from photoplethysmography (PPG) measure-
ture, although the quality of these calculations ments. Recent work (Zhang and Feng 2017;
is variable (see Consolvo et al. 2014 for exam- Patil et al. 2017) has shown that machine
ples of design implications of this variability learning techniques can be used to estimate
in inference). Certain wristbands—notably, both systolic and diastolic BP from PPG sig-
recent devices made by Garmin—attempt to nals with around 90% accuracy. Similarly,
characterize users’ stress levels, which can be Carek et al. (2017) have used the accelerome-
estimated from the inter-beat interval data ter and optical heart rate sensor in a smart
obtained from optical heart rate sensors watch to estimate BP from the pulse transit
(Hovsepian et al. 2015). Finally, wrist-based time, with similar accuracy. Although these
devices with pulse oximeters are able to detect accuracy rates are still too low for widespread
blood oxygen saturation, although the current clinical applications, this line of work is
generation of devices do not provide continu- quickly developing, and accurate continuous
ous monitoring of this metric. BP monitoring is likely not too distant.
Although wristbands are currently the most The other key target for physiological
common form factor for commercial wearable sensing in the context of mHealth has been
sensors, the range of physiological data that blood glucose measurement. Traditionally,
can be sensed unobtrusively at the wrist is lim- blood glucose could only be measured from
ited. As such, several other form factors exist, blood samples, typically obtained from a fin-
including smart rings (e.g., Moodmedtric and ger prick or, in the case of continuous glucose
Motiv), instrumented clothing (e.g., Hexoskin, monitors, from a needle semi-permanently
OMsignal, and Skiin), and adhesive tags that inserted into the subcutaneous tissue. In recent
can be attached to clothing (Spire). Such years, both the finger prick-based glucose
devices are able to detect other physiological monitors and CMGs have started to incorpo-
648 E. K. Choe et al.
emergency rooms), and to provide interven- can be used to characterize that person’s
tion content that is appropriate to the user’s health-related needs and provide timely sup-
current context. For instance, HeartSteps, an port to address those needs. However, not
mHealth physical activity intervention, sends everything can be sensed, and a range of
user’s activity suggestions that are tailored to important metrics needed to monitor patients’
their location, weather, time of day, and day health (e.g., pain levels and medication side
of the week (Klasnja et al. 2018). effects) and support self-management require
Similarly, MyBehavior (Rabbi et al. 2015), self-report data. For these cases, mobile
another physical activity intervention, tracks devices enable self-report that is precise,
users’ daily movements and then, based on the timely, and minimally burdensome. Collecting
current location, provides recommendations and leveraging in-situ self-report data using
for walking routes a user can take to increase mobile devices has recently gained tremen-
their steps, balancing the user’s step goal with dous interest among researchers in multiple
the suggested activity’s feasibility (e.g., how fields, such as ubiquitous computing, behav-
much time it would take). In the addiction ioral sciences, psychology, public health, and
space, location is often used to determine machine learning.
high-risk situations (e.g., proximity to a bar)
to provide just-in-time support, such as cop- 19.2.2.1 I n-Situ Data Collection
ing strategies (e.g., see Gustafson et al. 2014). Methods
Such just-in-time adaptive interventions Research methods such as diary study and
(JITAIs), enabled by knowledge of the indi- ecological momentary assessment (EMA)
vidual’s current context and activity, hold (Shiffman et al. 2008; Stone et al. 2007) help
great promise for achieving goals of precision researchers understand details about a per-
behavioral health by providing support only son’s context, intentions, and actions that tra-
when it’s most likely to be effective and when ditional research methods (e.g., interview,
individuals are receptive to it (Nahum-Shani survey, and system log analysis) cannot reveal.
et al. 2015; Spruijt-Metz and Nilsen 2014). Diary study and EMA share the same pur-
Other contextual variables that can be pas- pose of collecting data in people’s natural
sively sensed by current smartphones include environment to maximize ecological validity.
weather (by lookup via the user’s location), In diary study, people capture self-report
the user’s calendar (e.g., free/busy blocks), data—usually once a day—using either pen
ambient noise levels, and proximity to other and paper, mobile diary apps, or online sur-
people. A number of research efforts (Kanhere vey. Some diaries are structured (e.g., sleep
2011; Devarakonda et al. 2013) have exam- diary for Cognitive Behavioral Therapy for
ined incorporation of chemical sensors into Insomnia) whereas others are less structured
the phone that would allow sensing of pollu- (e.g., free-form note). Although diary study
tion levels. Although such pollution levels can helps researchers collect in-situ data relatively
currently be approximated by looking up pol- easily, it is subject to low adherence (if par-
lution data based on the user’s location, sens- ticipants forget to fill it out) and recall bias (if
ing on the phone could enable more timely and there is a large time interval between when an
potentially more effective just-in-time support event happened and when the event was cap-
for individuals with asthma and other respira- tured). More recently, researchers increasingly
tory problems. use EMA, which refers to frequent, brief col-
19 lection of self-report data about the person’s
current situation and experiences. As a data
19.2.2 Collecting Self-Reported collection method, EMA is intended to
Data decrease recall biases and memory limitations
inherent in retrospective self-report, while—
Across physiological processes, user activities, by collecting self-report frequently (often 5 to
and context, mobile devices are increasingly 8 times a day)—enabling researchers to under-
able to detect information about a person that stand how behavior, psychological processes,
mHealth and Applications
651 19
and individuals’ experiences change over time current offering is PACO,25 a cross-platform
and are influenced by time-varying factors (Android and iOS) system built by the Google
such as the person’s physical environment. engineer Bob Evans and his colleagues. Like
MyExperience, PACO supports construction
19.2.2.2 obile Research Platforms
M of multiple surveys and schedules, a broad
for In-Situ Data Collection range of question types, and sophisticated
Mobile devices are particularly well suited for questionnaire triggering. All configuration of
supporting diary and EMA studies in multi- PACO questionnaires can be done through a
ple ways. Researchers and commercial com- web interface without any programming, and
panies have developed a number of software its website provides study management func-
platforms for collecting in-situ data that tionality, including enrolling participants,
enable researchers to create data-collection monitoring their adherence, and downloading
schedules with no or minimal programming. response data in standard formats for
These tools typically support to the collection statistical analyses. To further extend people’s
of self-report data, passive logging of smart- data capture capability, OmniTrack enables
phone sensors (e.g., GPS, bluetooth, device people to create an Android native tracking
status, and activity), or some combination of app without programming (Kim et al. 2017).
both (semi-automated tracking) (Choe et al. By combining various data fields and
2017). integrating external data services (e.g., Fitbit)
One of the early projects of this kind, into a single tracking app, people can create a
MyExperience (Froehlich et al. 2007) ran on customized tracking app and configure the
Windows Mobile device, and it enabled app to be used as a general diary app or an
researchers to construct sophisticated EMA EMA tool. Researchers can design and deploy
surveys and schedules by writing a single con- an OmniTrack app using OmniTrack for
figuration file in XML. MyExperience surveys Research (see . Fig. 19.5), which handles app
could collect traditional questionnaire-style deployment, updates, and participant
data (i.e., multiple-choice questions, text monitoring.26 The ability to construct data
responses, etc.), as well as use the phone’s cam- capture instruments and conduct studies from
era and microphone to collect rich multimedia the web with no programming greatly increases
data. They also allowed complex survey sched- opportunities for researchers outside of
ules where different questionnaires were deliv- technical disciplines to integrate rich in-situ
ered to participants at different times and data into their research projects.
based on different triggering conditions. Many of the EMA platforms support not
Many similar systems have since been just traditional time-based prompting where
developed for different mobile platforms, and questionnaires are triggered based on time,
researchers can now choose among both but also event-based prompting, where ques-
commercial solutions where survey tionnaires are triggered when the phone
configuration is done by a commercial detects that a particular kind of event has
company that has developed a proprietary occurred. To do so, EMA systems use sensors
EMA platform (e.g., Life Data Corp and in the phone and connected devices to monitor
Ilumivu) and open-source platforms that users’ state and behavior, and then trigger a
researchers can configure and deploy questionnaire when a particular set of
themselves (e.g., Momento (Carter et al. conditions are met. The sensors and types of
2007), MyExperience (Froehlich et al. 2007),
AWARE (Ferreira et al. 2015), Jeeves (Rough
and Quigley 2015), PACO (Evans 2016),
Sensus (Xiong et al. 2016), Extrasensory App 25 PACO: The Personal Analytics Companion.
Retrieval June 13, 2019: 7 https://pacoapp.com/
(Vaizman et al. 2018), OmniTrack (Kim et al. 26 Kim YH, Lee B, Choe EK, Seo J. 2019. OmniTrack
2017), and TEMPEST (Batalas et al. 2018)). for Research. GitHub. Retrieval June 13, 2019:
Among the latter group, the most mature 7 https://omnitrack.github.io/research/
652 E. K. Choe et al.
.. Fig. 19.5 OmniTrack for Research. Researchers can use the graphical user interfaces to create mobile data col-
lection apps equipped with both manual and automated data collection functionalities
events that can be used for triggering vary contact with a recovery coach. The ability of
from platform to platform, but it is common EMA systems to trigger a request for self-
to be able to define events based on location report at times of high risk (e.g., when a per-
(e.g., a questionnaire can trigger when the per- son recovering from alcoholism is near a bar)
son gets home or stops moving), phone activ- and immediately respond to provided answers
ity (a questionnaire can trigger after the enables the kind of tailored, just-in-time sup-
person finishes a phone call), or data from port that was impossible prior to the develop-
accelerometers (a questionnaire can trigger ment of modern mobile technology.
when the person finishes a physical activity).
Event-based prompting allows researchers to 19.2.2.3 Incorporating
collect data at precise times when an event of Self-Reporting Into
interest (e.g., physical activity) occurs, maxi- User Interactions
mizing their ability to get timely information In addition to diary and EMA studies, mobile
about the person’s experience and surround- technology can facilitate collection of self-
ing events and minimizing recall biases and report by incorporating self-reporting into
the risk of forgetting. interactions that users already perform on
Finally, some EMA platforms enable not their mobile devices. Particularly interesting
just collection of EMA data but also the use attempts to do this involved appropriating the
of the provided answers to trigger interven- unlock gesture on Android phones to enable a
19 tions. Such ecological momentary interven- user to provide a single piece of self-report
tions (EMIs; Heron and Smyth 2010) are data as part of the process of unlocking the
particularly useful in the substance use arena phone. The first project that took route, Slide
(e.g., Gustafson et al. 2014; Dennis et al. to X (Truong et al. 2014), found that
2015), where a person’s answers to EMA ques- individuals unlocked their phones between 20
tions can be used to calculate risk of relapse, and 100 times per day on average. To take
which, if it reaches a predefined threshold, advantage of this frequent interaction, Truong
can trigger coping interventions or facilitate et al. built three applications that turned the
mHealth and Applications
653 19
phone’s standard unlocking interface into a manner is a useful indicator of individuals’
data-collection tool. One of them, Slide To receptivity to intervention.27 In the new ver-
QuantifySelf, enabled users to answer a single sion of HeartSteps, these data are being used
question (e.g., “how happy are you right by a learning algorithm that personalizes
now?”) on a Likert scale in place of using per- intervention provision for each HeartSteps
forming a standard unlock gesture. Users user. Adding a single question to an applica-
could specify multiple questions with which to tion dashboard, or triggering a question by
be prompted, as well as when during the day observing the use of other applications (e.g.,
each question should be asked (e.g., asking when a user quits a social media app) are
about whether they ate breakfast mid- other ways when interactions already taking
morning), providing a low-burden way to col- place on the phone can be leveraged to collect
lect a rich self-report dataset that can help an self-report data without the need for addi-
individual better manage her health. A recent tional disruptive prompting of the user.
project called LogIn (Zhang et al. 2016)
expanded on this idea by developing gesture- 19.2.2.4 Easing Data Entry
based unlock interactions to track pleasure Finally, mobile devices offer a number of
and accomplishment, sleepiness, and mood. ways to ease data entry during self-report. As
Unlike Slide to QuantifySelf, which used a we mentioned, most EMA platforms enable
single Likert scale item to record self-report, users to take pictures and use the phone’s
LogIt used more sophisticated gesture-based microphone to speak their answers to the
interactions, such as measuring mood on questions. Such multimedia responses enable
Russell’s affect grid (Russell et al. 1989). collection of rich qualitative data that would
Self-report can be incorporated into appli- be too burdensome or even impossible to col-
cations as well. Here too the underlying idea is lect by requiring users to type their responses
to decrease the burden of providing self- into a text form. Multimedia capture enables
report by tying it to an action the user is additional types of data analyses that cannot
already performing. In HeartSteps (Klasnja be done on traditional questionnaire data.
et al. 2018), an mHealth physical activity For instance, PlateMate (Noronha et al. 2011)
intervention, users can receive prompts to go lets users track what they are eating by taking
for a brief walk or move to disrupt prolonged pictures of meals. The system then uploads
sitting. Like many mHealth interventions, these images to Mechanical Turk, where they
these prompts are provided as push notifica- are processed through a number of steps that
tions to the user’s phone. Other than dismiss- extract detailed nutritional information from
ing them through a simple OK button, the images, enabling logging of calories and
however, HeartSteps provides users with three micro-nutrients with much lower self-report
different buttons to dismiss the notification, burden than is involved in traditional
each intended to indicate how the user per- database-based nutritional logging. Similarly,
ceived that particular prompt. Users can press machine learning algorithms can be used to
a thumbs-up icon to indicate that they liked process audio recordings of data individuals
the activity suggestion and that it came at a enter by speaking into the phone not only to
good time, a thumbs-down icon to indicate extract content of those recordings, but also
that the suggestion was not helpful or came at to detect user’s mood or features such as
a bad time, and a button to turn off future latency or pitch that can be indicative of
prompts for a certain period of time, indicat- changes in the individual’s mental health,
ing that they will be busy or unavailable for such as an onset of a manic episode (e.g.,
the intervention. The data from a study of Gideon et al. 2016).
HeartSteps showed that participants were sig-
nificantly more active after the prompts they
marked thumbs-up than those to which they
responded in either of the two other ways, 27 Predrag Klasnja, personal communication,
suggesting that the self-report obtained in this 8/31/2019.
654 E. K. Choe et al.
Self-report can be made more convenient to clinic visits (Koshy et al. 2008; Leong
by placing self-reporting interfaces into easily et al. 2006)), while other times they are
accessible locations and minimizing how embedded into more complex interven-
much user interaction is required. Choe et al. tions, such as those for chronic disease
(2015) enabled low-burden self-monitoring of management.
sleep by placing a self-tracking widget on the 2. Support for behavior change. A large num-
phone’s lock screen, making it visible every ber of mHealth interventions are intended
time a person reached for her phone, and by to help individuals make health-promoting
recording sleep quality via a single tap. changes in their behavior. mHealth inter-
Similarly, Fitabase Engage, a new platform ventions exist for improving medication
from Fitabase, enables simple questions to adherence, increasing wellness behaviors
show up on individuals’ Fitbit smartwatches, like physical activity and healthy diet,
and the questions to be answered with a single helping with cessation of addictive behav-
tap. The speed of this interaction, and not iors like smoking and substance use, pre-
needing to reach for the phone, greatly venting relapse, and adhering to health
decreases the perceived burden on answering management practices like monitoring of
questions delivered via Fitabase Engage. glucose or blood pressure. Some of the
applications for the management of men-
tal health (e.g., depression and bipolar dis-
19.2.3 Providing Interventions order) fall into this category as well, as
in Individuals’ Daily Lives they often focus on helping individuals
enact therapeutic practices like behavioral
By facilitating passive data collection via sen- activation, or increase the regularity of
sors embedded in the phone and wearable sleep, social contact, and other behaviors
devices, and by providing ways to collect brief that support mental well-being.
self-report data at the right time and in the 3. Discovery of patterns. A related category
right context, mobile technology has greatly is mHealth interventions that support
enhanced collection of data that are needed to individuals in discovering patterns in their
understand health behaviors, to monitor behavioral or physiological responses. For
patients’ health, and to provide interventions. instance, TummyTrials (Karkar et al.
We now turn to how this information can be 2017) is a recent mHealth intervention
used by mHealth systems to provide interven- intended to help individuals detect food
tions at times when those interventions are triggers that aggravate irritable bowel syn-
most needed and when individuals are most drome. Other interventions, like Health
receptive to them. Mashups, (Bentley et al. 2013), try to help
Given the immense number of mHealth individuals understand what factors influ-
apps, their content can be categorized in a ence their physical activity, sleep, and
number of different ways (e.g., see Klasnja & other wellness behaviors. Although inter
Pratt, 2012 for one classification). For simplic- ventions in this category do not have
ity, many mHealth apps can be seen as falling explicit behavior-change features, such as
into one or more of the following five catego- goal-setting or planning, the self-
ries: experimentation they support is usually in
1. Reminders. One of the simplest functions service of making health-promoting
changes in one’s behavior, making this cat-
19 of mHealth interventions is to provide
egory a close relative of the behavior-
reminders. Reminders exist for many dif-
ferent health behaviors, including attend- change interventions.
ing scheduled clinic visits, taking 4. Detection or prediction of critical health
medications, and applying sunscreen on events. An important use of sensors and
sunny days. Sometimes reminders are a self-report data in the mHealth context is
stand-alone intervention (e.g., a text mes- to enable interventions that detect and/or
saging intervention to increase adherence predict critical health events that may
mHealth and Applications
655 19
require prompt medical attention. health-promoting behaviors, or when the sen-
mHealth solutions exist to detect a broad sors detect that there is a critical change in the
range of acute health events, including person’s health (e.g., high likelihood of a heart
falls, alarming levels of chemotherapy tox- attack) and she needs to receive immediate
icity, imminent risk of a heart attack, onset medical attention. The development of such
of a manic episode, and decompensation just-in-time interventions is made possible by
in chronic heart failure. Depending on the continuous, low-burden data collection,
severity of the detected event, such inter- always-on Internet connectivity, powerful
ventions either provide guidance to the phone-based information processing, and the
individual on how to manage the event or ability to prompt the person via a push notifi-
automatically contact the emergency ser- cation or a text message to deliver an interven-
vices to get the individual medical help as tion. A great deal of recent work on mHealth
quickly as possible. interventions has aimed to realize this prom-
5. Communication with the healthcare sys- ise of smart, timely intervention delivery.
tem. A growing number of mHealth inter- Although the promise of timely, in-context
ventions are intended to facilitate information has been acknowledged for many
communication with the health system years, it is only recently that the technical,
through the support of remote patient algorithmic, and methodological develop-
monitoring, secure messaging, prescrip- ments have enabled the development of inter-
tion refills, accessing labs and imaging, ventions that realize this goal. In recent
and scheduling appointments. Unlike the literature, such mHealth interventions have
categories reviewed above, mHealth apps been referred to as just-in-time adaptive inter-
in this category are almost always provided ventions (JITAIs; Nahum-Shani et al. 2015;
by and tied to a particular health system, Nahum-Shani et al. 2016; Spruijt-Metz and
pharmacy, or a clinic. To make such Nilsen 2014). JITAIs refer to mHealth sys-
mHealth interventions work, clinical tems that use decision rules—if-then rules or
workflows often have to be restructured to algorithms that specify when, where, and how
accommodate the use of mHealth tools by interventions are delivered to individuals—to
patients and the data that are generated as attempt to provide the right type of support at
part of that use. the right times and in the right contexts.
JITAIs use sensors and low-burden self-report
Although the above categories are not to continuously monitor individuals’ state,
intended to be exhaustive, they do cover many behavior, and the environment, and when they
of the common types of mHealth applica- detect that an individual is in a state of high
tions. Thus, they demonstrate the range of the risk or has an opportunity to engage in a
intervention work that has been done in the health-promoting behavior, they make a deci-
field of mHealth. sion about whether to intervene. How these
decisions are made varies. Simple JITAIs use
deterministic if-then rules that determine
19.2.4 Providing Just-in-Time what the system should do when a situation of
Adaptive Interventions risk or opportunity is detected. For instance, a
JITAI might send a person recovering from
From the earliest days of mHealth, a major alcohol use disorder a push notification with a
promise of mobile technology, given its con- coping strategy every time that person is
stant proximity to the person, has been its within a certain distance of a bar (Gustafson
ability to provide support for health manage- et al. 2014). Or the decision rules may be sto-
ment when that support is most needed (Intille chastic, where intervention is not provided
2004; Patrick et al. 2008; Nilsen et al. 2012): at every time a situation of risk or opportunity is
times when a person is at risk (e.g., of a sub- encountered but only with certain
stance use relapse) and needs help with cop- probability—usually to reduce user burden
ing, when there is an opportunity to engage in (e.g., Klasnja et al. 2018). Finally, the system
656 E. K. Choe et al.
may incorporate algorithms that evolve the sions (e.g., the number of steps a person walks
decision rules over time to maximize their on a day when the system sent a motivational
effectiveness for each individual (we will message in the morning)—or long-term out-
return to this point shortly). comes that prioritize how the system performs
What form the decision rules take depends over time (e.g., the user’s average daily step
on the nature of a JITAI’s intervention com- count over the course of a month). Much of
ponents, as well as the situations it is trying to the foundational research in RL has been
target. For situations that occur frequently— done in areas like robotics where both the def-
such as getting stressed or high likelihood of inition of success and the relevant state vari-
lighting up a cigarette—interventions have to ables are relatively unambiguous. As human
be spaced out to manage user burden and behavior is inherently more messy and the sys-
reduce the risk of system abandonment. tem’s knowledge of the person’s state and
Stochastic rules are well suited for meeting environment is far more noisy, it’s an open
these criteria, as they can reduce the overall research question whether mHealth systems
number of interventions while creating a sense would work better by employing algorithms
of unpredictability, which may increase the that focus on clearer but shorter-term out-
effectiveness of the interventions by reducing comes, or if the more sophisticated algorithms
habituation. On the other hand, for risk states that focus on the long-term can be made to
that occur rarely—or for which the conse- work in this setting.
quences of not intervening are severe (the risk The other approach to personalized JITAIs
of suicide being an extreme example)—simple draws on control systems engineering (Hekler
deterministic decision rules may be both appr et al. 2018; Phatak et al. 2018). Control sys-
opriate and adequate. tems engineering focuses on the development
As we mentioned above, decision rules can of systems that are capable of controlling com-
also evolve over time. Although standard plex processes, such as the flight of an airplane
deterministic and stochastic decision rules do or blood glucose metabolism. At the heart of
not change over the course of system use and this approach is the development of mathe-
are typically the same for all users of a system, matical models—called dynamical systems
mHealth systems with evolving rules aim to models—that encode what is known about the
personalize deliver of interventions for each influences on the process or behavior that
system user. The idea is that the system learns needs to be controlled. Those models are then
patterns in user behavior and intervention used by control systems such as the plane auto-
response over time, and it can then adjust how pilot or the artificial pancreas to make deci-
it provides interventions to maximize their sions about how the system should intervene to
effectiveness for each individual user and to maximize the likelihood that the process will
minimize user burden. behave in a desired way (the flight will get to
Two approaches to intervention personal- the destination city, the blood glucose will be in
ization are currently being investigated: rein- the healthy range, etc.). In the context of
forcement learning and control systems mHealth, the approach is being used to form
engineering. JITAIs based on reinforcement dynamical models of health behaviors (e.g., an
learning (RL) use algorithms to continually individual’s daily steps), and the models are
adjust the probability of intervening based on then used by an mHealth system’s “controller”
observations of the outcome of previously to decide when and how to intervene to move
delivered interventions (Sutton and Barto the behavior in the desired direction (e.g., to
19 1998). Observations of successful outcomes increase the daily step count). What is notable
lead the algorithm to increase the probability about this approach is that the controller is
of intervening in the similar context in the constantly updating its model based on
future; unsuccessful outcomes decrease that continual observation of the person’s behavior
probability. RL algorithms can focus either on and response to interventions, so that
short-term outcomes—the most immediate subsequent intervention decisions are always
outcomes of individual intervention provi- based on the updated model, one that
mHealth and Applications
657 19
progressively better describes the idiosyncra- 19.3 Broad Considerations
sies of each user’s behavior. Although control and Challenges
systems engineering has been used successfully
to manage complex processes for decades, its All health technologies have associated prag-
application to mobile health is still in its matic challenges and ethical issues regarding
infancy (see Hekler et al. 2018 for the state of privacy and security, impact on people’s work,
the field). the potential to increase health disparities,
and regulatory issues. In this section, we focus
on aspects of those issues that are unique to
19.2.5 Supporting or particularly problematic for mHealth
Self-Experimentation applications.
information on the screen or from gathering intervene immediately when a mobile sensor
stored data. For example, location logs could indicates a high likelihood that their client
be used to disclose businesses or locations a could harm themselves or others. Although
person has visited when and or for how long, such responsiveness could improve outcomes,
or to predict where a person will be when, these expectations could lead to further prob-
which could endanger someone who needs to lems with clinician burnout and challenges
conceal their whereabouts for safety reasons. with patient autonomy and confidentiality.
Although most mobile operating systems now Physicians are already experiencing a greater
attempt to make such location tracking increase in burnout and reduction in satisfac-
optional, many mHealth apps need such data tion with work-life balance than peer adults in
to function fully. For example, fitness trackers the U.S. (Shanafelt et al. 2015). However, a
need location data to accurately record users’ scoping review of physician well-being in the
activity. Thus, users often must choose mHealth context showed that these technolo-
between using an mHealth application and gies are playing important roles in the
keeping their data private. improvement of physicians’ well-being too
(Chen et al. 2018). Such advances are addi-
tionally changing both the nature and amount
19.3.2 Changes in Clinician Work of patient-clinician interactions. We have
much to learn about how clinicians’ use of
The proliferation of these mHealth apps these new mHealth technologies and their
brings the potential for substantive changes in response to patients’ use of the technologies
clinician work as well as for clinician-patient will affect clinicians’ work.
interactions. Because many people are now
using mHealth apps to generate huge amounts
of detailed health data, people could expect 19.3.3 Changes in Patient Work
clinicians to use those data to gain new
insights into a person’s health and positively This increasing prevalence of mHealth apps
influence their care. However, the volume and are changing the work and personal lives of
variety of data as well as applications that the people who use these tools to assist in
generated the data make it challenging for cli- their everyday health and well-being. With
nicians to incorporate the new data into their constant sensing of our health comes the
workflow. A literature review of empirical potential for unhealthy disruptions to daily
studies of self-tracking tools identified many life. Even simple apps that count steps for
clinician work-related concerns about infor- physical fitness typically interrupt people
mation quality and the lack of standards for when they have reached their step goal or
representing or viewing that data (West et al. remind people to get up and move during the
2016, 2017). Yet, recent research points to the day. Although these rewards and reminders
value of human-centered design approaches can help people meet their health goals, such
to addressing these concerns. For example, disruptions can come at inopportune times or
one study of DataMD showed that such an places and negatively impact people’s overall
approach could help clinicians develop a new well-being. Many apps allow people to disable
workflow that would allow them incorporate those disruptive features, but then they risk
this kind of data and improve their counseling missing important information or reminders
skills and support more in-depth conversa- that could be key to its successful use. Frequent
19 tions (Kim et al. 2017). tracking and constant reminders of one’s
Yet, irrespective of good design, the real- health can also lead to detrimental obsessions.
time sensing nature of the data creates other For example, one study examined the effect of
workflow concerns, particularly ethical and tracking technology on college students and
legal expectations that clinicians respond to found that those students who used fitness
the sensed data promptly. For example, men- trackers had higher levels of eating concerns
tal health providers could be expected to and symptoms of eating disorders (Simpson
mHealth and Applications
659 19
and Mazzeo 2017). Other studies have shown (Stowell et al. 2018). They identified familiar-
that such tracking can make healthy activities ity with the technology, use of engaging mul-
that people used to enjoy feel more like work timedia content, frequent delivery of content,
and lead to a decrease in those activities and personalization as facilitators in success-
and subjective well-being (Etkin 2016). ful interventions. However, costs and con-
Furthermore, people could become overly cerns about confidentiality and privacy
reliant on the objectively sensed but not neces- (particularly for those who hadn’t completed
sarily accurate data and discount their own or their immigration paperwork) were substan-
others’ subjective experiences.29 tial barriers. Although a slight majority of
Nonetheless, such widespread availability the reviewed studies showed significant
of mobile health apps also brings unprece- improvements in the evaluated health mea-
dented power to everyday people in their abil- sures, their meta-analysis failed to show that
ity to collect new forms of data about their the mHealth interventions successfully
own health and to interpret that data indepen- impacted health outcomes in vulnerable pop-
dently from clinicians. For example, HemaApp ulations. Researchers are beginning to work
allows people with anemia or pulmonary ill- together to identify opportunities for socio-
nesses to detect and monitor total hemoglo- technical interventions to reduce health dis-
bin in their blood using a smartphone’s parities (Siek et al. 2019), but much work
camera and flash, rather than requiring a visit remains.
to a clinician’s office for a blood draw and fol-
low up visit about their results (Wang et al.
2016). Many other apps, such as BiliScreen for 19.3.5 Regulatory Issues
jaundice detection and pancreatic cancer
screening (Mariakakis et al. 2017) and The staggering number of mHealth apps that
SpiroSmart for assessing lung function are available in the marketplace makes it chal-
(Larson et al. 2012), allow people to detect lenging for clinicians, patients, and the general
health problems or monitor existing problems public to discern which apps are effective and
that previously required a clinician visit. safe to adopt. Many worry about the prolif-
eration of these mHealth apps, particularly
the wide variation in quality, potential mis-
19.3.4 Health Disparities leading or unsubstantiated claims, and the
vulnerability of disclosure of personal health
Despite the increasing uptake of mobile information. The US Food and Drug
phones by people—regardless of race, age, or Administration (FDA) is the regulatory body
socioeconomic status—and the high preva- that could provide such safety and effective-
lence of smartphone usage (81% of all U.S. ness oversight of mHealth apps, but its role
adults),30 concerns still remain about whether and influence has been changing. In 2013,
increased reliance upon such mobile technol- 2015, and 2019, the FDA revised its guidance
ogies will exacerbate existing health dispari- about what it will regulate in the mHealth
ties. A recent systematic review examined the space and is now focusing only on apps that
research literature to investigate mHealth pose a great risk if they do not work as intend-
interventions for vulnerable populations ed.31 The regulations pertain to apps that
diagnose, treat, or prevent a health condition.
To help developers determine what laws and
29 Siek, K. Why fitness trackers may not give you all
the ‘credit’ you hoped for. January 15, 2020. The
Conversation. 7 https://theconversation.com/why-
fitness-trackers-may-not-give-you-all-the-credit- 31 FDA. Device Software Functions Including Mobile
you-hoped-for-128585 Retrieved January 15, 2020. Medical Applications. 7 https://www.fda.gov/medi-
30 Pew Research Center. June 12, 2019. Mobile Fact cal-devices/digital-health/device-software-func-
Sheet. 1–6. 7 http://www.pewinternet.org/factsheet/ tions-including-mobile-medical-applications
mobile/ Retrieval January 15, 2020. Retrieval January 15, 2020.
660 E. K. Choe et al.
regulations apply to their mHealth apps, the coming from many different countries.35
FDA created an interactive tool that poses Grand View Research predicts that the global
questions and summarizes the applicability of market for mHealth apps will reach 236 bil-
the laws based on the answers.32 In the private lion in US dollars by 2026 with fitness as the
sector, Xcertia serves as an mHealth app col- largest type of mHealth app.36
laborative effort of the American Medical Some see the future of mHealth through
Association (AMA), the American Heart the eyes of science fiction. In particular, many
Association (AHA), DHX Group and the have envisioned Star Trek’s® tricorder-like
Healthcare Information and Management technology of a small, hand-held device that
Systems Society (HIMSS) “to foster safe, could quickly diagnose a variety of medical
effective, and reputable health technologies.”33 conditions. Qualcomm incentivized this vision
They have developed a set of guidelines for with its $10 million XPRIZE competition to
assessing operability, privacy, security, con- develop a mHealth device that could accu-
tent, and usability. Clearly, the safety and reg- rately diagnose 13 medical conditions, capture
ulation of mHealth apps will remain a key 5 real-time health vital signs, and provide a
issue for the future. compelling consumer experience, without
input from a healthcare professional or facili-
ty.37 Although no one was able to meet all
19.4 Future Directions their criteria, Final Frontier Medical Devices
(now Basil Leaf Technologies38) received the
The rapidly changing nature of technology top prize of $2.5 million with their DxtER
makes writing a book chapter on any aspect device that employed non-invasive sensors to
of it challenging, but anticipating future collect vital signs, body chemistry, and bio-
directions is even more daunting. All aspects logical functions.39 Many others have alterna-
of this book face that challenge, but the field tive, grand views of the future of mHealth
of mHealth and the applications that charac- apps. One point is clear: mHealth applications
terize it are especially dynamic. Exact statis- will continue to influence all aspects of health-
tics are hard to find and rapidly out of date, care—from wellness and prevention through
but nonetheless paint a picture of mHealth’s
growing influence. A 2017 report from IQIA
reports that over 318,000 mHealth apps were
available worldwide and that more than 200 35 Sydow, L. Medical Apps Transform How Patients
Receive Medical Care. April 16, 2019. 7 https://
health apps are added each day.34 According www.appannie.com/en/insights/market-data/medi-
to App Annie’s State of Mobile 2019 Report, cal-apps-transform-patient-care/ Retrieved January
the global download of mHealth apps 15, 2020.
exceeded 400 million in 2018 with the growth 36 Grand View Research. mHealth Apps Market Size,
Share & Trends Analysis Report By Type (Fitness,
Lifestyle Management, Nutrition & Diet, Women’s
Health, Medication Adherence, Healthcare Provid-
ers/Payers), And Segment Forecasts, 2019–2026.
June 2019 7 https://www.grandviewresearch.com/
32 Developing a mobile health app? Find out which industry-analysis/mhealth-app-market Retrieved
federal laws you need to follow. 7 https://www.ftc. January 15, 2020
gov/tips-advice/business-center/guidance/mobile- 37 Qualcomm Tricorder XPRIZE: Empowering Per-
health-apps-interactive-tool Retrieval January 15, sonal Healthcare. 7 https://www.xprize.org/prizes/
19 2020 tricorder Retrieved January 15, 2020.
38 Basil Leaf Technologies. 7 http://www.basilleaf-
33 Xcertia: mHealth App Guidelines. 7 https://xcertia.
org/ Retrieved January 15, 2020. tech.com/ Retrieved January 15, 2020.
34 The Growing Value of Digital Health: Evidence and 39 Family-led team takes top prize in qualcomm tri-
Impact on Human Health and the Healthcare Sys- corder xprize competition for consumer medical
tem. Nov 07, 2017. 7 https://www.iqvia.com/ device inspired by Star Trek® April 13, 2017.
insights/the-iqvia-institute/reports/the-growing- 7 https://www.xprize.org/prizes/tricorder/articles/fam-
value-of-digital-health Retrieved January 15, 2020. ily-led-team-takes-top-prize-in-qualcomm-tricor
mHealth and Applications
661 19
patient-provider interaction and even sur- 55 What are just-in-time adaptive interventions
gery—and that influence will likely follow an and how do mHealth tools enable this type
unpredictable but pivotal path. of health intervention?
55 Most patient-centered mHealth tools are
nnSuggested Readings discretionary use technologies, in the sense
Klasnja, P., & Pratt, W. (2012). Healthcare in the that individuals can choose whether, how
pocket: Mapping the space of mobile-phone much, and for how long to use these
health interventions. Journal of Biomedical devices and applications. Yet, for these
Informatics, 45(1), 184–198. tools to have a hope of being effective,
Choe, E. K., Lee, N. B., Lee, B., Pratt, W., & they have to be used. Given what learned
Kientz, J. A. (2014). Understanding quanti- in this chapter, what aspects of mHealth
fied-selfers’ practices in collecting and explor- tools can facilitate and hinder engagement?
ing personal data. In Proceedings of the 32nd How can mHealth designers make these
annual ACM conference on human factors in tools more engaging so individuals can
computing systems (pp. 1143–1152). benefit from them?
New York: ACM.
Kim, Y. H., Jeon, J. H., Lee, B., Choe, E. K., &
Seo, J. (2017). OmniTrack: a flexible self- References
tracking approach leveraging semi-automated
tracking. Proceedings of the ACM conference Abbey, B., Alipour, A., Gilmour, L., Camp, C., Hofer, C.,
on interactive, mobile, wearable and ubiqui- Lederer, R., et al. (2012). A remotely programmable
tous technologies. 1(3):1:1–1:28. smart pillbox for enhancing medication adherence. In
2012 25th IEEE international symposium on computer-
Stowell, E., Lyson, M. C., Saksono, H., Wurth, based medical systems (CBMS) (pp. 1–4). IEEE.
R. C., Jimison, H., Pavel, M., & Parker, A. G. Alexander, C. M., Teller, L. E., & Gross, J. B. (1989). Principles
(2018). Designing and evaluating mHealth of pulse oximetry: Theoretical and practical consider-
interventions for vulnerable populations: A ations. Anesthesia & Analgesia, 68(3), 368–376.
systematic review. In Proceedings of the 2018 Ali, A. A., Hossain, S. M., Hovsepian, K., Rahman, M. M.,
Plarre, K., & Kumar, S. (2012). mPuff: Automated
CHI conference on human factors in comput- detection of cigarette smoking puffs from respiration
ing systems. Paper (Vol. 15, pp. 1–17). measurements. In Proceedings of the 11th interna-
tional conference on information processing in sensor
??Questions for Discussion networks (pp. 269–280). ACM.
55 What challenges does the rapid evolution of Alshurafa, N., Kalantarian, H., Pourhomayoun, M., Sarin, S.,
Liu, J. J., & Sarrafzadeh, M. (2014). Non-invasive moni-
mobile technologies and platforms (OS)
toring of eating behavior using spectrogram analysis
pose to the people who develop mHealth in a wearable necklace. In 2014 IEEE healthcare inno-
interventions? vation conference (HIC) (pp. 71–74). IEEE.
55 What are some of the ways in which mHealth Althoff, T., White, R. W., & Horvitz, E. (2016). Influence of
technologies are designed for health Pokémon Go on physical activity: Study and implica-
tions. Journal of Medical Internet Research., 18(12),
professionals? In designing such tools, what
1–23.
are some key design considerations? Amft, O., Junker, H., & Troster, G. (2005). Detection of eat-
55 How can lay individuals access their own ing and drinking arm gestures using inertial body-
data collected from mHealth technologies? worn sensors. In Ninth IEEE international symposium
In what cases such data access might be on wearable computers (ISWC'05) (pp. 160–163).
IEEE.
useful for individuals?
Amini, F., Hasan, K., Bunt, A., & Irani, P. (2017). Data repre-
55 In addition to creating new ways to deliver sentations for in-situ exploration of health and fit-
health interventions, mHealth has been seen ness data. In Proceedings of the 11th EAI international
as having the potential to greatly advance conference on pervasive computing technologies for
health research, including fields such as healthcare (pp. 163–172). New York: ACM.
Batalas, N., Khan, V. J., & Markopoulos, P. (2018). Using
epidemiology. How can mHealth tools
TEMPEST: End-user programming of web-based eco-
advance our understanding of factors that logical momentary assessment protocols.
shape individuals’ health? Proceedings of the ACM on Human-Computer
Interaction. 2(EICS), 3, 1–3:24.
662 E. K. Choe et al.
Bentley, F., Tollmar, K., Stephenson, P., Levy, L., Jones, B., drome and weight management. Journal of Medical
Robertson, S., et al. (2013). Health mashups. ACM Internet Research, 17(8), 1–25.
Transactions on Computer-Human Interaction, 20(5), Consolvo, S., Klasnja, P., McDonald, D. W., & Landay, J. A.
1–27. (2014). Designing for healthy lifestyles: Design con-
Blascheck, T., Besançon, L., Bezerianos, A., Lee, B., & siderations for mobile technologies to encourage
Isenberg, P. (2019). Glanceable visualization: Studies consumer health & wellness. Foundations and Trends
of data comparison performance on smartwatches. in Human-Computer Interaction, 6(3–4), 167–315.
IEEE Transactions on Visualization and Computer Dai, J., Bai, X., Yang, Z., Shen, Z., & Xuan, D. (2010). Mobile
Graphics, 25(1), 630–640. phone-based pervasive fall detection. Personal and
Brehmer, M., Lee, B., Isenberg, P., & Choe, E. K. (2018). Ubiquitous Computing, 14(7), 633–643.
Visualizing ranges over time on mobile phones: A Daskalova, N., Metaxa-Kakavouli, D., Tran, A., Nugent, N.,
task-based crowdsourced evaluation. IEEE Boergers, J., McGeary, J., & Huang, J. (2016).
Transactions on Visualization and Computer Graphics, SleepCoacher: A personalized automated self-exper-
25(1), 619–629. imentation system for sleep recommendations. In
Buettner, M., Prasad, R., Philipose, M., & Wetherall, D. Proceedings of the 29th annual symposium on user
(2009). Recognizing daily activities with RFID-based interface software and technology (pp. 347–358).
sensors. In Proceedings of the 11th international con- New York: ACM.
ference on ubiquitous computing (pp. 51–60). Dennis, M. L., Scott, C. K., Funk, R. R., & Nicholson, L.
New York: ACM. (2015). A pilot study to examine the feasibility and
Carter, S., Mankoff, J., & Heer, J. (2007). Momento: support potential effectiveness of using smartphones to pro-
for situated ubicomp experimentation. In Proceedings vide recovery support for adolescents. Substance
of the SIGCHI conference on human factors in comput- Abuse, 36(4), 486–492.
ing systems (pp. 125–134). New York: ACM. Devarakonda, S., Sevusu, P., Liu, H., Liu, R., Iftode, L., &
Carek, A. M., Conant, J., Joshi, A., Kang, H., & Inan, O. T. Nath, B. (2013). Real-time air quality monitoring
(2017). SeismoWatch: wearable cuffless blood pres- through mobile sensing in metropolitan areas. In
sure monitoring using pulse transit time. Proceedings Proceedings of the 2nd ACM SIGKDD international
of the ACM on Interactive, Mobile, Wearable and workshop on urban computing (p. 15). ACM.
Ubiquitous Technologies, 1(3), 40. Dey, A. K., Wac, K., Ferreira, D., Tassini, K., Hong, J. H., &
Chen, H. C., Jia, W., Yue, Y., Li, Z., Sun, Y. N., Fernstrom, J. D., Ramos, J. (2011). Getting closer: An empirical inves-
& Sun, M. (2013). Model-based measurement of food tigation of the proximity of user to their smart
portion size for image-based dietary assessment phones. In Proceedings of the 13th international con-
using 3D/2D registration. Measurement Science and ference on ubiquitous computing (pp. 163–172).
Technology, 24, 105701. New York: ACM.
Chen, S., Ju, X., Guo, X., & Tian, W. S. (2018). Factors influ- Eadie, M., & Steele, R. J. (2017). Non-invasive blood glu-
encing physician's well-being management within the cose monitoring and data analytics. In Proceedings of
mobile health context: A scoping review. Proceedings the international conference on compute and data
of healthcare informatics & health information tech- analysis (pp. 138–142). ACM.
nology (Sighealth) AMCIS 2018. Ertin, E., Stohs, N., Kumar, S., Raij, A., Al'Absi, M., & Shah, S.
Choe, E. K., Lee, N. B., Lee, B., Pratt, W., & Kientz, J. A. (2014). (2011). AutoSense: Unobtrusively wearable sensor
Understanding quantified-selfers' practices in col- suite for inferring the onset, causality, and conse-
lecting and exploring personal data. In Proceedings quences of stress in the field. In Proceedings of the 9th
of the 32nd annual ACM conference on human factors ACM conference on embedded networked sensor sys-
in computing systems (pp. 1143–1152). New York: tems (pp. 274–287). ACM.
ACM. Etkin, J. (2016). The hidden cost of personal quantifica-
Choe, E. K., Lee, B., Kay, M., Pratt, W., & Kientz, J. A. (2015). tion. Journal of Consumer Research, 42(6), 967–984.
SleepTight: Low-burden, self-monitoring technology Evans, B. (2016). PACO—Applying computational meth-
for capturing and reflecting on sleep behaviors. In ods to scale qualitative methods. Ethnographic Praxis
Proceedings of the 2015 ACM international joint confer- in Industry Conference Proceedings, 2016(1), 348–368.
ence on pervasive and ubiquitous computing (pp. 121– Evenson, K. R., Goto, M. M., & Furberg, R. D. (2015).
132). New York: ACM. Systematic review of the validity and reliability of
Choe, E. K., Abdullah, S., Rabbi, M., Thomaz, E., Epstein, consumer-wearable activity trackers. International
D. A., Cordeiro, F., et al. (2017). Semi-automated track- Journal of Behavioral Nutrition and Physical Activity,
19 ing: A balanced approach for self-monitoring appli-
cations. IEEE Pervasive Computing, 16(1), 74–84.
12(1), 159.
Fang, S. H., Liang, Y. C., & Chiu, K. M. (2012). Developing
Choudhury, T., Borriello, G., Consolvo, S., Haehnel, D., a mobile phone-based fall detection system on
Harrison, B., Hemigway, B., et al. (2008). The Mobile android platform. In 2012 computing, communica-
sensing platform: An embedded activity recognition tions and applications conference (pp. 143–146).
system. IEEE Pervasive Computing, 7(2), 32–41. IEEE.
Chung, C. F., Cook, J., Bales, E., Zia, J., & Munson, S. A. Fenton, S., Giannangelo, K., Kallem, C., & Scichilone, R.
(2015). More than telemonitoring: Health provider (2013). Data standards, data quality, and interoper-
use and nonuse of life-log data in irritable bowel syn- ability (updated). Journal of AHIMA, 84(11), 64–69.
mHealth and Applications
663 19
Ferreira, D., Kostakos, V., & Dey, A. K. (2015). AWARE: proceedings (p. 874). Bethesda, MD: American
Mobile context instrumentation framework. Frontiers Medical Informatics Association.
in ICT, 2(6), 1–6. Intille, S. S. (2004). Ubiquitous computing technology for
Free, C., Knight, R., Robertson, S., Whittaker, R., Edwards, just-in-time motivation of behavior change. Studies
P., Zhou, W., et al. (2011). Smoking cessation support in Health Technology and Informatics, 107(2), 1434–
delivered via mobile phone text messaging (txt- 1437.
2stop): A single-blind, randomised trial. The Lancet, Jakicic, J. M., Davis, K. K., Rogers, R. J., King, W. C., Marcus,
378(9785), 49–55. M. D., Helsel, D., et al. (2016). Effect of wearable tech-
Fritz, T., Huang, E. M., Murphy, G. C., & Zimmermann, T. nology combined with a lifestyle intervention on
(2014). Persuasive technology in the real world: A long-term weight loss: The IDEA randomized clinical
study of long-term use of activity sensing devices for trial. Journal of the American Medical Association,
fitness. In Proceedings of the SIGCHI conference on 316(11), 1161–1171.
human factors in computing systems (pp. 487–496). Jaspan, H. B., Flisher, A. J., Myer, L., Mathews, C., Seebregts,
New York: ACM. C., Berwick, J. R., et al. (2007). Brief report: Methods
Froehlich, J., Chen, M. Y., Consolvo, S., Harrison, B., & for collecting sexual behaviour information from
Landay, J. A. (2007). MyExperience: A system for in South African adolescents—a comparison of paper
situ tracing and capturing of user feedback on versus personal digital assistant questionnaires.
mobile phones. In Proceedings of the 5th interna- Journal of Adolescence, 30(2), 353–359.
tional conference on mobile systems, applications and Kaziunas, E., Lindtner, S., Ackerman, M. S., & Lee, J. M.
services (pp. 57–70). New York: ACM. (2018). Lived data: Tinkering with bodies, code, and
Gideon, J., Provost, E. M., & McInnis, M. (2016). Mood state care work. Human-Computer Interaction, 33(1), 49–92.
prediction from speech of varying acoustic quality Kang, J., Binda, J., Agarwal, P., Saconi, B., & Choe, E. K.
for individuals with bipolar disorder. In Proceedings (2017). Fostering user engagement: Improving sense
of IEEE international conference on acoustics, speech, of identity through cosmetic customization in wear-
and signal processing (ICASSP) (pp. 2359–2363). able trackers. In Proceedings of the 11th EAI interna-
Piscataway: IEEE. tional conference on pervasive computing technologies
Gouveia, R., Pereira, F., Karapanos, E., Munson, S. A., & for healthcare (pp. 11–20). New York: ACM.
Hassenzahl, M. (2016). Exploring the design space of Kanhere, S. S. (2011). Participatory sensing:
glanceable feedback for physical activity trackers. In Crowdsourcing data from mobile smartphones in
Proceedings of the 2016 ACM international joint confer- urban spaces. In 2011 IEEE 12th international confer-
ence on pervasive and ubiquitous computing (pp. 144– ence on mobile data management (Vol. 2, pp. 3–6).
155). New York: ACM. IEEE.
Gustafson, D. H., McTavish, F. M., Chih, M. Y., Atwood, A. K., Karkar, R., Schroeder, J., Epstein, D. A., Pina, L. R., Scofield,
Johnson, R. A., Boyle, M. G., et al. (2014). A smart- J., Fogarty, J., et al. (2017). Tummytrials: A feasibility
phone application to support recovery from alcohol- study of using self-experimentation to detect indi-
ism: A randomized clinical trial. JAMA Psychiatry, vidualized food triggers. In Proceedings of the 2017
71(5), 566–572. CHI conference on human factors in computing sys-
Hayes, T. L., Hunt, J. M., Adami, A., & Kaye, J. A. (2006). An tems (pp. 6850–6863). New York: ACM.
electronic pillbox for continuous monitoring of med- Kho, A., Henderson, L. E., Dressler, D. D., & Kripalani, S.
ication adherence. In 2006 international conference of (2006). Use of handheld computers in medical edu-
the IEEE engineering in medicine and biology society cation. Journal of General Internal Medicine, 21(5),
(pp. 6400–6403). IEEE. 531–537.
Hekler, E. B., Rivera, D. E., Martin, C. A., Phatak, S. S., Kim, Y., Heo, E., Lee, H., Ji, S., Choi, J., Kim, J. W., et al. (2017).
Freigoun, M. T., Korinek, E., et al. (2018). Tutorial for Prescribing 10,000 steps like aspirin. In Proceedings of
using control systems engineering to optimize adap- the 2017 CHI conference on human factors in computing
tive mobile health interventions. Journal of Medical systems (pp. 5787–5799). New York: ACM.
Internet Research, 20(6), e214. Kim, Y., Lee, B., & Choe, E. K. (2019). Investigating data
Heron, K. E., & Smyth, J. M. (2010). Ecological momentary accessibility of personal health apps. Journal of the
interventions: Incorporating mobile technology into American Medical Informatics Association, 26(5), 412–
psychosocial and health behaviour treatments. 419.
British Journal of Health Psychology, 15(1), 1–39. Kim, Y. H., Jeon, J. H., Lee, B., Choe, E. K., & Seo, J. (2017).
Hovsepian, K., al’Absi, M., Ertin, E., Kamarck, T., Nakajima, OmniTrack: a flexible self-tracking approach leverag-
M., & Kumar, S. (2015). cStress: Towards a gold stan- ing semi-automated tracking. Proceedings of the ACM
dard for continuous stress assessment in the mobile on Interactive, Mobile, Wearable and Ubiquitous
environment. In Proceedings of the 2015 ACM interna- Technologies. 1(3):1:1–1:28.
tional joint conference on pervasive and ubiquitous Kitamura, K., de Silva, C., Yamasaki, T., & Aizawa, K. (2010).
computing (pp. 493–504). New York: ACM. Image processing based approach to food balance
Intille, S. S., Kukla, C., Farzanfar, R., & Bakr, W. (2003). Just- analysis for personal food logging. In Multimedia and
in-time technology to encourage incremental, expo (ICME), 2010 IEEE international conference on
dietary behavior change. In AMIA annual symposium (pp. 625–630).
664 E. K. Choe et al.
Klasnja, P., Consolvo, S., & Pratt, W. (2011). How to evalu- Lillie, E. O., Patay, B., Diamant, J., Issell, B., Topol, E. J., &
ate technologies for health behavior change in HCI Schork, N. J. (2011). The n-of-1 clinical trial: The ulti-
research. In Proceedings of the SIGCHI conference on mate strategy for individualizing medicine?
human factors in computing systems (pp. 3063–3072). Personalized Medicine, 8(2), 161–173.
New York: ACM. Liu, H., Philipose, M., Pettersson, M., & Sun, M. T. (2014).
Klasnja, P., & Pratt, W. (2012). Healthcare in the pocket: Recognizing object manipulation activities using
Mapping the space of mobile-phone health inter- depth and visual cues. Journal of Visual Communication
ventions. Journal of Biomedical Informatics, 45(1), and Image Representation, 25(4), 719–726.
184–198. Lupton, D. (2014). Self-tracking cultures: Towards a sociol-
Klasnja, P., Smith, S., Seewald, N. J., Lee, A., Hall, K., Luers, ogy of personal informatics. In Proceedings of the
B., et al. (2018). Efficacy of contextually tailored sug- 26th Australian computer-human interaction confer-
gestions for physical activity: A micro-randomized ence on designing futures: The future of design
optimization trial of HeartSteps. Annals of Behavioral (pp. 77–86). New York: ACM.
Medicine, 53(6), 573–582. Makeyev, O., Lopez-Meyer, P., Schuckers, S., Besio, W., &
Koshy, E., Car, J., & Majeed, A. (2008). Effectiveness of Sazonov, E. (2012). Automatic food intake detection
mobile-phone short message service (SMS) remind- based on swallowing sounds. Biomedical Signal
ers for ophthalmology outpatient appointments: Processing and Control, 7(6), 649–656.
Observational study. BMC Ophthalmology, 8(9), 1–6. Mariakakis, A., Wang, E., Patel, S., & Wen, J. C. (2016). A
Kwon, M., Lee, J. Y., Won, W. Y., Park, J. W., Min, J. A., Hahn, smartphone-based system for assessing intraocular
C., et al. (2013). Development and validation of a pressure. In 2016 38th annual international confer-
smartphone addiction scale (SAS). PLoS One, 8(2), ence of the IEEE engineering in medicine and biology
e56936:1–e56936:7. society (pp. 4353–4356). Piscataway: IEEE.
Kyritsis, K., Diou, C., & Delopoulos, A. (2017). Food intake Mariakakis, A., Parsi, S., Patel, S. N., & Wobbrock, J. O.
detection from inertial sensors using lstm networks. (2018). Drunk user interfaces: Determining blood
In International conference on image analysis and pro- alcohol level through everyday smartphone tasks. In
cessing (pp. 411–418). Cham: Springer. Proceedings of the 2018 CHI conference on human fac-
Lane, S. J., Heddle, N. M., Arnold, E., & Walker, I. (2006). A tors in computing systems (p. 234:1-234:13). New York:
review of randomized controlled trials comparing ACM.
the effectiveness of hand held computers with paper Mariakakis, A., Banks, M. A., Phillipi, L., Yu, L., Taylor, J., &
methods for data collection. BMC Medical Informatics Patel, S. N. (2017). Biliscreen: smartphone-based scleral
and Decision Making, 6(23), 1–10. jaundice monitoring for liver and pancreatic disorders.
Larson, E. C., Goel, M., Boriello, G., Heltshe, S., Rosenfeld, Proceedings of the ACM conference on interactive,
M., & Patel, S. N. (2012). SpiroSmart: Using a micro- mobile, wearable and ubiquitous technologies. 1, 2,
phone to measure lung function on a mobile phone. article 20. 26 pages. https://doi.org/10.1145/3090085
In Proceedings of the 2012 ACM conference on ubiqui- Martínez-Pérez, B., De La Torre-Díez, I., & López-Coronado,
tous computing (pp. 280–289). New York: ACM. M. (2013). Mobile health applications for the most
Lazar, A., Koehler, C., Tanenbaum, J., & Nguyen, D. H. prevalent conditions by the World Health
(2015). Why we use and abandon smart devices. In Organization: Review and analysis. Journal of Medical
Proceedings of the 2015 ACM international joint confer- Internet Research, 15(6), 1–19.
ence on pervasive and ubiquitous computing (pp. 635– Michie, S., Abraham, C., Whittington, C., McAteer, J., &
646). New York: ACM. Gupta, S. (2009). Effective techniques in healthy eat-
Lee, B., Brehmer, M., Isenberg, P., Choe, E. K., Langer, R., & ing and physical activity interventions: A meta-
Dachselt, R. (2018). Data visualization on mobile regression. Health Psychology, 28(6), 690.
devices. In Extended abstracts of the 2019 CHI confer- Murphy, T. (2013). 40 years after the first cell phone call:
ence on human factors in computing systems (pp. 1–8). Who is inventing tomorrow’s future? IEEE Consumer
New York: ACM. Electronics Magazine, 2(4), 44–46.
Lee, Y. K., Chang, C. T., Lin, Y., & Cheng, Z. H. (2014). The Nahum-Shani, I., Hekler, E. B., & Spruijt-Metz, D. (2015).
dark side of smartphone usage: Psychological traits, Building health behavior models to guide the devel-
compulsive behavior and technostress. Computers in opment of just-in-time adaptive interventions: A
Human Behavior, 31, 373–383. pragmatic framework. Health Psychology, 34(S),
Leong, K. C., Chen, W. S., Leong, K. W., Mastura, I., Mimi, 1209–1219.
O., Sheikh, M. A., et al. (2006). The use of text mes- Nahum-Shani, I., Smith, S. N., Spring, B. J., Collins, L. M.,
19 saging to improve attendance in primary care: A ran-
domized controlled trial. Family Practice, 23(6),
Witkiewitz, K., Tewari, A., & Murphy, S. A. (2016). Just-
in-time adaptive interventions (JITAIs) in mobile
699–705. health: Key components and design principles for
LiKamWa, R., Liu, Y., Lane, N. D., & Zhong, L. (2013). ongoing health behavior support. Annals of
Moodscope: Building a mood sensor from smart- Behavioral Medicine, 52(6), 446–462.
phone usage patterns. In Proceedings of the 11th Nilsen, W., Kumar, S., Shar, A., Varoquiers, C., Wiley, T.,
annual international conference on mobile systems, Riley, W. T., Pavel, M., & Atienza, A. A. (2012).
applications, and services (pp. 389–402). New York: Advancing the science of mHealth. Journal of Health
ACM. Communication, 17(Sup1), 5–10.
mHealth and Applications
665 19
Noronha, J., Hysen, E., Zhang, H., & Gajos, K. Z. (2011). Siek, K. A., Connelly, K. H., Rogers, Y., Rohwer, P., Lambert,
PlateMate: Crowdsourcing nutritional a nalysis from D., & Welch, J. L. (2006). When do we eat? An evaluation
food photographs. In Proceedings of the 24th annual of food items input into an electronic food monitoring
ACM symposium on user Interface software and tech- application. In 2006 pervasive health conference and
nology (UIST ‘11) (pp. 1–12). New York: ACM. workshops (pp. 1–10). Piscataway: IEEE.
Parate, A., Chiu, M. C., Chadowitz, C., Ganesan, D., & Siek, K., Veinot, T., & Mynatt, B. (2019). Research opportu-
Kalogerakis, E. [2014]. Risq: Recognizing smoking nities in sociotechnical interventions for health dis-
gestures with inertial sensors on a wristband. parity reduction. Computing Community
Patil, O. R., Gao, Y., Li, B., & Jin, Z. (2017). CamBP: A camera- Consortium; 2019 Jun. Available at https://cra.org/
based, non-contact blood pressure monitor. In ccc/wp-content/uploads/sites/2/2018/01/17602-
Proceedings of the 2017 ACM international joint confer- CCC-Health-Disparities-ReportFinal.pdf
ence on pervasive and ubiquitous computing and pro- Shanafelt, T. D., Hasan, O., Dyrbye, L. N., Sinsky, C., Satele,
ceedings of the 2017 ACM international symposium on D., Sloan, J., & West, C. P. (2015). Changes in burnout
wearable computers (pp. 524–529). ACM. and satisfaction with work-life balance in physicians
Patrick, K., Griswold, W. G., Raab, F., & Intille, S. S. (2008). and the general US working population between
Health and the mobile phone. American Journal of 2011 and 2014. In Mayo Clinic proceedings (Vol. 90,
Preventive Medicine, 35(2), 177–181. No. 12, pp. 1600–1613). Elsevier.
Phatak, S. S., Freigoun, M. T., Martín, C. A., Rivera, D. E., Shiffman, S., Stone, A. A., & Hufford, M. R. (2008).
Korinek, E. V., Adams, M. A., et al. (2018). Modeling Ecological momentary assessment. Annual Review of
individual differences: A case study of the application Clinical Psychology, 4, 1–32.
of system identification for personalizing a physical Simpson, C. C., & Mazzeo, S. E. (2017). Calorie counting
activity intervention. Journal of Biomedical and fitness tracking technology: Associations with
Informatics, 79, 82–97. eating disorder symptomatology. Eating Behaviors,
Piercy, K. L., Troiano, R. P., Ballard, R. M., Carlson, S. A., 26, 89–92.
Fulton, J. E., Galuska, D. A., et al. (2018). The physical Spruijt-Metz, D., & Nilsen, W. (2014). Dynamic models of
activity guidelines for Americans. JAMA, 320(19), behavior for just-in-time adaptive interventions. IEEE
2020–2028. Pervasive Computing, 13(3), 13–17.
Poon, A. D., Fagan, L. M., & Shortliffe, E. H. (1996). The PEN- Stone, A., Shiffman, S., Atienza, A., & Nebeling, L. (2007).
ivory project: Exploring user-interface design for the The science of real-time data capture: Self-reports in
selection of items from large controlled vocabularies health research. Oxford: Oxford University Press.
of medicine. Journal of the American Medical Stowell, E., Lyson, M. C., Saksono, H., Wurth, R. C., Jimison,
Informatics Association, 3(2), 168–183. H., Pavel, M., & Parker, A. G. (2018). Designing and
Quinn, C. C., Clough, S. S., Minor, J. M., Lender, D., Okafor, evaluating mHealth interventions for vulnerable
M. C., & Gruber-Baldini, A. (2008). WellDoc™ mobile populations: A systematic review. In Proceedings of
diabetes management randomized controlled trial: the 2018 CHI conference on human factors in
Change in clinical and behavioral outcomes and computing systems. Paper (Vol. 15, pp. 1–17).
patient and physician satisfaction. Diabetes Stratton, R. J., Stubbs, R. J., Hughes, D., King, N., Blundell,
Technology & Therapeutics, 10(3), 160–168. J. E., & Elia, M. (1998). Comparison of the traditional
Rabbi, M., Aung, M. H., Zhang, M., & Choudhury, T. (2015). paper visual analogue scale questionnaire with an
MyBehavior: automatic personalized health feed- apple Newton electronic appetite rating system
back from user behaviors and preferences using (EARS) in free living subjects feeding ad libitum.
smartphones. In Proceedings of the 2015 ACM European Journal of Clinical Nutrition, 52(10), 737–741.
International Joint Conference on Pervasive and Sun, M., Burke, L. E., Mao, Z. H., Chen, Y., Chen, H. C., Bai, Y.,
Ubiquitous Computing (pp. 707–718). Li, Y., Li, C., & Jia, W. (2014). eButton: A wearable com-
Rooksby, J., Rost, M., Morrison, A., & Chalmers, M. C. puter for health monitoring and personal assistance.
(2014). Personal tracking as lived informatics. In In Proceedings of the 51st annual design automation
Proceedings of the 32nd annual ACM conference on conference. San Francisco: 1–5 June 2014.
human factors in computing systems (pp. 1163–1172). Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning:
New York: ACM. An introduction. Cambridge: MIT Press.
Rough, D., & Quigley, A. (2015). Jeeves—A visual pro- Truong, K. N., Shihipar, T., & Wigdor, D. J. (2014). Slide to x:
gramming environment for mobile experience sam- Unlocking the potential of smartphone unlocking. In
pling. In 2015 IEEE symposium on visual languages Proceedings of the SIGCHI conference on human factors
and human-centric computing (VL/HCC) (pp. 121– in computing systems (pp. 3635–3644). New York:
129). Piscataway: IEEE. ACM.
Russell, J. A., Weiss, A., & Mendelsohn, G. A. (1989). Affect Vaizman, Y., Ellis, K., Lanckriet, G., & Weibel, N. (2018).
grid: A single-item scale of pleasure and arousal. Journal Extrasensory app: Data collection in-the-wild with
of Personality and Social Psychology, 57(3), 493–502. rich user interface to self-report behavior. In
Sadoughi, F., Behmanesh, A., & Sayfouri, N. (2020). Proceedings of the 2018 CHI conference on human fac-
Internet of things in medicine: A systematic mapping tors in computing systems (pp. 554:1–554:12).
study. Journal of Biomedical Informatics, 103, 103383. New York: ACM.
666 E. K. Choe et al.
Wang, E. J., Li, W., Hawkins, D., Gernsheimer, T., Norby- Xiong, H., Huang, Y., Barnes, L. E., & Gerber, M. S. (2016).
Slycord, C., & Patel, S. N. (2016). HemaApp: Sensus: A cross-platform, general-purpose system for
Noninvasive blood screening of hemoglobin using mobile crowdsensing in human-subject studies. In
smartphone cameras. In Proceedings of the 2016 ACM Proceedings of the 2016 ACM international joint confer-
international joint conference on pervasive and ubiqui- ence on pervasive and ubiquitous computing (pp. 415–
tous computing (pp. 593–604). New York: ACM. 426). New York: ACM.
Webb, T., Joseph, J., Yardley, L., & Michie, S. (2010). Using Zhang, X., Pina, L. R., & Fogarty, J. (2016). Examining
the internet to promote health behavior change: A unlock journaling with diaries and reminders for in
systematic review and meta-analysis of the impact of situ self-report in health and wellness. In Proceedings
theoretical basis, use of behavior change techniques, of the 2016 CHI conference on human factors in com-
and mode of delivery on efficacy. Journal of Medical puting systems (pp. 5658–5664). New York: ACM.
Internet Research, 12(1), e4. Zhang, Y., & Feng, Z. (2017). A SVM method for continuous
West, P., Giordano, R., Van Kleek, M., & Shadbolt, N. (2016). blood pressure estimation from a PPG signal. In
The quantified patient in the doctor’s office. In Proceedings of the 9th International Conference on
Proceedings of the 2016 CHI conference on human fac- Machine Learning and Computing (pp. 128–132).
tors in computing systems (pp. 3066–3078). New York: Zhu, H., Colgan, J., Reddy, M., & Choe, E. K. (2016). Sharing
ACM. patient-generated data in clinical practices: An inter-
West, P., Van Kleek, M., Giordano, R., Weal, M., & Shadbolt, view study. In AMIA annual symposium proceedings
N. (2017). Information quality challenges of patient- (pp. 1303–1312). Bethesda, MD: American Medical
generated data in clinical practice. Frontiers in Public Informatics Association.
Health, 5, 284.
19
667 20
Contents
References – 690
and largely ignored the audio-only telehealth. national guidelines were developed (Kane and
This is paradoxical given that up to 25% of Sands 1998). However, e-mail has a number of
all primary care encounters occur via the disadvantages for health-related messaging:
telephone. These include triage, case manage- delivery is not guaranteed; privacy and secu-
ment, results review, consultation, medication rity are problematic; e-mail is transient (there
adjustment and logistical issues, like schedul- was no automatic logging or audit trail); and
ing. In part, this can be traced to the fact that the messages are completely unstructured.
telephone consultations are not reimbursed To address these limitations, a variety of
by most insurance carriers. Web- based messaging solutions, called per-
More recently, increased interest in cost sonal clinical electronic communications, have
control through case management has driven been developed (Sarkar and Starren 2002).
renewed interest in use of audio-only com- Because the messages never leave the Web site,
munication between patients and providers. many of the problems associated with conven-
Multiple articles have appeared on the value tional e-mail are avoided. Web-based messag-
of telephone follow-up for chronic conditions ing is a standard feature of patient portals see
(Downes et al. 2017; Jayakody et al. 2016). 7 Chap. 11) associated with many EHRs. The
Several managed care companies have set up inclusion of messaging as a Meaningful Use
large telephone triage centers. The National requirement for Certified EHRs significantly
Health Service in the UK is investing £123 mil- increased the use of web-based messaging to
lion per year in NHS Direct, a nation-wide provide telehealth (. Fig. 20.1).
telephone information and triage system that
handles 27,000 calls per day.
20.3.3 Remote Monitoring
.. Fig. 20.1 Connections. The figure shows different based sensors) or active monitoring (such as wearables
ways that electronic communications can be used to link or other active monitoring devices) in the patient’s home
patients with various health resources. Only connections or other community settings. Other resources, such as
directly involving the patient are shown (e.g., use of the remote surgery or imaging, would require the patient to
EHR by the clinician is not shown). Patient generated go to a telehealth-equipped clinical facility
data are created via passive monitoring (such as home-
underlying nearly all remote monitoring is ters that guide management. Any measurable
that clinically significant changes in patient parameter is a candidate for remote monitor-
condition occur between regularly scheduled ing. The collected data may include continu-
visits and that these changes can be detected ous data streams or, more commonly, discrete
by measuring physiologic parameters. measurements.
The care model presumes that, if these Another important feature of most remote
changes are detected and treated sooner, monitoring is that the measurement of the
the overall condition of the patient will be parameter and the transmission of the data
improved. An important distinction between are typically separate events. The measure-
remote monitoring and many conventional ment devices have a memory that can store
forms of telemedicine is that remote moni- multiple measurements. The patient will send
toring focuses on management, rather than the data to the caregiver in one of several
on diagnosis. Typically, remote monitoring ways. For many studies, the patient will log
involves patients who have already been diag- onto a server at the central site (either over the
nosed with a chronic disease or condition. Web or by direct dial-up) and then type in the
Remote monitoring is used to track parame- data. Alternately, the patient may connect the
674 M. F. Chiang et al.
measurement device to a personal computer who will review the data. Research studies
or specialized modem and transfer the read- have utilized specially trained nurses at cen-
ings electronically. tralized offices, but it is not clear that this will
More recently, a variety of monitoring scale up. Third is money—for most condi-
devices have been developed that either con- tions, remote monitoring is still not a reim-
nect directly to mobile telephones or transmit bursed activity.
the data to the mobile phone using Bluetooth
wireless. The mobile phone then transmits the
data to a provider for review. A major advan- 20.3.4 Remote Interpretation
tage of direct electronic transfer is that it elim-
inates problems stemming from manual entry, Although Samuel was diagnosed with Type
including falsification, number preference II diabetes over 20 years ago and realizes that
and transcription errors. The role of mobile visual loss can be a serious complication, he
telephones in providing health services has has only rarely received dilated eye exams for
grown so rapidly that the term mobile health retinopathy screening. There is no eye doc-
(or “mHealth”) has been coined. The term tor conveniently located near his home, and
appeared first, one time, in 2004 in PubMed. he feels that the appointments are always
See 7 Chap. 19 for more on mHealth sys- too long and that he has no problems such
tems. Additionally, the emergence of inter- as blurred vision. However, his primary care
connected sensors and devices referred to as doctor has recently implemented a new reti-
Internet of Things (IoT) described later in the nal screening machine in the office. During a
chapter, have the potential to contribute to routine medical examination, Samuel receives
remote monitoring systems. a retinal photograph from an office technician
Any condition that is evaluated by mea- that is then interpreted by a remote ophthal-
suring a physiologic parameter is a candidate mologist. Samuel is told that he has high-risk
for remote monitoring. The parameter most diabetic retinopathy that requires treatment to
measured in the remote setting is blood glu- prevent visual loss. He is emergently referred
cose for monitoring diabetes. A wide variety to an ophthalmologist, who performs a
of research projects and commercial systems successful laser procedure to treat the diabetic
have been developed to monitor patients with retinopathy.
diabetes. Patients with asthma can be moni- Remote interpretation is a category of
tored with peak-flow or full-loop spirometers. store-and-forward telehealth that involves
Patients with hypertension can be monitored the capture of images, or other data, at one
with automated blood pressure cuffs. Patients site and their transmission to another site for
with congestive heart failure (CHF) are moni- interpretation. This may include radiographs
tored by measuring daily weights to detect (teleradiology), photographs (teledermatol-
fluid gain. Remote monitoring of pacemaker ogy, teleophthalmology, telepathology), wave
function has been available for a number of forms such as ECGs (e.g. telecardiology), and
years and has recently been approved for text-based medical data.
reimbursement. Home coagulation meters The store-and-forward telehealth modali-
have been developed that allows the monitor- ties have benefited most from the develop-
ing of patients on chronic anticoagulation ment of the commodity Internet and the
therapy. See a discussion on remote intensive increasing availability of affordable high
care later in the chapter and also 7 Chap. 21 bandwidth connections that it provides. The
for more on patient monitoring systems. shared commodity Internet provides relatively
Several factors limit the widespread use high bandwidth, but the available bandwidth
20 of remote monitoring. First is the question is continuously varying. This makes it much
of efficacy. While these systems have proven better suited for the transfer of text-based
acceptable to patients and beneficial in small data and image files, rather than for streaming
studies, few large-scale controlled trials have data or video connections. Although image
been done. Second is the basic question of files are often tens or hundreds of megabytes
Telemedicine and Telehealth
675 20
in size, the files are typically transferred to the ble, or potentially even better, accuracy and
interpretation site and cached there for later efficiency compared to traditional film-based
interpretation. From a logistical perspec- radiological examination (Franken et al. 1992;
tive, multiple remote interpretations may be Mackinnon et al. 2008; Reiner et al. 2002).
batched and performed together, thereby pro-
viding important workflow and convenience zz Teleophthalmology
advantages over traditional medical examina- Another area of remote interpretation that
tions or real-time video telehealth paradigms. is growing rapidly is teleophthalmology,
particularly for retinal disease screening. As
zz Teleradiology one example, diabetic retinopathy (retinal
By far, teleradiology is the largest category disease) is a leading cause of blindness that
of remote interpretation, and probably the can be treated if detected early. However, it
largest category of telehealth. Teleradiology has been found that nearly 50% of diabetics
(along with telepathology) represents the are non- compliant with guidelines recom-
most mature clinical domain in telehealth. mending annual screening eye examinations
With the deployment of picture archiving and (Brechner et al. 1993). Systems have been
communications systems (PACS) that capture, developed that allow nurses or technicians
store, transmit and displays digital radiology in primary care offices to obtain high quality
images, the line between teleradiology and digital retinal photographs. These images are
conventional radiology is blurring. In fact, sent to regional centers for interpretation. If
routine medical care in radiology and pathol- diabetic retinopathy is identified or suspected,
ogy is increasingly being delivered primarily the patient is referred for full ophthalmologic
through “telehealth” strategies (Radiology examination.
image management is discussed in more detail Large-scale operational systems have
in 7 Chap. 22). been implemented by the Veterans Health
Many factors have contributed to the more Administration and by other institutions, par-
rapid adoption of telehealth in domains such ticularly in areas with limited accessibility to
as radiology and pathology. One important eye care specialists (Cavalleranno et al. 2005;
factor is the relationship between these spe- Cuadros and Bresnick 2009). In fact, remote
cialists and their patients. In both domains, interpretation of retinal images by certified
the professional role is often limited to the reading centers, when taken after dilation of
interpretation of images, and the special- the eyes using standard photographic proto-
ist rarely interacts directly with the patient. cols originally developed for clinical research
To patients, there is therefore little perceived trials, has been demonstrated to classify dia-
difference between a radiologist in the next betic retinopathy more accurately than tradi-
building and one in the next state. tional dilated eye examination. This is likely
An important factor driving the growth because retinal abnormalities found on pho-
of teleradiology is that it is reimbursable by tographs may be reviewed in more detail than
insurance payers. Because image interpre- what is generally feasible during traditional
tation does not involve direct patient con- eye examinations.
tact, few payers make any distinction about Another application of teleophthalmology
where the interpretation occurred. Rapid dis- is in retinopathy of prematurity (ROP), a lead-
semination of teleradiology systems has also ing cause of blindness in premature infants,
been supported by widespread adoption of whereby hospitalized infants are examined
vendor-neutral image storage and transmis- regularly to identify treatment-requiring dis-
sion standards such as Digital Imaging and ease. However, these examinations are logis-
Communication in Medicine (DICOM; dis- tically difficult and time consuming, and the
cussed in more detail in 7 Chaps. 7 and 22). number of ophthalmologists willing to per-
Finally, numerous evaluation studies have form them has decreased. As a result, systems
demonstrated that digital image interpreta- have been developed in which trained nurses
tion by through teleradiology has compara- capture retinal photographs and transmit
676 M. F. Chiang et al.
.. Fig. 20.2 Retinal images of diabetic retinopathy ally used by optometrists. (Source: Rajalakshmi et al.
obtained via a fundus on phone (FOP) smartphone sys- 2015) Creative Common Attribution
tem compared to a professional Zeiss camera tradition-
them to experts for remote interpretation usually an academic medical center, was con-
(Richter et al. 2009). The proliferation of nected to many spokes, usually rural clinics.
smartphones has introduced additional ways Many of the early telehealth consults
to promote teleopthamology, using a “fun- involved the patient and the primary care pro-
dus on phone” (FOP) camera to facilitate a vider at one site conferring with a specialist at
smartphone-based cost-effective retinal imag- another site. Most of the state-wide telehealth
ing system (. Fig. 20.2). networks operated on this model. This was so
engrained in the telehealth culture, that the
first legislation allowing Medicare reimburse-
20.3.5 Video-Based Telehealth ment of telehealth consults required a “pre-
senter” at the remote site.
To many people telehealth is videoconferenc- This requirement for a “presenter” exac-
ing. Whenever the words “telehealth” or “tele- erbated the scheduling problem. Because
medicine” are mentioned, most people have a synchronous video telehealth often uses spe-
mental image of a patient talking to a doctor cialized videoconferencing rooms, the tele-
over some type of synchronous video connec- visits need to be scheduled at a specific time.
tion. Indeed, most early telehealth research Getting the patient and both clinicians (expert
did focus on synchronous video connections. and presenter) at the right places at the right
20 For many of the early studies, the goal was to time has forced many telehealth programs
provide access to specialists in remote or rural to hire a full-time scheduler. The schedul-
areas. Nearly all of the early systems utilized ing problem, combined with the advent of
a hub-and-spoke topology where one hub, more user-friendly equipment, ultimately led
Telemedicine and Telehealth
677 20
Medicare to drop the presenter requirement. have also been placed in ambulances to pro-
Even so, scheduling is often the single biggest vide remote triage.
obstacle to greater use of synchronous video More recently, the growing popular-
consultations. ity of mobile devices is creating potential
A second obstacle has been the availabil- for new strategies involving real-time video
ity of relevant clinical information. Because communication between patients and health
of the inability to interface between various care providers. This is especially promising
EHRs, it was not unusual for staff to print out because mobile networks are low-cost and
results from the EHR at one site and then to widely-available for consumers, and because
fax those to the other site prior to a synchro- they are increasingly accessible even in devel-
nous video consultation. oping countries. However, health informa-
Unlike store-and-forward telehealth, syn- tion exchange using mobile networks raises
chronous video requires a stable data stream. concerns about privacy, security, and compli-
Although video connection can use conven- ance with Health Insurance Portability and
tional phone lines (commonly referred to as Accountability Act (HIPAA). With appropri-
plain old telephone service, or POTS) that ate encryption settings, wireless video com-
provide 64 bits-per-second (64 kbs) transmis- munication using mobile device applications
sion speed, diagnostic quality video typically may be HIPAA-compliant (e.g. FaceTime;
requires at least 128 Kbs and more commonly Apple Computer, Cupertino, CA). There are
384 Kbs. In order to guarantee stable data already various commercially available solu-
rates, synchronous video in clinically criti- tions that allow patients to download smart-
cal situations still relies heavily on dedicated phone apps to access clinicians. Some of these
circuits, either Integrated Service Digital apps use chatbot technology to screen symp-
Network (ISDN) connections or leased lines. toms before matching patients with clinicians
Within single organizations, or in consulta- who can communicate with text, images and
tive or educational settings, Internet Protocol videos and can e-prescribe to local pharma-
(IP) based video conferencing has become cies. In the future, these mobile technologies
the dominant modality. While POTS-based may provide additional opportunities for
telehealth systems were common in 1990s increased communication between patients
and even early 2000s, the diffusion of high- and providers.
speed Internet has led to a much wider Prior to the adoption of IP-based video-
adoption of IP based videoconferencing. conferencing, programs that begun with grant
The anticipated growth of 5G (fifth genera- funding ended soon after the grant funding
tion wireless systems) facilitating far higher ended. Even after the advent of IP-based con-
speeds and connections with massive capac- ferencing, many programs continued to strug-
ity and low latency for consumer devices, is gle. This was in spite of the fact that Medicare
expected to accelerate the use of telehealth had begun reimbursing for synchronous video
for a broad spectrum of applications and tar- under limited circumstances. The COVID-19
get populations. pandemic introduced short-term policy changes
Synchronous video telehealth has been and led to an accelerated growth of telehealth
used in almost every conceivable situation. In as we discuss later (see 7 Sect. 20.3.8).
addition to traditional consultations, the sys- Some rural health care providers, such
tems have been used to transmit grand rounds as the Marshfield Clinic in Wisconsin, have
and other educational presentations. Video integrated synchronous telehealth into their
cameras have been placed in operating rooms standard care model to provide routine spe-
at hub sites to transmit images of surger- cialist services to outlying location. Some
ies for educational purposes. Video cameras categories of synchronous video telehealth
have been placed in emergency departments have developed sustainable models: telepsy-
and operating rooms at spoke sites to allow chiatry, correctional telehealth; home tele-
experts to “telementor” less experienced phy- health, emergency telehealth, and remote
sicians in the remote location. Video cameras intensive care.
678 M. F. Chiang et al.
others, care of children with autism (Mazurek at the COVID-19 outbreak epicenter at that
et al. 2017), geriatric mental health (Fisher time, telemedicine visits increased by 135%
et al. 2017) and diabetes (Swigert et al. 2014). in urgent care and by 4345% increase in non-
The wide adoption of ECHO aims to utilize urgent care between March 2 and April 14,
telehealth to strengthen workforce capacity in 2020 (Mann et al. 2020). In addition to using
underserved areas and address health dispari- telehealth for the delivery of traditional health
ties. As of 2018 Project ECHO operated more care services, this platform also played a role
than 220 hubs for more than 100 conditions for decision making and self-triage in the con-
or diseases in 31 countries. In a recent study, text of the pandemic. One such example is a
a program informed by the original ECHO telehealth patient portal for self-triage and
model called SCAN-ECHO (Specialty Care scheduling that was created at the University
Access Network-Extension for Community of California San Francisco to enable asymp-
Healthcare Outcomes) was introduced by the tomatic patients to report exposure history
Veterans Health Administration (VHA) to and for symptomatic patients to be triaged
improve care for patients with liver disease and paired with appropriate levels of care
in rural and underserved areas where care to (Judson et al. 2020). The rapid expansion of
specialized care can be challenging. The study telehealth in times of this pandemic highlights
collected 5 years of clinical data from 62,237 the significance of investing in infrastructure
veterans with liver disease in the region. Only and training to better prepare health systems
513 of these veterans had a primary care in times of public health emergencies.
physician who participated in SCAN-ECHO
where they could discuss their patient cases
with specialists but they had a 54% higher sur- 20.4 Challenges and Future
vival rate compared to the rest of their cohort Directions
even when adjusted for other variables (Su
et al. 2018). As telehealth evolves from research novelty
to being a standard way that health care is
delivered, many challenges must be overcome.
20.3.8 he Emergence of Telehealth
T Some of these challenges arise because the one
during a Global Pandemic patient, one doctor model no longer applies.
Basic questions of identity and trust become
As telehealth bridges geographic distance, paramount. At the same time, the shifting
it enables continuity in delivery of services focus from treating illness to managing health
even at times when populations may not have and wellness requires that clinicians know not
access to travel or even be restricted by physi- only the history of the individuals they treat
cal distancing and quarantine. The COVID-19 but also information about the social and envi-
pandemic in 2020 highlighted this potential ronmental context within which those individ-
and led to rapidly accelerated growth of tele- uals reside. In the diabetes example, knowledge
health. Worldwide health systems quickly of the family history of risk factors, diseases,
adopted telehealth solutions. In the US, insur- and the appropriate diagnostic and interven-
ers expanded coverage to include all telemedi- tional protocols, aid the clinical staff in pro-
cine and telehealth visit types including home viding timely and appropriate treatment.
visits, and licensure requirements were relaxed
(Centers for Medicare and Medicaid Services
2020). The US Department of Health and 20.4.1 Challenges to Using
Human Services (2020) waived enforcement the Internet for Telehealth
20 of HIPAA regulations to allow the use of Applications
video-conferencing for telemedicine visits
including the use of widely available video- Because of the public, shared nature of the
conferencing solutions. In one large health Internet, its resources are widely accessible
system (NYU Langone Health) which was by citizens and health care organizations.
Telemedicine and Telehealth
685 20
This public nature also presents challenges for home and community-based care requires
to the security of data transmitted along the that clinical services be supported by appro-
Internet. The openness of the Internet leaves priate technical resources. The challenge
the transmitted data vulnerable to intercep- of the digital divide that highlights varying
tion and inappropriate access. In spite of sig- degrees of access for patients to infrastruc-
nificant improvements in the security of Web ture and tools necessary for telehealth, must
browsing several areas, including protection be addressed when designing and implement-
against viruses, authentication of individuals ing such systems. This consideration is neces-
and the security of email, remain problematic. sary to ensure that telehealth systems, meant
Ensuring every citizen access to the Internet to bridge geographic distance and increase
represents a second important challenge to access, do not end up further exacerbating
the ability to use it for public health purposes. inequities and raising additional barriers to
Access to the Internet presently requires com- high quality care. Additionally, consumer
puter equipment that may be out of reach education is necessary so that patients and
for persons with marginal income levels. families fully understand risks and benefits of
Majority-language literacy and the physical using telehealth software and hardware inte-
capability to type and read present additional grated into the care they receive. Educational
requirements for effective use of the Internet. initiatives need to address a wide spectrum of
Preventing inequalities in access to health consumers’ literacy and health literacy but
care resources delivered via the Internet will also data literacy, namely consumers’ ability
require that health care agencies work with to process, extract meaning and communicate
other social service and educational groups knowledge generated by data.
to make available the technology necessary to
capitalize on this electronic environment for
health care. A 2019 Report by the U.S. Federal 20.4.2 Licensure and Economics
Communications Commission indicated in Telehealth
that more than 20 million Americans lack
advanced broadband Internet access (defined Licensure is frequently cited as the single
as download speeds of at least 25 megabits biggest problem facing telemedicine involv-
per second with upload speeds of 3 Mbps) ing direct patient-provider interactions. This
highlighting that many rural settings depend is because medical licensure in the United
on satellite Internet for access at a higher cost. States is state-based, while telemedicine fre-
The COVID-19 pandemic intensified the dis- quently crosses state or national boundaries.
parities that emerge from this digital divide The debate revolves around the questions
and strengthened ongoing efforts to dissemi- of whether the patient “travels” through the
nate higher bandwidth to rural settings. wire to the clinician, or the clinician “travels”
As health care becomes increasingly reli- through the wire to the patient. Several states
ant on Internet-based telecommunications have passed legislation regulating the manner
technology, the industry faces challenges in in which clinicians may deliver care remotely
insuring the quality and integrity of many or across state lines. Some states have enacted
devices and network pathways. These chal- “full licensure models” that require practitio-
lenges differ from previous medical device ners to hold a full, unrestricted license in each
concerns, because the diversity and reliability state where a patient resides. Many of these
of household equipment is under the control laws have been enacted specifically to restrict
of the household, not the health care provid- the out-of-state practice of telemedicine. To
ers. There is an increased interdependency limit Web-based prescribing and other types
between the providers of health services, those of asynchronous interactions, several states
who manage telecommunication infrastruc- have enacted or are considering regulations
ture and the manufacturers of commercial that would require a face-to-face encoun-
electronics. Insuring effective use of telehealth ter before any electronically delivered care
686 M. F. Chiang et al.
would be allowed. In contrast, some states and Medicaid Services (CMS) created a new
are adopting regulations to facilitate tele- chronic care management (CCM) code that
health by exempting out-of-state physicians provides for non-face-to-face consultation
from in-state licensure requirements provided which introduces options for reimburse-
that electronic care is provided on an irregu- ment for asynchronous remote monitoring.
lar or episodic basis. Still other models would Furthermore, starting in 2018 CMS allowed
include states agreeing to either a mutual providers to get reimbursed separately for
exchange of privileges, or some type of “reg- time spent on the collection and interpreta-
istration” system whereby clinicians from out tion of health-related data that were gener-
of state would register their intent to practice ated remotely. As technologies advance and
via electronic medium. play a pervasive role in our health care sys-
At the same time, national organizations tem, we anticipate the incremental changes in
representing a variety of health care profes- telehealth legislation to accelerate. In 2017,
sions (including nurses, physicians and physi- 210 telehealth related bills were active across
cal therapists) have proposed a variety of thirty states and as new technological capa-
approaches to these issues. While the exist- bilities are introduced, we anticipate further
ing system is built around individual state legislative efforts. Over 25 states have laws
licensure, groups that favor telemedicine have mandating private insurers to reimburse for
proposed various interstate or national licen- telehealth services (with at least another 10
sure schemes. The Federated State Board of additional states have pending or proposed
Medical Examiners has proposed that physi- laws to do so). Numerous insurers provide
cians holding a full, unrestricted license in any reimbursement for electronic messaging and
state should be able to obtain a limited tele- online consultations. The Medicare Telehealth
medicine consultation license using a stream- Parity Act of 2015 has led to advancements
lined application process. The American in reimbursement for teleradiology and tele-
Medical Association is fighting to maintain dermatology including payments for store-
the current state-based licensure model while and-forward telehealth; however, restrictions
encouraging some reciprocity. The American still apply to types of technologies used, ser-
Telemedicine Association supports the posi- vices provided and populations covered. Few
tion that—since patients are “transported” groups have even considered reimbursement
via telemedicine to the clinician—the practi- for telehealth services that do not involve
tioner need only be licensed in his or her home patient-provider interaction. An expert sys-
state. The National Council of State Boards tem could provide triage services; tailored
of Nursing has promoted an Interstate Nurse on-line educational material, or customized
Licensure Compact (NLC) whereby licensed dosage calculations. Such systems are expen-
nurses in a given state are granted multi-state sive to build and maintain, but only services
licensure privileges and are authorized to provided directly by humans are currently
practice in any other state that has adopted reimbursed by insurance.
the compact. By 2015, 25 states had adopted Historically, patients have been perceived
the NLC; in 2017 the eNLC went into effect as as reluctant to pay directly for telehealth ser-
an enhanced compact that addressed the chal- vices, especially when face-to-face visits were
lenge of uniform criminal background checks. covered by insurance. This trend is changing
As of early 2020, 34 states had enacted the as consumers are more familiar with the use
eNLC. of various technologies to bridge geographic
The second factor limiting the growth of distance and embrace various innovative tools
telehealth is reimbursement. Prior to the mid- to bridge geographic distance. In a 2017 study
20 1990s there was virtually no reimbursement (Chang et al. 2017) estimating US house-
for telehealth outside of teleradiology. For holds’ willingness to pay for telehealth, the
many years Medicare routinely reimbursed representative household was willing to pay
for synchronous video only for rural patients. $4.39 per month for telehealth. This valuation
In January 2015, the Centers for Medicare increased for household with higher opportu-
Telemedicine and Telehealth
687 20
nity costs and even more for households living Software that displays clinical information
more than 20 miles away from their nearest required for remote management, and that
medical facility (to $6.22 per month). integrates into existing workflow patterns and
Finally, home telehealth monitoring may maximizes efficiency through good usability
reduce the health care costs associated with principles will be required. More specifically
unreimbursed hospital readmissions. For in the context of usability, when patients are
example, some insurance payers do not reim- asked to utilize telehealth equipment, several
burse for hospital readmissions that occur factors should be taken into consideration
within 30 days of discharge, and there are such as patients’ previous experience with
anecdotal reports of health systems paying for and comfort in using technology, potential
32 days of home monitoring post-discharge. functional or cognitive limitations, and the
Determining whether, and how much, to pay availability of family members or informal
for telehealth services will likely be a topic caregivers who may be able to assist. Consider
of debate for years to come. Starting in 2020 our example of Samuel who lives alone and
Accountable Care Organizations (ACOs) with has some visual limitations. The decision to
Medicare fee-for-service beneficiaries will use software and/or hardware for home-based
have the option to expand telehealth services monitoring should address Samuel’s living
to include the home as an eligible originat- arrangements and residential infrastructure,
ing site (without being subject to the cur- as well as his ability and willingness to oper-
rent Medicare geographic requirements for ate the telehealth system (and the potential
the telehealth originating site). Public health role his neighbor may play who is involved in
developments affect the legal landscape as Samuel’s care). Methods for providing added
the recent COVID-19 pandemic highlighted value from technology toward telehealth diag-
where several of the regulatory and licensing nostic systems through strategies such as links
barriers were temporarily lifted. The regula- to consumer health resources or computer-
tory landscape will evolve over time as both based diagnosis may be explored (Koreen
technology and medical knowledge advance et al. 2007). Finally, studies have suggested
and societal needs change. that patient satisfaction with telehealth sys-
tems is high (Lee et al. 2010). However, the
practitioner-patient relationship is fundamen-
20.4.3 Logistical Requirements tal to health care delivery, and mechanisms
for Implementation must be developed that this bond is not lost
of Telehealth Systems from telehealth.
of these are based on an asynchronous model. tronically monitoring many processes simulta-
Transport media range from standard broad- neously. Clinicians will no longer ask simply,
band in the urban areas, to satellite connec- “How is Mrs. X today?” They will also ask the
tions, to cellular data, to SMS messaging. The computer “Among my 2,000 patients, which
largest group of applications focuses on the ones need my attention today?” Neither clini-
provision of remote consultations for difficult cians, nor EHRs, are prepared for this change.
cases using computer-based systems, while gen- Many service industries such as the travel
eral health education and remote data collec- and transportation sectors have more recently
tion have been the primary applications using experienced dramatic changes due to the con-
cellular telephony applications. However, the cept of “shared economy” that promotes a
development of smart mobile telephones with shift from strictly regulated frameworks for
high-resolution cameras is rapidly blurring this transactions to decentralized approaches
distinction. Successful implementation of tele- where community networks promote identify-
health in various low resource settings is dem- ing and optimizing resources based on needs
onstrated in a series of projects captured by an identified by the community members. This
e-book by Wootton and Bonnardot (2015). paradigm shift has started to also permeate
the health care field introducing an expanded
perspective of telehealth whereby consumers
20.4.5 Future Directions use an app to arrange for on-demand home
visits or “virtual visits” enabled by videocon-
Telehealth validation studies across a range ferencing. One example is the Pager app that
of clinical domains have demonstrated good helps consumers find a doctor who will con-
diagnostic accuracy, reliability, and patient tact and visit within a guaranteed two-hour
satisfaction. Based on these results, numer- window. Similarly, various apps like Mend
ous real-world telehealth programs have been or HealApp arrange for on-demand video-
implemented throughout the world. In the consultations or visits. Other examples of
long term, successful large-scale expansion sharing economy apps utilize crowd sourcing
of these programs will require addressing the to support diagnosis processes. These trends
above challenges. introduce opportunities and challenges as
Beyond these practical factors, traditional we consider the extent to which regulatory
medical care uses a workflow model based on and quality safeguards may be necessary to
synchronous interactions between clinicians maximize benefits and reduce unintended
and individual patients. The workflow model consequences. Furthermore, the use of wear-
is also a sequential one in that the clinician ables and smart home technologies expands
may deal with multiple clinical problems or the traditional models of telehealth so that in
data trends but only within the context of the near future comprehensive telehealth sys-
treating a single patient at a time. Medical tems may integrate physiological, behavioral,
records, both paper and electronic, as well as social, cognitive, environmental and genomic
billing and administrative systems all rely on data sources to deliver “precision medicine”
this sequential paradigm, in which the funda- in a continuum of care. Artificial intelligence
mental unit is the “visit.” Advances in tele- and predictive analytics will play a key role in
health are disrupting this paradigm. Devices this expansion of the telehealth paradigm.
have been developed that allow remote elec- Perhaps the greatest long-term effect of
tronic monitoring of diabetes, hypertension, the information and communication revo-
asthma, congestive heart failure (CHF), and lution will be the breaking down of role,
chronic anticoagulation. As a result, clini- geographic, and social barriers. Medicine is
20 cians may become inundated by large vol- already benefiting from this effect. Traditional
umes of electronic results. This may mean “doctors and nurses” are collaborating with
that clinicians will no longer function in an public health professionals, and anyone with
assembly-line fashion, but will become more computer access can potentially communicate
like dispatchers or air-traffic controllers, elec- with patients or experts around the world. The
Telemedicine and Telehealth
689 20
challenge will be to facilitate productive col- C. V., Hess, D. C., Majersik, J. J., Nystrom,
laborations among patients, their caregivers, K. V., Reeves, M. J., Rosamond, W. D.,
biomedical scientists, and information tech- Switzer, J. A., & American Heart Association
nology experts to promote patient engage- Stroke Council; Council on Epidemiology and
ment and shared decision making. Prevention; Council on Quality of Care and
Outcomes Research. (2017). Telemedicine
nnSuggested Readings quality and outcomes in stroke: A scientific
Bashshur, R. L., Shannon, G. W., Krupinski, statement for healthcare professionals from
E. A., Grigsby, J., Kvedar, J. C., Weinstein, the American Heart Association/American
R. S., et al. (2009). National telemedicine ini- Stroke Association. Stroke, 48, e3–e25. This is
tiatives: Essential to healthcare reform. an updated systematic evidence-based review
Telemedicine and e-Health, 15, 600–610. This of scientific data examining the use of tele-
paper discusses cost-benefit tradeoffs associ- medicine for stroke care delivery. Published
ated with telemedicine within the context of studies are categorized according to their level
large-scale efforts promoting health care of certainty and class of evidence.
reform in the United States.
Chi, N. C., & Demiris, G. (2015). A systematic ??Questions for Discussion
review of telehealth tools and interventions to 1. Telehealth has evolved from systems
support family caregivers. Journal of designed primarily to support
Telemedicine and Telecare, 21, 37–44. This consultations between clinicians to
review focuses on telehealth applications tar- systems that provide direct patient
geting either solely the family caregiver of a care. This has required changes in
patient or the dyad (caregiver and patient) hardware, user interfaces, software,
examining the impact of telehealth on care- and processes. Discuss some of the
giver outcomes. Six categories of telehealth changes that must be made when a
interventions for caregivers were identified: system designed for use by health care
education, consultation (including decision professionals is modified to be used
support), psychosocial/cognitive behavioral directly by patients.
therapy, social support, data collection and 2. There are still some challenges regarding
monitoring, and clinical care delivery. Studies reimbursement for telemedicine services.
demonstrate caregiver satisfaction as well as Imagine that you are negotiating with an
reduction of caregiver anxiety and burden. insurance carrier to obtain reimburse-
Reed, M. E., Parikh, R., Huang, J., Ballard, ment for a store-and-forward telemedi-
D. W., Barr, I., & Wargon, C. (2018). Real- cine service that you have developed.
time patient-provider video telemedicine inte- The medical director of the second
grated with clinical care. New England Journal insurance payer states: “Telemedicine
of Medicine, 379, 1478–1479. Kaiser seems like ‘screening’ rather than a
Permanente Northern California began offer- mechanism for delivering health care.
ing telemedicine visits enabling patients to use This is because you are simply using
videoconferencing on a mobile phone, com- technology to identify patients who need
puter or tablet to communicate with their phy- to be referred to a real doctor, rather
sicians. In this study 210,383 video visits than providing true medical care.
conducted over three years (involving 2796 Therefore, we should only reimburse a
primary care providers and 152,809 patients) very small amount for these screening
were examined. The telemedicine visits services.” In your opinion, is this a legit-
extended established patient–physician rela- imate argument? Explain.
tionships and led to high levels of patient sat- 3. Using telehealth systems, patients can
isfaction (93% of surveyed patients responded now interact with multiple health care
that the video visit met their needs). stakeholders and monitor multiple
Wechsler, L. R., Demaerschalk, B. M., Schwamm, aspects of their health generating large
L. H., Adeoye, O. M., Audebert, H. J., Fanale, data sets. In order to inform timely and
690 M. F. Chiang et al.
Ricci, M. A., Caputo, M., Amour, J., et al. (2003). project improves survival for veterans with liver dis-
Telemedicine reduces discrepancies in rural trauma ease. Hepatology, 68(6), 2317–2324.
care. Telemedicine Journal and E-Health, 9, 3–11. Swigert, T. J., True, M. W., Sauerwein, T. J., & Dai, H.
Richter, G. M., Williams, S. L., Starren, J., Flynn, J. T., (2014). U.S. air force telehealth initiative to assist
& Chiang, M. F. (2009). Telemedicine for retinopa- primary care providers in the management of diabe-
thy of prematurity diagnosis: Evaluation and chal- tes. Clinical Diabetes, 32, 78–80.
lenges. Survey of Ophthalmology, 54, 671–685. Takahashi, P. Y., Pecina, J. L., Upatising, B., Chaudhry,
Rosenfeld, B. A., Dorman, T., Breslow, M. J., et al. R., Shah, N. D., Van Houten, H., Cha, S., Croghan,
(2000). Intensive care unit telemedicine: Alternate I., Naessens, J. M., & Hanson, G. J. (2012). A ran-
paradigm for providing continuous intensivist care. domized controlled trial of telemonitoring in older
Critical Care Medicine, 28, 3925–3931. adults with multiple health issues to prevent hospi-
Saffle, J. R., Edelman, L., Theurer, L., et al. (2009). talizations and emergency department visits.
Telemedicine evaluation of acute burns is accurate Archives of Internal Medicine, 172, 773–779.
and cost-effective. The Journal of Trauma, 67, 358– United States Department of Health & Human Services.
365. (2020). Notification of enforcement discretion for
Sarkar I.N., Starren J. (2002). Desiderata for personal telehealth remote communications during the
electronic communication in clinical systems. COVID-10 nationwide public health emergency.
J Am Med Inform Assoc, 9(3):2009–16. https://www.h hs.g ov/hipaa/for-professionals/spe-
Starr P. (1982). The Social Transformation of American cialtopics/emergency-preparedness/notification-
Medicine. New York: Basic Books. enforcement-discretiontelehealth/index.h tml.
Starren, J., Hripcsak, G., Sengupta, S., Abbruscato, Accessed 5 May 2020.
C. R., Knudson, P. E., Weinstock, R. S., & Shea, S. United States Federal Communications Commission.
(2002). Columbia University’s informatics for diabe- 2019. Broadband deployment report (2019). https://
tes education and telemedicine (IDEATel) project: docs.fcc.gov/public/attachments/FCC-19-44A1.pdf.
Technical implementation. Journal of the American Accessed 30 May 2020.
Medical Informatics Association: JAMIA, 9(1), Wechsler, L. R., Demaerschalk, B. M., Schwamm,
25–36. L. H., Adeoye, O. M., Audebert, H. J., Fanale, C. V.,
Steventon, A., Bardsley, M., Billings, J., Dixon, J., Doll, Hess, D. C., Majersik, J. J., Nystrom, K. V., Reeves,
H., Hirani, S., Cartwright, M., Rixon, L., Knapp, M. J., Rosamond, W. D., Switzer, J. A., & American
M., Henderson, C., Rogers, A., Fitzpatrick, R., Heart Association Stroke Council; Council on
Hendy, J., Newman, S., & Whole System Epidemiology and Prevention; Council on Quality
Demonstrator Evaluation Team. (2012). Effect of of Care and Outcomes Research. (2017).
telehealth on use of secondary care and mortality: Telemedicine quality and outcomes in stroke: A sci-
Findings from the whole system demonstrator clus- entific statement for healthcare professionals from
ter randomised trial. BMJ: British Medical Journal, the American Heart Association/American Stroke
344, e3874. Association. Stroke, 48, e3–e25.
Su, G. L., Glass, L., Tapper, E. B., Van, T., Waljee, Wootton, R., & Bonnardot, L. (2015). Telemedicine in
A. K., & Sales, A. E. (2018). Virtual consultations low-resource settings. Frontiers in Public Health, 3,
through the veterans administration SCAN-ECHO 3. https://doi.org/10.3389/fpubh.2015.00003.
20
693 21
Contents
This chapter is adapted from an earlier version in the third and fourth edition authored by Reed
M. Gardner and M. Michael Shabot.
References – 728
Patient Monitoring Systems
695 21
nnLearning Objectives physiologic data. Increasingly, such data are
After reading this chapter, you should be collected by noninvasive sensors connected to
able to answer these questions: patients in ICUs, neonatal ICUs, operating
1. What is patient monitoring and why is it rooms (ORs), labor and delivery suites, emer-
used? gency departments, and other hospital care
2. What patient parameters do bedside units where patient acuity is increased.
physiological monitors track? We often think of a patient monitor as
3. What are the major problems with ac- something that watches for, and warns about,
quisition and presentation of monitor- serious or life-threatening events in patients,
ing parameters? and provides guidance for care of the criti-
4. In addition to bedside physiologic pa- cally ill. Such systems must include continu-
rameters, what other information is ous observations of a patient’s physiologic
fundamental to the care of acutely ill measurements and the assessment of the
patients? function of attached life support equipment.
5. Why is real-time computerized deci- Such monitoring is important in detecting
sion support potentially more benefi- life-threatening conditions and guiding man-
cial than monthly or quarterly quality- agement decision making, including when to
of-care reporting? make therapeutic interventions and to assess
6. What technical and social factors must the effect of those interventions.
be considered when implementing real- In this chapter, we discuss the use of com-
time data acquisition and decision sup- puters in collecting, displaying, storing, and
port systems? interpreting clinical data, making therapeutic
recommendations, and alarming and alerting.
In the past, most monitoring data (called vital
21.1 What is Patient Monitoring? signs) were in the form of HR and respira-
tory rate, blood pressure (BP), and body tem-
Life, from the physiologic standpoint requires perature. However, today’s ICU monitoring
supplying oxygen to tissues to address the systems are able integrate data from bedside
metabolic needs for the purpose of fueling monitors and devices, as well as data from
mitochondrial respiration in cells. When this many sources outside the ICU. Although
cycle is broken humans become critically ill. the material presented here deals primarily
That physiologic cycle could be controlled with patients who are in ICUs, the general
through oxygenation and perfusion monitor- principles and techniques are also applicable
ing but there are no direct methods existing to other hospitalized patients and electronic
to measure mitochondrial respiration. All medical records (EMRs). Patient monitor-
modern monitoring methods are proxy meth- ing is performed extensively for diagnostic
ods for such processes. In hospitals, and espe- purposes in the emergency department or for
cially in the intensive care unit (ICU), patient therapeutic purposes in the OR. Techniques
monitoring becomes critical for control and that initially were only used in the ICU such
optimization of hemodynamic, ventilation, as bedside monitors are now used routinely on
temperature, nutrition, and metabolism of the general hospital wards and in some situations
human body. even by patients in their homes.
Measurement of patient physiologic
parameters such as heart rate (HR), heart
rhythm, arterial blood pressure (ABP), respi- 21.1.1 Case Report
ratory rate, and blood-oxygen saturation, have
become common during the care of the hos- This case report provides a perspective on the
pitalized and, especially, critically ill patients. problems faced by the team caring for a criti-
When accurate and prompt decision making cally ill patient.
is crucial for effective patient care, bedside A 27-year-old man is injured in an auto-
monitors are used to collect, display, and store mobile accident and has multiple chest and
696 V. Herasevich et al.
head injuries. His condition was stabilized at Unfortunately, a few days later, the patient
the scene of the accident by skilled paramed- is beset with a problem common to multiple
ics using a portable computer-based electro- trauma victims—he develops a major nosoco-
cardiogram (ECG) and pulse oximeter, and mial hospital-acquired infection, sepsis, and
he is quickly transported to a trauma center. acute respiratory distress syndrome (ARDS).
Once in the trauma center, he is connected Multiple organ failure follows. As a result,
via noninvasive sensors to a computer-based antibiotics and electrolytes are required for
bedside monitor that displays physiologic sig- treatment and are dispensed via intravenous
nals, including his HR and rhythm, arterial (IV) pumps. The quantity of information
oxygen saturation, and BP. Radiographic and required to care for the patient has increased
magnetic resonance imaging provide further dramatically; with monitor data excluded,
information for care. critically ill patients produce in average 10
Because of the severe chest injury, the times more clinical data than patients on the
patient has difficulty breathing, so he is typical general hospital unit (. Fig. 21.2)
connected to a computer-controlled venti- (Herasevich et al. 2012).
lator that has both therapeutic and moni- Decision-making for patients with com-
toring functions and he is transferred to the plex rapidly changing acute conditions
ICU. Because of the head injury a bolt is requires much data in addition to monitoring
placed in a hole drilled through his skull and data (. Fig. 21.3) (Bradshaw et al. 1984).
a fiber optic sensor is inserted to continuously Increased data flow, rapid changes in
measure intracranial pressure with another the patient’s state, and a multidisciplinary
computer-controlled monitor. Blood is drawn approach involving different care providers
and clinical chemistry and blood gas tests are required the next generation of data manage-
promptly performed by the hospital labora- ment clinical systems outside of distributed,
tory. Results of those tests are displayed to the often not connected, multiple patient moni-
ICU team as soon as they are available. With tors and data from hospital EMRs.
intensive treatment, the patient survives the In 7 Sect. 21.6 (“Monitoring and
initial threats to his life and he now begins the Advanced Information Management in
long recovery process. . Figure 21.1 shows a ICUs”) we describe two integrated patient
nurse at the patient’s bedside surrounded by a monitoring systems: the HELP system used
bedside monitor, infusion pumps, a ventilator, by Intermountain Healthcare’s Hospitals and
and other devices. the AWARE system developed at Mayo Clinic
Hospitals. Both of these systems integrate
diverse clinical data for complex decision
making.
.. Fig. 21.2 Average number of total clinical data points per patient hour, excluding vitals, before and after admis-
sion to the ICU (time zero). Bars in red indicate first hours in ICU. Y-axis indicated number of data points
Pulmonary artery
Echocardiography
V. Herasevich et al.
Pulse oximetry
Temperature Photopletismography
Capnography
Respiratory Rate
Speciality monitoring Respiratory monitoring
Diaphragmic function
Positive end-expiratory pressure
Ventilation
Peak inspiratory pressure
.. Fig. 21.4 Different types of patient physiologic monitoring. ECG indicates electrocardiogram, EEG electroencephalogram
Patient Monitoring Systems
699 21
Korotkoff, applied Riva-Rocci’s cuff with a
stethoscope developed by the French phy-
sician Rene Laennec, which allowed the
measurement of both systolic and diastolic
arterial pressure. Harvey Cushing, a preemi-
nent US neurosurgeon of the early 1900s,
predicted the need for and later insisted on
routine ABP monitoring in the OR. At the
same time, Cushing also raised the following
questions, which are still being asked today:
1. Are we collecting too much data?
2. Are the instruments used in clinical medi-
cine too accurate?
3. Would not approximated values be just as
good?
.. Fig. 21.6 Waveforms on typical bedside monitor. and central venous blood pressures and their derived
Displays from the monitor show the real-time beat-by measures. (Courtesy of Royal Philips, with permission)
ECG, pulse oximeter, and arterial, pulmonary artery,
introduced by the Nihon Kohden company the oscilloscope to determine if there was a
from Japan in 1967 and was as large as the cardiac arrest or other life-threatening cardiac
patient bed. rhythm. Soon after these analog systems were
As a result of the detailed ECG informa- developed, methods for generating high- and
tion provided by the new patient monitors, low-HR alarm thresholds were included. The
treatment for serious cardiac arrhythmias alarms were usually audible and very annoy-
(heart rhythm disturbances) and cardiac ing. Unfortunately, since the beginning of the
arrest (abrupt cessation of heartbeat)—major use of these alarms, the false-positive rate has
causes of death after myocardial infarctions far exceeded the true positive rate. As a result,
(heart attack)—became possible. Mortality many times alarm systems for bedside moni-
rates from 1960 to 1970 were about 35%, tors are ignored or turned off. The problem
dropped to about 23% between 1970 and 1980 of alert fatigue in hospitals is still considered
and to about 20% between 1980 and 1990. a challenging technological problem and con-
During the 1990s reperfusion of the coro- tributes to poor patient outcomes including
nary arteries became common and mortality deaths.
rate dropped to about 5% (Braunwald 1988; Teams from several cities in the United
Rogers et al. 2000). States in the 1960s introduced computers into
In the 1960s, bedside monitors were built the ICU to assist in physiologic monitoring,
using analog computer technologies. These beginning in Los Angles with Shubin and
systems amplified the ECG signal and dis- Weil (1966) followed by Warner et al. (1968)
played the results on an oscilloscope. Such sys- in Salt Lake City. These investigators had sev-
tems required nurses or technicians to watch eral objectives:
702 V. Herasevich et al.
dynamic response characteristics must be devices using the pulse-pressure method have
optimized. Optimization is typically done by appeared. The issue is still active, with such
doing a fast flush test (by pushing sterile saline publications as Chen et al. (2009), Sun et al.
through the tubing) to optimize the system by (2009), and Gardner and Beale (2009). Other
removing blood and very tiny air bubbles that estimates of stroke volume and cardiac output
can dramatically distort the ABP waveform have been made from determining the bioreac-
and result in erroneous measures of systolic tance – a measure of the degree of phase-shift
and diastolic pressure. in the electrical signal—across the chest. This
At least two types of artifacts in the ABP method shows promise of being a rather sim-
signal are commonly observed. If a patient ple, continuous and noninvasive method for
rapidly moves or a care giver bumps the tub- measuring cardiac output (Keren et al. 2007).
ing, a pressure artifact is generated and trans- Investigations have made assessments of
mitted to the transducer and displayed. In delta pulse pressure, which measures the vari-
addition, when the clinical staff draws arterial ability of the peak to peak arterial pressure
blood for laboratory tests, they typically turn pulse signal across the breathing cycle to make
off the stopcock connected to the transducer an estimate of a patient’s fluid balance. The
and draw blood through the tubing, causing supposition is that if there is larger variability
an immediate loss of the pulsatile ABP signal. in this delta pulse pressure marker the patient
The pressure sensed by the transducer then may require fluid administration (Deflandre
typically rises to that found in the pressurized et al. 2008).
flush solution. Thus, continuous vigilance It is clear that future methods that process
on the part of nurses and other care givers is available physiologic signals will be applied
needed for the arterial catheter and monitor- to enhance and improve the availability of
ing systems to be properly maintained. As a important measures of cardiac function, a
historical note, the continuous flush device key parameter for making treatment decisions
was developed over 50 years ago to prevent used by critical care caregivers.
arterial catheters from clotting and to allow
one of the pioneering computerized monitor-
ing systems to become more reliable (Gardner 21.5.3 ulse Oximeter Signal
P
et al. 1970). Since that time, investigators have Acquisition and Processing
developed computerized methods to minimize
these “human caused artifacts” (Li et al. 2009; One of the most common technologic devices
Gorges et al. 2009). Unfortunately, these used in hospitals today is the pulse oximeter;
strategies have seldom been implemented into continuous monitoring of oxygen saturation
commercially available bedside monitors. became a standard of practice in the early
Since the early 1900s, efforts have been 1990s (Brown et al. 1990). The pulse oximeter
made to estimate cardiac output from the pul- sensing device is usually placed on a finger and
satile pressure in the arterial system by multi- measures oxygen saturation and pulse rate, or
plying the HR with estimates of stroke volume HR (Clark et al. 2006). The modern device
(the volume of blood ejected from the heart works by shining red and infrared light gener-
during a single contraction) made from the ated by two light emitting diodes through the
pressure waveform. Warner and his colleagues tissue. With each arterial pulse there is a varia-
at Mayo Clinic published some early work in tion in the light as it passes through the tissue
1953 (Warner et al. 1953) on the topic and and is sensed by a light-sensitive photodiode
followed up again in 1983 further substanti- on the opposite side. The more oxygenated
ating the feasibility of the method. However, the blood is, the more red light is transmit-
Cundick and Gardner (1980) showed that the ted, with less infrared light passing through.
widely varying mean BPs found in critically By calibrating these devices, reasonably accu-
21 ill patients adversely affected the reliability rate estimates of oxygen saturation (SpO2) can
of the method. Since that early work, mul- be determined. Although the pulse oximeter
tiple publications and commercially available is convenient and easy to use, it has several
Patient Monitoring Systems
705 21
important limitations, including motion arti- segment of pulse oximetry monitoring after
fact, when the patient moves, and other physi- acquiring Covidien, the previous leader of
ologic considerations such as anemia, low pulse oximetry monitoring. With one of most
perfusion state and low peripheral skin tem- extensive product lines, Philips Healthcare is
perature. If the blood flow to the hand gets leading multiparameter vital sign m onitoring,
disturbed, by perhaps squeezing the arm dur- wireless telemetry, and fetal/neonatal moni-
ing BP with a sphygmomanometer, the blood toring markets.
flow to the hand is interrupted and the pul- The other major manufacturers in the
satile BP signal required for the pulse oxim- patient monitoring market are: GE Healthcare,
eter is no longer available. The pulse oximeter Masimo, Edwards Lifesciences, Mindray
is one of ICU monitoring devices with most Medical, Natus Medical, Welch Allyn,
false alarms (Malviya et al. 2000). Omron Healthcare, Honeywell Life Sciences,
Nihon Kohden, Spacelabs Healthcare, St.
Jude Medical, Nonin Medical and Boston
21.5.4 edside Data Display
B Scientific. Each vendor of bedside monitors
and Signal Integration has made a “best effort” at displaying the
variety of physiologic signals derived. In most
While colorful and dynamic, the displays cases this consists of three channels: ECG,
on the bedside monitor can be complex. ABP, and pulse oximetery. Additional impor-
. Figure 21.6 shows a typical bedside moni- tant physiologic parameters can be derived
tor display as one example. from these signals, as noted in . Table 21.1.
In 2018 Medtronic led the US patient mon- Today’s bedside monitors still present
itoring device market by capturing the largest both waveforms and derived parameters in
a single-sensor-single-indicator format. That Bus (MIB) is the simple term used to desig-
is, for each individual sensor attached to the nate CEN ISO/IEEE 11073. So, why has the
patient there is a single indicator, a waveform MIB been a commercial failure to this point?
with derived values presented on the screen There are multiple reasons; unfortunately,
(Drews et al. 2008). One of the simplistic con- the MIB standard was designed during the
sequences of this display strategy is that each time when serial communications via RS-232
indicator is treated as if it had come from a was the norm; there were no Universal Serial
different patient. For example, if ECG, ABP, Bus (USB) interfaces or convenient wireless
and pulse oximeter signals were displayed, devices (Wi-Fi or Bluetooth) at the bedside.
they would each have the capability of deter- Furthermore, each vendor of bedside devices
mining HR. Thus, three different HR mea- and ICU data management systems would
sures might be displayed. Although there are like to be the “data integrator” (for a price)
physiologic reasons for such differences, the and thus has little incentive to adhere to
most common situation is that the HR should standards that would allow other vendors to
be an integrated assessment of the three sig- compete for the integrator role. The business
nals since artifact is a far more common event model apparently has not worked (Kennelly
than the unusual conditions that would cause and Gardner 1997; Mathews and Pronovost
the differences in HR. Studies suggest that 2011).
there are better methods for designing hemo- In spite of the lack of interface standards,
dynamic monitoring displays (Doig et al. the group at Intermountain Healthcare, LDS
2011; Drews et al. 2008). Hospital (Salt Lake City, Utah) has been
A more important problem relates to the actively interfacing ventilators, IV pumps, and
integration of data from multiple bedside similar devices for over three decades (Dalto
devices. Two examples will illustrate the prob- et al. 1997; Vawdrey et al. 2007). Another
lem: effort is The Medical Device “Plug-and-
1. The patient’s pulse oximeter has shown a Play” Interoperability Program at Partners
recent increase of SpO2. However, the bed- HealthCare (Boston, Massachusetts) –
side monitor has no knowledge that the 7 http://www.mdpnp.org. The open source
respiratory therapist has increased the result is OpenICE, implementation frame-
FIO2 from 30% to 40% on the ventilator. work of an Integrated Clinical Environment.
2. The patient’s HR has recently increased
from a dangerously low value of 45 beats
per minute to 72 beats per minute. 21.5.5 Challenges of Bedside
Unfortunately, the bedside monitor has no Monitor Alarms
way of knowing that a nurse has increased
the drip rate of a cardioactive medication. Care of the critically ill is complex and chal-
lenging. Most of these patients have medical
Patients in today’s ICUs can have 50 or more problems or injuries that are life threatening.
electronic devices attached (Mathews and They might have heart problems that within
Pronovost 2011). Many of these electronic minutes could result in sudden death, or they
devices were developed by independent com- might have breathing problems that require
panies and do not easily interface or commu- mechanical ventilation to maintain life. As a
nicate with each other. However, even though consequence, each of these situations requires
the larger monitoring companies have pur- intense minute-by-minute observation with
chased several of the specialty monitoring real-time, continuous physiologic monitor-
companies, problems still exist although it was ing. For those conditions, the requirement for
understood more than three decades ago, and record keeping, monitoring, and alarming is
standards for bedside data interchange (CEN intense.
21 ISO/IEEE 11073) (Gardner et al. 1989, 1991) There are clear expectation that bedside
were developed. The Medical Information physiologic monitors, ventilators, IV pumps,
Patient Monitoring Systems
707 21
and similar devices attached to patients 2. Monitoring the monitors – beyond risk
should provide true and valid alarms and that management (Thompson and Mahajan
caregivers will be promptly notified and pro- 2006)
vide the needed care immediately for those 3. Alarms and human behavior: Implications
patients (Kowalczyk 2011). On the other for medical alarms (Edworthy and Hellier
hand, a report from the New England Journal 2006)
of Medicine outlines 24 electronic require- 4. Alarms in the intensive care unit: Too
ments for classification of a hospital as hav- much of a good thing is dangerous: Is it
ing a comprehensive electronic record system time to add some intelligence to alarms?
(Jha et al. 2009), yet recording of data from (Blum and Tremper 2010)
bedside physiologic monitoring systems with 5. Intensive care unit alarms – How many do
their alarming systems and data gathering we need? (Siebig et al. 2010)
from other bedside devices such as ventilators
and IV pumps were not even mentioned. . Figures 21.7 and 21.8 give examples of the
So, currently there is a curious and inex- complexity of determining whether an alarm is
plicable set of expectations being generated true or false based on two life-threatening con-
for care of the critically ill patients. As a con- ditions. Alarms for ventricular tachycardia are
sequence, there are niche vendors who have shown in . Fig. 21.7. . Figure 21.7a shows
built their own data gathering and recording a true ventricular tachycardia alarm condi-
systems and nurse charting systems; in some tion while . Fig. 21.7b shows a false ventric-
cases these systems include simple interfaces ular tachycardia condition. . Figure 21.7b
to allow them to acquire laboratory data and has only a few seconds of ECG artifact, which
perhaps data from the administrative admis- causes the bedside monitors’ alarm detection
sions process. They may even include bedside system to issues an alarm.
computers or displays to allow care givers to Arterial hypotension alarms are shown
have access to such things as radiographic in . Fig. 21.8. . Figure 21.8a shows a true
images, dictated reports, and others. However, arterial hypotension alarm condition while
these systems are stand-alone devices and do . Figure 21.8b shows a false condition. If
not typically provide interfaces to transmit the monitor or human observer only watches
their physiologic data to the hospital’s EMR. the ABP signal, the two conditions appear
In the past, the number of physiologic similar. However, by simultaneously follow-
signals that can and are being monitored has ing the ECG signal, the human observer will
grown. With each signal and derived param- note that for some unknown reason the ABP
eter that is added there is typically a high and signal displays a false representation of the
low alarm added to warn the clinical staff of patient’s pulsatile BP. The unknown reason is
actual or impending patient crisis. Alarms may likely related to the catheter and tubing parts
be highlighted on the bedside monitor’s screen of the arterial monitoring system. Alerting
by using a color change or flashing indicators. the clinical staff to examine the catheter and
Most alarms also generate a sound. transducer system is certainly appropriate.
Imhoff and Kuhls (2006) noted from 1.6 Biomedical informatics specialists, bio-
to 14.6 alarms for each ICU patient each medical engineers, and bedside monitor ven-
hour; up to 90% of those alarms were false. dors have recently renewed their efforts to
Alarm overload is clearly a significant issue reduce false alarms and improve the relevance
in ICU monitoring; from clinical informatics of existing alarms. Most of the false alarms
professionals working in the ICU is needed are caused by noise or artifacts in the primary
to minimize the number of false alarms. Just signals. To help minimize these problems, two
noting the titles of several editorials and arti- examples are used to illustrate the challenges
cles should be informative: and opportunities to improve bedside alarms.
1. Alarms in the intensive care unit: How can 1. After observing over 200 hours of alarms
the number of false alarms be reduced? from bedside monitors and ventilators in
(Chambrin 2001) an adult medical ICU, Gorges and his
708 V. Herasevich et al.
.. Fig. 21.7 Ventricular tachycardia alarm conditions. reduced. b A false alarm caused by artifact in the ECG
a A true alarm; note that the ventricle is still pumping signal; note the ABP waveform is stable during the same
but that the arterial pulse pressure is dramatically time interval. ABP indicates arterial blood Pressure,
ECG electrocardiogram, v-tach ventricular tachycardia
c olleagues (2009) used the data recorded to derive HR: ECG 1, ECG 2, ECG 3, ABP,
recommend a two-step process that would and pulse oximeter. Since the probability
dramatically reduce the number of false of all those signals having an artifact is
alarms. The first step was to add a 19 sec- smaller than any single physiologic signal,
ond delay into the alarming system. That smart alarm algorithms that are more
step by itself reduced the number of alarms robust should be possible. Two investiga-
by 67%. They then noted that by having tors have developed and tested such algo-
some method for automatically detecting rithms (Zong et al. 2004; Poon 2005). The
when a patient was being suctioned, repo- Zong pressure alarm algorithm reduced
sitioned, given oral care or being washed, false alarms from 26.8% to 0.5%. Poon
there would be a further 13% reduction of found that the usual HR and rhythm alarm
ineffective alarms. By using these just these system produced 65.4% false alarms, while
two methods, almost 80% of the false an algorithm that integrated multiple sig-
alarms could be eliminated. nals generated only 31.5% false alarms.
2. Using multiple signals to derive identical Two other findings from the Poon study
measures should be an effective method of were also encouraging. By merely delaying
reducing false alarms (Herasevich et al. the alarms by 10 seconds there was a 60%
21 2013). As will be noted in . Fig. 21.7, reduction in false alarms. In addition, he
there are five signals that can be used to found that default settings for high and
Patient Monitoring Systems
709 21
.. Fig. 21.8 Arterial hypotension alarm conditions. a alarm; note for some nonphysiologic reason the arterial
A true alarm; note the normal ventricular beats fol- pressure signal loses its pulsatile characteristics and then
lowed by ventricular fibrillation that renders the heart eventually it returns. ABP indicates arterial blood pres-
unable to generate an effective blood pressure. b A false sure, ECG electrocardiogram
The first is the simplest and still widely tors, especially if the false alarm rate can be
in use: nurses observe data presented on the minimized. In addition to acquiring 15-minute
bedside monitor screen and manually key-in median data, one may wish to detect bedside
the observations into an integrated EMR. As alarms and record data in the intervals just
simple as this may be to implement, such before and just after these alarms. Thus, there
manual data collection strategy is inefficient, is still opportunity for informaticians to make
error prone up to 15% when documented on major improvements in both data recording
paper and then typed into EMR (Wager et al. and bedside monitoring alarms.
2010), and does not collect representative data
gathered by the bedside monitor.
The second strategy used by ICU infor- 21.6 Monitoring and Advanced
mation systems is to acquire vital sign data Information Management
directly from the bedside monitoring system’s
in ICUs
network by using an HL7 feed (see 7 Chap.
7). The information is automatically gathered
21.6.1 arly Pioneering in ICU
E
by the ICU information system and nurses
have the option of either accepting or modify- Systems
ing the data. In typical clinical settings, nurses
perform the selection and transfer of bedside As electronic information gathered in ICUs
monitoring data from the ICU information and hospitals started growing, problems with
system to the EMR about once an hour. These bedside monitors, alarms, and data integra-
ICU information systems typically retain the tions become more apparent. The next step in
high frequency bedside monitoring data and information management began 50 years ago
can achieve near-real-time computerized deci- at LDS Hospital where a team of people devel-
sion support. In many cases, the nurse’s notes oped what was known as the HELP (Health
are also entered into the ICU information Evaluation Through Logical Processing)
system – generally once per shift – and some System (Pryor et al. 1983; Kuperman et al.
summary vital sign information may find its 1991; Gardner et al. 1999).
way into those notes. Physician progress notes HELP was the first hospital informa-
are also entered into ICU information sys- tion system to collect patient data needed
tems in a similar fashion. Unfortunately, data for clinical decision making and at the same
in the ICU information system may never find time incorporate a medical knowledge base
its way into the hospital’s EMR. For these sys- and inference engine to assist the clinician
tems, the ICU data are usually archived sepa- in making decisions. The HELP system has
rately. As a consequence, these data cannot been operational at LDS Hospital since 1967.
be used for real-time decision making by the The system initially supported a heart cath-
hospital’s EMR. eterization laboratory and a postoperative
The third strategy is to have the ICU infor- open heart ICU. Initially only physiologic
mation system or the hospital’s EMR system data were acquired from the bedside moni-
automatically transfer vital sign data from the tors. Nursing note charting promptly followed
bedside monitoring system to the EMR. Most with ability to chart medications ordered and
systems that automatically gather data with given, including IV drip rates. Soon, it became
this strategy take a median of the vital sign apparent that much of the data needed to care
data over a 15 minute time interval to smooth for these critically ill patients came from the
the data (Warner et al. 1968; Gardner et al. clinical laboratory and other sites such as
1991; Vawdrey et al. 2007). This strategy pro- radiology. As a consequence, multiple mod-
vides real-time data for computations and ules were added to the HELP system to sup-
computerized decision support for the hospi- port the ICUs.
Clinical decision making in the ICU is
21 tal’s EMR and is the preferred strategy.
complex. Physicians, nurses, respiratory
There are opportunities to improve the
automated data gathering from bedside moni- therapists, pharmacists, and others evaluate
Patient Monitoring Systems
711 21
AWARE
Dashboards/viewers
“Sniffers Multipatient Single patient Rounding tool
or smart alerts” viewer viewer (Checklist)
Communication tools
Administrative Task list and
Hand over Claim patient
dashboard whiteboard
.. Fig. 21.9 Essential elements and logical structure of the Ambient Warning and Response Evaluation System
(AWARE) clinical management system
facilitating real-time access to key informa- 21.6.3 cquiring the Data: Quality
A
tion needed for timely medical and interven- and Timeliness
tional decision making at the point of care.
2. Sniffers or rule-based smart alerts. A fundamental part of any computerized
Continuously survey both the patient con- decision support system, just as with any
dition and provider actions detecting human clinical decision support system, is
potential mismatches and preventing the acquisition of data. Clinicians develop
potential errors before they occur. observational, interpersonal, and technical
3. Communication tools. AWARE white- skills as they collect accurate patient data.
board, task-list, readiness for discharge and Likewise, a computerized decision support
claim patient functions facilitate communi- system depends on high-quality, timely data.
cation between team members and during In many ICUs today, much medical data still
transitions of care, thereby preventing com- continues to be entered into computerized
mon errors of communication omission. patient records as scanned PDF files or in a
4. Checklist/ rounding tool. Designed to structured and coded formats while others
assists providers in developing and execut- (such as the progress note) are be stored in a
ing a coordinated daily plan of care. The free text format (either handwritten or typed)
easy-to-use interface minimizes clerical (Celi et al. 2001; Pickering et al. 2010; Ahmed
burden while simultaneously assuring et al. 2011; Hripcsak et al. 2011). As noted
adherence to patient-centered best care in 7 Chap. 8, natural language processing
practices and regulatory requirements. of free text to obtain coded and structured
5. Administrative dashboard. Feedback and information has seen great improvement over
reporting tool enables easy access to quality the past decades; however, the process is still
improvement metrics and patient outcomes far from perfect and all processing is delayed
for administrators and oversight groups, until data is entered to EMR. Such delays is
which facilitates rapid-cycle management limiting factor to use such free text data in real
21 changes based on continuous feedback. time monitoring systems.
Patient Monitoring Systems
713 21
As designers of clinical monitoring sys- acquiring ICU data to EMR either automati-
tems look at acquiring and entering clinical cally or manually (Gardner et al. 1989, 1991;
data they must decide: Dalto et al. 1997; Nelson et al. 2005; Vawdrey
1. Who should enter the data: automated et al. 2007). Data from bedside monitors,
acquisition from electronic instruments ventilators and IV pumps should be acquired
(such as the bedside monitor) versus man- automatically with a real-time technology.
ual entry using a keyboard, bar code Data thus acquired is timely and by appro-
reader, touch screen, voice input, or some priate signal processing methods can be vali-
similar method. dated (Dalto et al. 1997; Vawdrey et al. 2007;
2. When to enter the data: accurate ICU deci- Ahmed et al. 2011; Lilly et al. 2011). Changes
sion making often requires data to be in ventilator settings such as FIO2 may only
acquired in a timely manner, sometimes be present for a few minutes, but blood-gas
within 1 minute of an event to make a measurements taken during that time interval
timely decision. will be misinterpreted if only manual elec-
3. Where to enter the data: this automated tronic charting is used. Similar interpretation
data will naturally be acquired from the errors were found to occur with IV pump drip
bedside monitor or instrument located at rate charting when manual charting methods
the bedside; manual data entry should were compared to automated acquisition.
optimally occur at the bedside as well. Gathering accurate, representative, and timely
4. How data should be collected: methods computerized ICU data requires attention to
should take into account the occurrence of detail and careful planning to assure its qual-
artifacts in the patient data; many EMR ity. Bedside charting systems have the ability
systems allow nurses to review and vali- to capture near real-time data from bedside
date bedside vital sign data minutes to devices, but the presentation layer usually
hours after they are collected, although showed normalized and averaged values.
this process does not meet the requirement
for real-time data collection and can lead
to “human” and computerized decision 21.6.4 Presentation of Data
support errors (Nelson et al. 2005; Vawdrey
et al. 2007) Once data have been collected, their qual-
5. How much data to collect: this is particu- ity verified, and the results stored, one must
larly an issue with systems such as bedside decide how the data should be presented.
monitors that can generate an HR, systolic Currently, most data are presented on a col-
and diastolic BP value for each heartbeat, orful screen. However, some care givers will
resulting in hundreds of thousands of val- still prefer a paper copy. Still others will prefer
ues per day; except for special situations to to view these reports on their smart phones
use in automatic decision support systems, or other mobile devices. For ICU patients,
the collection of such intensive data in it is clear that specialized reports must be
regular EMR is inappropriate. developed. The traditional method of seg-
mented reporting (separate reports for labo-
The process of developing and implementing ratory data, vital signs reports, medication
the systems for acquiring data involves not lists, etc.) has proven inadequate (Clemmer
only technology, but adapting that technol- 2004; Ahmed et al. 2011). The ICU group at
ogy to the human users; training those users Mayo Clinic has developed and tested an ICU
to properly use the new system is complex rounding tool (Pickering et al. 2010). Thus,
and difficult. Consequently, developers and one can see there is value in the integration
adopters of such systems should plan for and of and presentation of data. As of this writ-
be prepared for challenges that may take years ing, there is probably not a single ICU sum-
to implement and optimize system. mary report that will satisfy all ICU users.
Despite modern protocols like HL7 and Thus, such reports will require special effort
FHIR there are still major problems with for each institution and perhaps even each
714 V. Herasevich et al.
ICU within that institution. For example, the the ICU at Mayo Clinic, the team found that as
report generated for a thoracic ICU is unlikely they developed AWARE system, they required
to be identical to that required by the neona- similar data content (Pickering et al. 2010).
tal ICU. Accomplishing such tasks typically At LDS Hospital, the computerized nurse
requires 6 months or more, with continuous charting module allows nurses to enter patient
ongoing effort to update the report as new care tasks, qualitative and quantitative data,
data are acquired and caregivers needs evolve. and a patient’s response to therapy (Willson
1994; Willson et al. 1994; Nelson et al. 2005). In
addition, nurses interact with a pharmacy mod-
21.6.5 Establishing the Decision ule to chart all given medications including IV
Rules and Knowledge Base drip rates (Pryor 1989; Kuperman et al. 1991).
Soon after the nurse charting was imple-
Deciding on the decision rules that should mented at LDS Hospital, respiratory thera-
be installed in a computerized ICU decision pists chose to enter their qualitative and
support system is difficult. Health care is cur- quantitative ventilator data and care given to
rently driven by implementing evidence-based patients (Andrews et al. 1985; Gardner 2004).
protocols. However, few of these protocols The motivation for the online charting was to
have been computerized. The long-standing provide clinicians with access to timely and
work with the HELP system and some excit- accurate data to make patient care decisions.
ing work done at Mayo Clinic and at the In addition, these data could be used to imple-
University of Massachusetts are exceptions ment protocol-controlled ventilator weaning
(Clemmer 2004; Morris 2000; East et al. 1992; systems (East et al. 1992; Morris 2001).
Ahmed et al. 2011; Lilly et al. 2011). Using a To optimize the performance of routine
consensus process to develop treatment deci- care deemed essential for ICU patient recov-
sions is essential. However, generating a con- ery, computerized reminders were generated
sensus is a tedious, difficult, and slow process. (Oniki et al. 2003). For example, 1 of the
At the moment, the consensus process involv- goals of the reminders was to provide assis-
ing all the clinical caregivers in the ICU is the tance in determining the required level of
best approach, as rules developed by indi- sedation while avoiding oversedation. By pro-
viduals are often not widely accepted or used. viding the computerized reminders to nurses,
However, in some departments there may be charting deficiencies were reduced by 40%
trusted clinical leaders who become the “local and the number of deficiencies at the end of
expert.” Developing the rules for clinical deci- the shift was improved. To optimize care pro-
sion support is complex and those rules are vided by the reminders, real-time charting was
always subject to change. Development of required. However, during a quality improve-
appropriate rules can take up to 6 months ment process, it was determined that 29% of
and the rules will need to be continuously the medication errors that should have been
reviewed and updated (Gardner 2004; Ahmed prevented by online nurse charting were still
et al. 2011; Lilly et al. 2011). present. A careful evaluation revealed that the
actual nurse charting workflow was different
than that envisioned by the system planners.
Instead of charting the given medication using
21.6.6 linical Charting Systems:
C a bedside terminal, nurses administered the
Nurses, Pharmacists, medication and then at some later time, at the
Physicians, Therapists central nursing station, charted that the medi-
cation had been given. Consequently, errors
The major portion (43%) of the data used were occurring. After careful training and
at LDS Hospital for decision-making dur- feedback with the nursing staff, the real-time
21 ing ICU rounds came from clinical notes and charting rate increased from 40% to 75% and
data charted by nurses and other clinicians remained at that level a year later. This exam-
(Bradshaw et al. 1984). In a more recent study in ple shows that having computerized decision
Patient Monitoring Systems
715 21
support systems in place without having real- coded format so that real-time computerized
time data entry was ineffective. Conceptually, decision support can be used. . Figure 21.10
one could make the same logical observation shows a schematic of the HELP system at LDS
if the ICU were operating as a tele-ICU, as Hospital and . Fig. 21.11 shows AWARE
discussed later in this chapter. system at Mayo Clinic as examples of such
For generations, nurses and other caregiv- systems. Based on the data available to ICU
ers who have used conventional paper records management systems from these multiple
have had the notion that if their paper chart data sources, its computerized decision sup-
was up-to-date at the end of the shift then port system makes and displays suggestions
they had met their requirements for good for optimum care for the specific problems
patient care. Clearly, the above example shows such as sepsis and acute respiratory distress
that such a strategy is flawed. However, it is syndrome. The system provides audible and
interesting that even today reports are being visual alerts for life-threatening situations. In
made about charting and use of data for end- addition, the system organizes and reports the
of-shift nursing care exchanges and patient large amount of data so that the medical team
handovers, suggesting that the EMR still may can make prompt and reliable treatment deci-
not be real-time (Hripcsak et al. 2011; Collins sions. The patient’s physicians are automati-
2011). Collins (2011) found that clinicians cally alerted about life-threatening laboratory
preferred oral communications compared and other findings.
to EMR documentation and stated that the Much of the information required for
perceptions that the EMR was a shift behind ICU patient care comes from underlying
might have only been a manifestation of laboratories and devices that automatically
the lack of real-time charting by nurses and acquire data. In the upper right hand cor-
acquisition of real-time data from bedside ner of the HELP system diagram, data from
monitors in their ICU. the ventilator, IV pumps, and the bedside
An early survey of nurses and physicians monitor are noted. While most of the physi-
use of the HELP clinical expert system was ologic bedside monitor vendors now acquire
conducted in 1994 (Gardner and Lundsgaarde ECG, BP, and pulse oximetry data, they do
1994). The investigators were encouraged by a not provide access to data from ventilators
positive response from both nurse and physi- or information from IV pumps. As a con-
cian users who appreciated having the data sequence, data from these devices must be
available with interpretation and alerting fea- obtained by developing hardware and soft-
tures provided by the HELP system. At the ware interfaces (Gardner et al. 1991; Dalto
time the survey was conducted, ICU charting et al. 1997; Kennelly and Gardner 1997;
and decision support was a major feature of Vawdrey et al. 2007). Based on those studies,
the HELP system. It is exciting to note that it is clear that automatically collecting data
other institutions have begun to assess factors from all of these devices in real-time is more
related to acceptance of an EMR in critical timely and accurate than manually charted
care (Carayon et al. 2011). The Carayon study data collected by nurses or respiratory thera-
showed that ease of use, as well as data pre- pists. Although data from these devices can
sentation strategies, were major determinants contain artifacts, methods for minimizing
of acceptability of their system. those artifacts have been implemented in
operational systems.
Initially the MIB standard CEN ISO/
21.6.7 utomated Data Acquisition
A IEEE 11073 was designed to help gather data
From All Bedside Devices from bedside devices, but has not been widely
implemented (Mathews and Pronovost 2011).
Computer systems that support ICU patients Fortunately, battery power and wireless com-
are tightly integrated and data are auto- munications with IV pumps are now widely
matically gathered and stored, primarily in a available. By using wireless technology, inter-
716 V. Herasevich et al.
IV Pumps Physiological
DECISION Monitors
SUPPORT ECG
ENGINE Lab
Blood
ELECTRONIC Gas Lab
HEALTH Clinical & Microbiology
TIME DATA Laboratories
RECORD
DRIVER DRIVER
Infectious Diseases
X-Ray
Blood
Bank Financial System
Interpretations
Alerts
Suggestions Surgery Pathology
Computations Schedule
Information Review
Protocols Medical Physical
Guidelines Therapy
Admitting Records
Transcribed
Dictation
.. Fig. 21.10 Diagram of HELP the System Used by tion, Time Driver decisions are also made. Shown
Intermountain Healthcare’s Hospitals. At the center is schematically, in the upper right hand corner of the dia-
the database for the electronic medical record (EMR). gram are blocks representing ICU bedside devices
Data from a wide variety of clinical and administrative including the physiologic monitor, ventilator, IV pumps
sources flow into the EMR. As the data flows into the and barcode scanner. ECG indicates electrocardiogram,
EMR, the Data Driver capabilities of the HELP Deci- HELP Health Evaluation Through Logical Processing,
sion Support Engine (red circle) are activated. In addi- ICU intensive care unit, IV intravenous
faces with the IV pumps are fast, mobile, and 21.6.8 Rounding Process:
easy for nurses to implement and tangled wires Single Patient Viewer
are no longer an issue. In addition, communi-
cations with device can be carried throughout The care of critically ill patients in the ICU
the hospital – in the OR, while on transport, requires collaboration among a diverse team
and in the ICU. of very competent caregivers to achieve the
Although early studies of nurses and best care (Clemmer et al. 1998). The team-
therapists showed that computerized charting work and communications is required in this
took longer than manual charting, it is almost complex care process.
certain with automated acquisition available The rounds activity at LDS Hospital
today that charting takes less time and is more is an example of the collaborative process.
accurate. As a consequence, in institutions . Figure 21.12 shows the clinical care team
that have historically collected IV pump and during rounds. There are physicians, house
ventilator data automatically, there is a com- officers, advanced practice clinicians, nurses,
mitment to collect data from every bedside pharmacists, respiratory therapists, dieticians,
monitoring device. These include measures case managers, and others who gather each
of urine output, fluid drainage, and similar
21 measures.
day to assess each patient and make key care
Patient Monitoring Systems
717 21
.. Fig. 21.11 Diagram of AWARE system developed ment in clinical trials. On the top are schematically
and used at Mayo Clinic. The core component is middle- shown clinical applications or viewers that use data from
ware database that cached data from hospital EMR sys- middleware to generate user interfaces. ADT indicates;
tems (left). It utilized number of approaches such as DB API,; apps applications, AWARE Ambient Warning and
stored procedures and HL7. Synthesis is Mayo Clinic Response Evaluation System, EMR DB Electronic
home-grown API layer and DB. Data from discharged Medical Record Database, HL7 Health Level Seven,
patients archived and stored in Datamart repository. ICU intensive care unit, LIS Laboratory Information
Historical data is used for administrative reporting and System, MCHS Mayo Clinic Health System, OR oper-
cohort identification for research. The alerting module ating room, PACU Post Anesthesia Care Unit, SIRS
(right) is used for clinical sniffers and research enroll- Surgical Information System
decisions. The rounds leader is usually a phy- The information from the computer sys-
sician, but each team member is considered tem is organized to support the process. The
an equal partner, providing key information computerized record is the patient record.
(most of it stored in the computer record) and Information from other sources such as radio-
given the opportunity to discuss their inter- graphic images and free-text reports are also
pretation and make recommendations about readily available (Gurses and Xiao 2006).
the patient’s care. As a single patient viewer, AWARE
Over decades, the social process of con- extracts data relevant to the treatment of
ducting these rounds has created a very open ICU patients and presents it to the provider
and cooperative environment. The purpose of in a systems-based information package
rounds is to reduce errors from human fac- (. Fig. 21.13). AWARE content has been
tors, to give structure to the evaluation, and selected through the systematic observation
to make sure all sides of the decision process of frontline provider information needs and
are considered as each member considers the profiling of provider data utilization patterns.
decisions from their point of view. The user interface has been optimized to sin-
718 V. Herasevich et al.
.. Fig. 21.13 Overview of data on the AWARE single status (red, required urgent intervention; yellow, abnor-
patient viewer organized around human organs and sys- mality; white, normal physiology and investigations).
tems. Each block represents four elements. Reading Middle top is physiologic status of organ displays cur-
21 from left the key organizational elements are clinical
context pulled from the patient problem list, procedure,
rent values for key variables. Middle bottom is organ
supports, displays the critical care interventions which
medications, and consults lists prior to the current hos- are supporting the current physiologic status. Right part
pitalization. System identifying icons with color-coded of the block displays investigations
Patient Monitoring Systems
719 21
The popularization of checklists use in 21.6.9 ollaborative Process, ICU
C
the clinical medicine began with the land- Change-of-Shift,
mark publication by Dr. Atul Gawande team
and Handover Issues
(Haynes et al. 2009). It has been shown to
reduce errors and health care costs, increase
Cognitive scientists have taken an interest in,
compliance with evidence-based practice, and
and have studied, the dynamic and distrib-
ultimately improve outcomes in critically ill
uted work environment in critical care medi-
patients (Weiss et al. 2011).
cine (Patel and Cohen 2008; Patel et al. 2008;
The smart checklist is a component of
Ahmed et al. 2011). They have studied issues
AWARE (. Fig. 21.14). The system auto-
such as provider task load, errors of cogni-
matically detects patient characteristics
tion, and performance of clinicians involved
to configure a patient’s specific checklist.
in these complex tasks. The change-of-shift
For example, if a patient is not on the ven-
and handover times are especially critical and
tilator, no ventilator- related questions are
require complex exchanges of information
asked. In simulation, the study checklist sig-
that must occur rapidly and efficiently. These
nificantly reduced provider workload and
investigators have found that errors can occur
errors (Thongprayoon et al. 2016). Also, use
during this time because of corruption of
of the checklist in the ICU was associated
information and a failure to transfer crucial
with increased number of occupational ther-
care facts. Having the majority of the patient
apy/physical therapy consults in critically ill
record in electronic form and having that data
patients (Ali et al. 2017).
timely and accurate should allow optimiza-
.. Fig. 21.14 Overview of AWARE checklist. It is a clinical notes, and collect data for precise administrative
tool to aid during critical care rounds by helping apply and quality reports. AWARE indicates Ambient Warn-
best evidence care for every patient, generate meaningful ing and Response Evaluation System
720 V. Herasevich et al.
tion of computerized decision making tools floor in addition to problem list and current
and methods for sharing the patient data. The medications icons. The second row is alerting
Rounds Report developed at LDS Hospital area with a purple circle with a line across that
three decades ago and recent developments gives providers immediate knowledge that a
at Mayo Clinic provide laboratory models for patient’s code status is Do not Resuscitate/
better understanding the issues and improv- Do Not Intubate. The middle section of the
ing efficiency and eliminating medical errors box contains patient information such as the
for ICU patients during shift changes and patient’s name, medical record number, age,
patient handover times. The AWARE system number of days in the ICU, and teams car-
has incorporated a number of tools that sup- ing for the patient. The lower section of the
port ICU change of shift and handover pro- box contains the clinical information with the
cesses, including shared tasklist, claim the organ icons. These icons exist for the central
patient, and handover modules. Each tool nervous, cardiovascular, respiratory, gastro-
is designed to decrease the number of errors intestinal, renal, and hematologic systems.
including omission, as well as the cognitive There is also an icon representing infectious
load of providers. diseases with relevant data including white
One of the core components of AWARE blood cell counts and microbiologic specimen
is the multipatient viewer that is a population results. Organ icons are color coded the same
management tool (. Fig. 21.15). The multi- as on single patient viewer. Second from bot-
patient viewer shows the census and the geo- tom row has organs support icons. All icons
graphic layout of patient rooms a specific ICU were tested for recognition and recall by clini-
unit. By hovering over or clicking on the icons, cal providers (Litell et al. 2012). By clicking
clinical information can be accessed quickly. on a patient box it will be launched into the
Each patient box has four distinct areas Single Patient Viewer.
(. Fig. 21.16). Workflow or administrative The AWARE system was extensively
area is on the top and this includes the Task tested. In cluster randomized trials the time
list and Checklist icons as well as the room spent on preround data gathering decreased
number and readiness for discharge to the from 12 to 9 minutes per patient before and
21
.. Fig. 21.15 Overview of multipatient viewer. It is population management tool where boxes represent geograph-
ical view of patient rooms in the care unit
Patient Monitoring Systems
721 21
.. Fig. 21.16 Patient box
on the multipatient viewer
includes four groups of
icons described in the text
Just as with the false alarms generated by bed- tory therapist charting described earlier, it was
side monitors, alarm feedback from computer possible to develop and test computerized ven-
systems must present timely and accurate rec- tilator management protocols. Patient therapy
ommendations with a minimum number of was controlled by protocol 95% of the time
false alarms. and 90% of the protocol instructions were
followed by clinicians. Several of the comput-
erized instructions not followed were due to
21.7.1 Laboratory Alerts ventilator charting errors. Patients cared for
with the computerized protocol had required
During the developmental period of the less positive pressure in the ventilator system,
HELP system in the 1980s, it became appar- and physiologic measures were disturbed less.
ent that on occasion life-threatening labo- The investigators concluded that such proto-
ratory results were not being acted upon cols could make the ventilator weaning pro-
promptly. On acute care nursing floors, the cesses “less mystifying, simpler, and more
initial alert response time averaged from 5.1 systematic” (East et al. 1992). Since that early
to 58.2 hours (Bradshaw et al. 1989). By post- work, several other investigators have imple-
ing alerts on computer terminals on nursing mented similar ventilator weaning algorithms.
floors, the average response time was reduced In the process of implementing automated
to 3.6 hours. Then a flashing light, similar to charting of ventilator parameters at LDS
those found on road maintenance vehicles, Hospital (Vawdrey et al. 2007), it became
was installed on each nursing floor. The aver- clear that critical ventilator alarms were being
age response time then decreased to 6 min- missed. As discussed earlier, alarm sounds
utes but the light was very annoying to the emitted from ventilators were blended with
nursing staff (Bradshaw et al. 1989). When a bedside monitor alarm sounds. As a conse-
sophisticated nurse paging system was set up quence, when a patient became disconnected
that paged the particular nurse caring for the from a ventilator the alarms could be missed
patient with the laboratory alert and required (Evans et al. 2005). Once this situation was
nurses to acknowledge the alerts the new pager recognized, an enhanced notification system
system was equally effective and less annoying was implemented. . Figure 21.17 illustrates a
to other patients and staff (Tate et al. 1995). ventilator disconnect alarm presented on the
Similar work was done by Shabot (1995) at patient’s bedside display and on every other
Cedars-Sinai Hospital in Los Angeles using computer display in the same ICU. The efficacy
a Blackberry pager. Since that time, wireless and user acceptance of the new alarm system
communications technology has improved has enhanced patient safety and allowed docu-
dramatically and a variety of even better feed- mentation of this important clinical event.
back mechanisms are now available. However
in a study by Harrison et al. (2017), the alert
acknowledgement rate from the severe sepsis
alert system was significantly better with tra-
Ventilator Disconnect in Room
ditional paging system.
E 645
21.7.2 Ventilator Weaning
Management and Alarm
System
Weaning patients from ventilators was one
of the first applications of a computerized
21 expert system to routine patient care at LDS
.. Fig. 21.17 Ventilator disconnect alarm. This alert is
for the patient in Room E645 but it is displayed on every
Hospital. As a result of the nurse and respira- computer screen in the intensive care unit
Patient Monitoring Systems
723 21
21.7.3 dverse Drug Event
A 21.7.4 I V Pump and Medications
Detection and Prevention Monitoring
Detection and prevention of adverse drug IV medication administration occurs in 90%
events (ADEs) has been a long-term goal of of hospitalized patients; virtually every ICU
caregivers, the World Health Organization, patient is connected to an IV pump to receive
and the U.S. Food and Drug Administration fluids, nutrients, and medications. Although
(Classen et al. 1991). Physicians, pharmacists, so-called smart pumps have been developed
and informatics specialists at LDS Hospital to minimize errors, those pumps are not yet
developed a computer-based ADE monitor integrated with the EMR and, as a result, are
that detected a variety of triggers in the EMR not capable of helping to prevent IV admin-
that could indicate potential ADEs, such as istration errors. Evans and associates (2010)
sudden medication stop orders, medication at LDS Hospital have used cabled or wireless
antidote ordering, and specific abnormal lab- IV pumps integrated with the HELP system
oratory and physiologic results. Pharmacists to enhance notification of IV pump program-
followed up on each ADE alert and each was ming errors. The medication charting sys-
verified and categorized. During an 18-month tem can detect and provide real-time alerts
period, 36,653 hospitalized patients were whenever an initial or potential pump rate
monitored and 731 true ADEs occurred in programming error occurs. A set of 23 high-
648 patients, 701 were classified as moderate risk medications are monitored by the HELP
or severe. Only 92 of the ADEs were identi- system. Whenever IV pump flow rate for one
fied by traditional voluntary reporting meth- of these medications is outside the acceptable
ods. Using this knowledge, the investigators range, a visual alert such as that shown in
developed methods for preventing ADEs. An . Fig. 21.18 is presented on the bedside dis-
example is the nurse charting work of Nelson play and on all other computer displays in the
et al. (2005). same ICU. Over a 2-year period, they found
Classen and colleagues (1997) followed that there were alerts on 4% of the initial or
up their earlier surveillance system for ADEs. dose rate changes or about 1.4 alerts per day.
They found that the attributable length of
stay and costs of hospitalization for ADEs
were substantial. If a patient had an ADE
there was an increased length of stay of
1.74 days, an increased cost of $2,013, and
an increased risk of death of 1.88 (Classen
et al. 1997).
Even with the enhanced computerized
methods for detecting, preventing and moni- Pump Alert
toring ADEs, there is still room for improve- E601
ment (Petratos et al. 2010). In a studyover Pump #305
200,000 medication alerts in an electronic
prescribing system found more than 90% of
drug alerts were overridden by physicians
(Isaac et al. 2009). Critically ill patients are
particularly susceptible to ADEs due to their
unstable physiology, complex therapeutic
medications, and the large percentage of IV
medications (Hassan et al. 2010). Better sys- .. Fig. 21.18 Intravenous pump alert. This alert is for
tems must be developed and implemented to pump 305 located in Room E601, but it is displayed on
prevent ADEs. every computer screen in the intensive care unit
724 V. Herasevich et al.
Rule-based decision support systems in the 2. How would you decide whether to buy a
modern EMR era. Journal of Clinical standalone ICU patient monitoring
Monitoring Computing, 27(4), 443–448. system versus an integrated EMR sys-
PMID: 23456293. tem?
Kuperman, G. J., Gardner, R. M., & Pryor, T. A. 3. How do care providers impact the instal-
(1991). HELP: A dynamic hospital informa- lation and optimization of real-time
tion system. New York: Springer. data collection and real-time decision
Morris, A. H. (2001). Rational use of computer- support?
ized protocols in the intensive care unit. 4. Perhaps real-time data collection and
Critical Care, 5, 249–254. PMID 11737899. computerized decision support are not
Pickering, B. W., Litell, J. M., Herasevich, V., & necessary. How would you assess these
Gajic, O. (2012). Clinical review: The hospital issues? Is there sufficient literature to
of the future - building intelligent environments validate or disprove your supposition? If
to facilitate safe and effective acute care deliv- not, what is missing?
ery. Critical Care, 16(2), 220. PMID: 22546172. 5. How would you go about selecting the
Shabot, M. M., & Gardner, R. M. (Eds.). (1994). optimum data for monitoring and
Decision support systems in critical care. improving the care of a critically ill
Boston: Springer. patient?
6. How would you optimize a patient mon-
??Questions for Discussion itoring system that you were building or
1. Describe how the integration of buying to provide the most accurate,
information from multiple bedside timely, and helpful computerized deci-
monitoring signals, the pharmacy, and sion support capabilities? Be specific
clinical laboratory data can help and give literature references to support
improve alarm systems used in an ICU. your optimization plan.
728 V. Herasevich et al.
7. If you were the Chief Clinical Brown, L. T., Purcell, G. J., & Traugott, F. M. (1990).
Information Officer of a large hospital Hypoxaemia during postoperative recovery using
continuous pulse oximetry. Anaesthesia and Intensive
without data from the ICU integrated
Care, 18(4), 509–516. PMID: 2268017.
into your EMR system, what factors Burykin, A., Peck, T., & Buchman, T. G. (2011). Using
would you have to consider to “off-the-shelf ” tools for a terabyte-scale waveform
implement such a system AND to recording in intensive care: Computer system
apply computerized clinical decision design, database description and lessons learned.
Computer Methods and Programs in Biomedicine,
support to optimize such a system?
103, 151–160. PMID 21093093.
How long do you think it would take Carayon, P., Cartmill, R., Blosky, M. A., Brown, R.,
to implement such a system? Hackenberg, M., Hoonakker, P., Hundt, A. S.,
Norfolk, E., Wetterneck, T. B., & Walker, J. M.
(2011). ICU nurses’ acceptance of electronic health
records. Journal of the American Medical Informatics
References Association: JAMIA, 18(6), 812–819. PMID
21697291.
Aboukhalil, A., Nielsen, L., Saeed, M., Mark, R. G., & Celi, L. A., Hassan, E., Marquardt, C., Breslow, M., &
Clifford, G. D. (2008). Reducing false alarm rates Rosenfeld, B. (2001). The eICU: It’s not just tele-
for critical arrhythmias using the arterial blood medicine. Critical Care Medicine, 29(8 Suppl),
pressure waveform. Journal of Biomedical N183–N189. PMID 11496041.
Informatics, 41, 442–451. PMID 18440873. Chambrin, M. C. (2001). Alarms in the intensive care
Ahmed, A., Chandra, S., Herasevich, V., Gajic, O., & unit: How can the number of false alarms be
Pickering, B. W. (2011). The effect of two different reduced? Critical Care, 5, 184–188. PMID
electronic health record user interfaces on intensive 11511330.
care provider task load, errors of cognition, and Chaudhry, B., Wang, J., Wu, S., Maglione, M., Mojica,
performance. Critical Care Medicine, 39, 1626– W., Roth, E., Morton, S. C., et al. (2006). Systematic
1634. PMID 21478739. review: Impact of health information technology on
Ali, R., Cornelius, P. J., Herasevich, V., Gajic, O., & quality, efficiency, and costs of medical care. Annals
Kashyap, R. (2017). Effect of daily use of electronic of Internal Medicine, 144(10), 742–752.
checklist on physical rehabilitation consultations in Chen, T., Clifford, G. D., & Mark, R. G. (2009). The
critically ill patients. Journal of Critical Care, 38, effect of signal quality on six cardiac output estima-
357–361. PMID: 28043721. tors. Computers in Cardiology, 36, 197–200. PMC
Andrews, R. D., Gardner, R. M., Metcalf, S. M., & 2926988.
Simmons, D. (1985). Computer charting: An evalu- Clark, A. P., Giuliano, K., & Chen, H. M. (2006). Pulse
ation of a respiratory care computer system. oximetry revisited “but His O2 was normal”.
Respiratory Care, 30, 695–707. PMID 10315682. Clinical Nurssing Specialist CNS, 20, 268–272.
Blum, J. M., & Tremper, K. K. (2010). Alarms in the PMID 17149014.
intensive care unit: Too much of a good thing is dan- Clark, E. N., Sejersten, M., Clemmensen, P., &
gerous: Is it time to add some intelligence to alarms? MacFarland, P. W. (2010). Automated electrocar-
Critical Care Medicine, 33, 702–703. PMID diogram interpretation programs versus cardiolo-
20083933. gists’ triage decision making based on teletransmitted
Bogun, F., Anh, D., Kalahasty, G., Wissner, E., Cou data in patients with suspected acute coronary syn-
Serhal, C., Bazzi, R., Weaver, W. D., & Schuger, C. drome. The American Journal of Cardiology, 106,
(2004). Misdiagnosis of atrial fibrillation and its 1696–1702. PMID 21126612.
clinical consequences. American Journal of Classen, D. C., Pestotnik, S. L., Evans, R. S., & Burke,
Medicine, 117, 636–642. PMID 15501200. J. P. (1991). Computerized surveillance of adverse
Bradshaw, K. E., Gardner, R. M., Clemmer, T. P., Orme, drug events in hospital patients. JAMA, 266, 2847–
J. F., Thomas, F., & West, B. J. (1984). Physician 2851. PMID 194452.
decision-making: Evaluation of data used in a com- Classen, D. C., Pestotnik, S. L., Evans, R. S., Lloyd, J. F.,
puterized ICU. International Journal of Clinical & Burke, J. P. (1997). Adverse drug events in hospital-
Monitoring and Computing, 1(2), 81–91. ized patients. Excess length of stay, extra costs, and
Bradshaw, K. E., Sittig, D. F., Gardner, R. M., Pryor, attributable mortality. Journal of the American
T. A., & Budd, M. (1989). Computer-based data Medical Association, 277(4), 301–306.
entry for nurses in the ICU. MD Computing, 6(5), Clemmer, T. P. (2004). Computers in the ICU: Where we
274–280. PMID 2486506. started and where we are now. Journal of Critical
Braunwald, E. (1988). Evolution of the management of Care, 19, 204–207. PMID 15648035.
acute myocardial infarction: A 20th century saga. Clemmer, T. P., Spuhler, V. J., Berwick, D. M., & Nolan,
21 The Lancet, 352, 1771–1774. PMID 9848369. T. W. (1998). Cooperation: The foundation of
Breslow, M. J. (2007). Remote ICU care programs: improvement. Annals of Internal Medicine, 128(12
Current status. Journal of Critical Care, 22(1), 66–76. Pt 1), 1004–1009. PMID 9625663.
Patient Monitoring Systems
729 21
Collins, F. S. (2011). Reengineering translational sci- Technology and Informatics, 160(Pt 1), 734–738.
ence: The time is right. Science Translational PMID 20841783.
Medicine, 3(90), 90cm17. Evans, R. S., Kuttler, K. G., Simpson, K. J., Howe, S.,
Cox, J., Jr. (1972). Digital analysis of the electroenceph- Crossno, P. F., Johnson, K. V., Schreiner, M. N., Lloyd,
alogram, the blood pressure wave, and the electro- J. F., Tettelbach, W. H., Keddington, R. K., Tanner,
cardiogram. Proceedings of the IEEE, 60, 1137. A., Wilde, C., & Clemmer, T. P. (2015). Automated
Crossley, G. H., Poole, J. E., Rozner, M. A., et al. (2011). detection of physiologic deterioration in hospitalized
The Heart Rhythm Society (HRS)/American patients. Journal of the American Medical Informatics
Society of Anesthesiologists (ASA) expert consen- Association, 22(2), 350–360. PMID: 25164256.
sus statement on the perioperative management Funk, M., Winkler, C. G., May, J. L., Stephens, K.,
arrhythmia monitors. Heart Rhythm, 8, 1114–1154. Fennie, K. P., Rose, L. L., Turkman, Y. E., & Drew,
PMID 21722856. B. J. (2010). Unnecessary arrhythmia monitoring
Cundick, R. M., Jr., & Gardner, R. M. (1980). Clinical and underutilization of ischemia and QT interval
comparison of pressure-pulse and indicator-dilution monitoring in current clinical practice: Baseline
cardiac output determination. Circulation, 62, 371– results of the practice use of the latest standards for
376. PMID 6994922. electrocardiography trial. Journal of
Cushing, H. (1903). On routine determination of arte- Electrocardiology, 43, 542–547. PMID 20832819.
rial tension in operating room and clinic. Boston Gardner, R. M. (1996). Accuracy and reliability of dis-
Medical Surgical Journal, 148, 250. posable pressure transducers coupled with modern
Dalto, J. S., Johnson, K. V., Gardner, R. M., Spuhler, pressure monitors. Critical Care Medicine, 24, 879–
V. J., & Egbert, L. (1997). Medical information bus 882. PMID 8706469.
usage for automated IV pump data acquisition: Gardner, R. M. (2004). Computerized clinical decision-
Evaluation of use and nurse attitudes. International support in respiratory care. Respiratory Care, 49,
Journal of Clinical Monitoring and Computing, 14, 378–386. PMID 15030611.
151–154. PMID 9387004. Gardner, R. M. (2016). Clinical information systems –
Deflandre, E., Bonhomme, V., & Hans, P. (2008). Delta from yesterday to tomorrow. Yearbook of Medical
down compared with delta pulse pressure as an indi- Informatics, (Suppl 1), S62–S75. PMID: 27362589.
cator of volaemia during intracranial surgery. Gardner, R. M., & Beale, R. J. (2009). Pressure to per-
British Journal of Anaesthesia, 100, 245–250. PMID form: Is cardiac output estimation from arterial
18083787. waveforms good enough for routine use? Critical
Doig, A. K., Drews, F. A., & Keefe, M. R. (2011). Care Medicine, 37, 337–338. EDITORIAL PMID
Informing the design of hemodynamic monitoring 19112291.
displays. Computers, Informatics, Nursing, 29(12), Gardner, R. M., & Lundsgaarde, H. P. (1994).
706–713. PMID 21412150. Evaluation of user acceptance of a clinical expert
Drew, B. J., & Funk, M. (2006). Practice standards for system. Journal of the American Medical Informatics
ECG monitoring in hospital setting: Executive sum- Association: JAMIA, 1(6), 428–438.
mary and guide for implementation. Critical Care Gardner, R. M., Warner, H. R., Toronto, A. F., &
Nursing Clinics of North America, 18(2), 157–168. Gaisford, W. D. (1970). Cather-flush system for con-
PMID 16728301. tinuous monitoring of central arterial pulse wave-
Drews, F. A., Musters, A., & Samore, M. H. (2008). form. Journal of Applied Physiology, 29, 911–913.
Error producing conditions in intensive care unit. In PMID 5485368.
Advances in patient safety: New directions and alter- Gardner, R. M., Tariq, H., Hawley, W. L., & East, T. D.
native approaches (Performance and tools) (Vol. 3). (1989). Medical information bus: The key to future
Rockville: AHRQ. PMID 21249947. integrated monitoring. International Journal of
East, T. D., Bohm, S. H., Wallace, C. J., Clemmer, T. P., Clinical Monitoring and Computing, 6, 197–204.
Weaver, L. K., Orme, J. F., Jr., & Morris, A. H. PMID 2628508.
(1992). A successful computerized protocol for clin- Gardner, R. M., Hawley, W. H., East, T. D., Oniki, T., &
ical management of pressure control inverse ration Young, H. F. W. (1991). Real time data acquisition:
ventilation. Chest, 101(3), 697–710. Experience with the medical information bus (MIB).
Edworthy, J., & Hellier, E. (2006). Alarms and human Proceedings of the Annual Symposium on Computer
behavior: Implications for medical alarms. British Applications in Medical Care, 15, 813–817. PMID
Journal of Anaesthesia, 97, 12–17. PMID 16698858. 1807719.
Evans, R. S., Johnson, K. V., Flint, V. B., Kinder, T., Gardner, R. M., Pryor, T. A., & Warner, H. R. (1999).
Lyon, C. R., Hawley, W. L., Vawdrey, D. K., & The HELP system: Update 1998. International
Thomsen, G. E. (2005). Enhanced notification of Journal of Medical Informatics, 54, 169–182. PMID
critical ventilator events. Journal of the American 10405877.
Medical Informatics Association, 12, 589–595. Gorges, M., Markewitz, B. A., & Westenskow, D. R.
PMID 16049226. (2009). Improving alarm performance in the medi-
Evans, R. S., Carlson, R., Johnson, K. V., Palmer, B. K., cal intensive care unit using delays and clinical con-
& Lloyd, J. F. (2010). Enhanced notification of infu- text. Anesthesia and Analgesia, 108, 1546–1552.
sion pump programming errors. Studies in Health PMID 19372334.
730 V. Herasevich et al.
Gregg, R. E., Zhou, S. H., Lindauer, J. M., Helfendbein, tronic health record implementation. Journal of the
E. D., & Giuliano, K. K. (2008). What is inside the American Medical Informatics Association, 22(2),
electrocardiograph? Journal of Electrocardiology, 443–452. PMID: 25164255.
41, 8–14. PMID 1819652. Hripcsak, G., Vawdrey, D., Fred, M., & Bostwick, S.
Groves, R. H., Jr., Holcomb, B. W., Jr., & Smith, M. L. (2011). Use of electronic clinical documentation:
(2008). Intensive care telemedicine: Evaluating a Time spent and team interactions. Journal of the
model for proactive remote monitoring and inter- American Medical Informatics Association: JAMIA,
vention in the critical care setting. Studies in Health 18(1), 112–117.
Technology and Informatics, 131, 131–146. PMID Imhoff, M., & Kuhls, S. (2006). Alarm algorithms in
18305328. critical care monitoring. Anesthesia and Analgesia,
Guglin, M. E., & Thatai, D. (2006). Common errors in 102, 1525–1537. PMID 16632837.
computer electrocardiogram interpretation. Isaac, T., Weissman, J. S., Davis, R. B., Massagli, M.,
International Journal of Cardiology, 106, 232–237. Cyrulik, A., Sands, D. Z., & Weingart, S. N. (2009).
PMID 16321696. Overrides of medication alerts in ambulatory care.
Gurses, A. P., & Xiao, Y. (2006). A systematic review of Archives of Internal Medicine, 169(3), 305–311.
the literature on multidisciplinary rounds to design PMID: 19204222.
information systems. Journal of the American Jha, A. K., DesRoches, C. M., Campbell, E. G.,
Medical Informatics Association: JAMIA, 13, 267– Donelan, K., Rap, S. R., Ferris, T. G., Shields, A.,
278. PMID 16501176. Rosenbaum, S., & Blumenthal, D. (2009). Use of
Han, Y. Y., Carcillo, J. A., Venkataraman, S. T., Clark, electronic health records in U.S. Hospitals. The New
R. S., Watson, R. S., Nguyen, T. C., Bayir, H., & England Journal of Medicine, 360, 1628–1638.
Orr, R. A. (2005). Unexpected increased mortality PMID 19321858.
after implementation of a commercially sold com- Johnson, A. E. W., & Mark, R. G. (2018). Real-time
puterized physician order entry system. Pediatrics, mortality prediction in the Intensive Care Unit.
116(6), 1506–1512. PMID: 16322178. American Medical Informatics Association Annual
Hassan, E., Badawi, O., Weber, R. J., & Cohen, H. (2010). Symposium Proceedings, 2017, 994–1003. PMID:
Using technology to prevent adverse drug events in 29854167.
the intensive care unit. Critical Care Medicine, 38(6 Kahn, J. M. (2011a). The use and misuse of ICU tele-
Suppl), S97–S105. PMID 20502181. medicine. JAMA, 305, 2176–2183. PMID 21576623.
Haynes, A. B., Weiser, T. G., Berry, W. R., Lipsitz, S. R., Karsh, B.-T., Weinger, M. B., Abbott, P. A., & Wears,
Breizat, A. H., Dellinger, E. P., Herbosa, T., Joseph, R. L. (2010). Health information technology:
S., Kibatala, P. L., Lapitan, M. C., Merry, A. F., Fallacies and sober realities. Journal of the American
Moorthy, K., Reznick, R. K., Taylor, B., Gawande, Medical Informatics Association: JAMIA, 17(6),
A. A., & Safe Surgery Saves Lives Study Group. 617–623.
(2009). A surgical safety checklist to reduce morbid- Kelley, M. A., Angus, D., Chalfin, D. B., Crandall,
ity and mortality in a global population. The New E. D., Ingbar, D., Johanson, W., Medina, J., Sessler,
England Journal of Medicine, 360(5), 491–499. C. N., & Vender, J. S. (2004). The critical care crisis
PMID: 19144931. in the United States: A report from the profession.
Herasevich, V., Yilmaz, M., Khan, H., Hubmayr, R. D., Chest, 125, 1514–1517. PMID 15078767.
& Gajic, O. (2009). Validation of an electronic sur- Kennelly, R. J., & Gardner, R. M. (1997). Perspectives
veillance system for acute lung injury. Intensive Care on development of IEEE 1073: The medical infor-
Medicine, 35, 1018–1023. PMID 19280175. mation bus (MIB) standard. International Journal of
Herasevich, V., Pickering, B. W., Dong, Y., Peters, S. G., Clinical Monitoring and Computing, 14(3), 143–149.
& Gajic, O. (2010). Informatics infrastructure for Keren, H., Burkhoff, D., & Squara, P. (2007). Evaluation
syndromic surveillance, decision support, reporting, of a non-invasive continuous cardiac output monitor-
and modeling of critical illness. Mayo Clinic ing system based on thoracic bioreactance. American
Proceedings, 85, 247–254. PMID 21194152. Journal of Physiology - Heart and Circulatory
Herasevich, V., Afessa, B., & Pickering, B. W. (2011). Physiology, 293, H583–H589. PMID 17384132.
Sepsis in critically ill patients with trauma. Critical Kilickaya, O., Schmickl, C., Ahmed, A., Pulido, J.,
Care Medicine, 39, 876–879. PMID 21613830. Onigkeit, J., Kashani, K., Gajic, O., Herasevich, V.,
Herasevich, V., Litell, J., & Pickering, B. (2012). & Pickering, B. (2014). Customized reference ranges
Electronic medical records and mHealth anytime, for laboratory values decrease false positive alerts in
anywhere. Biomedical Instrumentation & Technology, intensive care unit patients. PLoS One, 9(9),
46(Suppl 2), 45–48. PMID: 23039776. e107930. PMID: 25233485.
Herasevich, V., Kor, D. J., Subramanian, A., & Pickering, Kohn, L. T., Corrigan, J., & Donaldson, M. S. (2000).
B. W. (2013). Connecting the dots: Rule-based deci- Institute of Medicine. To err is human. Building a
sion support systems in the modern EMR era. safer health system (p. xxi, 287). Washington, DC:
Journal of Clinical Monitoring and Computing, National Academy Press. PMID: 25077248.
21 27(4), 443–448. PMID: 23456293. Kouwenhoven, W. B., Jude, J. R., & Knickerbocker,
Howley, M. J., Chou, E. Y., Hansen, N., & Dalrymple, G. G. (1960). Closed-chest cardiac massage. JAMA,
P. W. (2015). The long-term financial impact of elec- 173, 1064–1067. PMID 14411374.
Patient Monitoring Systems
731 21
Kowalczyk, L. (2011, February 13). Patient alarms often tion, alerts, and documentation. In Proceedings/
unheard, unheeded. Boston Globe. AMIA annual symposium AMIA symposium,
Kuperman, G. J., Gardner, R. M., & Pryor, T. A. (1991). pp. 498–502.
HELP: A dynamic hospital information system. Olchanski, N., Dziadzko, M. A., Tiong, I. C., Daniels,
New York: Springer. C. E., Peters, S. G., O’Horo, J. C., & Gong, M. N.
Landrigan, C. P., Parry, G. J., Bones, C. B., Hackbarth, (2017). Can a novel ICU data display positively
A. D., Goldmann, D. A., & Sharek, P. J. (2010). affect patient outcomes and save lives? Journal of
Temporal trends in rates of patient harm resulting Medical Systems, 41(11), 171. PMID: 28921446.
from medical care. The New England Journal of Oniki, T. A., Clemmer, T. P., & Pryor, T. A. (2003). The
Medicine, 363(22), 2124–2134. PMID: 21105794. effect of computer-generated reminders on chart-
Lee, J., & Mark, R. G. (2010). An investigation of pat- ing deficiencies in the ICU. Journal of the American
terns in hemodynamic data indicative of impending Medical Informatics Association: JAMIA, 10, 177–
hypotension in intensive care. Biomedical 187.
Engineering, 9, 62. PMID 20973998. Patel, V. L., & Cohen, T. (2008). New perspectives on
Lee, J. Y., Du, Y. E., Coki, O., Flynn, J. T., Starren, J., & error in critical care. Current Opinion in Critical
Chiang, M. F. (2010). Parental perceptions toward Care, 14(4), 456–459.
digital imaging and telemedicine for retinopathy of Patel, V. L., Zhang, J., Yoskowitz, N. A., Green, R., &
prematurity management. Graefe’s Archive for Clinical Sayan, O. R. (2008). Translational cognition for
and Experimental Ophthalmology, 248, 141–147. decision support in critical care environments: A
Li, Q., Mark, R. G., & Clifford, G. D. (2009). Artificial arte- review. Journal of Biomedical Informatics, 41, 413–
rial blood pressure artifact models and an evaluation of 431.
a robust blood pressure and heart rate estimator. Petratos, G. N., Kim, Y., Evans, R. S., & Gardner, R. M.
Biomedical Engineering Online, 8, 13. PMID 19586547. (2010). Comparing the effectiveness of computer-
Lilly, C. M., Cody, S., Zhao, H., et al. (2011). Hospital ized adverse drug event monitoring systems to
mortality, length of stay, and preventable complica- enhance clinical decision support for hospitalized
tions among critically Ill patients before and after patients. Applied Clinical Informatics, 1, 293–303.
Tele-ICU reengineering of critical care processes. Pickering, B. W., Herasevich, V., & Gajic, A. A. (2010).
JAMA, 305, 2175–2183. Novel representation of clinical information in the
Litell, J. M., Suther, T. W., Ridgeway, C. J., Tiong, I. C., ICU – developing user interfaces which reduce
Pickering, B. W., & Herasevich, V. (2012). information overload. Applied Clinical Informatics,
Representation of organ system domains in a novel 1, 116–131. PMID: 23616831.
critical care EMR interface: Implications for effec- Pickering, B. W., Litell, J. M., Herasevich, V., & Gajic,
tive partnership between clinicians and design pro- O. (2012). Clinical review: The hospital of the
fessionals. In AMIA 2012 proceedings, p. 1840. future - building intelligent environments to facili-
Lown, B., Amarasingham, R., & Neuman, J. (1962). tate safe and effective acute care delivery. Critical
New method for terminating cardiac arrhythmias. Care, 16(2), 220. PMID: 22546172.
Use of synchronous capacitor discharge. JAMA, Pickering, B. W., Dong, Y., Ahmed, A., Giri, J.,
182, 548–555. PMID 12921298. Kilickaya, O., Gupta, A., Gajic, O., & Herasevich,
Malviya, S., Reynolds, P. I., Voepel-Lewis, T., Siewert, V. (2015). The implementation of clinician designed,
M., Watson, D., Tait, A. R., & Tremper, K. (2000). human-centered electronic medical record viewer in
False alarms and sensitivity of conventional pulse the intensive care unit: A pilot step-wedge cluster
oximetry versus the Masimo SET technology in the randomized trial. International Journal of Medical
pediatric postanesthesia care unit. Anesthesia and Informatics, 84(5), 299–307. PMID: 25683227.
Analgesia, 90(6), 1336–1340. PMID: 10825316. Poon, K. B. (2005). Fusing multiple heart rate signals to
Mathews, S. C., & Pronovost, P. J. (2011). The need for reduce alarms in adult intensive care unit. Master of
systems integration in health care. JAMA, 305, 934– Science Thesis from Department of Medical
935. PMID 21364143. Informatics, University of Utah.
Morris, A. H. (2000). Developing and implementing Pryor, T. A. (1989). Computerized nurse charting.
computerized protocols for standardization of clini- International Joural of Clinical Monitoring and
cal decisions. Annals of Internal Medicine, 132, 373– Computing, 6, 173–179. PMID 2592844.
383. PMID 10691588. Pryor, T. A., Gardner, R. M., Clayton, P. D., & Warner,
Morris, A. H. (2001). Rational use of computerized pro- H. R. (1983). The HELP system. Journal of Medical
tocols in the intensive care unit. Critical Care, 5, Informatics, 7(2), 87–102.
249–254. PMID 11737899. Rogers, W. J., Canto, J. G., Lambrew, C. T., Tiefenbrunn,
Nelson, N. C., Evans, R. S., Samore, M. H., & Gardner, A. J., Kinkaid, B., Shoultz, D. A., Frederick, P. D.,
R. M. (2005). Detection and prevention of medica- & Every, N. (2000). Temporal trends in the treat-
tion errors using real-time nurse charting. Journal of ment of over 1.5 million patients with myocardial
the American Medical Informatics Association: infarction in the US from 1990 thru 1999: The
JAMIA, 12, 390–397. PMID 15802486. National Registry of Myocardial Infarction 1, 2 and
Norris, P. R., & Dawant, B. M. (2001). Closing the loop 3. Journal of the American College of Cardiology, 36,
in ICU decision support: Physiologic event detec- 2056–2063. PMID 11127441.
732 V. Herasevich et al.
Rosenfeld, B. A., Dorman, T., Breslow, M. J., et al. data entry of ventilator settings. Journal of the
(2000). Intensive care unit telemedicine: Alternate American Medical Informatics Association: JAMIA,
paradigm for providing continuous intensivist care. 14, 295–303. PMID 17329731.
Critical Care Medicine, 28, 3925–3931. Wager, K. A., Schaffner, M. J., Foulois, B., Swanson
Saeed, M., Villarroel, M., Reisner, A. T., Clifford, G., Kazley, A., Parker, C., & Walo, H. (2010).
Lethman, L. W., Moody, G., Heldt, T., Kyaw, T. H., Comparison of the quality and timeliness of vital
Moody, B., & Mark, R. G. (2011). Multiparameter signs data using three different data-entry devices.
intelligent monitoring in intensive care II: A public- Computers, Informatics, Nursing, 28(4), 205–212.
access intensive care unit database. Critical Care PMID: 20571371.
Medicine, 39, 952–960. PMID 21283005. Warner, H. R., Swan, H. J., Connolly, D. C., Tompkins,
Sanctorius, S. (1626). Commentaria in primam Fen primi R. G., & Wood, E. H. (1953). Quantitation of beat-
libri canonis Avicennae (p. 22). Venice: Sarcina. to-beat changes in stroke volume from the aortic
Saxton, G. A., Jr., & Myers, G. (1957). A servomecha- pulse contour in man. Journal of Applied Physiology,
nism for automatic regulation of pulmonary venti- 5, 495–507. PMID 13034677.
lation. Journal of Applied Physiology, 11(2), Warner, H. R., Gardner, R. M., & Toronto, A. F. (1968).
326–328. PMID: 13475186. Computer-based monitoring of cardiovascular
Shabot, M. M. (1995). Computers in the intensive care function in postoperative patients. Circulation, 37(4
unit: Was pogo correct? Journal of Intensive Care Suppl), II68–II74.
Medicine, 10, 211–212. Weinfurt, P. T. (1990). Electrocardiographic monitoring:
Shubin, H., & Weil, M. H. (1966). Efficient monitoring An overview. Journal of Clinical Monitoring, 6(2),
with a digital computer of cardiovascular function 132–138.
in seriously ill patients. Annals of Internal Medicine, Weiss, C. H., Moazed, F., McEvoy, C. A., Singer, B. D.,
65(3), 453–460. Szleifer, I., Amaral, L. A., Kwasny, M., Watts,
Siebig, S., Kuhls, S., Imhoff, M., Gather, U., Scholmerich, C. M., Persell, S. D., Baker, D. W., Sznajder, J. I., &
J., & Wrede, C. E. (2010). Intensive care unit Wunderink, R. G. (2011). Prompting physicians to
alarms – how many do we need? Critical Care address a daily checklist and process of care and
Medicine, 38, 451–456. PMID 20016379. clinical outcomes: A single-site study. American
Sittig, D. F., Wright, A., Meltzer, S., Simonaitis, L., Journal of Respiratory and Critical Care Medicine,
Evans, R. S., Nichol, W. P., Ash, J. S., & Middleton, 184(6), 680–686. PMID: 21616996.
B. (2011). Comparison of clinical knowledge man- Willson, D. (1994). Survey of nurse perception regard-
agement capabilities of commercially-available and ing the utilization of bedside computers.
leading internally-developed electronic health Proceedings of the Annual Symposium on Computer
records. BMC Medical Informatics and Decision Applications in Medical Care, pp. 553–557. PMID
Making, 11, 13. PMID 21329520. 7949989.
Sun, J. X., Reisner, A. T., Saeed, M., Heldt, T., & Mark, Willson, D., Nelson, N. C., Rosebrock, B. J., Hujcs,
R. G. (2009). The cardiac output from pressure M. T., Wilner, D. G., & Buxton, R. B. (1994). Using
algorithms trial. Critical Care Medicine, 37, 72–80. an integrated point of care system: A nursing per-
PMID 19112280. spective. Topics in Health Information Management,
Tate, K. E., Gardner, R. M., & Scherting, K. (1995). 14, 24–29. PMID 19134757.
Nurses, pagers, and patient-specific criteria: Three Wright, A., Sittig, D. F., Ash, J. S., Feblowitz, J., et al.
keys to improved critical value reporting. Proceedings (2011). Development and evaluation of comprehen-
of the Annual Symposium on Computer Applications sive clinical decision support taxonomy:
in Medical Care, pp. 154–8. PMID 8563258. Comparison of front-end tools in commercial and
Thompson, J. P., & Mahajan, R. P. (2006). Monitoring internally developed electronic health records.
the monitors – beyond risk management. British Journal of the American Medical Informatics
Journal of Anaesthesia, 97, 1–3. PMID 16769701. Association: JAMIA, 18, 232–242. PMID 21415065.
Thompson, G., O’Horo, J. C., Pickering, B. W., & Zhang, Y., & Szolovitz, P. (2008). Patient-specific learn-
Herasevich, V. (2015). Impact of the electronic med- ing in real time for adaptive monitoring in critical
ical record on mortality, length of stay, and cost in care. Journal of Biomedical Informatics, 41, 452–460.
the hospital and ICU: A systematic review and PMID 18463000.
metaanalysis. Critical Care Medicine, 43(6), 1276– Zoll, P. M., Linethal, A. J., Gibson, W., Paul, M. H., &
1282. PMID: 25756413. Norman, L. R. (1956). Termination of ventricular
Thongprayoon, C., Harrison, A. M., O’Horo, J. C., fibrillation in man by externally applied counter-
Berrios, R. A. S., Pickering, B. W., & Herasevich, V. shock. The New England Journal of Medicine, 254,
(2016). The effect of an electronic checklist on criti- 727–732. PMID 13309666.
cal care provider workload, errors, and perfor- Zong, W., Moody, G. B., & Mark, R. G. (2004).
mance. Journal of Intensive Care Medicine, 31(3), Reduction of false arterial blood pressure alarms
205–212. PMID: 25392010. using signal quality assessment and relationship
21 Vawdrey, D. K., Gardner, R. M., Evans, R. S., Orme, between the electrocardiogram and arterial blood
J. F., Jr., Clemmer, T. P., Greenway, L., & Drews, pressure. Medical & Biological Engineering &
F. A. (2007). Assessing the data quality in manual Computing, 42, 698–706. PMID 15503972.
733 22
Imaging Systems
in Radiology
Bradley J. Erickson
Contents
References – 753
22
Imaging Systems in Radiology
735 22
tion can be effectively decoupled. Independent light spectrum is also responsible for producing
imaging centers in a community face some of images seen endoscopically, typically captured
the same issues and opportunities, although to as video images or sequences(movies). Sound
a lesser degree, so we focus primarily on the energy, in the form of echoes from internal
distributed medical center-based Radiology structures, is used to form images in ultra-
department in this chapter. sound, a modality used heavily in cardiac,
abdominal, pelvic, breast, thyroid, testes, and
obstetrical imaging. In addition, Doppler shifts
22.2 Basic Concepts and Issues of sound frequency can measure blood flow
velocity in both arteries and veins and newer
22.2.1 Roles for Imaging microvascular imaging methods can measure
in Biomedicine perfusion in capillary beds. X-ray energy pro-
duces radiographic and computed-tomography
Imaging is a central part of the healthcare pro- (CT) images of most parts of the body: the dif-
cess for diagnosis, treatment planning, image- ferential absorption of X-rays by various tis-
guided interventions, assessment of response sues produces the varying densities that enable
to treatment, and prediction of outcome. In radiographic images to portray normal and
addition, it plays important roles in medical abnormal structures. More recent techniques
communication and education, as well as in that separate the various energies of the X-ray
research. beam allow for more precise characterization of
tissue composition, a simple form being known
as dual-energy CT, and more advanced forms
22.2.1.1 Detection and Diagnosis known as spectral CT. Emission of radioactive
The primary uses of images are for the detec- particles by isotopes that are incorporated into
tion of medical abnormalities, for diagnosing various types of molecules are used to produce
the nature of those abnormalities, and for nuclear-medicine images, which reflect the dif-
planning and guiding therapeutic interven- ferential concentration of those molecules in
tions. Detection focuses on identifying the various tissues. Magnetic-resonance imaging
presence of an abnormality, but in the case (MRI) depicts energy fluctuations of certain
where the findings are not sufficiently spe- atomic nuclei—usually hydrogen—when they
cific to be characteristic of a particular dis- are aligned in a magnetic field and then per-
ease, other information is required for actual turbed by a radiofrequency pulse. Parameters
diagnosis. This is the case, for example, with such as proton density, the rate at which the
mammograms, which are often used to screen nuclei return to alignment (T1), the rate of loss
for breast cancer; once a suspicious lesion of phase coherence after the pulse (T2), diffu-
is detected, a biopsy procedure is usually sion of water, and even the concentration of
required for diagnosis. In other circumstances, certain chemicals (MR Spectroscopy) can be
the image finding is diagnostic: for example, measured. These quantities differ in various
the finding of focal stenosis or obstruction tissues under normal conditions, with more
of an artery during angiography. Most often variations due to disease, thus enabling MRI to
there is a continuum between detection and distinguish among them. . Figure 22.1 shows
diagnosis, with imaging detecting a lesion some example images. It is also possible to use
with some range of confidence, and suggest- MRI methods to accentuate and measure the
ing some possibilities, known as the differen- flow of fluids like blood or CSF, known as
tial diagnosis (see 7 Chap. 2). Magnetic Resonance Angiography (MRA).
Diagnosis and detection can be done with
a wide variety of imaging procedures. Images 22.2.1.2 Assessment and Planning
produced by visible light can be used by oph- In addition to being used for detection and
thalmologists for retinal photography, but also diagnosis, imaging is often used to assess a
by dermatologists to view skin lesions, or by patient’s health status in terms of progression
pathologists for light microscopy. The visible- of a disease process (such as determination of
736 B. J. Erickson
a b
e f
.. Fig. 22.1 Examples of the types of images discussed in the text. (a) is a microscopic image of tissue stained with
hematoxylin and eosin; (b) is an ultrasound image of the thyroid; (c) is a contrast-enhanced CT image of the abdo-
men; (d) is contrast-enhanced MRI of the brain; (e) is an MR angiogram of the cervical vessels; (f) is an FDG PET-
22 CT image of the upper body; (g) is a photgraph of an eye (Dermatopath, US, CDUS, CT, MRI, MRI-DWI)
Imaging Systems in Radiology
737 22
tumor stage), response to treatment, and esti-
mation of prognosis. One can analyze cardiac
status by assessing the heart’s size and motion
echocardiographically. Similarly, one can use
ultrasound to assess fetal size and growth.
Computed tomography is used frequently to
determine approaches for surgery or for radia-
tion therapy. In the latter case, precise calcula-
tions of radiation-beam configuration can be
optimized to maximize dose to the tumor while
minimizing absorption of radiation by sur-
rounding tissues. This calculation is often per-
formed by simulating multiple radiation-beam
configurations and iterating to a best treatment
plan. For surgical planning, three-dimensional
volumes of CT or MRI data can be constructed
and presented for viewing from different per-
spectives to facilitate determination of the most
.. Fig. 22.2 Example of a CT-guided biopsy of a
appropriate surgical approach. More recently,
lesion in the neck. High quality imaging allows precise
creation of 3D printed models using 3D print- targeting of small targets even near important structures
ers has become very popular for surgical plan- like the carotid artery
ning and patient education. In some cases, the
printed object is even used to guide incisions or
implanted as a scaffold for tissue ingrowth. still practical to do so only in limited settings.
Because the abnormality is viewed through a
22.2.1.3 Image-Guided Procedures video display, the image source can be physi-
Images can provide real-time guidance when cally remote, a technique called telepresence.
virtual-reality (VR) images are superim- Similarly, the manipulation of the endoscope
posed on a surgeon’s visual perspective on the itself can be controlled by a robotic device
appropriate image view in the projection that that reproduces the hand movements of a
demonstrates the abnormality, a technique remote operator, and can provide haptic feed-
known as augmented reality (AR). With back reproducing the sensations of tissue
endoscopic and minimally invasive surgery, textures, margins, and resistance. This tech-
this kind of imaging can provide a localiz- nology is not too different from the robotic
ing context for visualizing and orienting the surgery methods that have become quite com-
endoscopic findings, and can enable monitor- mon today, though the practical limits noted
ing of results of interventions such as focused above have limited its use.
ultrasound, cryosurgery, or thermal ablation.
It is also possible to use intra-operative imag- 22.2.1.4 Communication
ing to update the position and appearance of Medical decision-making, including diagno-
pre-operative imaging used for procedural sis and treatment planning, is often aided by
planning. . Figure 22.2 shows an example of allowing clinicians to visualize images concur-
a CT-guided biopsy of a lesion in the neck. rently with textual reports and discussions of
High quality imaging allows precise targeting interpretations. Thus, we consider imaging
of small targets such as diseased lymph nodes to be an important adjunct to communica-
with little risk of damaging important nearby tion and images to be a desirable component
structures like the aorta or carotid artery. of a multimedia electronic medical record.
Improvements in robotics technology and Because medical imaging is an essential ele-
wide-area network capability have enabled ment of the practice of medicine, support
minimally invasive procedures to be conducted for transmission and remote image viewing
at a distance (see 7 Chap. 20), although it is is also a critical component of telemedicine
738 B. J. Erickson
22
Imaging Systems in Radiology
739 22
.. Fig. 22.4 Example of functional mapping of a patient with a brain tumor, where functional mapping is used to
identify critical structures, and thus guide surgical therapy
Educate/train,
provide
feedback
Analyze &
Perform exam
interpret findings
final two are done in parallel and are ongoing image). This task actually involves inter-
and support the other five. related subtasks: (a) detection of the rele-
1. The process begins with an evaluation by a vant findings and (b) interpretation of
clinician of a clinical problem and deter- those findings in terms of clinical meaning
mination of the need for an imaging proce- and significance.
dure. Decision-support tools (7 Chap. 24) 5. The radiologist creates a report and may
are commonly used to help determine if, also directly communicate the results to
and what type of, testing should be per- the referring clinician, as well as making
formed. suggestions for further evaluation as
2. The procedure is requested and scheduled, needed. In the past, this was free text, but
the indication for the procedure is stated, there is increasing use of templates that
and relevant clinical history is made result in a consistent pattern in the report,
available. or structured reports in which the textual
3. The imaging procedure is carried out, and report also exists in a form with codes for
images are acquired. An important step all concepts in the textual report. The
for many types of examinations is the ‘pro- annotation and markup of images can be
tocoling’ in which the precise way that the very helpful in communicating locations
images are acquired are specified by the of findings and serves as helpful land-
radiologist. For example, whether oral or marks for subsequent exams, and for sur-
IV contrast are administered, the imaging gical or radiation procedures.
plane(s), slice thickness, and contrast 6. Quality control and monitoring are car-
properties are also specified. ried out throughout the process, with the
4. The radiologist reviews the images in the aim of improving the foregoing processes.
context of the clinical history and indica- Factors such as patient waiting times,
tions for the examination and may mea- workloads, numbers of exposures obtained
sure structures in the images, segment per procedure, quality of images (such as
components of the image (e.g. measure ones degraded by patient motion or incor-
volumes such as the left ventricle), or rect acquisition parameters), radiation
manipulate the images (e.g. create 3D ren- dose, yields of procedures, incidence of
derings or perform processing such as con- complications, and quality of reports are
version of a series of images into a new measured, reported and adjusted to opti-
parametric image like a blood volume mize individual and overall quality.
22
Imaging Systems in Radiology
741 22
7. Continuing education and training are car- served both to detect the X-ray photons, to
ried out through a variety of methods, record them in a permanent form, and also
including access to atlases, review materials, to display the data (with the aid of a light-
teaching-file cases, and feedback of subse- box). This integrated arrangement existed
quently confirmed diagnoses to interpreting for about a century. Today, most radiographs
radiologists. Peer review of previously are either (a) recorded in a latent form (i.e.,
reported cases is now a common expecta- they are not directly visible, but are acquired
tion or requirement in most radiology prac- as an electronic signal on a charged plate)
tices as well as medical board agencies like where a ‘reader’ then scans the plate to create
the American Board of Radiology. a digital image (known as computed radiog-
raphy or CR) or (b) the photons are directly
All these tasks are now, in a growing number of converted to digital images (known as digital
departments, computer-assisted or automated, radiography or DR). The digital image can
and most of them involve images in some way. then be transmitted and stored like any digital
In fact, radiology is one branch of medicine data, using conventional networks and stor-
in which even the basic data are usually pro- age systems. The matrix size of the images is
duced by computers and stored directly in variable, ranging from as low as 64 x 64 for
computer memory. Radiology has also contrib- some nuclear medicine images, up to 5000 x
uted strongly to advances in computer-aided 4000 picture elements (pixels) for mammo-
instruction (see 7 Chap. 25), in technology grams. The size of typical radiology images
assessment (see 7 Chap. 13), and in clinical and examinations is shown in . Table 22.1.
decision support (see 7 Chap. 24). Speech rec- CT, MR, US, NM, and PET all use computer
ognition is commonly used for report creation. to convert the acquired raw signal to a digital
image and thus they are also exist as direct or
nearly direct digital modalities.
22.2.3 Electronic Imaging Systems
22.2.3.2 DICOM
22.2.3.1 Image Acquisition The first medical devices to produce digital
The first radiographs used an integrated images routinely were CT scanners, and soon
detection, recording and display system— after, MRI scanners. The availability of digi-
that is, the glass plate (and later, plastic film) tal data that represented a three-dimensional
CR/DR 5000,000 3 29
CT 262,144 500 250
MRI 65,536 500 63
US 262,144 50 25
Mammography 20,000,000 4 153
Interventional/fluoro 1,048,576 50 100
Nuclear medicine 16,384 25 1
Note that there is variability in image size and images per examination, and these numbers should be viewed
as very rough estimates. Furthermore, there is a strong trend for both increased image resolution (increasing
image size) and more images per examination since the emergence of digital imaging
aCR computer radiography, DR digital radiography, CT computed tomography, MRI magnetic resonance
imaging, US ultrasound
742 B. J. Erickson
image stimulated the field of medical image local area network at RSNA 1992, followed
processing and 3D rendering. An early chal- by increasingly sophisticated versions over the
lenge to such investigations was that the medi- next 2 years. The RSNA also made that soft-
cal device vendors used half-inch tape media ware available for free public access as a model
for storing the data, but each vendor (and for understanding the standard and design
usually each model of scanner) had its own of utilities and tools by developers. During
format. Such formats were proprietary, and the mid 1990’s, the RSNA annual meetings
required each investigator to reverse engineer hosted a major digital image interoperabil-
the format of the tape just to gain access to ity demonstration that became progressively
the data. Although computer networks were more sophisticated and demanding. RSNA
used in hospitals at that time, few if any scan- and its meetings accordingly facilitated dem-
ners supported network connections. onstration of the interconnection of vendor
The need to write all data to tape, and then products through the Internet, promoted
read it into a different computer using soft- DICOM compatibility as a feature that could
ware unique to each scanner resulted in signifi- be visualized at participating vendor exhibits,
cant unnecessary effort. The need to exchange and created a model Request For Proposals
images efficiently demanded that they be (RFP) for radiology practices and hospitals
represented in a standard fashion. This need to use to craft a DICOM requirement as part
was recognized by the American College of of the procurement of imaging systems. Later
Radiology (ACR) and the National Electrical this interoperability testing was done separate
Manufacturer’s Association (NEMA), and from the RSNA annual meeting, and became
led to the development of the ACR/NEMA known as the ‘Connectathon’. These efforts
standard for medical images in 1985. As other turned out to be extremely successful in trans-
imaging devices started to produce digital forming the marketplace from one that was
images, and as the information about the dominated by proprietary formats to one that
images became richer, the second version was was standards-based and interoperable.
published in 1989. That standard described DICOM continues to be updated and
both a model that described the data, and pre- improved through an international committee
scribed a special connector for transferring process. While it is hard for any standard to
image data between devices. This was dem- be both widely accepted and perfectly up-to-
onstrated at the 1990 Radiological Society date, the DICOM governance has done a
of North America (RSNA) conference. Soon remarkable job of adapting to rapid advances
after, TCP/IP became a widely accepted net- in imaging technology. The governance con-
work standard, and while the ACR/NEMA tinues to reflect its roots of combining indus-
standard did not describe a method for trans- try and medical experts who are interested in
ferring data over TCP/IP, investigators fairly providing the best technology that can be put
quickly implemented this, and it worked well. into commercial products.
Continued improvements in the information
model, as well as extension to medical spe- 22.2.3.3 Image Transmission,
cialties other than radiology, and standards Storage, and Display
for storage on physical media like compact Digital image capture provides the opportu-
disks demanded further revisions. The addi- nity to display and store the images in digital
tion of non-radiology images also demanded form. In the early days, the size of the images
a name change, and thus ‘ACR/NEMA 3.0’ represented a challenge—the amount of data
was rebranded as DICOM, which stands for was quite large relative to the capacity of
Digital Image Communications in Medicine. storage devices. As a consequence, there was
To promote adoption, the RSNA com- intense interest in using compression methods
missioned the creation of Central Test Node that could reduce the amount of storage that
(CTN) software, for demonstrating use of was required–as well as increase the speed of
the standard for transmitting images over a network transmission of images. Even with
22
Imaging Systems in Radiology
743 22
compression, the amount of storage used for to be updated or migrated. Because the data
images is quite large relative to non-image data were not stored in a standard format, it was
stored in a hospital. A hospital must there- necessary to get the cooperation of the vendor
fore carefully consider how images are stored. to migrate the data to the new system (which
In the early days of Picture Archiving and might be from another vendor!). Because
Communications Systems (PACS), there was workstations rapidly change, but archive
little choice about how and where images were contents do not, there was perceived value in
stored, because the storage system was tightly separating these two functions (Erickson and
integrated with the display and transmission. Hangiandreou 1998). Today, several compa-
This was done because the high demands on nies sell vendor-neutral archives (VNAs) that
storage, transmission, and display all required leverage the DICOM standard. These allow a
special hardware. As computer technology wide variety of image-producing and image-
caught up with medical image sizes, there was consuming systems to access the archive in
less need for specialized versions of networks, a standard fashion. It also enables storing
archives, and displays. images from outside the radiology depart-
The military was an early adopter/driver of ment on the same infrastructure. In fact, the
PACS technology, and released an RFP in the use of a VNA is now the norm rather than
1990’s requiring that ‘any image be displayed the exception at any major hospital system
anywhere on the network within 2 seconds.’ because it allows sharing of resources across
To address this requirement, early PACS uti- departments.
lized either uncommon standard technologies Because the image datasets are quite large,
or proprietary networking methods to provide there is interest in finding ways to reduce stor-
high bandwidth transmission. An example of age requirements. Image compression does
proprietary transmission technology is the exactly this, in one of two ways. There are
PACS developed by LORAL, which leveraged lossless compression methods, which encode
technology developed for its defense applica- redundancy in the image in a way that allows
tions. Its network was a hybrid of (standard) the original to be exactly reproduced. Lossy
10 Mbps Ethernet, which provided control (or irreversible) compression produces an
signaling, and a (proprietary) unidirectional image that is visually similar to the original.
hub to spoke optical network that had lossless Exactly how similar depends on the algorithm
compression built into the network card. The and user-selectable settings that reflect the
optical network signaled at 100 Mbps, and trade-off between fidelity and compression
because it was unidirectional, it routinely real- ratio (the ratio of original size to compressed
ized its nominal speed. Other vendors utilized size). The major challenge is that one can-
FDDI, which also was an optical fiber network not select a given setting, reliably get images
that signaled at 100 Mbps. However, its han- that are not visibly altered, and also achieve
dling of contention was much less effective, a good compression ratio. Lossless compres-
and its performance suffered. Today, stan- sion methods typically achieve compression
dard Ethernet signaling at either 100 Mbps ratios of about 2.5, while lossy compres-
or 1 Gbps can provide adequate performance, sion can achieve as much as 40:1 compres-
as long as reasonable attention is paid to net- sion without a perceptible or diagnostic loss,
work layout and implementation. for certain types of images. While the size of
While the display component of PACS imaging examinations continues to increase,
drove networking advances in the 1990’s, the the decrease in storage cost is more rapid,
migration of PACS data from one system to lessening the demand for lossy compression.
another during an upgrade drove the next The use of lossy compression is more widely
major change—the Vendor Neutral Archive, accepted in non-radiology specialties, such as
or VNA. In the early days of PACS, updating cardiology and pathology, in part because of
the system to take advantage of new worksta- the greater uniformity of image characteris-
tion or network technology meant that the tics, allowing easy specification of acceptable
whole system, including the archive, needed ratios. A key goal of lossy compression is that
744 B. J. Erickson
it not have an adverse impact on diagnostic by the HIS or EHR system when available
value to the human or to computer aided diag- such as in a hospital or large outpatient prac-
nostic algorithms (Zheng et al. 2000). Thus, tice; they are usually performed by the RIS in
lossy methods are usually tuned to the spe- cases where there is no HIS or EHR (such as
cific diagnostic task so as not to have adverse many outpatient imaging centers).
impacts. In fact, monitoring the performance
of CAD as the ratio is changed is one method 22.2.4.2 Speech Recognition
to select the optimal compression ratio. In the past, the RIS would provide a means
Early PACS also required specialized dis- for a transcriptionist to type the text of the
play devices. At the time, standard computer report into the RIS as the radiologist dictated
displays were often 640 x 480. Medical images it (either live or via dictation system). Today,
were often more than 2048 pixels in each the vast majority of radiologists use speech
direction. Liquid crystal technology for large recognition to convert their speech into text.
displays was also not developed, meaning In some cases, the text is immediately reviewed
that the displays were large cathode ray tubes. by the radiologist and approved as final. This
These displays were large, heavy, produced model has the advantages of rapid turn-
much heat, and degraded rather rapidly. around time—the time from when the exami-
Nearly all were monochrome. Imaging also nation is ready to be reported to the time it
required a higher luminance for detection of has a final report available. In this model, a
subtle gray level distinctions than was avail- separate application (the speech recognition
able with consumer-grade displays. Today, flat system) converts the audio to an HL-7 mes-
panel technology that meets the demands of sage, which is sent to the RIS along with the
radiological interpretation is widely available final (or other appropriate status). In other
at reasonable prices. We note here that while cases, a ‘correctionist’ reviews the text created
consumer grade displays can be used, it is by the speech recognition system, and cor-
important that quality displays are used with rects it based on listening to the audio file. In
appropriate calibration. A DICOM com- this case, the radiologist must then review the
mittee has established display requirements text again to make it final, which will degrade
for medical purposes (ACR-NEMA 2006). turn-around-time.
Medical grade monitors typically have hard- There are two major advantages to using
ware built into the display to perform such speech recognition: First, it enables rapid
calibration in an automated way. turn-around time. Before speech recognition,
turn-around times of 1 week were common,
but now, turn-around times of less than 1 hour
22.2.4 Integration with Other is common (Hart et al. 2010; Krishnaral et al.
2010; Mattern et al. 1999). This improvement
Healthcare Information in turn-around time undoubtedly improves the
quality of care provided to patients. Second,
22.2.4.1 Radiology Information
it reduces staffing for radiology depart-
Systems (RIS) ments or hospitals by reducing the number
A Radiology Information System, or RIS, is of transcriptionists/correctionists needed. Of
responsible for much of the text information course, some decrease in productivity is com-
in a radiology department. Core functions of monly observed for radiologists at the time of
a RIS include capture of the interpretation implementation, which reduces the economic
for a given examination and records the status benefit (Langer 2002; Strahan and Schneider-
of an imaging examination. A RIS mayo do Kolsky 2010).
more, depending on what other systems are Over the years, many efforts have attempted
available and preferred for a given situation: to enable radiologists to generate templated
some of the functions that might be performed or structured reports (SR) from a selection
by a RIS include ordering, scheduling, and of choices in forms, and through use of drop-
22 billing. These functions are usually performed down entries in the text, macros that produce
Imaging Systems in Radiology
745 22
predetermined text phrases, and other tech- Another role for CAD is in assisting with
niques. Some of these are now used in specific diagnosis (the lesion is detected, but unsure if
situations, especially where reports have a it is cancer or infection). A common applica-
largely anticipated format and structure, e.g., tion is in determination the nature of lesions
mammography and obstetrical ultrasound, on high resolution CT images of the chest.
and macros are used in conjunction with . Figure 22.6 shows an image of the output
speech recognition approaches for certain of the experimental algorithm CALIPER,
“canned” sections or reports. Such reports rendered as a 3D image, to show the distribu-
enable efficient capture of the information for tion and change of different degrees of inter-
later data analysis, and there are also several stitial lung disease in a patient.
reports showing that most referring physi- More recently, a machine learning tech-
cians prefer templated reports because of the nique referred to as ‘Deep Learning’, has
consistency in where key information can be become popular. This technique uses neural
found. In fact, some ultrasound devices will networks and derives its ‘deep’ name because
send many of the key measurements to the it typically uses many (50+) layers in the net-
reporting system to ‘pre- populate’ a struc- work. In addition to having many more layers
tured report with many of the measurements than traditional neural networks, some forms
made during image acquisition. include several convolutional layers (hence
the name convolutional neural networks or
22.2.4.3 Computer-Aided Diagnosis CNNs) at the input that learn the features
(CAD) that produce the best output. This means that
We have described that the interpretation task it learns the best features to use, rather than
consists of detection, description, and diag- requiring a human to pre-compute them.
nosis. In some cases, the detection task can Deep Learning methods have proven very
be quite challenging, particularly for screen- effective for many of the traditional CAD
ing tasks involving mammography and chest tasks (mammography and chest CT lesion
X-rays because the incidence is rather low, and detection) and for classification of lesions.
the volume is high. Particularly at the end of They also excel at automated organ segmen-
a long shift, human observers probably have tation (U-Nets) that can be useful for many
decreased performance due to fatigue. For tasks, and this easy access to quantitative data
these cases, computer algorithms that high- will likely increase the quantitative content in
light suspicious regions of an image may be radiology reports.
useful to assure that important findings aren’t Perhaps more interesting is that CNNs
missed. Some have called this role ‘computer- have also proven effective at predicting
aided diligence’. molecular properties of tissues using routine
During the first decade of computer-aided images, even in cases where humans are not
diagnosis (CAD), the algorithms searched able to identify any differentiating features.
for specific features that radiologists thought Examples include the prediction of IDH-1
were important, and therefore the value was mutation, 1p19q chromosomal status, and
either minimal (diligence), or provides benefit MGMT methylation in brain tumors using
readers less familiar with the importance of routine T2-weighted MR.
various features (Gur and Sumkin 2006). In
the case of mammography, the lack of a clear 22.2.4.4 Advanced Visualization
benefit for experienced readers, combined CT and MR scanners provide images that can
with the reduction in productivity has resulted be thought of as 3D images, even if they are
in its near complete abandonment, and also not always truly acquired as 3D, but rather, as
its loss of reimbursement. This is a sobering a series or ‘stack’ of 2D images. Some imaging
example that computer assistance must be devices can acquire a 3D image directly and
implemented in ways that truly add value. repeatedly, thus producing a 4D image (time
746 B. J. Erickson
a b
.. Fig. 22.6 Image of the output of the experimental algorithm CALIPER, rendered as a 3D image, to show the
distribution and change of different degrees of interstitial lung disease in a patient. (a) shows the overall disease
burden of the lungs; (b) shows the disease category as a 3D rendering of the lungs; (c) shows the disease category on
a coronal section of the lungs; (d) shows the disease category labels
is the fourth dimension). In particular, cardiac volume element (voxel) is a part of the struc-
imaging benefits from 4D capability so that ture of interest or not. In the case of a CT
the beating heart can be examined throughout image of bone, segmentation is rather easy.
the cardiac cycle. Such 3D and 4D data sets If intravascular contrast is administered dur-
are large, and proper demonstration of the ing the examination, that can make it fairly
important findings requires visualization of straightforward to select vessels (arteries and/
the data. For instance, if one wishes to see a or veins depending on the timing). Soft tissue
skeletal finding using CT, one can set a thresh- organs like livers, kidneys, and muscles have
old to select bony structures, and then render been more challenging, but are now much
it using traditional computer rendering meth- more feasible with deep learning techniques.
ods. This can be done at multiple time points A description of the rendering algorithms and
to produce movies of moving structures. their trade-offs is provided in 7 Chap. 10.
The great challenge in medical visualiza- A recent advance in visualization tools is
22 tion is segmentation—deciding whether a to have the computation done on a central
Imaging Systems in Radiology
747 22
.. Fig. 22.7 Example advanced visualization of the abdominal aorta, distributed via a web client, which allows
vascular surgeons to better plan surgical options in their own office
server, with interactive segmentation and est in routinely collecting more quantitative
rendering viewed using web browsers. This information from images, because it appears
allows a much larger population of physi- that for an increasing number of diseases,
cians to have access, and can be valuable for quantitation is receiving increased attention
surgeons contemplating surgery, as well as for in clinical realms.
patient education. . Figure 22.7 provides an An SR is produced when the concepts of
example advanced visualization of the spine, a report are represented using coded termi-
distributed via a web client, which allows nology. There is a DICOM specification for
surgeons to better plan and review treatment SR, though adoption has been limited. This
options in their own office, with the patient. is because there are currently not efficient user
interfaces for creation of structured reports
22.2.4.5 Advanced Reporting in most areas of radiology. BIRADS is a
While textual reports have served medical prac- standardized way to report breast imaging,
tice well for the past century, there are oppor- with an accepted scale for findings. However,
tunities to improve reporting. Multimedia those findings are usually not also stored in
reports provide a richer representation of the an encoded format, but rather with highly
information present in the examination, and consistent text. Because BIRADS has truly
might include links from portions of the text enabled much better care of patients and
report to specific images and locations on the research, other areas of imaging are adopt-
images, moving images (‘video’), or audio files ing standardized reporting (e.g. TIRADS
such as the heart sounds. In some diseases, it for thyroid, LIRADS for liver, PIRADS for
can be important to have specific measure- prostate, just to name a few). The most widely
ments made, and possibly tracked over time. adopted example of SR is probably the send-
If these measurements are encoded in a spe- ing of specific measurements from ultrasound
cific way (using Structured Reporting or SR), scanners to reporting systems. Of course, this
it will be easier to extract and use that infor- is not the entire content of the report, and the
mation elsewhere in the medical record, and radiologist usually adds and may edit the SR
for other purposes like research. Lexicons, content. Another example where DICOM SR
such as RadLex, can be helpful in conveying may be used is for reporting radiation dose,
some of the information. There is great inter- but again, that is not even the main content
748 B. J. Erickson
of the radiological report. Some would also turn-around time, compliance with notifica-
argue that DICOM SR is more correctly tion requirements, and patient waiting times.
thought of as structured results rather than Most dashboards provide a mechanism
structure reports. to ‘drill down’ on a particular measurement.
For instance, if the patient waiting time moni-
22.2.4.6 Workflow Management tor goes ‘red’, clicking on that indicator light
(Including Dashboards) might show the waiting time by location
The ability to monitor and control events in (maybe just one facility is causing the prob-
an imaging department is critical to efficient lem), total patient volume (maybe the site is
and effective operation. Dashboards have been experiencing a spike in patient volume), or
applied in many business arenas to give quick examination time (maybe the complexity of
visual displays of Key Performance Indicators examinations is going up). Such informa-
(KPIs) for that business. Such dashboard tion is critical to enabling a timely and effec-
technology is now becoming widely used tive response to performance that is outside
within imaging departments for monitoring expected service levels.
such KPIs as report turn-around time, patient The popularity of deep-learning-based
waiting time, number of days out to schedule artificial intelligence (AI) tools for image
examinations, and revenue days-outstanding. interpretation has also driven the need for
The dashboards give people a quick view workflow orchestration: the application of
of what is happening, and can alert them to workflow management technology to collect
problem areas. . Figure 22.8 is an example data needed by an AI algorithm, assure its
of a radiology dashboard that shows impor- execution on the images and other data, and
tant departmental metrics, including report then collect and distribute the results. There
University of Informatica
Department of Radiology Dashboard
Turn-Around Time Patient Wait Time Appt Fill Rate Critical Results
Department Metrics
Personal Metrics
.. Fig. 22.8 Example of a radiology dashboard that shows important departmental metrics, including report turn-
22 around time, compliance with notification requirements, and patient waiting times
Imaging Systems in Radiology
749 22
are many reports showing AI can improve ing that they can receive DICOM images, and
efficiency (such as for quantitation of organ display them for clinicians.
or tumor size) and to improve the quality of
care in radiology departments. 22.2.4.9 Decision Support
Because medical imaging has been a major
22.2.4.7 Teleradiology component of increases in total healthcare
Teleradiology is the practice of interpreting costs, there has been much attention paid to
images at a location that is physically distant assuring that only necessary examinations
from the place where the images are collected. are performed. To help assure this, decision-
Initially, this referred to transmitting images support systems to guide proper ordering
from the hospital to the radiologist’s home in have been developed that have been shown
the middle of the night so that the physician to have an impact on utilization of imaging
did not need to drive in to see the image onsite. studies (Sistrom et al. 2009). Such systems
While this is still done, it is now common for a have been shown to decrease the total number
hospital to contract with a ‘nighthawk’ service of examinations performed, and in particular
that will provide these night-time interpreta- to decrease the number of examinations that
tions. A nighthawk service contracts with appear to be unnecessary. In addition to alert-
many hospitals—enough to keep a team of ing the user to a potentially improper order,
radiologists busy during the night. Having a educational materials are often provided
team continuously operating is usually more to help the ordering physician understand
efficient, and allows for specialization of when and what imaging examinations might
image interpretation. Teleradiology is now be appropriate for the given indication. In
also practiced during the day to balance clini- addition, such systems can provide manage-
cal workload and to provide specialized inter- ment reports to improve the understanding of
pretation on a routine basis. The technology to ordering practices.
rapidly transmit images across large distances
is widely available and inexpensive. The great-
est challenges to teleradiology are licensing/ 22.3 Imaging in Other
credentialing issues, especially if films are read Departments
across state lines or internationally (radiolo-
gists may be licensed to practice where they 22.3.1 Cardiology
review the films but not in the location where
the patients were located when the images Cardiac imaging has many similarities to
were acquired). Teleradiology is a term that is radiological imaging, and in many cases is
not heard very often mostly because transmis- performed either by radiology departments
sion of images to the best site for interpreta- or in conjunction with radiology. The pri-
tion has become routine, and implemented as mary imaging modalities for cardiology
a part of the PACS network. include echocardiography (ultrasound), cath-
eterization (interventional/vascular, involv-
22.2.4.8 Enterprise Integration ing fluoroscopy and angiography, i.e., vessel
(Including HL7, Decision visualization via contrast die administra-
Support) tion), MR, CT, and PET. The workflow can
Medicine is an information-rich business, be similar, but can be different in those cases
and providing access to the relevant informa- where the imaging is performed by the same
tion in a timely fashion is critical to success. department and even by the same individual
Integration of systems with the relevant pieces as the person who ordered it. In such cases,
of information is necessary, and in hospitals there can be less formal ordering, schedul-
this is generally done with HL-7 messages (see ing, and reporting. However, as the imaging
7 Chap. 7). Both RIS and PACS will typically is increasingly a part of the general enter-
be able to use HL7 messages, and an increasing prise, such informality will become a greater
number of EHRs are ‘image enabled’ mean- challenge.
750 B. J. Erickson
Cardiology has been aggressive in its use of images are valuable for documenting the
lossy compression. This is primarily because important findings (or lack thereof) during
the nature of cardiac imaging is much more a procedure. They are also useful for edu-
stereotyped (allowing better prediction of cational purposes, including informing the
appropriate compression). The primary focus patient of the findings and procedures carried
is the heart, whereas in radiology many dif- out. Some surgeons have suggested that medi-
ferent organs with very different appearances colegal demands will require routine capture
can challenge compression methods. The of entire surgical procedures.
echocardiographic and interventional images Such images can be more graphic and
are also much more like video—usually being revealing than radiological images, and in
motion-oriented rather than focused on static some cases have driven the expectation of
capture, and that redundancy in content need for an additional level of privacy pro-
enables effective compression. In fact, the tection. In one institution, for instance, all
major cardiovascular societies have published photographic images from the Plastic Surgery
and supported the use of specific compression department are protected from access—only
technologies and settings for cardiac imaging physician members of the Plastic Surgery
(Simon et al. 1994; ACC/ACR/NEMA Ad department can routinely view those images,
hoc Group 1995). with a process for granting temporary access
Because the focus is primarily on the heart to other care providers (Erickson et al. 2007).
and its function, cardiac imaging is more
advanced in structured reporting. The pri-
mary variables that are of interest in cardiac 22.3.4 Pathology and Dermatology
imaging (left ventricular volumes, stroke vol-
umes) are of interest in most cases, and are Pathology and dermatology have similar
routinely measured and reported. This has needs, except that dermatology includes pho-
driven the acceptance of structured report- tographic visible light images of skin lesions.
ing for the common measurements in cardiac For these purposes, consumer-grade pho-
imaging—particularly echocardiography. tographs can be sufficient, but transport-
ing those images (usually in JPEG form) to
a medical-grade imaging system will usually
22.3.2 Obstetrics and Gynecology require an import process. This process will
require confidence about the accuracy of
Obstetrical and gynecological imaging is patient and site location information. Often,
rather like cardiac imaging, except that it is the JPEG images are then ‘wrapped’ with
nearly always ultrasound imaging. Much like DICOM information to assure that the con-
cardiac imaging, there are a well-defined and nection between a photograph and a patient
accepted set of measurements and observa- exists at the file level, rather than via a link
tions expected for the routine obstetric exam, to a filename. Image viewers require color
and as such, structured reports for these find- capability, but otherwise are not substantially
ings are widely used. Estimates of fetal ges- different from what is provided in most radio-
tational age and development are typically logical image viewers. If images are converted
automatically computed, on the basis of mea- to DICOM, the archive system is usually able
surements using well-tested prediction models. to store them without difficulty.
Microscopic images represent a bigger
challenge. At this point, there are 2 strategies
22.3.3 Intraoperative/Endoscopic for the capture of microscope images: the first
Visible Light is whole slide digitization in which the entire
specimen is digitized; the other is capture of
It is now common to capture still images and specific views that are of interest. In both
video of endoscopic procedures as well as cases, though more so in whole-slide imaging,
22 traditional open surgical procedures. These there is a need for multi-resolution viewing.
Imaging Systems in Radiology
751 22
That is because the workflow is very different. it is challenging for most hospitals to use the
Whole-slide scanning is usually done prior to images on CDs. In some cases, the images
the pathologist reviewing the images, while the can be imported into the PACS, but that can
spot-capture is done by the pathologist at the cause confusion about where the study was
time of viewing. In the former case, the com- done, and can be challenging for the RIS to
puter performs the pan-zoom function, while represent this (who ordered the CD exam, and
in the latter case the optical microscope per- where the report is located). In addition, there
forms that function. In the former, the com- are important data integrity issues—as much
puter is a diagnostic device, while in the latter, as 0.1% of CDs have been shown to include
it is used mostly for documentation purposes. images for patients other than the intended
Whole-slide scanning is of greater interest, patient (Erickson 2011), leading to important
because it has greater possibilities for improv- health delivery and legal risks.
ing healthcare delivery by allowing the slides
to go to the pathologist. It also represents a
much greater challenge, because much larger 22.4.2 irect Network Image
D
amounts of data must be stored and the asso- Exchange
ciated computer-based viewing application
requires the ability to display low resolution The problems with CD image exchange noted
and high resolution images. Such images are above, as well as the time delays and costs, have
typically 1GB in size. Computing low resolu- driven many institutions to use internet trans-
tion images from high resolution images can fer mechanisms. In cases where there is a high
be computational expensive. volume and a high level of trust, one can estab-
Another important issue differentiating lish virtual private networks (VPNs) that allow
pathology imaging from radiological imag- secure transfer between two institutions. While
ing is that retrieval of old images is much less this allows rapid and low-cost transfer, it still
common in pathology. Follow-up of disease is requires confident patient identification, and
biopsy is much less common than with radio- a method for importing the images into some
logical imaging, so comparison with priors form of clinical viewer. It also requires a hospi-
is much less common. Recall would usually tal set up an extensive network of VPNs, which
only occur when there is a medicolegal issue, is something that can be difficult to secure.
or possibly in case of disease recurrence or This has led to the creation of the ‘image
metastasis, where there is a need to compare sharing’ industry. This industry has developed
the older sample with a newer one. internet tools that allow images to be securely
transferred from one hospital to another. The
transport mechanism is usually proprietary,
22.4 Cross-Enterprise Imaging so transfer between hospitals is most efficient
if both use the same vendor system. However,
22.4.1 CD Image Exchange nearly all provide a ‘Dropbox’-like method
where a weblink can be used to either send
When images were stored on film, sharing or receive images. There are also efforts to
images with another hospital required that the develop standard interchange methods that
films be either physically transferred or cop- are both efficient and secure.
ied. Copying a film was labor-intensive and
expensive. Therefore, it was standard practice
to ‘loan’ films to other facilities when needed. 22.5 uture Directions for Imaging
F
With digital images, it is much easier to Systems
copy the digital data onto media like a com-
pact disc (CD) and give that to the patient. The increasing capabilities of mobile devices
There are well-accepted standards (DICOM) and the increasing expectations of ready
for how to store the images on a CD. However, access to medical professionals have driven
752 B. J. Erickson
imaging onto mobile devices. At present, the Dreyer, K. J. (2006). PACS: A guide to the digital
FDA has limited the use of such devices for revolution. New York: Springer. This is also a
diagnosis. On the other hand, these devices book focused on the practical aspects of
can be extremely useful for consultation on implementing and maintaining a digital imag-
specific areas of an image, or when therapeu- ing department. Its format is that of a tradi-
tic options are being considered, or for patient tional textbook, and covers a broad range of
communication. As the bandwidth and dis- topics.
play qualities improve, these devices will likely Liu, Y., & Wang, J. (2011). PACS and Digital
play an increasing role in both diagnosis and Medicine. Boca Raton: CRC Press. This book
therapy planning. goes into greater detail of the technology of
Cloud technology is becoming an impor- PACS, and to a lesser degree RIS and
tant technology for image archival and trans- EMR. This is a very good resource for those
fer. The ability to leverage efficiencies of scale interested in more details of DICOM and how
is an important economic driver that is push- a PACS can be configured to address specific
ing many smaller imaging providers to use needs.
cloud-based storage. The use of cloud meth-
ods for image exchange was also described ??Questions for Discussion
above. The increasing use of computation 1. What are the Pros and Cons of a highly
intensive diagnostic aids has also driven the structured technology like DICOM?
use of cloud-based CAD tools. DICOM has been highly successful in
Phenome characterization (see 7 Chap. terms of adoption as a standard, and
26) is becoming an important aspect of the virtually all image communication
move to individualizing medicine sometimes utilizes it. This differs markedly from
referred to as ‘precision medicine’. Deep some other standards. What are factors
learning is a technology that will likely cause that have contributed to this success,
rapid advancement, as it allows both rapid and what lessons can be drawn from
automated segmentation of all major organs this in terms of how to promote
in an image, but also characterization of tis- adoption of standards in the future?
sue textures. The ability to predict genomic 2. If one were to design medical imaging
information from such images will likely pro- systems today, would the optimal design
duce a new generation of ‘precision imaging’ continue to have PACS and RIS as sepa-
applications. rate systems, or would they be combined
into one system? Should these be sepa-
nnSuggested Readings rate from the EHR?
Birkfelliner, W. (2010). Applied medical image 3. What are the ways in which radiology
processing: A basic course. CRC Press. As the reports of examination interpretations
title says, this is an introductory book, with can be generated, and what are the
many excellent explanations and example advantages and disadvantages of each
code (mostly MatLab). approach, in terms of ease and efficiency
Branstetter, B. F. (2009). Practical imaging infor- of report creation, timeliness of avail-
matics. New York: Springer. As its title implies, ability of report to clinicians, usefulness
this book is a practice-oriented book primarily for retrieval of cases for research and
aimed at those responsible for implementing education?
and maintaining a digital imaging practice. The 4. In these days of high bandwidth and low
format of the book is an outline with many storage costs, is there still a good reason
practical tips from a wide variety of experts. to use lossy compression in medical
Dougherty, G. (2009). Digital image processing imaging? What kinds of trends are likely
for medical applications. Cambridge to affect image growth, as part of the
University Press. This is an excellent, practical patient’s medical record?
book on concepts of image processing algo- 5. What are the arguments for maintaining
22 rithms used in medical imaging. raw rather than compressed data (not
Imaging Systems in Radiology
753 22
only for imaging data but for compres- Proc First Internat Conf on Image Management and
sion or summarization of other types of Communication in Patient Care: Implementation
and Impact (IMAC 89), Los Alamitos, CA.
data)?
Gur, D., & Sumkin, J. (2006). CAD in screening mam-
6. Describe a classification of ways in mography. AJR, 187, 1474. https://doi.org/10.2214/
which image data are used in medical AJR.06.1384.
decision making. Hart, J., McBride, A., Blunt, D., Gishen, P., &
7. What are the data management Strickland, N. (2010). Immediate and sustained ben-
efits of a “total” implementation of speech recogni-
implications of using a separate
tion reporting. The British Journal of Radiology,
advanced visualization system for 83(989), 424–427.
clinicians that is distinct from the Krishnaral, A., Lee, J., Laws, S., & Crawford, T. (2010).
PACS used by radiologists for Voice recognition software: Effect on radiology
interpretations? What if the report turnaround time at an academic medical cen-
ter. AJR. American Journal of Roentgenology,
radiologists use that system in addition
195(1), 194–197.
to the PACS? Langer, S. (2002). Impact of speech recognition on radi-
ologist productivity. Journal of Digital Imaging,
15(4), 203–209.
References Mattern, C., Erickson, B., King, B., & Okryznski, T.
(1999). Impact of electronic imaging on clinician
behavior in the urgent care setting. Journal of
ACC/ACR/NEMA Ad Hoc Group. (1995). American
Digital Imaging, 12(2 Suppl 1), 148–151.
College of Cardiology, American College of
Simon, R., Brennecke, R., Hess, O., Meier, B., Reiber,
Radiology and industry develop standard for digital
H., & Zeelenberg, C. (1994). Report of the ESC task
transfer of angiographic images. Journal of the
force on digital imaging in cardiology.
American College of Cardiology, 25, 800–802.
Recommendations for digital imaging in angiocar-
ACR-NEMA. (2006). Digital imaging and communica-
diography. European Heart Journal, 15, 1332–1334.
tions in medicine (DICOM) Part 14: Grayscale
Sistrom, C., Dang, P., Weilburg, J., Dreyer, K.,
Standard Display Function. Rosslyn: National
Rosenthal, D., & Thrall, J. (2009). Effect of comput-
Electrical Manufacturers Association.
erized order entry with integrated decision support
Erickson, B. (2011). Experience with importation of
on the growth of outpatient procedure volumes:
electronic images into the medical record from phys-
Seven-year time series analysis. Radiology, 251, 147–
ical media. Journal of Digital Imaging, 24(4), 694–
155.
699.
Strahan, R., & Schneider-Kolsky, M. (2010). Voice rec-
Erickson, B., & Hangiandreou, N. (1998). The evolution
ognition versus transcriptionist: Error rates and
of electronic imaging in the medical environment.
productivity in MRI reporting. Journal of Medical
Journal of Digital Imaging, 11(3 suppl 1), 71–74.
Imaging and Radiation Oncology, 54(5), 411–414.
Erickson, B., Ryan, W., Gehring, D., & Pynadath, A.
Zheng, B., Sumkin, J., Good, W., Maitz, G., Chang, Y.,
(2007). Top 10 features clinicians require in an
& Gur, D. (2000). Applying computer-assisted
image viewer. SIIM News (Summer 2007).
detection schemes to digitized mammograms after
Greenes, R. (1989, June). The radiologist as clinical activ-
JPEG data compression: An assessment. Academic
ist: A time to focus outward. Paper presented at the
Radiology, 7(8), 595–602.
755 23
Information Retrieval
William Hersh
Contents
References – 789
Information Retrieval
757 23
nnLearning Objectives does that mean? Although there are many
After reading this chapter, you should know ways to classify biomedical information and
the answers to these questions: the informatics applications that use them, in
55 What types of online knowledge-based this chapter we will broadly divide them into
information are available and useful to two categories. Patient-specific information
clinicians, biomedical researchers, and applies to individual patients. Its purpose is to
consumers? document and increasingly analyze for health
55 What are the major components of the care providers, administrators, and research-
information retrieval process? ers about the health and disease of a patient.
55 What are the major categories of avail- This information historically came from the
able knowledge-based information? patient’s medical record but now can come
55 How do techniques differ for indexing from many different sources, including mobile
various types of knowledge-based bio- and wearable devices. The second category of
medical information? biomedical information is knowledge-based
55 What are the major approaches to information. This is information that has been
retrieval of knowledge-based biomedi- derived and organized from observational or
cal information? experimental research. In the case of clinical
55 How effectively do searchers use infor- research, this information provides clinicians,
mation retrieval systems? administrators, and researchers with knowl-
55 What are the important research direc- edge derived from experiments and observa-
tions in information retrieval? tions, which can then be applied to individual
55 What are the major challenges to mak- patients. This information has historically
ing digital libraries effective for health been provided in books and journals but can
and biomedical users? take a wide variety of other forms, including
clinical practice guidelines, consumer health
literature, Web sites, and so forth. The distinc-
23.1 Introduction tion between these two types of information is
blurred by the growing amount of data that
Information retrieval (IR), sometimes called comes from people and is used to derive
search, is the field concerned with the acquisi- knowledge.
tion, organization, and searching of A basic overview of the IR process is
knowledge-based information (Hersh 2020). shown in . Fig. 23.1 and forms the basis for
Although biomedical IR has traditionally most of this chapter. The overall goal of IR or
concentrated on the retrieval of text from the search is to find content that meets a person’s
biomedical literature, the domain over which
IR can be effectively applied has broadened
considerably with the advent of multimedia Metadata
publishing and vast storehouses of images, Retrieval Indexing
video, chemical structures, gene and protein
sequences, and a wide range of other digital
Queries Content
material and artifacts of relevance to biomed-
ical education, research, and patient care.
With the proliferation of IR systems and
online content, the notion of the library has Search
changed substantially, and new digital librar- engine
ies have emerged (Lindberg and Humphreys
.. Fig. 23.1 Basic overview of the information retrieval
2005).
process. Retrieval is made possible via metadata, which
IR systems and digital libraries histori- is produced via indexing and applied in queries by users.
cally existed to store and disseminate The metadata is used by the search engine, which directs
knowledge-based information. What exactly the user to the content
758 W. Hersh
information needs. This is done by posing a index for the single best-matching subject
23 query to the IR system. A search engine heading and then be directed to citations of
matches the query to content items through published articles.
metadata, which is “data about data” that The printed Index Medicus served as the
describes the content items (Foulonneau and main biomedical IR source until 1971, when
Riley 2008). There are two intellectual pro- the National Library of Medicine (NLM)1
cesses of IR. Indexing is the process of assign- unveiled an electronic version, the Medical
ing metadata to content items, while retrieval Literature Analysis and Retrieval System
is the process of the user entering his or her (MEDLARS), which had been cataloging
query and retrieving content items. bibliographic records since 1966 (Miles 1982).
Because computing power and disk storage
were very limited, MEDLARS and its follow-
23.2 Evolution of Biomedical on MEDLARS Online (MEDLINE), stored
Information Retrieval only limited information for each article, such
as author name(s), article title, journal source,
As with many chapters in this volume, IR has and publication date. In addition, the NLM
changed substantially over the five editions of assigned to each article a number of terms
this book. In the first edition, this chapter was from its Medical Subject Headings (MeSH)
titled “Bibliographic-Retrieval Systems,” vocabulary. Searching was done by users hav-
reflecting the predominant type of knowledge ing to mail a paper search form to the NLM
that was accessible at the time. The second edi- and receiving results back a few weeks later.
tion saw the emergence of the World Wide Only librarians who had completed a special-
Web (WWW or Web) as a delivery mechanism ized course were allowed to submit searches.
for knowledge-based information. In the third As computing power grew and disk stor-
edition, “Digital Libraries” was added to the age became more plentiful in the 1980s, full-
chapter name, reflecting that the entire bio- text databases began to emerge. These new
medical library and beyond was now part of databases allowed searching of the entire text
available online knowledge. The fourth edition of medical documents. Although lacking
reflected the ubiquitous nature of information graphics, images, and tables from the original
on computers, smartphones, tablets, and other source, these databases made it possible to
devices. This fifth edition recognizes that digi- retrieve the full text of important documents
tal data, information, and knowledge have quickly from remote locations. Likewise, with
become their primary medium, i.e., even the growth of computer networks, end users
though the world still has plenty of paper, the were now allowed to search the databases
source of most information in modern times is directly, though at a substantial cost.
digital. Essentially all articles, books, patient In the early 1990s, the pace of change in
records, etc. are primarily in digital form and the IR field quickened. The advent of the
mainly printed for the convenience of reading. Web and the exponentially increasing power
Although this chapter focuses on the use of computers and networks brought a world
of computers to facilitate IR, methods for where vast quantities of medical information
finding and retrieving information from medi- from multiple sources with various media
cal sources have been in existence for nearly a extensions were now available over the global
century and a half. In 1879 Dr. John Shaw Internet (Berners-Lee et al. 1994). In the late
Billings created Index Medicus to help medi- 1990s, the NLM made all of its databases
cal professionals find relevant journal articles available to the entire world for free. Also
(DeBakey 1991). Journal article citations were during this time, the notion of digital librar-
indexed by author name(s) and subject ies developed, with the recognition that the
heading(s) and then aggregated in bound vol-
umes. A scientist or practitioner seeking an
article on a topic could manually search the 1 7 https://www.nlm.nih.gov/
Information Retrieval
759 23
entire array of knowledge-based information stored. Libraries actually perform a variety of
could be accessed using this technology functions, including the following:
(Borgman 1999). 55 Acquisition and maintenance of collec-
Into the twenty-first century, use of IR tions
systems and digital libraries has become ubiq- 55 Cataloging and classification of items in
uitous. Estimates vary, but among individuals collections to make them more accessible
who use the Internet in the United States, over to users
80% have used it to search for information 55 Serving as a place where individuals can
relevant to their own health or that of an get assistance with seeking information,
acquaintance (Fox 2011; Taylor 2010). including information on computers
Virtually all physicians use the Internet 55 Providing work or study space (particu-
(Anonymous 2012). Furthermore, access to larly in universities)
systems has gone beyond the traditional per-
sonal computer and extended to new devices, Digital libraries provide some of the same ser-
such as smartphones and tablet devices. vices, but their focus tends to be on the digital
aspects of content.
23.3 Knowledge-Based
Information in Health 23.3.1 Information Needs
and Biomedicine and Information Seeking
Knowledge-based information can be subdi- Different users of knowledge-based informa-
vided into two categories. Primary knowl- tion have differing needs based on the nature
edge–based information (also called primary of their information need and available
literature) is original research that appears in resources. The information needs and infor-
journals, books, reports, and other sources. mation seeking of physicians have been most
This type of information reports the initial extensively studied. Gorman defined four
discovery of health knowledge, usually with states of information need in the clinical con-
either original data or reanalysis of data (e.g., text (Gorman and Helfand 1995):
systematic reviews, sometimes with meta- 55 Unrecognized need—clinician unaware of
analysis). Secondary knowledge–based infor- information need or knowledge deficit
mation consists of the writing that reviews, 55 Recognized need—clinician aware of need
condenses, and/or synthesizes the primary lit- but may or may not pursue it
erature. The most common examples of this 55 Pursued need—information seeking
type of literature are books, monographs, and occurs but may or may not be successful
review articles. Secondary literature includes 55 Satisfied need—information seeking
the growing quality of patient/consumer- successful
oriented health information that is increas-
ingly available via the Web. It also encompasses There is a great deal of evidence that the
opinion-based writing (such as editorials and majority of information needs are not being
position or policy papers), clinical practice satisfied and that IR applications may help.
guidelines, narrative reviews, and health infor- Among the reasons that physicians do not
mation on Web pages. In addition, it includes adhere to the most up-to-date clinical prac-
the plethora of pocket-sized manuals that tices is that they often do not recognize that
were formerly a staple for practitioners in their knowledge is incomplete. While this is
many professional fields. As will be seen later, not the only reason for such practices, the evi-
secondary literature is the most common type dence is compelling. For example, physicians
of literature used by physicians. do not always provide patients with most up-
Libraries have been the historical place to-date care (McGlynn et al. 2003), do not
where knowledge-based information has been adhere to established guidelines (Diamond
760 W. Hersh
and Kaul 2008), and vary widely in how they to electronic publishing of journals have been
23 provide care (Wennberg 2010). overcome, such that virtually all scientific
Studies from the late twentieth century journals are published electronically now. A
found that when physicians recognize an modern Internet connection is sufficient to
information need, they were likely to pursue deliver most of the content of journals. When
only a minority of unanswered questions. available in electronic form, journal content is
These studies found that physicians in prac- easier and more convenient to access.
tice had unmet information needs on the Furthermore, since most scientists have the
desire for widespread dissemination of their
order of two questions for every three patients
seen and only pursued answers for about 30% work, they have incentive for their papers to
of these questions (Covell et al. 1985; Ely be available electronically.
et al. 1999; Gorman and Helfand 1995). When The technical challenges to electronic
answers to questions were actually pursued, scholarly publication have been replaced by
these studies showed that the most frequent economic and political ones (Hersh and
source for answers to questions was col- Rindfleisch 2000; Sox 2009). Printing and
leagues, followed by paper-based textbooks. mailing, tasks no longer needed in electronic
publishing, comprised a significant part of the
Therefore, it is not surprising that barriers to
“added value” from publishers of journals.
satisfying information needs remain (Ely et al.
2002). Physicians use electronic sources more There is still however value added by publish-
now than were measured in these earlier stud- ers, such as hiring and managing editorial
ies, with the widespread use of the electronicstaff to produce the journals, and managing
health record (EHR) as well as ubiquity of the peer review process. Even if publishing
portable smartphones and tablets, although companies, as they currently exist today, were
less research is undertaken in modern times to vanish, there would still be some cost to the
assessing their needs. Another approach to production of journals. Thus, while the cost
facilitating access to knowledge-based infor- of producing journals electronically is likely
mation has been to link it more directly with to be less, it is not zero, and even if journal
the context of the patient in the EHR (Cimino content is distributed “free,” someone has to
and delFiol 2007). pay the production costs. The economic issue
The information needs of other users have in electronic publishing, then, is who is going
been less well-studied. For consumers, surveysto pay for the production of journals (Sox
found about 80% of all Internet users searched2009). This introduces some political issues as
for personal health information (Fox and well. One of them centers around the concern
Duggan 2013). The most common type of that much research is publicly funded through
grants from federal agencies such as the
search focuses on a specific disease or medical
problem (66% of all who have searched), fol- National Institutes of Health (NIH) and the
lowed by a specific medical treatment or pro- National Science Foundation (NSF). In the
cedure (56%). Consumers also use the Web to current system, especially in the biomedical
sciences (and to a lesser extent in other sci-
search for physicians, health care institutions,
and health insurance. Even less studied have ences), researchers turn over the copyright of
been the information needs of researchers, buttheir publications to journal publishers. The
one recurrent finding is the idiosyncratic political concern is that the public funds the
nature of their use of IR and other systems research and the universities carry it out, but
(Bartlett and Toms 2005). individuals and libraries then must buy it back
from the publishers to whom they willingly
cede the copyright. This problem is exacer-
23.3.2 Changes in Publishing bated by the general decline in funding for
libraries.
The Internet and the Web have had a pro- One solution to this problem has been the
found impact on the publishing of knowledge- emergence of open-access (OA) publishing.
based information. The technical impediments The premise of the OA model is that the
Information Retrieval
761 23
c ontent is made freely available electronically, its grants to be submitted to PMC, either in
with the costs of production covered by other the form published by publishers or as a PDF
funding sources (Frank 2013). The most com- of the last manuscript prior to journal accep-
mon funding source is the research funder, tance.6 Publishers have expressed concern that
and most funders consider OA publishing copyrights give journals more control over the
costs to be an allowable expense on research integrity of the papers they publish (Drazen
grants. While some have expressed concern and Curfman 2004). An alternative approach,
that OA may give rise to financial incentives advocated by non-commercial (professional
for excess publishing, many OA journals have society) publishers is the Washington DC
over a decade of experience demonstrating Principles for Free Access to Science,7 which
traditional peer review is compatible with OA advocates:
publishing. Another concern has been the 55 Reinvestment of revenues in support of
ability of resource-poor scientists to afford science.
OA publishing, although most OA journals 55 Use of open archives such as PMC as
have “hardship” policies that allow waiver of allowed by business constraints.
the publication fee. 55 Commitment to some free publication,
There has been the emergence of two mod- access by low-income countries, and no
els of OA publishing (Frank 2013). One is OA charges to publish.
Gold, where the author (usually through the
research funding) pays the cost of production. One adverse outcome of OA publishing has
Publishing charges are typically only a tiny been the emergence of so-called predatory
fraction of the overall cost of research, esti- journals (Haug 2013). These journals exist
mated to be about 0.3% (Zerhouni 2004). Two mainly to make money, allowing authors to
early publishers operating under this model publish in impressive-sounding titles with lit-
were Biomed Central (BMC, now owned by tle or no peer review. Predatory journals offer
Springer)2 and the Public Library of Science inexpensive publishing and send massive
(PLoS).3 Many commercial publishers now emails to academic faculty members offering
offer authors the ability to publish under OA, to publish or even serve on editorial boards.
and some journals have developed “Open” sis- Some authors have exposed the process by
ter journals, such as JAMA, BMJ, and writing clearly fake papers (McCool 2017),
JAMIA. while others have issued calls for efforts to
A second model is OA Green, where stop it (Moher and Moher 2016).
authors are required to deposit the manu- Some consider OA to be part of larger
script, either the published manuscript or the “open science,” consisting of (Anonymous
last draft of the manuscript prior to typeset- 2018b):
ting, into public repositories such as PubMed 55 Open data – all data collected in research
Central (PMC).4 PMC is a repository of life 55 Open source – all software code developed
science research that provides free access while and used
allowing publishers to maintain copyright 55 Open methodology – clear and detailed
and even optionally keep the papers housed description; availability of all surveys and
on their own servers. A lag time of up to other tools
6 months is allowed so that journals can reap 55 Open peer review – all comments of peer
the revenue that comes with initial publica- reviewers
tion. The National Institutes of Health
(NIH)5 now requires all research funded by The move to predominant electronic publica-
tion of science has led many to advocate for
growing access to the underlying data from
2 7 https://www.biomedcentral.com/
3 7 https://www.plos.org/
4 7 https://www.ncbi.nlm.nih.gov/pmc/ 6 7 https://publicaccess.nih.gov/
5 7 https://www.nih.gov/ 7 7 http://www.dcprinciples.org/
762 W. Hersh
research studies. While this has been a stan- 23.3.3 Quality of Information
23 dard practice in genomics and related areas
for many years (i.e., depositing genome In the early days of the growth of the Internet
sequences in GenBank as a condition of pub- and the Web, another concern was the quality
lication), there have been concerns that have of information available. A large fraction of
impeded this approach in clinical studies. Web-based health information has always
The case for requiring publication of scien- been aimed at nonprofessional audiences.
tific data is strong. As most research is taxpayer- Many lauded this development as empower-
funded, it only seems fair that those who paid ing those most directly affected by health
are entitled to all the data for which they paid. care—those who consumed it (Eysenbach
Likewise, all of the subjects were real people et al. 1999). Others expressed concern about
who potentially took risks to participate in the patients misunderstanding or being purposely
research, and their data should be used for dis- misled by incorrect or inappropriately inter-
covery of knowledge to the fullest extent pos- preted information (Jadad 1999). Some clini-
sible. In addition, new discoveries may emerge cians also lamented the growing amount of
from re-analysis of data. For these reasons, the time required to go through stacks of print-
International Committee of Medical Journal outs downloaded by patients and brought to
Editors (ICMJE) called for de-identified data their offices. The Web was inherently demo-
from randomized controlled trials to be shared cratic, allowing anyone to post information.
as condition of publication (Taichman et al. However, this was potentially at odds with the
2017). Other have called for data access to be operation of a professional field, particularly
FAIR (finable, accessible, interoperable, and one like health care, where practitioners were
reusable) (Wilkinson et al. 2016). ethically bound and legally required to adhere
Some researchers, however, have pushed to the highest standard of care. Thus, a major
back on this notion. They argue that those concern with health information on the Web
who carry out the work of designing, imple- is the presence of inaccurate or out-of-date
menting, and evaluating experiments certainly information. An early systematic review of
have some exclusive rights to the data gener- studies assessing the quality of health infor-
ated by their work. Some also question mation found that 55 of 79 studies came to
whether the cost is a good expenditure of lim- the conclusion that quality of information
ited research dollars, especially since the was a problem (Eysenbach et al. 2002). More
demand for such data sets may be modest and recent studies continue to show the variable
the benefit is not clear. One group of 282 quality of health information on the Web
researchers in 33 countries, the International (Kitchens et al. 2014).
Consortium of Investigators for Fairness in A more recent problem, extending beyond
Trial Data Sharing, notes that there are risks, health care, has been the proliferation of
such as misleading or inaccurate analyses as “fake news” and “alternative facts,” often pro-
well as efforts aimed at discrediting or under- mulgated by those understanding how to
mining the original research (Anonymous manipulate search engines and social media
2016). They also express concern about the (Vosoughi et al. 2018; Wenzel 2017). A related
costs, given that there are over 27,000 RCTs area is “adversarial” IR, where the goal is to
performed each year. As such, this group calls not retrieve information the user does not
for an embargo on reuse of data for 2 years want to see or should not see (Castillo and
plus another half-year for each year of the Davison 2011). One major research organiza-
length of the RCT. Even those who support tion has lamented the larger societal impacts
data sharing point out the requirement for of such “truth decay,” lamenting “erosion of
proper curation, wide availability to all civil discourse, political paralysis, alienation
researchers, and appropriate credit to and and disengagement of individuals from politi-
involvement of those who originally obtained cal and civic institutions” (Kavanagh and
the data (Merson et al. 2016). Rich 2018).
Information Retrieval
763 23
The impact of poor-quality health infor- 23.3.4 Evidence-Based Medicine
mation is unclear. People were harmed by
incorrect and misleading health information The growing quantity of clinical information
long before the emergence of digital informa- available in IR systems and digital libraries
tion. One well-known self-help expert argued requires new approaches to select that which
that patients and consumers actually are savvy is best to use for clinical decisions. The phi-
enough to understand the limits of quality of losophy guiding this approach is evidence-
information on the Web. This view held that based medicine (EBM), which can be viewed a
patients and consumers should be trusted to set of tools to inform clinical decision-making.
discern quality using their own abilities to It allows clinical experience (“art”) to be inte-
consult different sources of information and grated with best clinical science (Guyatt et al.
to communicate with health care practitioners 2014, 2015). Also, EBM makes the medical
and with others who share their condition(s) literature more clinically applicable and rele-
(Ferguson 2002). Indeed, the ideal situation vant. In addition, it requires the user to be fac-
may be a partnership among patients and ile with computers and IR systems. The
their health care practitioners, as it has been process of EBM involves three general steps:
shown that patients desire that their practitio- 55 Phrasing a clinical question that is perti-
ners be the primary source of recommenda- nent and answerable.
tions for online information (Tang et al. 1997). 55 Identifying evidence (studies in articles)
This concern about quality of information that address the question.
led a number of individuals and organizations 55 Critically appraising the evidence to deter-
to develop guidelines for assessing the quality mine whether it applies to the patient.
of health information. One of the earliest and
most widely quoted set of criteria was pub- The phrasing of the clinical question is an
lished in JAMA (Silberg et al. 1997). These often-overlooked portion of the EBM pro-
criteria stated that Web pages should contain cess. There are two general types of clinical
the name, affiliation, and credentials of the question: background questions and fore-
author; references to the claims made; explicit ground questions (Guyatt et al. 2014, 2015).
listing of any perceived or real conflict of Background questions ask for general knowl-
interest; and date of most recent update. edge about a disorder, whereas foreground
Another early set of criteria was the Health on questions ask for knowledge about managing
the Net (HON)8 codes, a set of voluntary patients with a disorder. Background ques-
codes of conduct for health-related Web sites. tions are generally best answered with text-
Sites that adhere to the HON codes can dis- books and classical review articles, whereas
play the HON logo. Another approach to foreground questions are answered using
insuring Web site quality is accreditation by a EBM techniques. There are four major fore-
third party. URAC (formerly, the Utilization ground question categories:
Review Accreditation Commission) has a pro- 55 Therapy (or intervention)—benefit of
cess for such accreditation.9 The URAC stan- treatment or prevention.
dards cover six general issues: health content 55 Diagnosis—test diagnosing disease.
editorial process, disclosure of financial rela- 55 Harm—detrimental health effects of a dis-
tionships, linking to other Web sites, privacy ease, environmental exposure (natural or
and security, consumer complaint mecha- man-made), or medical intervention.
nisms, and internal processes required to 55 Prognosis—outcome of disease course.
maintain quality over time.
Identifying evidence involves selecting the
best evidence for a given type of question.
EBM proponents advocate, for example, that
8 7 https://www.hon.ch/en/
9 7 https://www.urac.org/programs/health-web-site-
randomized controlled trials or a systematic
accreditation review (with or without meta-analysis) that
764 W. Hersh
combines multiple trials provide the best evi- 23.4.1 Bibliographic Content
23 dence for or against particular health care
interventions. Likewise, diagnostic test accu- The first category consists of bibliographic
racy is best assessed with comparison to a content. It includes what was for decades the
known gold standard in an appropriate spec- mainstay of IR systems: literature reference
trum of patients to whom the test will be databases. Also called bibliographic data-
applied (see 7 Chap. 3). Questions of harm bases, this content consists of citations or
can be answered by randomized controlled pointers to the medical literature (i.e., journal
trials when it is ethical to do so; otherwise articles). The best-known and most widely
they are best answered with observational used biomedical bibliographic database is
case control or cohort studies. There are MEDLINE, which contains bibliographic ref-
checklists of attributes for these different erences to all of the biomedical articles, edito-
types of studies that allow their critical rials, and letters to the editors in approximately
appraisal and applicability to a given patient 5000 scientific journals. The journals are cho-
in the EBM resources described above. sen for inclusion by an advisory committee of
The original approach to EBM has evolved subject experts convened by NIH. At present,
over time, with less emphasis on critically over 800,000 references are added to
appraising original evidence and more on syn- MEDLINE yearly. It contained over 24 mil-
thesized evidence being made readily available lion references by the end of 2017.10
to clinicians, usually through electronic The current MEDLINE record contains
sources, including clinical decision support over 60 fields.11 A clinician may be interested
systems (see 7 Chap. 26) (DiCenso et al. in just a handful of these fields, such as the
2009; Hersh 1999). There have also been a title, abstract, and indexing terms. But other
number of criticisms of EBM, with argu- fields contain specific information that may be
ments that it may ignore clinical expertise of great importance to other audiences. The
(Haynes et al. 2002), patient values (Guyatt Supplementary Information (SI) field con-
et al. 2004), and other ways of “knowing” tains links to records in many different data
(Sim 2016). Others express concerns that its banks, from clinical trials registries to genom-
methods have been “distorted” (Greenhalgh ics and other “omics” databases.12 Even the
et al. 2014), “hijacked” by commercial forces clinician may, however, derive benefit from
and other self-interest (Ioannidis 2016a, some of the other fields. For example, the
2017), and subverted by the political process Publication Type (PT) field can help in the
(Patashnik et al. 2017). A final criticism is the application of EBM, such as when one is
proliferation of systematic reviews, not all of searching for a practice guideline or a ran-
which may be motivated by objective science domized controlled trial.
(Ioannidis 2016b). MEDLINE records are also assigned
other identifiers. The PubMed ID (PMID) is a
unique identifier for records in the database.
23.4 Content of Knowledge-Based Another identifier in MEDLINE is the
Information Resources PubMed Central ID (PMCID), assigned to
records whose articles have been deposited
The previous sections of this chapter have into PMC. MEDLINE also contains an
described some of the issues and concerns Author Identifier (AUID) field, which allows
surrounding the production and use of three possible identifiers, the most common of
knowledge-based information in biomedicine.
It is useful to classify the information to gain
a better understanding of its structure and 10 7 https://www.nlm.nih.gov/bsd/index_stats_comp.
function. In this section, we classify content html
into bibliographic, full-text, annotated, and 11 7 https://www.nlm.nih.gov/bsd/mms/medlineele-
ments.html
aggregated categories, although some content
12 7 https://www.nlm.nih.gov/bsd/medline_data-
does not neatly fit within them. bank_source.html
Information Retrieval
765 23
which is the ORCID identifier,13 a unique category; see 7 Sect. 23.4.4, below). In gen-
identifier for scientific authors (e.g., 0000- eral, the former contains only links to other
0002-4114-5148 for this author). A growing pages and sites, while the latter include actual
number of journals and other publications content that is highly integrated with other
make use of the ORCID. resources. Some well-known Web catalogs
MEDLINE can be accessed without include:
charge via the PubMed system,14 produced by 55 HON Select19—a European catalog of
the National Center for Biotechnology quality-filtered, clinician-oriented Web
Information (NCBI)15 of the NLM. Some content from the HON foundation.
other information vendors, such as Ovid 55 Translating Research into Practice
Technologies,16 license the content of (TRIP)20—a database of content deemed
MEDLINE and other databases and provide to meet high standards of EBM.
value-added services that can be accessed for a
fee by individuals and institutions. A specialized registry specific to healthcare
The NLM used to offer a number of other, was the National Guidelines Clearinghouse
more focused bibliographic databases, but (NGC). Produced by the Agency for
these have all been folded into MEDLINE. A Healthcare Research and Quality (AHRQ), it
number of other publishers offer biomedical contained exhaustive information about clini-
bibliographic databases. The major non-NLM cal practice guidelines. In 2018, AHRQ shut
database for the nursing field is the Cumulative down the National Guidelines Clearinghouse
Index to Nursing and Allied Health Literature (Munn and Qaseem 2018). The original con-
(CINAHL),17 which covers nursing and allied tractor developing the NGC was the non-
health literature, including physical therapy, profit research firm, ECRI, which developed a
occupational therapy, laboratory technology, new site, ECRI Guidelines Trust.21
health education, physician assistants, and A final kind of bibliographic-like content
medical records. Another well-known biblio- consists of RSS feeds (originally RDF Site
graphic database is EMBASE,18 which is pro- Summary, often dubbed “Really Simple
duced by the commercial publisher, Elsevier Syndication”), which are short summaries of
(Amsterdam, Netherlands). EMBASE con- Web content, typically news, journal articles,
tains over 28 million records and covers a blog postings, and other content. Users set up
superset of journals from MEDLINE. These an RSS aggregator, which can be though a
journals are often important for those carry- Web browser, email client, or standalone soft-
ing out systematic reviews and meta-analyses, ware, configured for the RSS feed desired,
which need access to all the studies published with an option to add a filter for specific con-
across the world. tent. There are two versions of RSS (1.0 and
A second, more modern type of biblio- 2.0) but both provide:
graphic content is the Web catalog. There are 55 Title—name of item
increasing numbers of such catalogs, which 55 Link—URL to content
consist of Web pages containing mainly links 55 Description—a brief description of the
to other Web pages and sites. It should be content
noted that there is a blurry distinction between
Web catalogs and aggregations (the fourth
23.4.2 Full-text Content
nally consisted of the online versions of books standalone reference is Online Mendelian
23 and periodicals. As already noted, just about Inheritance in Man (OMIM),24 which is con-
all traditionally paper-based medical content, tinually updated with new information about
from journals to textbooks, is now available the genomic causes of human disease.
electronically. The electronic versions usually Electronic textbooks offer additional fea-
have identical content to paper versions but tures beyond text from the print version.
may be enhanced by measures ranging from While many print textbooks do feature high-
the provision of supplemental data in a jour- quality images, electronic versions offer the
nal article to linkages and multimedia content ability to have more pictures and illustrations.
in a textbook. The final component of this They also have the ability to use sound and
category is the Web site. Admittedly, the diver- video, although few do at this time. As with
sity of information on Web sites is enormous, full-text journals, electronic textbooks can
and sites may include every other type of con- link to other resources, including journal ref-
tent described in this chapter. However, in the erences and the full articles. Many Web-based
context of this category, “Web site” refers to textbook sites also provide access to continu-
the vast number of static and dynamic Web ing education self-assessment questions and
pages at a discrete Web location. medical news. Finally, electronic textbooks let
One of the fields in MEDLINE is the uni- authors and publishers provide more frequent
form resource locator (URL) for the publish- updates of the information than is allowed by
er’s full text of the article, allowing linkage the usual cycle of print editions, where new
directly from the bibliographic database to the versions come out only every few years.
full text. This link is active when the PubMed As noted above, Web sites are another
record is displayed, but users may be met by a form of full-text information. Probably the
password screen if the article is not available most effective provider of Web-based health
for free. Many sites allow both access to sub- information is the U.S. government. Not only
scribers or a pay-per-view facility. Many aca- do they produce bibliographic databases, but
demic organizations now maintain large the NLM, AHRQ, the National Cancer
numbers of subscriptions to journals available Institute (NCI), Centers for Disease Control
to faculty, staff, and students. Other publish- (CDC), and others have also been innovative
ers, such as Ovid, provide access within their in providing comprehensive full-text informa-
own password-protected interfaces to articles tion for health care providers and consumers.
from journals that they have licensed for use One example is the popular CDC Travel site.25
in their systems. Some of these will be described later as aggre-
The most common secondary literature gations, since they provide many different
source is textbooks, almost all of which are types of resources.
available in electronic form. A common A large number of commercial biomedical
approach with textbooks is bundling them, and health Web sites have emerged in recent
sometimes with linkages across the bundled years. On the consumer side, they include
texts. An early bundler of textbooks was Stat!- more than just collections of text; they also
Ref22 that, like many, began as a CD-ROM include interaction with experts, online stores,
product and then moved to the Web. Most and catalogs of links to other sites. Some well-
other large publishers have now similarly known examples include Mayo Clinic26 and
aggregated their libraries of textbooks and WebMD.27 There are also Web sites, either
other content. Another collection of text- from medical professional societies or compa-
books is the NCBI Bookshelf,23 which con- nies, which provide information geared toward
tains many volumes on biomedical research
topics. Initially published by NCBI but now a
24 7 http://www.omim.org/
25 7 https://wwwnc.cdc.gov/travel
22 7 http://statref.com/ 26 7 https://www.mayoclinic.org
23 7 https://www.ncbi.nlm.nih.gov/books 27 7 https://www.webmd.com
Information Retrieval
767 23
health care providers, typically overviews of 55 Visible Human Project30 – collection of
diseases, their diagnosis, and treatment; medi- three-dimensional representations of nor-
cal news and other resources for providers are mal male and female bodies, consisting of
often offered as well. cross-sectional slices of cadavers, with sec-
Other sources of on-line health-related tions of 1 mm thickness in the male and
content include encyclopedias, the so-called 0.3 mm thickness in the female (Spitzer
body of knowledge (BOK; the complete set of et al. 1996). Also available from each cadaver
concepts, terms and activities that make up a are transverse computerized tomography
professional domain), and Weblogs or blogs. and magnetic resonance images.
A well-known online encyclopedia with a 55 Images from the History of Medicine31 –
great deal of health-related information is online access to images from the historical
Wikipedia,28 which features a distributed collections of the NLM.
authorship process whose content has been 55 Open-I32 – collection of images from
found to reliable (Giles 2005; Nicholson 2006) PubMed Central papers (Demner-
and frequently shows up near the top in Fushman et al. 2012).
health-related Web searches (Laurent and
Vickers 2009). A growing number of organi- Many genomics databases are available on the
zations have a body of knowledge, such as the Web. The first issue each year of the journal
American Health Information Management Nucleic Acids Research (NAR) catalogs and
Association (AHIMA)29 Blogs tend to carry a describes these databases, and is now available
stream of consciousness but often high- by open access means (Rigden and Fernández
quality information is posted within them. 2020). NAR also maintains an ongoing data-
base of such databases, the Molecular Biology
Database Collection.33 Among the most
23.4.3 Annotated Content important of these databases are those avail-
able from NCBI (Anonymous 2018a). All
The third category consists of annotated con- their databases are linked among themselves,
tent. These resources are usually not stored as along with PubMed and OMIM, and are
freestanding Web pages but instead are often searchable via the NCBI Search system.34
housed in database management systems. More details on the specific content of genom-
This content can be further subcategorized ics databases is provided in 7 Chap. 28.
into discrete information types: Citation databases provide linkages to
55 Image databases—collections of images articles that cite others across the scientific lit-
from radiology, pathology, and other areas erature. The earliest citation databases were
55 Genomics databases—information from the Science Citation Index (SCI) and Social
gene sequencing, protein characterization, Science Citation Index (SSCI), which are now
and other genomic research part of the larger Web of Science (Clarivate
55 Citation databases—bibliographic link- Analytics, Philadelphia, PA). Two well-known
ages of scientific literature bibliographic databases for biomedical and
55 EBM databases—highly structured collec- health topics that also have citation links
tions of clinical evidence include SCOPUS35 and Google Scholar.36 A
55 Other databases—miscellaneous other
collections
final citation database of note is CiteSeer,37 vide linkage for use of data sets from biomed-
23 which focuses on computer and information ical research (Ohno-Machado et al. 2017).
science, including biomedical informatics.
EBM databases are devoted to providing
annotated evidence-based information. Some 23.4.4 Aggregated Content
examples (all available with through subscrip-
tion fees) include: The final category consists of aggregations of
55 Cochrane Database of Systematic content from the first three categories. The
Reviews—one of the original collections distinction between this category and some of
of systematic reviews38 the highly-linked types of content described
55 BMJ Best Practice39 above is admittedly blurry, but aggregations
55 JAMA Evidence40 typically have a wide variety of different types
55 Up-to-Date—content centered around of information serving the diverse needs of
clinical questions41 users. Aggregated content has been developed
55 Essential Evidence Plus42 for all types of users from consumers to clini-
cians to scientists.
There is a growing market for a related type of Probably the largest aggregated consumer
evidence-based content in the form of clinical information resource is MedlinePlus48 from
decision support order sets, rules, and health/ the NLM. MedlinePlus includes all of the
disease management templates. Publishers types of content previously described, aggre-
include EHR vendors whose systems employ gated for easy access to a given topic.
this content as well as other vendors such as MedlinePlus contains health topics, drug
Zynx43 and Provation.44 information, medical dictionaries, directories,
There are a variety of other annotated and other resources. Each topic contains links
content. The ClinicalTrials.gov database45 to health information from the NIH and other
began as a database of clinical trials spon- sources deemed credible by its selectors. There
sored by NIH. After concerns about clinical are also links to current health news (updated
trials having their protocols altered after the daily), a medical encyclopedia, drug refer-
start of trials, ClinicalTrials.gov expanded its ences, and directories, along with a preformed
scope to be a registry of all clinical trials PubMed search related to the topic.
(DeAngelis et al. 2005; Zarin et al. 2017) and Another well-known group of aggrega-
to contain actual results of trials (Zarin et al. tions of content for genomics researchers is
2011). Another important database for the model organism databases. These data-
researchers is NIH RePORTER,46 which is a bases bring together bibliographic databases,
database of all grant awards funded by full text, and databases of sequences, struc-
NIH. An additional annotated resource is ture, and function for organisms whose
DataMed,47 which aims to catalog and pro- genomic data have been highly characterized.
One of the oldest and most developed model
organism databases is the Mouse Genome
Informatics resource.49 More details are pro-
37 7 http://citeseerx.ist.psu.edu vided in 7 Chap. 28.
38 7 https://www.cochranelibrary.com
39 7 https://bestpractice.bmj.com/info/evidence-infor-
mation/
40 7 https://jamaevidence.mhmedical.com 23.5 Indexing
41 7 https://www.uptodate.com/home
42 7 http://www.essentialevidenceplus.com As noted at the beginning of the chapter,
43 7 https://www.zynxhealth.com/ indexing is the process of assigning metadata
44 7 https://www.provationmedical.com/order-set-
management
45 7 https://clinicaltrials.gov/
46 7 http://projectreporter.nih.gov/reporter.cfm 48 7 https://medlineplus.gov/
47 7 https://datamed.org 49 7 http://www.informatics.jax.org/
Information Retrieval
769 23
to content to facilitate its retrieval. Most mod- contains relationships between terms, which
ern commercial content is indexed in two typically fall into three categories:
ways: 55 Hierarchical—terms that are broader or
1. Manual indexing—where human indexers, narrower. The hierarchical organization
usually using a controlled terminology, not only provides an overview of the struc-
assign indexing terms and attributes to ture of a thesaurus but also can be used to
documents, often following a specific pro- enhance searching (e.g., MeSH tree explo-
tocol. sions that add terms from an entire por-
2. Automated indexing—where computers tion of the hierarchy to augment a search).
make the indexing assignments, usually 55 Synonym—terms that are synonyms,
limited to breaking out each word in the allowing the indexer or searcher to express
document (or part of the document) as an a concept in different words.
indexing term. 55 Related—terms that are not synonymous
or hierarchical but are somehow otherwise
Manual indexing is done most commonly related. These usually remind the searcher
with bibliographic databases and annotated of different but related terms that may
content. In this age of proliferating electronic enhance a search.
content, such as online textbooks, practice
guidelines, and multimedia collections, man- The MeSH terminology is used to manually
ual indexing has become either too expensive index most of the databases produced by the
or outright unfeasible for the quantity and NLM (Coletti and Bleich 2001). The latest
diversity of material now available. Thus there version contains over 28,000 subject headings
are increasing numbers of databases that are (the word MeSH uses for the canonical repre-
indexed only by automated means. Before sentation of its concepts). It also contains
covering these types of indexing in detail, let over 90,000 synonyms to those terms, which
us first discuss controlled terminologies. in MeSH jargon are called entry terms. MeSH
also contains Supplementary Concept
Records, representing 230,000 additional
23.5.1 Controlled Terminologies chemicals, drugs, genes, organisms, etc. that
indexers encounter in indexing process and
A controlled terminology contains a set of map to MeSH headings.
terms that can be applied to a task, such as MeSH contains the three types of relation-
indexing. When the terminology defines the ships described previously:
terms, it is usually called a vocabulary. When 55 Hierarchical—MeSH is organized hierar-
it contains variants or synonyms of terms, it is chically into 16 trees, such as Diseases,
also called a thesaurus. Before discussing Organisms, and Chemicals and Drugs
actual terminologies, it is useful to define 55 Synonym—MeSH contains a vast number
some terms. A concept is an idea or object that of entry terms, which are synonyms of the
exists in the world, such as the condition headings
under which human blood pressure is ele- 55 Related—terms that may be useful for
vated. A term is the actual string of one or searchers to add to their searches when
more words that represent a concept, such as appropriate are suggested for many head-
“Hypertension” or “High Blood Pressure”. ings
One of these string forms is the preferred or
canonical form, such as “Hypertension” in the The MeSH terminology files, their associated
present example. When one or more terms can data, and their supporting documentation are
represent a concept, the different terms are available on the NLM’s MeSH Web site.50
called synonyms. There is also a browser that facilitates explo-
A controlled terminology usually contains
a list of terms that are the canonical represen-
tations of the concepts. If it is a thesaurus, it 50 7 http://www.nlm.nih.gov/mesh/
770 W. Hersh
ration of the terminology51 as well as a tool publication type Review or Review Literature.
for mapping text to MeSH terms, called Or, to find studies that provide the best evi-
MeSH on Demand.52 . Figure 23.2 shows a dence for a therapy, the publication type
slice through the MeSH hierarchy for Meta-Analysis, Randomized Controlled Trial,
“Hypertension” and related cardiovascular or Controlled Clinical Trial would be used.
diseases in the C. Diseases tree. MeSH is not the only thesaurus used for
There are features of MeSH designed to indexing biomedical documents. A number of
assist indexers in making documents more other thesauri are used to index non-NLM
retrievable. One of these is subheadings, which databases. CINAHL, for example, uses the
are qualifiers of subject headings that narrow CINAHL Subject Headings, which are based
the focus of a term. In Hypertension, for on MeSH but have additional domain-specific
example, the focus of an article may be on the terms added. EMBASE has a terminology
diagnosis, epidemiology, or treatment of the called EMTREE,53 which has many features
condition. Another feature of MeSH that similar to those of MeSH.
helps retrieval is check tags. These are MeSH One problem with controlled terminolo-
terms that represent certain facets of medical gies, not limited to IR systems, is their prolif-
studies, such as age, gender, human or nonhu- eration. As already described in 7 Chap. 8,
man, and type of grant support. Related to there is great need for linkage across these dif-
check tags are the geographical locations in ferent terminologies. This was the primary
one particular part of the MeSH hierarchy motivation for the Unified Medical Language
(called the “Z tree”, because their term codes System (UMLS) Project,54 which was under-
start with “Z”). Indexers must also include taken in the 1980s to address this problem
these, like check tags, since the location of a (Humphreys et al. 1998). There are three com-
study (e.g., Oregon) must be indicated. ponents of the UMLS Knowledge Sources:
Another feature gaining increasing impor- the Metathesaurus, the Semantic Network,
tance for EBM and other purposes is the pub- and the Specialist Lexicon. The Metathesaurus
lication type, which describes the type of component of the UMLS links parts or all of
publication or the type of study. A searcher over 100 terminologies (Bodenreider 2004).
who wants a review of a topic may choose the
53 7 https://www.elsevier.com/solutions/embase-bio-
51 7 http://www.nlm.nih.gov/mesh/MBrowser.html medical-research
52 7 https://meshb.nlm.nih.gov/MeSHonDemand 54 7 https://www.nlm.nih.gov/research/umls/
Information Retrieval
771 23
.. Fig. 23.3 Concepts, Atrial Fibrillation
terms, and strings for the Concept af
Metathesaurus concept
atrial fibrillation. Each
string may occur in more afib
Atrial Fibrillation
than one vocabulary, in
which case each would be Terms
a fib
an atom. (Courtesy of Auricular Fibrillation
National Library of AF-Atrial
Medicine, with Fibrillation
permission)
Str ings Auricular Fibrillation, Auricular
Fibrillation Auricular Fibrillations
In the Metathesaurus, all terms that are both terms are several strings that vary in
conceptually the same are linked together as a word order and case.
concept. Each concept may have one or more The Metathesaurus contains a wealth of
terms, each of which represents an expression additional information. In addition to the
of the concept from a source terminology that synonym relationships between concepts,
is not just a simple lexical variant (i.e., differs terms, and strings described earlier, there are
only in word ending or order). Each term may also non-synonym relationships between con-
consist of one or more strings that represent cepts. There are a great many attributes for the
all the lexical variants that are represented for concepts, terms, strings, and atoms, such as
that term in the source terminologies. One of definitions, lexical types, and occurrence in
each term’s strings is designated as the pre- various data sources. Also provided with the
ferred form, and the preferred string of the Metathesaurus is a word index that connects
preferred term is known as the canonical form each word to all the strings it occurs in, along
of the concept. There are rules of precedence with its concept, term, string, and atomic
for determining the canonical form, the main identifiers.
one being that the MeSH heading is used if
one of the source terminologies for the con-
cept is MeSH. 23.5.2 Manual Indexing
Each Metathesaurus concept has a single
concept unique identifier (CUI). Each term Manual indexing is most commonly done for
has one term unique identifier (LUI), all of bibliographic and annotated content,
which are linked to the one (or more) CUIs although it is sometimes for other types of
with which they are associated. Likewise, each content as well. Manual indexing is usually
string has one string unique identifier (SUI), done by means of a controlled terminology of
which is likewise linked to the LUIs in which terms and attributes. Most databases utilizing
they occur. In addition, each string has an human indexing usually have a detailed proto-
atomic unique identifier (AUI) that represents col for assignment of indexing terms from the
information from each instance of the string thesaurus. The MEDLINE database is no
in each vocabulary. . Figure 23.3 depicts the exception. The principles of MEDLINE
English-language concepts, terms, and strings indexing were laid out in the two-volume
for the Metathesaurus concept atrial fibrilla- MEDLARS Indexing Manual (Charen 1976,
tion. (Each string may occur in more than one 1983). Subsequent modifications have
vocabulary, in which case each would be an occurred with changes to MEDLINE, other
atom.) The canonical form of the concept and databases, and MeSH over the years. The
one of its terms is atrial fibrillation. Within major concepts of the article, usually from
772 W. Hersh
two to five headings, are designed as main International Organization for Standards
23 headings, and designated in the MEDLINE (ISO), ISO Standard 15836:2009.
record by an asterisk. The indexer is also There have been some medical adaptations
required to assign appropriate subheadings. of the DCMI. The most developed of these is
Finally, the indexer must also assign check the Catalogue et Index des Sites Médicaux
tags, geographical locations, and publication Francophones (CISMeF)56 (Darmoni et al.
types. Although MEDLINE indexing is still 2000). A catalog of French-language health
manual, indexers are aided by a variety of resources on the Web, CISMeF has used
electronic tools for selecting and assigning DCMI to catalog Web pages, including infor-
MeSH terms. mation resources (e.g., practice guidelines,
Few full-text resources are manually consensus development conferences), organi-
indexed. One type of indexing that commonly zations (e.g., hospitals, medical schools, phar-
takes place with full-text resources, especially maceutical companies), and databases. The
in the print world, is that performed for the Subject field uses the French translation of
index at the back of the book. However, this MeSH but also includes the English transla-
information is rarely used in IR systems; tion. For Type, a list of common Web
instead, most online textbooks rely on auto- resources has been enumerated.
mated indexing (see 7 Sect. 23.5.3, below). While Dublin Core Metadata was origi-
Manual indexing of Web content would nally envisioned to be included in Hypertext
likewise be challenging. With billions of pages Markup Language (HTML) Web pages, it
of content, manual indexing of more than a became apparent that many non-HTML
fraction of it is not feasible. On the other resources exist on the Web and that there are
hand, the lack of a coherent index makes reasons to store metadata external to Web
searching much more difficult, especially pages. For example, authors of Web pages
when specific resource types are being sought. might not be the best people to index pages or
A simple form of manual indexing of the Web other entities might wish to add value by their
takes place in the development of the Web own indexing of content. An emerging stan-
catalogs and aggregations as described earlier. dard for cataloging metadata is the Resource
These catalogs contain not only explicit index- Description Framework (RDF) (Akerkar
ing about subjects and other attributes, but 2009).
also implicit indexing about the quality of a A second approach to manually indexing
given resource by the decision of whether to content on the Web has been to create directo-
include it in the catalog. ries of content. The first major effort to create
Two major approaches to manual indexing these was for use in the Yahoo! search engine,57
have emerged on the Web that are often com- which created a subject hierarchy and assigned
plementary. The first approach, that of apply- Web sites to elements within it. When concern
ing metadata to Web pages and sites, is began to emerge that the Yahoo directory was
exemplified by the Dublin Core Metadata proprietary and not necessarily representative
Initiative (DCMI)55 (Weibel and Koch 2000). of the Web community at large, an alternative
The goal of the DCMI has been to develop a movement sprung up: the Open Directory
set of standard data elements that creators of Project (dmoz.org). Due to increasing growth
Web resources can use to apply metadata to of the Web, these projects were eventually dis-
their content. The specification has defined 15 banded.
elements, as shown in . Table 23.1. The Manual indexing has a number of limita-
DCMI was recently approved as a standard tions, the most significant of which is
by the National Information Standards inconsistency. Funk and Reid (Funk and Reid
Organization (NISO) with the designation 1983) evaluated indexing inconsistency in
Z39.85. It is also a standard with the
56 7 http://www.chu-rouen.fr/cismef/
55 7 http://dublincore.org/ 57 7 https://www.yahoo.com/
Information Retrieval
773 23
Element Definition
the standpoint of an IR system, words are A commonly used approach for term
23 descriptors of documents, just like human- weighting is TF*IDF weighting, which com-
assigned indexing terms. Most retrieval sys- bines the inverse document frequency (IDF)
tems actually use a hybrid of human and word and term frequency (TF). The IDF is the loga-
indexing, in that the human-assigned indexing rithm of the ratio of the total number of doc-
terms become part of the document, which uments to the number of documents in which
can then be searched by using the whole con- the term occurs. It is assigned once for each
trolled term or individual words within it. term in the database, and it correlates inversely
With the development of full-text resources in with the frequency of the term in the entire
the 1980s and 1990s, systems that allowed database. The usual formula used is:
only word indexing began to emerge. This
IDF term log
trend increased with the advent of the Web. number of documents in database
1
Word indexing is typically done by defin-
number of docum
ments with term
ing all consecutive alphanumeric sequences (23.1)
between white space (which consists of spaces, The TF is a measure of the frequency with
punctuation, carriage returns, and other non- which a term occurs in a given document and
alphanumeric characters) as words. Systems is assigned to each term in each document,
must take particular care to apply the same with the usual formula:
process to documents and the user’s query,
especially with characters such as hyphens TF term,document frequency of term in
and apostrophes. Many systems go beyond document
simple identification of words and attempt to (23.2)
assign weights to words that represent their
importance in the document (Salton 1991). In TF*IDF weighting, the two terms are com-
Many systems using word indexing employ bined to form the indexing weight, WEIGHT:
processes to remove common words or con-
WEIGHT term,document
flate words to common forms. The former
consists of filtering to remove stop words, TF term,document IDF term
which are common words that always occur (23.3)
with high frequency and are usually of little
value in searching. The stop word list, also Another automated indexing approach gener-
called a negative dictionary, varies in size from ating increased interest is the use of link-based
the seven words of the original MEDLARS methods, fueled by the success of the Google
stop list (and, an, by, from, of, the, with) to the search engine.59 This approach gives weight to
list of 250–500 words more typically used. pages based on how often they are cited by
Examples of the latter are the 250-word list of other pages. The PageRank (PR) algorithm is
van Rijsbergen (1979), the 471-word list of mathematically complex, but can be viewed as
Fox (1992), and the PubMed stop list.58 giving more weight to a Web page based on
Conflation of words to common forms is done the number of other pages that link to it (Brin
via stemming, the purpose of which is to and Page 1998). Thus, the home page of the
ensure words with plurals and common suf- NLM or a major medical journal is likely to
fixes (e.g., -ed, -ing, -er, -al) are always indexed have a very high PR, whereas a more obscure
by their stem form (Frakes 1992). For exam- page will have a lower PR.
ple, the words cough, coughs, and coughing General-purpose search engines such as
are all indexed via their stem cough. Both stop Google and Microsoft Bing use word-based
word remove and stemming reduce the size of approaches and variants of the PageRank
indexing files and lead to more efficient query algorithm for indexing. They amass the con-
processing. tent in their search systems by “crawling” the
Web, collecting and indexing every object they
58 7 https://www.ncbi.nlm.nih.gov/books/NBK3827/
table/pubmedhelp.T.stopwords/ 59 7 https://www.google.com/
Information Retrieval
775 23
find on the Web. This includes not only searching, on the other hand, recognizes the
HTML pages, but other files as well, including inexact nature of both indexing and retrieval,
Microsoft Word, Portable Document Format and instead attempts to return to the user con-
(PDF), and images. tent ranked by how close it comes to the user’s
Word indexing has a number of limita- query. After general explanations of these
tions, including: approaches, we will describe actual systems
55 Synonymy—different words may have the that access the different types of biomedical
same meaning, such as high and elevated. content.
This problem may extend to the level of
phrases with no words in common, such as
the synonyms hypertension and high blood 23.6.1 Exact-Match Retrieval
pressure.
55 Polysemy—the same word may have dif- In exact-match searching, the IR system gives
ferent meanings or senses. For example, the user all documents that exactly match the
the word lead can refer to an element or to criteria specified in the search statement(s).
a part of an electrocardiogram machine. Since the Boolean operators AND, OR, and
55 Content—words in a document may not NOT are usually required to create a manage-
reflect its focus. For example, an article able set of documents, this type of searching is
describing hypertension may make men- often called Boolean searching. Furthermore,
tion in passing to other concepts, such as since the user typically builds sets of docu-
congestive heart failure (CHF) that are not ments that are manipulated with the Boolean
the focus of the article. operators, this approach is also called set-
55 Context—words take on meaning based based searching. Most of the early operational
on other words around them. For example, IR systems in the 1950s through 1970s used
the relatively common words high, blood, the exact-match approach, even though Salton
and pressure, take on added meaning when was developing the partial-match approach in
occurring together in the phrase high research systems during that time (Salton and
blood pressure. McGill 1983). Currently, exact-match search-
55 Morphology—words can have suffixes ing tends to be associated with retrieval from
that do not change the underlying mean- bibliographic and annotated databases, while
ing, such as indicators of plurals, various the partial-match approach tends to be used
participles, adjectival forms of nouns, and with full-text searching.
nominalized forms of adjectives. Typically the first step in exact-match
55 Granularity—queries and documents may retrieval is to select terms to build sets. Other
describe concepts at different levels of a attributes, such as the author name, publica-
hierarchy. For example, a user might query tion type, or gene identifier (in the secondary
for antibiotics in the treatment of a spe- source identifier field of MEDLINE), may be
cific infection, but the documents might selected to build sets as well. Once the search
describe specific antibiotics themselves, term(s) and attribute(s) have been selected,
such as penicillin. they are combined with the Boolean opera-
tors. The Boolean AND operator is typically
7 Chapter 9 on Natural Language Processing used to narrow a retrieval set to contain only
(NLP) describes automated methods for documents with two or more concepts. The
addressing these limitations. Boolean OR operator is usually used when
there is more than one way to express a con-
cept. The Boolean NOT operator is often
23.6 Retrieval employed as a subtraction operator that is
applied to a pair of sets, with the result being
There are two broad approaches to retrieval. the documents found in the first set but not in
Exact-match searching allows the user precise the second set. Some systems more accurately
control over the items retrieved. Partial-match call this the ANDNOT operator.
776 W. Hersh
Some systems allow terms in searches to neered the approach in the 1960s (Salton and
23 be expanded by using the wild-card character, McGill 1983). Although partial-match
which adds all words to the search that begin searching does not exclude the use of non-
with the letters up until the wild-card charac- term attributes of documents, and for that
ter. This approach is also called truncation. matter does not even exclude the use of
Unfortunately, there is no standard approach Boolean operators (e.g., Salton et al. 1983),
to using wild-card characters, so syntax for the most common use of this type of search-
them varies from system to system. PubMed, ing is with a query of a small number of
for example, allows a single asterisk at the endwords, also known as a natural language
of a word to signify a wild-card character. query. Because Salton’s approach was based
Thus the query word can* will lead to the on vector mathematics, it is also referred to as
words cancer and Candida, among others, the vector-space model of IR. In the partial-
being added to the search. match approach, documents are typically
ranked by their closeness of fit to the query.
That is, documents containing more query
23.6.2 Partial-Match Retrieval terms will likely be ranked higher, since those
with more query terms will in general be more
Although partial-match searching was con- likely to be relevant to the user. As a result
ceptualized very early, it did not see wide- this process is called relevance ranking. The
spread use in IR systems until the advent of entire approach has also been called lexical–
Web search engines in the 1990s. This is most statistical retrieval.
likely because exact-match searching tends to The most common approach to document
be preferred by “power users” whereas partial- ranking in partial-match searching is to give
match searching is preferred by novice search- each a score based on the sum of the weights
ers. Whereas exact-match searching requires of terms common to the document and query.
an understanding of Boolean operators and Terms in documents typically derive their
(often) the underlying structure of databases weight from the TF*IDF calculation described
(e.g., the many fields in MEDLINE), partial- above. Terms in queries are typically given a
match searching allows a user to simply enter weight of one if the term is present and zero if
a few terms and start retrieving documents. it is absent. The following formula can then be
The development of partial-match search- used to calculate the document weight across
ing is usually attributed to Salton, who pio- all query terms:
This may be thought of as a giant OR of all tive for many diverse test collections
query terms, with sorting of the matching (Robertson and Walker 1994). Another
documents by weight. The usual approach is approach showing efficacy in research has
for the system to then perform the same stop been the use of language modeling techniques
word removal and stemming of the query that (Zhai and Lafferty 2004). More recently, the
was done in the indexing process. (The equiv- learning-to-rank approach has found value
alent stemming operations must be performed for machine learning approaches in the rele-
on documents and queries so that comple- vance ranking process (Li 2011). The large
mentary word stems will match). search engines such as Google make use of
A number of other ranking algorithms other information in their results, such as aim-
have been developed over the years. The ing to provide answers to queries that are
BM25 approach has been found to be effec- likely to be questions (e.g., defining words or
Information Retrieval
777 23
.. Fig. 23.4 Search results from PubMed, showing query, results, and ability to set limits (on the left side of the
window). (Courtesy of National Library of Medicine, with permission)
listing airplane flights) and using other fea- automatic term mapping, the system attempts
tures of Web pages, such as their geographic to map user input, in succession, to MeSH
location (e.g., a user querying for restaurants). terms, journals names, common phrases, and
authors. Remaining text that PubMed cannot
map is searched as text words (i.e., words that
23.6.3 Retrieval Systems occur in any of the MEDLINE fields). A
results screen for a search combining the dis-
This section describes searching systems used ease congestive heart failure (CHF) and the
to retrieve content from the four categories angiotensin converting enzyme (ACE) inhibi-
previously described in 7 Sect. 23.4. tor class of drugs is shown in . Fig. 23.4.
As noted above, PubMed is the system at PubMed allows the use of wild-card char-
NLM that searches MEDLINE and other acters. It also allows phrase searching whereby
bibliographic databases. Although presenting two or more words can be enclosed in quota-
the user with a simple text box, PubMed does tion marks to indicate they must occur adja-
a great deal of processing of the user’s input cent to each other. If the specified phrase is in
to identify MeSH terms, author names, com- PubMed’s phrase index, then it will be
mon phrases, and journal names (described in searched as a phrase. Otherwise the individual
the on-line help system of PubMed). In this words will be searched. PubMed allows speci-
778 W. Hersh
23
.. Fig. 23.5 PubMed Advanced Search Builder, showing the use of sets and application of Boolean operators as
well as limits. (Courtesy of National Library of Medicine, with permission)
fication of other indexing attributes via enter the search string congestive heart failure
“Limits.” These include publication types, and ACI inhibitors. . Figure 23.5 shows the
subsets, age ranges, and publication date PubMed Advanced Search Builder screen
ranges. These are accessed from the left-hand such a searcher might develop. This searcher
side of the results screen, with the most com- has limited the output (using some of the lim-
monly used ones shown and the others acces- its shown in . Fig. 23.4) with various publi-
sible by additional mouse clicks. cation types known to contain the best
As in most bibliographic systems, users evidence for this question. Also note that the
can also search PubMed by building search search does not require the “and,” as PubMed
sets and then combining them with Boolean determines the Boolean operator should be
operators to tailor the search. This is called placed there automatically.
the PubMed Advanced Search Builder. Most MEDLINE systems have ranked
Consider a user searching for studies assessing output sorted by reverse chronological order,
the reduction of mortality in patients with based on the notion that the most recent arti-
CHF through the use of ACE inhibitors. A cles have the mostly timely and complete
simple approach to such a search might be to information. PubMed has also featured
combine the terms ACE inhibitors and CHF relevance-ranked output and recently
with an AND. The easiest way to do this is to improved its algorithms (Fiorini et al. 2018).
Information Retrieval
779 23
.. Fig. 23.6 Search screen for NLM NCBI Search, showing the variety of different databases that can be search on
the NLM site. (Courtesy of National Library of Medicine, with permission)
PubMed has an additional approach to by the usual automatic term mapping and the
finding the best evidence, which is through the resulting output is limited (via AND) with the
use of its Clinical Queries function,60 where appropriate statement.
the subject terms are limited by search state- A growing number of search engines allow
ments designed to retrieve the best evidence searching over many resources. The general
based on principles of EBM. There are two search engines Google, Microsoft Bing, and
different approaches. The first uses strategies others allow retrieval of any types of docu-
for retrieving the best evidence for the four ments they have indexed via their Web crawl-
major types of clinical questions. These strat- ing activities. Other search engines allow
egies arise from research assessing the ability searching over aggregations of various
of MEDLINE search statements to identify sources, such as NCBI Search,61 which allows
the best studies for therapy, diagnosis, harm, searching over all NLM NCBI databases and
and prognosis (Haynes et al. 1994). The sec- other resources in one simple interface, as
ond approach to retrieving the best evidence shown in . Fig. 23.6.
aims to retrieve evidence-based resources that
are syntheses and synopses, in particular
meta-analyses, systematic reviews, and prac- 23.7 Evaluation
tice guidelines. The strategy derives in part
from research by Boynton et al. (Boynton There has been a great deal of research over
et al. 1998). When the clinical queries inter- the years devoted to evaluation of IR systems.
face is used, the search statement is processed As with many areas of research, there is con-
60 7 https://www.ncbi.nlm.nih.gov/pubmed/clinical 61 7 https://www.ncbi.nlm.nih.gov/search/
780 W. Hersh
troversy as to which approaches to evaluation be reported high. In the most recent update
23 best provide results that can assess searching of her ongoing survey of health-related
in the systems they are using. Many frame- searching, Fox found that 72% of US adult
works have been developed to put the results Internet users (59% of all US adults) have
in context. One of those frameworks orga- looked for health information in the last year
nized evaluation around six questions that (Fox and Duggan 2013). Earlier research
someone advocating the use of IR systems found that the most common types of
might ask (Hersh and Hickam 1998): searches done by these users was for a specific
1. Was the system used? disease or medical condition and for a certain
2. For what was the system used? medical treatment or procedure (Fox 2011).
3. How well did they use the system? Three focus groups convened by Mayo Clinic
4. Were the users satisfied? researchers asked consumers about their
5. What factors were associated with success- online searching use and needs, finding that
ful or unsuccessful use of the system? subjects reported searching, filtering, and
6. Did the system have an impact? comparing information retrieved, with the
process stopping due to saturation and fatigue
While early evaluation studies asked questions (Fiksdal et al. 2014).
about whether IR systems would be used if Most evaluation research has focused on
made available, in modern times their use is the third question from the above list, i.e.,
ubiquitous. A study by Google and Manhattan how well did search systems or their users
Research found that essentially all physicians perform? The rest of this section on evalua-
reported searching on digital devices daily, tion will focus on studies of that question,
with most searching resulting in action, such grouping approaches and studies into those
as changing treatment decisions or sharing that are system-oriented, i.e., the focus of the
with a colleague or patient (Anonymous 2012). evaluation is on the IR system, and those
Similarly, a recent study of internal medicine that are user-oriented, i.e., the focus is on the
residents at three sites found that nearly all user.
responding to the survey searched daily, with
the most common resource searched being
UpToDate (Duran-Nelson et al. 2013). The 23.7.1 System-Oriented Evaluation
next most frequent source of information was
consultation with attending faculty, followed There are many ways to evaluate the perfor-
by the Google search engine, the Epocrates mance of IR systems, the most widely used
drug reference, and various other “pocket” of which are the relevance-based measures
references. Another study of family medicine of recall and precision. These measures quan-
resident and attending physicians found uni- tify the number of relevant documents
versal use of smartphones and tablet devices retrieved by the user from the database and
daily in their practices (Yaman et al. 2016). in his or her search. Recall is the proportion
Patient and consumer searching of the of relevant documents retrieved from the
Web for health information also continues to database:
In other words, recall answers the question, of relevant documents for a query is known.
for a given search, what fraction of all the rel- For all but the smallest of databases, how-
evant documents have been obtained from the ever, it is unlikely, perhaps even impossible,
database? for one to succeed in identifying all relevant
One problem with Eq. (23.5) is that the documents in a database. Thus most studies
denominator implies that the total number use the measure of relative recall, where the
Information Retrieval
781 23
denominator is redefined to be the total num- Precision is the proportion of relevant
ber of unique, relevant documents identified documents retrieved in the search:
by one or more searches on the query topic.
This measure answers the question, for a which documents are relevant for each topic
search, what fraction of the retrieved docu- in the task, allowing different researchers to
ments is relevant? compare their systems with others on the
One problem that arises when one is com- same task and improve them. The longest
paring systems that use ranking versus those running and best-known challenge evaluation
that do not is that nonranking systems, typi- in IR is the Text REtrieval Conference
cally using Boolean searching, tend to retrieve (TREC),62 which is organized by the
a fixed set of documents and as a result have U.S. National Institute for Standards and
fixed points of recall and precision. Systems Technology (NIST).63 Started in 1992, TREC
with relevance ranking, on the other hand, has provided a testbed for evaluation and a
have different values of recall and precision forum for presentation of results. TREC is
depending on the size of the retrieval set the organized as an annual event at which the
system (or the user) has chosen to show. The tasks are specified and queries and documents
problem has been addressed by the develop- are provided to participants. Participating
ment of aggregate measures that combine groups submit “runs” of their systems to
recall and precision and that account for NIST, which calculates the appropriate
ranking. One of the most common measures performance measure(s). TREC is organized
used is mean average precision (MAP), which into “tracks” geared to specific interests. A
measures precision at each point that a rele- book summarizing the first decade of TREC
vant document is retrieved, and then provides provides more information on this important
a mean of all the average precision values IR initiative that is still ongoing (EM Voorhees
(Buckley and Voorhees 2005). and Harman 2005).
Another challenge for IR evaluation While TREC has been mostly focused on
occurs with large collections, where every pos- retrieval of general information sources (e.g.,
sible item retrieved cannot be judged for rele- newswire, government documents, Web pages,
vance. If there is concern that there are large etc.), there have been a number of tracks over
numbers of unjudged documents, the B-Pref the years devoted to biomedical IR. These
measure can be used, which only makes use of tracks tended to reflect areas of biomedicine
unjudged documents in its calculations that were emerging importance. The first
(Buckley and Voorhees 2004). An additional TREC track specific to the biomedical domain
measure increasingly used in normalized dis- was the Genomics Track, due to the emer-
tributed cumulative gain (NDCG), which gence at the time of the sequencing of human
allows differential value of retrieved docu- genome and the rise of the area of bioinfor-
ments, e.g., a value of 2 for highly relevant matics. A variety of literature retrieval tasks
and 1 for partially relevant documents were developing, focused on journal article
(Jarvelin and Kekalainen 2002). abstracts (from MEDLINE records) or full
A good deal of evaluation in IR is done text (Hersh and Bhupatiraju 2003; Hersh
via challenge evaluations, in which a common
IR task is defined and a test collection of doc-
uments, topics, and relevance judgments are 62 7 https://trec.nist.gov/
developed. The relevance judgments define 63 7 https://www.nist.gov/
782 W. Hersh
et al. 2004, 2005, 2006, 2007; Hersh and comes at a cost of low precision, so a question
23 Voorhees 2009; Roberts et al. 2009). is how to reduce the work of systematic
A second track from the biomedical reviews by increasing precision while mini-
domain aimed to leverage the growing interest mally impacting recall. Early work in this area
in processing medical records text around the was carried out by Cohen et al. (2009, 2015),
onset of the HITECH Act. The Medical demonstrating value for machine learning
Records Track used a collection of de- approaches. A systematic review task was
identified patient records for a task aiming to added to CLEF eHealth in 2017 (Kanoulas
retrieve patients who might be candidates for et al. 2017, 2018, 2019).
clinical studies (Voorhees 2013; Voorhees and A number of researchers have criticized or
Hersh 2012; Voorhees and Tong 2011). noted the limitations of relevance-based mea-
The next biomedical domain track was the sures. While no one denies that users want sys-
Clinical Decision Support (CDS) Track, tems to retrieve relevant articles, it is not clear
which aimed to retrieve full-text journal arti- that the quantity of relevant documents
cles from a snapshot of PubMed Central to retrieved is the complete measure of how well
identify knowledge relevant to diagnosis, test- a system performs (Harter 1992; Swanson
ing, or treatment (Roberts et al. 2015, 2016a, 1988). Hersh (1994) noted that clinical users
b; Simpson et al. 2014). The CDS Track was are unlikely to be concerned about these mea-
refined into the Precision Medicine Track, sures when they simply seek an answer to a
which aimed to retrieve information relevance clinical question and are able to do so no mat-
to the precision medicine paradigm (Roberts ter how many other relevant documents they
et al. 2017). miss (lowering recall) or how many nonrele-
Another annual challenge evaluation, vant ones they retrieve (lowering precision).
based in Europe, has been the Conference and This has led to more focus on user-oriented
Labs of the Evaluation Forum (CLEF, origi- evaluation.
nally known as the Cross-Language
Evaluation Forum). Since 2013, one focus of
CLEF has been eHealth, with tasks focused 23.7.2 User-Oriented Evaluation
not only in IR, but also information extrac-
tion and information management.64 The What alternatives to relevance-based mea-
track has had a patient-centered retrieval task sures can be used for determining perfor-
since 2013, a cross-language retrieval task mance of individual searches? Some
since 2014, and a systematic review task start- alternatives have focused on users being able
ing in 2017. An additional challenge evalua- to perform various information tasks with IR
tion emanating from CLEF and focused on systems, such as finding answers to questions
image retrieval, which has included a medical (Egan et al. 1989; Hersh and Hickam 1995;
image retrieval component, has been Hersh et al. 1996; Mynatt et al. 1992;
ImageCLEF.65 Some recent overviews of the Wildemuth et al. 1995). For several years,
state of the art of image retrieval have been TREC featured an Interactive Track that had
published (Li et al. 2018; Müller and Unay participants carry out user experiments with
2017). the same documents and queries (Hersh
Some system-oriented studies have focused 2001). A number of user-oriented evaluations
on specific use cases for IR systems. One area have been performed over the years looking at
gaining a good deal of attention has been users of biomedical information.
reducing the workload of performing system- When end-user retrieval systems first
atic reviews, which require high recall to appeared, a number of studies appeared aim-
retrieval all possibly relevant studies. This ing to measure search performance by clini-
cians. One of the original studies compared
the capabilities of librarian and clinician
64 7 https://sites.google.com/site/clefehealth/ searchers (Haynes et al. 1990). In this study,
65 7 https://www.imageclef.org 78 searches were randomly chosen for replica-
Information Retrieval
783 23
tion by both a clinician experienced in search- Another study compared Boolean and
ing and a medical librarian. The results natural language searching of MEDLINE
showed that the experienced clinicians and with two commercial products, CD Plus (now
librarians achieved comparable recall in the Ovid) and Knowledge Finder representing
range of 50%, although the librarians had Boolean and natural language searching
better precision. The novice clinician search- respectively (Hersh et al. 1996). Sixteen medi-
ers had lower recall and precision than either cal students were recruited and randomized
of the other groups. This study also assessed to one of the two systems and given three yes/
user satisfaction of the novice searchers, who no clinical questions to answer. The students
despite their recall and precision results said were able to use each system successfully,
that they were satisfied with their search out- answering 37.5% correctly before searching
comes. A follow-up study noted that different and 85.4% correctly after searching. There
searchers tended to use different strategies on were no significant differences between the
a given topic. The different approaches repli- systems in time taken, relevant articles
cated a finding known from other searching retrieved, or user satisfaction. This study
studies in the past, namely, the lack of overlap demonstrated that both types of systems
across searchers of overall retrieved citations could be used equally well with minimal
as well as relevant ones. Thus, even though the training.
novice searchers had lower recall, they did A more comprehensive study looked at
obtain a great many relevant citations not MEDLINE searching by medical and nurse
retrieved by the two expert searchers. practitioner (NP) students to answer clinical
Furthermore, fewer than 4% of all the rele- questions. A total of 66 medical and NP stu-
vant citations were retrieved by all three dents searched five questions each (Hersh
searchers. et al. 2002). This study used a multiple-choice
Recognizing the limitations of recall and format for answering questions that also
precision for evaluating clinical users of IR included a judgment about the evidence for
systems, subsequent studies assessed the abil- the answer. Subjects were asked to choose
ity of systems to help students and clinicians from one of three answers:
answer clinical questions. The rationale for 55 Yes, with adequate evidence.
these studies is that the usual goal of using an 55 Insufficient evidence to answer question.
IR system is to find an answer to a question. 55 No, with adequate evidence.
While the user must obviously find relevant
documents to answer that question, the quan- Both groups achieved a presearching correct-
tity of such documents is less important than ness on questions about equal to chance
whether the question is successfully answered. (32.3% for medical students and 31.7% for NP
In fact, recall and precision can be placed students). However, medical students
among the many factors that may be improved their correctness with searching (to
associated with ability to complete the task 51.6%), whereas NP students hardly did at all
successfully. (to 34.7%).
The first study using this task-oriented This study also attempted to measure what
approach compared Boolean versus natural factors might influence searching. A multi-
language searching in an online medical text- tude of factors, such as age, gender, computer
book (Hersh and Hickam 1995). There was experience, and time taken to search, were not
no difference in ability to answer questions associated with successful answering of ques-
with one interface or the other. Most answers tions. Successful answering was, however,
were found on the first search to the textbook. associated with answering the question cor-
For the questions that were incorrectly rectly before searching, spatial visualization
answered, the document with the correct ability (measured by a validated instrument),
answer was actually retrieved by the user two- searching experience, and EBM question type
thirds of the time and viewed more than half (prognosis questions easiest, harm questions
the time. most difficult). An analysis of recall and pre-
784 W. Hersh
cision for each question searched demon- answers (58–62% correct) or time taken (136–
23 strated a complete lack of association with 139 seconds). While those starting in the sum-
ability to answer these questions. mary resource mostly found answers in
Two studies extended this approach in dif- resources that were part of the summary sys-
ferent ways. Westbook et al. assessed use of an tem 93% of the time, those starting with
online evidence systems and found that physi- Google found answers in commercial medical
cians answered 37% of questions correctly portals (25.7%), hospital Web sites (12.6%),
before use of the system and 50% afterwards, Wikipedia (12.0%), US government Web sites
while nurse specialists answered 18% of ques- (9.4%), PubMed (9.4%), evidence-based sum-
tions correctly and also 50% afterwards mary resources (9.4%), and others (18%).
(Westbrook et al. 2005). Those who had cor- Another study looked at medical students’
rect answers before searching had higher con- short-term knowledge when randomized to
fidence in their answers, but those not initially answer questions in Wikipedia, UpToDate,
knowing the answer had no difference in con- and a digital textbook, finding the best short-
fidence whether their answer turned out to be term knowledge acquisition with Wikipedia
right or wrong. McKibbon and Fridsma per- (Scaffidi et al. 2017).
formed a comparable study of allowing physi- Koopman et al. assessed factors compris-
cians to seek answers to questions with ing effective queries and those making them
resources they normally use (McKibbon and (Koopman et al. 2017). They found that query
Fridsma 2006) employing the same questions formulation had more impact on retrieval
as Hersh et al. (2002). This study found no dif- effectiveness than the particular retrieval sys-
ference in answer correctness before or after tems used. The most effective queries were
using the search system. short, ad-hoc keyword queries and queriers
Pluye et al. (Pluye and Grad 2004) per- who inferred novel keywords most likely to
formed a qualitative study assessing impact of appear in relevant documents.
IR systems on physician practice. The study Other users of IR systems have been stud-
identified 4 themes mentioned by physicians: ied beyond clinicians. One study used the
55 Recall—of forgotten knowledge TREC Genomics Track 2004 collection to
55 Learning—new knowledge assess the value of MeSH terms for different
55 Confirmation—of existing knowledge types of searchers (Liu and Wacholder 2017).
55 Frustration—that system use not successful The researchers recruited four types of
searchers:
The researchers also noted two additional 55 Search Novice (SN) – undergraduates with
themes: no formal search training or advanced
55 Reassurance—that system is available knowledge in biomedicine
55 Practice improvement—of patient-55 Domain Expert (DE) – biomedical gradu-
physician relationship ate students
55 Search Expert (SE) – library and informa-
More recent studies have focused on searchers tion science graduate students
using well-known modern IR systems. Kim 55 Medical Librarian (ML)
et al. looked at the ability of internal medicine
interns to answer questions starting from The searchers used a digital library system to
Google versus an evidence- based summary search on 20 topics from the original test col-
resource developed by a local medical library lection. Searchers assigned to search with
(Kim et al. 2014). Ten questions were given to MeSH were provided access to a MeSH
each subject, with each participant random- browser. As with other studies, recall (0.15–
ized to start in either Google or the summary 0.23) and precision (0.29–0.40) were relatively
resource for half of questions. Answers were close across different groups. MeSH terms
found for 82% of the questions administered, had little impact upon recall in the four
with no difference between groups in correct groups, but they were found to substantially
Information Retrieval
785 23
increase precision in search novices (SN and and lay people (Palotti et al. 2016). This study
DE) and decrease it in search experts (SE and found that medical experts were more
ML) (recall and precision with MeSH; with- persistent in their interaction with the search
out MeSH). User characteristics that engine. They also noted that the main focus of
improved precision were number of under- users, both laypeople and professionals, was
graduate and graduate biology courses for SN on disease rather than symptoms.
and DE respectively. User characteristics
associated with improved recall included hav-
ing had online search courses and MeSH use 23.8 Research Directions
experience. Other factors having no associa-
tion with search results included gender, The above evaluation research shows that
native language, age, or experience or fre- there is still plenty of room for IR systems to
quency with database searching. improve their abilities. In addition, there will
Another group of searchers that have been be new challenges that arise from growing
studied are consumers. A study from Mayo amounts of information, new devices, and
Clinic analyzed search queries submitted other new technologies.
through general search engines but leading There are also other areas related to IR
users into a consumer health information por- where research is ongoing in the larger quest
tal from computers and mobile devices to help all involved in biomedicine and
(Jadhav et al. 2014). The most common types health—including patients, clinicians and
of searches were on symptoms (32–39%), researchers—to better apply knowledge to
causes of disease (19–20%), and treatments improve health. . Figure 23.7 shows this
and drugs (14–16%). Health queries tended to author’s “funnel” by which the user searches
be longer and more specific than general (non- all of the scientific literature using IR systems
health) queries. Health queries were somewhat to obtain a set of possibly relevant literature.
more likely to come from mobile devices. Most In the current state of the art, he/she reviews
searches used key words, although some were this literature by hand, selecting which articles
also phrased as questions (wh- or yes/no). are definitely relevant and may become
An additional study aimed to assess differ- “actionable knowledge” that can be acted
ences in searching between medical experts upon to make better decisions.
Our ability to carry out the activities in the
upper part of the funnel, i.e., IR, is much bet-
All literature ter than those in the lower part. These areas
include:
Possibly relevant Information
retrieval 55 Information extraction and text mining—
literature
usually through the use of natural lan-
Definitely relevant guage processing (NLP, see 7 Chap. 8) to
literature Information extract facts and knowledge from text
extraction, (K. Cohen and Demner-Fushman 2014).
Structured
text mining These techniques are often employed to
Knowledge
extract information from the EHR, with a
wide variety of accuracy as shown in a sys-
.. Fig. 23.7 Funnel of knowledge discovery, showing
tematic review (Stanfill et al. 2010).
how an information need starts with a search (informa-
tion retrieval) leading to a large possibly relevant set of 55 Summarization—providing automated
literature that is winnowed down to a smaller definitely extracts or abstracts summarizing the con-
relevant set (usually by human inspection but with tech- tent of longer documents (Fiszman et al.
niques like information extraction and text mining pos- 2004; Mani 2001). In recent years, these
sibly automating the process in the future). Ultimately
methods have been applied to text and
actionable knowledge is obtained that can be applied by
a human or fashioned into, for example, rules for a com- other data in the EHR (Pivovarov and
puter-based decision support system Elhadad 2015).
786 W. Hersh
55 Question-answering—going beyond the most part, quality control can be taken for
23 retrieval of documents to providing actual granted. Until recently, most published litera-
answers to questions, as exemplified by ture came from commercial publishers and
IBM’s Watson system (Ferrucci et al. specialty societies that had processes such as
2010). Watson has been evaluated for peer review, which, although imperfect,
answering questions on medical board allowed the library to devote minimal
exams (Ferrucci et al. 2012) as well as resources to assessing their quality. While
making cancer treatment recommenda- libraries can still cede the judgment of quality
tions (Somashekhar et al. 2018). to these information providers in the Internet
era, they cannot ignore the myriad of infor-
mation published only on the Internet, for
23.9 Digital Libraries which the quality cannot be presumed.
Other functions of libraries besides main-
Discussion of IR “systems” thus far has taining collections include cataloging and
focused on the provision of retrieval mecha- classification of items in those collections,
nisms to access online content. Even with the being a place (even virtual) where individuals
expansive coverage of some IR systems, such could go to get assistance with information
as Web search engines, they are often part of a seeking, and providing space for work or
larger collection of services or activities. An study, particularly in universities.
alternative perspective, especially when com- The paper-based nature of traditional
munities and/or proprietary collections are libraries carried a number of assumptions
involved, is the digital library. Digital librar- that are challenged in the digital era. For
ies share many characteristics with “brick and example, items were produced in multiple
mortar” libraries, but also take on some addi- copies, freeing the individual library from
tional challenges. Borgman (1999) noted that excessive worry that an item could not be
libraries of both types elicited different defini- replaced. In addition, items were fairly static,
tions of what they actually are, with research- simplifying their cataloging. With digital
ers tending to view libraries as content libraries, this status quo is challenged. There
collected for specific communities and librari- is a great deal of concern about archiving of
ans alternatively viewing them as institutions content and managing its change when fewer
or services. Lindberg and Humphreys (2005) “copies” of it exist on the file servers of pub-
laid out a vision in 2005 for libraries 10 years lishers and other organizations. A related
hence, noting that while collections would be problem for digital libraries is that they do not
virtual and accessed in many diverse ways, own the “artifact” of the paper journal, book,
other elements of science would stay intact, or other item. This is exacerbated by the fact
including journals and the peer review p rocess. that when a subscription to an electronic jour-
This section provides an overview of key nal is terminated, access to the entire journal
issues of digital libraries, with an orientation is lost; that is, the subscriber does not retain
toward biomedical libraries. accumulated back issues, as was taken for
granted with paper journals.
66 7 http://www.doi.org/ 67 7 http://www.wipo.int/portal/en/index.html
788 W. Hersh
ity for magnetic materials was the least, with 23.10 Future Directions for IR
23 the expected lifetime of magnetic tape being Systems and Digital
5 to 10 years. Optical storage has somewhat
Libraries
better longevity, with an expected lifetime of
30 to 100 years depending on the specific
There is no doubt that considerable progress
type. Ironically, paper has a life expectancy
has been made in IR and digital libraries.
well beyond all these digital media.
Seeking online information is now done rou-
Rothenberg noted that the Rosetta Stone,
tinely not only by clinicians and researchers,
which provided help in interpreting ancient
but also by patients and consumers. There are
Egyptian hieroglyphics and has survived
still considerable challenges to make this
over 20 centuries (Rothenberg 1999). He reit-
activity more fruitful to users. They include:
erated Lesk’s description of the reduced life-
55 How do we lower the effort it takes for cli-
time of digital media in comparison with
nicians to get to the information they need
traditional media, and to note another prob-
rapidly in the busy clinical setting?
lem familiar to most long-time users of com-
55 How can researchers extract new knowl-
puters, namely, data can become obsolete
edge from the vast quantity that is avail-
not only owing to the medium, but also as a
able to them?
result of data format. Both authors noted
55 How can consumers and patients find
that storage devices as well as computer
high-quality information that is appropri-
applications, such as word processors, have
ate to their understanding of health and
seen their formats change significantly over
disease?
the last couple of decades.
55 Can the value added by the publishing
The US Library of Congress has devoted
process be protected and remunerated
considerable effort to digital preservation,
while making information more available?
documenting its efforts on the Web site.68 An
55 How can the indexing process become
early digital preservation effort in the US was
more accurate and efficient?
National Digital Information Infrastructure
55 Can retrieval interfaces be made simpler
Preservation Program (NDIIPP) of the
without giving up flexibility and power?
Library of Congress, which has now become
55 Can we develop standards for digital
the National Digital Stewardship Alliance
libraries that will facilitate interoperability
(NDSA)69 and is housed by the Digital
but maintain ease of use, protection of
Library Federation (DLF), at the Council on
intellectual property, and long-term pres-
Library and Information Resources (CLIR).
ervation of the archive of science?
Other digital preservation efforts include
Portico,70 a collaboration of publishers,
libraries, and government agencies to preserve nnSuggested Readings
electronic scholarly content and LOCKSS Baeza-Yates, R., & Ribeiro-Neto, B. (2011).
(Lots of Copies Keep Stuff Safe),71 which Modern information retrieval: The concepts
provides libraries with digital preservation and technology behind search (2nd ed.).
tools and support. Reading: Addison-Wesley. A book surveying
most of the automated approaches to infor-
mation retrieval.
Croft, W., Metzler, D., & Strohman, T. (2009).
Search engines: Information retrieval in prac-
tice. Boston: Addison-Wesley. A book survey-
ing most of the automated approaches to
68 7 http://www.digitalpreservation.gov/
search engines.
69 7 https://ndsa.org
70 7 https://www.portico.org/ Hersh, W. (2020). Information retrieval: A bio-
71 7 https://www.lockss.org/ medical and health perspective (4th ed.).
Information Retrieval
789 23
New York: Springer. A textbook on informa- 8. Describe how you might devise a
tion retrieval systems in the health and bio- system that achieves a happy medium
medical domain that covers state-of-the-art as between protection of intellectual
well as research systems. property and barrier-free access to the
Miles, W. (1982). A history of the national library archive of science.
of medicine: The nation’s treasury of medical
knowledge. Bethesda: U.S. Department of
Health and Human Services. A comprehen- References
sive history of the National Library of
Medicine and its forerunners, covering the Akerkar, R. (2009). Foundations of the Semantic Web:
story of Dr. John Shaw Billings and his found- XML, RDF & Ontology. Oxford: Alpha Science
International Ltd.
ing of Index Medicus to the modern imple-
Anonymous. (2012). From screen to script: The doctor’s
mentation of MEDLINE. digital path to treatment. Retrieved from New York:
National Academies of Sciences, Engineering, https://www.thinkwithgoogle.com/research-studies/
and Medicine; Policy and Global Affairs; the-doctors-digital-path-to-treatment.html.
Board on Research Data and Information; Anonymous. (2016). Toward fairness in data sharing.
New England Journal of Medicine, 375, 405–407.
Committee on Toward an Open Science
Anonymous. (2018a). Database resources of the
Enterprise. (2018). Open science by design – National Center for Biotechnology Information.
realizing a vision for 21st century research. Nucleic Acids Research, 46(D1), D8–D13. Retrieved
Washington, DC: National Academies Press. from https://academic.oup.com/nar/article/46/D1/
A vision for open science from the National D8/4621330.
Anonymous. (2018b). Open science by design – realizing
Academies of Medicine, Science, and
a vision for 21st century research. Washington, DC:
Engineering. National Academies Press.
Bartlett, J., & Toms, E. (2005). Developing a protocol
??Questions for Discussion for bioinformatics analysis: An integrated informa-
1. With the advent of full-text searching, tion and behavior task analysis approach. Journal of
should the National Library of the American Society for Information Science and
Technology, 56, 469–482.
Medicine abandon human indexing of Berners-Lee, T., Cailliau, R., Luotonen, A., Nielsen, H.,
citations in MEDLINE? Why or why & Secret, A. (1994). The World-Wide Web.
not? Communications of the ACM, 37, 76–82.
2. Explain why you think open-access pub- Besser, H. (2002). The next stage: Moving from isolated
lishing will succeed or not. digital collections to interoperable digital libraries.
First Monday, 7, 6. Retrieved from http://www.
3. How would you aggregate the clinical firstmonday.dk/issues/issue7_6/besser/.
evidence-based resources described in Bodenreider, O. (2004). The Unified Medical Language
the chapter into the best digital library System (UMLS): Integrating biomedical terminol-
for clinicians? ogy. Nucleic Acids Research, 32, D267–D270.
4. Devise a curriculum for teaching clini- Borgman, C. (1999). What are digital libraries?
Competing visions. Information Processing and
cians and patients the most important Management, 35, 227–244.
points about searching for health-related Boynton, J., Glanville, J., McDaid, D., & Lefebvre, C.
information. (1998). Identifying systematic reviews in
5. Find a consumer-oriented Web page and MEDLINE: Developing an objective approach to
determine the quality of the information search strategy design. Journal of Information
Science, 24, 137–157.
on it. Brin, S., & Page, L. (1998). The anatomy of a large-
6. What are the limitations of recall and scale hypertextual Web search engine. Computer
precision as evaluation measures and Networks and ISDN Systems, 30, 107–117.
what alternatives would improve upon Retrieved from http://infolab.stanford.edu/pub/
them? papers/google.pdf.
Buckley, C., & Voorhees, E. (2004). Retrieval evaluation
7. Select a concept that appears in two or with incomplete information. Paper presented at the
more clinical terminologies and demon- proceedings of the 27th annual international ACM
strate how it would be combined into a SIGIR conference on research and development in
record in the UMLS Metathesaurus. information retrieval, Sheffield, England.
790 W. Hersh
Buckley, C., & Voorhees, E. (2005). Retrieval system evidence: Fine-tuning the 5S model into a 6S
23 evaluation. In E. Voorhees & D. Harman (Eds.),
TREC: Experiment and evaluation in information
model. Annals of Internal Medicine, 151(6), JC3–
JC2, JC3-3.
retrieval (pp. 53–75). Cambridge, MA: MIT Press. Drazen, J., & Curfman, G. (2004). Public access to bio-
Castillo, C., & Davison, B. (2011). Adversarial Web medical research. New England Journal of Medicine,
search. Delft: Now Publishers. 351, 1343.
Charen, T. (1976). MEDLARS indexing manual, part I: Duran-Nelson, A., Gladding, S., Beattie, J., & Nixon,
Bibliographic principles and descriptive indexing, L. (2013). Should we Google it? resource use by
1977. Springfield: National Technical Information internal medicine residents for point-of-care clini-
Service. cal decision making. Academic Medicine, 88, 788–
Charen, T. (1983). MEDLARS indexing manual, part II. 794.
Springfield: National Technical Information Egan, D., Remde, J., Gomez, L., Landauer, T.,
Service. Eberhardt, J., & Lochbaum, C. (1989). Formative
Cimino, J., & delFiol, G. (2007). Infobuttons and point design-evaluation of Superbook. ACM Transactions
of care access to knowledge. In R. Greenes (Ed.), on Information Systems, 7, 30–57.
Clinical decision support: The road ahead (pp. 345– Ely, J., Osheroff, J., Ebell, M., Bergus, G., Levy, B.,
371). Amsterdam: Elsevier. Chambliss, M., & Evans, E. (1999). Analysis of
Cohen, K., & Demner-Fushman, D. (2014). Biomedical questions asked by family doctors regarding patient
natural language processing. Amsterdam: John care. British Medical Journal, 319, 358–361.
Benjamins Publishing. Ely, J., Osheroff, J., Ebell, M., Chambliss, M., Vinson,
Cohen, A., Ambert, K., & McDonagh, M. (2009). Cross- D., Stevermer, J., & Pifer, E. (2002). Obstacles to
topic learning for work prioritization in systematic answering doctors’ questions about patient care
review creation and update. Journal of the American with evidence: Qualitative study. British Medical
Medical Informatics Association, 16, 690–704. Journal, 324, 710–713.
Cohen, A., Smalheiser, N., McDonagh, M., Yu, C., Eysenbach, G., Su, E., & Diepgen, T. (1999). Shopping
Adams, C., Davis, J., & Yu, P. (2015). Automated around the internet today and tomorrow: Towards
confidence ranked classification of randomized con- the millennium of cybermedicine. British Medical
trolled trial articles: An aid to evidence-based medi- Journal, 319, 1294–1298.
cine. Journal of the American Medical Informatics Eysenbach, G., Powell, J., Kuss, O., & Sa, E.-R. (2002).
Association, 22, 707–717. Empirical studies assessing the quality of health
Coletti, M., & Bleich, H. (2001). Medical subject head- information for consumers on the world wide web:
ings used to search the biomedical literature. Journal A systematic review. Journal of the American
of the American Medical Informatics Association, 8, Medical Association, 287, 2691–2700.
317–323. Ferguson, T. (2002). From patients to end users: Quality
Covell, D., Uman, G., & Manning, P. (1985). Information of online patient networks needs more attention
needs in office practice: Are they being met? Annals than quality of online health information. British
of Internal Medicine, 103, 596–599. Medical Journal, 324, 555–556.
Darmoni, S., Leroy, J., Baudic, F., Douyere, M., Piot, J., Ferrucci, D., Brown, E., Chu-Carroll, J., Fan, J.,
& Thirion, B. (2000). CISMeF: A structured health Gondek, D., Kalyanpur, A., et al. (2010). Building
resource guide. Methods of Information in Medicine, Watson: An overview of the DeepQA Project. AI
9, 30–35. Magazine, 31(3), 59–79. Retrieved from http://www.
DeAngelis, C., Drazen, J., Frizelle, F., Haug, C., Hoey, J., a a a i .o rg / o j s / i n d ex .p h p / a i m ag a z i n e / a r t i c l e /
Horton, R., et al. (2005). Is this clinical trial fully reg- view/2303.
istered? A statement from the International Ferrucci, D., Levas, A., Bagchi, S., Gondek, D., &
Committee of Medical Journal Editors. Journal of Mueller, E. (2012). Watson: Beyond Jeopardy!
the American Medical Association, 293, 2927–2929. Artificial Intelligence, 199-200, 93–105.
DeBakey, M. (1991). The National Library of Medicine: Fiksdal, A., Kumbamu, A., Jadhav, A., Cocos, C.,
Evolution of a premier information center. Journal Nelsen, L., Pathak, J., & McCormick, J. (2014).
of the American Medical Association, 266, 1252– Evaluating the process of online health information
1258. searching: A qualitative approach to exploring con-
Demner-Fushman, D., Antani, S., Simpson, M., & sumer perspectives. Journal of Medical Internet
Thoma, G. (2012). Design and development of a Research, 16(10), e224. Retrieved from http://www.
multimodal biomedical information retrieval sys- jmir.org/2014/10/e224/.
tem. Journal of Computing Science and Engineering, Fiorini, N., Canese, K., Starchenko, G., Kireev, E., Kim,
6, 168–177. W., Miller, V., et al. (2018). Best Match: New rele-
Diamond, G., & Kaul, S. (2008). The disconnect vance search for PubMed. PLoS Biology, 16(8),
between practice guidelines and clinical practice – e2005343. Retrieved from https://journals.plos.org/
stressed out. Journal of the American Medical p l o s b i o l o g y / a r t i c l e ? i d = 1 0 .1 3 7 1 / j o u r n a l .
Association, 300, 1817–1819. pbio.2005343.
DiCenso, A., Bayley, L., & Haynes, R. (2009). ACP Fiszman, M., Rindflesch, T., & Kilicoglu, H. (2004).
Journal Club. Editorial: Accessing preappraised Summarization of an online medical encyclopedia.
Information Retrieval
791 23
Paper presented at the MEDINFO 2004 – proceed- Haynes, R., Wilczynski, N., McKibbon, K., Walker, C.,
ings of the Eleventh World Congress on Medical & Sinclair, J. (1994). Developing optimal search
Informatics, San Francisco, CA. strategies for detecting clinically sound studies in
Foulonneau, M., & Riley, J. (2008). Metadata for digital MEDLINE. Journal of the American Medical
resources: Implementation, systems design and Informatics Association, 1, 447–458.
interoperability. New York: Neal-Schuman Haynes, R., Devereaux, P., & Guyatt, G. (2002). Clinical
Publishers. expertise in the era of evidence-based medicine and
Fox, C. (1992). Lexical analysis and stop lists. In patient choice. ACP Journal Club, 136(2), A11–A14.
W. Frakes & R. Baeza-Yates (Eds.), Information Hersh, W. (1994). Relevance and retrieval evaluation:
retrieval: Data structures and algorithms (pp. 102– Perspectives from medicine. Journal of the American
130). Englewood Cliffs: Prentice-Hall. Society for Information Science, 45, 201–206.
Fox, S. (2011). Health topics. Retrieved from Hersh, W. (1999). “A world of knowledge at your finger-
Washington, DC: http://www.pewinternet. tips”: The promise, reality, and future directions of
org/2011/02/01/health-topics-2/. on-line information retrieval. Academic Medicine,
Fox, S., & Duggan, M. (2013). Health online 2013. 74, 240–243.
Retrieved from Washington, DC: https://www. Hersh, W. (2001). Interactivity at the Text Retrieval
pewinternet.org/2013/01/15/health-online-2013/. Conference (TREC). Information Processing and
Frakes, W. (1992). Stemming algorithms. In W. Frankes Management, 37, 365–366.
& R. Baeza-Yates (Eds.), Information retrieval: Data Hersh, W. (2020). Information retrieval: A biomedical
structures and algorithms (pp. 131–160). Englewood and health perspective (4th ed.). New York: Springer.
Cliffs: Prentice-Hall. Hersh, W., & Bhupatiraju, R. (2003). TREC genomics
Frank, M. (2013). Open but not free – publishing in the track overview. Paper presented at the The Twelfth
21st century. New England Journal of Medicine, 368, Text Retrieval Conference (TREC 2003),
787–789. Gaithersburg, MD.
Funk, M., & Reid, C. (1983). Indexing consistency in Hersh, W., & Hickam, D. (1995). An evaluation of
MEDLINE. Bulletin of the Medical Library interactive Boolean and natural language search-
Association, 71, 176–183. ing with an on-line medical textbook. Journal of
Giles, J. (2005). Internet encyclopaedias go head to the American Society for Information Science, 46,
head. Nature, 438, 900–901. Retrieved from http:// 478–489.
www.n ature.c om/nature/journal/v438/n7070/ Hersh, W., & Hickam, D. (1998). How well do physi-
full/438900a.html. cians use electronic information retrieval systems? A
Gorman, P., & Helfand, M. (1995). Information seeking framework for investigation and review of the litera-
in primary care: How physicians choose which clini- ture. Journal of the American Medical Association,
cal questions to pursue and which to leave unan- 280, 1347–1352.
swered. Medical Decision Making, 15, 113–119. Hersh, W., & Rindfleisch, T. (2000). Electronic publish-
Greenhalgh, T., Howick, J., & Maskrey, N. (2014). ing of scholarly communication in the biomedical
Evidence based medicine: A movement in crisis? sciences. Journal of the American Medical
British Medical Journal, 348, g3725. Retrieved from Informatics Association, 7, 324–325.
http://www.bmj.com/content/348/bmj.g3725. Hersh, W., & Voorhees, E. (2009). TREC genomics spe-
Guyatt, G., Cook, D., & Haynes, R. (2004). Evidence cial issue overview. Information Retrieval, 12, 1–15.
based medicine has come a long way. British Medical Hersh, W., Pentecost, J., & Hickam, D. (1996). A task-
Journal, 329, 990–991. oriented approach to information retrieval evalua-
Guyatt, G., Rennie, D., Meade, M., & Cook, D. (Eds.). tion. Journal of the American Society for Information
(2014). Users’ guides to the medical literature: A Science, 47, 50–56.
manual for evidence-based clinical practice, 3E. Hersh, W., Crabtree, M., Hickam, D., Sacherek, L.,
New York: McGraw-Hill. Friedman, C., Tidmarsh, P., et al. (2002). Factors
Guyatt, G., Rennie, D., Meade, M., & Cook, D. (Eds.). associated with success for searching MEDLINE
(2015). Users’ guides to the medical literature: and applying evidence to answer clinical questions.
Essentials of evidence-based clinical practice (3rd Journal of the American Medical Informatics
ed.). New York: McGraw-Hill. Association, 9, 283–293.
Harter, S. (1992). Psychological relevance and informa- Hersh, W., Bhuptiraju, R., Ross, L., Johnson, P., Cohen,
tion science. Journal of the American Society for A., & Kraemer, D. (2004). TREC 2004 genomics
Information Science, 43, 602–615. track overview. Paper presented at the The Thirteenth
Haug, C. (2013). The downside of open-access publish- Text Retrieval Conference (TREC 2004),
ing. New England Journal of Medicine, 368, 791– Gaithersburg, MD.
793. Hersh, W., Cohen, A., Yang, J., Bhupatiraju, R.,
Haynes, R., McKibbon, K., Walker, C., Ryan, N., Roberts, P., & Hearst, M. (2005). TREC 2005
Fitzgerald, D., & Ramsden, M. (1990). Online genomics track overview. Paper presented at the The
access to MEDLINE in clinical settings. Annals of Fourteenth Text Retrieval Conference – TREC
Internal Medicine, 112, 78–84. 2005, Gaithersburg, MD.
792 W. Hersh
Hersh, W., Cohen, A., Roberts, P., & Rekapalli, H. based summary resources: A randomized controlled
23 (2006). TREC 2006 genomics track overview. Paper
presented at the The Fifteenth Text Retrieval
crossover study. Academic Medicine, 89, 940–943.
Kitchens, B., Harle, C., & Li, S. (2014). Quality of
Conference (TREC 2006), Gaithersburg, MD. health-related online search results. Decision
Hersh, W., Cohen, A., Ruslen, L., & Roberts, P. (2007). Support Systems, 57, 454–462.
TREC 2007 genomics track overview. Paper pre- Koopman, B., Zuccon, G., & Bruza, P. (2017). What
sented at the The Sixteenth Text Retrieval makes an effective clinical query and querier?
Conference (TREC 2007) Proceedings, Journal of the American Society for Information
Gaithersburg, MD. Science and Technology, 68, 2557–2571.
Humphreys, B., Lindberg, D., Schoolman, H., & Laurent, M., & Vickers, T. (2009). Seeking health infor-
Barnett, G. (1998). The Unified Medical Language mation online: Does Wikipedia matter? Journal of
System: An informatics research collaboration. the American Medical Informatics Association, 16,
Journal of the American Medical Informatics 471–479.
Association, 5, 1–11. Lesk, M. (2005). Understanding digital libraries (2nd
Ioannidis, J. (2016a). Evidence-based medicine has been ed.). San Francisco: Morgan Kaufmann.
hijacked: A report to David Sackett. Journal of Li, H. (2011). Learning to rank for information retrieval
Clinical Epidemiology, 73, 82–86. and natural language processing. San Rafael:
Ioannidis, J. (2016b). The mass production of redun- Morgan & Claypool Publishers.
dant, misleading, and conflicted systematic reviews Li, Z., Zhang, X., Müller, H., & Zhang, S. (2018). Large-
and meta-analyses. Milbank Quarterly, 94, 485–514. scale retrieval for medical image analytics: A com-
Ioannidis, J. (2017). Hijacked evidence-based medicine: prehensive review. Medical Image Analysis, 43,
Stay the course and throw the pirates overboard. 66–84.
Journal of Clinical Epidemiology, 84, 11–13. Lindberg, D., & Humphreys, B. (2005). 2015 – the future
Jadad, A. (1999). Promoting partnerships: Challenges of medical libraries. New England Journal of
for the Internet age. British Medical Journal, 319, Medicine, 352, 1067–1070.
761–764. Liu, Y., & Wacholder, N. (2017). Evaluating the impact
Jadhav, A., Andrews, D., Fiksdal, A., Kumbamu, A., of MeSH (Medical Subject Headings) terms on dif-
McCormick, J., Misitano, A., et al. (2014). ferent types of searchers. Information Processing &
Comparative analysis of online health queries origi- Management, 53, 851–870.
nating from personal computers and smart devices Mani, I. (2001). Automatic summarization. Amsterdam:
on a consumer health information portal. Journal of John Benjamins.
Medical Internet Research, 16(7), e160. Retrieved Marcetich, J., Rappaport, M., & Kotzin, S. (2004).
from http://www.jmir.org/2014/7/e160/. Indexing consistency in MEDLINE. Paper presented
Jarvelin, K., & Kekalainen, J. (2002). Cumulated gain- at the MLA 04 Abstracts, Washington, DC.
based evaluation of IR techniques. ACM McCool, J. (2017, April 6). Opinion: Why I published in
Transactions on Information Systems, 20, 422–446. a predatory journal. The Scientist. Retrieved from
Kanoulas, E., Li, D., Azzopardi, L., & Spijker, R. http://www.t he-scientist.c om/?articles.v iew/arti-
(2017). CLEF 2017 technologically assisted reviews c l e N o / 4 9 0 7 1 / t i t l e / O p i n i o n % 2 D % 2 DWhy - I -
in empirical medicine overview. Paper presented at Published-in-a-Predatory-Journal/.
the Working Notes of CLEF 2017 – Conference McGlynn, E., Asch, S., Adams, J., Keesey, J., Hicks, J.,
and Labs of the Evaluation Forum, Dublin, DeCristofaro, A., & Kerr, E. (2003). The quality of
Ireland. health care delivered to adults in the United States.
Kanoulas, E., Li, D., Azzopardi, L., & Spijker, R. New England Journal of Medicine, 348, 2635–2645.
(2018). CLEF 2018 technologically assisted reviews McKibbon, K., & Fridsma, D. (2006). Effectiveness of
in empirical medicine overview. Paper presented at clinician-selected electronic information resources
the Working Notes of CLEF 2018 – Conference for answering primary care physicians’ information
and Labs of the Evaluation Forum, Avignon, needs. Journal of the American Medical Informatics
France. Association, 13, 653–659.
Kanoulas, E., Li, D., Azzopardi, L., & Spijker, R. Merson, L., Gaye, O., & Guerin, P. (2016). Avoiding
(2019). CLEF 2019 technology assisted reviews in data dumpsters – toward equitable and useful data
empirical medicine overview. Paper presented at the sharing. New England Journal of Medicine, 374,
Working Notes of CLEF 2019 – Conference and 2414–2415.
Labs of the Evaluation Forum, Lugano, Miles, W. (1982). A history of the national library of med-
Switzerland. icine: The nation’s treasury of medical knowledge.
Kavanagh, J., & Rich, M. (2018). Truth decay – an initial Bethesda: U.S. Department of Health and Human
exploration of the diminishing role of facts and analy- Services.
sis in American public life. Santa Monica: RAND Moher, D., & Moher, E. (2016). Stop predatory publish-
Corporation. ers now: Act collaboratively. Annals of Internal
Kim, S., Noveck, H., Galt, J., Hogshire, L., Willett, L., Medicine, 164, 616–617.
& O’Rourke, K. (2014). Searching for answers to Mork, J., Aronson, A., & Demner-Fushman, D. (2017).
clinical questions using Google versus evidence- 12 years on – is the NLM medical text indexer still
Information Retrieval
793 23
useful and relevant? Journal of Biomedical Roberts, K., Demner-Fushman, D., Voorhees, E., &
Semantics, 2017(8), 8. Retrieved from https:// Hersh, W. (2016a). Overview of the TREC 2016 clin-
jbiomedsem.b iomedcentral.c om/articles/10.1 186/ ical decision support track. Paper presented at the
s13326-017-0113-5. The Twenty-Fifth Text REtrieval Conference
Müller, H., & Unay, D. (2017). Retrieval from and (TREC 2016) Proceedings, Gaithersburg, MD.
understanding of large-scale multi-modal medical Roberts, K., Simpson, M., Demner-Fushman, D.,
datasets: A review. IEEE Transactions on Voorhees, E., & Hersh, W. (2016b). State-of-the-art
Multimedia, 19(9), 17099710. Retrieved from in biomedical literature retrieval for clinical cases: A
https://ieeexplore.ieee.org/document/7984864/. survey of the TREC 2014 CDS track. Information
Munn, Z., & Qaseem, A. (2018). Disappearance of the Retrieval Journal, 19, 113–148.
National Guideline Clearinghouse: A huge loss for Roberts, K., Demner-Fushman, D., Voorhees, E.,
evidence-based health care. Annals of Internal Hersh, W., & Bedrick, S. (2017). Overview of the
Medicine, 169, 648–649. TREC 2017 precision medicine track. Paper
Mynatt, B., Leventhal, L., Instone, K., Farhat, J., & presented at the The Twenty-Sixth Text REtrieval
Rohlman, D. (1992). Hypertext or book: Which is Conference (TREC 2017) Proceedings,
better for answering questions? Paper presented at Gaithersburg, MD.
the Proceedings of Computer-Human Interface 92. Robertson, S., & Walker, S. (1994). Some simple effective
Nicholson, D. (2006). An evaluation of the quality of approximations to the 2-Poisson model for probabilis-
consumer health information on Wikipedia tic weighted retrieval. Paper presented at the
(Capstone). Oregon Health & Science University, Proceedings of the 17th Annual International ACM
Portland, OR. Retrieved from http://www.ohsu.edu/ SIGIR Conference on Research and Development
dmice/people/students/theses/2006/upload/ in Information Retrieval, Dublin, Ireland.
Nicholson_CapstoneFinal06.pdf. Rothenberg, J. (1999). Ensuring the longevity of digital
Ohno-Machado, L., Sansone, S., Alter, G., Fore, I., information. Retrieved from http://www.clir.org/
Grethe, J., Xu, H., et al. (2017). Finding useful data pubs/archives/ensuring.pdf.
across multiple biomedical data repositories using Salton, G. (1991). Developments in automatic text
DataMed. Nature Genetics, 49, 816–819. retrieval. Science, 253, 974–980.
Palotti, J., Hanbury, A., Muller, H., & Kahn, C. (2016). Salton, G., & McGill, M. (1983). Introduction to modern
How users search and what they search for in the information retrieval. New York: McGraw-Hill.
medical domain – understanding laypeople and Salton, G., Fox, E., & Wu, H. (1983). Extended Boolean
experts through query logs. Information Retrieval information retrieval. Communications of the ACM,
Journal, 19, 189–224. 26, 1022–1036.
Paskin, N. (2006). The DOI Handbook. Oxford: Scaffidi, M., Khan, R., Wang, C., Keren, D., Tsui, C.,
International DOI Foundation. Garg, A., et al. (2017). Comparison of the impact of
Patashnik, E., Gerber, A., & Dowling, C. (2017). Wikipedia, UpToDate, and a digital textbook on
Unhealthy politics: The battle over evidence-based short-term knowledge acquisition among medical
medicine. Princeton: Princeton University Press. students: Randomized controlled trial of three web-
Pivovarov, R., & Elhadad, N. (2015). Automated meth- based resources. JMIR Medical Education, 3(2), e20.
ods for the summarization of electronic health Retrieved from http://mededu.jmir.org/2017/2/e20/.
records. Journal of the American Medical Informatics Silberg, W., Lundberg, G., & Musacchio, R. (1997).
Association, 22, 938–947. Assessing, controlling, and assuring the quality of
Pluye, P., & Grad, R. (2004). How information retrieval medical information on the Internet: Caveat lector et
technology may impact on physician practice: An viewor – let the reader and viewer beware. Journal of
organizational case study in family medicine. the American Medical Association, 277, 1244–1245.
Journal of Evaluation in Clinical Practice, 10, 413– Sim, I. (2016). Two ways of knowing: Big data and
430. evidence-based medicine. Annals of Internal
Rigden, D., & Fernández, X. (2020). The 27th annual Medicine, 164, 562–563.
Nucleic Acids Research database issue and molecu- Simpson, M., Voorhees, E., & Hersh, W. (2014).
lar biology database collection. Nucleic Acids Overview of the TREC 2014 clinical decision support
Research, 48, D1–D8. Retrieved from https:// track. Paper presented at the The Twenty-Third Text
academic.oup.com/nar/article/48/D1/D1/5695332. REtrieval Conference Proceedings (TREC 2014),
Roberts, P., Cohen, A., & Hersh, W. (2009). Tasks, top- Gaithersburg, MD.
ics and relevance judging for the TREC Genomics Sollins, K., & Masinter, L. (1994). Functional require-
Track: Five years of experience evaluating biomedi- ments for uniform resource names. Retrieved from
cal text information retrieval systems. Information http://www.w3.org/Addressing/rfc1737.txt.
Retrieval, 12(1), 81–97. Somashekhar, S., Sepúlveda, M., Puglielli, S., Norden,
Roberts, K., Simpson, M., Voorhees, E., & Hersh, W. A., Shortliffe, E., RohitKumar, C., et al. (2018).
(2015). Overview of the TREC 2015 clinical decision Watson for Oncology and breast cancer treatment
support track. Paper presented at the The Twenty- recommendations: Agreement with an expert multi-
Fourth Text REtrieval Conference (TREC 2015) disciplinary tumor board. Annals of Oncology, 29,
Proceedings, Gaithersbug, MD. 418–423.
794 W. Hersh
Sox, H. (2009). Medical journal editing: Who shall pay? Vosoughi, S., Roy, D., & Aral, S. (2018). The spread of
23 Annals of Internal Medicine, 151, 68–69.
Spitzer, V., Ackerman, M., Scherzinger, A., & Whitlock,
true and false news online. Science, 359, 1146–1151.
Weibel, S., & Koch, T. (2000). The Dublin Core Metadata
D. (1996). The visible human male: A technical Initiative: Mission, current activities, and future
report. Journal of the American Medical Informatics directions. D-Lib Magazine, 6. Retrieved from http://
Association, 3, 118–130. www.dlib.org/dlib/december00/weibel/12weibel.html.
Stanfill, M., Williams, M., Fenton, S., Jenders, R., & Wennberg, J. (2010). Tracking medicine: A researcher’s
Hersh, W. (2010). A systematic literature review of quest to understand health care. Oxford: Oxford
automated clinical coding and classification sys- University Press.
tems. Journal of the American Medical Informatics Wenzel, R. (2017). Medical education in the era of alter-
Association, 17, 646–651. native facts. New England Journal of Medicine, 377,
Swanson, D. (1988). Historical note: Information 607–609.
retrieval and the future of an illusion. Journal of the Westbrook, J., Coiera, E., & Gosling, A. (2005). Do
American Society for Information Science, 39, 92–98. online information retrieval systems help experi-
Taichman, D., Sahni, P., Pinborg, A., Peiperl, L., Laine, enced clinicians answer clinical questions? Journal
C., James, A., et al. (2017). Data sharing statements of the American Medical Informatics Association,
for clinical trials: A requirement of the International 12, 315–321.
Committee of Medical Journal Editors. Annals of Wildemuth, B., deBliek, R., Friedman, C., & File, D.
Internal Medicine, 167, 63–65. (1995). Medical students’ personal knowledge,
Tang, P., Newcomb, C., Gorden, S., & Kreider, N. searching proficiency, and database use in problem
(1997). Meeting the information needs of patients: solving. Journal of the American Society for
Results from a patient focus group. Paper presented Information Science, 46, 590–607.
at the Proceedings of the 1997 AMIA Annual Fall Wilkinson, M., Dumontier, M., Aalbersberg, I.,
Symposium, Nashville, TN. Appleton, G., Axton, M., van der Lei, J., et al.
Taylor, H. (2010). “Cyberchondriacs” on the Rise? Those (2016). The FAIR Guiding Principles for scientific
who go online for healthcare information continues to data management and stewardship. Scientific Data,
increase. Retrieved from Rochester, NY: http://www. 3, 160018. Retrieved from https://www.nature.com/
harrisinteractive.c om/vault/HI-Harris-Poll- articles/sdata201618.
Cyberchondriacs-2010-08-04.pdf. Yaman, H., Yavuz, E., Er, A., Vural, R., Albayrak, Y.,
van Rijsbergen, C. (1979). Information retrieval. Yardimci, A., & Asilkan, O. (2016). The use of
London: Butterworth. mobile smart devices and medical apps in the family
Voorhees, E. (2013). The TREC Medical Records Track. practice setting. Journal of Evaluation in Clinical
Paper presented at the Proceedings of the Practice, 22, 290–296.
International Conference on Bioinformatics, Zarin, D., Tse, T., Williams, R., Califf, R., & Ide, N.
Computational Biology and Biomedical Informatics, (2011). The ClinicalTrials.gov results database--up-
Washington, DC. date and key issues. New England Journal of
Voorhees, E., & Harman, D. (Eds.). (2005). TREC: Medicine, 364, 852–860.
Experiment and evaluation in information retrieval. Zarin, D., Tse, T., Williams, R., & Rajakannan, T.
Cambridge, MA: MIT Press. (2017). Update on trial registration 11 years after
Voorhees, E., & Hersh, W. (2012). Overview of the TREC the ICMJE policy was established. New England
2012 Medical Records Track. Paper presented at the Journal of Medicine, 376, 383–391.
The Twenty-First Text REtrieval Conference Zerhouni, E. (2004). NIH public access policy. Science,
Proceedings (TREC 2012), Gaithersburg, MD. 306, 1895.
Voorhees, E., & Tong, R. (2011). Overview of the TREC Zhai, C., & Lafferty, J. (2004). A study of smoothing
2011 Medical Records Track. Paper presented at the methods for language models applied to informa-
The Twentieth Text REtrieval Conference tion retrieval. ACM Transactions on Information
Proceedings (TREC 2011), Gaithersburg, MD. Systems, 22, 179–214.
795 24
Clinical Decision-Support
Systems
Mark A. Musen, Blackford Middleton, and Robert A. Greenes
Contents
References – 836
Clinical Decision-Support Systems
797 24
nnLearning Objectives such as insulin pumps are exceptions to this
After reading this chapter, you should know model; decision support for these systems is
the answers to these questions: not covered in this Chapter.) Ideally, CDSSs
55 What are the key motivations for clini- can be described in terms of the five rights
cal decision support? that they aim to accomplish: “provide the
55 What are typical design considerations right information, to the right person, in the
when building a decision-support sys- right format, through the right channel, at
tem? the right point in workflow, to improve health
55 What are some ways in which develop- and health-care decisions and outcomes”
ers of decision-support systems encode (Osheroff et al. 2012).
and represent clinical knowledge? Systems that provide CDS may be con-
55 What are some current standards in the sidered in terms of three basic categories:
HIT industry that facilitate the con- (1) those that use information about the cur-
struction of decision-support applica- rent clinical context to retrieve highly rel-
tions? evant online documents, as with so-called
55 What are the current main areas of “Infobuttons” (introduced in 7 Chap. 14);
research and development in clinical (2) those that are intelligent systems that pro-
decision-support systems? vide patient-specific, situation-specific alerts,
advice, reminders, physician order sets, or
In this chapter, we discuss information tech- other recommendations for direct action; or
nology aimed at furnishing clinical decision (3) those that organize and present informa-
support (CDS) – the process that “provides tion in a way that facilitates problem solv-
clinicians, staff, patients, or other individu- ing and decision making, as in dashboards,
als with knowledge and person-specific infor- graphical displays, documentation templates,
mation, intelligently filtered or presented at structured reports, and order sets.
appropriate times, to enhance health and Systems that create order sets offer an
health care” (Osheroff et al. 2007). CDS ideal example of CDS, not only because they
systems (often referred to as CDSSs) com- can facilitate decision making by providing an
municate information that takes into consid- actionable set of recommendations such as a
eration the particular clinical context, offering combination of orders, but also because they
situation-specific information and recommen- provide a mnemonic function by gathering
dations. Generally, we think of such systems together items that should be associated in a
as reasoning about the clinical situation and particular setting. Order sets also can enhance
presenting their conclusions as recommenda- workflow by providing a means to select a
tions to the user. Information retrieval systems group of relevant activities quickly. Not all
that find relevant information from reposi- CDS systems have the ability to optimize
tories of documents with high relevance to a workflow, as we shall discuss. In fact, some
specific patient and clinical context can also CDS systems are subject to concern if they
serve in this role, but do not themselves make are poorly implemented, as they actually can
patient-specific recommendations for care. impede workflow or usability.
CDS systems do not directly perform clinical This chapter provides a review of computer-
decision making; they provide relevant knowl- based decision aids, emphasizing their role and
edge and analyses that enable the ultimate adoption within the current health-care milieu
decision makers—clinicians, patients, and of the United States while keeping an eye on
health-care organizations—to develop more global trends. It offers some thoughts on the
informed judgments; hence the importance nature of the decision-making process, then it
of the word “support” in the term “CDS”. provides a description of current implemen-
(Closed-loop systems such as in implantable tation strategies and challenges, and it closes
cardioverter defibrillators and other devices with a discussion of critical research questions
798 M. A. Musen et al.
that must be addressed to ensure optimal effec- perform, and assessing the value of the results
tiveness of CDS in clinical practice. that can be obtained in relation to associ-
ated risks or financial costs. Thus, diagnosis
24 involves deciding not only what is true about
24.1 he Nature of Clinical Decision
T a patient, but also what data are needed to
Making determine what is true. Even when the diag-
nosis is known, there often are challenging
If you ask lay people what the phrase “com- management decisions that test the physician’s
puters in medicine” means, they often describe knowledge and experience: Should I treat the
a computer program that helps physicians to patient or allow the process to resolve on its
make diagnoses. Although computers play own? If treatment is indicated, what should it
numerous important clinical roles, people be? How should I use the patient’s response to
have recognized, from the earliest days of therapy to guide me in determining whether
computing, that computers might support an alternate approach should be tried or, in
some cases, to question whether my initial
health-care workers by helping these people
diagnosis was incorrect after all? (In that
to sift through the vast collections of possible
sense, the response to treatment is also a type
diseases, findings, and treatments.
of diagnostic test.) Also, when a clinician and
We can view nearly all the contents of this
a patient are faced with alternative treatments,
book as addressing clinical data and clini-
and they seek help to choose among them, the
cal decision making. In 7 Chap. 2, we dis-
estimation of the chance for cure or the risk
cussed the central role of accurate, complete,
of death or of complications is an important
and relevant data in supporting the decisions
decision-making activity. Lastly, many disease
that confront clinicians and other health-
processes evolve over time, and evaluation and
care workers. In 7 Chap. 3, we described
change in treatment must evolve with them,
the nature of good decisions and the need
resulting in the need for guidelines for man-
for clinicians to understand the proper use agement that take into account the temporal
of information if they are to be effective and aspects of prior states and activities in order
efficient decision-makers. In 7 Chap. 4 we to decide what to do next. Decision making
introduced the cognitive issues that underlie also may involve integrating data from mul-
clinical decision-making and that influence tiple providers, treatments, and responses to
the design of systems for decision support. them, as well as increased use of personal sen-
Subsequent chapters have mentioned many sors and apps to provide additional data, so
real or potential uses of computers to assist that aspects of care coordination over time
with such decision-making. Medical practice play an important role. This is beyond our
is medical decision-making, so most applica- scope, but the topic is addressed in part in
tions of computers in health care are intended 7 Chaps. 21, 22 and 23.
to have a direct or indirect effect on the qual- Biomedicine is also replete with decision
ity of health-care decisions. In this chapter, we tasks that do not involve specific patients or
bring together these themes by concentrating their diseases. Consider, for example, the bio-
on methods and systems that have been devel- medical scientist who is using laboratory data
oped specifically to assist health workers in to help with the design of her next experi-
making decisions. ment or the hospital administrator who uses
By now, you are familiar with the range of management data to guide decisions about
clinical decisions. The classic problem of diag- resource allocation in his hospital. In addition,
nosis (analyzing available data to determine the new financial models for health-care payment
pathophysiologic explanation for a patient’s or reimbursement based on value (roughly,
symptoms) is only one of these. Equally chal- patient outcomes and cost of care), rather
lenging, as emphasized in 7 Chaps. 3 and 4, is than on fee-for-service calculations based on
the diagnostic process—deciding which ques- individual patient care activities, require that
tions to ask, tests to order, or procedures to decisions be made on the basis of aggregate
Clinical Decision-Support Systems
799 24
data on outcomes for groups of patients to data are collected, and decisions often have to
determine expected norms and to identify be made on an urgent basis.
outliers. Although we focus on systems to Access to data from all relevant sources
assist with clinical decisions in this chapter, is required but difficult to achieve in prac-
we emphasize that the concepts discussed tice. Patients may be seen in different venues,
generalize to many other problem areas as including a primary care office (perhaps via
well. In 7 Chap. 29, for example, we exam- a telecare visit), a specialist office, an emer-
ine the need for formal decision techniques gency room, a laboratory or imaging facility,
and tools in creating health policies. As we a hospital, an extended care facility, or they
develop databases that can identify patients may be monitoring themselves at home. Each
with specific diseases, with risks of compli- venue may be using different and incompat-
cations, or in need of specific interventions ible EHRs or data repositories, terminolo-
such as screening tests or immunizations (see gies, and data representation models. Access
7 Chap. 18), population management can be to these data and interoperability across data
used to provide a form of decision support for repositories may be limited. Typically, the
groups of patients. Some clinical decision sup- data available are only those within a health
port is also aimed directly at patients, in terms system and its EHR–which may include affili-
of alerts, reminders, or aids to interpretation ated office practices, clinics, emergency rooms,
of information, especially given increasing use and hospitals, or by interaction with an avail-
of smartphone apps and connected sensors able health information exchange (HIE) (see
and devices; techniques for assessing prog- 7 Chap. 15).
nosis and risk of alternative strategies should Equally important is the quality of the
involve shared decision making between pro- available data to input to a CDSS. In 7 Chap.
viders and patients, which is also an impor- 2, we discussed imprecision in terminology,
tant area of activity. illegibility and inaccessibility of records, and
In this chapter, we focus on decision aids other opportunities for misinterpretation of
for the provider in particular—the clinician data. Similarly, measurement instruments or
seeing the patient at the point of care. The recorded data may simply be erroneous; use
requirements for excellent decision-making of faulty data can have serious adverse effects
fall into three principal categories: (1) accu- on patient-care decisions. Thus, clinical data
rate data, (2) pertinent knowledge, and (3) often need to be validated.
appropriate problem-solving, or clinical rea- Even good data are useless if we do not
soning, skills. have the knowledge necessary to apply them
The data about a patient must be ade- properly. Decision-makers must have broad
quate—both accurate and sufficiently com- knowledge of medicine, in-depth familiarity
prehensive to include everything relevant for with their area of expertise, and access to per-
making an informed decision—but they must tinent additional information resources. Their
not be excessive (see 7 Chap. 4). Indeed, a knowledge must be accurate, with areas of
major challenge occurs when decision-makers controversy well understood and questions of
are bombarded with so much information personal choice well distinguished from those
that they cannot process and synthesize the where a more prescriptive approach is appro-
information intelligently and rapidly (see, for priate. Their knowledge must also be current;
example, 7 Chap. 21). Thus, it is important in the rapidly changing world of medicine,
to know when additional data will confuse facts decay just as certainly as dead tissue
rather than clarify and when it is imperative does.
to use tools (computational, visual, or other- Good data and an extensive factual knowl-
wise) that permit data to be summarized for edge base still do not guarantee a good deci-
easier cognitive management (see 7 Chap. 4). sion; good problem-solving skills are equally
Operating rooms and intensive-care units are important. Decision-makers must know how
classic settings in which this problem arises; to set appropriate goals for a task, how to rea-
patients are monitored extensively, numerous son about each goal, and how to make explicit
800 M. A. Musen et al.
the trade-offs between costs and benefits of of formal models of clinical reasoning. At
diagnostic procedures or therapeutic maneu- the same time, construction of such systems
vers. Skilled clinicians draw extensively on offers obvious societal benefits if the com-
24 personal experience, and new physicians soon puter programs can aid practitioners in their
realize that good clinical judgment is based as care of patients and can lead to better clinical
much on an the ability to reason effectively outcomes. Although the more academic con-
and appropriately about what to do as it is siderations have provided strong motivation
on formal knowledge of the field or access for work in the area of computer-based deci-
to high-quality patient data. Thus, clinicians sion aids over several decades, the recognition
must develop a strategic approach to the selec- of the importance of CDSSs as practical tools
tion and interpretation of diagnostic tests, has increased markedly in recent years as a
understand ideas of sensitivity and specificity, result of the inexorable growth in health-care
and be able to assess the urgency of a situa- complexity and cost, as well as the introduc-
tion. Similar issues relating to test or treat- tion of new health-care legislation, regulatory
ment selection, in terms of costs, risks, and initiatives, and payment incentives aimed at
benefits, must be understood. Awareness of addressing these trends—which have all made
biases and of the ways that they can creep into the development and broad adoption of CDS
problem-solving also are crucial (see 7 Chap. technology a priority.
3). Also, as noted above, patient preferences The twenty first century has seen changes
and concerns must be adequately addressed in health-care practices that make the devel-
as part of the decision-making process. Thus, opment of CDS technology particularly
good communication and interaction with necessary. Computer-based CDS has taken
the patient is essential. This brief review of on increasing urgency for four reasons: (1)
issues central to clinical decision-making increasing challenges related to knowledge
serves as a fitting introduction to the topic of and information management in clinical prac-
computer-assisted decision-making: Precisely tice, thus increasing physician information
the same topics are pertinent when we develop needs, (2) the ubiquity of electronic medical
a computational tool for CDS. The program records and the desire to enhance health care
must have access to good data, it must have through the communication and integration
extensive background knowledge encoded for of the relevant data, (3) the goal of deliver-
the clinical domain in question, and it must ing increasingly personalized health-care ser-
embody an intelligent approach to problem- vices—tailored to the patient’s preferences for
solving that is sensitive to requirements for care and to his or her individual genome, and
proper analysis, appropriate cost–benefit (4) the growing evidence that CDS can not
trade-offs, and efficiency. only improve the quality of care delivered but
also reduce costs of care. We consider these
four reasons in the sections that follow.
24.2 Motivation for We note another factor that is shaping
Computer-Based CDS new directions for CDS which we will explore
further at the end of this chapter—the trend
Since the 1960s, workers in biomedical infor- toward use of personal devices and apps, as
matics have been interested in CDS systems well as more portable, distributed laboratory
because of a desire both to improve health or analytic procedures, to provide a growing
care and to understand better the process of range of sources of data to incorporate into
medical decision-making. Building a com- decision making. These devices and apps also
puter system that attempts to process clini- provide an ability to track, monitor, and inter-
cal data to offer situation-specific advice can vene early whenever needed, both interacting
provide insight into the nature of medical with the user/patient directly, and also for those
problem solving and can enable the creation conditions warranting it, with the provider.
Clinical Decision-Support Systems
801 24
24.2.1 Physician Information expanding clinical knowledge base, and now
Needs and Clinical Data as more data sources are available and use
of EHRs is almost ubiquitous – are primary
Management
drivers for the adoption of CDS systems. (See
7 Chap. 23 for a deeper discussion of physi-
Modern health care is characterized by an
cian information needs.)
ever-expanding knowledge base of clinical
medicine, and by a growing clinical data base
describing every patient characteristic from
phenotype to genotype (Kohn et al. 2002). 24.2.2 EHR Adoption
Despite the growing amounts of data and and Integration of CDS
knowledge with which physicians need to
work, health-care practitioners have seen the The motivations for adoption of EHRs and
average time for a clinical encounter steadily CDS are affected by the way in which health
curtailed, particularly in the United States, care is financed and paid for, by the structure
where the pressures of the prevalent fee-for- and organization of the health-care system of
service reimbursement system and a concomi- a nation or a region, and by political forces that
tant rise in the amount of paperwork required can create constraints, regulations, and incen-
for administrative management and billing tives. In this section, we use the United States
continue to squeeze practitioners (Baron as an example to demonstrate these influences.
2010). Studies of information needs among Health-care safety and quality concerns,
physicians in clinical practice have long coupled with a seemingly inexorable rise in
revealed that unanswered clinical questions health-care costs, have led in the United States
are common in ambulatory clinical encoun- to a variety of cost-containment and quality-
ters, with as many as one or two unanswered improvement strategies in recent years.
clinical questions about diagnosis, therapy, Health-care delivery in the United States is
or administrative issues arising in every visit in the midst of a profound transformation,
(Covell et al. 1985). Prior to the broad adop- in part due to Federal public policy efforts
tion of EHRs, in as many as 81% of clini- to encourage the adoption and use of health
cal encounters in ambulatory care, clinicians information technology (HIT). The American
were found to be missing critical informa- Recovery and Reinvestment Act (ARRA) of
tion, with an average of four missing items 2009, and the HITECH regulations within
per case (Tang et al. 1994, 1996). Currently, it, created incentives for the widespread
even with an EHR, providers may face major adoption of health information technologies
challenges in accessing relevant information, (Blumenthal 2009; see 7 Chap. 29). These
acquiring a complete picture of the patient’s public policy efforts, while ultimately suffer-
clinical state and history, and knowing what ing some reversals for political reasons, are
further testing or therapeutic actions are best often viewed as a long-term adjunct to current
to take. Prior to broad EHR adoption, stud- health-care–payment reform efforts in the
ies suggested that as many as 18% of medical U.S., and a prelude to additional health-care–
errors might be due to inadequate availability delivery redesign, payment reform, and cost
of patient information (Leape 1994). Today, containment. As recently as 2012, only 34.8%
conversely, the overabundance of information of physicians in ambulatory practice in the
in the EHR can lead to ‘information chaos’ US used a basic or comprehensive electronic
and may make it difficult for the clinician to medical record (Decker et al. 2012), and only
find relevant information for clinical decision 26.6% of U.S. hospitals used health informa-
making (Melnick et al. 2019; Beasley et al. tion technologies in inpatient care-delivery set-
2011). The demands for increased informa- tings (DesRoches et al. 2012), although these
tion management – in the setting of an ever- numbers have had a rapid upward trajectory.
802 M. A. Musen et al.
The ARRA and HITECH policies, and the EHR data (Blumenthal and Tavenner 2010;
resulting technology adoption, changed the Adler-Milstein et al. 2017). More recently, the
practice of medicine and clinical care delivery Medicare Access and CHIP Reauthorization
24 in both beneficial and untoward ways (Sittig Act of 2015 (MACRA),1 the Twenty-first
and Singh 2011). Century Cures Act of 2016 in the United
To achieve meaningful and effective use of States,2 and other initiatives have promoted
HIT, the software must be viewed as one com- a wide array of additional enhancements and
ponent of a complex sociotechnical system, improvements to the use of health informa-
in which all elements must work effectively tion technology in clinical practice. Notably,
(Institute of Medicine 2011a). The movement the Twenty-first Century Cures Act removed
toward value-based reimbursement, a more from U.S. Food and Drug Administration
recent U.S. trend, encourages emphasis on (FDA) consideration as a medical device soft-
wellness, prevention, and early intervention in ware devoted to performing the functions of
disease processes, and requires a new level of an EHR, administrative tools, or providing
connectivity, continuity, and coordination of clinical decision support. Significant push-
care across multiple venues is gaining in accep- back to the burden on both EHR vendors
tance. A prominent example is the advent of and on health-care organizations to comply
Accountable Care Organizations (ACOs; with Meaningful Use Phase III resulted in
McClellan 2015), which require emphasis on relaxation of these constraints. Meaningful
not only integration of data from all sources Use regulations have been superseded in the
(clinical and financial) but also aggregate, United States by the Cures Act, promot-
population-based data on outcomes, costs, ing more sweeping goals for interoperability
and identification of outliers. These alterna- and access to quality measures (Sinsky and
tive payment models are distributing upside Privitera 2018; see 7 Chap. 29). More recently,
and downside financial risk based upon qual- political considerations regarding health-care
ity outcomes, patient experience, and costs, financing in the United States have made the
and thus call for CDSSs to support both pay- speed of adoption of such measures some-
ers and providers. what uncertain.
One of the principal motivations for EHR From a more worldwide perspective,
adoption is to provide an infrastructure with interoperability of data and of CDS knowl-
which to improve the quality, safety, and edge models are essential for wide adoption,
efficacy of health-care delivery. In the past as well as for improving the evidence base
decade, the U.S. government placed consider- from which knowledge is derived.
able emphasis on the adoption of quality mea-
sures and quality-reporting requirements as
part of meaningful use of HIT (Clancy et al. 24.2.3 Precision Medicine
2009; Institute of Medicine 2011a). Quality
measures, despite their ability to provide feed- The fundamental model for the practice of
back that stimulates improved performance medicine has undergone dramatic change in
by the clinician, are only part of the process the past century or so. The objectives of clini-
needed to make the desired improvements.
Prospective, proactive clinical decision sup-
port must also be in place. The U.S. govern-
1 MACRA (2015). The Medicare Access and CHIP
ment’s rules for meaningful use of HIT, which Reauthorization Act of 2015 (MACRA). Retrieval
became progressively more demanding over a February 19, 2020: 7 https://www.cms.gov/medi-
4–6 year period, required only minimal CDS care/quality-initiatives-patient-assessment-instru-
compliance in phase I and II, but Phase III ments/value-based-programs/macra-mips-and-
of the meaningful use regulations in 2016 was apms/macra-mips-and-apms.html
2 Twenty-First Century Cures Act. H.R. 34, 114th
intended to increase the mandate for CDS in Congress (2016). Retrieval February 19, 2020:
EHR systems substantially by requiring APIs 7 h t t p s : / / w w w. g p o. g ov / f d s y s / p k g / B I L L S -
(application program interfaces) for access to 114hr34enr/pdf/BILLS-114hr34enr.pdf
Clinical Decision-Support Systems
803 24
cal care have shifted radically from the archaic regarding their own care (see 7 Chap. 26).
goal of correcting putative imbalances of In this approach, clinical decision making is
bodily humors to the scientific understand- explicitly patient-centered in new ways, bring-
ing of pathophysiology, of mechanisms for ing the best evidence at the genetic level to
eliminating pathogens, and of remedying bear on many clinical scenarios, while incor-
biological aberrancies. The resulting view porating patient preferences for acquiring and
of medicine as the application of biologi- applying genetic information (Fargher et al.
cal principles was at the core of the report 2007; Marcial et al. 2018). The increasing use
produced by Abraham Flexner (1910) that of genomics in medicine (Chan and Ginsburg
upended medical education in the early twen- 2011) is generating data that outstrip the
tieth century and that had led to the reduc- information and knowledge-processing capa-
tionist biomedical model that prevailed for the bilities of practitioners, and many clinicians
rest of that century. More recently, however, feel threatened by the impending tsunami
George Engel’s biopsychosocial model (Engel of additional knowledge that they will need
1977) brought to the fore of clinical care the to master (Baars et al. 2005). As precision
need to address psychological and social fac- medicine becomes the norm, primary-care
tors in clinical treatment plans in addition to and specialist practitioners alike will need to
underlying biomedical problems. By the end manage their patients by interpreting genomic
of the twentieth century, it became increas- tests along with myriad other data at the point
ingly accepted that CDS requires not only the of care. It is hard to imagine how clinicians
rapid communication of appropriate scientific will manage to perform such activities with-
medical knowledge, but also the adaptation out substantial computer-based assistance.
of that knowledge to reflect the psychologi- Informatics is well suited to support a person-
cal and social situation that would temper the alized approach to clinical genomics (Ullman-
application of the knowledge. Added to this Cullere and Mathew 2011).
complexity is the aging of the population, As mentioned earlier, another related
owing in part to advances in health and health change is growing recognition of the impor-
care. The result has been a much higher bur- tance of promoting optimal health and
den of chronic diseases, multiple diseases, and wellness, not just by treating disease but by
multiple testing and treatment options, with encouraging healthy lifestyles, fostering com-
both their positive and negative consequences pliance with health and health-care regimens,
that must be balanced—all contributing to the and carrying out periodic health-risk assess-
increasing intricacy of care. Indeed, the view ments. Key to prescriptive medicine are tools
of care is so complex for some patients that to support prospective medicine (Langheier
a major role of CDS is to integrate models and Snyderman 2004)—assisting the acqui-
of patient state and context (provider, set- sition of a detailed family history, social
ting, specialization, and activity) to provide history, and environmental history, using per-
selective visualization, analysis, and decision- sonal apps and sensors, providing health-risk
making support for optimal care management assessments, and managing genomic informa-
(Greenes et al. 2018). tion (Hoffman and Williams 2011; Overby
As a further extension of these trends, the et al. 2010).
genomic era in which we now live has fur-
ther increased the need for clinical practice
to reflect precision medicine and the need to 24.2.4 Savings Potential
tailor care to individual factors in ways that with Health IT and CDS
never before were imaginable (Ginsburg and
Willard 2009). Precision medicine is charac- CDS has been shown to influence physician
terized by decision making that may take into behavior (Colombet et al. 2004; Lindgren
account patient personal history, family his- 2008; Schedlbauer et al. 2009), diagnostic
tory, social and environmental factors, along test ordering and other care processes (Bates
with genomic data and patient preferences and Gawande 2003; Blumenthal and Glaser
804 M. A. Musen et al.
2007), and the costs of care (Haynes et al. advent of legal, regulatory, and financial driv-
2010), and it may have a modest impact on ers, as well as the increasing importance of
clinical outcomes (Bright et al. 2012). While personalizing medical decision making on
24 there is enormous promise for HIT and CDS, the basis of genomic data, now make CDS an
their implementation is not without potential essential element of modern clinical practice.
peril: HIT poorly designed or implemented,
or misused, can generate unintended conse-
quences (Ash et al. 2007; Harrison et al. 2007; 24.3 Methods of CDS
Bloomrosen et al. 2011), and introduce new
types of medical errors (Institute of Medicine As we have already noted, CDS systems
2011a). (1) may use information about the current
Only a handful of studies have examined clinical context to retrieve pertinent online
the return on investment (ROI) for HIT, and documents; or (2) they may provide patient-
even fewer have investigated ROI for decision- specific, situation-specific alerts, reminders,
support specifically. The value of CDS in physician order sets, or other recommen-
terms of ROI is difficult to measure. Isolated dations for direct action; or (3) they may
studies of various hand-crafted systems in organize information in ways that facilitate
academic centers have shown value (Wang decision making and action. Category (2)
et al. 2003; Kaushal et al. 2006), but adoption largely consists of the various computer–
elsewhere has often been problematic. Broad based approaches (“classic” CDS based on
adoption has not occurred, for many reasons the application of intelligent systems) that
discussed later in this chapter, including the have been the substrate for work in informat-
proprietary nature of systems for CDS and ics since the advent of applied work in proba-
for representation of knowledge, the lack of bilistic reasoning and artificial intelligence in
interoperability of data and knowledge, the the 1960s and 1970s. Such systems provide
mismatch of CDS to workflow, and usability custom-tailored assessments or advice based
concerns. on sets of patient-specific data. They may fol-
Systematic reviews of the scientific lit- low simple logics (such as algorithms), they
erature, such as the one performed by Bright may be based on decision theory and cost–
et al. (2012), have not been able to demon- benefit analysis, or they may use probabilistic
strate an effect of CDS on patient outcomes approaches or derive their conclusions on the
except in the short term. This finding is not basis of machine learning from large amounts
surprising, however, because the time point at of data. Some diagnostic assistants (such as
which CDS occurs is often long before a final DXplain; Barnett et al. 1987) suggest differ-
outcome, and many intervening factors may ential diagnoses or indicate additional infor-
have a greater effect. In the case of CDS for mation that would help to narrow the range
many chronic diseases whose complications of etiologic possibilities. Other systems sug-
ensue over years or decades, it simply may gest a single best explanation for a patient’s
be impractical to continue longitudinal stud- symptomatology. Other systems interpret
ies of sufficiently long duration to be able to and summarize the patient’s record over time
measure meaningful differences in outcome. in a manner sensitive to the clinical context
Nevertheless, economic simulation studies of (Shahar and Musen 1996). Still other systems
the potential effect of CDS on chronic dis- provide therapy advice rather than diagnostic
eases have demonstrated benefit in the long assistance (Musen et al. 1996).
term (McClellan 2015; Bu et al. 2007; Adler- CDS systems can achieve their results
Milstein et al. 2007). using a wide variety of computational meth-
Historically, the adoption of CDS technol- ods. These approaches include Bayesian
ogy has been motivated by a virtuous desire to probabilistic reasoning (see 7 Chap. 3), the
enhance the performance of clinicians when use of machine learning to make predic-
dealing with complex situations. The recent tions based on large amounts of data (often
Clinical Decision-Support Systems
805 24
from the EHR), performing inference using and so on. Still, there is no controlled termi-
IF–THEN rules, or by the identification of nology that can capture all the nuances of a
relevant templates for the clinician to fill in, patient’s history of present illness or findings
such as knowledge-based groupings of phy- on physical examination. There is no cod-
sician orders (order sets), or some combina- ing system that can reflect all the details of
tion of these approaches (James et al. 2013). physicians’ or nurses’ progress notes. Given
CDS systems may acquire the data on which that much of the information in the medical
they base their recommendations interactively record that we would like to use to drive deci-
from users or directly from a health informa- sion support is not available in a structured,
tion system (or some combination of these machine-understandable form, there are clear
approaches). We now discuss the issues that limitations on the data that can be used to
drive CDS system design, and we highlight assist clinician decision-making. The prose of
how these issues are manifest in current clini- progress notes, consultation notes, procedure
cal decision aids. or operation reports, discharge summaries,
and other documents contains an enormous
amount of information that never makes it
24.3.1 Acquisition and Validation to the coded part of the EHR. Nevertheless,
of Patient Data even when computer-based patient records
store substantial information only as free-
As mentioned in the introduction to this text entries, the data that are also available
chapter, a prerequisite to any decision-making in coded form (typically, diagnosis codes and
process is having available all the data that are prescription data) can be used to significant
needed to perform the required actions. As advantage (van der Lei et al. 1991).
emphasized in 7 Chap. 2, few problems are The desire to access information from the
more challenging than the development of EHR that may be available only in text has
effective techniques for capturing patient data been a topic of great interest to the CDS
accurately, completely, and efficiently. You community. Some information systems pro-
have read in this book about a wide variety of vide options for structured data entry, ask-
techniques for data acquisition, ranging from ing clinicians to use fill-in-the-blanks forms
keyboard entry, to speech input, to methods or templates on the computer screen to enter
that separate the clinician from the computer patient-related information that otherwise
(such as scannable forms, real-time data mon- would be entered as part of a prose note. In
itoring, and intermediaries who transcribe general, providers have resisted such human–
written or dictated data for use by computers). computer interfaces, often finding it restrictive
The problems of data acquisition go and cumbersome to make selections from pre-
beyond entry or extraction from the EHR, defined menus when they would much rather
or from other repositories, of the data them- express themselves more freely in prose. In
selves, however. A primary obstacle is that fact, structured templates or methods for col-
we lack standardized ways of expressing lecting information about a particular prob-
most clinical situations in a form that com- lem or finding are themselves often regarded
puters can interpret. As discussed in detail as a form of CDS, in that they provide an
in 7 Chap. 7, there are several controlled organized framework and a mnemonic func-
medical terminologies that health-care work- tion. Fortunately, work in natural language
ers use to specify precise diagnostic evalua- processing has made major advances in recent
tions (e.g., the International Classification of years, making it increasingly possible to mine
Diseases and SNOMED CT), clinical proce- the textual notes of EHRs to identify infor-
dures (e.g., Current Procedural Terminology mation that might bear on the CDS process
and LOINC codes), drugs (e.g., RxNorm), (see 7 Chap. 8).
806 M. A. Musen et al.
3 1 2
Information Execution
Knowledge Base
model engine
4
Result
CDS module specification
5
Application
Host-specific Host-specific environment
Invoking information action
process mapping mapping
.. Fig. 24.1 A conceptual model of CDS components. recommendation to be provided; and (5) how the pro-
There are five aspects of CDS that work together to pro- cess interacts with the application environment, includ-
duce CDS capability. The aspects are: (1) the method of ing how it is invoked and how the data and
computation or inferencing; (2) the knowledge needed recommendations are communicated. (Adapted from
to carry out the computation; (3) the information model Greenes 2014)
for the data that drive decision making; (4) the type of
Clinical Decision-Support Systems
807 24
action, classification, relevant citations); and Indeed, such systems are now ubiquitous in
(5) how the process interacts with the applica- health care. (7 Chap. 25 provides a compre-
tion environment, including how it is invoked hensive discussion of information-retrieval
(e.g., a process launched by an event monitor, methods.)
by the user explicitly, or by being embedded in The simplest, and perhaps most com-
workflow) and how the data and recommen- mon, form of CDS uses contextual informa-
dations are communicated (e.g., provided via tion from an EHR to perform information
a FHIR API or delivered as a popup alert). In retrieval from a database of information
this section, we focus primarily on the meth- about online documents. A person view-
ods of computation required for CDS (Aspect ing data in an EHR may see selectable icons
1 of . Fig. 24.1), and to a lesser extent on (infobuttons) next to the names of drugs,
the knowledge needed during computation laboratory tests, patient problems, or other
(Aspect 2). We consider the other elements in elements of the patient record, or the items
subsequent sections. themselves may be hyperlinked to an informa-
Aspect 1 was central to much of the early tion retrieval engine. Clicking on an infobut-
work on CDS development, beginning in the ton causes the clinical information system to
1960s and largely led by academic centers, perform a query on the database, providing
which focused on developing and exploring the user with one or more immediately acces-
different computational models of CDS in sible resources that can offer more informa-
the absence of any health IT infrastructure in tion about the item in question. Alternatively,
which to embed the systems. In recent years, the system may automatically query one or
much of CDS has been built by vendors of more of those external resources and return
proprietary systems, giving rise to a range of the results of the queries for display (Cimino
different approaches, often embedded in those et al. 2002). Clicking on an infobutton next to
systems, and with methods that are less able a drug, for example, might allow the user to
to be inspected, formalized, and shared. Some access information about customary dosing,
of the trends in health-care delivery that we side effects, or alternative medications (see
cited in 7 Sect. 26.2 are hoped to give rise . Fig. 14.15 in 7 Chap. 14). The query that
to increased interoperability and sharing in retrieves the links to the documents is tailored
vendor- developed systems. Even if current based on whatever is next to the infobutton
commercial products may not manifest the icon on the screen. The query may also take
different components of . Fig. 26.1 dis- into account contextual information, such
tinctly, we adopt this conceptual focus in our as patient-related data, the activity in which
discussion of various strategies for CDS to the user is engaged, and the role of the user
clarify the underlying principles. in the health-care enterprise (physician, nurse,
patient, and so on).
24.3.2.1 Context-Specific An infobutton manager mediates the que-
Information Retrieval ries between the clinical information sys-
Early developers of CDS systems argued that tem and the available information resources.
decision support entails more than identify- The standards development organization
ing relevant information that can help a clini- Health Level Seven has created a standard
cian to solve a problem; it was believed that for “context-aware knowledge retrieval,” lead-
a CDS system has to suggest specifically how ing to infobutton managers that have been
the problem should be solved. Thus, simply adopted by many commercial EHR vendors.3
providing information for the user to read and
digest was not considered “real” decision sup-
port. Nevertheless, as information-retrieval 3 HL7 International (2014). HL7 Version 3 Standard:
systems have improved over the years—with Context Aware Knowledge Retrieval Application
better performance characteristics—it is (“Infobutton”), Knowledge Request, Release 2.
Retrieval February 19, 2020: 7 https://www.hl7.
often hard to maintain that such systems do o r g / i m p l e m e n t / s t a n d a r d s / p r o d u c t _ b r i e f.
not offer decision support in a genuine sense. cfm?product_id=208
808 M. A. Musen et al.
Infobutton managers need to anticipate how and formatting of reports). This capability
the clinical context might tailor the specific serves not only as a convenience to the user,
query performed by any given infobutton, so but also as a mnemonic function (i.e., offering
24 that the result of the query is highly precise a check list; Gawande 2009).
and relevant to the situation at hand. Detailing When patients are admitted to the hospital
specifically how contextual information might with a particular condition such as a suspected
alter the queries performed by each infobutton myocardial infarction or pneumonia, when
type can be tedious, and the process requires hospital staff must prepare them for diagnos-
developers to be adept at second-guessing all tic procedures or surgery, or when they need
the reasons that might cause a user to click on to be transferred to another care team or to
a particular infobutton. Current research con- be discharged home, there often are stereo-
centrates on the development of a Librarian typical groups of medical orders that physi-
Infobutton Tailoring Environment (LITE; Jing cians tend to request. For example, patients
et al. 2015) that promises to aid the authoring with possible pneumonia often require a chest
of infobutton queries via “wizards” and other x-ray examination, the recording of vital signs
user-interface conveniences. at certain intervals, the administration of sup-
Although infobuttons are unquestion- plementary oxygen, cultures of their sputum
ably important knowledge resources, many or blood, and administration of antibiotics.
people would argue that they are not true When patients are admitted to a hospital with
CDS systems. Infobuttons retrieve relevant the diagnosis of pneumonia, the EHR can
information for a user, but they do not explic- automatically suggest to the treating physi-
itly address particular decisions that the user cians that such a set of orders be considered.
needs to make. The possible reasons that a Systems that produce such order sets can
user might click on an infobutton are folded use the clinical situation to tailor the recom-
into the query specification at the time that mended orders (e.g., the computer may not
the infobutton is created; at runtime, of recommend a chest x-ray examination if the
course, there is no way for the system to know patient has just had one; knowledge that the
exactly why the user selected the infobut- patient has been placed on an artificial venti-
ton. Infobutton managers therefore require lator may trigger a separate set of associated
sophisticated query capabilities, but they do orders for consideration).
not need to reason from a clinical situation to Clinicians routinely face many other ste-
a particular recommendation. reotypical tasks, such as describing the results
When the goal is to generate a situation- of a diagnostic procedure such as an imag-
specific recommendation regarding diagnosis ing study or reporting the sequence of events
or therapy, developers need to turn to meth- that took place during a surgical operation.
ods that can perform some kind of inference. Automated systems that recognize the clinical
The sophistication of the required technique context can provide a tailored template for the
is a function of the kind of inference that is clinician to fill in, helping to ensure that infor-
necessary to render a result for the user. mation is provided accurately and completely.
Throughout their professional activi-
24.3.2.2 Organizing or Grouping ties, clinicians constantly must keep in mind
Information as a CDS large amounts of information when treating
Method patients with even routine medical problems
As we have noted earlier, a valuable kind of and reporting the results of their work. The
CDS is to organize information to provide a use of simple mnemonics such as check lists
ready collection of items that need to be con- can be remarkably effective in ensuring that
sidered together (e.g., order sets for particular care givers remember everything that they
clinical indications or settings, documentation need to do or to report in a given context
templates for particular purposes, or structure (Algaze et al. 2016; Alamri et al. 2016; Pageler
Clinical Decision-Support Systems
809 24
et al. 2014). Transforming such check lists other than for particular kinds of computa-
into groups of orders to consider, groups of tions such as drug dose calculations (e.g., for
features to note in a diagnostic evaluation, or pediatric patients). In addition, the advantage
groups of steps that may have been followed of their implementation on computers has not
when performing a surgical procedure can been clear; the use of simple printed copies of
help clinicians to remember important details the algorithms generally has proved adequate
and to improve both the quality of medical for clinical care (Komaroff et al. 1974). A
care and what clinicians may report about it. noteworthy exception that gained enormous
Computationally, systems that offer order attention in the early 1970s was a computer
sets or templates to clinicians typically assem- program deployed in Boston at what was then
ble predefined information from collections the Beth Israel Hospital (Bleich 1972); it used
of text strings or database entries. The HL7 detailed algorithmic logic to provide advice
organization has issued a standard for creat- regarding the diagnosis and management of
ing libraries of order sets. Systems that sug- acid–base and electrolyte disorders. More
gest specific order sets to users by making recently, such branching-logic and other infer-
selections from such a library are generally ence methods approaches have been widely
embedded within the EHRs that call on their adopted in the administrative information
services, however. Thus, the specific methods systems that third-party payers use to process
that these systems use tend to be proprietary. requests to pre-certify payment for expensive
Researchers have experimented with the services such as MRI studies and elective sur-
use of machine-learning techniques to create gery (the HL7 Da Vinci Project is a notable
order sets on the fly based on empirical data. body of work in this area).
For example, Wang et al. (2018) have built a Although representing clinical algorithms
system that infers groups of orders for par- simply as computer code offers a very direct
ticular situations by examining a data ware- approach to implementing a CDSS, there
house for the individual orders that physicians are obvious challenges that occur when it
have administered historically. The system becomes necessary to refine or update the pro-
then suggests that the groups of orders that it gram’s behavior. Every modification requires
discerns from the database may be reasonable reprogramming the system. It may not always
for clinicians to administer as an ensemble. be obvious how to reprogram the system to
render the desired behavior and making the
24.3.2.3 Hardcoding Clinical necessary changes in one part of the pro-
Algorithms gram may have unintended consequences
From a computational perspective, there is when other parts of the program execute.
nothing simpler than encoding an algorithm Thus, although it may seem appealing simply
directly in a computer program. In health care, to hardcode clinical algorithms, developers
there is generally nothing simpler than defin- of decision support systems generally seek
ing a decision process in terms of a flowchart. more flexible mechanisms to represent clinical
Numerous CDS systems thus have taken knowledge and to reason about it. We discuss
problem-specific flowcharts designed by clini- this matter further in 7 Sect. 24.4.5, and we
cians and encoded them for use by a computer. address future research directions in this area
Although such flowcharts have been useful for in the Conclusion.
the purpose of triaging patients in urgent-care
situations and as a didactic technique used in 24.3.2.4 Learning from Data
journals and books where an overview for a Considerable flexibility is achieved when the
problem’s management has been appropriate, CDSS is largely data-driven. The advent of
computable interactive flowcharts have been extraordinarily fast computers and the abil-
largely rejected by physicians as too simplistic ity to process enormous amounts of data has
or generic for routine use (Grimm et al. 1975), led to an explosion of interest in the use of
810 M. A. Musen et al.
large datasets to learn patterns in the data bility of seven possible explanations for acute
to support clinical decision making (James abdominal pain (appendicitis, diverticulitis,
et al. 2013). The success of such data-driven perforated ulcer, cholecystitis, small-bowel
24 methods has led to considerable excitement obstruction, pancreatitis, and nonspecific
about the use of “big data” in health care. It is abdominal pain). To keep the Bayesian com-
important to appreciate, however, that work- putations manageable, the program made the
ers in biomedical informatics benefited from “naïve” Bayesian assumptions of (1) condi-
such approaches long before computers could tional independence of the findings for the
manage the enormous datasets that they pro- various diagnoses, (2) mutual exclusivity, and
cess today. (3) exhaustiveness of the seven diagnoses (see
7 Chap. 3).
Probabilistic Systems In one system evaluation (de Dombal
Attempts to drive computer-based decision et al. 1972), physicians filled out data sheets
support from relationships inferred directly summarizing clinical and laboratory findings
from data began during the earliest days of for 304 patients who came to the emergency
research in biomedical informatics. A seminal department with abdominal pain of sudden
article in 1959 first introduced Bayes theorem onset. The data from these sheets provided
and also value theory (later known as util- the attributes that were analyzed using Bayes’
ity theory) into health-care decision making rule. Thus, the Bayesian formulation assumed
(Ledley and Lusted 1959). In the 1960s, work- that each patient had one of the seven condi-
ers in the field recognized that they could use tions and it selected the most likely one on the
computers to apply Bayes’ rule to determine basis of the recorded observations.
the posterior probability of diseases based on In contrast to the clinicians’ diagnoses,
observations of patient-specific parameters which were correct in only 65–80% of the 304
(see 7 Chap. 3). Such calculations were based cases (with accuracy depending on the indi-
on the determination of appropriate probabi- vidual clinician’s training and experience), the
listic relationships between findings and dis- program’s diagnoses were correct in 92% of
eases by analyzing available datasets. Large cases. Furthermore, in six of the seven disease
numbers of Bayesian diagnosis programs have categories, the computer was more likely to
been developed in the intervening years, many assign the patients to the correct disease cat-
of which have been shown to be accurate in egory than was the senior clinician in charge
selecting among competing explanations of a of the case.
patient’s disease state. De Dombal’s system began to achieve
Among the most significant of the early widespread use—from emergency depart-
experiments were those of F. T. de Dombal ments in other countries to the British subma-
and his associates (1972) in England, who rine fleet. Surprisingly, however, the system
focused on the diagnosis of acute abdomi- never obtained the same degree of diagnostic
nal pain. De Dombal’s group used a naïve accuracy in other settings that it did where
Bayesian model that assumed that there are it had initially been deployed—even when
no conditional dependencies among findings adjustments were made for differences in prior
(i.e., a model that makes the inappropriate probabilities of disease. There are several rea-
but convenient assumption that the presence sons possible for this discrepancy, which are
of a finding such as upper abdominal pain relevant for all Bayesian CDS systems. The
never affects the likelihood of the presence most likely explanation is that there may be
of a finding such as lower abdominal pain). considerable variation in the way that clini-
Using surgical or pathologic diagnoses as the cians interpret the data that must be entered
gold standard, de Dombal’s group used sensi- into the computer. For example, physicians
tivity, specificity, and disease-prevalence data with different training or from different cul-
for various signs, symptoms, and test results tures may not agree on the criteria for identi-
to calculate, using Bayes’ theorem, the proba- fication of certain patient findings on physical
Clinical Decision-Support Systems
811 24
examination, such as “rebound tenderness.”4 disease (Gorry and Barnett 1968) and has
Another possible explanation is that there are been used in many CDS systems since.
different probabilistic relationships between In recent years, the use of naïve Bayesian
findings and diagnoses in different patient models has been seriously challenged by the
populations. adoption of systems based on Bayesian belief
Although a naïve Bayesian model may networks (see below), which can take advan-
have limitations in accurately modeling a tage of efficient algorithms that overcome
diagnostic problem, a major strength of this the limiting assumptions of naïve Bayesian
approach is computational efficiency. When approaches—albeit at the cost involved in
the findings that bear on a hypothesis are creating a more complex (and more nuanced)
assumed to be conditionally independent, model of the underlying probabilistic rela-
then the order in which the findings are con- tionships. In addition, systems that adopt the
sidered in the Bayesian analysis does not sequential Bayes approach lack the ability to
matter. The computer starts by considering choose the next test to be applied in a man-
a given finding, the prior probability of each ner that can optimize reasoning. Bayesian
possible diagnosis under consideration (gen- systems that use more sophisticated reason-
erally the prevalence of each diagnosis in the ing strategies based on decision analysis (see
population), and the conditional probabilities 7 Chap. 3) can use utility theory to identify
of the finding (or the absence of the find- the test that will provide the most useful infor-
ing) given each diagnosis (or the absence of mation given the current state of reasoning.
the diagnosis)—the sensitivity and specific- Considerations of cost, discomfort to the
ity of the finding (see the discussion of these patient, and the availability of the test can
concepts in 7 Chap. 2). The computer then influence the utility of the particular choice.
applies Bayes’ rule to calculate the posterior
probability of each diagnosis given the value Machine Learning
of the finding. The computer now is poised to The availability of large biomedical datasets
update the probability of each diagnosis given and computers and algorithms that can pro-
the value of a second finding. The prior prob- cess huge amounts of data are revolutioniz-
ability for each diagnosis in this case is not ing health care and the life sciences. Machine
the prevalence of the diagnosis in the popu- learning is everywhere in health care, from
lation, however. Having applied Bayes’ rule interpreting radiographic images to predict-
once, we have more information than we had ing utilization of health care services to iden-
at the start. We can treat the posterior prob- tifying potential adverse drug events. It is not
ability of each diagnosis given the first finding surprising that decision-support based on
as the prior probability of the diagnosis when machine-learning models is becoming increas-
we apply Bayes’ rule a second time. When it is ingly common in clinical settings. Such sys-
time to consider a third finding, the posterior tems have been in place for decades, but now
probability for each diagnosis after processing they are assuming increasing prominence,
the second finding serves as the prior prob- given the new opportunities that the renais-
ability for the next application of Bayes rule. sance in machine learning is offering all of
The process continues until the value of each biomedicine in the era of “big data.”
finding has been considered. This sequential There are a host of supervised learning
Bayes approach was explored as early as the techniques that can determine how data are
1960s for the diagnosis of congenital heart associated with hypotheses (James et al. 2013),
and that consequently can be trained on EHR
data to infer conclusions based on some set of
input data. For example, the decision-support
4 Rebound tenderness is pain that is exacerbated when
the physician presses down on the abdomen and
capabilities of the patient monitoring systems
then suddenly releases, generating a “rebound” discussed in 7 Chap. 21 often apply statisti-
when the abdomen returns to its baseline position. cal methods to the current data stream to
812 M. A. Musen et al.
infer corresponding classifications to inform its final conclusion. There is currently intense
care providers of the patient’s current state. interest in being able to develop new technol-
Regression analysis or more sophisticated ogies powered by machine-learning methods
24 techniques, such as artificial neural networks that in some measure can explain the basis of
and support vector machines, when applied to their reasoning and that can allow users to
routinely collected patient data, have enabled assess the likelihood that particular recom-
investigators to develop venerable decision mendations are appropriate in their clinical
aids such as the APACHE system (Knaus setting (Ribeiro et al. 2016).
et al. 1991; Zimmerman et al. 2006), which
offers prediction models providing prognos- 24.3.2.5 Declarative Representation
tic information regarding patients in the ICU of Knowledge
(see 7 Chap. 21). In simple Bayesian systems and in those
Recent work demonstrates the value of CDSSs based on machine-learning algo-
applying data-driven techniques to a wide rithms, data are provided as input into the sys-
range of clinical problems for which clini- tem, and the output is a classification of the
cians may benefit from decision support, from data—often a diagnosis on which to act. Since
assessing newborns in the ICU (Saria et al. the 1970s, however, workers in biomedical
2010) to development of models that can sug- informatics have pursued the development of
gest which patients might benefit most from decision-support technologies that attempt to
palliative care (Avanti et al. 2018). Scores of encode in a more explicit way how the inputs
start-up companies have emerged in recent to the system relate to the outputs. The goal
years, each hoping that different large datasets is to encode models—models of reasoning,
and specialized machine-learning techniques models of pathophysiology, models of proba-
will lead to new insights about particular clin- bilistic relationships, models of the evidence
ical problems in an effort to enhance decision in support of alternative treatment options,
making. and other relevant models—in a way that
From the beginning, such machine- forms the basis for a system’s computation to
learning approaches have been criticized when derive an appropriate recommendation from
used as the basis for CDS, primarily because the input data. Such systems are often built
of the lack of transparency in how data-driven with the objective that the underlying models
methods reach their conclusions (Shortliffe be examinable and explainable. Often, there is
et al. 1979). Because the associations between a desire to make those models editable, so that
findings and diagnoses are inferred as the the models easily can be updated in light of
system is trained on the data and are not new discoveries and new understanding. The
readily available for inspection, such systems unifying idea in this approach is that CDSSs
cannot offer guidance as to why they might are built with a computer-based representation
reach particular conclusions. This inability to of the knowledge that drives system behavior.
explain the basis for their recommendations is There are many ways to represent knowledge
especially important when the recommenda- in computers (Musen 2014), and each strategy
tions of a system might be overly fitted to the has different strengths and weaknesses.
peculiarities of a dataset drawn from a patient
population different from the one to which Bayesian Belief Networks
the system is being applied (as may have been Much of the early interest in the naïve,
the case with de Dombal’s system). Because sequential Bayesian approach stemmed from
the output of a CDSS based on a machine- a conviction that it simply was impractical
learning algorithm must be accepted at face to construct Bayesian systems in which the
value, there is typically no way to know what assumption of conditional independence was
biases may exist in the data that trained the lifted: There would be too many probabilities
system or what clinically relevant intermedi- to assess when building the system, and the
ary states may have led the system to reach necessary computation could be intractable.
Clinical Decision-Support Systems
813 24
Work on the use of belief networks, however, ally have been prespecified—either as decision
has demonstrated that it actually is realistic trees that enumerate all possible decisions and
to develop more expressive Bayesian systems all possible ramifications of those decisions or
in which conditional dependencies are mod- as belief networks in which explicit decision
eled explicitly—often by taking advantage and utility nodes are added, called influence
of approximate algorithms for concluding diagrams (Shachter 1986).
the posterior probabilities that are compu- We say that belief networks and influence
tationally efficient in most cases. (Belief net- diagrams represent knowledge in a declarative
works are described in detail in 7 Chap. 3.) manner, because a belief network provides
Currently, many modern CDS systems that an inspectable, editable model of the proba-
make recommendations based on probabilis- bilistic relationships that are relevant to the
tic relationships use belief networks as their decision problem under consideration. If a
primary representation of the underlying developer wants to designate a new relation-
clinical situation, and then “solve” the belief ship between two entities in the network, then
network at runtime to calculate the posterior she needs only to augment the model by add-
probabilities of the conditions represented ing a new edge to the graph that encodes the
in the graph. The use of belief networks is given network. The network thus provides a
popular because the formalism makes proba- transparent mechanism to communicate what
bilistic relationships perspicuous, overcomes the system “knows” about the probabilistic
the assumption of conditional independence, relationships among the entities in the appli-
and enables the attendant probabilities to be cation domain, and changing the network
learned from analysis of appropriate data intrinsically changes the behavior of the sys-
sets (for example, EHR data). The approach tem when it reasons about those entities. This
has been demonstrated in numerous diagnos- is different from the case of a hardcoded algo-
tic systems, from belief networks that ascer- rithm or a system based on machine learning,
tain the status of newborns from data in the where the knowledge is not readily inspectable
neonatal ICU (Saria et al. 2010), systems or editable in a direct manner.
for differential diagnosis (Shwe et al. 1991;
Middleton et al. 1991), to belief networks that Rule-Based Approaches
offer interpretations of biomedical image data Although belief networks provide a conve-
(Kahn et al. 1997). nient mechanism to encode knowledge about
Because making most decisions in medi- the world in a declarative fashion, they are
cine requires weighing the costs and benefits only one of several alternative frameworks
of actions that could be taken in diagnosing that may be used to drive CDS based on
or managing a patient’s illness, researchers explicit models of the application area. Since
also have developed tools that draw on the the 1970s, workers in medical AI have been
methods of decision analysis. Decision anal- exploring the use of methods that emphasize
ysis adds to Bayesian reasoning the idea of the modeling of rules that describe conclusions
explicit decisions and of utilities associated that can be reached about the decision prob-
with the various outcomes that could occur in lem and the variables that may predicate those
response to those decisions (see 7 Chap. 3). conclusions. Often called knowledge-based
One class of programs for decision-analysis is systems, these programs reason about the
designed for use by the analysts themselves; clinical situation by examining a collection of
such programs are of little use to the aver- rules of the form, “If some set of conditions
age clinician or patient, however (Pauker and is true, then conclude that something else is
Kassirer 1981). A second class of programs true” (. Fig. 24.2). Although these rules
uses decision-analysis concepts within sys- may be created through machine-learning
tems designed to advise physicians who are approaches, they often are built by manu-
not trained in these techniques. In such pro- ally encoding relationships between clinical
grams, the underlying decision models gener- data and corresponding conclusions that are
814 M. A. Musen et al.
[i.e. WHY is it important to determine the number of cultures which were (system’s rephrasing of the WHY
positive for the organism?] question, and answer)
[2.0] This will help to determine a diagnosis for which the infection should
be treated.
It has already been established that
[2.1] the infection is not one of: primary bacteremia, meningitis
[2.2] for some current culture of the patient it is true that the site
of the culture is blood
Therefore, if
[2.3] there is significant disease associated with this occurrence of
the organism
Then
It is definite (1.0) that the diagnosis for which the infection should
be treated is secondary-bacteremia
[RULE103]
.. Fig. 24.3 Two examples of MYCIN’s explanation expands each [“WHY”] question (enclosed in square
capabilities. User input is shown in boldface capital let- brackets) to ensure that the user is aware of its interpre-
ters and follows the double asterisks. The system tation of the query
thus offered the kind of inspectability and The developers of MYCIN had to con-
editability that we discussed when considering struct their own syntax for encoding rules and
belief-network representations of knowledge. had to program their own inference engine
The developers evaluated MYCIN’s per- to evaluate the rules. However, now there
formance on therapy selection for patients are many open-source and proprietary rule
with blood-borne bacterial infections (Yu engines that provide custom-tailored editors
et al. 1979a), and for those with meningitis for writing rules and inference engines that
(Yu et al. 1979b). In the latter study, MYCIN can execute the rules at runtime. For exam-
gave advice that compared favorably with that ple, JESS is a popular Java-based rule engine
offered by experts in infectious diseases— that can be licensed from Sandia National
results that ushered in enormous excitement Laboratory and that currently is free for
about the potential of rule-based systems to academic use. Drools is an open-source rule
offer high-level clinical advice in real-world engine developed by the JBoss community
situations. that also has had substantial adoption.
816 M. A. Musen et al.
Developers use JESS, Drools, and propri- standard is any notion of the semantics of the
etary rule engines to create CDS systems that data on which the MLMs operate. When an
contain multiple rules that, as with MYCIN, MLM executes, the variables that are used
24 can chain together to generate conclusions in the logic of the rule are bound to values
based on a sequence of inference steps. that derive from the patient database of the
Decision support sometimes requires multiple information system in which the MLMs oper-
rules to execute at runtime, together generat- ate. Arden Syntax specifies that the individual
ing a final recommendation that derives from database queries needed to determine the val-
the consequences of the rules chaining off one ues of the variables should appear within the
another. “curly braces” of variable definitions in the
In most installed information systems, portion of the MLM known as the “data slot”
however, rule-based decision support is much (see . Fig. 24.4). What a developer should
simpler and also more limited. Most deployed include within the curly braces depends on
CDS systems have rules that generally do not the particular schema of the relevant patient
chain together, but that are triggered indi- database and mechanism for performing que-
vidually, each time that either there is a rel- ries. EHR information models and the way in
evant change to the data in a patient database which elements are coded differ from system
that should generate an alert, there is a time- to system. Thus, all system-specific aspects of
related event that should trigger a reminder, MLM integration need to be provided within
or an action is performed (e.g., by a user, when the curly braces. To adapt an MLM for use
the action is established in the workflow as a in a new environment, a programmer needs
triggering event). Each rule examines the state to consider the variables on which the MLM
of the database and generates a correspond- operates, determine whether those variables
ing action, alert, or a freminder that is usually have counterparts in the local patient data-
sent to a particular clinician or to members base, and write an appropriate query that will
of the health-care team. Such rules are of the execute at runtime.
general form: Event – Condition – Action The curly braces problem is compounded
(ON Event, IF Condition, THEN Perform because there may be assumptions regarding
Action) and they are commonly referred to as the semantics of the variables themselves that
ECA rules. may not be obvious to the local implementer:
For example, Arden Syntax became an If the MLM refers to serum potassium, should
international standard for ECA rules known the logic be executed if the original specimen
as Medical Logic Modules (MLMs), endorsed was grossly hemolyzed?5 If a serum potas-
by HL7 and ANSI in 1999 (. Fig. 24.4). sium value is not available in the database, but
Arden Syntax provides a standard mecha- there is a value for a whole-blood potassium,
nism for declaring the variables about whose should the MLM be executed using that value
values the system will perform its reasoning instead?6 If there is no serum potassium value
(values that derive from data in the clinical available for today, but there is one from last
information system); the conditions that, if night, should the logic execute using the most
true, would predicate specific actions; the recent value? Decision rules cannot simply be
actions that should be taken, and the kinds of dropped from one system into another and
events that would invoke or trigger the rule. be shared effortlessly; rather, considerable
The standard was created with the hope that thought, analysis, and computer skill needs to
the informatics community would develop go into writing the appropriate database que-
whole libraries of MLMs, all written in Arden
Syntax, that could operate in any clinical envi-
ronment where an information system could
interpret the standard format. 5 If the red blood cells in a specimen hemolyze (burst),
they release potassium, which can cause an inaccu-
A significant obstacle to the sharing of rate elevation in the measured potassium value.
MLMs, however, is that Arden Syntax is, in 6 The serum is the liquid that is left when the cells are
fact, just a syntax. What is missing from the removed from whole blood.
Clinical Decision-Support Systems
817 24
MAINTENANCE:
Title: Diabetic Foot Exam Reminder;;
Mlmname: Diabetic_Foot_Exam.mlm;;
Arden: Version 2.8;;
Version: 1.00;;
Institution: Intermountain Healthcare ;;
Author: Peter Haug (Peter.Haug@imail.org) ;;
Specialist: Peter Haug (Peter.Haug@imail.org) ;;
Date: 2011-11-28;;
Validation: testing;;
LIBRARY:
Purpose: Alert for Diabetic Foot Exam Yearly;;
Explanation: This MLM will send an alert if the patient is a diabetic (diabetes in problem list or discharge diagnoses)
and Foot Exam is recorded within the last 12 months.;;
Keywords: diabetes; Foot Exam;;
Citations: Boulton AJM, Armstrong DG, Albert SF, Frykberg RG,Richard Hellman, Kirkman MS, Lavery LA,
LeMaster JW, Mills JL, Mueller MJ, Sheehan P, Dane K. Wukich DK. Comprehensive Foot Examination
and Risk Assessment. Diabetes Care. 2008 August; 31(8): 1679–1685.;;
Links: http://en.wikipedia.org/wiki/Diabetic_foot_ulcer;;
KNOWLEDGE:
Type: data_driven;;
Data: Problem_List_Problem := object [Problem, Recorder];
Problem_List := read as Problem_List_Problem {select problem, recorded_by from Problem_List_Table};
Patient_Dx_Object := object [Dx];
Diabetic_Dx := read as Patient_Dx_Object {ICD_Discharge_Diagnoses};
Foot_Examination := object [Recorder, Observation];
Observation := object [Abnormatlity, Location, Size, Units];
Foot_Exam := read as Foot_Examination latest {select Recorder, Observation.Abnormatlity,
Observation.Location, Observation.Size, Observation.Units from PE_Table};
Registration_Event := event { registration of patient };
ICD_for_Diabetes := (250 , 250.0 , 250.1 , 250.2 , 250.3 , 250.4 , 250.5 , 250.6 , 250.7 ,
250.8 , 250.9 ); ;;
Evoke: Registration_Event;;
Logic: if (Diabetic_Dx.Dx is in ICD_for_Diabetes or (exist Problem_List and "Diabetes" is in
Problem_List.Problem)) then Diabetes_Present := true ;
endif;
if (Diabetes_Present and exist Foot_Exam and Foot_Exam occurred not within past 12 months) then
conclude true ;
endif;
conclude false ; ;;
Action: write "Patient is a diabetic with no Diabetic Foot Exam in last 12 months. Please order or perform one.";;
.. Fig. 24.4 This medical logic module (MLM), writ- defines the procedure to follow if the logic slot reaches a
ten in the Arden syntax, prints a warning for health-care positive conclusion. The data slot defines the variables
workers whenever a patient who has diabetes is regis- that are to be used by the MLM; the text between curly
tered for a clinic visit and has not had a documented braces must be translated into queries on the local
foot examination in the past year. The evoke slot defines patient database when the MLM is deployed locally.
a situation that causes the rule to be triggered; the logic (Courtesy of P. J. Haug, Intermountain Healthcare,
slot encodes the decision logic of the rule; the action slot with permission)
818 M. A. Musen et al.
ries that go within the curly braces to make logic. Achieving the right balance will always
such rules operational. remain an elusive target (see also 7 Sect.
In the case of Arden Syntax, developers 24.5.4).
24 write rules to deal with one clinical problem
More General Representations
at a time. There may be one MLM to deal
with the problem of administering a drug of Knowledge
like penicillin to a patient with a history of The variety of approaches for building CDS
penicillin allergy; another MLM may report systems that we have described so far suggests
that a patient has a dangerously low serum that each method has significant strengths
potassium value. Unlike the rules in MYCIN, and weaknesses. Hardcoded branching-logic
MLMs are generally not intended to interact systems can be very easy to build but difficult
with one another or to be chained together to update and maintain when the program
to generate complex inferences. MLMs may code becomes complicated. Belief networks
be coerced to chain together when one MLM and influence diagrams offer precision in
posts to the patient database a value that can probabilistic reasoning when the goal is to
trigger another MLM. This mechanism also make a classification of some clinical phe-
allows one MLM to set up information in the nomena, but they offer limited capabilities if
database that might invoke another MLM in the goal is to generate a plan for medical ther-
the case of some future event, thus enabling apy on the fly or to simulate some biomedi-
the recommendation of actions that unfold cal situation. Rule-based systems can help to
over time, as in the case of many clinical prac- decompose decision problems into tractable
tice guidelines for chronic diseases. Although IF–THEN chunks, but they typically are lim-
this approach allows developers to program ited in what these chunks can express about a
complex problem-solving behavior, the tech- clinical problem. All of these approaches are
nique has the same disadvantages that came constrained in their inability to reason about
to light with chaining rule-based systems such clinical abstractions (e.g., knowing that ele-
as MYCIN: When the rule base grows to a vated serum potassium is a kind of electrolyte
large size, interactions among rules may have abnormality) and by their inability to make
unanticipated side effects. Furthermore, when inferences about situations that entail nuance
rules are added to or deleted from a previously or complexity.
debugged knowledge base, there may be unex- In recent years, developers of CDS sys-
pected system behaviors that emerge as a result tems have become increasingly interested in
(Clancey 1983; Heckerman and Horvitz 1986). the use of more general representations of
For MLMs to work well in practice, more- clinical knowledge to provide more sophisti-
over, the rules need to be tailored to the par- cated capabilities for decision support. These
ticular clinical environment— triggered by interests have paralleled the surge of enthusi-
appropriate workflow events, interacting with asm in the World Wide Web community for
particular kinds of participants, customiz- the notion of developing knowledge graphs
ing logic to account for various business and that encode facts about the world in a lattice
workflow processes, and notifying the user in of nodes that represent the kinds of entities
setting-specific ways. To customize an MLM in the world and links between them that rep-
to account for such considerations requires resent relationships among those entity types
that it become less portable. Much of the (Noy et al. 2019). Unlike belief networks,
effort required to introduce CDS systems which typically contain dozens of nodes,
into the health-care enterprise involves pre- knowledge graphs often contain hundreds
cisely such adaptations. To accelerate porta- or thousands or even millions of nodes—or
bility, MLM developers must seek a balance more. Whereas belief networks store numeric,
between a generic specification of logic that probabilistic relationships among the entities
is widely agreed upon, and site-specific cus- in the graph, knowledge graphs store sym-
tomizations that will facilitate the use of that bolic, logical connections among the enti-
Clinical Decision-Support Systems
819 24
.. Fig. 24.5 Google depicts its knowledge graph artis- expand searches by including synonyms, and to collect
tically as a vast collection of nodes representing entities information about entities related to the subject of a
in the world, linked to other entities in a rich network. search for presentation to the user. (Source: 7 https://
The graph allows Google’s search engine to highlight searchengineland.c om/laymans-visual-guide-googles-
attributes of entities for which users might search, to knowledge-graph-search-api-241935)
ties in addition to various properties of each Many commercial CDS systems are
entity. There currently is no standard way of adopting knowledge-graph technology as a
creating a knowledge graph and there is not component of their software architecture.
even consensus on what features a knowledge Because there is no standard knowledge-
graph should support. Nevertheless, virtu- graph approach, these systems adopt different
ally all well-known e-commerce sites—from knowledge-graph formalisms and perform
Google to Facebook to Bing to eBay—use a different tasks using their graphs. “IBM
graph-based representation of knowledge to Watson,” for example, is a name that encom-
provide key functionality for users hoping to passes a family of CDS systems that have
perform a variety of tasks. been deployed for a range of clinical domains
Google, for example, has a knowledge in recent years. Although these IBM Watson
graph that comprises more than one billion systems have had a variety of capabilities and
classes of entities and instances of those enti- evolving computational architectures, a com-
ties, and more than 70 billion facts about those mon theme has been their use of knowledge
entries (. Fig. 24.5) (Noy et al. 2019). The graphs to implement precise information
graph makes it possible for Google to know retrieval and to assist with natural language
that, when a user searches for some abstrac- understanding. Thus, a knowledge graph can
tion, such as “medications for diabetes,” the allow an IBM Watson system to interpret a
user may be interested in different kinds of user’s natural language query and to locate
medications for diabetes, such as insulin and a document or a fact that is itself stored in
hypoglycemic drugs. Thus, the knowledge a knowledge graph to respond to that query.
graph causes the search engine to bring up Indeed, the use of knowledge graphs in gen-
links to related entities when a user performs eral search engines such as Google and Bing
a search, and the graph helps to disambigu- provides such technologies with capabili-
ate the user’s query when the objective of the ties that make them useful for decision sup-
search may not be clear. port. When reporting search results, both
820 M. A. Musen et al.
Google and Bing display the contents of their that the patient is experiencing). This distinc-
knowledge graphs for the indicated item in tion makes it possible for system builders to
a “knowledge box” at the upper right of the address different elements of the knowledge
24 page. These knowledge boxes often can be needed to be represented in the computer
very effective in offering specific information using tailored approaches and tools (Musen
to address user queries. 1998; deClerq et al. 2004).
The simplest form of knowledge graph The ATHENA-CDS system exemplifies
is one that encodes an enumeration of the this component-oriented approach (Goldstein
kinds of entities in an application area and et al. 2000). ATHENA-CDS is a computer
that places them in an abstraction hierarchy, system that is integrated with the HIS that has
indicating when elements of one entity form been used by the U.S. Department of Veterans
a subclass (or superclass) of another. This Affairs (VA), known as VistA.7 ATHENA-
type of data structure is often referred to as CDS has been installed at several VA medical
an ontology. Ontologies are like controlled centers and has remained in continuous use
terminologies (see 7 Chap. 7), in that they at the Palo Alto VA medical center since the
provide a standard means for referring to the 1990s. ATHENA-CDS offers advice regard-
types of entities that comprise a domain, but ing patients who have certain chronic dis-
they organize those entities using a graph that eases, whose physicians would like to treat
makes explicit the semantics of the relation- those patients in accordance with recognized
ships among the entities. SNOMED CT and evidence- based clinical practice guidelines
the NCI Thesaurus are examples of com- (. Fig. 24.6). ATHENA-CDS draws on sev-
monly used controlled terminologies that are eral electronic knowledge bases, each one con-
represented using knowledge graphs in a man- stituting a knowledge graph that encodes the
ner that also makes them ontologies. knowledge of a particular guideline (e.g., for
Ontologies are important in biomedical hypertension, for hyperlipidemia, for diabetes,
informatics for their explicit representation and so on). Each time that a patient with a rel-
of abstraction relationships, thus facilitating evant diagnosis (e.g., hypertension) is seen in
interpretation of high-throughput experi- the outpatient clinic, ATHENA-CDS takes as
ments, aiding information retrieval and input the corresponding guideline knowledge
natural language processing, and index- base and patient-specific data from the VistA
ing data (Bodenreider and Stevens 2006). EHR and generates as output suggestions to
CDS systems can use ontologies that encode the clinician for treating the patient to ensure
knowledge about different application areas that the treatment is consistent with the care
in a manner that aids reuse of that knowl- that the guideline would recommend. Because
edge in new settings and that makes it easy for the standard documents that define clinical
developers to update the knowledge as their practice guidelines can be long and compli-
understanding of the application area evolves. cated, it is extremely helpful for the computer
Such systems use ontologies (or knowledge to focus the clinician’s attention on precisely
graphs more generally) to encode clinical which interventions should be considered to
knowledge in a manner that may overcome guarantee that the patient’s care is consonant
some of the limitations of the more prevalent with the medical evidence captured by a given
CDS architectures. These systems make an guideline (. Fig. 24.7).
explicit distinction between the static knowl- ATHENA-CDS was engineered using
edge of the clinical domain (e.g., knowledge an approach that separates out static knowl-
of the specifications entailed by a clinical edge about the clinical application area from
practice guideline) and the problem-solving knowledge about problem solving (i.e., knowl-
knowledge needed to apply the static knowl- edge about generating a situation-specific clin-
edge to a particular patient (e.g., the means to
generate specific prescriptions for medications
based on the general guideline recommen- 7 The VA is in the process of replacing VistA with a
dations and the particular clinical situation commercial HIS offered by Cerner.
Clinical Decision-Support Systems
821 24
.. Fig. 24.6 An example of the ATHENA-CDS sys- the patient’s most recent blood pressure, and is offered
tem interface. ATHENA-CDS provides decision-sup- advice about possible alterations in therapy based on
port for the management of hypertension and several the relevant clinical-practice guideline. The screen image
other chronic diseases by using a declarative knowledge depicts only simulated patient data (Courtesy of M. K.
base created as an instantiation on a generic guideline Goldstein, VA Palo Alto Healthcare System)
ontology. In the screen capture, the provider has entered
Lifestyle Modifications
.. Fig. 24.7 Professional societies, health-care prac- documents. Here is a high-level, paper-based flowchart
tices, private foundations, and other organizations are all from the guideline developed by Joint National Commis-
working to capture “best practices” for managing sion on Hypertension. The flowchart summarizes
patients in accordance with scientific evidence in terms detailed recommendations that the guideline document
of clinical practice guidelines. Unfortunately, nearly all specifies in many pages of text
these guidelines are published initially as large paper
developed by the American Heart Association ating the ontology to specify the knowledge
and the American College of Cardiology. required for particular guidelines. Similarly,
The developers built a knowledge base for the different knowledge bases can be mapped
management of chronic pain, based on the to different problem-solving programs, such
guideline promoted by the VA and the DOD that each problem solver automates a differ-
(Trafton et al. 2010). Other knowledge bases ent task associated with guideline-based care
for guideline-based care of diabetes, hyperlip- (therapy planning, eligibility determination,
idemia, and chronic kidney disease were cre- and so on). The ability to “mix and match”
ated in a similar manner. knowledge bases and problem solvers offers
The ontology-driven approach makes it considerable flexibility, and it enables devel-
possible to start with a particular ontology (in opers to reuse elements of previous solutions
this case, one for clinical practice guidelines to address new CDS problems that require
for management of chronic disease) to create different domain knowledge or different
multiple knowledge bases, each one instanti- problem-solving procedures.
Clinical Decision-Support Systems
823 24
.. Fig. 24.8 A small portion of the ontology of clinical butes of the entity known as Mangement_Guideline. The
guidelines used by ATHENA-CDS as entered into the ontology entered into Protégé reflects concepts believed
Protégé ontology-editing system. The hierarchy of to be common to all guidelines, but does not include
entries on the left includes entities that constitute build- specifications for any guidelines in particular. The com-
ing blocks for constructing guideline descriptions. The plete domain model is used to generate automatically a
panel on the right shows the attributes of whatever graphical knowledge-acquisition tool, such as the one
entity is highlighted on the left. Here, goal, eligibility_ shown in . Fig. 24.7
criteria, and clinical_algorithm, for example, are attri-
24
.. Fig. 24.9 A screen from a Protégé-generated The entries into the tool specify the knowledge required
knowledge-acquisition tool for entry of clinical-practice to treat patients in accordance with the guideline for
guidelines. The tool is generated automatically from a chronic hypertension adopted by the Department of
domain ontology, part of which appears in . Fig. 24.7. Veterans Affairs
Newer work focuses on the develop- 2013). FHIR resources are accessed through
ment of the Quality Data Model (QDM) to what is known as a RESTful API, which can
represent clinical data and concepts used in be defined as one that uses HTTP requests to
24 specifying quality measures, and ultimately GET, PUT, POST and DELETE data. FHIR
clinical decision support logic. The QDM is frameworks are built around the concept of
an information model that describes the rela- resources—basic units of interoperability
tionships between patients and clinical con- and modular components that can be assem-
cepts in standardized formats. QDM allows bled into working systems to try to resolve
the definition of a clinical concept used in a clinical, administrative and infrastructural
specification to measure quality of patient problems in health care. This capability has
care via defined data elements, and it provides partially addressed the heterogeneity of
the vocabulary needed to relate concepts to different systems, and it has dramatically
each other. For example, QDM, at the highest improved interoperability between systems.
level of abstraction, defines categories such
as Medication, Procedure, Condition, and so
on. Within a category, the Datatype definition 24.4.2 Adoption of Standard
gives the context of the clinical care process Knowledge-Representation
being assessed such as “medication - active,” Models
or “medication – administered.” Further
Datatype details are defined with Attributes Although Arden Syntax has been an HL7 and
that can further define the Datatype, or define ANSI standard since 1999, only a few ven-
the expected source for the data. For example, dor systems manage their libraries of deci-
a diagnosis of “Diabetes Mellitus Type I” is sion rules using Arden Syntax. Even doing
an active diagnosis datatype, a “Metformin so, of course, the rules still need to be cus-
prescription” is a medication prescribed, and tomized for use with vendor-specific patient
a “Hemoglobin A1c value” is a laboratory databases on a tedious, rule-by-rule basis to
result. Each of these elements can be related to overcome the curly-braces problem described
terms in controlled terminologies via specifi- in 7 Sect. 24.3.2. HL7 has pursued the devel-
cation of corresponding code sets. By relating opment of more interoperable models not
attributes between data elements, the QDM only for data query but also for defining ele-
provides a method to construct complex clini- ments of ECA rules. A succession of efforts
cal representations both for electronic clinical led to a specification for a query language
quality measures and for clinical decision- called GELLO (Sordo et al. 2004), and then
support logic. Because of this ability to share the Health eDecisions rule formalism,8 and
common logic elements (expressions, value more recently the Clinical Query Language,
sets, terminology) between quality measure discussed below.
and CDS specifications, QDM has become
popular for specifying quality measures 24.4.2.1 Standards for Encoding
among a wide variety of measure developers Clinical Guideline Models
and CDS implementers (Pathak et al. 2013; Particularly challenging in the standards-
Hong et al. 2016). development world is creation of a stan-
More recently, the Fast Healthcare dardized, shared model for representing
Interoperability Resource (FHIR) standard clinical practice guidelines in a form suit-
from HL7 has emerged from a multi-ven- able for execution at run time. The Guideline
dor collaboration in the Argonaut Project Element Model (GEM; Shiffman et al. 2000)
(HealthMgt 2015). FHIR is designed specifi-
cally for the Web and provides resources and
foundations based on common methods and 8 HL7 International (2014). Health eDecisions.
technologies used in the Web (XML, JSON, Retrieval February 19, 2020: 7 https://wiki.hl7.org/
HTTP, and OAuth) (Bender and Sartipi index.php?title=Health_eDecisions
Clinical Decision-Support Systems
827 24
is an XML mark-up specification that is or in very specific procedures such as renal
an American National Standards Institute dialysis). ATHENA- CDS thus dispenses
(ANSI) standard, now in its third revision, with offering specific guideline-based recom-
that guideline authors can use to annotate mendations, and instead suggests to the clini-
their narrative guidelines to identify key ele- cians when certain treatment options might be
ments for both quality assessment and execu- “compellingly indicated” or “relatively con-
tion. GEM allows authors to demarcate the traindicated.” In highly regimented settings
text that identifies guideline actions or eligibil- such as the administration of chemotherapy
ity criteria, and thus can serve an intermediary for cancer, however, a CDS system generally
purpose in work to transform a prose guide- would need to be much more “prescriptive” in
line into a computable specification, but the offering recommendations to clinicians.
standard does not itself provide a mechanism
to translate a marked-up guideline document 24.4.2.2 Standards for Encoding
into a structure that a computer can interpret ECA Rules
and execute. Such systems may be criticized As we have noted, the ability to share decision
as not explicitly representing the underlying rules is hindered in the absence of standards
ontology of the guideline components to be to encode both the logic of the rules (how
used at run time. the IF and the THEN components are to be
Other efforts have focused on creation of evaluated) and the clinical data on which the
a guideline ontology, such as the one adopted rules depend. Recently, the Clinical Quality
by ATHENA-CDS (see . Fig. 24.8), Language (CQL) has emerged as an expres-
that can inform the creation of computer- sion language that addresses these challenges.
understandable knowledge bases that are able CQL is intended to characterize both quality-
to capture knowledge about specific guidelines. measure logic and decision-support logic and
Such knowledge bases then could allow a CDS the data that such logic processes (Odigie
system to use knowledge about the guideline, et al. 2019). For example, CQL expressions
data from the EHR, and information concern- define the expected input data model, library
ing patient preferences and available resources resources that may be called, parameters,
to offer situation-specific, guideline-directed value sets, and code sets used in definitions of
advice. As we have noted, an underlying infra- patient conditions, and additional concepts
structure known as EON (Musen et al. 1996) needed to specify and encode the clinical logic
drives the ATHENA-CDS system. Other and data types used in a clinical quality mea-
ontology-based approaches have appeared sure—or in a CDS rule (Jiang et al. 2015).
over the years, including GLIF, GUIDE, Expression languages are commonly used
PRODIGY, Asbru, and GLARE. Peleg and to represent the logic to be used at the presen-
colleagues (2003) compared many of these tation layer of an application, and the meth-
guideline models, and they showed signifi- ods that may be used to interact with standard
cant commonalities among them. Despite the data models. CQL grew out of prior efforts
large degree of agreement, however, work in such as GELLO and the Health eDecisions
this area has not yet led to anything near a framework, from work in the U.S. Office of
standard that is widely adopted. Part of the the National Coordinator’s Clinical Quality
problem is that there is wide variation in the Framework Initiative. It was focused on iden-
structure, granularity, and specificity of exist- tifying, defining, and harmonizing standards
ing clinical practice guidelines, making it diffi- and specifications that promote integration
cult to develop a single comprehensive and yet and reuse between clinical decision support
readily applicable guideline model. Analysis and clinical quality measurement (CQM)
of the use of guidelines also indicates that knowledge-representation formalisms. It
guidelines themselves are rarely “executed” strives to be more clinician friendly and acces-
without considerable adaptation or localiza- sible to subject-matter experts. However, the
tion, except in situations such as protocol- standards used for the electronic representa-
driven care (for example, in clinical trials tion of CDS and CQM artifacts have not been
828 M. A. Musen et al.
modifications that may be site-specific; how vate knowledge sources, who would have over-
the recommendation should be transmitted sight over it, how its knowledge would be peer
(e.g., via electronic mail, popup windows, or reviewed and quality-rated, and how it would
24 sidebar messages); whether the recommenda- be sustained are among the many questions
tion requires acknowledgment by the recipi- that have not yet been answered, but this is
ent; whether it can be overridden; whether an area of intense research and development.
the alert should be escalated to supervising While many health-care organizations con-
clinicians, and so on. Rules that have been tinue to perform this kind of knowledge-cura-
custom tailored in such ways by means of exe- tion work for their own constituencies, several
cutable code naturally are less sharable than initiatives show a clear pathway to becoming
are generic rules. Failure to capture the kinds viable alternatives for knowledge aggregation
of customizations that are needed, however, and dissemination.
makes it time consuming for individual sites
to adapt generic medical recommendations to
their particular requirements or to capitalize 24.5 Future Research
on the experiences of others. What is needed and Development for CDS
is a way to represent useful experience in
terms of SSF combinations that work, with- Workers in biomedical informatics have
out needing to do so at the level of detailed studied problems in assisting with complex
code that is difficult for users to visualize and decision making for more than half a cen-
modify. tury. It seems that it is only now, with the
very recent adoption of HIT on a wide-
spread basis, that the foundations are finally
24.4.5 Sharing of Best-Practice in place for the rapid advance of CDS tech-
Knowledge for CDS nology in clinical settings. Although con-
siderable logistical problems still must be
The methods described above for sharing surmounted as outlined in 7 Sect. 24.4, this
computable biomedical knowledge are now is an exciting time in which to study CDS
gaining momentum (viz., SMART, CQL, and its translation from the laboratory to
FHIR). Historically, it generally has fallen on the point of care.
each health-care organization, user group, or
other entity to undertake its own process of
identifying and managing the best-practice 24.5.1 Standards Harmonization
knowledge it wants to deploy in its CDS sys- for Knowledge Sharing
tems. Most institutions lack the expertise, or and Implementation
the resources, to accomplish this task. Even
having a national or international reposi- Many implementation challenges remain
tory of such knowledge would not preclude for the broad adoption and effective use
the need for customization, but it would of CDS in EHR systems. As mentioned
certainly make it easier for each health-care above, one of the most active areas of cur-
entity to start with a trusted source. Over the rent research focuses on development of
past decade, the U.S. Agency for Healthcare standard approaches to knowledge sharing
Quality and Research has funded efforts to for CDS. Knowledge sharing may take the
create such a public repository, known as form of human-readable artifacts, machine-
CDS Connect (Lomatan et al. 2019). In CDS interpretable artifacts, or executable Web
Connect’s repository, the goal is to archive services (Osheroff et al. 2007; Goldberg et al.
knowledge artifacts represented in the CQL 2014; Dixon et al. 2013; Kawamoto et al.
formalism to promote sharing and interop- 2013). A capability for CDS sharing, as well
erability. Where such a repository should be as CDS functionality itself, would be substan-
hosted, how it might integrate public and pri- tially facilitated by the continued develop-
Clinical Decision-Support Systems
831 24
ment and use of common standards designed Other ways in which context could be used
to serve CDS needs in health care. might be to create specialized views or filters
As noted, several standards currently exist of available data, based on CSA. For exam-
that are aimed at specific areas of CDS and ple, selection of data to be viewed could be
types of CDS artifacts, or that could be lever- based on a user’s role and domain expertise.
aged to benefit CDS. For example, the Clinical Associations (and, ideally, typed relations)
Decision Support Consortium, a large col- among data items could be highlighted—
laborative research and development group for example, among problems, findings, and
supported by the U.S. Agency for Healthcare actions. Such relations could also potentially
Research and Quality, adopted an enhanced be used to anticipate assessment and plan
version of the Continuity of Care Document entries in notes based on data items pres-
(CCD) to serve as the foundation for input ent or being focused on. Such approaches
data for multi-institutional trials of CDS undoubtedly will only be scalable in gener-
technology (Middleton 2009). When taking alized web- services implementations. For
advantage of the most current standards such example, infobutton managers already use
as CQL and FHIR, systems developers tend to context such as user type, app/function being
adopt not only the standards but also arrive at performed, and specific item being evaluated
more common implementation approaches or to identify retrieval keys for external knowl-
patterns, that promote knowledge sharing and edge resources. The idea behind the expansion
reuse, and which may decrease the implemen- of context as a mode for invocation of CDS
tation burden if EHR vendors accommodate more generally is to create a more detailed
the standardized API and interaction models, and continually updated model of context,
as well as the data model expectations of the state, and activity. Such efforts are just getting
standards. underway.
In addition to reducing unwarranted clini- flow is provided by CDS Hooks. CDS Hooks
cal variability in practice, future work needs (Dolin et al. 2018) is a method for enabling
to establish a canonical patient information an event in the host system to determine
24 model with a formal ontology, an event model when an app (such as provided by SMART-
for triggering conditions, an action model for on-FHIR) should be launched programmati-
CDS intervention recommendations, a work- cally, thus complementing the need to launch
flow model for appropriately inserting CDS apps directly by users. The EHR detects an
interventions into the routines of clinical event such as a physician beginning to write
practice, a knowledge-representation schema an order, and it then can invoke an external
with a standard regular expression language, decision-support service. That service can
and, ideally, a measurement standard to assess determine what task is being performed and
CDS performance in use. return information in the form of a “card” (a
phrase or text snippet containing an inference
or assessment, or suggested action) that will
24.5.4 Externalizing CDS be displayed within the EHR. CDS Hooks
may also provide a link to an external app.
Standardization of methods for externalized The main downside of these approaches to
CDS (CDS performed outside of the EHR, CDS is that, although they have formal meth-
in the cloud) is becoming more commonplace ods for launching apps, they basically have no
due to emerging standards described above constraints on what happens within the apps,
(CQL, CQM, QUICK, FHIR). Prior research or support for linking to the intrinsic func-
and development efforts have demonstrated tionality and workflows of a host EHR. Thus,
the feasibility of accessing clinical data from the methods can result in a proliferation of
the electronic health record via standards- SMART apps and CDS Hooks modules in
based (viz., FHIR) and proprietary RESTful an organization’s library, with a number of
APIs, and running executable knowledge arti- them that may be of limited utility. This situ-
facts – both quality measures, and clinical ation could cause significant challenges for an
decision support – on the data in the secure enterprise seeking to manage and update its
cloud environment and returning insights into CDS capabilities on a regular basis.
the app being used in the clinical workflow
(Wright et al. 2015; Dixon et al. 2013). Current
work is focused on representing all of the req- 24.5.5 Usability Research and CDS
uisite knowledge artifacts required to create
a computable practice guideline fully repre- The use of CDS within EHRs, and health IT in
sented in CQL, and on running those knowl- general, have been identified as double-edged
edge artifacts via FHIR data-access methods. swords: technology may provide benefit, but it
Use of services enables considerable flexibil- also may cause considerable harm. Clinician
ity and breadth of kinds of CDS that can be error when using information systems that
made available widely, and ongoing work with may result in untoward outcomes and unin-
FHIR resource specifications is improving the tended consequences (Karsh et al. 2010; Sittig
ability to gather the data needed and deliver and Singh 2009) may be an emerging property
it, and to integrate with clinical workflow pro- that is demonstrated only after system imple-
cesses more smoothly. mentation or widespread use. Medical errors
We have described the many efforts related to use of Health IT are problematic,
underway to standardize data models for not only for clinical and quality of care rea-
information exchange, CQL for knowledge sons, but technically, since they may represent
representation, and functional integration a mismatch between the user’s model of the
with EHR workflows using SMART, and task being performed and the model used in
FHIR. A deeper functional integration model a computation (National Research Council
between externalized services and EHR work- (U.S.) Committee on Engaging the Computer
Clinical Decision-Support Systems
833 24
Science Research Community in Health Care scale data-mining methods to provide CDS
Informatics et al. 2009), a mismatch between for population monitoring, public health sur-
the application’s intended functionality and veillance, and even to offer patient-specific
the resulting action or event (Harrison et al. recommendations based on cohort data when
2007), or a latent health IT-related error yet there is no specific evidence that could other-
to happen (Ash et al. 2007). Excessive alert wise guide therapy. With the increasing avail-
fatigue can undermine the efficacy of clinical ability of data from diverse sources relevant
decision support in CPOE (Isaac et al. 2009; to patient care, large data sets may be created
Strom et al. 2010), and in other IT functions and used for both discovery of previously
(Chused et al. 2008), and result in very high unknown associations, and novel clinical pre-
user override rates (Shah et al. 2006; van der dictions (Frankovich et al. 2011; Longhurst
Sijs et al. 2006; Weingart et al. 2003). Critical et al. 2014). Critical research questions here
research questions need to focus on the poten- will include how to define like cohorts of
tial mismatch between the user’s mental patients, how to structure and frame the index
model or intent and the application design, decision, what methods to use to assess the
use case, or workflow model (Zhang and likelihood of alternate prediction scenarios,
Walji 2011; Patel et al. 2010). Further atten- and how to model and elicit the patient’s
tion needs to be given to basic principles of preferences for each scenario. The Institute
human-factors engineering, such as the use of of Medicine (2011b) articulated a long-term
colors and layout within the application inter- vision for a Learning Health System, in which
face. Additional questions remain regarding clinical and administrative data of all kinds
the ideal design of methods and controls with will begin to inform and enhance clinical
which a user might interact to choose a medi- practice on a national level in a wide variety
cation from a long list, or identify and encode of ways.
patient problems. More advanced research
will enable visualization and decision mak-
ing by matching problems with care plans, 24.6 Conclusions
and facilitation of continuity and coordina-
tion of care based on underlying CDS rules The future of CDS systems inherently
and guideline-based workflows. Especially depends on progress in developing useful
challenging is addressing the need for struc- computer programs and in reducing logisti-
tured data to support clinical decision sup- cal barriers to implementation. Although
port and quality reporting, in a manner that ubiquitous computer-based decision aids that
routinely assist clinicians in most aspects of
is efficient for the end-user. This goal might be
achieved by combining structured documen- their practice are currently the stuff of science
tation during data entry, and natural language fiction, progress has been real and the poten-
processing for data abstraction from the clini-tial remains inspiring. Early predictions about
cal narrative. Most important, however, are the effects that such innovations would have
methods to direct CDS to the right user, at on medical education and practice have not
the right time, in the right workflow, with theyet come to pass (Schwartz 1970), but grow-
ing successes support an optimistic view of
right level of alerting or intervention, and the
right information (Osheroff et al. 2012). what technology will eventually do to assist
practitioners with the processing of complex
data and knowledge. The research challenges
24.5.6 Data-Driven CDS have been identified much more clearly, leg-
islative mandates are creating not only new
As we have discussed, a major area of research financial incentives but also the practical sub-
in informatics concerns methods for deriving strate of increased EHR adoption and con-
knowledge from large data sets using a variety vergence toward data interoperability, and
of techniques. With the adoption of health IT the implications for health-science education
broadly, investigators are drawing on large- are much better understood. The basic com-
834 M. A. Musen et al.
puter literacy of health professional students Adoption. New York: Elsevier This book
can be generally assumed, but health-science offers a comprehensive d iscussion of the
educators now must teach the conceptual nature of medical knowledge and of informa-
24 foundations of biomedical informatics if their tion technology to assist with medical decision
graduates are to be prepared for the techno- making. It provides detailed discussions of the
logically sophisticated world that lies ahead. computational, organizational, and strategic
Equally important, we have learned much challenges in the design, development, and
about what is not likely to happen. The more deployment of CDS systems.
that investigators understand the complex Institute of Medicine. (2011). Digital infrastruc-
and changing nature of medical knowledge, ture for the learning health system: The foun-
the clearer it becomes that trained practitio- dation for continuous improvement in health
ners of biomedical informatics will always be and healthcare. Workshop Series Summary.
required as participants in fostering a coop- Washington, DC: The National Academies
erative relationship between physicians and Press. This monograph summarizes the vision
computer-based decision tools. There is no for a national Learning Health System and
evidence that machine capabilities will ever offers the perspective of a wide range of
equal the human’s ability to deal with unex- thought leaders on the work required to
pected situations, to integrate visual and audi- achieve that vision.
tory data that reveal subtleties of a patient’s Ledley, R., & Lusted, L. (1959). Reasoning foun-
problem, to work with patients to incorpo- dations of medical diagnosis. Science, 130,
rate their values and priorities in care plans, 9–21 This is the paper that started it all. This
or to deal with social and ethical issues that classic article provided the first influential
are often key determinants of proper medical description of how computers might be used
decisions. Considerations such as these will to assist with the process of diagnosis. The
always be important to the humane practice flurry of activity applying Bayesian methods
of medicine, and practitioners will always to computer-assisted diagnosis in the 1960s
have access to information that is meaning- was largely inspired by this provocative paper.
less to the machine. Such observations argue Sittig, D. F., Wright, A., Osheroff, J. A.,
cogently for the discretion of health-care Middelton, B., Teich, J. M., Ash, J. A.,
workers in the proper use of decision-support Campbell, E., & Bates, D. W. (2008). Grand
tools. challenges in clinical decision support. Journal
of Biomedical Informatics, 41(2), 387–392 A
nnSuggested Readings rank-ordered list of some of the principal
Bright, T. J., Wong, A., Dhurjati, R., Bristow, E., challenges for CDS technology development
Bastian, L., Coeytaux, R. R., Samsa, G., and implementation, intended “to educate
Hasselblad, V., Williams, J. W., Musty, M. D., and inspire researchers, developers, funders,
Wing, L., Kendrick, A. S., Sanders, G. D., & and policy makers”.
Lobach, D. (2012). Effect of clinical decision-
support systems: A systematic review. Annals ??Questions for Discussion
of Internal Medicine, 157(1), 29–43 This thor- 1. Some researchers in medical AI have
ough analysis of studies of CDS systems dem- argued that CDS systems should rea-
onstrates that there is good evidence that CDS son from clinical data in a way that
technology can alter clinician behavior in pos- closely matches the reasoning strate-
itive ways, but that evidence that CDS systems gies of the very best clinical experts, as
can improve long-term patient outcomes is such experts are the most clever diag-
still inconclusive. The paper is also useful for nosticians and the most experienced
its comprehensive bibliography. treatment specialists that there are.
Greenes, R. A. (Ed.). (2014). Clinical Decision Other researchers maintain that expert
Support, 2nd Edition: The Road to Broad reasoning, no matter how excellent, is
Clinical Decision-Support Systems
835 24
at some level inherently flawed, and proper use of such systems? Would you
that CDS systems must be driven from be willing to visit a particular physician
the mining of large amounts of solid if you knew in advance that she made
data. How do you account for the decisions regarding treatment that were
apparent difference between these approved by expert colleagues less than
views? Which view is valid? Explain 80% of the time? If you would not, what
your answer. level of performance would you con-
2. Transitioning CDS systems from one sider adequate? Justify your answers.
clinical setting to another has always 4. A large international organization once
been problematic. The Leeds Abdominal proposed to establish an independent
Pain System was installed in several laboratory—much like Underwriters
major clinical settings, and yet the sys- Laboratory in the United States—that
tem never performed as well elsewhere would test CDS systems from all ven-
as it had done in Leeds. The Arden dors and research laboratories, certify-
Syntax, created expressly to facilitate ing the effectiveness and accuracy of
knowledge sharing across institutions, those systems before they might be put
failed to meet this goal to a significant into clinical use. What are the possible
degree. Why kinds of setting-specific dimensions along which such a labora-
factors make it difficult to transplant tory might evaluate decision-support
decision-support technology from one systems? What kinds of problems might
environment to another? What kinds of such a laboratory encounter in attempt-
research might lead to better methods ing to institute such a certification pro-
for knowledge sharing in the future? cess? In the absence of such a
3. In one early evaluation study, the credentialing system for CDS systems,
decision-support system ONCOCIN how can health-care workers feel confi-
provided advice concerning cancer ther- dent in using a clinical decision aid?
apy that was approved by experts in only 5. There is considerable untapped poten-
79% of cases (Hickam et al. 1985). In tial for CDS to help in managing
another study, the HyperCritic CDS sys- patients with multiple complex condi-
tem for the management of hyperten- tions. What are the challenges in dealing
sion offered the same comments that with such patients, and how can CDS be
were generated by a panel of experts in helpful? What are the features required
only 45% of cases (Van der Lei et al. of an algorithm that might integrate rec-
1991). Even today, such system perfor- ommendations from the multiple
mance is fairly typical for computer pro- clinical-practice guidelines that a CDS
grams that suggest patient therapy. Do system could apply?
you believe that this performance is ade- 6. CDS is often implemented poorly,
quate for a computational tool that is resulting in dissatisfaction, if not out-
designed to help physicians to make right annoyance. What are the human
decisions regarding patient care? What factors that need to be taken into con-
problems might CDS systems encounter sideration in implementing CDS effec-
as their developers attempt to make the tively? Discuss issues and approaches
systems more comprehensive in the to enhancing usability. What are situa-
advice that they offer? Why might it be tions in which graphics and visualiza-
more difficult for computer systems to tion might be used? How can CDS be
offer acceptable recommendations for used to enhance rather than to impede
patient therapy than seems to be the workflow? What are strategies to help
case for diagnosis? What safeguards, if avoid unintended consequences of
any, would you suggest to ensure the poorly implemented CDS?
836 M. A. Musen et al.
ers. In I. N. Sarkar et al. (Eds.), Proceedings of ing framework: synthesis of insights from leading
Medinfo 2015. IOS Press. https://doi. subject matter experts.” J Am Med Inform Assoc.
org/10.3233/978-1-61499-564-7-574. 20(1): 199–207.
Institute of Medicine. (2011a). Health IT and patient Knaus, W. A., Wagner, D. P., & Lynn, J. (1991). Short-
24 safety: Building safer systems for better care. term mortality predictions for critically ill hospital-
Washington, DC: The National Academies Press. ized adults: Science and ethics. Science, 254,
Institute of Medicine. (2011b). Digital infrastructure for 389–394.
the learning health system: The foundation for con- Kohn, L., Corrigan, J., & Donaldson, M. (Eds.). (2002).
tinuous improvement in health and healthcare. To err is human: Building a safer health system.
Workshop Series Summary. Washington, DC: The Institute of Medicine. Washington, DC: National
National Academies Press. Academy Press.
Isaac, T., Weissman, J., Davis, R., et al. (2009). Overrides Komaroff, A., Black, W., & Flatley, M. (1974). Protocols
of medication alerts in ambulatory care. Archives of for physician assistants: Management of diabetes
Internal Medicine, 169(3), 305. and hypertension. New England Journal of Medicine,
James, G., Witten, D., Hastie, T., & Tibshirani, R. 290, 370–312.
(2013). An introduction to statistical learning: With Langheier, J. M., & Snyderman, R. (2004). Prospective
applications in R, (Springer Texts in Statistics). medicine: The role for genomics in personalized
New York: Springer. health planning. Pharmacogenomics, 5(1), 1–8.
Jiang, G., Solbrig, H. R., Pathak, J., et al. (2015). https://doi.org/10.1517/phgs.5.1.1.25685.
Developing a standards-based information model Leape, L. L. (1994). Error in medicine. JAMA: The
for representing computable diagnostic criteria: A Journal of the American Medical Association,
feasibility study of the NQF Quality Data Model. 272(23), 1851–1857. https://doi.org/10.1001/
Studies in Health Technology and Informatics, 216, jama.1994.03520230061039.
1097. Ledley, R., & Lusted, L. (1959). Reasoning foundations
Jiang, G., Kiefer, R. C., Sharma, D. K., et al. (2017a). A of medical diagnosis. Science, 130, 9–21.
consensus-based approach for harmonizing the Lindgren, H. (2008). Decision support system support-
OHDSI common data model with HL7 FHIR. ing clinical reasoning process - an evaluation study
Studies in Health Technology and Informatics, 245, in dementia care. Studies in Health Technology and
887–891. Informatics, 136, 315–320.
Jiang, G., Kiefer, R. C., Prud’hommeaux, E., et al. Lomatan, E. A., Meadows, G., Michaels, M., et al.
(2017b). Building interoperable FHIR-based vocab- (2019). To share is human! CDS Connect: A grow-
ulary mapping services: A case study of OHDSI ing national repository of shareable, interoperable
vocabularies and mappings. Studies in Health clinical decision support. Proceedings of the AMIA
Technology and Informatics, 245, 1327. Spring Congress.
Jing, X., Cimino, J. J., & Del Fiol, G. (2015). Usability Longhurst, C. A., Harrington, R. A., & Shah, N. H.
and acceptance of the librarian infobutton tailoring (2014). A ‘green button’ for using aggregate patient
environment: An open access online knowledge cap- data at the point of care. Health Affairs, 33(7),
ture, management, and configuration tool for 1229–1235.
OpenInfobutton. Journal of Medical Internet Mandel, J. C., Kreda, D. A., Mandl, J., et al. (2016).
Research, 17(11), e272. SMART on FHIR: A standards-based, interopera-
Kahn, C. E., Roberts, L. M., Shaffer, K. A., et al. (1997). ble apps platform for electronic health records.
Construction of a Bayesian network for mammo- Journal of the American Medical Informatics
graphic diagnosis of breast cancer. Computers in Association, 23(5), 899–908.
Biology and Medicine, 27, 19–29. Mandl, K. D., & Kohane, I. S. (2012). Escaping the
Karsh, B.-T., Weinger, M. B., Abbott, P. A., et al. (2010). EHR trap — the future of health IT. The New
Health information technology: Fallacies and sober England Journal of Medicine, 366, 2240–2242.
realities. Journal of the American Medical Marcial, L. H., Richardson, J. E., Lasater, B., et al.
Informatics Association, 17(6), 617–623. https://doi. (2018). The imperative for patient-centered clinical
org/10.1136/jamia.2010.005637. decision support. eGEMs, 6(1), 12.
Kaushal, R., Jha, A., Franz, C., et al. (2006). Return on McClellan, M. (2015). Accountable care organizations
investment for a computerized physician order entry and evidence-based payment reform. JAMA: The
system. Journal of the American Medical Informatics Journal of the American Medical Association,
Association, 13(3), 261–266. 313(21), 2128–2123.
Kawamoto, K., Del Fiol, G., Strasberg, H. R., et al. Melnick, E. R., Dyrbye, L. N., Sinsky, C. A., et al.
(2010). Multi-national, multi-institutional analysis (2019). The association between perceived electronic
of clinical decision support data needs to inform health record usability and professional burnout
development of the HL7 Virtual Medical Record among US physicians. Mayo Clinic Proceedings.
standard. AMIA Annual Symposium Proceedings, https://doi.org/10.1016/j.mayocp.2019.09.024.
2010, 377–381. Middleton, B., Shwe, M. A., Heckerman, D. E., et al.
Kawamoto, K., et al. (2013). “Key principles for a (1991). Probabilistic diagnosis using a reformula-
national clinical decision support knowledge shar- tion of the INTERNIST-1/QMR knowledge base.
Clinical Decision-Support Systems
839 24
Part II: Evaluation of diagnostic performance. Overby, C. L., Tarczy-Hornoch, P., Hoath, J. I., et al.
Meth Inf Medicine, 4, 256–267. (2010). Feasibility of incorporating genomic knowl-
Middleton, B., Anderson, J., Fletcher, J., et al. (1998). edge into electronic medical records for pharma-
Use of the WWW for distributed knowledge engi- cogenomic clinical decision support. BMC
neering for an EMR: The KnowledgeBank concept. Bioinformatics, 11(Suppl 9), S10. https://doi.
Proceedings of the AMIA Annual Symposium, org/10.1186/1471-2105-11-S9-S10.
pp. 126–130. Pageler, N. M., Longhurst, C. A., Wood, M., et al.
Middleton, B. (2009). “The clinical decision support (2014). Use of electronic medical record-enhanced
consortium.” Stud Health Technol Inform. 150, checklist and electronic dashboard to decrease
26–30. CLABSIs. Pediatrics, 133(3), e738–e746.
Musen, M. A. (1998). Domain ontologies in software Patel, V. L., Arocha, J. F., & Zhang, J. (2010). Medical
engineering: Use of PROTÉGÉ with the EON reasoning and thinking. In K. J. Holyoak & R. G.
architecture. Methods of Information in Medicine, Morrison (Eds.), Oxford handbook of thinking and
37(4–5), 540–550. reasoning (pp. 736–754). Oxford: Oxford University
Musen, M. A. (2014). Knowledge representation. In Press.
I. N. Sarkar (Ed.), Methods in biomedical informat- Pathak, J., Bailey, K. R., Beebe, C. E., et al. (2013).
ics: A pragmatic approach. Waltham, MA: Academic Normalization and standardization of electronic
Press, Elsevier. health records for high-throughput phenotyping:
Musen, M. A. (2015). The Protégé project: A look back The SHARPn consortium. Journal of the American
and a look forward. AI Matters. Association of Medical Informatics Association, 20(e2), e341–e348.
Computing Machinery Special Interest Group in Pauker, S. G., & Kassirer, J. P. (1981). Clinical decision
Artificial Intelligence, 1(4), 4–12. https://doi. analysis by computer. Archives of Internal Medicine,
org/10.1145/2557001.25757003. 141(13), 1831–1837.
Musen, M. A., Tu, S. W., Das, A. K., et al. (1996). EON: Peleg, M., Tu, S., Bury, J., et al. (2003). Comparing
A component-based approach to automation of computer- interpretable guideline models: A case-
protocol-directed therapy. Journal of the American study approach. Journal of the American Medical
Medical Informatics Association, 3(6), 367–388. Informatics Association, 10, 52–68. https://doi.
National High Blood Pressure Education Program. org/10.1197/jamia.M1135.
(2004). The Seventh Report of the Joint National Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). “Why
Committee on prevention, detection, evaluation, and should I trust you?” Explaining the predictions of
treatment of high blood pressure. National Heart, any classifier. Proceedings of the Conference on
Lung, and Blood Institute, National Insitutes of Knowledge Discovery in Databases, San Francisco,
Health. CA.
National Research Council (US) Committee on Engaging Saria, S., Rajani, A. K., Gould, J., et al. (2010).
the Computer Science Research Community in Integration of early physiological responses predicts
Health Care Informatics, Stead, W. W., & Lin, H. S. later illness severity in preterm infants. Science
(2009). Computational technology for effective health Translational Medicine, 2(48), 48ra65.
care: Immediate steps and strategic directions (The Schedlbauer, A., Prasad, V., Mulvaney, C., et al. (2009).
National Academies Collection: Reports funded by What evidence supports the use of computerized
National Institutes of Health). Washington, DC: alerts and prompts to improve clinicians’ prescrib-
National Academies Press (US). ing behavior? Journal of the American Medical
Noy, N., Gao, Y., Jain, A., et al. (2019). Industry-scale Informatics Association, 16(4), 531–538. https://doi.
knowledge graphs: Lessons and challenges. org/10.1197/jamia.M2910.
Communications of the ACM, 62(8), 36–43. Schnipper, J. L., Linder, J. A., Palchuk, M. B., et al.
Odigie, E., Lacson, R., Raja, A., et al. (2019). Fast (2008). “Smart Forms” in an Electronic Medical
healthcare interoperability resources, clinical qual- Record: Documentation-based clinical decision
ity language, and systematized nomenclature of support to improve disease management. Journal of
medicine-clinical terms in representing clinical evi- the American Medical Informatics Association,
dence logic statements for the use of imaging proce- 15(4), 513–523. https://doi.org/10.1197/jamia.
dures: Descriptive study. JMIR Medical Informatics, M2501.
7(2), e13590. Schwartz, W. B. (1970). Medicine and the computer: The
Osheroff, J. A., Teich, J. M., Middleton, B., et al. (2007). promise and problems of change. New England
A roadmap for national action on clinical decision Journal of Medicine, 283(23), 1257–1264.
support. Journal of the American Medical Shachter, R. D. (1986). Evaluating influence diagrams.
Informatics Association, 14(2), 141–145. https://doi. Operations Research, 34(6), 871–882. https://doi.
org/10.1197/jamia.M2334. org/10.1287/opre.34.6.871.
Osheroff, J. A., Teich, J. A., Levick, D., et al. (2012). Shah, N. R., Seger, A. C., Seger, D. L., et al. (2006).
Improving outcomes with clinical decision support: Improving acceptance of computerized prescribing
An implementer’s guide. Chicago, IL: HIMSS: alerts in ambulatory care. Journal of the American
Healthcare Information Management and Systems Medical Informatics Association : JAMIA, 13(1),
Society. 5–11. https://doi.org/10.1197/jamia.M1868.
840 M. A. Musen et al.
Shahar, Y., & Musen, M. A. (1996). Knowledge-based Ullman-Cullere, M. H., & Mathew, J. P. (2011).
temporal abstractions in clinical domains. Artificial Emerging landscape of genomics in the Electronic
Intelligence in Medicine, 8(3), 267–298. Health Record for personalized medicine. Human
Shiffman, R. N., Karras, B. T., Agrawal, A., et al. (2000). Mutation, 32(5), 512–516. https://doi.org/10.1002/
24 GEM: A proposal for a more comprehensive guide- humu.21456.
line document model using XML. Journal of the van der Lei, J., Musen, M. A., van der Does, E., et al.
American Medical Informatics Association, 7(5), (1991). Comparison of computer-aided and human
488–498. review of general practitioners’ management of
Shortliffe, E. H. (1976). Computer-based medical consul- hypertension. Lancet, 338(8781), 1504–1508.
tations: MYCIN. New York: Elsevier/North van der Sijs, H., Aarts, J., Vulto, A., et al. (2006).
Holland. Overriding of drug safety alerts in computerized
Shortliffe, E. H., Buchanan, B. G., & Feigenbaum, E. physician order entry. Journal of the American
(1979). Knowledge engineering for medical deci- Medical Informatics Association, 13(2), 138–147.
sion making: A review of computer-based clinical https://doi.org/10.1197/jamia.M1809.
decision aids. Proceedings of the IEEE, 67, 1207– Wang, S. J., Middleton, B., Prosser, L. A., et al. (2003).
1224. A cost-benefit analysis of electronic medical records
Shwe, M. A., Middleton, B., Heckerman, D. E., et al. in primary care. The American Journal of Medicine,
(1991). Probabilistic diagnosis using a reformula- 114(5), 397–403.
tion of the INTERNIST-1/QMR knowledge base. Wang, J. K., Hom, J., Balasubramanian, S., et al. (2018).
I. The probabilistic model and inference algorithms. An evaluation of clinical order patterns machine-
Methods of Information in Medicine, 30(4), 241–255. learned from clinician cohorts stratified by patient
Sinsky, C. A., & Privitera, M. R. (2018). Creating a mortality outcomes. Journal of Biomedical
“manageable cockpit” for clinicians: A shared Informatics, 86, 109–119.
responsibility. JAMA Internal Medicine, 178(6), Weingart, S. N., Toth, M., Sands, D. Z., et al. (2003).
741–742. Physicians’ decisions to override computerized drug
Sittig, D., & Singh, H. (2009). Eight rights of safe elec- alerts in primary care. Archives of Internal Medicine,
tronic health record use. JAMA, 302(10), 1111–1113. 163(21), 2625–2631. https://doi.org/10.1001/
Sittig, D. F., & Singh, H. (2011). Defining health infor- archinte.163.21.2625.
mation technology-related errors: New develop- Wright, A., Sittig, D. F., Ash, J. S., et al. (2009). Clinical
ments since to err is human. Archives of Internal decision support capabilities of commercially-
Medicine, 171(14), 1281–1284. https://doi. available clinical information systems. Journal of the
org/10.1001/archinternmed.2011.327. American Medical Informatics Association, 16(5),
Sordo, M., Boxwala, A. A., Ogunyemi, O., et al. (2004). 637–644.
Description and status update on GELLO: A pro- Wright, A., Sittig, D. F., Ash, J. A., et al. (2015). Lessons
posed standardized object-oriented expression lan- learned from implementing service-oriented clinical
guage for clinical decision support. Studies in Health decision support at four sites: A qualitative study.
Technology and Informatics, 107, 164–168. International Journal of Medical Informatics, 84(11),
Strom, B. L., Schinnar, R., Aberra, F., et al. (2010). 901–911.
Unintended effects of a computerized physician Yu, V. L., Fagan, L. M., Wraith, S. M., et al. (1979a).
order entry nearly hard-stop alert to prevent a drug Antimicrobial selection by a computer. A blinded
interaction: A randomized controlled trial. Archives evaluation by infectious disease experts. Journal of
of Internal Medicine, 170(17), 1578–1583. https:// the American Medical Association, 242(12), 1279–
doi.org/10.1001/archinternmed.2010.324. 1282.
Tang, P. C., Fafchamps, D., & Shortliffe, E. H. (1994). Yu, V. L., Buchanan, B. G., Shortliffe, E. H., et al.
Traditional medical records as a source of clinical (1979b). Evaluating the performance of a computer-
data in the outpatient setting. Proceedings of the based consultant. Computer Programs in
Annual Symposium on Computer Applications in Biomedicine, 9(1), 95–102.
Medical Care, pp. 575–579. Zhang, J., & Walji, M. F. (2011). TURF: Toward a uni-
Tang, P. C., Jaworski, M. A., Fellencer, C. A., et al. fied framework of EHR usability. Journal of
(1996). Clinician information activities in diverse Biomedical Informatics, 44(6), 1056–1067.
ambulatory care practices. Proceedings of the AMIA Zimmerman, J. E., Kramer, A. A., McNair, D. S., et al.
Annual Fall Symposium. pp. 12–16. (2006). Acute Physiology and Chronic Health
Trafton, J., Martins, S., Michel, M., et al. (2010). Evaluation (APACHE) IV: Hospital mortality
Evaluation of the acceptability and usability of a assessment for today’s critically ill patients. Critical
decision support system to encourage safe and effec- Care Medicine, 34(5), 1297–1310. https://doi.
tive use of opioid therapy for chronic, noncancer org/10.1097/01.CCM.0000215112.84523.F0.
pain by primary care providers. Pain Medicine,
11(4), 575–585.
841 25
Contents
References – 864
Digital Technology in Health Science Education
843 25
nnLearning Objectives of learning, digital technologies for learning
After reading this chapter, you should know environments, and an overview of learner
the answers to these questions: audiences. We then transition to digital learn-
55 How can computers improve the deliv- ing systems, which includes learning man-
ery of in-class and self-learning, as well agement systems, learning content creation
as in-practice learning? systems, just-in-time learning systems and
55 How can different approaches to learn- performance support, usability and accessi-
ing be implemented using computers? bility, interoperability standards, digital con-
55 How can simulations supplement stu- tent and assessment of learning. Finally, we
dents’ exposure to clinical practice? discuss future directions and challenges for
55 What are the issues to be considered the application of digital technology in health
when developing computer-based edu- science education.
cational programs?
55 What are the significant barriers to
widespread integration of computer- 25.2 Approaches to Teaching
aided instruction into the medical cur- and Learning with Digital
riculum? Technology
The continual rapid increase in health sci-
25.1 Introduction ences knowledge requires a shift in learning
methods both by the health sciences student
The application of digital technology to health as well as the health professional. Decades
science education is a sub-field of biomedical earlier, memorization and recall of facts
informatics. It includes the application of all were a primary, and sufficient, learning goal.
aspects of information and computer technol- Current learning approaches require learning
ogy to the content and delivery of education, the basic concepts and methods of a discipline
as well as to research on the improvement and but, in addition, emphasize the ability to inte-
efficacy of education. Healthcare requires grate knowledge and to solve problems in the
constant learning, with its practice in a multi- context of everyday healthcare.
disciplinary team environment in an informa-
tion-rich world. Digital technology offers new
approaches to learning that: 25.2.1 Theories of Learning
55 increase engagement and retention of
knowledge, Understanding how digital technology can
55 allow personalization of knowledge deliv- support learning in the health sciences begins
ery, with an appreciation of how people learn.
55 enhance collaboration through connectiv- This understanding provides a foundation for
ity, thinking about the learning process and how
55 support learning any time and anywhere, it is shaped by context, purpose, goals, com-
55 make available the increasing volume of plexity, and the diversity of learners.
knowledge, At its most basic, learning involves a
55 support learning of evidence-based clini- change in how a learner perceives and under-
cal practice, and stands some part of their world. The term
55 enhance research through collection and schema is commonly used, in cognitive sci-
analysis of large volumes of learner data. ence, to describe the cognitive frameworks
the people use to organize the information
In this chapter, we first discuss approaches to they have and their beliefs about a particu-
teaching and learning with digital technology. lar concept, activity, or experience. Schemas
That section includes material about theories help us quickly assess a situation and act
844 P. Dev and T. Schleyer
appropriately. When individuals encounter the content, with each other, and in reflection
new information, they try to incorporate it on what and how they are learning. Research
into their existing schema to enhance their has shown that instructional approaches that
understanding. promote active learning consistently outper-
If new information contradicts the learn- form transmission-only approaches at a statis-
er’s existing knowledge or beliefs, the learner tically significant level (Freeman et al. 2014).
25 adjusts in one of two ways. They may accept Flipping classes is a relatively recent strat-
the new information as valid, and modify their egy for encouraging active learning. Instructors
schema accordingly. Alternatively, they judge flip the class when they provide the instruction,
the new information as invalid, unimport- traditionally delivered through in-class lecture,
ant, or irrelevant, and they do not adapt their online and the class time is devoted to active
schema to account for this new information. learning which replaces the majority of tra-
When strong existing schemas keep individu- ditional homework. The homework becomes
als from accounting for new, valid informa- doing what needs to be done to prepare for the
tion it can be very difficult for them to make in-person class. Flipped classes are similar to
changes such as altering unsafe procedures or hybrid or blended classes where the seat time
adopting new safety protocols. To encourage that would be used for lecture is focused on
such knowledge and behavioral change, it is active learning instead. Faculty engage with
important to explicitly address existing mis- learners through case discussions, problem
conceptions and support learner motivation solving, and deep dives to further understand
to change. the content learned outside the classroom.
Changes in schema presume that indi-
viduals actively construct their own meaning
through the interactions they have with infor- 25.2.2 Digital Technologies
mation, other people, and the environment for Learning Environments
around them. This constructivist approach
argues that individuals are not blank slates Much, if not most, of today’s learning con-
and bring their own history and experience to tent is delivered digitally. This is evident in
every learning situation. It is also important the tools that are used within and outside
to recognize that this construction is an inher- of the classroom, for on-the-job training, in
ently social and situated process. Often learn- specialized learning facilities and elsewhere.
ers are learning together in classes, groups, or In the classroom, learning content such as
teams. The conversations and shared expe- PowerPoint slides, Prezi presentations, web-
riences with others provide interpersonal sites, simulations, games and other digital
cognitive and affective context for the learn- media are often projected using digital pro-
ing process, and can shape the direction and jectors or delivered directly to the devices of
scope of knowledge construction. learners, such as smartphones, tablets and
Digital technology can support a variety of laptops. Audience interaction methods, such
approaches to learning. A common approach as Web-based surveys, shared digital white-
is didactic teaching, a one-way transfer of boards or student/audience response systems,
information through lectures, presented online can help learners interact with the learning
via technologies such as recorded digital video. content, the instructor(s) or fellow learners.
This approach has the advantage that new, as The classroom technologies and device
well as remedial, material can be made avail- ecology accessible to teachers and learners
able, with additional links to in-depth con- can extend the learning experience seamlessly
tent. A considerable amount of the available beyond the classroom. For instance, video-
digital content is didactic, though it is usually conferencing allows individuals to attend
enhanced with various activities for active lectures remotely, raise their hand and ask a
learning. Active learning approaches, on the question. Depending on the teaching style,
other hand, focus on engaging learners in the this remote participation may approximate
learning process by having them interact with face-to-face attendance fairly closely or fall
Digital Technology in Health Science Education
845 25
.. Fig. 25.1 A life-size
reconstruction of a digital
human is viewed in a horizontal
computer screen or “digital
table”. With finger gestures,
learners can identify structures,
remove layers of tissue to make
vessels, nerves and bones visible,
or rotate the body. Additional
functions at the edges of the
table allow further viewing and
measurement functions. Clinical
cases, with anatomy
reconstructed from radiologic
images, such as CT and MRI,
allow study of actual cases.
(Courtesy of Anatomage Inc.,
with permission)
far short of it. Collaborative technologies, transition, but only if they take the specific con-
such as instant messaging, group chats or text of learners and their goals into account.
collaborative editing tools, can help bridge We therefore discuss learner audiences and
physical separation, and enable efficient and their particular needs in the following sections.
effective virtual group work.
Such collaborative work can also happen
asynchronously and provide flexibility (within 25.3.1 Undergraduate
limits) to learner schedules. Discussion lists, and Graduate Health Care
messaging boards and collaboration sites Professions Students
enable episodic, time-independent contribu-
tions from and interactions with learners. Basic science programs in medical schools
Some time ago, using advanced technolo- were among the first to implement technology-
gies, such as simulations, virtual reality and supported learning. Visually rich content in
augmented reality, required a trip to a spe- anatomy, neuroanatomy and pathology was
cialized facility, such as a simulation center or much more accessible on the computer than
“virtual reality cave.” However, with devices through the microscope or via the cadaver
such as Oculus or HTC Vive, such experiences dissection room (. Fig. 25.1). Excellent 3D
are now available almost anyplace. learning programs for anatomy are available,
Last, social media such as Facebook, such as Netter 3D Anatomy, Primal Pictures,
Google Hangouts, Twitter, Snapchat and VH Dissector, Anatomage Table,1 and other
Instagram have found important uses in edu- products, providing ever more accurate visual-
cation, ranging from real-time updates on ization of the human in three dimensions. The
projects to sharing experiences on a field-trip. use of microscopes in fields such as histology
and pathology education has virtually disap-
peared. Interestingly, in many schools, the use
25.3 Overview of Learner of cadavers has seen a resurgence, both as an
Audiences important learning tool and as a rite of pas-
sage into the health care profession.
As discussed above, education in medicine,
nursing, pharmacy, dentistry and other health
professions is shifting from a focus on knowl-
1 Netter 3D Anatomy: 7 http://netter3danatomy.
edge acquisition to competency-based edu- com; Primal Pictures: 7 https://primalpictures.com;
cation (Englander et al. 2013). Educational VH Dissector: 7 https://www.toltech.net; Anatom-
technologies are well positioned to support this age: 7 https://www.anatomage.com
846 P. Dev and T. Schleyer
Nursing schools have moved quickly to group of students is given a clinical problem
expand their use of technology in education. and they engage in discussion to develop an
For instance, digitally-enhanced physical understanding of the problem, identify rel-
mannequins for simulation of realistic nurs- evant knowledge, seek required knowledge
ing scenarios are widely-used learning tools. using online and library research, discuss and
Many nursing schools that used to share a challenge each other’s interpretations, and set-
25 simulation center with a medical school have tle on a solution to the problem. PBL is widely
found their own demand high enough to used in undergraduate clinical learning, teach-
require building their own simulation centers. ing students self-directed learning, reflection,
Dental schools often share a part of their and teamwork. An interesting analysis of
curriculum with medical schools, and as a PBL by a student is available at Chang (2016).
result use the same or similar learning con- Computer-simulated patients allow a full
tent. However, they also need specialized ana- range of diseases to be presented and allow
tomical and simulation content for dental and the learner to follow the course of an illness
craniofacial topics. 3D software for dental over any appropriate time period. Faculty can
anatomy is used widely in pre-doctoral den- decide what clinical material must be seen and
tistry. Historically, simulation of dental pro- can use the computer to ensure that this core
cedures was practiced using physical objects curriculum is delivered. Moreover, with the
such as chalk or plastic teeth, a practice that use of an “indestructible patient,” the learner
is still widespread. More recently, high fidelity can take full responsibility for decision mak-
digital simulators have been developed. ing, without concern over harming an actual
For many years, teaching hospitals had patient by making mistakes. These simu-
patients with interesting diagnostic problems lated patients may be fully virtual, or may be
such as “unexplained weight loss” or “fever of computer-enhanced physical mannequins.
unknown origin”. This environment allowed
for thoughtful “visit rounds,” at which the
attending physician could tutor the students 25.3.2 racticing Health Care
P
and house staff, who could then go to the Providers
library to research the subject. A patient
stayed in the hospital for weeks as testing Health Sciences education does not stop after
was pursued and the illness evolved. In the the completion of formal training. The science
modern era of restricted insurance payments, of medicine advances at such a rapid rate that
managed care, and reduced length-of-stay, much of what is taught rapidly becomes obso-
this opportunity for learning in the hospital lete, and it has become obligatory for health
environment, has vanished for most junior professionals to be lifelong learners, both for
students. The typical patient in today’s teach- their own satisfaction and, increasingly, as a
ing hospital is multi-morbid, usually elderly, formal requirement to maintain their profes-
and commonly acutely ill. The emphasis is on sional certification. Therefore, online courses
short stays, with diagnostic problems handled and online certification examinations have
on an outpatient basis, and diseases evolving become increasingly common for maintenance
at home or in chronic care facilities. Thus, the of certification. An additional advantage of
medical student is faced with fewer diagnostic online certification is the automatic tracking
challenges suited to their level of knowledge, of learner performance, and the accompany-
and has little opportunity to see the evolution ing automatic generation of certificates and
of a patient’s illness over time. institutional compliance reports.
One response of medical educators has Health professionals are also required to
been to try to move teaching to the outpatient demonstrate clinical competence through
setting; another has been to use problem-based their performance in simulated clinical sce-
learning and computer-simulated virtual narios. Advanced Trauma Life Support and
patients. Problem-based learning (PBL) is Advanced Cardiac Life Support are some of
a pedagogical approach where each small the areas in which clinical competence is dem-
Digital Technology in Health Science Education
847 25
.. Fig. 25.2 A screenshot of the SimSTAT simulation such as the equipment in the room or the icons at the
used for Maintenance of Certification by the American bottom of the screen. Through these interactions, the
Society of Anesthesiologists. In a simulation of an oper- learner can control the level of sedation, give medica-
ating room, the anesthesiologist ventilates the patient tions, fluids, and gases, and monitor the patient’s physi-
who is lying on the operating table. The simulation is ologic status. The inset screen on the top left is a monitor
viewed by the learner on a computer screen while the for the patient’s vital signs. The inset screen on the top
learner plays the role of the anesthesiologist. The learner right is the display from the anesthesia machine. (Cour-
guides the on-screen anesthesiologist to care for the tesy of CAE Healthcare, Inc., with permission)
unconscious patient by clicking on desired interactions,
onstrated through actual participation in on- delivery are occurring slowly, and technology,
site scenarios. Online simulation of these and particularly simulation and role playing, will
other scenarios can be used as preparation for be part of this change (Zaharias 2018).
testing in a live, crisis scenario. Some special- Meanwhile, healthcare information is
ties, such as anesthesiology, have developed widely available to the general public, with the
sufficiently rich online simulations in their most reliable information being on web sites
specialty, with real time assessment, that they affiliated with federal library resources such
can reduce the requirement for use of live sce- as Medline Plus, academic organizations,
narios (. Fig. 25.2). professional societies or federal health agen-
cies. Online courses, both free and paid, are
also available, many from traditional universi-
ties or other online education organizations.
25.3.3 Patients, Caregivers, Interactive learning applications, however, are
and the Public not widely available to the public though they
are able to provide more engaging learning.2
For those outside the health science profes-
sions, there is no systematic way to learn how
to be a knowledgeable patient or home-based
caregiver, or how to effectively communicate 2 7 h t t p s : / / w w w. n i h . g ov / h e a l t h - i n fo r m at i o n ;
with a health care provider. For the provider 7 https://medlineplus.gov; 7 https://www.cdc.gov;
7 https://www.medscape.com; 7 http://www.diabe-
interacting with the patient, the need to under- tes.org; 7 https://www.heart.org/en/health-topics;
stand the patient accompanies the need to 7 https://my.clevelandclinic.org/health; 7 https://
problem-solve. These changes in health care www.mayoclinic.org
848 P. Dev and T. Schleyer
Improving the patient’s health literacy deployed to all learners without any need for
has become an important approach to pro- special integration programming.
viding higher quality health care. Failure to Collaborative content creation can be
comply with medication regimes, exercise another powerful approach to learning. As
plans, or hospital discharge instructions are opposed to structured content that is created
a major cause of return visits to the hospital by faculty or a similar creator, collaborative,
25 or clinic. Patient and family caregiver edu- learner-generated content is informal and cre-
cation, leading to better understanding of ated on the fly, for instance in a discussion
clinical instructions, could result in a more forum. Structured discussion groups which
effective partnership between the patient and encourage students to provide their thinking
the healthcare provider (Nelson 2016) (see on the discussion questions, and to comment
7 Chap. 11). Online learning resources are on content from other learners, are useful
one approach to alleviating the compliance tools to support learning through active par-
problem. ticipation, argument, and reflection. Video
In the next section, we take a close look conferencing tools, such as Zoom and Skype,
at how digital learning content is created and support real-time discussion and collabora-
delivered. tion, and can be content creation tools if there
is a repository of the content.
When the content to be created is
25.4 Digital Learning Systems sufficiently complicated, requires strict adher-
ence to organizational policy, or requires a
25.4.1 Learning Content range of skill sets and significant expense, it
Management Systems becomes necessary to approach it at the level
of the enterprise. For example, the American
A Learning Content Management System Heart Association has developed courses for
(LCMS) is a software platform that allows cardiac life support that are required training
learning content creators, such as faculty, in the United States and many other coun-
to create, manage, host and track changes in tries.3 These programs are created by teams,
with each person providing a different skill,
digital learning content. Prior to the develop-
such as graphic design, programming, or con-
ment of LCMSs, educational content creators
tent knowledge, rather than by an informal
had to assemble several separate and disparate
collaboration of learners with similar skills.
items to create rich, engaging learning content.
LCMSs, on the other hand, are one-stop plat-
forms that integrate a wide variety of tools for
content creation. They overlap with Learning 25.4.2 Learning Management
Management Systems (LMS, described below) Systems
in that both support content hosting and
delivery. However, LCMSs specialize in tools A Learning Management System (LMS) is a
to create, manage and update content. repository of learning content, an interface
Personally created course content may be for delivering courses and content to learn-
a web site or a blog, created by a faculty mem- ers, and a platform for the course creator or
ber on any of a range of web site or blog cre- administrator to track learner engagement
ation tools. Other personal content creation and performance. From the learner’s view-
tools include tools for quiz item development, point, the LMS provides a single login access
capturing video lectures or demonstrations, to all courses that they may need. Once within
and creation of interactive animations or
games. The only requirement is that the con-
tent files be compatible with the LMS, so that 3 American Heart Association’s ACLS, BLS, and
the content can be uploaded to the LMS and PALS courses: 7 https://elearning.heart.org/courses
Digital Technology in Health Science Education
849 25
a course, the learner can access content, such Examples of performance support or just-
as text, videos, quizzes, games, handouts and in-time learning tools include:
assignments. The LMS may include admin- 55 job aids, such as check lists, quick refer-
istration features for the learner to select ence cards, and handouts;
courses, register or join a wait list, pay for 55 protocols and templates, such as the SBAR
each course, and track their grades. It may (Situation, Background, Assessment,
also include resource sharing and collabo- Recommendation and Request) technique
ration between learners. From the faculty’s for communicating critical information;
point of view, the LMS allows uploading and 55 resource or policy documents;
modification of course content, as well as a 55 video and audio recordings, such as brief
dashboard for viewing the performance of demonstrations of care procedures, par-
learners, groups and classes as a whole. LMS ticularly helpful for home-based care pro-
features may include various statistics, such viders; and
as usage of course components or the perfor- 55 animations, simulations, and learning
mance of the class on individual test items. modules that include brief instructions,
Higher education institutions provide a rel- demonstrations, or explanations.
atively structured curriculum to well-defined
learner populations. Their needs typically
are served by educational LMS applications 25.4.4 Interoperability Standards
such as Blackboard Learn, D2L Brightspace,
Moodle, or Canvas (Dahlstrom et al. 2014). The education enterprise process includes
Medical centers and corporations often use many parts, such as content, curriculum,
LMSs that are more suited to corporate needs. LMSs, learner profiles, assessment, certi-
Typically, the most important requirement of fication, and others. These parts are often
such an LMS is tracking learner compliance supported by different tools and platforms
with required training, and export of reports that need to work together seamlessly. To
for regulatory and accreditation purposes. enable this seamless environment, tools and
There are numerous enterprise-oriented platforms need to be interoperable, without
LMSs. Some common ones are Captivate the need for custom programming for integra-
Prime, TalentLMS, and Totara, but market tion. In this chapter, we discuss some of the
leadership of LMSs changes continually. interoperability standards in education (see
7 Chap. 7).
Historically, the commonly used stan-
dard for such learning object interoperability
25.4.3 Just-in-Time Learning was the Shareable Content Object Reference
Systems and Performance Model (SCORM).4 It defines how a content
Support object (a course) should be packaged and
described, how it should be launched, and how
Learning also occurs outside of formal learn- data should be communicated between the
ing contexts. This “just-in-time learning” hap- LMS and the content package. A SCORM-
pens on demand, for instance when learners compatible content object can be published
need instant information at a critical moment, to and played back from any SCORM-
or something goes wrong and they need to compatible LMS. SCORM can report on
know what to do next. Instant information course completion and time spent. SCORM
can provide immediate help, or performance was last updated in 2009 and, as a standard,
support, but can also be considered a learn- has not kept up with changing technology.
ing opportunity. A particularly powerful However, it is still one of the most widely used
approach is to make these tools accessible standards for learning object interoperability.
when and where they are likely needed, for
instance in online help areas of electronic
medical records. 4 SCORM: 7 https://scorm.com/
850 P. Dev and T. Schleyer
The newer xAPI standard (aka TinCan There has also been an interest in the
API)5 is much more robust in terms of analyt- exchange of education components, specifi-
ics and mobile deployment. The drawback to cally learning content modules, between insti-
xAPI is that it does not integrate with older tutions. A number of content collections have
LMSs and tends to be costly to deploy, often been developed, with the most well-known
requiring professional development a ssistance. being MERLOT. Standardized descriptions
25 With the rapid growth in the use of of learning objects, known as Learning Object
learning management systems in higher and Metadata, were developed, but exchanging
continuing education and workplace learn- and repurposing individual learning objects
ing, it has become critical to have interoper- did not become commonplace. However, an
ability standards that provide integration interesting by-product was a standardized
of web-based learning objects and applica- way to describe items in a content collection,
tions. Learning Tools Interoperability (LTI)6 to manage a library of learning objects and
is a standard developed by the IMS Global to track learner use of those objects. Another
Learning Consortium to provide a means for example of a repository of shared learning
seamless and secure pass-through of student resources is the AAMC’s MedEdPORTAL8,
credentials and grades between the LMS and which is peer- reviewed and contains both
the external application. LTI tools are usually patient cases and clinical scenarios.
web-based applications written in a server-
side language which can serve a variety of
purposes. These include, but are not limited 25.4.5 Usability and Access
to, hosting and serving video with quizzing,
providing access to interactive learning mate- Usability and access are important consid-
rials from textbook publishers, allowing learn- erations in developing educational software
ers to create media, use specialized programs, (see 7 Chap. 5). About 1 in 50 adults have
and collaborate in integrated development some form of vision or hearing disability, and
environments (IDE), whiteboarding and mind need alternate or augmented access to digi-
mapping applications, or videoconferencing. tal learning content. Two standards, Section
Other interoperability standards address 508 of federal law (508)9 and the World Wide
various education services. For example, the Web’s Web Content Accessibility Guidelines
Medbiquitous7 organization has developed (WCAG),10 address use of digital content by
a Curriculum Inventory Data Exchange people with disabilities.
Standard that is being used by the Association Section 508 specifies that digital informa-
of American Medical Colleges to collect and tion provided by or to the government must
collate curriculum data from all its medical be accessible if there is “no undue burden”.
schools and map these curricula to compe- In practice, designing accessible online con-
tency requirements. Other Medbiquitous stan- tent requires use of techniques available in
dards include the Educational Achievement current web design, such as the “Alt Text” tag
Standard to document learner competency for graphics, and indications to make the user
used by numerous medical certification orga- interface more visible or audible. Adherence
nizations, and the Virtual Patient Standard to to Section 508 becomes more difficult in more
enable exchange of virtual patient simulations complicated applications, such as 3D immer-
across institutions. sive environments and dynamic simulations,
topics with a limited investment of time, and with a fixed series of images and lessons on
can demonstrate mastery. the brainstem, or it may allow a student to
Massively Online Open Courses select a brain structure of interest, such as a
(MOOCs) bring free-to-view, world-class tract, and to follow the structure up and down
university courses to a global audience. The the brainstem, moving from image to image,
first major MOOC, Introduction to Artificial observing how the location and size of the
25 Intelligence, launched with an astonishing structure changes.
160,000 subscribers. The structure of the Each of these approaches has advantages
first courses was similar to a typical univer- and disadvantages. Fixed path learning pro-
sity course, with lectures released at the same grams ensure that no important fact or con-
time as they would be taught to an in-person cept is missed, but they do not allow students
class on campus, along with assignments and to deviate from the prescribed course or to
final examinations that needed to be turned explore areas of special interest. Conversely,
in on time. Course support was provided programs that provide an exploratory envi-
by peer support through student discussion ronment and allow students to choose any
groups. Some MOOCs now require fees for actions in any order encourage experimenta-
certification of completion of courses. Private tion and self-discovery. Without structure or
companies have sprung up providing sup- guidance, however, students may waste time
port to students around selected MOOC, an following unproductive paths and may fail to
indication of the ecosystems that develop learn important material, resulting in ineffi-
around interesting technologies. EdX is a cient learning.
MOOC delivery platform launched by the An example is the Tooth Atlas, used in
Massachusetts Institute of Technology and dentistry. Understanding the three-dimen-
Stanford University. Coursera, Udacity sional structure of teeth is important for clini-
and FutureLearn are major private MOOC cal dentistry. The key instructional objective
platforms.12 of the program is to help students learn the
complex external and internal anatomy of
the variety of teeth in three dimensions. The
25.5.2 Interactive Content rich, interactive 3D visualizations show teeth
as they would be visually perceived as well
Teaching programs differ in the degree to as through very high resolution computed
which they impose structure on a teaching ses- tomography scans, radiographs and physical
sion. In general, drill-and-practice systems are cross-sections. The learners can rotate and
highly structured. The system’s responses to section the computed models, and can con-
students’ choices are specified in advance; stu- trol the transparency of each structure so as
dents cannot control the course of an inter- to study inter-relationships. While the visu-
action directly. In contrast, other programs alization is highly exploratory, the embedded
create an exploratory environment in which pedagogy is very structured, consisting of
students can experiment without guidance or detailed textual quizzes with multiple-choice
interference. For example, a neuroanatomy answers.
teaching program may provide a student
25.5.3 Games
12 Mulgan G, Joshi R. Clicks and mortarboards: how
can higher education make the most of digital tech- A learning game places learning content
nology? November 2016. 7 https://media.nesta. within a digital video game. The game play
org.uk/documents/higher_education_and_technol-
experience engages and entertains the learner
ogy_nov16_.pdf
Digital Technology in Health Science Education
853 25
while certain steps in the game instill desired 25.5.4 Cases, Scenarios
content knowledge. In a learning game, the and Problem-Based Learning
learning content is embedded in the game.
Gamification, on the other hand, has ele- The learner is presented with a story that
ments such as a score, achievement badges, includes a clinical problem. The presentation
or a “leader board”, to add excitement to an may be only in text, with text and graphics, in a
otherwise routine learning experience.13 near-realistic three-dimensional environment,
A game has the following components: a or even in an immersive virtual environment,
goal, such as finding the best treatment for a with correspondingly varying levels of inter-
patient; a setting, such as a three-dimensional activity. The learner’s role may be c onstrained
rendition of an emergency department; game such that the learner knows who they represent,
play, such as the information, tests, proce- what resources are available, and what prob-
dures, and medications available; and game lem must be solved. Alternatively, the learner
mechanics, such as accessing game play ele- may be required to investigate the situation
ments by drawing up medication in a syringe (examine the patient), define the problem, find
or selecting a medication dosage from a menu. any supporting resources (what imaging and
Successful resolution of a clinical problem laboratory tests are available or what learning
can give the same satisfaction as an enjoyable resources are at hand) and guide the scenario
game. However, to be considered a game, there to an end goal. As the learner proceeds, the
need to be challenges, such as conquering scenario evolves on the computer, based on the
“enemies” or accomplishing “quests”, evolv- actions taken and the progress of time.
ing clinical problems, or a restricted avail- Prognosis is a case-based program with
ability of supplies and personnel, that must over 500 cases covering most specialties
be overcome, as well as a clear criterion of (. Fig. 25.5). Each case begins with a brief
success. To be a learning game, actions during story of the clinical presentation. The learner
game play should result in learning, either by must choose among available tests, diagnoses
exposure of a nugget of information, by feed- and treatments, and then receives feedback on
back from a mentor embedded in the game, the choices made, as well as the preferred or
or by trying alternative medical approaches to optimal choices. The presentation and inter-
find an effective treatment. activity are very simple, and the cases brief,
Numerous learning games have been devel- but the engagement provided by the clini-
oped for all aspects of healthcare education cal puzzle has made this a popular program
but the evidence for their efficacy is not clear among medical students and residents.
(Gorbanev et al. 2018). Funding for efficacy An approach that combines the benefits
research is limited, and is one reason for the of exploration with the constraint of a linear
paucity of rigorous studies (Reed et al. 2007). path through the material is one that breaks
Game development that has a clear learning the evolving scenario into a series of short
goal, and has been informed by research dur- vignettes. A situation is presented, information
ing the design stage as well as during devel- and action options are available, and a decision
opment of the game play, has been shown must be made. Each decision triggers the pre-
to be both engaging and an effective learn- sentation of the next vignette. This could lead
ing tool (Kato et al. 2008) (. Box 25.1 and to a branching story line but, usually, the next
. Fig. 25.4a and b). vignette presents the result of the best actions
from the previous vignette. A scenario about
a virtual patient could have vignettes that lead
13 A leader board is a list of players with the highest
the learner through the steps of diagnosis, tests,
scores. Players compete to be among the high and the course of treatment. This approach is
scorers. commonly used in computer-based testing of
854 P. Dev and T. Schleyer
a b
.. Fig. 25.4 Screenshots from opening screens of vent cancer from escaping into the blood stream. b In
Re: Mission 2. a In “Nanobot’s Revenge,” players use “Nano Dropbot”, the player continues to kill cancer
an ever-increasing arsenal of powerful chemo attacks cells but also learns to recruit healthy cells in the
to crush the cancerous forces of the Nuclear Tyrant, fight
firing targeted treatments on a growing tumor to pre-
clinical knowledge where assessment of learner novices and experts on a set of immunological
performance would be extremely difficult if the cases. Using neural nets to process the track-
interactions were completely unconstrained. ing data, they detected consistent differences
The ability of the computer to track in the problem solving approach of novices
and store the learner’s actions allows post- compared to experts. In particular, novices
processing and analysis of the tracked data. exhibited considerably more searching and
An interesting analytic capability is one that lack of recognition of relevant information,
compares the performance of novice learn- while experts converged rapidly on a com-
ers and experts to detect features that define mon set of information items. The potential
expert information gathering or action of using such expert patterns of performance
sequences. Stevens et al. (1996) compared the to educate novice learners has not been widely
information gathering and the conclusions of explored in the health sciences.
Digital Technology in Health Science Education
855 25
.. Fig. 25.5 Screenshot from the case-based program, management plan and receives summary feedback on
Prognosis. The learner is presented with a summary of the success or failure of the plan. The case ends with a
the case. The simple graphic presents physical examina- review of the disease background and optimal manage-
tion information in a similar format for each case. The ment, including relevant references. (Courtesy of Medi-
following screen offers options for laboratory, imaging cal Joyworks, LLC, with permission)
and other investigation options. The learner selects the
.. Fig. 25.8 Three-dimensional computer-generated goals may include medical goals such as stabilization of
virtual medical environments are used to present clinical the patient, communication goals such as learning to
scenarios to a team of learners. Each learner controls a point out a problem to senior personnel, or team goals
character in the scenario and, through it, interacts with of leadership and delegation. (Courtesy of Innovation
devices, the patient, and the other characters. Learning in Learning, Inc. with permission)
858 P. Dev and T. Schleyer
being used to construct and deliver team train- Examples of simulations using virtual real-
ing in such virtual medical environments.16 ity have been demonstrated by many universities
and organizations. VR simulations of surgery
(https://ossovr.com) and patient examination
25.5.9 Virtual Reality (https://oxfordmedicalsimulation.com) are in
use in medical and nursing curricula. There are
The use of virtual reality glasses, along with a few studies examining the learning efficacy of
spatially accurate sound and virtual hands, VR simulations (Kyaw et al. 2019). It is likely
creates an immersive experience that surpasses that the realism of virtual reality, its “face valid-
the experience of a three-dimensional world ity”, will result in its use in education even if
as seen on a computer screen on the learner’s rigorous efficacy studies are not available.
desk. The learner feels truly “inside” the expe-
rience. The resulting immediacy is so real that
it manifests itself through physiologic changes 25.5.10 Augmented Reality
such as a speeding of the learner’s pulse and a
total lack of awareness of the actual physical Augmented reality (AR) differs from virtual
surroundings (. Fig. 25.9). reality in that a real world is seen through the
There are two types of virtual reality in use AR glasses while other information is over-
at present. One is reality as represented by a laid on the world by the glasses. Information
completely synthetic three-dimensional envi- can be textual, such as heart rate and blood
ronment, within which the learner navigates pressure data when looking at a physical man-
and acts. The other is represented as a 360° nequin. It can be graphic, such as an open
video of a real environment within which the wound seen overlaid on a person on a bed,
clinical action has been recorded. The video simulating an injured patient. The AR over-
virtual reality is useful for didactic training lay information changes depending on what is
about procedures, such as new surgical meth- being viewed, creating a world of information
ods, where the learner has a front row view on top of the real visible world.
as though they were actually present in the Learning possibilities using AR are end-
operating room. less. A new nurse can walk into an empty
operating room and “see” the contents of
closets and drawers, thus being trained on the
16 7 h t t p s : / / a n e s t h e s i o l o g y. d u ke. e d u / ? p a g e _
location of OR supplies. A nursing student
id=825623, 7 https://healthteamspaces.simtabs.com can see a “pressure sore” evolve on the heel
Digital Technology in Health Science Education
859 25
of a real person because of pressure on the as against the goals of the specific learning
heel in the bed. A medical student can “scroll” module. Without these goals, any assessment
through the electronic medical record as they is without direction and its purpose may be a
talk with a simulated patient. mystery to the learner.17
AR in medical education is in its infancy Assessment maybe formative (guiding
but its applications are expected to be wide- future learning and promoting reflection) or
ranging. summative (making a judgment about compe-
tence or qualification before being allowed to
advance to the next level of study). Assessment
25.5.11 3D Printed Physical Models can also be used by instructors and educa-
tion program designers to identify whether
Three dimensional (3D) printing is a novel the learning content can be improved, and
application of printing. Slice data from an brought closer to the identified learning goals
object, such as a CT image of a bone, is used to (Epstein 2007).
print a layer of solid material, such as p
lastic. Digital technology supports rich assess-
Subsequent image slices are used for printing ment both because of its ability to present
a cumulative stack of plastic slices until the many types of assessment tools, and because
entire object has been printed. The advan- of its ability to track learner actions in great
tage of sequential printing of slices, instead detail. Selected assessment methods are pre-
of carving the shape from a solid block of sented below.
plastic, is that hollow portions of the origi-
nal object can be printed as holes in the slice
data. A second advantage is that very complex 25.6.1 uizzes, Multiple Choice
Q
objects can be printed using this technology. Questions, Flash Cards, Polls
3D models are beginning to be used for
learning. For undergraduate students, cadaver Quizzes test the learner’s knowledge and,
dissection, plastinated specimens, and dried depending on the quiz format, the learner’s
bone have provided the physical specimens. ability to solve problems. Quizzes can be pre-
3D models add a new option. For healthcare sented as questions with single or multiple
practitioners, patient-specific 3D models help correct answers, or may require sorting and
in planning procedures, but they also help matching two sets of items. Digital technol-
in educating the patient about their upcom- ogy simplifies the process of preparing, pre-
ing surgery. In a recent systematic review on senting and scoring quizzes, and can make
surgical planning for congenital heart dis- them engaging and fun by adding imagery,
ease, the authors found that 25% of the stud- animations and game-like success states.
ies showed 3D printed models were useful in Flash cards that present the question on
medical education for healthcare profession- one side of the card and the answer on the
als, patients, caregivers, and medical students other can also be simulated using technol-
(Lau and Sun 2018). ogy. The learner types their answer. Through
simple word or phrase matching, the learner’s
answer is matched to the expected answer, and
25.6 Assessment of Learning scored based on the level of match achieved.
For more complex answers, some level of nat-
Assessment of student learning compares ural language processing is required.
educational performance with educational Polls are particularly useful for an instruc-
goals. Ideally, the content used for assessment tor to assess, in real time, the current status of
resides within a curriculum and an educational
program that has a clear set of educational
goals. Therefore, student learning is measured 17 AAHE, 7 https://www.oxy.edu/sites/default/files/
(assessed) against these overall goals as well assets/AAHE9Principles.pdf
860 P. Dev and T. Schleyer
learner understanding in the classroom. The simulations in that a large number of decision
poll question, and multiple answers, are dis- choices are available to the learner at every
played on the classroom screen. Each learner moment. Thus the simulation is a more real-
selects an answer on a smartphone or on a istic representation of a clinical situation but
polling device. The poll responses are imme- is also correspondingly more difficult to score
diately collated and presented as a bar graph. for assessment (Dillon et al. 2002).
25 If all or most of the learners select the cor-
rect answer, the instructor can assume that the
topic has been understood. If the responses 25.6.4 Intelligent Tutoring,
are distributed over two or more answers, then Guidance, Feedback
the instructor can pause to review the topic
and clear up learner misconceptions or lack Intelligent Tutoring Systems (ITS) differ from
of understanding. other technology-based learning systems in
that they offer individualization of the learn-
ing experience based on the learner’s per-
25.6.2 Branching Scenarios formance while using the system (VanLehn
2011). Because of their architecture, continu-
A branching scenario is a structured approach ous assessment of the learner is essential to
to assessment using simulation. A mini- the operation of ITSs, with guidance provided
scenario or a choice of data resources (such as as needed, placing ITSs in the domain of for-
laboratory tests) is presented at each branch mative assessment. Typically, an ITS is built to
point, and the learner chooses one out of a replicate one-on-one, personalized tutoring.
set of available decisions or responses. One or Modern ITSs use natural language for
more of the decisions may be correct. Based dialog between the learner and the tutor
on each consecutive decision, the learner (. Fig. 25.10). Conversational dialog is more
moves through a branched scenario and likely to uncover learner misconceptions or
achieves a final outcome to the scenario. The gaps in knowledge. As the student responds
learner can be assessed on each decision or to the tutor’s questions, the response is com-
on the final outcome. If the same material is pared to the expected response using statistical
presented in a learning mode, the learner may methods that compare the conceptual similar-
receive feedback about each decision. ity of the two pieces of text. An example of
a conversational tutoring system is Autotutor
(Graesser et al. 2004), which has been used for
25.6.3 Simulations learning domains ranging from physics and
mathematics to training nurses for mass casu-
Simulations for assessment may use stan- alty triage (Shubeck et al. 2016).
dardized patients (actors trained to represent
patients), realistic interactive mannequins,
on-screen simulations, or simulations pre- 25.6.5 Analytics
sented in virtual reality. In all cases, tech-
nology can be used to track learner actions Understanding and improving the process and
and to assess their performance (Ryall et al. outcome of education requires applying met-
2016). In all except standardized patients, this rics into many facets of the educational pro-
tracking is built into the simulation, and can cess. With digital technology, measurement
be extracted and analyzed for performance and resulting data availability has increased
reporting. These more complex, scenario- steadily. At the same time, educational institu-
based simulations, differ from branching tions and businesses are beginning to develop
Digital Technology in Health Science Education
861 25
.. Fig. 25.10 Screenshot
of a conversation with an
intelligent simulated tutor.
The learner, a first
responder, converses with
a tutor who uses natural
language to guide a
student to give detailed
answers using their own
words. (Courtesy of
Innovation in Learning,
Inc., with permission)
Becker 2017).18 The following are examples of 55 Real-time feedback. Significant portions
some of the challenges that we can expect to of pre-clinical training in healthcare
encounter. require use of simulators. With embedded
55 Digital content production and verification sensing and compute capability, and inter-
remains an ongoing challenge. Digital net access, these simulators will become
learning content can range from inexpen- capable of real-time monitoring of learner
25 sive recording and streaming the video of performance. Display of this data on a
a single lecture to very expensive and time- performance dashboard will allow both
consuming creation of a rich and dynamic learner and teacher to observe flaws in per-
simulation of a disease process. Effective formance and for the teacher to provide
curation and distribution of high-quality appropriate guidance at the time it is
content remains a challenge, with some needed. With the addition of intelligent
healthcare faculty being reluctant to use tutors that are built into the simulator, the
content developed at other universities. An learner can receive needed feedback by
emerging trend that may increase use of using the simulator at any time of the day.
existing content is to apply the methods of Similarly, we can challenge ourselves to
the “flipped classroom” to MOOC-based understand and implement intelligent,
online courses. In this method, selected real-time feedback into all aspects of
online content, from MOOCs or other healthcare learning.
sources, is assigned for study at home, and 55 Artificial intelligence and adaptive learning.
group time is used for instructor-led con- Understanding and engaging with each
tent discussion and problem solving. Such student’s success at the course level is the
approaches can combine the best of online domain of the individual faculty, and
content with the strengths of classroom remains a challenge for the application of
teaching by faculty, and it is possible that appropriate digital technology.
such hybrid or blended classes will become Implementation of adaptive learning, that
increasingly common. is, adapting the presentation of learning
55 Learning analytics is a direct outcome of content in response to continuous assess-
digital learning content and learning man- ment of learner performance, will be an
agement systems. An immediate challenge essential next step. We can expect student
is to utilize the available data to improve performance to be tracked, and personal-
the healthcare education process at the ized exercises and assessments presented,
level of the individual, the course, the cur- so that they can understand their strengths
ricula, and the institution, and to match and weaknesses, and can request digital or
this education process to the needs of in-person help they need for success.
today’s healthcare. A more far-reaching 55 Learning Management Systems will see sig-
challenge is to use data as evidence to nificant evolution. Currently they are nar-
understand what works and why. In par- rowly focused on the administration
ticular, we need to understand the best aspects of learning, ensuring that learners
approaches to blended online and face-to- are aware of courses needed for their pro-
face learning, the uses of collaborative and gram, delivering course material with the
project-based learning, and the role of appropriate sequence and timing, and
simulations and experiential learning. checking when these courses have been
completed. In the future, the challenge will
be for LMSs to go beyond administration,
18 Adams Becker S, Cummins, M, Davis A, Freeman and to support student learning. In partic-
A, Hall Giesinger C, and Ananthanarayanan V. ular, for healthcare education, LMSs will
(2017). NMC Horizon Report: 2017 Higher Educa-
tion Edition. Austin, Texas: The New Media Con-
be required to support mastery- and com-
sortium. 7 http://cdn.nmc.org/media/2017-nmc- petency-based education, with detailed
horizon-report-he-EN.pdf tracking of concept and skill acquisition.
Digital Technology in Health Science Education
863 25
The topics presented above are only a small This is an exciting time in digital learning
selection of the interesting challenges in capabilities. It is an even more exciting time to
future healthcare education. Journals such as solve the many challenges ahead so as to move
“Academic Medicine” and “Computers and towards high performance learning systems.
Education”, and websites such as Educause.
edu, periodically discuss these and other chal- nnSuggested Readings
lenges in more depth. Bligh, D. A. (2000). What’s the use of lectures?
San Francisco: Jossey-Bass. In this book, the
author analyzes the best use of the lecture as a
25.8 Conclusion teaching method, and what lectures fail to
teach.
Digital learning is widespread in healthcare Bransford, J. D., Brown, A. L., & Cocking, R. R.
education and has proven to be both effective (2000). How people learn: Brain, mind experi-
and engaging. Digital content ranges from ence and school. Washington, DC: The
basic web pages to highly immersive inter- National Academies Press. This National
active 3D virtual spaces. Digital support of Research Council book synthesizes many find-
learning uses learner tracking to assess per- ings on the science of learning, and explains
formance and to advise and guide the learner how these insights can be applied to actual
towards optimal learning outcomes. Artificial practice in teaching and learning.
intelligence and adaptive learning methods NMC Horizon Report: 2017 Higher Education
have the potential to personalize learning, and Edition. Available at: https://library.educause.
to provide the institution with detailed under- edu/resources/2018/8/2018-nmc-horizon-
standing of how to support each learner as report. This annual report highlights issues,
well as how to align educational approaches trends and technologies in education.
with institutional goals. Simulators, for hands- Talbot, T. B., Sagae, K., John, B., & Rizzo, A. A.
on procedures and for diagnosis and commu- (2012). Sorting out the virtual patient: How to
nication, will provide a learning environment exploit artificial intelligence, game technology
that parallels the student’s progress through and sound educational practices to create
the real clinical environment, providing safe, engaging role-playing simulations.
realistic practice before learners must use International Journal of Gaming and
that knowledge on real patients. Virtual and Computer-Mediated Simulations, 4(3), 1–19.
augmented reality will make these simulated This paper is a good overview and analysis of
environments and tools appear and feel real- the many methods of simulating a patient.
istic, while providing the content scaffolding
and mentoring that may not be available in ??Questions for Discussion
the real clinical environment. Next generation 1. In developing effective educational
learning management systems will provide the interventions, you are often faced with
administrative infrastructure to support the a choice of instructional methods.
student as they progress through their educa- Which of the instructional methods
tional program, deliver personalized learning listed below would best match the
to each student, and offer detailed dashboard instructional goals listed? Please justify
information to both faculty and institutional your selection. Note: For some
administration. instructional goals, more than one
To realize the full potential of digital instructional method might be
learning, there must be significant invest- appropriate.
ment in further development of digital learn- Instructional Goal
ing technology and content. There must also 1. Be able to intubate an unconscious
be effort to develop faculty and staff so that patient
they move beyond simply using technology to 2. Memorize the terminology used in
understanding how to make each technology neuroanatomy
elicit the desired learning outcome.
864 P. Dev and T. Schleyer
3. Recognize the symptoms of a patient check out several sharing sites for curric-
with probable mental illness ular material, such as MedEdPORTAL,
4. Describe the pathophysiological pro- to try to find relevant teaching materials.
cess of hypertension What kind of issues/problems would you
5. Detect histopathologic variations expect in integrating material from those
on histology slides sites in your course?
25 5. As Chief of Quality Improvement at
Instructional Method
the Veterans Administration, you are
1. Case-based scenarios that include
attempting to improve fairly poor out-
video
comes of patients with Post-traumatic
2. Physical simulation with computer-
Stress Disorder (PTSD). You would like
based feedback
to develop a computer-based educa-
3. Didactic material that includes text,
tional tool for patients and caregivers to
images and illustrations
help them cope with PTSD. Most of the
4. Intelligent tutoring system
patients and caregivers are quite unfa-
5. Drill-and-practice program
miliar with the disorder, and health
2. You are developing a software literacy varies widely in your target
application for interprofessional audience. In conceptualizing your
education to teach participants about approach, you are focused on the fol-
managing patients with advanced Type lowing questions:
2 Diabetes. Your audience includes
(a) What are the instructional goals of
students representative of the clinicians
the program?
who are typically involved in the care
(b) What kind of digital content should
of such patients: primary care
you use?
physicians, specialists such as
(c) How do you assess baseline
ophthalmologists and podiatrists, and
knowledge of patients and
nurses. Your software application is
caregivers about PTSD, and how
focused on the care of individual
do you measure knowledge gains
patients, and you have put together a
after they have used the program?
set of clinical case studies as a basis.
How could you leverage current
collaborative technologies to help the
team manage each case in a way that
References
resembles what they would do in real
Adams Becker, S., Cummins, M., Davis, A., Freeman,
life? A., Hall Giesinger, C., & Ananthanarayanan, V.
3. Select a topic in physiology with which (2017). NMC Horizon Report: 2017 Higher
you are familiar, such as arterial blood– Education Edition. Austin: The New Media
gas exchange or filtration in the kidney, Consortium.
Aebersold, M. (2018). Simulation-based learning: No
and construct a representation of the
longer a novelty in undergraduate education. OJIN:
domain in terms of the concepts and The Online Journal of Issues in Nursing, 23(2), 14.
sub-concepts that should be taught for https://doi.org/10.3912/OJIN.Vol23No02PPT39.
that topic. Using this representation, Chang, B. J. (2016). Problem-based learning in medical
design a teaching program using one of school: A student’s perspective. Annals of Medicine
and Surgery, 12, 88–89.
the following methods: (1) a didactic
Dahlstrom, E., Brooks, D. C., & Bichsel, J. (2014). The
approach, (2) a simulation approach, or current ecosystem of learning management systems in
(3) a game approach. higher education: Student, faculty, and IT perspec-
4. You are a junior faculty member at a tives. Research report. ECAR: Louisville. Available
major medical center and you just were from http://www.educause.edu/ecar.
Dillon, G. F., Clyman, S. G., Clauser, B. E., & Margolis,
appointed director for a course on clini-
M. J. (2002). The introduction of computer-based
cal patient examinations. You decide to
Digital Technology in Health Science Education
865 25
case simulations into the United States medical Masiello, I., Kononowicz, A. A., Zary, N., Tudor
licensing examination. Academic Medicine, 77(10 Car, L., (2019). Virtual reality for health professions
suppl), S94–S96. education: Systematic review and meta-analysis by
Englander, R., Cameron, T., Ballard, A. J., Dodge, J., the digital health education collaboration J Medical
Bull, J., & Aschenbrener, C. A. (2013). Toward a Internet Research, 21(1), e12959.
common taxonomy of competency domains for the Lau, I., & Sun, Z. (2018). Three-dimensional printing in
health professions and competencies for physicians. congenital heart disease: A systematic review.
Academic Medicine, 88, 1088–1094. Journal of Medical Radiation Sciences, 65(3), 226–
Epstein, R. M. (2007). Assessment in medical education. 236.
NEJM, 356(4), 387–396. Accessed at https://www. Nelson, R. (2016). Informatics: Empowering ePatients
nejm.org/doi/full/10.1056/NEJMra054784. to drive health care reform – part I. OJIN: The
Freeman, S., Eddy, S. L., McDonough, M., Smith, Online Journal of Issues in Nursing, 21(3), 5. https://
M. K., Okoroafor, N., Jordt, H., & Wenderoth, doi.org/10.3912/OJIN.Vol21No03InfoCol01.
M. P. (2014). Active learning increases student per- Reed, D. A., Cook, D. A., Beckman, T. J., Levine, R. B.,
formance in science, engineering, and mathematics. Kern, D. E., & Wright, S. M. (2007). Association
Proceedings of the National Academy of Sciences between funding and quality of published medical
USA, 111, 8410–8415. education research. Journal of the American Medical
Gaba, D. M., (2004). The future vision of simulation in Association, 298(9), 1002–1009.
health care. BMJ Quality & Safety, 13, i2-i10. Ryall, T., Judd, B. K., & Gordon, C. J. (2016).
Gorbanev, I., Agudelo-Londoño, S., González, R. A., Simulation-based assessments in health professional
et al. (2018). A systematic review of serious games education: A systematic review. Journal of
in medical education: Quality of evidence and peda- Multidisciplinary Healthcare, 9, 69–82. https://doi.
gogical strategy. Medical Education Online, 23(1), org/10.2147/JMDH.S92695.
1438718. Shubeck, K. T., Craig, S. D., & Hu, X. (2016). Live-action
Graesser, A. C., Lu, S., Jackson, G. T., Mitchell, H. H., mass-casualty training and virtual world training: A
Ventura, M., Olney, A., & Louwerse, M. M. (2004). comparison. Proceedings of the Human Factors and
AutoTutor: A tutor with dialogue in natural lan- Ergonomics Society, 60(1), 2103–2107. Available
guage. Behavior Research Methods, Instruments, & from: https://doi.org/10.1177/1541931213601476.
Computers, 36(2), 180–192. Accessed at https://link. Accessed at https://www.researchgate.net/publica-
springer.com/content/pdf/10.3758%2FBF03195563. tion/307944349_Live-action_mass-casualty_train-
pdf, on October 20, 2018. ing_and_virtual_world_training_A_comparison, on
Hayden, J. K., Smiley, R. A., Alexander, M., Carding- October 20, 2018.
Edgren, S., & Jeffries, P. R. (2014). The NCSBN Stevens, R. H., Lopo, A. C., & Wang, P. (1996). Artificial
national simulation study: A longitudinal, random- neural networks can distinguish novice and expert
ized, controlled study replacing clinical hours with strategies during complex problem solving. Journal of
simulation in prelicensure nursing education. the American Medical Informatics Association, 3(2),
Journal of Nursing, 5(Supplement 2), S3–S40. 131–138.
Huang, G., Reynolds, R., & Candler, C. (2007). Virtual Talbot, T. B., Sagae, K., John, B., & Rizzo, A. A. (2012).
patient simulation at U.S. and Canadian Medical Sorting out the virtual patient: How to exploit arti-
Schools. Academic Medicine, 82(5), 446–451. ficial intelligence, game technology and sound edu-
Jeffries, P.R., (2005). A framework for designing, imple- cational practices to create engaging role-playing
menting, and evaluating simulations used as teach- simulations. International Journal of Gaming and
ing strategies in nursing. Nurs Educ Perspect. Computer-Mediated Simulations, 4(3), 1–19.
26(2):96–103. VanLehn, K. (2011). The relative effectiveness of human
Kato, P. M., Cole, S. W., Bradlyn, A. S., & Pollock, B. H. tutoring, intelligent tutoring systems, and other
(2008). A video game improves behavioral outcomes tutoring systems. Educational Psychologist, 46(4),
in adolescents and young adults with cancer: A ran- 197–221.
domized trial. Pediatrics, 122(2), e305–e317. Zaharias, G. (2018). What is narrative-based medicine?:
Kyaw, B. M., Saxena, N., Posadzki, P., Vseteckova, J., Narrative-based medicine 1. Canadian Family
Nikolaou, C. K., George, P. P., Divakar, U., Physician, 64(3), 176–180.
867 26
Translational Bioinformatics
Jessica D. Tenenbaum, Nigam H. Shah, and Russ B. Altman
Contents
References – 901
Translational Bioinformatics
869 26
nnLearning Objectives T1 Translational Barrier
Bench Bedside
After reading this chapter, you should know
Data
the answers to these questions:
55 How does translational bioinformatics
Health informatics
differ from the more general field of
Bioinformatics
Translational Bioinformatics
bioinformatics? Storage
55 What do T1 and T2 refer to in the con- Analysis
text of translational research? Interpretation
55 What is a biomarker, and why is it
important in medicine?
55 What is precision medicine, and how Knowledge
does it differ from traditional medical
.. Fig. 26.1 TBI bridges the gap between bioinformat-
practice?
ics on the “bench” side of the T1 barrier and health
55 What is the difference between pharma- informatics on the “bedside” end of the spectrum. Novel
cokinetics and pharmacodynamics? methods for storage, analysis, and interpretation span
55 What is the difference between statisti- the spectrum from data to knowledge. (Adapted from
cal significance and clinical signifi- Sarkar et al. (2011). Creative Commons CC BY-ND
License)
cance?
55 How are genomic data being used today
in research, clinical care, and consumer ter, we describe key concepts and methods in
health? TBI, summarize TBI data-related resources,
55 What are some ethical issues surround- and introduce the concept of precision medi-
ing genomic medicine? cine, which is enabled by TBI and covered in
55 How are ontologies useful in transla- greater depth in 7 Chap. 28. We conclude
tional bioinformatics? with a discussion of challenges and future
directions for the field.
this would have on the field of medical infor- transforming life science research in the
matics was recognized immediately, reflected twenty-first century. Biomedical informat-
in the theme for the 2002 AMIA3 Annual ics plays a strong role across all three of the
Symposium: “Bio*Medical Informatics: One major Roadmap themes: New Pathways to
Discipline” (Tarczy-Hornoch 2007). Discovery, Research Teams of the Future,
and Reengineering the Clinical Research
Enterprise (Zerhouni 2006). As part of this
26.2.2 What Is Translational Roadmap, the NIH embarked on a major ini-
Research? tiative to break down translational barriers
through a new funding mechanism known as
In the early 2000s, there was growing the Clinical and Translational Science Award
acknowledgement that the population at (CTSA). This award was aimed at major aca-
large, and patients in particular, were not demic medical centers and their partners with
reaping the full benefits of the considerable the goal of improving translational research
amount of research money being devoted to to get treatments to patients quickly.
scientific discovery. It was recognized that It was in this context, with newfound
researchers do not do a good job translating attention to translational research, that Butte
their discoveries “from bench to bedside,” and Chen coined the term “translational
i.e. bridging biological discoveries in the bioinformatics” at the AMIA annual sym-
lab and clinical application of the findings posium in 2006 in a paper entitled “Finding
(Lenfant 2003). Two significant roadblocks Disease- Related Genomic Experiments
were initially identified (. Fig. 26.2)—one Within an International Repository: First
in translating discoveries into clinical care Steps in Translational Bioinformatics” (Butte
guidelines (dubbed T1 translation), and the and Chen 2006). AMIA added TBI as one
other in translating clinical care guidelines of its key supported domains and in 2008
into actual practice (T2 translation) (Sung held its first annual Summit on Translational
et al. 2003). Additional “T’s” have been Bioinformatics. Later that year, the Journal of
devised more recently, and definitions refined the American Medical Informatics Association
to be more granular, e.g. splitting out early (JAMIA) published a perspective on TBI’s
and late phase trials and knowledge dissemi- “Coming of Age” that enumerated several
nation vs. knowledge application (Waldman reasons why the time was right for TBI to
and Terzic 2010). In 2004, the National come into its own as a field (Butte 2008). In
Institutes of Health (NIH) launched the 2009, the editors of the Journal of Biomedical
Roadmap for Medical Research, aimed at Informatics published an explicit Editorial to
announce a change in the journal’s editorial
policy to “focus its bioinformatics attention
3 AMIA is the American Medical Informatics Asso- on innovations in the area of translational bio-
ciation, Bethesda, MD: 7 http://www.amia.org informatics” (Shortliffe et al. 2009).
872 J. D. Tenenbaum et al.
.. Fig. 26.3 A
1.0
comparison between two
Receiver Operator
Characteristic (ROC)
curves. The area under
0.8
the curve (AUC) or C
statistic is higher for Test
2 (gray) than for Test 1
(diagonal lines) to a
statistically significant
26
0.6
degree, but this increased Sensitivity
accuracy does not
necessarily imply clinical
relevance
0.4
Test 1
0.2
Test 2
0.0
Genes
measures absolute risk. That is, do the risk
values predicted by the test reflect the actual
risk observed in the population. Calibration
may differ across the population at different
levels of predicted risk, which may in turn
affect the test’s utility.
et al. 2009, 2014). Resources for genomic Sequence Project (PGRNSeq) (Bush et al.
data include: 2016).
thereby leading to several proteins, the RNA aggregators of this information. These create
transcript may be degraded before it is trans- a remarkably powerful network of associa-
lated, and the proteins may be modified after tions that can be used creatively to make new
they are created. There are many resources associations. For example, . Fig. 26.5 shows
on the web for gene information, and many the links on PharmGKB (Pharmacogenomics
26
.. Fig. 26.5 PharmGKB gene pages are organized by about human genes. (Copyright PharmGKB, used with
tabs and the “Downloads/LinkOuts” tab shown here permission from PharmGKB and Stanford University)
has links to many other sites with valuable information
Translational Bioinformatics
881 26
Knowledge Base) for the drug VKORC1 and 2004). Of course, other disease terminologies
includes links to: such as SNOMED are also useful (Spackman
55 Entrez Gene: summarizes the sequence, et al. 1997). For side effects, there are special-
variations, homologs across species ized terminologies, including the MedDRA
55 OMIM: provides information about rare terminology used by the FDA in adverse
genetic diseases involving this gene event reporting (MedDRA replaced a pre-
55 UniProt: provides mapping information to vious terminology called COSTART), and
relate this gene to its protein products the WHOART (World Health Organization
55 GeneCards: provides aggregated informa- Adverse Reactions Terminology) diction-
tion about function, tissue localization, ary for adverse reactions (Brown et al. 1999;
expression levels, literature references, and Alecu et al. 2006). The SIDER database
more (Kuhn et al. 2010) provides information
mined from drug package inserts about drug
indications and side effects. The Anatomic
zz Drugs and small molecules Therapeutic Chemical classification (ATC)
RxNorm is a terminology standard for speci- from the World Health Organization provides
fying drugs (Parrish et al. 2006). DrugBank a high level classification of drugs organized
provides information about drugs, their tar- hierarchically by the anatomical location of
gets, pharmacology, uses, and many other target, the therapeutic category, the pharma-
characteristics (Knox et al. 2011). There cological subgroup, chemical subgroup and
are only around 2000 approved drugs in the precise chemical substance (Miller and Britt
United States, and so this list is relatively short. 1995).
The list of small molecules that are not drugs
is much larger and includes the contents of zz Pharmacological properties of drugs
PubChem (Wang et al. 2009, 2010), an NIH- There are resources on the web that pro-
built resource with basic information about vide molecular level assay data related to
the structure, function and literature on small small molecules, including many drugs. The
molecules. The Zinc Database (Irwin and ChEMBLdb resource provides the ability
Shoichet 2005) lists 13 million commercially to find targets, binding affinities, inhibition
available compounds that can be purchased concentrations and information about other
for use in research. Much drug information drug-oriented assays (Overington 2009).
is contained within the “package insert” that BindingDB also provides binding affinities for
is included in most drug packaging. This is small molecules and proteins (Liu et al. 2007).
information created by the drug company, but In addition, RxNorm includes an RxClass
approved by the FDA. The FDA makes these API for accessing drug classes and members
available on a drug information site called of a given class (Bahr et al. 2017).
DailyMed. For patients, the National Library
of Medicine’s MedlinePlus resource provides zz Population-based data on drug effects
basic drug information as well. The FDA maintains information about all
reports of adverse events in the FDA Adverse
zz Drug indications and drug effects Event Reporting System (FDA AERS). These
Drugs are used to treat particular diseases, reports include demographic information,
and so controlled terminologies of drug indi- indications for treatment, drugs administered,
cations and drug effects are useful for com- side effects experienced and a summary of
putational efforts. At the organism level, clinical outcomes. They are freely available at
indications and effects are often diseases (dia- the FDA website. The Canadian equivalent
betes is an indication) or side effects (hyper- system, also making data freely available, is
glycemia is a side effect). The UMLS and available through the Health Canada web-
MeSH terminologies are often used to charac- site. These data are very noisy and have many
terize such disease phenotypes (Bodenreider confounding variables, but nonetheless can
882 J. D. Tenenbaum et al.
be useful for discovering “signals” suggesting the chance that an association is spurious. If
dangerous side effects or drug-drug interac- the result is real, then the SNP may be used to
tions (Tatonetti et al. 2011b). identify nearby genes in the region that may
be important for the drug response. For exam-
ple, Shuldiner and colleagues were interested
26.5.2 TBI Applications in the ability of the drug clopidogrel to pro-
in Pharmacogenomics tect patients from cardiovascular events. They
found that a polymorphism RS12777823
26 The network of data described above is a rich was associated with a high likelihood of hav-
potential source of hypotheses about how ing a cardiovascular event. They noted that
genes combine to create drug response, as well this SNP was very close to the metaboliz-
as for predicting the particular consequences ing enzyme CYP2C19, and in particular the
of genetic variation. This is still a new field, “risk” allele for this SNP co-occurred with the
and there remain many opportunities for CYP2C19*2 variant. Thus, they showed that
innovative use of these data. We highlight a CYP2C19 is important for the desired effect
few here to illustrate how integration of data of clopidogrel, and found a variation of this
can lead to novel discoveries. gene that predicted poor response to the drug
in affected patients (Shuldiner et al. 2009).
zz GWAS to Discover Drug Response Genes
The most straightforward way to associ- zz Mining the FDA AERS to Find Drug-Drug
ate genes with drug response is to perform Interactions
a genome-wide association study (GWAS) The FDA Adverse Events database associ-
in which two groups are compared. (See ates multiple drugs with multiple diseases as
7 Chap. 28 for more on GWAS.) One group indications as well as side effects. This data-
(cases) has a drug response of interest (e.g., base shows promise as a way to find new
an adverse event in response to the drug or associations between single drugs and their
a particularly good response to it) and the side effects, as well as multiple drugs and
other group (controls) does not have the drug their side effects. As mentioned above, the
response of interest. It is critical to ensure SIDER database is a “top down” database of
that the phenotype or response is carefully side effects derived from the package label of
defined and measured. With each group, drugs. Another approach to getting good lists
DNA is collected and typically 500,000 or of side effects is a more data-driven approach.
1000,000 SNPs (single nucleotide polymor- One way to do this is to look for patterns of
phisms) are measured using microarray tech- side effects associated with certain types of
nology. Then, for each SNP, an association is drugs using machine learning. For example,
measured between the genotypes in cases and one may analyze the side effects of drugs that
controls and the response of interest using a alter glucose in order to create a signature of
simple statistical test such as the chi-squared the “typical” profile of side effects associated
test. The SNPs that are most highly associated with a glucose-altering drug. Then, one can
may represent regions of the genome that are search a database of side effects (such as FDA
involved in the response. These must be care- AERS) for other drugs that match this profile.
fully vetted statistically, as there are many This was done by Tatonetti et al., who cre-
potential confounding variables. For example, ated a profile for glucose-altering drugs and
it is important that the cases and controls are found a set of 10 side effects either enriched
drawn from populations with similar ethnic or deficient (compared to background) in
origin, and that the significance remains after these drugs: hyperglycemia, diarrhea, hypo-
correcting for multiple testing. When one tests glycemia, and pain were higher than others,
500,000 or 1000,000 hypotheses, adjustments and paresthesia, nausea, pyrexia, abdominal
such as Bonferoni correction (see 7 Chap. 24) pain, and anorexia were less likely than o thers
must be made in order to take into account (Tatonetti et al. 2011a). Using this pattern,
Translational Bioinformatics
883 26
more than 93% of drugs that are known to olizes codeine” and “CYP2D metabolizes
alter glucose could be recovered. More inter- metoprolol” might be combined to infer that
estingly, however, this pattern could be applied codeine and metoprolol have a potential drug-
to patients on pairs of drugs to search for drug interaction. There are a large number of
pairs that altered glucose. A highly correlated similar inferences that could be drawn about
combination was the antidepressant parox- the relationships between genes, drugs and
etine and the cholesterol medication pravas- diseases given a high quality database of pair-
tatin (Tatonetti et al. 2011a). In subsequent wise interactions drawn from the published
validation in three independent EHR systems, literature. Of course, some pairwise interac-
large increases in glucose were observed in tions may be incorrect, and so evidence for
patients on these two drugs, and in mouse interactions should be combined from sev-
studies of these two drugs, glucose was sub- eral sources (including EMR validation, for
stantially increased. Thus, the adverse event example) and once predictions are made, they
patterns could be used to create patterns and should be embraced only with skepticism.
detect new signals, not specifically reported A comprehensive text-based search for rela-
in the database, but implied by the pattern of tionships between genes, drugs and diseases
other side effects observed. over all PubMED abstracts yielded a publicly
available database of more than 2 million
zz Mining the Literature to Build a Database high quality associations (Percha and Altman
of Gene-Drug Associations 2015, 2018).
Biomedical text can also be an important
source of high quality information about the zz Using Drug-Target Interactions to Predict
relationships among genes, drugs, and diseases New Ones
(Garten et al. 2010). High fidelity natural lan- Another way to find new uses for old drugs is
guage processing techniques (7 Chap. 7) can to predict interactions between drugs and new
be used to extract information about gene- potential targets. Many drugs are designed
drug interactions. In some cases, the associa- to interact with a single target based on a
tion between genes and drugs can be inferred detailed understanding of disease pathology.
simply by their co-occurrence in sentences Once the drugs are administered, however,
(Garten and Altman 2009). In these cases, they may not bind only the original target,
however, there can be many false positives but they may unexpectedly have effects based
due to sentences in which genes and drugs on their binding to other targets. Most com-
are mentioned, but are not actually interact- monly, these “off target” effects are consid-
ing. A more precise method is based on care- ered side effects and are avoided. In some
ful parsing of sentences to find subjects and cases, however, the “off target” effect may be
objects that are genes and drugs, and which beneficial in the setting of some other disease.
are related by verbs that connect them (e.g. Thus, both for explaining the molecular basis
“CYP2D metabolizes codeine” has CYP2D of side effects and for finding new molecu-
as the subject, codeine as the object, and the lar evidence for beneficial novel effects, it
verb “metabolizes” establishes their relation- is useful to connect drugs to proteins. One
ship) (Coulet et al. 2010). The rate of false way to do this is to build computational and
positives is reduced in this case because more visualization methods for docking a 3D rep-
strict criteria are applied before claiming a resentation of a small molecule into the 3D
relationship. These high quality interactions structure of a target protein. This can be very
can be chained together to infer new knowl- successful, and has led to the hypothesis that
edge. For example, drug-drug interactions a Parkinson’s disease drug may treat tubercu-
often occur because two drugs share a com- losis (Kinnings et al. 2009)! In that case, the
mon metabolizing gene and that gene becomes 3D structure of a tuberculosis protein had
saturated in the presence of both drugs, and a pocket that appeared to have high bind-
cannot adequately metabolize both of them. ing potential to a known Parkinson’s disease
Thus, the observations that “CYP2D metab- drug, and thus the hypothesis arose that the
884 J. D. Tenenbaum et al.
Parkinson’s drug might inhibit TB growth. based on the assumption that side effects
These structure-based methods are powerful arise from a few common mechanisms, and so
but limited because we have the 3D structure genes involved in this mechanism may be tar-
of only a subset of human proteins. Another geted by multiple drugs or drug classes. In one
approach, therefore, is based on looking for study, Campillos et al. showed that they could
similarities in the list of drugs that have been create 1018 drug-drug relationships based on
shown experimentally to bind a protein. In this shared side effects (Campillos et al. 2008). The
case, all that is needed are data from chemical side effects were taken from the SIDER data-
26 assays showing which drugs bind which pro- base, and the drugs came from a list of 746
teins. These are routinely collected in large marketed drugs. Twenty of these drug-drug
screening experiments, and are available at the relationships were tested experimentally, and
ChEMBL resource (Heikamp and Bajorath 13 of them were shown to bind common tar-
2011), for example. Given two proteins with gets. Thus, a relatively straightforward asso-
two lists of interacting drugs, we can compare ciation of drugs based on side effects allowed
the list of drugs to look for commonalities. If the definition of molecular targets. In related
there are many commonalities between pro- work, Hansen et al. showed that genes could
tein A and protein B, then one might conclude be ranked by their likelihood of interacting
that the drugs that bind protein A may also with a drug based on looking at the degree
bind protein B. This was the approach taken of similarity between chemical structure and
in the Similarity Ensemble Approach (SEA) indications-of-use between the query drugs,
where the list of drugs binding two proteins and small molecules known to interact with
are compared using a measure of chemi- the gene products and their close protein
cal similarity (Keiser et al. 2009). When the interaction neighbors (Hansen et al. 2009).
chemicals on the two lists are statistically sim- The pattern of predicted binding of a small
ilar (more than would be expected by chance), molecule to protein off-targets can also yield
then the SEA method predicts cross-binding information about the likely side effect profile
of ligands for the two structures. When this for that molecule (Liu and Altman 2015).
was applied to a large set of proteins, the The examples we have discussed have
authors found that the antidepressant fluox- several common features: they deal with the
etine (Prozac) had high potential binding to basic objects of diseases, drugs, and disease
the beta-adrenergic receptor, and this was or adverse-event phenotypes; they integrate
found experimentally to block the beta-1 at least two sources of data to establish new
receptor—demonstrating that Prozac is a type relationships between these basic objects; and
of beta-blocker! they connect clinical entities (drugs and dis-
eases or adverse events) to molecular entities.
zz Identifying Drug Targets Using Side-Effect Such examples represent only a small subset
Similarity of the types of questions that can be asked
A critical goal in pharmacogenomics is to with these valuable datasets. The key techni-
associate drugs with their target proteins (and cal challenges are typically (1) finding ade-
thus their coding genes) in order to know quate gold standards (7 Chap. 2) to evaluate
where to look for variation that may affect the success of methods before applying them
drug response. Determining drug targets can for novel discoveries; (2) understanding the
involve a difficult and lengthy experimental sources of error and bias so that predictions
program. Thus, it would be very useful to are as reliable as possible; (3) designing careful
have computational methods for determin- statistical tests to ensure that the scoring and
ing targets. One way to do this is to associate estimates of significance are accurate and use-
drugs to their side effects, and to look for side ful (minimizing false positives, in particular);
effect profiles that are similar across drugs. If and (4) identifying and engaging experimen-
one drug has a known target, and if another tal collaborators who can, when appropriate,
drug has a similar pattern of side effects, then test the predictions that are made in human or
the two drugs may share that target. This is model systems. Recently, it has become clear
Translational Bioinformatics
885 26
that despite their shortcomings, EHRs can zz Computational Methods to Leverage Stem
be extremely useful for initial validation of Cell-Based Model Systems and CRISPr
hypotheses about connections between drugs Assays
and adverse events (Tatonetti et al. 2011a). The rise in the use of stem cells will create
Gene-drug associations are typically tested in opportunities for combining direct measure-
model systems with genes altered in order to ments of cellular response to drugs with
reduce or eliminate their normal function, or systems models of response, whole genome
by looking for covariation in human subjects. variation, and epigenetic information. As we
perfect methods for creating induced pluripo-
tent stem cells and differentiating them into
26.5.3 Challenges the target tissues, we will be in a position to
for Pharmacogenomics measure the response to drugs directly on
these cells, with identical genetic and per-
zz Target Expansion: Molecules to Networks haps epigenetic backgrounds. Computational
The emerging field of systems pharmacology methods for analyzing these responses and
is abandoning the view of “one drug, one tar- relating them to the expected response in the
get” and moving instead toward a view that patients from whom these cells are derived
“the network is the target.” That is, the larger will be a major challenge in the years ahead.
network of interacting genes is targeted by a Similarly, the increased availability and ease
drug at several points, and thus the systemic of generating genome wide CRISPr libraries
effects of drugs need to be evaluated in order that allow genes to be knocked out alone and
to understand better the molecular under- in combination promises to usher in an era of
pinnings of drug response. The challenges unprecedented data about the effects of genes
to systems pharmacology are similar to the on drug response (Kweon and Kim 2018).
challenges to the more general systems biol-
ogy: defining the network topology and key
players, creating ways to measure parameters, 26.6 Ontologies for Translational
modeling nonlinear responses, and under- Research
standing how variation in the basic molecular
players impacts the resulting phenotype—in In order to apply computational methods for
this case drug response phenotypes. biomarker discovery, one needs a consistent
way to refer to genes, diseases, drugs, devices,
zz Rare Variants etc. Several ontologies exist in the biomedical
As whole genome sequencing increasingly domain, many under active development, that
provides data about rare variants, the para- provide the necessary terms for creating consis-
digm of looking for common genetic varia- tent annotations—preferably in an automated
tion that explains variation in drug response manner—for the various datasets that are at
will need to be modified. There may be cases the core of conducting research in TBI. One
when variation in drug response is explained primary need in TBI is to identify and refer
by multiple rare variants rather than one or unambiguously to diseases using one or more
a few common variants. This is particularly disease ontologies. We use the term disease
challenging because there will often be insuf- ontology to refer to artifacts—terminologies
ficient statistics to evaluate rare variants. In and vocabularies as well as true ontologies—
some cases, huge population-based studies that can provide a hierarchy of parent–child
may provide enough samples, but in other terms for disease conditions. Disease-specific
cases even these large cohorts will not have and other clinically-oriented ontologies are
sufficient examples of any rare variant to discussed in detail in 7 Chap. 7.
allow statistical validation. In those cases, The Ontology for Biomedical Investigations
we will have to rely on computational tech- (OBI) was developed as a collaboration among
niques to assess the significance of very rare a number of experimental communities
variations. around the world in order to represent com-
886 J. D. Tenenbaum et al.
mon aspects of biological and clinical inves- tual data relevant to a domain of interest and
tigations. It includes broadly applicable terms returns as output an ordered list of ontologies
such as assay, as well as more specific terms, that would be most appropriate for annotating
such as transcription profiling by array assay. It the corresponding text. By browsing ontolo-
is particularly useful for annotation of experi- gies on BioPortal and using tools such as the
mental metadata, for example to record that ontology recommender, a cancer biologist
a protein expression profiling assay was per- may find, for example, that although the Gene
formed on a blood specimen (Brinkman et al. Ontology offers some terms for annotating
26 2010). her experimental data related to cell division,
there are more precise terms in the NCIt. She
may discover that the Foundational Model
26.6.1 Ontology-Related Resources of Anatomy ontology provides terms for
for Translational Scientists consistently naming body parts from which
the experimental specimens were obtained,
The use of ontology-based analyses for TBI, or that the National Drug File – Reference
especially disease and drug ontologies as well Terminology (NDF-RT) provides the prop-
as analyses using multiple ontologies, is a erties of the drugs used in generating the
recent development and the adoption and use experimental data. BioPortal allows users to
of ontologies is likely to accelerate. Several navigate ontologies using a tree browser or
resources are available for researchers who visualize ontologies as a graph that offer cog-
wish to use ontologies in making sense of nitive support for understanding the complex-
large-scale datasets. The UMLS, or Unified ities of large ontologies (. Fig. 26.6).
Medical Language System (see 7 Chaps. 2 To provide the relationships between terms
and 7), is a set of files and software that brings in two different ontologies, BioPortal provides
together many health and biomedical vocabu- mappings between the terms (Ghazvinian
laries and standards to enable interoperabil- et al. 2009). The mappings can inform the user
ity among computer systems. The UMLS has that the term Melanoma in the NCI Thesaurus
many uses, including search engine retrieval, is related to the term Malignant, Melanoma
data mining, public health statistics report- in SNOMED-CT and to Melanoma in the
ing, and terminology research. In the field of Human Disease Ontology. These mappings
TBI, the UMLS is a relatively underutilized allow users to compare the use of related
resource, but that is changing with the increase terms in different ontologies and to analyze
in the variety of access options (Aronson how whole ontologies compare with one
2001; Bodenreider 2004; Aronson et al. 2008; another (Ghazvinian et al. 2011). In addition
Shah and Musen 2008; Aronson and Lang to curated mappings from the UMLS metath-
2010; Mork et al. 2010) and heightened dis- esaurus, BioPortal enables algorithmic and
semination efforts by the National Library of user-generated mappings as well.
Medicine.
The National Center for Biomedical
Ontology maintains a repository of biomedi- 26.6.2 Enrichment Analysis
cal ontologies called BioPortal (Musen, et al.
2011) which provides access through both Enrichment analysis is a statistical method
Web pages and Web Services to more than to determine whether, for a set of items, a
600 biomedical ontologies and controlled given concept or value is statistically over-
terminologies. Users go to the BioPortal Web represented compared to what one would
site to browse biomedical ontologies and to expect by chance. For example, informatics-
search for specific ontologies relevant to their related terms are over-represented in this
work. BioPortal also provides tools such as book compared to what one would expect to
the Ontology Recommender (Jonquet et al. find in a random sampling of words from all
2010), which takes as input representative tex- textbooks. The canonical example of enrich-
Translational Bioinformatics
887 26
.. Fig. 26.6 A portion of the National Cancer Insti- root node of ‘Disease, Disorder or Finding’. The navi-
tute’s thesaurus. The left pane shows a standard tree gation bar just above the graphical visualization pro-
view for the term ‘Melanoma’. The right pane shows a vides access to additional information, such as mappings
visualization that provides additional context by show- which provide hooks into other disease ontologies that
ing the parent classes of melanoma, all the way to the contain the concept Melanoma
ment analysis involves a list of genes dif- the annotating GO concepts for each gene in
ferentially expressed in some condition. To this list, and arrive at a profile of the biologi-
determine the biological meaning of such a cal processes or mechanisms affected by the
list, the usual solution is to perform enrich- condition under study. This approach does
ment analysis with the GO (Gene Ontology), have certain limitations, for example incom-
which provides terms for consistent nam- plete annotations for a number of genes,
ing of the cellular component (CC) of gene lack of conditional independence between
products, the molecular functions (MF) they annotations, sensitivity to GO version, and
carry out, and the biological processes (BP) in lack of a systematic mechanism to compen-
which they participate. Several curation proj- sate for differing levels of depths in different
ects use GO terms to annotate gene products branches of the ontology hierarchy (Khatri
from multiple organisms with terms from the and Draghici 2005; Rhee et al. 2008; Tomczak
three branches (CC, MF, BP) (Camon et al. et al. 2018). Despite this, such analysis is
2003). These annotations form the basis for widely popular in the bioinformatics commu-
enrichment analysis in which we can aggregate nity and has resulted in over 100 tools listed
888 J. D. Tenenbaum et al.
on the GO website7 and over 7000 citations tation analysis methods for detecting drug
to the landmark paper on the Gene Ontology safety signals using electronic medical records
(Ashburner et al. 2000). up to 2 years before a drug’s recall (LePendu
Disease and drug ontologies can be used et al. 2013).
to perform enrichment analysis in a manner Mining EHR data has also been pro-
similar to GO-based analyses for gene expres- posed as a solution to the challenge of the
sion data (Subramanian et al. 2005; LePendu large number of subjects that are needed for
et al. 2011a). Just as scientists can ask Which genome wide association studies (GWAS).
26 biological process is over-represented in my Patients are increasingly able to consent to, or
set of interesting genes or proteins, we can in some cases to opt out of, allowing excess
also ask Which disease (or class of diseases) biospecimens taken in the course of clinical
is over-represented in my set of interesting care to be used in a de-identified fashion for
genes or proteins? For example, by annotating genomic testing. Even for relatively strong
known protein mutations with disease terms, genetic effects, GWAS requires thousands
Mort et al. were able to identify a class of of individuals for sufficient statistical power
diseases—blood coagulation disorders—that (7 Chap. 11). For weaker effects, tens of
were associated with a lower than expected thousands of subjects are likely to be needed.
rate of amino acid substitutions at O-linked Although the cost of genotyping continues
glycosylation sites (Mort et al. 2010). to decrease, recruitment and sample collec-
tion for these large numbers is both costly and
labor- intensive. Leveraging the health care
26.7 atural Language Processing
N system and EHRs for research recruitment
for Information Extraction offers a potential approach to circumvent this
problem. Ritchie et al. demonstrated the feasi-
Ontologies are also useful in the context of bility of this approach by using EHR data and
extracting information from a body of text. an associated biobank to replicate a number
In-depth methods for natural language pro- of previously discovered genotype-phenotype
cessing are discussed in 7 Chap. 7. Here we associations (Ritchie et al. 2010).
describe some applications in the context of One major initiative in this area is the
translational research. eMERGE (Electronic Medical Records and
Genomics) Network, whose initial aim was to
demonstrate that data captured through rou-
26.7.1 ining Electronic Health
M tine clinical care are sufficient to identify cases
and controls accurately for GWAS (Thorisson
Records
et al. 2005). As of 2018, the eMERGE consor-
tium includes eleven institutions with DNA
Researchers have shown that is possible to
repositories and associated electronic medical
profile patient cohorts from EHRs using a
record systems. For each site, ontology-based
variety of ontologies including SNOMED
data extraction and natural language process-
CT, MedDRA, and RxNorm (LePendu et al.
ing algorithms are applied to the EHR in order
2011b). For example, LePendu et al. devel-
to determine phenotypes such as dementia,
oped methods to annotate clinical text and
cataracts, peripheral artery disease, type 2
methods for the mining of the resulting anno-
diabetes, and cardiac conduction defects. This
tations to compute the risk of having a myo-
analysis is performed in a high-throughput,
cardial infarction on taking Vioxx (rofecoxib)
scalable fashion with results compared to a
for Rheumatoid arthritis. Subsequently they
manually curated gold standard in order to
demonstrated that it is possible to apply anno-
determine positive and negative predictive val-
ues for cases and controls for the phenotypes
in question (Kho et al. 2011). The consortium
7 7 h t t p : / / w w w. g e n e o n t o l og y. o rg / G O. t o o l s. also looks at cross- institutional algorithm
shtml#alphabet (Accessed 12/3/2012). application, ethical, legal, and social issues
Translational Bioinformatics
889 26
around DNA biobanks, and the potential for has enabled novel analyses from already col-
future incorporation of GWAS findings into lected molecular and clinical data (Garber
clinical care (Liu et al. 2012; Rohrer Vitek et al. 2017; Sweeney et al. 2018a, b). Such
et al. 2017; Wan et al. 2017). annotation represents a large part of the work
These types of EMR-associated biobank required to address the ‘F’ in FAIR data,
resources enable a number of other approaches making data ‘findable.’
to data mining. For example, Denny et al. used The utility of consistent annotation of
BioVU at Vanderbilt University to perform research datasets is now widely accepted. As a
what they called a “PheWAS,” or a system- result, there are several initiatives to build tools
atic, high-throughput phenome-wide asso- for consistent meta-data assignment, as well
ciation scan (Denny et al. 2010). Instead of as indices of available datasets correspond-
measuring whole genomes across thousands ing to specific terms of interest. CeDAR, the
of patients in order to find a gene associated Center for Expanded Data Annotation and
with a phenotype in question, they measured Retrieval, was formed with a goal of devel-
only five alleles across thousands of patients oping information technology to facilitate
and performed enrichment analysis for vari- authoring and adoption of metadata (Musen,
ous diseases based on ICD9 codes. They then et al. 2015). BioCADDIE (Biomedical and
were able to reproduce known associations healthCare Data Discovery Index Ecosystem)
between those genes and certain diagnoses, is an international effort to promote biomedi-
and to generate new hypotheses for associa- cal data discovery through the creation of a
tions between these genes and other diagnoses data discovery index called DataMed (Chen
that were statistically enriched for a given gen- et al. 2018).
otype. The ability to connect, at a molecular
level, diseases that were not previously asso-
ciated can have implications for therapeutic 26.8 Network Analysis
intervention (Denny et al. 2016).
Biology lends itself in various ways to mod-
eling through networks or graphs. The term
26.7.2 Dataset Annotations “graph” simply refers to a set of nodes or
circles connected by a set of edges or lines.
In addition to EHRs, public repositories for In a molecular context, a node represents a
omics-scale datasets remain a valuable but molecular entity, and an edge represents some
underutilized resource for data mining. Upon form of relationship between those molecular
submission, these datasets typically con- entities. This relationship may be a physical
tain only free-text descriptions. Addressing interaction (e.g., binds to), an influence (e.g.,
the lack of annotations, researchers demon- activates), or a similarity (e.g., is co-expressed
strated that translational analyses are enabled with), among other possibilities. One fre-
by automatically annotating tissue and gene quently sees graphical models of gene regu-
microarray datasets with ontology terms latory networks, protein-protein interactions,
(Shah et al. 2009a; Doan et al. 2014). Butte and signaling cascades. The set of all of these
et al. employed a crowd-sourcing approach to sorts of physical interactions has been referred
annotate samples from the Gene Expression to as the interactome (Barabasi et al. 2011).
Omnibus (Hadley et al. 2017). Such auto- Studying this interactome and its properties
mated annotation approaches have been gen- from a graph theory perspective enables use-
eralized to create systems that process the free ful insights regarding gene modules and path-
text metadata of diverse database elements ways, and how these are disrupted in disease.
such as gene expression data sets, descriptions A number of researchers have attempted to
of radiology images, clinical-trial reports, and develop gene association networks using gene
PubMed article abstracts to annotate and expression data either alone or together with
index them with concepts from appropriate other sources of network data such as protein-
ontologies (Jonquet et al. 2011). Doing so protein interactions. The general idea is that
890 J. D. Tenenbaum et al.
DISEASOME
Lymphoma HEXB
Spinal muscular atrophy
KRAS AR
Androgen insensitivity
Breast cancer LMNA ATM
Prostate cancer Perineal hypospadias BRCA2
MSH2 BRIP1
Pancreatic cancer
Lymphoma PIK3CA BRCA1
Wilms tumor KRAS
Wilms tumor TP53 RAD54L
Breast cancer
Ovarian cancer Spinal muscular atrophy TP53
MAD1L1
Pancreatic cancer
Sandh off disease MAD1L1
Papillary serous carcinoma CHEK2
RAD54L
Fanconi anemia Lipodystrophy
T-cell lymphoblastic leukemia PIK3CA
VAPB
Charcot-Mare-Tooth disease
Ataxia-telangiectasia
CDH1 MSH2
CHEK2
Amyotrophic lateral sclerosis
BSCL2
Silver spastic paraplegia syndrome
Fanconi anemia B R IP 1
.. Fig. 26.8 The Human Disease Network. The middle The Disease Gene Network on the right depicts genes as
panel shows a small subset of the bi-partite gene-disease node, with connections indicating that they have been
network based on OMIM (Online Mendelian Inheri- implicated in one or more of the same disorders. (From
tance in Man) gene-disease relationships. The Human Goh et al. (2007), ©2007 National Academy of Sciences,
Disease Network on the left shows diseases as nodes, U.S.A. with permission)
with connections representing common related genes.
development (see . Fig. 26.9). Inclusion of the cause) to rational drug design (Yildirim
drugs still under investigation, i.e., not yet et al. 2007). More recently, Hu et al. used a
FDA approved at the time of analysis, dem- network-based approach to look at comor-
onstrated that the breadth of drug targets is bidities and the effects of environmental
expanding, suggesting a trend toward target perturbations (Hu et al. 2016). Using this
diversity. Incorporating the cellular compo- approach they were able to generate hypothe-
nent of target proteins showed that the distri- ses for molecular mechanisms of comorbidity
bution of cellular location for target proteins, which in turn can facilitate drug repurposing
previously nearly two-thirds membrane- and the development of targeted therapeutics.
associated, is becoming more diverse, better
matching the known distribution for disease
proteins. Finally, this group incorporated 26.9 Basepairs to Bedside
protein-protein interaction data to facilitate
the study of network properties of drug target Although the sequenced human genome has
gene products. They looked for the shortest not been a panacea for human disease, it has
path between drug target genes and known enabled the beginnings of a new approach
disease genes for the disorder that drug was to human health and to the practice of pre-
intended to treat and found that this number cision medicine (Collins and Varmus 2015).
appears to be decreasing over time, suggest- As the price of genomic sequencing falls and
ing that drugs are moving from a palliative as our knowledge regarding the meaning of
approach (i.e. treating the symptoms and not genomic variation increases, genotypic data
892 J. D. Tenenbaum et al.
METABOLISM
BLOOD
CARDIOVASCULAR
DERMATOLOGICAL
ADRA2A
GENITO-URINARY
SCN5A
HORMONES
SLC6A4
PGR ADRA1A ANTI-INFECTIVES
ESR1 SLC6A2 ANTINEOPLASTICS
MUSCULOSKELETAL
DRD1
NERVOUS SYSTEM
26
HTR2A
DRD2
ANTIPARASITIC
ADRB1 RESPIRATORY
ADRB2
SENSORY ORGANS
AR VARIOUS
CHRM2
HRH1 CHRM1 GABRA1
Membrane
Cytoplasm
Exterior
COX1 BZRP Organelles
OPRM1 Nucleus
COX2
Unknown
.. Fig. 26.9 Yildirim et al.’s Drug-Target network. their cellular location. Clusters of drugs associated with
Circles represent FDA-approved drugs and rectangles one target reflect the pharmaceutical industry’s ten-
represent target proteins. Diseases are color-coded by dency to develop ‘follow-on’ drugs. (Courtesy of Albert-
anatomical system and protein targets according to Laszlo Barabasi, MD, with permission)
is poised to become a standard component this way, whether because the SNPs them-
of a person’s medical record. In this section selves were of interest, or due to genetic link-
we describe the translational path of genom- age—the tendency for alleles located close to
ics, from sequencing in the lab to clinical rel- one another on a chromosome to be inherited
evance for individuals. together. However, more recent findings have
demonstrated that the concept of “common
disease-common variant” is flawed (Zhu et al.
26.9.1 Whole Genome Sequencing 2011). Indeed, there has been some disap-
pointment in the extent to which GWAS has
zz Technologic Advances been able to explain common diseases with
The DNA-probe approach to genotyping, known genetic components (Manolio et al.
described in 7 Chap. 9, may be compared 2009). Whole genome or, in some cases, whole
to looking for one’s car keys under the pro- exome sequencing allows researchers to iden-
verbial street lamp. That is, the technology tify rare variants (i.e., those with a minor allele
shines the light on a certain portion of the frequency of <1%) that account for genetic
genetic landscape, and that is where we look. disease. Advances in sequencing technolo-
Which SNPs are included on a chip is deter- gies (e.g., “nextgen” sequencing, and third-
mined in large part by which SNPs have been generation sequencing, see 7 Chap. 9) and the
detected in the past, for example through the corresponding decrease in cost, make genome-
HapMap project (Kang et al. 2006). A num- scale sequencing increasingly feasible in trans-
ber of new associations have been found in lational research and even in clinical care.
Translational Bioinformatics
893 26
zz Whole Genome Versus Exome genomic information about the individual in
Even with recent advances in genome question.) These actions prompted responses
sequencing technology, the cost to sequence a that ranged from “too little, too late” to “a
full genome at a rate of coverage that enables heavy-handed bureaucratic response to a
the identification of novel SNPs is still sig- practically minimal risk that will unnecessar-
nificant, on the order of one thousand dol- ily inhibit scientific research” (Church et al.
lars. However, recall that only about 1% of 2009). Current NIH policy allows investi-
the genome actually codes for proteins and gators to submit a data access request to be
85% of known disease-causing mutations reviewed by an NIH Data Access Committee.
with large effects occur in proteins (Choi Access to data is granted once a Data Use
et al. 2009). One way to further decrease the Certification is co-signed both by the inves-
time and cost of sequencing is to look at only tigator and the appropriate official[s] at the
those stretches that code for actual proteins. investigator’s affiliated institution.9
This can be justified because most variants The Global Alliance for Genomics &
known to underlie Mendelian disorders dis- Health is a policy and standards oriented
rupt protein-coding sequences. Of course, organization aimed at enabling responsible
this approach will miss causal variations if data sharing (Terry 2014). It was founded
they exist in the other 99% of the genome. in 2013 when 50 colleagues from 8 countries
Moreover, a recent cluster of publications met to discuss challenges and opportuni-
from the ENCODE (Encyclopedia of DNA ties in genomic research and medicine, and
Elements) Consortium asserts assignment of now comprises more than 500 organizational
biochemical function for 80% of the genome members from 71 different countries. A num-
(Dunham et al. 2012). Some additional com- ber of Work Streams and Driver Projects
ponents, such as regulatory regions or splice guide development efforts and serve to pilot
acceptor and donor sites may be included as the organization’s tools. In a similar vein, the
well to increase sensitivity without incurring RDA (Research Data Alliance) is an inter-
significant additional cost. Exome sequencing national organization meant to promote
in a small number of individuals has been used data sharing and data driven research, and
to identify the causal variant for rare diseases to develop and promote the technical infra-
such as Miller’s syndrome, a multiple malfor- structure to that end. The RDA’s scope goes
mation disorder (Ng et al. 2010), and Proteus beyond genomic and biomedical data, but the
syndrome, a disorder causing the overgrowth mission is highly aligned with GA4GH. One
of tissues and organs, thought to have afflicted of the RDA’s many Working Groups is
the nineteenth century Englishman known as called “FAIRSharing Registry WG: connect-
The Elephant Man (Lindhurst et al. 2011). ing (meta) data standards, repositories, and
policies.10”
zz Genomic Data Sharing Additional online genomic resources
In 2008, researchers demonstrated that the include TCGA (The Cancer Genome Atlas)
presence of a single genome within a complex and WTCCC (Wellcome Trust Case Control
mixture of DNA samples could be ascertained Consortium). TCGA is a joint effort of the
(Homer et al. 2008). This caused both NIH National Cancer Institute (NCI) and the
and the Wellcome Trust8 to limit access not National Human Genome Research Institute
only to individual genomes, but to aggregate (NHGRI) to accelerate understanding of the
genomic information as well. (Note that the molecular basis for cancer through appli-
ability to determine the presence or absence
of an individual’s DNA in a heterogeneous
sample presupposes the availability of detailed
9 7 http://grants.nih.gov/grants/guide/notice-files/
NOT-OD-07-088.html (Accessed 10/29/2018).
10 7 https://rd-alliance.org/group/fairsharing-regis-
8 7 http://www.wellcome.ac.uk/ (Accessed try-connecting-data-policies-standards-databases.
10/28/2018). html (Accessed 10/29/2018).
894 J. D. Tenenbaum et al.
tools, automation, and infrastructure, as well Statistics predict that any given patient will
as a whole new paradigm in genetic counsel- find out he is a carrier for some lethal autoso-
ing, are required to make incorporation of mal recessive disease. Illness aside, the average
genomic data into health care feasible for the global non-paternity rate has been estimated
population at large. to be as high as 10% (Olson 2007), though it
is likely closer to 1% (Larmuseau et al. 2016).
zz Ethical, Legal, and Social Issues (ELSI) Genetic information could also have impli-
Pursuit of genomic medicine raises a number cations for the patient’s children, present or
26 of ethical, legal, and social issues (see also future, and for other family members. Patients,
7 Chap. 12). Some worry that people are ill- this group concluded, must have access to
equipped to process the results of these tests. trained professionals to provide answers to
But it is not clear that a paternalistic approach their questions, where answers exist. This will
is a better alternative; there was a time when be difficult, lengthy, and expensive, but not to
it was considered acceptable for a doctor not do it would undermine the consent process
to disclose a cancer diagnosis to the patient (Ormond et al. 2010).
himself (Novack et al. 1979). In addition, new Although knowing the “parts list” for the
discoveries are being made all the time—what human genome is an important step, much
are the obligations to follow up if something remains to be understood about how genes
new (and dire? and actionable?) is discovered factor into human health and disease. For
about a given subject? Other questions include most diseases, the environment plays as much,
whether enough is known for the results to be if not more, of a role as a person’s DNA. Aside
of any practical use, whether the service should from some notable, deterministic exceptions
be provided outside of the context of a rela- such as Huntington’s disease, most known
tionship with a clinical caregiver, and whether risk alleles confer fairly low odds ratios unto
results could have detrimental effects on a per- themselves (see 7 Chap. 3), making an indi-
son’s ability to secure health insurance. Some vidual, for example, approximately 1.1 times
states have banned the services, others have as likely as the average individual to develop a
made stipulations requiring clinician involve- given condition. Even when ratios are as high
ment and CLIA certification for the labs that as, say, twofold, it is of dubious actual util-
handle the samples and process the results.16 ity to know that based on one’s genotype, the
odds of being diagnosed with Crohn’s disease
In a companion article to the Quake pro- went from 0.5 in 100 to 1 in 100.
file, it was asserted that consent for a process For certain disease markers, such as
in which the risks of knowledge gained are Alzheimer’s or BRCA1 and BRCA2, it was,
not wholly understood is more complex than and largely still is, unknown what impact
for simple genetic testing. People have trouble negative results might have on a customer’s
interpreting probabilities. Patients must be mental and emotion well-being. Some studies
advised that they may find out things they did have shown that while a person experiences
not want to know about. The eminent scientist negative emotions immediately in the wake
James Watson made a point of requesting that of learning the bad news, over a time period
his ApoE status be redacted from the release of months there is no significant difference
of his full genome because he did not want to in anxiety, depression, or test-related distress
know if he was at risk. His grandmother had (Green et al. 2009). In any case, DTC genetics
died of Alzheimer’s at 83, and he did not want companies’ websites must provide the ability
to worry that every subsequent memory lapse to view sensitive results while protecting the
marked the onset of dementia (Angrist 2010). customer from stumbling on these findings
unintentionally. 23andMe, as an example, has
spent considerable resources on the design
16 7 http://www.genomeweb.com/dxpgx/will-other-
of a user-friendly interface through which
states-follow-ny-calif-taking-dtc-genetic-testing- to present an individual’s “health reports,”
firms (Accessed 12/6/2012). or their individual genotype for markers that
Translational Bioinformatics
897 26
b F2
2–fold
Increased Risk
O
AB
Average Risk
F5
2–fold
Decreased Risk
.. Fig. 26.10 23andMe’s graphical representation of The individual’s relative risk for each of three reported
relative risk from their website before the FDA stepped markers: factor 5, factor 2, and ABO. (Specific values
in to regulate DTC genetic testing. a Colored figures were displayed on the website when the user hovered the
represent the number of people on average out of 100 mouse over the colored bars.) In a later version of the
who are likely to develop venous thromboembolism website, risk for hereditary thrombophilia was based
over the course of a lifetime. Green figures represent the only on factor 2 and factor 5; ABO was no longer
individual’s personal reported risk; blue figures repre- included. (© 23andMe, Inc. 2007–2012. All rights
sent the average risk for females of European descent. reserved; distributed pursuant to a Limited License
(Accompanying text has been shortened for clarity.) b from 23andMe)
have been characterized through reliable, . Figure 26.10 shows such a graphic for an
established research methods. Along with a individual’s risk of venous thromboembolism
text explanation, these health reports give a from a circa 2012 version of the website, before
graphical depiction of a person’s relative risk. the FDA began to regulate DTC genetic test-
898 J. D. Tenenbaum et al.
ing. The graphical representation has been Plans to sell the saliva collection container
modified in more recent versions of the site in brick-and-mortar stores were put on hold
to be less specific, more accurately reflect- until the regulatory issues could be resolved,
ing the underlying uncertainty. For sensitive or at least addressed.
results such as BRCA1 and 2, and markers Another high profile legal issue is the case
for Alzheimer’s and Parkinson’s disease, the of Assoc. for Molecular Pathology v. Myriad
information is initially “locked.” Users must Genetics, Inc., et al., regarding Myriad’s pat-
explicitly click through an additional screen ent on the BRCA1 and BRCA2 genes, which
26 to confirm that they truly want to know geno- were included in 23andMe’s offerings,17 and
type and relative risk for that trait. more generally whether genes should be pat-
entable at all. In 2011, a federal appeals court
zz Rulings and Regulations overturned a lower court in the case of and
From a regulatory perspective, it was not ini- found that genes can, in fact, be patented
tially clear whether these services qualify as (Pollack 2011). This ruling was upheld in a
medical devices as defined by the FDA, and court of appeals in 2012, however, in 2013,
are therefore subject to regulation by the the Supreme Court partially overturned that
Agency. In fall of 2013, the FDA sent a letter ruling and found that isolated genomic DNA
to 23andMe ordering it to “immediately dis- (gDNA) is not patent-eligible, but cDNA is.
continue marketing the PGS [Saliva Collection Disappointingly, this ruling did not do much
Kit and Personal Genome Service] until such to reduce ambiguity around these issues.
time as it receives FDA marketing authoriza-
tion for the device.” (Annas and Elias 2014) A
few weeks later, 23andMe announced that it 26.10 Challenges and Future
was complying with the FDA’s demands and Directions
suspending their health-related genetic tests.
23andMe continued to offer (and market) their TBI as a discipline continues to evolve in an
ancestry-related testing, and less than 2 years exciting and dynamic phase. Though chal-
later announced FDA approval for a carrier lenges remain, the field is poised to become
screening test for Bloom syndrome, with word- an increasingly crucial element of biomedi-
ing that left the option open for additional car- cal research and clinical practice in the era of
rier screening tests without premarket review precision medicine (Tenenbaum et al. 2016).
(Annas and Elias 2014, 2015). In October, We conclude this chapter with a discussion of
2017, the company announced that it would future directions and key challenges for this
offer genetic risk for ten medical conditions burgeoning discipline.
including Parkinson’s disease and late-onset
Alzheimer’s (Check Hayden 2017). The DTC
testing landscape is still rapidly evolving, and 26.10.1 Expansion of Data Types
will continue to do so for the foreseeable future.
Logistically, a prospective customer typi- Genomic data are already being used to guide
cally registers on the DTC company’s web- clinical care. Genomic data themselves are
site and a sample collection kit is sent in the relatively straightforward in that an individ-
mail, though 23andMe’s kit is also available ual’s genome is relatively static, and through
for purchase through 7 Amazon.com and the intrinsic physical properties of ribonu-
even on the shelves of local brick-and-mortar cleic acids and the transcriptional process,
pharmacies. In May 2010, Pathway Genomics DNA and RNA are relatively easy to capture,
and Walgreens announced a plan to sell these observe, and quantify. Proteins and metabolites
kits in Walgreens drugstores, but the FDA are more challenging in this regard. Proteomic
sent a letter to Pathway Genomics indicating
their belief that the company’s genomic report
qualified as medical device (Bradley et al. 17 7 https://www.23andme.com/health/BRCA-Can-
2011) and as such required FDA approval. cer/ (Accessed 12/19/12).
Translational Bioinformatics
899 26
and metabolomic methodologies have primar- ics, pharmacogenomics, statistics, and data
ily centered around isotopic labeling, but more standards will be increasingly important.
recent approaches enable unbiased label-free Expertise in these fields will also need to be
identification and even quantification (Du et al. supplemented by an expanded workforce of
2008; Wishart 2011). Identification of metabo- genetic counselors. Increasingly, therapies will
lites associated with disease has already enabled require accompanying diagnostic tests. As the
enzymatic drug targeting in diabetes, obesity, opportunities for use of genomic data in clini-
cardiovascular disease, and cancer, among cal care continue to advance, it will become
other conditions (Chan and Ginsburg 2011). increasingly important to incorporate this
We expect that as proteomics and metabolo- information into both the electronic health
mics standards and technologies continue to record and into machine readable clinical care
mature, they will play an increasingly signifi- guidelines for clinical decision support. This
cant role in translational research and practice. in turn will require new standards to capture
The role of epigenetics needs to be under- genomic findings, and new decision support
stood more fully. It is clear that the environ- tools to enable clinicians to incorporate this
ment can induce changes in the packaging ever-increasing amount of information into
and labeling of DNA. These environmental their therapeutic decision making processes
cues can include lifetime exposures to toxins, (Hoffman 2007). A number of standards
viruses, bacteria and nutritional compounds exist in this space; the key will be in educat-
as well as drug exposures. Understanding ing prospective users and enforcing adoption.
the ways in which these epigenetic modifica- This applies to the full translational spectrum,
tions affect phenotype is in its infancy, and from annotation of experimentally generated
so we must understand how to measure these datasets to a common format for the exchange
effects, and then compute with them. The of clinically relevant omic information
human microbiome is also an active area for between EHR systems. Most doctors have
translational research. Though one might only a basic level of training in genetics, and
expect associations between the gut microbi- are ill-equipped to answer in-depth questions
ome and various gastro-intestinal conditions, from patients who bring to an appointment
surprising correlations and even causal rela- printouts of their results from these services
tionships have been discovered with cancer, (Frueh and Gurwitz 2004). More knowledge
neurological, and even psychiatric disorders is required, in addition to training and tools,
(Mayer et al. 2014; Zitvogel et al. 2015; Zheng before family care providers, internists, and
et al. 2016; Clapp et al. 2017). even specialists, are prepared to incorporate
Finally, as standards are developed and genomic information into their clinical prac-
clinicians and researchers see the value tice (Ormond et al. 2010; Chan and Ginsburg
to be gained from structured data collec- 2011).
tion through studies such as The National
Children’s Study (Landrigan et al. 2006),
structured environmental data is likely to be 26.11 Conclusions
increasingly available to complete the picture
for gene-environment interactions (Schwartz As the cost of data generation and storage
and Collins 2007). continues to decrease, and the methods for
data analysis and interpretation continue to
advance, TBI is poised to be a key enabler of
26.10.2 Changes for Medical precision medicine (Tenenbaum et al. 2016).
Training, Practice, One can imagine a day when every newborn
and Support has his or her genome sequenced and this
information becomes a part of the medical
Clinicians will need enhanced training in record, much as blood type is recorded today.
genetics and other areas described above. The biggest challenges to achieving this vision
Curricular components relating to genet- are likely not to be technical ones, but rather
900 J. D. Tenenbaum et al.
ethical, legal, and economic in nature (Schadt zation and tools for interpretation of single
2012). Society must strike a balance between nucleotide variants.
privacy protection and facilitating progress Davies, K. (2010). This text, written by the editor
in biomedical research. Legal issues will need of BioIT World magazine, documents the
to be worked out around direct-to-consumer characters, events, and issues in the race to
genetic testing, gene patenting, preventing achieve the $1000 Genome. In The $1000
genetic discrimination, and many other such genome: The revolution in DNA sequencing
issues. Return on investment will need to be and the new era of personalized medicine.
26 established through economic analysis com- New York: Free Press.
bined with comparative effectiveness research Hastie, T., Tibshirani, R., & Friedman, J. H.
(see 7 Chaps. 11 and 26). Ultimately, some- (2009). The elements of statistical learning :
one will have to pay for these accompanying Data mining, i nference, and prediction.
diagnostic tests. Major change is unlikely until New York: Springer. A useful primer on the
an organization like the Center for Medicare statistical concepts underlying machine learn-
and Medicaid Services (CMS) changes its ing approaches to biomarker discovery.
policies. For example, CMS coverage for the Kann, M. G, & Lewitter, F. (Eds.). (2012).
genetic test to guide warfarin dosing is cur- Translational bioinformatics. PLOS
rently conditional upon it being ordered as Computational Biology Collections eBook.
part of a research protocol (Meckley and This eBook represents both the first “text-
Neumann 2010). TBI will continue to play a book” devoted entirely to TBI, and the first
key role in transforming these types of scien- online, open access textbook from PLOS. In
tific discoveries into improvements in human addition to many of the topics covered in this
health. chapter, the collection includes chapters on
related topics such as cancer genome analysis,
nnSuggested Readings micribiome analysis, structural variation, and
Altman, R. B., & Miller, K. S. (2011). 2010 trans- protein interactions in disease.
lational bioinformatics year in review. Journal Masys, D. R., Jarvik, G. P., Abernethy, N. F.,
of the American Medical Informatics Anderson, N. R., Papanicolaou, G. J., Paltoo,
Association, 18, 358–366. This article summa- D. N., Hoffman, M. A., Kohane, I. S., & Levy,
rizes Dr. Altman’s third annual “year in H. P. (2012). Technical desiderata for the inte-
review” presentation delivered at the 2010 gration of genomic data into electronic health
AMIA Joint Summits on Translational records. Journal of Biomedical Informatics,
Science in San Francisco. 45(3), 419–422. The authors describe the char-
Angrist, M. (2010). Here is a human being: At the acteristics of biomolecular data that differen-
dawn of personal genomics. New York: tiate it from other EHR data, enumerate a set
Harper. This text is written by one of the of technical desiderata for management of
Personal Genome Project’s first subjects, biomolecular data in clinical settings (e.g.,
describing the project, the cohort, and the separation of molecular data observations
experience. It also gives a good overview of from clinical interpretation, lossless data com-
the background of the project and a number pression, support for readability by both
of ethical, legal, and social issues that it raises. humans and machines), and propose a techni-
Capriotti, E., Nehrt, N. L., Kann, M. G., & cal approach to its representation.
Bromberg, Y. (2012, July). Bioinformatics for Sarkar, I. N., & Payne, P. R. O. (2011, December).
personal genome interpretation. Briefings in The joint summits on translational science:
Bioinformatics, 13(4), 495–512. The authors crossing the translational chasm. Journal of
of this review summarize key databases and Biomedical Informatics, 44(Suppl 1), S1–S2.
bioinformatics tools that have been developed This editorial discusses the spectrum of bio-
in recent years to aid in the interpretation of medical informatics, from biology to medicine,
genomic variance. Resources covered include in the context of the NIH Roadmap and the
databases of variants, genotype/phenotype Clinical and Translational Science Award pro-
annotation databases, tools for gene prioriti- gram. It gives the history of the AMIA Joint
Translational Bioinformatics
901 26
Summits on Translational Science, and explains SMART on FHIR genomics: Facilitating standard-
the emergence of TBI and CRI as disciplines ized clinico-genomic apps. Journal of the American
Medical Informatics Association, 22(6), 1173–1178.
unto themselves, intended to address the same
Altman, R. B. (2007). PharmGKB: A logical home for
issues that motivated those initiatives- namely knowledge relating genotype to drug response phe-
translating scientific discoveries into meaning- notype. Nature Genetics, 39(4), 426.
ful changes in health care delivery. Altshuler, D. M., Durbin, R. M., & 1000 Genomes
Sarkar, I. N., Butte, A. J., Lussier, Y. A., Tarczy- Project Consortium. (2010). A map of human
genome variation from population-scale sequenc-
Hornoch, P., & Ohno-Machado, L. (2011).
ing. Nature, 467(7319), 1061–1073.
Translational bioinformatics: Linking knowl- Angrist, M. (2010). Here is a human being : At the dawn
edge across biological and clinical realms. of personal genomics. New York: Harper.
Journal of the American Medical Informatics Annas, G. J., & Elias, S. (2014). 23andMe and the FDA.
Association, 18, 354–357. The authors pres- The New England Journal of Medicine, 370(11), 985–
988.
ent the field of TBI in the context of suc-
Aronson, A. R. (2001). Effective mapping of biomedical
cesses from bioinformatics and health text to the UMLS Metathesaurus: The MetaMap
informatics. program. In Proceedings of the AMIA symposium
(pp. 17–21).
??Questions for Discussion Aronson, A. R., & Lang, F. M. (2010). An overview of
1. Should DTC genetic testing for MetaMap: Historical perspective and recent
advances. Journal of the American Medical
health-related traits be regulated by Informatics Association, 17(3), 229–236.
the FDA? Aronson, A. R., Mork, J. G., Neveol, A., Shooshan,
2. Should genes be patentable? S. E., & Demner-Fushman, D. (2008). Methodology
3. Are there sufficient legal protections in for creating UMLS content views appropriate for
place to prevent discrimination based on biomedical natural language processing. In AMIA
annual symposium proceedings (pp. 21–25).
genomic information? If not, what regu- Ashburner, M., Ball, C. A., Blake, J. A., Botstein, D.,
lations are needed? Butler, H., Cherry, J. M., Davis, A. P., Dolinski, K.,
4. Are we headed toward full disclosure of Dwight, S. S., Eppig, J. T., Harris, M. A., Hill, D. P.,
genomic information? Issel-Tarver, L., Kasarskis, A., Lewis, S., Matese,
5. What are some reasons a researcher J. C., Richardson, J. E., Ringwald, M., Rubin,
G. M., & Sherlock, G. (2000). Gene ontology: Tool
might not want to share research data? for the unification of biology. The Gene Ontology
Should they be required to share? If so, Consortium. Nature Genetics, 25(1), 25–29.
under what circumstances (e.g., Ashley, E. A., Butte, A. J., Wheeler, M. T., Chen, R.,
6 months after first publication)? Klein, T. E., Dewey, F. E., Dudley, J. T., Ormond,
6. For novel analyses applied to complex, K. E., Pavlovic, A., Morgan, A. A., Pushkarev, D.,
Neff, N. F., Hudgins, L., Gong, L., Hodges, L. M.,
high-dimensional datasets, should there Berlin, D. S., Thorn, C. F., Sangkuhl, K., Hebert,
be new guidelines in place to prevent J. M., Woon, M., Sagreiya, H., Whaley, R., Knowles,
reporting erroneous results through user J. W., Chou, M. F., Thakuria, J. V., Rosenbaum,
error or data fraud? Why or why not? A. M., Zaranek, A. W., Church, G. M., Greely,
7. What are the major barriers to H. T., Quake, S. R., & Altman, R. B. (2010). Clinical
assessment incorporating a personal genome.
incorporating the benefits of Lancet, 375(9725), 1525–1535.
personalized medicine fully into Atkinson, A. J., Colburn, W. A., DeGruttola, V. G.,
standard practice? DeMets, D. L., Downing, G. J., Hoth, D. F., Oates,
J. A., Peck, C. C., Schooley, R. T., Spilker, B. A.,
Woodcock, J., & Zeger, S. L. (2001). Biomarkers and
surrogate endpoints: Preferred definitions and con-
References ceptual framework. Clinical Pharmacology and
Therapeutics, 69(3), 89–95.
Alecu, I., Bousquet, C., Mougin, F., & Jaulent, M. C. Auer, P. L., Reiner, A. P., Wang, G., Kang, H. M.,
(2006). Mapping of the WHO-ART terminology on Abecasis, G. R., Altshuler, D., Bamshad, M. J.,
Snomed CT to improve grouping of related adverse Nickerson, D. A., Tracy, R. P., Rich, S. S., NHLBI
drug reactions. Studies in Health Technology and GO Exome Sequencing Project, & Leal, S. M.
Informatics, 124, 833–838. (2016). Guidelines for large-scale sequence-based
Alterovitz, G., Warner, J., Zhang, P., Chen, Y., Ullman- complex trait association studies: Lessons learned
Cullere, M., Kreda, D., & Kohane, I. S. (2015). from the NHLBI exome sequencing project.
902 J. D. Tenenbaum et al.
American Journal of Human Genetics, 99(4), 791– network. Clinical Pharmacology and Therapeutics,
801. 100(2), 160–169.
Bahr, N. J., Nelson, S. D., Winnenburg, R., & Butte, A. J. (2008). Translational bioinformatics:
Bodenreider, O. (2017). Eliciting the intension of Coming of age. Journal of the American Medical
drug value sets – principles and quality assurance Informatics Association, 15(6), 709–714.
applications. Studies in Health Technology and Butte, A. J., & Chen, R. (2006). Finding disease-related
Informatics, 245, 843–847. genomic experiments within an international reposi-
Barabasi, A. L., Gulbahce, N., & Loscalzo, J. (2011). tory: First steps in translational bioinformatics. In
Network medicine: A network-based approach to AMIA annual symposium proceedings (pp. 106–110).
human disease. Nature Reviews. Genetics, 12(1), Cai, B., Li, B., Kiga, N., Thusberg, J., Bergquist, T.,
26 56–68. Chen, Y. C., Niknafs, N., Carter, H., Tokheim, C.,
Barrett, T., Wilhite, S. E., Ledoux, P., Evangelista, C., Beleva-Guthrie, V., Douville, C., Bhattacharya, R.,
Kim, I. F., Tomashevsky, M., Marshall, K. A., Yeo, H. T. G., Fan, J., Sengupta, S., Kim, D., Cline,
Phillippy, K. H., Sherman, P. M., Holko, M., M., Turner, T., Diekhans, M., Zaucha, J., Pal, L. R.,
Yefanov, A., Lee, H., Zhang, N., Robertson, C. L., Cao, C., Yu, C. H., Yin, Y., Carraro, M., Giollo, M.,
Serova, N., Davis, S., & Soboleva, A. (2013). NCBI Ferrari, C., Leonardi, E., Tosatto, S. C. E., Bobe, J.,
GEO: Archive for functional genomics data sets- Ball, M., Hoskins, R. A., Repo, S., Church, G.,
-update. Nucleic Acids Research, 41(Database issue), Brenner, S. E., Moult, J., Gough, J., Stanke, M.,
D991–D995. Karchin, R., & Mooney, S. D. (2017). Matching
Becker, K. G., Barnes, K. C., Bright, T. J., & Wang, S. A. phenotypes to whole genomes: Lessons learned
(2004). The genetic association database. Nature from four iterations of the personal genome project
Genetics, 36(5), 431–432. community challenges. Human Mutation, 38(9),
Bodenreider, O. (2004). The unified medical language 1266–1276.
system (UMLS): Integrating biomedical terminol- Camon, E., Magrane, M., Barrell, D., Binns, D.,
ogy. Nucleic Acids Research, 32(Database issue), Fleischmann, W., Kersey, P., Mulder, N., Oinn, T.,
D267–D270. Maslen, J., Cox, A., & Apweiler, R. (2003). The gene
Bradley, W. G., Golding, S. G., Herold, C. J., Hricak, H., ontology annotation (GOA) project: Implementation
Krestin, G. P., Lewin, J. S., Miller, J. C., Ringertz, of GO in SWISS-PROT, TrEMBL, and InterPro.
H. G., & Thrall, J. H. (2011). Globalization of P4 Genome Research, 13(4), 662–672.
medicine: Predictive, personalized, preemptive, and Campillos, M., Kuhn, M., Gavin, A. C., Jensen, L. J., &
participatory--summary of the proceedings of the Bork, P. (2008). Drug target identification using
eighth international symposium of the International side-effect similarity. Science, 321(5886), 263–266.
Society for Strategic Studies in Radiology, August Cedar, H. (1988). DNA methylation and gene activity.
27–29, 2009. Radiology, 258(2), 571–582. Cell, 53(1), 3–4.
Brinkman, R. R., Courtot, M., Derom, D., Fostel, J. M., Chan, I. S., & Ginsburg, G. S. (2011). Personalized med-
He, Y., Lord, P., Malone, J., Parkinson, H., Peters, icine: Progress and promise. Annual Review of
B., Rocca-Serra, P., Ruttenberg, A., Sansone, S. A., Genomics and Human Genetics, 12, 217–244.
Soldatova, L. N., Stoeckert, C. J., Jr., Turner, J. A., Check Hayden, E. (2017). The rise and fall and rise
& Zheng, J. (2010). Modeling biomedical experi- again of 23andMe. Nature, 550(7675), 174–177.
mental processes with OBI. Journal of Biomedical Chen, X., Gururaj, A. E., Ozyurt, B., Liu, R., Soysal,
Semantics, 1(Suppl 1), S7. E., Cohen, T., Tiryaki, F., Li, Y., Zong, N., Jiang,
Brown, E. G., Wood, L., & Wood, S. (1999). The medi- M., Rogith, D., Salimi, M., Kim, H. E., Rocca-
cal dictionary for regulatory activities (MedDRA). Serra, P., Gonzalez-Beltran, A., Farcas, C.,
Drug Safety, 20(2), 109–117. Johnson, T., Margolis, R., Alter, G., Sansone, S. A.,
Bush, W. S., Crosslin, D. R., Owusu-Obeng, A., Wallace, Fore, I. M., Ohno-Machado, L., Grethe, J. S., &
J., Almoguera, B., Basford, M. A., Bielinski, S. J., Xu, H. (2018). DataMed – an open source discov-
Carrell, D. S., Connolly, J. J., Crawford, D., Doheny, ery index for finding biomedical datasets. Journal of
K. F., Gallego, C. J., Gordon, A. S., Keating, B., the American Medical Informatics Association,
Kirby, J., Kitchner, T., Manzi, S., Mejia, A. R., Pan, 25(3), 300–308.
V., Perry, C. L., Peterson, J. F., Prows, C. A., Choi, M., Scholl, U. I., Ji, W., Liu, T., Tikhonova, I. R.,
Ralston, J., Scott, S. A., Scrol, A., Smith, M., Zumbo, P., Nayir, A., Bakkaloglu, A., Ozen, S.,
Stallings, S. C., Veldhuizen, T., Wolf, W., Volpi, S., Sanjad, S., Nelson-Williams, C., Farhi, A., Mane,
Wiley, K., Li, R., Manolio, T., Bottinger, E., S., & Lifton, R. P. (2009). Genetic diagnosis by
Brilliant, M. H., Carey, D., Chisholm, R. L., Chute, whole exome capture and massively parallel DNA
C. G., Haines, J. L., Hakonarson, H., Harley, J. B., sequencing. Proceedings of the National Academy of
Holm, I. A., Kullo, I. J., Jarvik, G. P., Larson, E. B., Sciences of the United States of America, 106(45),
McCarty, C. A., Williams, M. S., Denny, J. C., 19096–19101.
Rasmussen-Torvik, L. J., Roden, D. M., & Ritchie, Church, G., Heeney, C., Hawkins, N., de Vries, J.,
M. D. (2016). Genetic variation among 82 pharma- Boddington, P., Kaye, J., Bobrow, M., & Weir, B.
cogenes: The PGRNseq data from the eMERGE (2009). Public access to genome-wide data: Five
Translational Bioinformatics
903 26
views on balancing research with privacy and pro- Dunham, I., Kundaje, A., Aldred, S. F., Collins, P. J.,
tection. PLoS Genetics, 5(10), e1000665. Davis, C. A., Doyle, F., Epstein, C. B., Frietze, S.,
Cirulli, E. T., & Goldstein, D. B. (2010). Uncovering the Harrow, J., Kaul, R., Khatun, J., Lajoie, B. R.,
roles of rare variants in common disease through Landt, S. G., Lee, B. K., Pauli, F., Rosenbloom,
whole-genome sequencing. Nature Reviews. K. R., Sabo, P., Safi, A., Sanyal, A., Shoresh, N.,
Genetics, 11(6), 415–425. Simon, J. M., Song, L., Trinklein, N. D., Altshuler,
Clapp, M., Aurora, N., Herrera, L., Bhatia, M., Wilen, R. C., Birney, E., Brown, J. B., Cheng, C., Djebali,
E., & Wakefield, S. (2017). Gut microbiota’s effect S., Dong, X., Ernst, J., Furey, T. S., Gerstein, M.,
on mental health: The gut-brain axis. Clinics and Giardine, B., Greven, M., Hardison, R. C., Harris,
Practice, 7(4), 987. R. S., Herrero, J., Hoffman, M. M., Iyer, S., Kelllis,
Collins, F. S., & Varmus, H. (2015). A new initiative on M., Kheradpour, P., Lassman, T., Li, Q., Lin, X.,
precision medicine. The New England Journal of Marinov, G. K., Merkel, A., Mortazavi, A., Parker,
Medicine, 372(9), 793–795. S. C., Reddy, T. E., Rozowsky, J., Schlesinger, F.,
Cook, N. R. (2007). Use and misuse of the receiver Thurman, R. E., Wang, J., Ward, L. D., Whitfield,
operating characteristic curve in risk prediction. T. W., Wilder, S. P., Wu, W., Xi, H. S., Yip, K. Y.,
Circulation, 115(7), 928–935. Zhuang, J., Bernstein, B. E., Green, E. D., Gunter,
Coulet, A., Shah, N. H., Garten, Y., Musen, M., & C., Snyder, M., Pazin, M. J., Lowdon, R. F., Dillon,
Altman, R. B. (2010). Using text to build semantic L. A., Adams, L. B., Kelly, C. J., Zhang, J., Wexler,
networks for pharmacogenomics. Journal of J. R., Good, P. J., Feingold, E. A., Crawford, G. E.,
Biomedical Informatics, 43(6), 1009–1019. Dekker, J., Elinitski, L., Farnham, P. J., Giddings,
Crick, F. (1970). Central dogma of molecular biology. M. C., Gingeras, T. R., Guigo, R., Hubbard, T. J.,
Nature, 227(5258), 561–563. Kellis, M., Kent, W. J., Lieb, J. D., Margulies, E. H.,
Denny, J. C., Ritchie, M. D., Basford, M. A., Pulley, Myers, R. M., Starnatoyannopoulos, J. A.,
J. M., Bastarache, L., Brown-Gentry, K., Wang, D., Tennebaum, S. A., Weng, Z., White, K. P., Wold, B.,
Masys, D. R., Roden, D. M., & Crawford, D. C. Yu, Y., Wrobel, J., Risk, B. A., Gunawardena, H. P.,
(2010). PheWAS: Demonstrating the feasibility of a Kuiper, H. C., Maier, C. W., Xie, L., Chen, X.,
phenome-wide scan to discover gene-disease associ- Mikkelsen, T. S., Gillespie, S., Goren, A., Ram, O.,
ations. Bioinformatics, 26(9), 1205–1210. Zhang, X., Wang, L., Issner, R., Coyne, M. J.,
Denny, J. C., Bastarache, L., & Roden, D. M. (2016). Durham, T., Ku, M., Truong, T., Eaton, M. L.,
Phenome-wide association studies as a tool to Dobin, A., Lassmann, T., Tanzer, A., Lagarde, J.,
advance precision medicine. Annual Review of Lin, W., Xue, C., Williams, B. A., Zaleski, C.,
Genomics and Human Genetics, 17, 353–373. Roder, M., Kokocinski, F., Abdelhamid, R. F.,
Dewey, F. E., Chen, R., Cordero, S. P., Ormond, K. E., Alioto, T., Antoshechkin, I., Baer, M. T., Batut, P.,
Caleshu, C., Karczewski, K. J., Whirl-Carrillo, M., Bell, I., Bell, K., Chakrabortty, S., Chrast, J.,
Wheeler, M. T., Dudley, J. T., Byrnes, J. K., Cornejo, Curado, J., Derrien, T., Drenkow, J., Dumais, E.,
O. E., Knowles, J. W., Woon, M., Sangkuhl, K., Dumais, J., Duttagupta, R., Fastuca, M., Fejes-
Gong, L., Thorn, C. F., Hebert, J. M., Capriotti, E., Toth, K., Ferreira, P., Foissac, S., Fullwood, M. J.,
David, S. P., Pavlovic, A., West, A., Thakuria, J. V., Gao, H., Gonzalez, D., Gordon, A., Howald, C.,
Ball, M. P., Zaranek, A. W., Rehm, H. L., Church, Jha, S., Johnson, R., Kapranov, P., King, B.,
G. M., West, J. S., Bustamante, C. D., Snyder, M., Kingswood, C., Li, G., Luo, O. J., Park, E., Preall,
Altman, R. B., Klein, T. E., Butte, A. J., & Ashley, J. B., Presaud, K., Ribeca, P., Robyr, D., Ruan, X.,
E. A. (2011). Phased whole-genome genetic risk in a Sammeth, M., Sandu, K. S., Schaeffer, L., See,
family quartet using a major allele reference L. H., Shahab, A., Skancke, J., Suzuki, A. M.,
sequence. PLoS Genetics, 7(9), e1002280. Takahashi, H., Tilgner, H., Trout, D., Walters, N.,
Doan, S., Lin, K. W., Conway, M., Ohno-Machado, L., Wang, H., Hayashizaki, Y., Reymond, A.,
Hsieh, A., Feupe, S. F., Garland, A., Ross, M. K., Antonarakis, S. E., Hannon, G. J., Ruan, Y.,
Jiang, X., Farzaneh, S., Walker, R., Alipanah, N., Carninci, P., Sloan, C. A., Learned, K., Malladi,
Zhang, J., Xu, H., & Kim, H. E. (2014). PhenDisco: V. S., Wong, M. C., Barber, G. P., Cline, M. S.,
Phenotype discovery system for the database of Dreszer, T. R., Heitner, S. G., Karolchik, D.,
genotypes and phenotypes. Journal of the American Kirkup, V. M., Meyer, L. R., Long, J. C., Maddren,
Medical Informatics Association, 21(1), 31–36. M., Raney, B. J., Grasfeder, L. L., Giresi, P. G.,
Du, X., Callister, S. J., Manes, N. P., Adkins, J. N., Battenhouse, A., Sheffield, N. C., Showers, K. A.,
Alexandridis, R. A., Zeng, X., Roh, J. H., Smith, London, D., Bhinge, A. A., Shestak, C., Schaner,
W. E., Donohue, T. J., Kaplan, S., Smith, R. D., & M. R., Kim, S. K., Zhang, Z. Z., Mieczkowski,
Lipton, M. S. (2008). A computational strategy to P. A., Mieczkowska, J. O., Liu, Z., McDaniell,
analyze label-free temporal bottom-up proteomics R. M., Ni, Y., Rashid, N. U., Kim, M. J., Adar, S.,
data. Journal of Proteome Research, 7(7), 2595– Zhang, Z., Wang, T., Winter, D., Keefe, D., Iyer,
2604. V. R., Sandhu, K. S., Zheng, M., Wang, P., Gertz,
904 J. D. Tenenbaum et al.
J., Vielmetter, J., Partridge, E. C., Varley, K. E., N. P., Huang, H., Li, J. J., Noble, W. S., Bilmes,
Gasper, C., Bansal, A., Pepke, S., Jain, P., Amrhein, J. A., Buske, O. J., Sahu, A. O., Kharchenko, P. V.,
H., Bowling, K. M., Anaya, M., Cross, M. K., Park, P. J., Baker, D., Taylor, J., & Lochovsky, L.
Muratet, M. A., Newberry, K. M., McCue, K., (2012). An integrated encyclopedia of DNA ele-
Nesmith, A. S., Fisher-Aylor, K. I., Pusey, B., ments in the human genome. Nature, 489(7414),
DeSalvo, G., Parker, S. L., Balasubramanian, S., 57–74.
Davis, N. S., Meadows, S. K., Eggleston, T., Edgar, R., Domrachev, M., & Lash, A. E. (2002). Gene
Newberry, J. S., Levy, S. E., Absher, D. M., Wong, Expression Omnibus: NCBI gene expression and
W. H., Blow, M. J., Visel, A., Pennachio, L. A., hybridization array data repository. Nucleic Acids
Elnitski, L., Petrykowska, H. M., Abyzov, A., Research, 30(1), 207–210.
26 Aken, B., Barrell, D., Barson, G., Berry, A., Bignell, Fabregat, A., Jupe, S., Matthews, L., Sidiropoulos, K.,
A., Boychenko, V., Bussotti, G., Davidson, C., Gillespie, M., Garapati, P., Haw, R., Jassal, B.,
Despacio-Reyes, G., Diekhans, M., Ezkurdia, I., Korninger, F., May, B., Milacic, M., Roca, C. D.,
Frankish, A., Gilbert, J., Gonzalez, J. M., Griffiths, Rothfels, K., Sevilla, C., Shamovsky, V., Shorser, S.,
E., Harte, R., Hendrix, D. A., Hunt, T., Jungreis, I., Varusai, T., Viteri, G., Weiser, J., Wu, G., Stein, L.,
Kay, M., Khurana, E., Leng, J., Lin, M. F., Hermjakob, H., & D’Eustachio, P. (2018). The reac-
Loveland, J., Lu, Z., Manthravadi, D., Mariotti, tome pathway knowledgebase. Nucleic Acids
M., Mudge, J., Mukherjee, G., Notredame, C., Pei, Research, 46(D1), D649–D655.
B., Rodriguez, J. M., Saunders, G., Sboner, A., Frueh, F. W., & Gurwitz, D. (2004). From pharmacoge-
Searle, S., Sisu, C., Snow, C., Steward, C., Tapanan, netics to personalized medicine: A vital need for
E., Tress, M. L., van Baren, M. J., Washieti, S., educating health professionals and the community.
Wilming, L., Zadissa, A., Zhengdong, Z., Brent, Pharmacogenomics, 5(5), 571–579.
M., Haussler, D., Valencia, A., Raymond, A., Gamazon, E. R., Huang, R. S., Cox, N. J., & Dolan,
Addleman, N., Alexander, R. P., Auerbach, R. K., M. E. (2010). Chemotherapeutic drug susceptibility
Bettinger, K., Bhardwaj, N., Boyle, A. P., Cao, associated SNPs are enriched in expression quanti-
A. R., Cayting, P., Charos, A., Cheng, Y., Eastman, tative trait loci. Proceedings of the National Academy
C., Euskirchen, G., Fleming, J. D., Grubert, F., of Sciences of the United States of America, 107(20),
Habegger, L., Hariharan, M., Harmanci, A., 9287–9292.
Iyenger, S., Jin, V. X., Karczewski, K. J., Kasowski, Garber, M. E., Saldanha, A., Parker, J. S., Jones, W. D.,
M., Lacroute, P., Lam, H., Larnarre-Vincent, N., Kaukinen, K., Laurila, K., Lahdeaho, M. L.,
Lian, J., Lindahl-Allen, M., Min, R., Miotto, B., Khatri, P., Khosla, C., Adelman, D. C., & Maki, M.
Monahan, H., Moqtaderi, Z., Mu, X. J., O’Geen, (2017). A B-cell gene signature correlates with the
H., Ouyang, Z., Patacsil, D., Raha, D., Ramirez, L., extent of gluten-induced intestinal injury in celiac
Reed, B., Shi, M., Slifer, T., Witt, H., Wu, L., Xu, disease. Cellular and Molecular Gastroenterology
X., Yan, K. K., Yang, X., Struhl, K., Weissman, and Hepatology, 4(1), 1–17.
S. M., Tenebaum, S. A., Penalva, L. O., Karmakar, Garten, Y., & Altman, R. B. (2009). Pharmspresso: A
S., Bhanvadia, R. R., Choudhury, A., Domanus, text mining tool for extraction of pharmacogenomic
M., Ma, L., Moran, J., Victorsen, A., Auer, T., concepts and relationships from full text. BMC
Centarin, L., Eichenlaub, M., Gruhl, F., Heerman, Bioinformatics, 10(Suppl 2), S6.
S., Hoeckendorf, B., Inoue, D., Kellner, T., Garten, Y., Coulet, A., & Altman, R. B. (2010). Recent
Kirchmaier, S., Mueller, C., Reinhardt, R., Schertel, progress in automatically extracting information
L., Schneider, S., Sinn, R., Wittbrodt, B., Wittbrodt, from the pharmacogenomic literature.
J., Jain, G., Balasundaram, G., Bates, D. L., Byron, Pharmacogenomics, 11(10), 1467–1489.
R., Canfield, T. K., Diegel, M. J., Dunn, D., Ge, D., Ruzzo, E. K., Shianna, K. V., He, M., Pelak, K.,
Ebersol, A. K., Frum, T., Garg, K., Gist, E., Heinzen, E. L., Need, A. C., Cirulli, E. T., Maia,
Hansen, R. S., Boatman, L., Haugen, E., Humbert, J. M., Dickson, S. P., Zhu, M., Singh, A., Allen,
R., Johnson, A. K., Johnson, E. M., Kutyavin, A. S., & Goldstein, D. B. (2011). SVA: Software for
T. M., Lee, K., Lotakis, D., Maurano, M. T., Neph, annotating and visualizing sequenced human
S. J., Neri, F. V., Nguyen, E. D., Qu, H., Reynolds, genomes. Bioinformatics, 27(14), 1998–2000.
A. P., Roach, V., Rynes, E., Sanchez, M. E., Ghazvinian, A., Noy, N. F., & Musen, M. A. (2009).
Sandstrom, R. S., Shafer, A. O., Stergachis, A. B., Creating mappings for ontologies in biomedicine:
Thomas, S., Vernot, B., Vierstra, J., Vong, S., Simple methods work. In AMIA annual symposium
Weaver, M. A., Yan, Y., Zhang, M., Akey, J. A., proceedings 2009 (pp. 198–202).
Bender, M., Dorschner, M. O., Groudine, M., Ghazvinian, A., Noy, N. F., & Musen, M. A. (2011).
MacCoss, M. J., Navas, P., Stamatoyannopoulos, How orthogonal are the OBO Foundry ontologies?
G., Stamatoyannopoulos, J. A., Beal, K., Brazma, Journal of Biomedical Semantics, 2(Suppl 2), S2.
A., Flicek, P., Johnson, N., Lukk, M., Luscombe, Goh, K. I., Cusick, M. E., Valle, D., Childs, B., Vidal,
N. M., Sobral, D., Vaquerizas, J. M., Batzoglou, S., M., & Barabasi, A. L. (2007). The human disease
Sidow, A., Hussami, N., Kyriazopoulou- network. Proceedings of the National Academy of
Panagiotopoulou, S., Libbrecht, M. W., Schaub, Sciences of the United States of America, 104(21),
M. A., Miller, W., Bickel, P. J., Banfai, B., Boley, 8685–8690.
Translational Bioinformatics
905 26
Grando, A., & Schwab, R. (2013). Building and evaluat- Jonquet, C., Musen, M. A., & Shah, N. H. (2010).
ing an ontology-based tool for reasoning about con- Building a biomedical ontology recommender web
sent permission. In AMIA annual symposium service. Journal of Biomedical Semantics, 1(Suppl 1),
proceedings 2013 (pp. 514–523). S1.
Green, R. C., Roberts, J. S., Cupples, L. A., Relkin, Jonquet, C., LePendu, P., Falconer, S., Coulet, A., Noy,
N. R., Whitehouse, P. J., Brown, T., Eckert, S. L., N. F., Musen, M. A., & Shah, N. H. (2011). NCBO
Butson, M., Sadovnick, A. D., Quaid, K. A., Chen, Resource Index: Ontology-based search and mining
C., Cook-Deegan, R., & Farrer, L. A. (2009). of biomedical resources. Web Semantics: Science,
Disclosure of APOE genotype for risk of Services and Agents on the World Wide Web, 9(3),
Alzheimer’s disease. The New England Journal of 316–324.
Medicine, 361(3), 245–254. Joshi-Tope, G., Gillespie, M., Vastrik, I., D’Eustachio,
Gregory, S. (2011). P., Schmidt, E., de Bono, B., Jassal, B., Gopinath,
Hadley, D., Pan, J., El-Sayed, O., Aljabban, J., Aljabban, G. R., Wu, G. R., Matthews, L., Lewis, S., Birney,
I., Azad, T. D., Hadied, M. O., Raza, S., Rayikanti, E., & Stein, L. (2005). Reactome: A knowledgebase
B. A., Chen, B., Paik, H., Aran, D., Spatz, J., of biological pathways. Nucleic Acids Research,
Himmelstein, D., Panahiazar, M., Bhattacharya, S., 33(Database issue), D428–D432.
Sirota, M., Musen, M. A., & Butte, A. J. (2017). Kang, M. J., Lee, C. G., Cho, S. J., Homer, R. J., & Elias,
Precision annotation of digital samples in NCBI’s J. A. (2006). IFN-gamma-dependent DNA injury
gene expression omnibus. Scientific Data, 4, 170125. and/or apoptosis are critical in cigarette smoke-
Hansen, N. T., Brunak, S., & Altman, R. B. (2009). induced murine emphysema. Proceedings of the
Generating genome-scale candidate gene lists for American Thoracic Society, 3(6), 517–518.
pharmacogenomics. Clinical Pharmacology and Kaye, J., Heeney, C., Hawkins, N., de Vries, J., &
Therapeutics, 86(2), 183–189. Boddington, P. (2009). Data sharing in genomics-
Heikamp, K., & Bajorath, J. (2011). Large-scale similar- -re-shaping scientific practice. Nature Reviews.
ity search profiling of ChEMBL compound data Genetics, 10(5), 331–335.
sets. Journal of Chemical Information and Modeling, Kaye J, Hawkins N. (2014). Data sharing policy design
51(8), 1831–1839. for consortia: challenges for sustainability. Genome
Hoffman, M. A. (2007). The genome-enabled electronic Med. 29;6(1):4. https://doi.org/10.1186/gm523.
medical record. Journal of Biomedical Informatics, PMID: 24475754; PMCID: PMC3978924.
40(1), 44–46. Keiser, M. J., Setola, V., Irwin, J. J., Laggner, C., Abbas,
Homer N, Szelinger S, Redman M, Duggan D, Tembe A. I., Hufeisen, S. J., Jensen, N. H., Kuijer, M. B.,
W, Muehling J, Pearson JV, Stephan DA, Nelson SF, Matos, R. C., Tran, T. B., Whaley, R., Glennon,
Craig DW. (2008). Resolving individuals contribut- R. A., Hert, J., Thomas, K. L., Edwards, D. D.,
ing trace amounts of DNA to highly complex mix- Shoichet, B. K., & Roth, B. L. (2009). Predicting
tures using high-density SNP genotyping new molecular targets for known drugs. Nature,
microarrays. PLoS Genet. 29;4(8):e1000167. https:// 462(7270), 175–181.
doi.org/10.1371/journal.pgen.1000167. PMID: Khatri, P., & Draghici, S. (2005). Ontological analysis
18769715; PMCID: PMC2516199. of gene expression data: Current tools, limitations,
Homerova, D., Sprusansky, O., Kutejova, E., & and open problems. Bioinformatics, 21(18), 3587–
Kormanec, J. (2002). Some features of DNA- 3595.
binding proteins involved in the regulation of the Kho, A. N., Pacheco, J. A., Peissig, P. L., Rasmussen, L.,
Streptomyces aureofaciens gap gene, encoding Newton, K. M., Weston, N., Crane, P. K., Pathak,
glyceraldehyde-3-phosphate dehydrogenase. Folia J., Chute, C. G., Bielinski, S. J., Kullo, I. J., Li, R.,
Microbiologia (Praha), 47(4), 311–317. Manolio, T. A., Chisholm, R. L., & Denny, J. C.
Hsi-Yang Fritz, M., Leinonen, R., Cochrane, G., & (2011). Electronic medical records for genetic
Birney, E. (2011). Efficient storage of high through- research: results of the eMERGE consortium.
put DNA sequencing data using reference-based Science Translational Medicine, 3(79), 79re71.
compression. Genome Research, 21(5), 734–740. Kim, H., Bell, E., Kim, J., Sitapati, A., Ramsdell, J.,
Hu, J. X., Thomas, C. E., & Brunak, S. (2016). Network Farcas, C., Friedman, D., Feupe, S. F., & Ohno-
biology concepts in complex disease comorbidities. Machado, L. (2017). iCONCUR: Informed consent
Nature Reviews. Genetics, 17(10), 615–629. for clinical data and bio-sample use for research.
Hudson KL, Holohan MK, Collins FS. (2008). pace Journal of the American Medical Informatics
with the times--the Genetic Information Association, 24(2), 380–387.
Nondiscrimination Act of 2008. N Engl J Med. Kinnings, S. L., Liu, N., Buchmeier, N., Tonge, P. J., Xie,
19;358(25):2661-3. https://doi.org/10.1056/ L., & Bourne, P. E. (2009). Drug discovery using
NEJMp0803964. PMID: 18565857. chemical systems biology: Repositioning the safe
Irwin, J. J., & Shoichet, B. K. (2005). ZINC--a free data- medicine Comtan to treat multi-drug and exten-
base of commercially available compounds for vir- sively drug resistant tuberculosis. PLoS
tual screening. Journal of Chemical Information and Computational Biology, 5(7), e1000423.
Modeling, 45(1), 177–182. Knox, C., Law, V., Jewison, T., Liu, P., Ly, S., Frolkis,
A., Pon, A., Banco, K., Mak, C., Neveu, V.,
906 J. D. Tenenbaum et al.
Djoumbou, Y., Eisner, R., Guo, A. C., & Wishart, K. D., Miner, T. L., Delehaunty, A., Kramer, J. B.,
D. S. (2011). DrugBank 3.0: A comprehensive Cook, L. L., Fulton, R. S., Johnson, D. L., Minx,
resource for ‘omics’ research on drugs. Nucleic Acids P. J., Clifton, S. W., Hawkins, T., Branscomb, E.,
Research, 39(Suppl 1), D1035–D1041. Predki, P., Richardson, P., Wenning, S., Slezak, T.,
Kolesnikov, N., Hastings, E., Keays, M., Melnichuk, O., Doggett, N., Cheng, J. F., Olsen, A., Lucas, S.,
Tang, Y. A., Williams, E., Dylag, M., Kurbatova, Elkin, C., Uberbacher, E., Frazier, M., Gibbs, R. A.,
N., Brandizi, M., Burdett, T., Megy, K., Pilicheva, Muzny, D. M., Scherer, S. E., Bouck, J. B.,
E., Rustici, G., Tikhonov, A., Parkinson, H., Sodergren, E. J., Worley, K. C., Rives, C. M.,
Petryszak, R., Sarkans, U., & Brazma, A. (2015). Gorrell, J. H., Metzker, M. L., Naylor, S. L.,
ArrayExpress update--simplifying data submis- Kucherlapati, R. S., Nelson, D. L., Weinstock,
26 sions. Nucleic Acids Research, 43(Database issue), G. M., Sakaki, Y., Fujiyama, A., Hattori, M., Yada,
D1113–D1116. T., Toyoda, A., Itoh, T., Kawagoe, C., Watanabe, H.,
Kraus, V. B., Burnett, B., Coindreau, J., Cottrell, S., Totoki, Y., Taylor, T., Weissenbach, J., Heilig, R.,
Eyre, D., Gendreau, M., Gardiner, J., Garnero, P., Saurin, W., Artiguenave, F., Brottier, P., Bruls, T.,
Hardin, J., Henrotin, Y., Heinegard, D., Ko, A., Pelletier, E., Robert, C., Wincker, P., Smith, D. R.,
Lohmander, L. S., Matthews, G., Menetski, J., Doucette-Stamm, L., Rubenfield, M., Weinstock,
Moskowitz, R., Persiani, S., Poole, A. R., Rousseau, K., Lee, H. M., Dubois, J., Rosenthal, A., Platzer,
J. C., & Todman, M. (2011). Application of bio- M., Nyakatura, G., Taudien, S., Rump, A., Yang,
markers in the development of drugs intended for H., Yu, J., Wang, J., Huang, G., Gu, J., Hood, L.,
the treatment of osteoarthritis. Osteoarthritis Rowen, L., Madan, A., Qin, S., Davis, R. W.,
Cartilage, 19(5), 515–542. Federspiel, N. A., Abola, A. P., Proctor, M. J.,
Kuhn, M., Campillos, M., Letunic, I., Jensen, L. J., & Myers, R. M., Schmutz, J., Dickson, M., Grimwood,
Bork, P. (2010). A side effect resource to capture J., Cox, D. R., Olson, M. V., Kaul, R., Shimizu, N.,
phenotypic effects of drugs. Molecular Systems Kawasaki, K., Minoshima, S., Evans, G. A.,
Biology, 6, 343. Athanasiou, M., Schultz, R., Roe, B. A., Chen, F.,
Kweon, J., & Kim, Y. (2018). High-throughput genetic Pan, H., Ramser, J., Lehrach, H., Reinhardt, R.,
screens using CRISPR-Cas9 system. Archives of McCombie, W. R., de la Bastide, M., Dedhia, N.,
Pharmacal Research, 41(9), 875–884. Blocker, H., Hornischer, K., Nordsiek, G.,
Lamb, J. (2007). The Connectivity Map: A new tool for Agarwala, R., Aravind, L., Bailey, J. A., Bateman,
biomedical research. Nature Reviews. Cancer, 7(1), A., Batzoglou, S., Birney, E., Bork, P., Brown, D. G.,
54–60. Burge, C. B., Cerutti, L., Chen, H. C., Church, D.,
Lamb, J., Crawford, E. D., Peck, D., Modell, J. W., Blat, Clamp, M., Copley, R. R., Doerks, T., Eddy, S. R.,
I. C., Wrobel, M. J., Lerner, J., Brunet, J. P., Eichler, E. E., Furey, T. S., Galagan, J., Gilbert,
Subramanian, A., Ross, K. N., Reich, M., J. G., Harmon, C., Hayashizaki, Y., Haussler, D.,
Hieronymus, H., Wei, G., Armstrong, S. A., Hermjakob, H., Hokamp, K., Jang, W., Johnson,
Haggarty, S. J., Clemons, P. A., Wei, R., Carr, S. A., L. S., Jones, T. A., Kasif, S., Kaspryzk, A., Kennedy,
Lander, E. S., & Golub, T. R. (2006). The S., Kent, W. J., Kitts, P., Koonin, E. V., Korf, I.,
Connectivity Map: Using gene-expression signa- Kulp, D., Lancet, D., Lowe, T. M., McLysaght, A.,
tures to connect small molecules, genes, and disease. Mikkelsen, T., Moran, J. V., Mulder, N., Pollara,
Science, 313(5795), 1929–1935. V. J., Ponting, C. P., Schuler, G., Schultz, J., Slater,
Lander, E. S., Linton, L. M., Birren, B., Nusbaum, C., G., Smit, A. F., Stupka, E., Szustakowski, J.,
Zody, M. C., Baldwin, J., Devon, K., Dewar, K., Thierry-Mieg, D., Thierry-Mieg, J., Wagner, L.,
Doyle, M., FitzHugh, W., Funke, R., Gage, D., Wallis, J., Wheeler, R., Williams, A., Wolf, Y. I.,
Harris, K., Heaford, A., Howland, J., Kann, L., Wolfe, K. H., Yang, S. P., Yeh, R. F., Collins, F.,
Lehoczky, J., LeVine, R., McEwan, P., McKernan, Guyer, M. S., Peterson, J., Felsenfeld, A.,
K., Meldrim, J., Mesirov, J. P., Miranda, C., Morris, Wetterstrand, K. A., Patrinos, A., Morgan, M. J., de
W., Naylor, J., Raymond, C., Rosetti, M., Santos, Jong, P., Catanese, J. J., Osoegawa, K., Shizuya, H.,
R., Sheridan, A., Sougnez, C., Stange-Thomann, Choi, S., & Chen, Y. J. (2001). Initial sequencing and
N., Stojanovic, N., Subramanian, A., Wyman, D., analysis of the human genome. Nature, 409(6822),
Rogers, J., Sulston, J., Ainscough, R., Beck, S., 860–921.
Bentley, D., Burton, J., Clee, C., Carter, N., Coulson, Landrigan, P. J., Trasande, L., Thorpe, L. E., Gwynn,
A., Deadman, R., Deloukas, P., Dunham, A., C., Lioy, P. J., D’Alton, M. E., Lipkind, H. S.,
Dunham, I., Durbin, R., French, L., Grafham, D., Swanson, J., Wadhwa, P. D., Clark, E. B., Rauh,
Gregory, S., Hubbard, T., Humphray, S., Hunt, A., V. A., Perera, F. P., & Susser, E. (2006). The National
Jones, M., Lloyd, C., McMurray, A., Matthews, L., Children’s Study: A 21-year prospective study of
Mercer, S., Milne, S., Mullikin, J. C., Mungall, A., 100,000 American children. Pediatrics, 118(5),
Plumb, R., Ross, M., Shownkeen, R., Sims, S., 2173–2186.
Waterston, R. H., Wilson, R. K., Hillier, L. W., Landrum, M. J., Lee, J. M., Benson, M., Brown, G.,
McPherson, J. D., Marra, M. A., Mardis, E. R., Chao, C., Chitipiralla, S., Gu, B., Hart, J., Hoffman,
Fulton, L. A., Chinwalla, A. T., Pepin, K. H., Gish, D., Hoover, J., Jang, W., Katz, K., Ovetsky, M.,
W. R., Chissoe, S. L., Wendl, M. C., Delehaunty, Riley, G., Sethi, A., Tully, R., Villamarin-Salomon,
Translational Bioinformatics
907 26
R., Rubinstein, W., & Maglott, D. R. (2016). (2011). A mosaic activating mutation in AKT1 asso-
ClinVar: Public archive of interpretations of clini- ciated with the Proteus syndrome. The New England
cally relevant variants. Nucleic Acids Research, Journal of Medicine, 365(7), 611–619.
44(D1), D862–D868. Liu, T., & Altman, R. B. (2015). Relating essential pro-
Larmuseau, M. H. D., Matthijs, K., & Wenseleers, T. teins to drug side-effects using canonical compo-
(2016). Cuckolded fathers rare in human popula- nent analysis: A structure-based approach. Journal
tions. Trends in Ecology & Evolution, 31(5), 327–329. of Chemical Information and Modeling, 55(7), 1483–
Lek, M., Karczewski, K. J., Minikel, E. V., Samocha, 1494.
K. E., Banks, E., Fennell, T., O’Donnell-Luria, Liu, T., Lin, Y., Wen, X., Jorissen, R. N., & Gilson,
A. H., Ware, J. S., Hill, A. J., Cummings, B. B., M. K. (2007). BindingDB: A web-accessible data-
Tukiainen, T., Birnbaum, D. P., Kosmicki, J. A., base of experimentally determined protein-ligand
Duncan, L. E., Estrada, K., Zhao, F., Zou, J., binding affinities. Nucleic Acids Research,
Pierce-Hoffman, E., Berghout, J., Cooper, D. N., 35(Database issue), D198–D201.
Deflaux, N., DePristo, M., Do, R., Flannick, J., Liu, M., Shah, A., Jiang, M., Peterson, N. B., Dai, Q.,
Fromer, M., Gauthier, L., Goldstein, J., Gupta, N., Aldrich, M. C., Chen, Q., Bowton, E. A., Liu, H.,
Howrigan, D., Kiezun, A., Kurki, M. I., Moonshine, Denny, J. C., & Xu, H. (2012). A study of transport-
A. L., Natarajan, P., Orozco, L., Peloso, G. M., ability of an existing smoking status detection mod-
Poplin, R., Rivas, M. A., Ruano-Rubio, V., Rose, ule across institutions. In AMIA annual symposium
S. A., Ruderfer, D. M., Shakir, K., Stenson, P. D., proceedings (pp. 577–586).
Stevens, C., Thomas, B. P., Tiao, G., Tusie-Luna, Lunshof, J. E., Bobe, J., Aach, J., Angrist, M., Thakuria,
M. T., Weisburd, B., Won, H. H., Yu, D., Altshuler, J. V., Vorhaus, D. B., Hoehe, M. R., & Church,
D. M., Ardissino, D., Boehnke, M., Danesh, J., G. M. (2010). Personal genomes in progress: From
Donnelly, S., Elosua, R., Florez, J. C., Gabriel, S. B., the human genome project to the personal genome
Getz, G., Glatt, S. J., Hultman, C. M., Kathiresan, project. Dialogues in Clinical Neuroscience, 12(1),
S., Laakso, M., McCarroll, S., McCarthy, M. I., 47–60.
McGovern, D., McPherson, R., Neale, B. M., MacArthur, J., Bowler, E., Cerezo, M., Gil, L., Hall, P.,
Palotie, A., Purcell, S. M., Saleheen, D., Scharf, Hastings, E., Junkins, H., McMahon, A., Milano,
J. M., Sklar, P., Sullivan, P. F., Tuomilehto, J., A., Morales, J., Pendlington, Z. M., Welter, D.,
Tsuang, M. T., Watkins, H. C., Wilson, J. G., Daly, Burdett, T., Hindorff, L., Flicek, P., Cunningham,
M. J., MacArthur, D. G., & Exome Aggregation, C. F., & Parkinson, H. (2017). The new NHGRI-EBI
(2016). Analysis of protein-coding genetic variation Catalog of published genome-wide association
in 60,706 humans. Nature, 536(7616), 285–291. studies (GWAS Catalog). Nucleic Acids Research,
Lenfant, C. (2003). Shattuck lecture--clinical research to 45(D1), D896–D901.
clinical practice--lost in translation? The New Mailman, M. D., Feolo, M., Jin, Y., Kimura, M., Tryka,
England Journal of Medicine, 349(9), 868–874. K., Bagoutdinov, R., Hao, L., Kiang, A., Paschall,
LePendu, P., Musen, M. A., & Shah, N. H. (2011a). J., Phan, L., Popova, N., Pretel, S., Ziyabari, L., Lee,
Enabling enrichment analysis with the human dis- M., Shao, Y., Wang, Z. Y., Sirotkin, K., Ward, M.,
ease ontology. Journal of Biomedical Informatics, Kholodov, M., Zbicz, K., Beck, J., Kimelman, M.,
44(Suppl 1), S31–S38. Shevelev, S., Preuss, D., Yaschenko, E., Graeff, A.,
LePendu, P., Racunas, S. A., Iyer, S., Liu, Y., Fairon, C., Ostell, J., & Sherry, S. T. (2007). The NCBI dbGaP
& Shah, N. (2011b). Annotation analysis for testing database of genotypes and phenotypes. Nature
drug safety signals. In The 14th bio-ontologies SIG Genetics, 39(10), 1181–1186.
meeting at ISMB 2011. Vienna, Austria. Manolio TA, Collins FS, Cox NJ, Goldstein DB,
LePendu, P., Iyer, S. V., Bauer-Mehren, A., Harpaz, R., Hindorff LA, Hunter DJ, McCarthy MI, Ramos
Mortensen, J. M., Podchiyska, T., Ferris, T. A., & EM, Cardon LR, Chakravarti A, Cho JH,
Shah, N. H. (2013). Pharmacovigilance using clini- Guttmacher AE, Kong A, Kruglyak L, Mardis E,
cal notes. Clinical Pharmacology and Therapeutics, Rotimi CN, Slatkin M, Valle D, Whittemore AS,
93(6), 547–555. Boehnke M, Clark AG, Eichler EE, Gibson G,
Lindhurst, M. J., Sapp, J. C., Teer, J. K., Johnston, J. J., Haines JL, Mackay TF, McCarroll SA, Visscher
Finn, E. M., Peters, K., Turner, J., Cannons, J. L., PM. (2009) Finding the missing heritability of com-
Bick, D., Blakemore, L., Blumhorst, C., Brockmann, plex diseases. Nature. 8;461(7265):747–53. https://
K., Calder, P., Cherman, N., Deardorff, M. A., doi.org/10.1038/nature08494. PMID: 19812666;
Everman, D. B., Golas, G., Greenstein, R. M., PMCID: PMC2831613.
Kato, B. M., Keppler-Noreuil, K. M., Kuznetsov, Mayer, E. A., Knight, R., Mazmanian, S. K., Cryan,
S. A., Miyamoto, R. T., Newman, K., Ng, D., J. F., & Tillisch, K. (2014). Gut microbes and the
O’Brien, K., Rothenberg, S., Schwartzentruber, brain: Paradigm shift in neuroscience. The Journal
D. J., Singhal, V., Tirabosco, R., Upton, J., of Neuroscience, 34(46), 15490–15496.
Wientroub, S., Zackai, E. H., Hoag, K., Whitewood- Meckley, L. M., & Neumann, P. J. (2010). Personalized
Neal, T., Robey, P. G., Schwartzberg, P. L., Darling, medicine: Factors influencing reimbursement.
T. N., Tosi, L. L., Mullikin, J. C., & Biesecker, L. G. Health Policy, 94(2), 91–100.
908 J. D. Tenenbaum et al.
Miller, G. C., & Britt, H. (1995). A new drug classifica- Greely, H. T. (2010). Challenges in the clinical appli-
tion for computer systems: The ATC extension cation of whole-genome sequencing. Lancet,
code. International Journal of Bio-Medical 375(9727), 1749–1751.
Computing, 40(2), 121–124. Overington, J. (2009). ChEMBL. An interview with
Mork, J. G., Bodenreider, O., Demner-Fushman, D., John Overington, team leader, chemogenomics at
Dogan, R. I., Lang, F. M., Lu, Z., Neveol, A., the European Bioinformatics Institute Outstation
Peters, L., Shooshan, S. E., & Aronson, A. R. of the European Molecular Biology Laboratory
(2010). Extracting Rx information from clinical nar- (EMBL-EBI). Interview by Wendy A. Warr. Journal
rative. Journal of the American Medical Informatics of Computer-Aided Molecular Design, 23(4), 195–
Association, 17(5), 536–539. 198.
26 Mort, M., Evani, U. S., Krishnan, V. G., Kamati, K. K., Parrish, F., Do, N., Bouhaddou, O., & Warnekar, P.
Baenziger, P. H., Bagchi, A., Peters, B. J., Sathyesh, (2006). Implementation of RxNorm as a terminol-
R., Li, B., Sun, Y., Xue, B., Shah, N. H., Kann, ogy mediation standard for exchanging pharmacy
M. G., Cooper, D. N., Radivojac, P., & Mooney, medication between federal agencies. In AMIA
S. D. (2010). In silico functional profiling of human annual symposium proceedings (p. 1057).
disease-associated and polymorphic amino acid Pencina, M. J., D’Agostino, R. B., Sr., D’Agostino,
substitutions. Human Mutation, 31(3), 335–346. R. B., Jr., & Vasan, R. S. (2008). Evaluating the
Musen, M. A., Noy, N. F., Shah, N. H., Chute, C. G., added predictive ability of a new marker: From area
Store, M.-A., Smith, B., & NCBO team. (2011). The under the ROC curve to reclassification and beyond.
National Center for Biomedical Ontology. Journal Statistics in Medicine, 27(2), 157–172; discussion
of the American Medical Informatics Association, 207–112.
19(2), 190–195. Percha, B., & Altman, R. B. (2015). Learning the struc-
Musen, M. A., Bean, C. A., Cheung, K. H., Dumontier, ture of biomedical relationships from unstructured
M., Durante, K. A., Gevaert, O., Gonzalez-Beltran, text. PLoS Computational Biology, 11(7), e1004216.
A., Khatri, P., Kleinstein, S. H., O’Connor, M. J., Percha, B., & Altman, R. B. (2018). A global network of
Pouliot, Y., Rocca-Serra, P., Sansone, S. A., Wiser, biomedical relationships derived from text.
J. A., & CEDAR team. (2015). The center for expanded Bioinformatics, 34(15), 2614–2624.
data annotation and retrieval. Journal of the American Pollack, A. (2011). Ruling Upholds gene patent in can-
Medical Informatics Association, 22(6), 1148–1152. cer test. The New York Times. New York City, The
Ng, S. B., Buckingham, K. J., Lee, C., Bigham, A. W., New York Times Company.
Tabor, H. K., Dent, K. M., Huff, C. D., Shannon, Rehm, H. L., Berg, J. S., Brooks, L. D., Bustamante,
P. T., Jabs, E. W., Nickerson, D. A., Shendure, J., & C. D., Evans, J. P., Landrum, M. J., Ledbetter,
Bamshad, M. J. (2010). Exome sequencing identifies D. H., Maglott, D. R., Martin, C. L., Nussbaum,
the cause of a mendelian disorder. Nature Genetics, R. L., Plon, S. E., Ramos, E. M., Sherry, S. T.,
42(1), 30–35. Watson, M. S., & ClinGen. (2015). ClinGen--the
Nicolae, D. L., Gamazon, E., Zhang, W., Duan, S., clinical genome resource. The New England Journal
Dolan, M. E., & Cox, N. J. (2010). Trait-associated of Medicine, 372(23), 2235–2242.
SNPs are more likely to be eQTLs: Annotation to Rhee, S. Y., Wood, V., Dolinski, K., & Draghici, S.
enhance discovery from GWAS. PLoS Genetics, (2008). Use and misuse of the gene ontology anno-
6(4), e1000888. tations. Nature Reviews Genetics, 9(7), 509–515.
NIH broadens genomic data-sharing policy. (2014). Ritchie, M. D., Denny, J. C., Crawford, D. C., Ramirez,
Cancer Discovery, 4(12): OF4. https://pubmed.ncbi. A. H., Weiner, J. B., Pulley, J. M., Basford, M. A.,
nlm.nih.gov/25477121/ Brown-Gentry, K., Balser, J. R., Masys, D. R.,
Nikolsky, Y., Ekins, S., Nikolskaya, T., & Bugrim, A. Haines, J. L., & Roden, D. M. (2010). Robust repli-
(2005). A novel method for generation of signature cation of genotype-phenotype associations across
networks as biomarkers from complex high through- multiple diseases in an electronic medical record.
put data. Toxicology Letters, 158(1), 20–29. American Journal of Human Genetics, 86(4), 560–
Novack, D. H., Plumer, R., Smith, R. L., Ochitill, H., 572.
Morrow, G. R., & Bennett, J. M. (1979). Changes in Rocca-Serra, P., Brandizi, M., Maguire, E., Sklyar, N.,
physicians’ attitudes toward telling the cancer Taylor, C., Begley, K., Field, D., Harris, S., Hide,
patient. JAMA, 241(9), 897–900. W., Hofmann, O., Neumann, S., Sterk, P., Tong, W.,
Obeid, J., Gabriel, D., & Sanderson, I. (2010). A bio- & Sansone, S. A. (2010). ISA software suite:
medical research permissions ontology: cognitive Supporting standards-compliant experimental
and knowledge representation considerations. In annotation and enabling curation at the community
Proceedings of the 2010 workshop on governance of level. Bioinformatics, 26(18), 2354–2356.
technology, information and policies. Austin, Texas: Rohrer Vitek, C. R., Abul-Husn, N. S., Connolly, J. J.,
ACM. Hartzler, A. L., Kitchner, T., Peterson, J. F.,
Olson, S. (2007). Who’s your daddy? The Atlantic. July/ Rasmussen, L. V., Smith, M. E., Stallings, S.,
August. Williams, M. S., Wolf, W. A., & Prows, C. A. (2017).
Ormond, K. E., Wheeler, M. T., Hudgins, L., Klein, Healthcare provider education to support integra-
T. E., Butte, A. J., Altman, R. B., Ashley, E. A., & tion of pharmacogenomics in practice: The
Translational Bioinformatics
909 26
eMERGE Network experience. Pharmacogenomics, let effect and clinical efficacy of clopidogrel therapy.
18(10), 1013–1025. JAMA, 302(8), 849–857.
Ross, D. T., Scherf, U., Eisen, M. B., Perou, C. M., Singer, E. (2011). A family learns the secrets of its
Rees, C., Spellman, P., Iyer, V., Jeffrey, S. S., Van de genomes. MIT Technology Review (Sept).
Rijn, M., Waltham, M., Pergamenschikov, A., Lee, Sirota, M., Dudley, J. T., Kim, J., Chiang, A. P., Morgan,
J. C., Lashkari, D., Shalon, D., Myers, T. G., A. A., Sweet-Cordero, A., Sage, J., & Butte, A. J.
Weinstein, J. N., Botstein, D., & Brown, P. O. (2011). Discovery and preclinical validation of drug
(2000). Systematic variation in gene expression indications using compendia of public gene expres-
patterns in human cancer cell lines. Nature sion data. Science Translational Medicine, 3(96),
Genetics, 24(3), 227–235. 96ra77.
Rusk, N. (2010). Expanding HapMap. Nature Methods, Spackman, K. A., Campbell, K. E., & Cote, R. A.
7(10), 780–781. (1997). SNOMED RT: a reference terminology for
Sarkar, I. N., Butte, A. J., Lussier, Y. A., Tarczy- health care. In Proceedings of the AMIA annual fall
Hornoch, P., & Ohno-Machado, L. (2011). symposium (pp. 640–644).
Translational bioinformatics: Linking knowledge Stein, L. D. (2010). The case for cloud computing in
across biological and clinical realms. Journal of the genome informatics. Genome Biology, 11(5), 207.
American Medical Informatics Association, 18(4), Stenson, P. D., Mort, M., Ball, E. V., Howells, K.,
354–357. Phillips, A. D., Thomas, N. S., & Cooper, D. N.
Schadt, E. E. (2012). The changing privacy landscape in (2009). The human gene mutation database: 2008
the era of big data. Molecular Systems Biology, 8, update. Genome Medicine, 1(1), 13.
612. Stenson, P. D., Mort, M., Ball, E. V., Evans, K., Hayden,
Schaefer, C. F., Anthony, K., Krupa, S., Buchoff, J., M., Heywood, S., Hussain, M., Phillips, A. D., &
Day, M., Hannay, T., & Buetow, K. H. (2009). PID: Cooper, D. N. (2017). The Human Gene Mutation
The pathway interaction database. Nucleic Acids Database: Towards a comprehensive repository of
Research, 37(Database issue), D674–D679. inherited mutation data for medical research,
Schwartz, D., & Collins, F. (2007). Medicine. genetic diagnosis and next-generation sequencing
Environmental biology and human disease. Science, studies. Human Genetics, 136(6), 665–677.
316(5825), 695–696. Steyerberg, E. W., Pencina, M. J., Lingsma, H. F.,
Seal, R. L., Gordon, S. M., Lush, M. J., Wright, M. W., Kattan, M. W., Vickers, A. J., & Van Calster, B.
& Bruford, E. A. (2011). genenames.org: The (2011). Assessing the incremental value of diagnos-
HGNC resources in 2011. Nucleic Acids Research, tic and prognostic markers: A review and illustra-
39(Database issue), D514–D519. tion. European Journal of Clinical Investigation,
Shah, N. H., & Musen, M. A. (2008). UMLS-Query: a 42(2), 216–228.
perl module for querying the UMLS. In AMIA Subramanian, A., Tamayo, P., Mootha, V. K.,
annual symposium proceedings (pp. 652–656). Mukherjee, S., Ebert, B. L., Gillette, M. A.,
Shah, N. H., Jonquet, C., Chiang, A. P., Butte, A. J., Paulovich, A., Pomeroy, S. L., Golub, T. R., Lander,
Chen, R., & Musen, M. A. (2009a). Ontology- E. S., & Mesirov, J. P. (2005). Gene set enrichment
driven indexing of public datasets for translational analysis: A knowledge-based approach for inter-
bioinformatics. BMC Bioinformatics, 10(Suppl 2), preting genome-wide expression profiles.
S1. Proceedings of the National Academy of Sciences of
Shah, S. H., Crosslin, D. R., Nelson, S., Haynes, C., the United States of America, 102(43), 15545–15550.
Boling, C. L., Bain, J., Muehlbauer, M., Stevens, Sung, N. S., Crowley, W. F., Jr., Genel, M., Salber, P.,
R. D., Brantley, P. J., Stevens, V., Hollis, J. F., Appel, Sandy, L., Sherwood, L. M., Johnson, S. B.,
L. J., Lien, L., Newgard, C. B., & Svetkey, L. P. Catanese, V., Tilson, H., Getz, K., Larson, E. L.,
(2009b). Metabolomic profiles are associated with Scheinberg, D., Reece, E. A., Slavkin, H., Dobs, A.,
baseline insulin resistance and improvement in insu- Grebb, J., Martinez, R. A., Korn, A., & Rimoin, D.
lin resistance with weight loss in the WLM trial. (2003). Central challenges facing the national clini-
Ciculation, 1(20), S467. cal research enterprise. JAMA, 289(10), 1278–1287.
Sherry, S. T., Ward, M. H., Kholodov, M., Baker, J., Sweeney, T. E., Azad, T. D., Donato, M., Haynes, W. A.,
Phan, L., Smigielski, E. M., & Sirotkin, K. (2001). Perumal, T. M., Henao, R., Bermejo-Martin, J. F.,
dbSNP: The NCBI database of genetic variation. Almansa, R., Tamayo, E., Howrylak, J. A., Choi,
Nucleic Acids Research, 29(1), 308–311. A., Parnell, G. P., Tang, B., Nichols, M., Woods,
Shortliffe, E. H., Califano, A., & Hunter, L. (2009). New C. W., Ginsburg, G. S., Kingsmore, S. F., Omberg,
JBI emphasis on translational bioinformatics. Journal L., Mangravite, L. M., Wong, H. R., Tsalik, E. L.,
of Biomedical Informatics, 42(2), 199–200. Langley, R. J., & Khatri, P. (2018a). Unsupervised
Shuldiner, A. R., O’Connell, J. R., Bliden, K. P., Gandhi, analysis of transcriptomics in bacterial sepsis across
A., Ryan, K., Horenstein, R. B., Damcott, C. M., multiple datasets reveals three robust clusters.
Pakyz, R., Tantry, U. S., Gibson, Q., Pollin, T. I., Critical Care Medicine, 46(6), 915–925.
Post, W., Parsa, A., Mitchell, B. D., Faraday, N., Sweeney, T. E., Thomas, N. J., Howrylak, J. A., Wong,
Herzog, W., & Gurbel, P. A. (2009). Association of H. R., Rogers, A. J., & Khatri, P. (2018b).
cytochrome P450 2C19 genotype with the antiplate- Multicohort analysis of whole-blood gene expres-
910 J. D. Tenenbaum et al.
sion data does not form a robust diagnostic for Thorisson, G. A., Smith, A. V., Krishnan, L., & Stein,
acute respiratory distress syndrome. Critical Care L. D. (2005). The international HapMap project
Medicine, 46(2), 244–251. web site. Genome Research, 15(11), 1592–1593.
Tarczy-Hornoch, P. (2007). Bio*medical informatics Tomczak, A., Mortensen, J. M., Winnenburg, R., Liu,
and genomic medicine: Research and training. C., Alessi, D. T., Swamy, V., Vallania, F., Lofgren,
Journal of Biomedical Informatics, 40(1), 1–4. S., Haynes, W., Shah, N. H., Musen, M. A., &
Tatonetti, N. P., Denny, J. C., Murphy, S. N., Fernald, Khatri, P. (2018). Interpretation of biological exper-
G. H., Krishnan, G., Castro, V., Yue, P., Tsau, P. S., iments changes with evolution of the gene ontology
Kohane, I., Roden, D. M., & Altman, R. B. (2011a). and its annotations. Scientific Reports, 8(1), 5115.
Detecting drug interactions from adverse-event Venter, J. C., Adams, M. D., Myers, E. W., Li, P. W.,
26 reports: Interaction between paroxetine and pravas- Mural, R. J., Sutton, G. G., Smith, H. O., Yandell,
tatin increases blood glucose levels. Clinical M., Evans, C. A., Holt, R. A., Gocayne, J. D.,
Pharmacology and Therapeutics, 90(1), 133–142. Amanatides, P., Ballew, R. M., Huson, D. H.,
Tatonetti, N. P., Fernald, G. H., & Altman, R. B. Wortman, J. R., Zhang, Q., Kodira, C. D., Zheng,
(2011b). A novel signal detection algorithm for X. H., Chen, L., Skupski, M., Subramanian, G.,
identifying hidden drug-drug interactions in adverse Thomas, P. D., Zhang, J., Miklos, G. L. G., Nelson,
event reports. Journal of the American Medical C., Broder, S., Clark, A. G., Nadeau, J., McKusick,
Informatics Association, 19(1), 79–85. V. A., Zinder, N., Levine, A. J., Roberts, R. J.,
Taylor, C. F., Field, D., Sansone, S. A., Aerts, J., Simon, M., Slayman, C., Hunkapiller, M., Bolanos,
Apweiler, R., Ashburner, M., Ball, C. A., Binz, R., Delcher, A., Dew, I., Fasulo, D., Flanigan, M.,
P. A., Bogue, M., Booth, T., Brazma, A., Brinkman, Florea, L., Halpern, A., Hannenhalli, S., Kravitz,
R. R., Michael Clark, A., Deutsch, E. W., Fiehn, O., S., Levy, S., Mobarry, C., Reinert, K., Remington,
Fostel, J., Ghazal, P., Gibson, F., Gray, T., Grimes, K., Abu-Threideh, J., Beasley, E., Biddick, K.,
G., Hancock, J. M., Hardy, N. W., Hermjakob, H., Bonazzi, V., Brandon, R., Cargill, M.,
Julian, R. K., Jr., Kane, M., Kettner, C., Kinsinger, Chandramouliswaran, I., Charlab, R., Chaturvedi,
C., Kolker, E., Kuiper, M., Le Novere, N., Leebens- K., Deng, Z., Di Francesco, V., Dunn, P., Eilbeck,
Mack, J., Lewis, S. E., Lord, P., Mallon, A. M., K., Evangelista, C., Gabrielian, A. E., Gan, W., Ge,
Marthandan, N., Masuya, H., McNally, R., Mehrle, W., Gong, F., Gu, Z., Guan, P., Heiman, T. J.,
A., Morrison, N., Orchard, S., Quackenbush, J., Higgins, M. E., Ji, R. R., Ke, Z., Ketchum, K. A.,
Reecy, J. M., Robertson, D. G., Rocca-Serra, P., Lai, Z., Lei, Y., Li, Z., Li, J., Liang, Y., Lin, X., Lu,
Rodriguez, H., Rosenfelder, H., Santoyo-Lopez, J., F., Merkulov, G. V., Milshina, N., Moore, H. M.,
Scheuermann, R. H., Schober, D., Smith, B., Snape, Naik, A. K., Narayan, V. A., Neelam, B., Nusskern,
J., Stoeckert, C. J., Jr., Tipton, K., Sterk, P., D., Rusch, D. B., Salzberg, S., Shao, W., Shue, B.,
Untergasser, A., Vandesompele, J., & Wiemann, S. Sun, J., Wang, Z., Wang, A., Wang, X., Wang, J.,
(2008). Promoting coherent minimum reporting Wei, M., Wides, R., Xiao, C., Yan, C., Yao, A., Ye,
guidelines for biological and biomedical investiga- J., Zhan, M., Zhang, W., Zhang, H., Zhao, Q.,
tions: The MIBBI project. Nature Biotechnology, Zheng, L., Zhong, F., Zhong, W., Zhu, S., Zhao, S.,
26(8), 889–896. Gilbert, D., Baumhueter, S., Spier, G., Carter, C.,
Tenenbaum, J. D., & Goodman, K. W. (2017). Beyond Cravchik, A., Woodage, T., Ali, F., An, H., Awe, A.,
the Genetic Information Nondiscrimination Act: Baldwin, D., Baden, H., Barnstead, M., Barrow, I.,
Ethical and economic implications of the exclusion Beeson, K., Busam, D., Carver, A., Center, A.,
of disability, long-term care and life insurance. Cheng, M. L., Curry, L., Danaher, S., Davenport,
Personalized Medicine, 14(2), 153–157. L., Desilets, R., Dietz, S., Dodson, K., Doup, L.,
Tenenbaum, J., James, A., & Paulyson-Nuñez, K. Ferriera, S., Garg, N., Gluecksmann, A., Hart, B.,
(2012). An altered treatment plan based on direct to Haynes, J., Haynes, C., Heiner, C., Hladun, S.,
consumer (DTC) genetic testing: Personalized med- Hostin, D., Houck, J., Howland, T., Ibegwam, C.,
icine from the patient/pin-cushion perspective. Johnson, J., Kalush, F., Kline, L., Koduru, S., Love,
Journal of Personalized Medicine, 2(4), 192–200. A., Mann, F., May, D., McCawley, S., McIntosh, T.,
Tenenbaum, J. D., Avillach, P., Benham-Hutchins, M., McMullen, I., Moy, M., Moy, L., Murphy, B.,
Breitenstein, M. K., Crowgey, E. L., Hoffman, Nelson, K., Pfannkoch, C., Pratts, E., Puri, V.,
M. A., Jiang, X., Madhavan, S., Mattison, J. E., Qureshi, H., Reardon, M., Rodriguez, R., Rogers,
Nagarajan, R., Ray, B., Shin, D., Visweswaran, S., Y. H., Romblad, D., Ruhfel, B., Scott, R., Sitter, C.,
Zhao, Z., & Freimuth, R. R. (2016). An informatics Smallwood, M., Stewart, E., Strong, R., Suh, E.,
research agenda to support precision medicine: Thomas, R., Tint, N. N., Tse, S., Vech, C., Wang, G.,
Seven key areas. Journal of the American Medical Wetter, J., Williams, S., Williams, M., Windsor, S.,
Informatics Association, 23(4), 791–795. Winn-Deen, E., Wolfe, K., Zaveri, J., Zaveri, K.,
Terry, S. F. (2014). The global alliance for genomics & Abril, J. F., Guigo, R., Campbell, M. J., Sjolander,
health. Genetic Testing and Molecular Biomarkers, K. V., Karlak, B., Kejariwal, A., Mi, H., Lazareva,
18(6), 375–376. B., Hatton, T., Narechania, A., Diemer, K.,
Translational Bioinformatics
911 26
Muruganujan, A., Guo, N., Sato, S., Bafna, V., Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J.,
Istrail, S., Lippert, R., Schwartz, R., Walenz, B., Appleton, G., Axton, M., Baak, A., Blomberg, N.,
Yooseph, S., Allen, D., Basu, A., Baxendale, J., Boiten, J. W., da Silva Santos, L. B., Bourne, P. E.,
Blick, L., Caminha, M., Carnes-Stine, J., Caulk, P., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M.,
Chiang, Y. H., Coyne, M., Dahlke, C., Mays, A., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T.,
Dombroski, M., Donnelly, M., Ely, D., Esparham, Finkers, R., Gonzalez-Beltran, A., Gray, A. J.,
S., Fosler, C., Gire, H., Glanowski, S., Glasser, K., Groth, P., Goble, C., Grethe, J. S., Heringa, J., ‘t
Glodek, A., Gorokhov, M., Graham, K., Gropman, Hoen, P. A., Hooft, R., Kuhn, T., Kok, R., Kok, J.,
B., Harris, M., Heil, J., Henderson, S., Hoover, J., Lusher, S. J., Martone, M. E., Mons, A., Packer,
Jennings, D., Jordan, C., Jordan, J., Kasha, J., A. L., Persson, B., Rocca-Serra, P., Roos, M., van
Kagan, L., Kraft, C., Levitsky, A., Lewis, M., Liu, Schaik, R., Sansone, S. A., Schultes, E., Sengstag,
X., Lopez, J., Ma, D., Majoros, W., McDaniel, J., T., Slater, T., Strawn, G., Swertz, M. A., Thompson,
Murphy, S., Newman, M., Nguyen, T., Nguyen, N., M., van der Lei, J., van Mulligen, E., Velterop, J.,
Nodell, M., Pan, S., Peck, J., Peterson, M., Rowe, Waagmeester, A., Wittenburg, P., Wolstencroft, K.,
W., Sanders, R., Scott, J., Simpson, M., Smith, T., Zhao, J., & Mons, B. (2016). The FAIR guiding
Sprague, A., Stockwell, T., Turner, R., Venter, E., principles for scientific data management and stew-
Wang, M., Wen, M., Wu, D., Wu, M., Xia, A., ardship. Scientific Data, 3, 160018.
Zandieh, A., & Zhu, X. (2001). The sequence of the Wishart, D. S. (2011). Advances in metabolite identifica-
human genome. Science, 291(5507), 1304–1351. tion. Bioanalysis, 3(15), 1769–1782.
Via, M., Gignoux, C., & Burchard, E. G. (2010). The WTCCC. (2007). Genome-wide association study of
1000 Genomes Project: New opportunities for 14,000 cases of seven common diseases and 3,000
research and social challenges. Genome Medicine, shared controls. Nature, 447(7145), 661–678.
2(1), 3. Yildirim, M. A., Goh, K. I., Cusick, M. E., Barabasi,
Waldman, S. A., & Terzic, A. (2010). Clinical and trans- A. L., & Vidal, M. (2007). Drug-target network.
lational science: From bench-bedside to global vil- Nature Biotechnology, 25(10), 1119–1126.
lage. Clinical and Translational Science, 3(5), Zerhouni, E. A. (2006). Clinical research at a cross-
254–257. roads: The NIH roadmap. Journal of Investigative
Wan, Z., Vorobeychik, Y., Xia, W., Clayton, E. W., Medicine, 54(4), 171–173.
Kantarcioglu, M., & Malin, B. (2017). Expanding Zheng, P., Zeng, B., Zhou, C., Liu, M., Fang, Z., Xu, X.,
access to large-scale genomic data while promoting Zeng, L., Chen, J., Fan, S., & Du, X. (2016). Gut
privacy: A game theoretic approach. American microbiome remodeling induces depressive-like
Journal of Human Genetics, 100(2), 316–322. behaviors through a pathway mediated by the host’s
Wang, Y., Xiao, J., Suzek, T. O., Zhang, J., Wang, J., & metabolism. Molecular Psychiatry, 21(6), 786.
Bryant, S. H. (2009). PubChem: A public informa- Zhu, J., Zhang, B., & Schadt, E. E. (2008). A systems
tion system for analyzing bioactivities of small mol- biology approach to drug discovery. Advances in
ecules. Nucleic Acids Research, 37(Web Server issue), Genetics, 60, 603–635.
W623–W633. Zhu, Q., Ge, D., Maia, J. M., Zhu, M., Petrovski, S.,
Wang, Y., Bolton, E., Dracheva, S., Karapetyan, K., Dickson, S. P., Heinzen, E. L., Shianna, K. V., &
Shoemaker, B. A., Suzek, T. O., Wang, J., Xiao, J., Goldstein, D. B. (2011). A genome-wide compari-
Zhang, J., & Bryant, S. H. (2010). An overview of son of the functional properties of rare and com-
the PubChem BioAssay resource. Nucleic Acids mon genetic variants in humans. American Journal
Research, 38(Database issue), D255–D266. of Human Genetics, 88(4), 458–468.
Weston, A. D., & Hood, L. (2004). Systems biology, pro- Zitvogel, L., Galluzzi, L., Viaud, S., Vetizou, M.,
teomics, and the future of health care: Toward pre- Daillere, R., Merad, M., & Kroemer, G. (2015).
dictive, preventative, and personalized medicine. Cancer and the gut microbiota: An unexpected link.
Journal of Proteome Research, 3(2), 179–196. Science Translational Medicine, 7(271), 271ps271.
913 27
Contents
References – 938
of representative clinical research use cases, with additional collection of data performed
introduce the role of major classes and types solely for the purposes of research, rather
of information system that enable or facili- than the normal process of patient care.
tate such activities, and conclude with a set Further along the spectrum are clinical tri-
analyses regarding the future directions of the als, in which research subjects participate in
field. The overall objective of this chapter is to some additional activity, or intervention, that
provide the reader with the ability to evaluate is intended either to induce a change in the
critically the current and anticipated roles of subject or to prevent the occurrence of some
biomedical informatics knowledge and prac- change that would otherwise be expected. The
tice as applied to clinical research. intervention might be as simple as administer-
27 ing a substance already found in the human
body (such as a vitamin) to measuring a
27.2 A Primer on Clinical Research change in that substance (such as the amount
of the vitamin found in the blood or urine).
In the following section, we will briefly intro- More complex studies involve interventions
duce the characteristics of the modern clinical that have an impact on human disease, such
research environment, including the design as the administration of a preventive vaccine,
and execution of an exemplary class of clini- the administration of a curative drug, or a
cal studies that are known as randomized surgical procedure to remove, insert, repair or
controlled trials (RCTs). This primer on clini- replace a structure or device in the subject’s
cal research will serve as the context for the body. As with passive studies, data collec-
remainder of the chapter, in which we will tion is critical to the proper performance of
introduce major information needs and their research and may become intense, with the
relationships to a variety of basic and applied collection of clinical information occurring
biomedical informatics practice areas and IT more frequently and involving data describing
tools/platforms. the intervention materials (such as the purity
of a drug or the performance of a device) in
addition to data related to the human subject
27.2.1 he Modern Clinical
T and their response to the intervention under
Research Environment study.
Although not an intrinsic requirement of
Clinical research comes in many forms and clinical research, the inclusion of comparison
may include a variety of specific activities. groups is usually considered an important
All forms, however, share a common set of part of rigorous and reproducible clinical
requirements related to the comprehensive research method. In some cases, historical
management of study data – specifically, the controls can be used for comparison with a
collection of data on human research sub- group of subjects under study. For example, if
jects – and analysis of those data. As clinical a disease is known to have a particular fatal-
research designs span the spectrum from pas- ity rate, subjects could be given a potentially
sive or observational studies to interventional life-saving treatment, and their fatality rate
trials, the acuity of activities and associated can be measured and compared to past expe-
data-management needs increase commensu- rience. In quasi-experiments, comparison sub-
rately. For example, in a retrospective study ject groups can also be selected based on some
subjects are selected based on the presence known characteristic that distinguishes the
or absence of a particular condition and ret- two groups, such as gender or race, or their
rospective or pre-existing data are obtained willingness to undergo a particular interven-
from historical records (such as EHRs, disease tion.
registries, and research-specific databases), A more rigorous method of establishing
whereas in natural history studies, subjects are comparison groups is through randomization,
recruited and followed in prospective manner, in which prospective subjects are assigned to
Clinical Research Informatics
917 27
different groups (often referred to as study practical, or the condition under study may
arms) and undergo different interventions as be so rare that only historical controls are
a result of the arm to which they are assigned. available.
Typically, randomization might account for While different study designs have unique
observable characteristics (such as gender, and differentiated data, information, and
ethnicity, and race) to create balanced groups, knowledge management needs, they usually
especially where the characteristics are known involve some form of systematic data man-
to have some influence on the effect of the agement, as noted previously. Such data man-
intended intervention. Randomization also agement activities usually include initial data
serves to distribute subjects based on unob- collection, aggregation, analysis, and results
served characteristics, for example, unknown dissemination, to name a few of many such
genetic traits, in order to reduce differences in tasks. As shown in . Fig. 27.1, different study
the groups that might bias the results of the methods introduce new issues as successively
study. In a randomized controlled trial (RCT), more complex interventions and study design
one subject group will often receive a control patterns are employed. For the remainder of
intervention (for example, the usual treatment this chapter, we will focus our discussion on
or treatments for a condition, or even no treat- RCTs as our prototypical study design, as they
ment) while one or more other groups receive tend to involve most if not all of the informat-
an experimental intervention. ics issues and information needs encountered
Although intended to reduce bias, the ran- in other study designs. Further information on
domization process itself must be carefully the design characteristics, data management
executed such that it does not introduce new needs, and associated best practices related to
sources of bias. For example, randomization various types of clinical trials can be found in
can include blinding, in which the subject,
the investigator, or both (as in double-blinded
studies), are kept unaware of group assign- Study Activity Output
ment until after all assessments have been Protocol Protocol
made. This might include the use of a placebo Development Documents
Preparatory
a number of excellent works concerning this of a new therapeutic intervention, like a new
subject (Bhatt and Mehta 2016; Hulley et al. drug, an individual research study is designed
2013; Prokscha 2011), and further discussion to address each phase in a line of research
is beyond the scope of this chapter. inquiry that will determine the efficacy and
effectiveness of such a therapy (Spilker 1984).
27.2.1.1 Phased Randomized In most cases, this adheres to the following
Controlled Trials model:
Most clinical studies begin with the identifica- 55 Phase I: Investigators evaluate the novel
tion of a set of driving or motivating hypothe- therapy in a small group of participants in
ses. The research questions that serve to define order to assess overall safety. This safety
27 such hypotheses might be raised through an assessment includes dosing levels in the
case of non-interventional therapeutic tri-
analysis of gaps in knowledge as found in the
published biomedical literature or be informed als, and potential side effects or adverse
by the results of a previous study. It is impor- effects of the therapy. Often, Phase I trials
tant to note that clinical research endeavors of non-interventional therapies involve the
exist on a spectrum of scientific activity that is use of normal volunteers who do not have
often referred to as clinical and translational the disease state targeted by the novel
research. A particular type of translational therapy.
research, often referred to as T1-type transla- 55 Phase II: Investigators evaluate the novel
tion, is a process by which basic science dis- therapy in a larger group of participants in
coveries are used to design novel therapies order to assess the efficacy of the treat-
(Sung et al. 2003). Such discoveries are then ment in the targeted disease state. During
evaluated during clinical research studies, this phase, assessment of overall safety is
first pre-clinical and subsequent clinical trial continued.
phases (Payne et al. 2005). A second type of 55 Phase III: Investigators evaluate the novel
translational research, often referred to as T2 therapy in an even larger group of partici-
translation, involves methods such as those pants and compare its performance to a
borrowed from implementation science and reference standard which is usually the
clinical informatics, and focus on translating current standard of care for the targeted
the findings of such clinical research stud- disease state. This phase typically employs
ies into common practice (Sung et al. 2003). an RCT design, and often a multi-center
A common colloquialism for this process of RCT given the numbers of variation of
translating a novel basic science discovery subjects that must be recruited to test the
through clinical research and into clinical hypothesis. In general, this is the final
practice is “bench to bedside” science. study phase to be performed before seek-
Individual and distinct RCTs are often ing regulatory approval for the novel ther-
conducted for different purposes, most often apy and broader use in standard-of-care
motivated by the need to fill fundamental environments.
knowledge gaps about a particular interven- 55 Phase IV: Investigators study the perfor-
tion under study. By combining such knowl- mance and safety of the novel therapy
edge gaps with the underlying biomedical after it has been approved and marketed.
mechanisms of physiology and disease, a moti- This type of study is performed in order to
vating hypothesis or collections of hypotheses detect long-term outcomes and effects of
are established as to why a given intervention the therapy. It is often called “post-market
might lead to a given result or finding. Such surveillance” and is, in fact, not an RCT at
hypotheses result in a natural sequence of all, but a less formal, observational study.
research questions that can be asked relative
to a novel intervention. Usually, an individ- The phase of an RCT has implications for
ual research study is designed to address one the kinds of questions being asked and the
specific research question and hypothesis. In kinds of processes carried out to answer them.
the case of the development and evaluation From an informatics perspective, however, the
Clinical Research Informatics
919 27
tasks are usually very similar. At a high level, are enrolled, and in the case of studies with
the conduct of a Phase I, II or III clinical trial multiple study groups or arms, randomized
can be thought of in an operational sense as into one of those arms.
consisting of three major stages: preparatory, The preceding activities lead to the initia-
active, and dissemination. During these three tion of the next step in the research process,
stages, a specific temporal series of processes is which we refer to as the active phase. During
executed. First, during the preparatory phase, the active phase, the participant receives the
a protocol document is generated as part of therapeutic intervention indicated by their
the project development process. The proto- study arm and is actively monitored to enable
col document usually contains background the collection of study-specific data. This ther-
information, scientific goals, aims, hypotheses apeutic intervention and active monitoring
and research questions to be addressed by process is often iterative, involving multiple
the trial. In addition, the protocol describes cycles of interventions and active monitoring.
policies, procedures, and data collection or Follow-up activities begin once a participant
analysis requirements. A critical aspect of the has completed the interventional stage of a
protocol document is the definition of a pro- study. During this stage, subjects are con-
tocol schema, which defines at a highly granu- tacted on a specified temporal basis in order
lar level the temporal sequence of tasks and to collect additional data of interest, such as
events required to both deliver the interven- long-term treatment effects, disease status or
tion under study and to ensure that data are survival status.
collected and managed in a systematic man- Finally, during the dissemination phase,
ner commensurate with the study hypotheses the results of the study are evaluated and for-
and aims. malized in publications or other knowledge
Once a protocol is deemed ready for dissemination media, for translation into the
execution, the feasibility of the study design next phase of an RCT or into clinical prac-
(e.g., addressing questions such as “are there tice. In some cases, such as is adaptive study
enough participants available in the targeted designs (Bhatt and Mehta 2016), this dissemi-
population to satisfy the study design defined nation phase feeds back into the planning and
in the protocol document?” is assessed either active phases. Such feedback cycles enable
quantitatively (e.g., using historical data) and/ rapid revision of a study design, iterative par-
or heuristically). Throughout the preparatory ticipant enrollment, and dynamic data collec-
phase, a concurrent process of seeking regula- tion in support of such revised hypotheses and
tory approval from local and national bodies designs. Of note, these types of adaptive trial
(e.g., local Institutional Review Boards, the designs are particularly helpful when conduct-
Food and Drug Administration) occurs. Once ing studies of conditions where large numbers
a protocol plan is complete, deemed feasible, of patients may not be present for recruitment
and regulatory approval has been received, purposes, so that patients are assigned to an
potential participants are recruited and intervention and/or monitored in a manner
screened to determine if they meet the inclu- that maximizes data collection that is most
sion and exclusion criteria for the study (e.g., likely to demonstrate the safety, efficacy, and
specific demographic and/or clinical param- comparative effectiveness of the diagnostic or
eters required for subjects to be eligible for the therapeutic approach being evaluated (Bhatt
study). Once a potential participant has been and Mehta 2016).
deemed eligible for the study, they are pro- The quality of data produced by a clini-
vided with an informed consent document, cal trial is assessed using multi-dimensional
which must be signed prior to proceeding metrics that account for the design, execu-
with the enrollment process. Enrollment in the tion, analysis and dissemination of the study
context of clinical trials means officially regis- results. The quality of a clinical trial is also
tering as a study subject, and the subsequent judged with respect to the significance or rel-
assignment of a study-specific identifier. Once evance of the reported study results within a
a person agrees to become a participant, they clinical context (Hulley et al. 2013; Prokscha
920 P. R. O. Payne et al.
2011; Spilker 1984). One key metric used to 27.2.2.1 Information Systems
assess clinical trial quality is validity, which Supporting Clinical
can be defined both internally and externally. Research Programs
Internal validity is defined as the minimiza- It is helpful to conceptualize the conduct of
tion of potential biases during the design and clinical research studies as a multiple-stage
execution of the trial, while external validity sequential model, as was introduced previ-
is the ability to generalize study results into ously and is expanded upon in this section
clinical care. It is important to note in a dis- (Payne et al. 2005). At each stage in such a
cussion of the role of biomedical informatics model, a combination of research-specific
relative to clinical research that such plat- and general technologies can be employed to
27 forms, interventions, and methods can play
a major role in reducing or mitigating such
support or address related information needs
(. Fig. 27.2).
sources of bias, thus enhancing the validity There are numerous examples of general-
and generalizability of study results. purpose and clinical systems that are able to
support the conduct of clinical research:
55 Publication (or bibliographic) databases
27.2.2 Information Needs and information retrieval (IR) tools such
and Systems in the Clinical as PubMed, Google Scholar, and OVID
Research Environment can be used to assist in conducting the
background research necessary for the
As can be inferred by the preceding intro- preparation of protocol documents.
duction to the definitional aspects of clini- 55 Electronic health records (EHRs) can be
cal research, such activities regularly involve used to collect clinical data on research
a variety of data, information, and knowl- participants in a structured form that can
edge sources, as well a complicated set of reduce redundant data entry.
complementary and overlapping workflows.
55 Data warehouses and associated data or
At the highest level, these characteristics text mining tools can be used in multiple
of the clinical research environment can be capacities, including: (1) determining if
related to a number of critical information participant cohorts who meet the study
needs, as summarized in . Table 27.1. This inclusion or exclusion criteria can be
representation of the information needs practically recruited given historical
inherent to clinical research is presented using trends, and (2) identifying specific partici-
the specific context of a prototypical RCT, but pants and related data within existing
the basic types of needs and example solu- databases.
tions provided can be extended to apply to 55 Clinical decision-support systems (CDSS)
the broader spectrum of research designs and can be used to alert providers at the point-
patterns introduced earlier. of-care that an individual may be eligible
Building upon this broad definition of for a clinical trial
the information needs inherent to clinical
research, in the following sub-sections we: (1) In addition to the preceding general technolo-
review the types of information systems that gies, a number of research-specific technolo-
can support the phases that comprise a clinical gies have been developed:
study, (2) explore the functional components 55 Feasibility analysis applications and data
that make up a clinical trials management sys- simulation and visualization tools can
tem, (3) identify current consortia that share streamline the pre-clinical research process
clinical and research data, and (4) discuss the (e.g., disease models) and assist in the
role of standards in enabling interoperability analysis of complex data sets in order to
between such information systems. assess the feasibility of a given study
design.
Clinical Research Informatics
921 27
.. Table 27.1 Overview of definitional information needs in the clinical research environment, contextualized
using the design of RCTs
Support for research Collaborative document and Study teams often involve geographically and
planning and knowledge management temporally distributed participants, who need to
conduct engage in iterative protocol development and approval
Data sources and tools for
processes. Such activities by necessity incorporate
feasibility analyses
document versioning, annotation, and associated
Regulatory approval metadata management tasks. Once a protocol has been
workflows developed, access to data sets for the purposes of
assessing the feasibility of a given study design is
critical, and often involves the use of de-identified data
sets drawn from a data warehouse or research registry.
Finally, the submission, tracking, and documentation
of regulatory approvals often necessitate the
coordination and management of complex,
document-oriented workflows and record keeping
tasks.
Facilitation of data Secondary-use of The ability to use primary clinical data from EHR or
management, access, EHR-derived data for equivalent platforms to support secondary use in a
and integration research purposes research program has the potential to reduce
redundancy and potential errors while increasing data
Research project specific
quality. However, using such data in a secondary
data capture, management,
capacity also requires that appropriately structured
and reporting
data be captured and codified in clinical systems, and
Distributed data then be made available to research teams and research
management (spanning data management systems in a timely and resource
traditional organizational efficient manner. In addition to such secondary use of
boundaries) clinical data, most clinical studies require the regular
capture and management of study-specific data
Syntactic and semantic elements, a task that is usually accomplished via the
interoperability use of Electronic Data Capture (EDC) or Clinical
Trial Management Systems (CTMS). Finally, given
the propensity to conduct studies that span traditional
organizational boundaries in order to realize
economies of scale and/or access sufficiently large
patient populations, it is often necessary to query,
integrate, and manage distributed data sets, and
ensure their syntactic and semantic interoperability.
Such a need is usually addressed through the use of
Service Oriented Architectures, Cloud Computing,
Data Warehousing, and Metadata Management
technologies.
Workforce training Dissemination of study, A central need when conducting clinical studies is the
and support methodological, and technical ability to ensure that individuals involved in the
training materials execution of a protocol share common methods, data
management practices, and workflows (thus reducing
Support for team
potential sources of study bias). Ensuring such shared
collaboration and
knowledge and practices, particularly in distributed or
knowledge sharing
multi-site settings, requires the use of distance
education and team-science tools and platforms to
enable knowledge sharing and distance learning
paradigms.
(continued)
922 P. R. O. Payne et al.
Management Support for research billing The business and management aspects of the conduct
information capture of clinical studies is complex, often requiring the
Operational instrumentation
and reporting disambiguation of standard-of-care and research
and reporting
specific charges as part of billing operations, as well
Regulatory monitoring as the tracking of key performance and data quality
metrics that may be required to satisfy contractual
Data quality assurance commitments to the entities funding such studies.
Furthermore, the monitoring of study data for critical
or sentinel events that should or must be reported for
27 regulatory purposes is both necessary and of extreme
importance. All of the aforementioned activities
require the application of a variety of management
information system, business intelligence, and
reporting tools, leveraging a broad variety of
enterprise, administrative, and study-specific data
sources.
Participant Cohort discovery The identification of participant cohorts that satisfy
recruitment tools key study design criteria, such as inclusion and
Eligibility determination
and methods exclusion criteria, is frequently a major barrier to the
and alerting
timely and efficient execution of clinical studies. A
Participant registration, variety of information needs, related to the
consent, and enrollment identification and engagement of such cohorts, to
execution and tracking point-of-care alerting regarding potential study
eligibility, to the management of registration, consent,
and enrollment records is inherent to this information
need. Such requirements are usually satisfied through
a multi-modal approach, leveraging both clinical and
research-specific information systems.
Data standards Standards for interoperability As has been noted relative to several of the preceding
between research systems information needs, there is a frequent and reoccurring
requirement for both syntactic and semantic
Standards for
interoperability between research-specific information
interoperability between
systems, as well as between research-specific and
research, enterprise (e.g.
clinical or administrative systems. Such a need
EHR), and administrative
necessitates the design, selection, and application of a
systems
variety of data standards, as well as the ability to map
and harmonize between shared information models to
support interactions between systems using a variety
of standards.
Workflow support Integration of tools for Much as was the case related to data standards, a
combined standard-of-care closely aligned information need exists relative to the
and research visits ability to support complex workflows between
information systems and actors involved in the
Data, information, and
conduct of clinical research. Such workflow support
knowledge transfer between
requires both computational and application-level
stakeholders, project phases,
workflow orchestration, as well as the ability to define
activities, and associated
and apply reusable data analytic “pipelines.”
information systems or
tools.
Clinical Research Informatics
923 27
Table 27.1 (continued)
Data, information Knowledge management for The ultimate objective of clinical research is to
and knowledge clinical evidence generated generate and apply new evidence in support of
dissemination during trials improvements in clinical care and human health. In
order to do so, it is necessary to disseminate the
Guidelines and CDSS
findings generated during such studies in a variety of
delivery mechanisms
formats, including reusable/actionable knowledge
Publication mechanisms resources, clinical guidelines, decision support rules,
and/or publications and reports. In addition, increasing
Data registries emphasis is being placed on the transparency and
reproducibility of study designs, which is often
accomplished through the creation of public registries
via which study data sets can be shared and made
available to the broader biomedical community.
Intervention and/or
EHR, Data Warehouses Raw Data
Data Collection
CTMS, EDC, Participant
Tracking or Calendaring
Tools
CDSS Monitoring and/or
Quality Assurance Monitored Data
.. Fig. 27.2 Overview of study activities, and related research-specific and general information technologies, as
well as targeted products or outputs associated with the sequential clinical research workflow paradigm
55 Protocol authoring tools can allow geo- 55 Electronic data capture (EDC) and Clinical
graphically distributed authors to collabo- Trial and/or Research Management Sys
rate on complex protocol documents. tems (CTMS/CRMS) can be used to col-
55 Automated screening tools and targeted lect research-specific data in a structured
alerts can assist in the identification and form and reduce the need for redundant
registration of research participants. and potentially error-prone paper-based
924 P. R. O. Payne et al.
Innovation Network (TIN),3 the Recruitment rized for all remote sites (Weber et al. 2009).
Innovation Center (RIC),4 and the Center for SHRINE networks have been established by
Data 2 Health (CD2H)5 which is charged with many research institutions to support, among
the coordination of network-wide informat- other functions, the estimation of available
ics activities. These centers and sub-networks cohort sizes for multi-institutional clinical
within the broader CTSA network engage in a trials.
range of activities, such as: (1) leveraging local
data repositories and federated data query
tools to enable the rapid assessment of fea- 27.3.4 Accrual to Clinical Trials
sibility when designing multi-site clinical tri- (ACT) Network
27 als (ACT); (2) providing expertise and shared
best practices as they relate to study designs Recently, NCATS has funded the Accrual
and regulatory frameworks for such multi-site to Clinical Trials (ACT) network to bring
clinical trials (TIN); (3) delivering novel tools together CTSA sites into a single SHINE net-
and methods to accelerate the recruitment work. Its intent is to allow clinical researchers
of participants into those trials (RIC); and to query the network in real time and to obtain
(4) helping the Biomedical Informatics com- aggregate counts of patients who meet clini-
ponents at CTSA hubs to collaborate, share cal trial inclusion and exclusion criteria from
knowledge and tools, and harmonize data sites across the United States (Visweswaran
assets (CD2H). et al. 2018). Currently, 31 CTSA sites are
fully operational in the ACT network, with
an additional 15 being “staged” for integra-
27.3.3 i2b2 and SHRINE tion.6 Although the ACT SHRINE network
currently returns only summary statistics for
One popular warehouse software platform each site, future plans include the ability to
that is frequently used in the context of obtain detailed patient data sets and, working
clinical research activities is Informatics for with researchers are member sites, being able
Integrating Biology and the Bedside (i2b2), to contact individual patients for potential
developed under a National Center for recruitment into clinical studies.
Biomedical Computing grant from NIH to
Partners HealthCare System and Harvard
University. i2b2 provides an information 27.3.5 PCORNet
system framework to allow clinical research-
ers to use existing clinical data for discovery Another clinical research data sharing con-
research (Murphy et al. 2010). Many of the sortium is PCORNet, which is funded by
over 60 institutions receiving CTSA grants the Patient-Centered Outcomes Research
have adopted i2b2 technologies to support Institute (PCORI), a non-profit corpora-
research and collaboration. tion established by the Patient Protection
A companion to i2b2 is the Shared Health and Affordable Care Act (see 7 Chap. 12) to
Research Information Network (SHRINE), support a national clinical research agenda
which is a version of i2b2 that can pass data (Fleurence et al. 2014). PCORNet is made of
queries entered by a local user off to other clinical data research networks (CDRNs) that
i2b2 instances to provide patient counts and each have access to EHR data on a million or
demographic information that is summa- more patients, patient-powered research net-
works (PPRNs) that are led by patients and
patient advocates with an interest in a particu-
3 7 https://trialinnovationnetwork.org/ (Accessed lar disease (common or rare), and Health Plan
January 27, 2019).
4 7 https://trialinnovationnetwork.org/recruitment-
innovation-center/ (Accessed January 27, 2019).
5 7 https://ctsa.ncats.nih.gov/cd2h/ (Accessed Janu- 6 7 http://www.actnetwork.us/national/get-to-know-
ary 27, 2019). 46EU-1128WI.html (Accessed January 1, 2019).
Clinical Research Informatics
927 27
Research Networks (HPRNs) that link EHR ing clinical quality metrics, and HOMER
data with insurance claims data (Fleurence (Health Outcomes and Medical Effectiveness
et al. 2014). There are currently 13 CDRNs, Research), a tool for risk identification and
20 PPRNs, and 2 HPRNS nation-wide.7 The comparative effectiveness studies.
intent of these networks is to provide data The OHDSI Research Network allows
that can be used directly to answer clinical researchers at member sites to query data
research questions based on the health data repositories to obtain high-quality observa-
of large patient cohorts. Clinical researchers tional data that can be used for study design,
can also use the networks to help identify and execution, and data analysis. An OHDSI
contact patients who might be suitable for research defines a “project” which is actual-
clinical studies. ized as a query for specific patient data. Data
owners are invited to participate in projects by
querying their own databases and conducting
27.3.6 bservational Health Data
O data analysis locally, with results sent back to
Sciences and Informatics the initial research team for compilation and
(OHDSI) analysis.
7 7 https://pcornet.org/participating-networks
(Accessed January 1, 2019).
8 7 https://www.ohdsi.org/who-we-are/collaborators 9 7 https://flatiron.com/about-us (Accessed January
(Accessed January 1, 2019). 1, 2019).
928 P. R. O. Payne et al.
Electronic Health Record (EHR) vendors primarily by 11 consortia across the country,
are also creating networks to support or enable involving 51 health care organizations.10
research data sharing among their clients. For Unlike other consortia described above,
example, Epic Corporation began working data and specimens are being consolidated
with a group of academic health center cus- centrally at a Data and Research Center
tomers in 2013 on such a data sharing network (DRC) and a biobank, respectively. Plans
and created an advisory group of CRI experts include the collection of data from all of the
from such sites to inform governance, infra- participants’ EHRs, not just those at recruit-
structure, and research data sharing processes. ment centers, and the sequencing of partici-
Similarly, Cerner corporation has created a pants full genomes.
27 centralized data sharing platform that enables
their clients to combine and access large vol-
The All of Us Research Program also
differs from other consortia in its patient-
umes of de-identified patient- level data for centered focus. All of Us asks participants
trial design and population health manage- for full access to their fully identified data for
ment purposes. use by researchers without prior approval of
Further, major computing and cloud specific studies. In return, patients are repaid
computing vendors, such as Amazon, Apple, through full transparency and partnership.
Facebook, Google, and Microsoft, have Participants have access to their information
launched initiatives to provide patient-centered and play a role in helping to identify impor-
data sharing capabilities, which empower tant research priorities for the program.
individuals to aggregate and share their own
health care data from a variety of sources
(“Large technology companies continue to 27.4 ata Standards in Clinical
D
ramp up healthcare forays,” 2018). These capa- Research
bilities introduce new data sharing scenarios
in which individual patients will have the abil- The use of standards to represent clinical
ity to “donate their data” to research projects research information provides the same chal-
without engaging their health care providers lenges and benefits found in other informat-
as intermediaries in such transactions. The ics application areas. Data may be captured
impact of all of these activities remains to be with standard terminologies or translated into
understood, given their relative immaturity at standards to support data reporting and shar-
the time this chapter is being written. ing which, in turn, require agreed-upon stan-
dard frameworks to support such exchanges.
Standards are even being developed for the
27.3.8 All of Us representation of clinical trial protocols them-
selves. . Figure 27.3 depicts how the various
The All of Us Research Program is the prin- kinds of standards fit into the overall schema
ciple component of the US government’s of clinical research, ranging from data mod-
Precision Medicine Initiative (PMI), estab- els that define how data are to be represented,
lished in 2016 to establish a cohort of one through standards for terminologies to actu-
million Americans who provide health status
data, blood, and urine specimens for clinical
and genetic analysis, and access to their com-
plete EHR data (Collins and Varmus 2015). 10 7 https://allofus.nih.gov/about/program-partners/
health-care-provider-organizations (Accessed Janu-
Recruitment is currently being carried out ary 1, 2019).
Clinical Research Informatics
929 27
Data Representation
CR-Focused Terminologies General Terminologies
and ontologies and ontologies
.. Fig. 27.3 Relationships among various general pur- data will map from the model to the messages used for
pose and CRI-specific standards that are relevant to the interchanging the data. The use of messages is deter-
design, conduct, and dissemination of clinical research mined by the requirements of regulatory agencies and
studies. Data modeling determines how terms from ter- collaborating research groups. See text for an explana-
minologies and ontologies will be recorded in clinical tion of acronyms used in this figure
research databases. Exchange standards determine how
ally represent the data and structures for as with SHRINE, but consolidation of the
exchanging them, out to standards for report- data may still require mapping of data to a
ing and sharing. The standards described here common terminology (although the ACT
are some of the current and most prevalent network requires using a shared ontology for
ones, but they continue to evolve and new representing data). Note that the data model
standards relevant to the CRI domain are used for storing the data in a database (i2b2,
constantly emerging. for example) may not be acceptable for data
exchange.
An alternative approach is the model-
27.4.1 ata Modeling Standards
D driven architecture, in which an underly-
to Support Clinical Research ing data model is created for the express
purpose of representing all aspects of an
Formats for data sharing typically include a information design, including data represen-
data model for the information to be shared, tation. Previously, the models used for clinical
leaving to individual contributors the later research management systems have been those
task of mapping local data into the exchange required to support system functionality. New
model. For example, i2b2 uses a standard efforts are underway to create standards for
data model internally that is based on a modeling the actual research protocols, to
generic entity-attribute-value model (EAV) enable a logical representation that includes
that essentially allows any data to be stored the semantic aspects of the protocol (for
using any desired controlled terminology example, the relationships between specific
(Klann et al. 2016). This allows queries to interventions and observations intended to
be conducted across multiple i2b2 databases, measure their effects). While use of such mod-
930 P. R. O. Payne et al.
els may make the research process somewhat ics, encounters, diagnoses, laboratory results,
more complicated, the mapping to standards procedures, and medications. PCORI uses the
used for exchanging data becomes greatly PCORNet Common Data Model (Belenkaya
simplified. et al. 2015) based on the data model used in
For example, Health-Level 7 (HL7; see the FDA’s Sentinel Initiative,12 while OHDSI
7 Chap. 7) is an open standards develop- uses the OMOP Common Data Model (Ryan
ment organization that develops consen- et al. 2009). The All of Us program has
sus standards for all manner of clinical and adopted the OMOP model as well.
administrative data, and is also working on
clinical research-specific standards, such as
27 the Regulated Clinical Research Information 27.4.2 Terminology Standards
Management (RCRIM) model in order to to Support Clinical Research
define messages, document structures, termi-
nology, and semantics related to the collection, As described previously, the design of clinical
storage, distribution, integration and analysis protocols includes rigorous attention to the
of research information (R. L. Richesson and types of data to be collected and the format
Krischer 2007). The main focus of the work is of those data. This often involves the use of
on data related to studies involving US Food controlled terminologies to capture categori-
and Drug Administration (FDA) regulated cal data. The terminology may be as small as
products (drugs and devices). “yes/no” or a ten-point pain scale for captur-
The Biomedical Research Integrated ing subjects’ symptoms, or it may be as vast as
Domain Group (BRIDG) Model11 is designed a list of all possible drugs or diseases in a sub-
to harmonize models from the HL7 RCRIM, ject’s medical history. In many cases, research-
the Clinical Data Interchange Standards ers will simply compose sets of terms that
Consortium (CDISC) a standards group meet their immediate needs and then require
motivated by the needs of the pharmaceuti- all investigators participating in the study to
cal and bio-technology industry entities that apply them consistently.
sponsor or otherwise support many clinical Because the terms used in clinical research
studies. CDISC provides a standard for sub- are often identical to those used in clini-
mitting regulatory information to the FDA cal care, standard multi-use terminologies
(Fridsma et al. 2008). (such as those described in 7 Chap. 7) are
In a similar manner, HL7 has created a often appropriate for use in capturing clini-
standard Clinical Document Architecture cal research data. However, there are some
(CDA; see 7 Chap. 7) that specifies the struc- aspects of clinical research that are not well
ture and semantics of “clinical documents” represented in mainstream terminologies; and
for the purpose of data exchange. A CDA can in these cases, terminologies and their richer
contain any type of clinical content, including forms, ontologies, that are more focused on
clinical notes typically found in EHRs but may clinical research, are required. In particular,
also include case report forms from research clinical research data and workflow models
studies. HL7 Fast Healthcare Interoperability require controlled terminologies and ontolo-
Resources (FHIR; see 7 Chap. 8) supports gies that define domain-specific concepts and
the exchange of CDA documents (Bender and standard common data elements (CDEs).
Sartipi 2013). Collections of standard terms for CDEs can
The consortia that share clinical data be found in the NIH’s Common Data Element
various consortia have each developed their server (Rubinstein and McInnes 2015).13 In a
own data models to cover the common ele-
ments found in EHRs: patient demograph-
12 7 https://www.sentinelinitiative.org (Accessed Jan-
uary 1, 2019).
11 7 https://www.cdisc.org/standards/domain-infor- 13 7 https://www.nlm.nih.gov/cde (Accessed January
mation-module/bridg (Accessed January 1, 2019). 1, 2019).
Clinical Research Informatics
931 27
similar manner, the Ontology for Biomedical in others. For example, the use of a standard
Investigations (OBI) (Bandrowski et al. 2016) scale for recording a subject’s pain will allow
has been developed by a consortium of rep- comparison of results from a study of one
resentatives from across the spectrum of treatment with those from a second study of
biomedical research, and includes terms to another treatment. The selection of an appro-
represent the design of protocols and data col- priate standard for a particular purpose is not
lection methods, as well as the types of data straightforward (for example, the NIH Pain
obtained and the analyses performed on them. Consortium lists six different scales14). The
There are several reasons for considering choice may be determined simply based on
the use of standard controlled terminologies in the emerging popularity of one terminology
the capture of clinical research data. One rea- over another in a wide community of those
son is to take advantage of clinical data that investigating similar problems. PCORNet,
are already being collected on research sub- OHDSI, and All of Us each specify the use of
jects for other purposes. A common example terminologies such as ICD, SNOMED, CPT,
is the use of data on morbidity and mortal- and LOINC (See 7 Chap. 7).
ity that are collected using one of the various A fourth use of standard terminologies
versions and derivatives of the International relates to reporting requirements. Government
Classification of Diseases (ICD; see 7 Chap. agencies sometimes require the reporting
7). In the US, for example, patient diagnoses of clinical research data and, when they do,
are reported for billing purposes using the often require certain data to be reported using
Clinical Modifications of the tent edition of a particular standard. For example, the FDA
ICD (ICD-10-CM). While such coded infor- requires the use of the Medical Dictionary for
mation is readily available, researchers repeat- Regulatory Activities (MedDRA) for report-
edly find that ICD-10-CM codes assigned ing all adverse events occurring in drug trials
to patient records have an undesired level of (Brown et al. 1999), while the Cancer Therapy
reliability or granularity, especially when com- Evaluation Program (CTEP) at the National
pared to with the actual content of the records Cancer Institute (NCI) requires the use of
(Topaz et al. 2013). Thus, the convenience of Common Terminology Criteria for Adverse
using such standard codes may be outweighed Events (CTCAE) (Colevas and Setser 2004).
by the imprecision, which can adversely affect In an analogous manner, at the international
study design and analytical results. level, the World Health Organization requires
A second reason for adopting a standard the use of the Adverse Reactions Terminology
controlled terminology is simply to avoid (WHO-ART).15 Faced with such reporting
“reinventing the wheel.” As is described in requirements, researchers sometimes choose
7 Chap. 7, a great deal of effort has been to record data in these terminologies as they
expended in the creation of domain-spe- are being captured. In those cases where the
cific terminologies that are comprehensive, clinical questions being answered require
unambiguous, and maintained over time. more detailed data, however, researchers must
Designating such terminologies for use in a resort to recording data with some other stan-
protocol design can relieve researchers of hav- dard (such as SNOMED; see 7 Chap. 7), or a
ing to worry about the quality of the termi- controlled terminology of their own creation,
nology. For example, a researcher is unlikely and then translating them to the terminol-
to encounter novel concepts when recording ogy or terminologies required for reporting
subjects’ demographic data, such as gender, purposes.
marital status, religion, and race. Specifying,
for example, that ISO standards should be
used for these data elements greatly simplifies 14 7 http://nationalpainreport.com/how-to-measure-
the protocol-design process. chronic-pain-8812496.html (Accessed January 1,
2019).
A third reason for choosing standard ter- 15 7 https://www.who-umc.org/vigibase/services/
minologies relates to the ability to compare learn-more-about-who-art (Accessed January 1,
data collected in one study with those collected 2019).
932 P. R. O. Payne et al.
16 7 https://ctep.cancer.gov/protocolDevelopment/
e l e c t ro n i c _ ap p l i c at i o n s / a dve r s e _ eve n t s. h t m 17 7 http://www.icmje.org/journals-following-the-
(Accessed January 1, 2019). icmje-recommendations (Accessed January 1, 2019).
Clinical Research Informatics
933 27
The value of telemedicine for bringing ticipants who are afflicted with COVID-19.
medical expertise to patients located in an CD2H has established the National Clinical
area where expertise is lacking is well demon- COVID Collaboratory (N3C) to create a cen-
strated. In a pandemic, such as the COVID- trally sponsored “data enclave”, composed of
19 outbreak, the patients are by definition de-identified COVID-19 patient records from
located almost everywhere, while expertise CTSA-sponsored institutions that can be used
in this new condition is limited to a relatively directly for data analysis (see . Fig. 27.4).20
small number of academic and government COVID-19 research has placed new
institutions. Perhaps as never before, telemed- demands for analytic support. In part, this is
icine is also needed to protect the clinicians due to the perceived urgency to find answers
and other caregivers who must minimize con- to epidemiologic, preventive, diagnostic, prog-
tact with contagious patients. The technolo- nostic and therapeutic questions. Another
gies of telemedicine are also being exploited issue is diversity of data sharing and harmo-
for “tele-research” where, again, potential nization efforts that exist at the regional and
research subjects are widely dispersed and national levels (described above). Researchers
hard to reach and at the same time pose a are overwhelmed with trying to understand
potential danger to the researchers. COVID- how to navigate various resources, their con-
19 researchers are drawing lessons on research tents, and associated regulatory constraints,
at a distance, including effective recruitment to find the best “fit” given a driving problem
and consent, from previous work in HIV or hypothesis (see . Fig. 27.5).
(Mgbako et al. 2020) and geriatric research
(Nicol et al. 2020).
The rapid expansion of data collection in 27.6 Future Directions for CRI
response to new domains and data types often
outstrips the ability of standards organizations As the preceding sections illustrate, significant
to keep up. This poses challenges for the shar- progress has been made to advance the state
ing, aggregation, and analysis of data for sur- of the CRI domain, and such advances have
veillance, understanding natural history of the already begun to enable significant improve-
disease, and patient recruitment. In COVID-19, ments in the quality and efficiency of clini-
new data types and domains have outstripped cal research. These advances can be viewed
the standards development organization’s abil- as having been achieved at the individual
ity to keep up. As a result, consortia such as investigator level (e.g., improvements in pro-
the the ACT network (see Sect. 29.3.4) and tocol development, study design, participant
CD2H consortium in the CTSA network (see recruitment, etc.), through approaches and
Sect. 29.3.2), both sponsored by NCATS, as resources developed and implemented at the
well as the NIH-sponsored All of Us network institutional level (e.g., development of meth-
(sees Sect. 29.3.8) have had to develop their ods and resources in data warehousing that
own criteria for inferring which patients in enable storage and retrieval of clinical data
EHR databases have COVID-19.18 For ACT, for research, development of novel clinical tri-
the criteria support cross-institutional queries als management systems, etc.), and through
to obtain summary results that can be used for mechanisms that have enabled and facilitated
patient enrollment.19 All of Us is ramping up the endeavors multi-center research consortia
data submissions from healthcare organiza- to drive team science (e.g., innovations that
tions to obtain more timely data on their par- enable data management and interchange for
multi-center studies) (Bourne et al. 2015; P. J.
Embi et al. 2019; Payne et al. 2018; Sanchez-
Pinto et al. 2017; Smoyer et al. 2016).
18 7https://allofus.nih.gov/news-events-and-media/
announcements/coronavirus-update-all-us (Last
accessed June 2, 2020).
19 7 https://ncats.nih.gov/pubs/features/ctsa-act (Last 20 7 https://ncats.nih.gov/n3c (Last accessed June 2,
accessed June 2, 2020). 2020).
934 P. R. O. Payne et al.
27
.. Fig. 27.4 The National COVID Cohort Collabora- monization, creation of a centralized data enclave to
tory is a project at the National Institute for the support collaborative analytics, and development of a
Advancement of Translational Science (NCATS) which synthetic data set based on actual patient data, that can
is pooling electronic data on COVID-19 patients in be downloaded for analysis but poses no risk of reiden-
order to provide researchers with access to data on tification. (See 7 https://covid.cd2h.org. Courtesy
patient sets that are larger than those available in a sin- of the U.S. National Library of Medicine)
gle institution. Efforts include data acquisition and har-
Clinical Research Informatics
935 27
COVID Data Sharing Efforts (MO Perspective)
Detailed Reporting & On-Demand Query &
High-level Reporting Surveillance Benchmarking Analyses
A-D C,D
National
Associations and
Registries
Health Information Public Health Ad-Hoc Data
Exchange(s) Departments Exchange
C EHR Vendors
Local
(dashboards w/relevant
comparators)
Key: (A) transactional electronic data interchange; (B) manual abstraction and reporting; (C) platform-specific and/or standards-based
interfaces or data aggregation workflows; (D) ETL of data from source systems to shared repositories or data models
.. Fig. 27.5 The emergence of the COVID-19 pan- severe disease, and therapeutic outcomes. Efforts include
demic has engendered many efforts to share clinical and local sharing in patient care networks as well as national
epidemiologic data in order to rapidly learn patterns of coalitions established to support both patient care and
natural history, predictions of risk for exposure and research
Looking towards the future of the field of incorporate data from beyond the clinic
CRI, a number of national- and international- and hospital settings. However, the tempo-
level trends are introducing new or evolved rality, granularity, and reliability of such
challenges and opportunities, including: data is vastly different from that collected
55 Nearly ubiquitous adoption of EHRs at via more “traditional” mechanisms as have
the national level, as well as the availability been introduced in this chapter. As a
of lightweight and scalable data-level and result, important questions are now being
platform-independent interoperability raised concerning how such emergent data
standards are making it possible to build sources can/should be integrated with
and leverage learning health care systems those “traditional” data types, and further,
at-scale, in which every patient encounter the methods that are appropriate when
is an opportunity to learn and improve seeking to identify meaningful “signals”
that patients care as well as the care deliv- within ensuing multi-scale and complex
ered to broader populations (Friedman data sets.
et al. 2010). However, while such systems 55 The ready availability of artificial intelli-
do enable the pragmatic collection and gence (AI) platforms and methods (e.g.,
integration of increasingly large volumes knowledge-based systems, cognitive com-
of clinical phenotype data, these “real puting, high-throughput machine learn-
world” data also exhibit new types of chal- ing, deep learning, etc.) are making it
lenges in terms of scope, completeness, possible to perform analyses across and
quality, and domain coverage. As such, between data scales, for example, identify-
new methods are needed to enable the ing meaningful patterns that incorporate
characterization and analysis of such data genomic, clinical, demographic, social,
in a rigorous and reproducible manner. and environmental measures. Such multi-
55 Mobile, wearable, and other sensor tech- scale reasoning is essential to the genera-
nologies, in conjunction with new mecha- tion of evidence in support of precision
nisms for collecting patient reported medicine and/or health paradigms, but
outcome (PRO) data, are allowing for the also introduce critical questions concern-
conduct of clinical research studies that ing how to design and execute such
936 P. R. O. Payne et al.
27
ties that make up the clinical and translational 4. How do the core functional components
research “lifecycle.”. of common clinical trial management
Tenenbaum, J. D., Avillach, P., Benham-Hutchins, systems (CTMS) overlap with or other-
M., Breitenstein, M. K., Crowgey, E. L., wise replicate the functionality of elec-
Hoffman, M. A., et al. (2016). An informatics tronic health record (EHR) systems? To
research agenda to support precision medi- what extent does this similarity or differ-
cine: Seven key areas. Journal of the American ence inform the need for syntactic and/
Medical Informatics Association, 23(4), 791– or semantic interoperability among such
795. This perspective introduces an agenda for systems?
informatics research and practice in the con- 5. In what situations is the use of clinical
27 text of precision medicine. The authors
describe multiple axes of how supporting the-
research-specific terminologies or ontol-
ogies appropriate? In such situations,
oretical frameworks and applied methods can what challenges exists relative to the
generate and deliver evidence in support of selection, use, and maintenance of
precision risk management, diagnosis, and appropriate standards?
treatment planning. 6. What is the role of data standards in
Weng, C., Shah, N., & Hripcsak, G. (2020). Deep enabling the dissemination and reuse of
phenotyping: Embracing complexity and tem- study-generated data sets? How can the
porality—Towards scalability, portability, and use of such standards enable the cross-
interoperability. Journal of Biomedical linkage or integrative analysis of data
Informatics. As clinical and translational sets derived from multiple but indepen-
research becomes increasingly multi- dent studies?
institutional and involves the sharing of deep 7. Compare and contrast the future
phenotypes across and between traditional directions of CRI with those of other
organizational boundaries, there is a need for BMI sub-disciplines and focus areas
common tools and methods to derive and rep- described in this book. To what extent
resent such constructs. This report outlines are they similar and different, and
the current state-of-the-art in terms of com- what are the implications of such
putational phenotyping methods and inter- findings relative to the role of common
change standards. informatics theories and methods and
their applicability to the clinical
??Questions for Discussion research domain?
1. How do the foundational information
needs of clinical research differ
depending on the type and phase of References
study being undertaken? Do study
phases have an impact on the primacy Bandrowski, A., Brinkman, R., Brochhausen, M.,
of such information needs? Brush, M. H., Bug, B., Chibucos, M. C., et al.
2. What is the role of biomedical informat- (2016). The ontology for biomedical investigations.
PLoS One, 11(4).
ics with regard to decreasing bias in Belenkaya, R., Mirhaji, P., Khayter, M., Torok, D.,
RCTs and thus enhancing the internal Khare, R., Ong, T., & Schilling, L. (2015).
validity, external validity, and generaliz- Establishing interoperability standards between
ability of study results? OMOP CDM v4, v5, and PCORnet CDM v1. Paper
3. How can clinical or general-purpose presented at the OHDSI workshop.
Bender, D., & Sartipi, K. (2013). HL7 FHIR: An Agile
information systems and research- and RESTful approach to healthcare information
specific tools be employed synergisti- exchange. Paper presented at the Proceedings of the
cally to address clinical research-specific 26th IEEE international symposium on computer-
information needs, such as participant based medical systems.
recruitment or the population of study- Bhatt, D. L., & Mehta, C. (2016). Adaptive designs for
clinical trials. New England Journal of Medicine,
specific data capture instruments? 375(1), 65–74.
Clinical Research Informatics
939 27
Bourne, P. E., Bonazzi, V., Dunn, M., Green, E. D., (OHDSI): Opportunities for observational research-
Guyer, M., Komatsoulis, G., et al. (2015). The NIH ers. Studies in Health Technology and Informatics,
big data to knowledge (BD2K) initiative. Journal of 216, 574.
the American Medical Informatics Association, Hulley, S. B., Cummings, S. R., Browner, W. S., Grady,
22(6), 1114–1114. D. G., & Newman, T. B. (2013). Designing clinical
Brightling, C. E. (2017). Clinical trial research in focus: research. Lippincott Williams & Wilkins.
Do trials prepare us to deliver precision medicine in Johnson, S. B., Farach, F. J., Pelphrey, K., & Rozenblit,
those with severe asthma? The Lancet Respiratory L. (2016). Data management in clinical research:
Medicine, 5(2), 92–95. Synthesizing stakeholder perspectives. Journal of
Brown, E. G., Wood, L., & Wood, S. (1999). The medi- Biomedical Informatics, 60, 286–293.
cal dictionary for regulatory activities (MedDRA). Klann, J. G., Abend, A., Raghavan, V. A., Mandl, K. D.,
Drug Safety, 20(2), 109–117. & Murphy, S. N. (2016). Data interchange using
Colevas, A., & Setser, A. (2004). The NCI Common i2b2. Journal of the American Medical Informatics
Terminology Criteria for Adverse Events (CTCAE) Association, 23(5), 909–915.
v 3.0 is the new standard for oncology clinical trials. Knosp, B., Barnett, W. K., & Embi, P. J. (2017). Maturity
Journal of Clinical Oncology, 22(14_suppl), 6098– models for research IT and informatics – Reports from
6098. the field. Paper presented at the AMIA Summits on
Collins, F. S., & Varmus, H. (2015). A new initiative on Translational Science, San Francisco, CA.
precision medicine. New England Journal of Large technology companies continue to ramp up
Medicine, 372(9), 793–795. healthcare forays. (2018, September 17). Health
Embi, P. (2013). Clinical research informatics: Survey of Data Management. Retrieved from https://www.
recent advances and trends in a maturing field. healthdatamanagement.com/articles/large-technol-
Yearbook of Medical Informatics, 8(1), 178–184. ogy-companies-continue-to-ramp-up-healthcare-
Embi, P. J., & Payne, P. R. (2009). Clinical research forays
informatics: Challenges, opportunities and defini- Liu, Y., Gu, Z., Xia, S., Shi, B., Zhou, X. N., Shi, Y., &
tion for an emerging domain. Journal of the Liu, J. (2020). What are the underlying transmission
American Medical Informatics Association, 16(3), patterns of COVID-19 outbreak? – An age-specific
316–327. social contact characterization. E Clinical Medicine,
Embi, P. J., & Payne, P. R. (2013). Evidence generating 18, 100354.
medicine: Redefining the research-practice relation- Meinert, E., Milne-Ives, M., Surodina, S., & Lam, C.
ship to complete the evidence cycle. Medical Care, (2020). Agile Requirements Engineering and
51, S87–S91. Software Planning for a Digital Health Platform to
Embi, P. J., & Payne, P. R. (2014). Advancing method- Engage the Effects of Isolation Caused by Social
ologies in Clinical Research Informatics (CRI). Distancing: Case Study. JMIR Public Health
Journal of Biomedical Informatics, 52(C), 1–3. Surveill, 6(2), e19297.
Embi, P. J., Richesson, R., Tenenbaum, J., Kannry, J., Mailman, M. D., Feolo, M., Jin, Y., Kimura, M., Tryka,
Friedman, C., Sarkar, I. N., & Smith, J. (2019). K., Bagoutdinov, R., et al. (2007). The NCBI
Reimagining the research-practice relationship: dbGaP database of genotypes and phenotypes.
Policy recommendations for informatics-enabled Nature Genetics, 39(10), 1181–1186.
evidence-generation across the US health system. Mgbako, O., Miller, E. H., Santoro, A. F., Remien,
JAMIA 2(1), 2–9. R. H., Shalev, N., Olender, S., Gordon, P., &
Fleurence, R. L., Beal, A. C., Sheridan, S. E., Johnson, Sobieszczyk, M. E. (2020). COVID-19, telemedi-
L. B., & Selby, J. V. (2014). Patient-powered research cine, and patient empowerment in HIV care and
networks aim to improve patient care and health research. AIDS Behav, 21, 1–4.
research. Health Affairs, 33(7), 1212–1219. Murphy, S. N., Weber, G., Mendis, M., Gainer, V.,
Fleurence, R. L., Curtis, L. H., Califf, R. M., Platt, R., Chueh, H. C., Churchill, S., & Kohane, I. (2010).
Selby, J. V., & Brown, J. S. (2014). Launching Serving the enterprise and beyond with informatics
PCORnet, a national patient-centered clinical for integrating biology and the bedside (i2b2).
research network. Journal of the American Medical Journal of the American Medical Informatics
Informatics Association, 21(4), 578–582. Association, 17(2), 124–130.
Fridsma, D. B., Evans, J., Hastak, S., & Mead, C. N. Nicol, G. E., Piccirillo, J. F., Mulsant, B. H., & Lenze,
(2008). The BRIDG project: A technical report. E. J. (2020). Action at a distance: Geriatric research
Journal of the American Medical Informatics during a pandemic. Journal of the American
Association, 15(2), 130–137. Geriatrics Society, 68, 922–925. https://doi.
Friedman, C. P., Wong, A. K., & Blumenthal, D. (2010). org/10.1111/jgs.16443.
Achieving a nationwide learning health system. Payne, P. R., Bernstam, E. V., & Starren, J. B. (2018).
Science Translational Medicine, 2(57), 57cm29– Biomedical informatics meets data science: Current
57cm29. state and future directions for interaction. JAMIA
Hripcsak, G., Duke, J. D., Shah, N. H., Reich, C. G., open, 1(2), 136–141.
Huser, V., Schuemie, M. J., et al. (2015). Payne, P. R., Embi, P. J., & Niland, J. (2010).
Observational Health Data Sciences and Informatics Foundational biomedical informatics research in
940 P. R. O. Payne et al.
the clinical and translational science era: A call to Smoyer, W. E., Embi, P. J., & Moffatt-Bruce, S. (2016).
action. Journal of the American Medical Informatics Creating local learning health systems: Think glob-
Association, 17(6), 615–616. ally, act locally. Journal of the American Medical
Payne, P. R., Johnson, S. B., Starren, J. B., Tilson, H. H., Association, 316(23), 2481–2482.
& Dowdy, D. (2005). Breaking the translational bar- Spilker, B. (1984). Guide to clinical studies and developing
riers: The value of integrating biomedical informat- protocols. Raven Press.
ics and translational research. Journal of Sung, N. S., Crowley, W. F., Jr., Genel, M., Salber, P.,
Investigative Medicine, 53(4), 192–201. Sandy, L., Sherwood, L. M., et al. (2003). Central
Payne, P. R., Pressler, T. R., Sarkar, I. N., & Lussier, Y. challenges facing the national clinical research
(2013). People, organizational, and leadership fac- enterprise. Journal of the American Medical
tors impacting informatics support for clinical and Association, 289(10), 1278–1287.
translational research. BMC Medical Informatics Tenenbaum, J. D., Avillach, P., Benham-Hutchins, M.,
Precision Medicine
and Informatics
Joshua C. Denny, Jessica D. Tenenbaum, and Matt Might
Contents
Joshua C. Denny completed this work while working at Vanderbilt University School of Medicine.
and other similar recent fields to all share as a 28.2 sing EHRs for Genomic
U
goal improvement of health for both individ- Discovery
ual patients and populations through use of
data. Indeed, in practice, most of these terms Electronic health records (EHRs) have been
are used fairly interchangeably amongst most an important part of clinical care for decades,
institutions focusing on discovery and imple- but over the last decade have become an
mentation of precision medicine approaches. increasing part of discovery to advance preci-
A key differentiator is that precision medicine sion medicine. While retrospective epidemio-
focuses primarily on its goals through logical research has been performed using
enhanced understandings and new taxono- claims data for decades, the more recent use
mies that lead to redefinition of the diseases of electronic health records for molecular
themselves. Precision medicine shares with research, especially genomics, over the last
these other terms the centuries-old goal of
28 individualizing care for individual patients,
decade has become transformational. EHRs
contain a wealth of dense patient data that are
following Sir William Osler’s maxim: “The valuable for discovery, and that would cost
good physician treats the disease; the great significant sums to reproduce in a research
physician treats the patient who has the dis- cohort (. Fig. 28.2). As such, they have
ease.” The primary difference between now become a valuable source of information for
and then is the deluge of new modalities and retrospective research (Robinson, Wei, Roden,
quantities of information truly allowing us to & Denny, 2018).
leverage big data to solve problems of indi- The first successful use of electronic health
vidual health in new ways. Such scale of huge records for genetic discovery was in 2010.
data sets is needed, for instance, to untangle Four papers published that year used EHR
the contribution of rare genomic variation to data as the sole phenotypic information to
the clinical impact on individual patients. For replicate known genetic effects (Denny,
example, out of 6.4 billion base pairs in the Ritchie, Basford, et al., 2010; Denny, Ritchie,
(diploid) human genome, an average patient Crawford, et al., 2010; Kullo, Ding, Jouni,
genome may have 4.1 million to 5 million Smith, & Chute, 2010; Ritchie et al., 2010).
genetic variants (1000 Genomes Project Ritchie et al. tested for the associations
Consortium et al., 2015). Distilling the impact between 5 diseases and 21 SNPs that were
of these variants in such a way that clinicians known to be associated from prior literature
can make use of this information is a substan- and replicated all associations for which the
tial challenge for both rare and common dis- study was adequately powered (Ritchie et al.,
orders. 2010). The other studies added replications
In this chapter, we will focus on the infor- with endophenotypes of cardiac conduction
matics resources for and implications of preci- and red blood cell traits. These studies
sion medicine. We will also provide a basic included the first EHR-based genome-wide
overview of precision medicine. Informatics is association studies (GWAS; . Fig. 28.3a) –
needed for the capture of data, transforma- studies which test for association between a
tion of data into information and knowledge, phenotype and hundreds of thousands to mil-
and – perhaps most importantly – implemen- lions of single nucleotide polymorphisms
tation of precision medicine – acting on that (SNPs). GWAS is discussed more in 7 Sect.
new knowledge to improve the health of real 28.4.1. Importantly, these studies suggested a
people. Electronic health record (EHRs) are process for developing phenotype algorithms
an important part of precision medicine both that identified both cases and controls as two
by providing a real world data source as well separate groups. Each algorithm used a vari-
as a key modality for its implementation. ety of types of EHR data to identify their
Relevant other chapters to this chapter include populations with high positive predictive val-
7 Chaps. 9 and 26. ues (analytics for phenotype algorithms are
Precision Medicine and Informatics
945 28
.. Fig. 28.2 Density of phenotypic data in an EHR data prior to their enrollment, allowing both cross-
linked biobank. All rows represent data from patients sectional and in silico longitudinal studies. Each point
that enrolled in BioVU, the Vanderbilt DNA biobank. represents real data transformed by square root divided
The first patiented enroll ed in 2007. This figure demon- by 20 of the actual count at that time period. (From
strates that individuals can have decades of extant EHR Robinson et al., 2018)
discussed in more detail in 7 Sect. 28.3 (eMERGE) network, which began in 2007.
below). Another important early study vali- eMERGE had the explicit goal of exploring
dated known SNPs associated with rheuma- the capabilities of EHRs for genomic discov-
toid arthritis using a validated EHR algorithm ery. The initial eMERGE network had five sites
(Kurreeman et al., 2011). This study demon- and has been renewed twice, now in its third
strated the effect sizes from the EHR algo- iteration, with 10 sites. In subsequent itera-
rithms and from prior research studies were tions, eMERGE has grown from a primary
similar. focus of discovery to also include implementa-
Another advance highlighted first in early tion of genomic medicine (Rasmussen-Torvik
EHR studies was the phenome-wide associa- et al., 2014). The first novel discovery out of
tion study (PheWAS; . Fig. 28.3b), which is eMERGE was in 2011 by identifying vari-
discussed in 7 Sect. 28.4.3 below. That the ants in FOXE1 associated with autoimmune
first systematic “PheWAS” was performed in hypothyroidism (some of the results from this
EHRs was enabled by the broad collection of study are shown in . Fig. 28.3) (Denny et al.,
phenotypes found in EHRs that is essentially 2011). Since then, eMERGE sites have inves-
unrelated to an a priori research hypothesis tigated nearly 100 different phenotypes, many
(Denny, Ritchie, Basford, et al., 2010). with novel discoveries (Crawford et al., 2014).
PheWAS approaches have been applied in
observational cohorts as well (Millard et al.,
2015; Sarah A Pendergrass et al., 2013). 28.3 Finding Research-Grade
An important development in the use of Phenotypes in EHRs
EHRs for precision medicine was the National
Human Genome Research Institute-funded A characteristic of EHR phenotypes is the
Electronic Medical Records and Genomics combination of multiple modalities of the
946 J. C. Denny et al.
28 b
.. Fig. 28.3 Example GWAS and PheWAS using EHR A. This PheWAS identified autoimmune thyroid disease
data. a Manhattan plot from a GWAS of autoimmune associated with this variant and highlighted other con-
hypothyroidism performed in the eMERGE Network. ditions, like atrial flutter, that are inversely associated
This GWAS identified variants in FOXE1 as a risk fac- with hypothyroidism. (With permission from Denny
tor for autoimmune hypothyroidisms, a novel finding at et al. 2011 © Elsevier)
the time. b PheWAS of the variant identified in panel
EHR information to identify high-quality Both of these were validated at more than one
research grade phenotypes. Most frequently, site and demonstrated successful genomic dis-
they include billing codes, medications, labo- covery or replication (Denny et al., 2011; Kho
ratory results, and some sort of text process- et al., 2012). Clearly, sufficient sample size
ing or natural language processing. Example (and consequently statistical power) is needed
phenotypes for autoimmune hypothyroidism to identify associations. Thus, the eMERGE
and type 2 diabetes are shown in . Fig. 28.4. network has found it necessary to run pheno-
Precision Medicine and Informatics
947 28
a
Case medications ICD-9 codes for hypothyroidism Abnormal lab values ICD-9 codes for secondary
levothyroxine, synthroid, 244, 244.8, 244.9, 245, 245.2, 245.8, 245.9 TSH > 5 OR FT4 < 0.5 causes of hypothyroidism
levoxyl unithroid, 244.0, 244.1, 244.2, 244.3
armour thyroid, desicated thyroid,
cytomel, triostat, liothyronine, ICD-9 codes for post surgical or
synthetic triiodothyronine, post radiation hypothyroidism
Case Definition
liotrix, thyrolar 193*, 242.0, 242.1, 242.2,
All three conditions required:
242.3, 242.9, 244.0, 244.1, 244.2,
• ICD-9 code for hypothyroidism OR abnormal TSH/FT4 244.3, 258*
Pregnancy exclusion ICD 9
• Thyroid replacement medication use
codes
Any pregnancy billing code or • Require at least 2 instances of either medication or lab with at CPT codes for post radiation
lab test if all Case Definition least 3 months between the first and last instance of medication hypothyroidism
codes, labs, or medications fall and lab 77261, 77262, 77263, 77280, 77285,
within 6 months before 77290, 77295, 77299, 77300, 77301,
pregnancy to one 1 year after 77305, 77310, etc.
pregnancy. Case Exclusions
V22.1, V22.2, 631, 633, 633.0, Exclude if the following information occurs at any time in the record: Exclusion keywords
633.00, 633.1, 633.10, 633.20, • Secondary causes of hypothyroidism multiple endocrine neoplasia. MEN
633.8, 633.80, 633.9, 633.90, • Post surgical or post radiation hypothyroidism I, MEN II, thyroid cancer, thyroid
645.1, 645.2, 646.8, etc. • Other thyroid diseases carcinoma
• Thyroid altering medication
Exclusion keywords Thyroid-altering medications
optiray, radiocontrast, iodine,
omnipaque, visipaque, hypaque, Case Exclusions Phenytoin, Dilantin, Infatabs,
ioversol, diatrizoate, iodixanol, Time dependent case exclusions: Dilantin Kapseals, Dilantin-125,
isovue, iopamidol, conray, Phenytek, Amiodarone Pacerone,
• Recent pregnancy TSH/FT4 Cordarone, Lithium, Eskalith,
iothalamate, renografin, sinografin, • Recent contrast exposure
cystografin, conray, iodipamide Lithobid, Methimazole, Tapazole,
Northyx, Propyithiouracil, PTU
Yes
Yes Rx Insulin? No
No Rx T2DM med
before insulin Rx? Abnormal* Yes
glucose or
Yes
HbA1c?
≥2 dates w/
clinician entered
T2DM Dx
Yes Yes T2DM case Yes
.. Fig. 28.4 EHR phenotype algorithms for Autoim- PheKB.org. Each algorithm was validated by manual
mune Hypothyroidism and Type 2 Diabetes. Details for chart review at multiple institutions. (Figures adapted
each of these algorithms are available on 7 http:// from: a Conway et al., (2011) and b Kho et al., (2012))
type algorithms across different sites to phenotypes across different sites given local
increase sample size. In doing so, eMERGE variability in EHR systems, billing practices,
found it helpful to collaboratively develop and institutional practices. Algorithms from
948 J. C. Denny et al.
<95%
Case & control
algrithm Manual review; ≥95% Deploy in Association
Common development and assess precision cohort tests
phenotype refinement
Simple agorithm
Rare Manual Association
to find possible
phenotype review/deploy tests
cases (controls
are easy)
.. Fig. 28.5 Overview of the general approach to finding a phenotype in the EHR
and those who do not match the algorithm in tions on 7 PheKB.org demonstrated that
the chart review to avoid anchoring bias. In out of 145 posted implementations (as of
this review, the order of the charts would be December 2018), 79 (54%) reported PPV and
randomized and the reviewers blinded to the 33 (23%) reported recall. The condition in
algorithm’s determination of case (or control) which sensitivity is more important is when
status. After review, the investigators can cal- the variable is being used as a covariate in an
culate the positive predictive value (PPV) as analysis.
the number of true positives divided by (the To facility EHR based discovery, huge data
number of true positives plus the number sets are needed. While the growth through the
of false positives). Additionally, researchers early 2000‘s and this decade have often been
may want to calculate a sensitivity (or recall). focused on single EHR systems, there is an
Calculating recall for common diseases may increasing need to combine data across sides
be relatively straightforward but very chal- for discovery. This is important both for the
lenging to assess for rare diseases. Thus, many needs of amassing the necessary size of this
reviewers often make simplifying assumptions size as well as representing diversity of geog-
about requirements to be a case. For example, raphy, demographics, environmental expo-
many researchers make the assumption that a sures, and practice habits, which can vary
case must have at least one relevant ICD code between institutions and geography. In short,
to be a case (Carroll et al., 2012; Liao et al., this effort has been facilitated through use of
2010). This approach provides a reasonable common data representations such as the Fast
estimate of sensitivity. Health Interoperability Resource (FHIR)
PPV is typically viewed as the most and common data models (CDMs). Most fre-
important metric performance for a case- quently used common data models have been
control study, since it is usually more impor- the Informatics for Integrative Biology and
tant to be sure to have high-quality cases and the Bedside (i2b2) data model, PCORNet,
controls for a given analysis. For instance, and the OMOP data model. These are dis-
review of phenotype algorithm implementa- cussed in more detail in 7 Chap. 25.
950 J. C. Denny et al.
28.4 Omic Discovery Approaches tified SNPs associated with 7 common dis-
eases (Wellcome Trust Case Control
Over the last three decades, the explosion Consortium, 2007). As mentioned above, the
of efficient and relatively inexpensive dense first GWAS using EHR information to define
molecular measures has led to the growth of cases and controls was performed in 2010
more data-driven molecular investigations (Denny, Ritchie, Crawford, et al., 2010; Kullo
of traits and diseases. The most commonly et al., 2010). Since then, the number of EHR-
investigated currently would be genomic based GWAS and large consortia using EHR
investigations. Some of these are also finding information has risen dramatically (Wei &
translation into clinical practice (discussed Denny, 2015). The current inclusion of rou-
in 7 Sect. 28.7). The research observation tine healthcare data in very large cohorts (see
of genetic pleiotropy (the condition in which 7 Sect. 28.6 below) has made reliance on
one gene or genetic variant impacts multi- healthcare data such as those from EHRs or
28 ple phenotypes) combined with more dense administrative claims data now common-
phenotypic assessments have led to similar place.
hypothesis-free tests of association of the Since 2009, published GWAS have been
phenome. Specific technologies are discussed curated and made available via the online
further in 7 Chaps. (9 and 26). GWAS Catalog, begun first by NHGRI and
now hosted by EMBL-EBI (7 https://www.
ebi.ac.uk/gwas/) (MacArthur et al., 2017). By
28.4.1 Genome-Wide Association 2010, more than 500 GWASs had been per-
Studies (GWAS) formed on a wide variety of traits in common
disease, and by the end of 2018, this had
A GWAS systematically surveys polymor- grown to 3675 publications reporting one of
phisms across the genome to find variants more GWASs, noting associations between
associated with a trait or disease. Variants for 87,081 SNPs and phenotypes (Green, Guyer,
a GWAS are typically weighted toward more & National Human Genome Research
common SNPs that can represent a broad Institute, 2011; MacArthur et al., 2017).
range of genomic variation based on linkage
disequilibrium, that is the nonrandom cluster-
ing of variation in the genome based on inher- 28.4.2 Genomic Sequencing
itance patterns. Thus, relatively small numbers
of the >3 billion base pairs in the (haploid) The rapidly falling cost of genomic sequencing
human genome can represent a large fraction (to several hundred for a research whole exome
of inherited variation in the genome. Most and less than $1000 for a whole genome as of
GWAS assay >500,000 mostly SNPs common the end of 2018) is leading to a dramatic growth
SNPs. More recent GWAS have incorporated in use of genetic sequencing. The primary
rare variants, such as functional genomic vari- added benefit of genomic sequencing to preci-
ants known to be associated with disease and sion medicine at the current time is a better and
pharmacogenomic variants. more detailed assessment of rare and very rare
The first GWASs (. Fig. 28.3a) was con- variants through a more comprehensive cover-
ducted in 2005 and 2006 and discovered age of the genome. Sequencing approaches
genetic variants associated with Age-related have enabled the discovery of novel variants for
Macular Degeneration of ~100k SNPs common disease and have been especially
(Dewan et al., 2006; Klein et al., 2005). The impactful for the uncovering of variants in rare
modern era of array-based GWAS approach disease. Sequencing is routinely used now clini-
with large case control populations identify- cally to aid cancer care or diagnose rare genetic
ing common variants influencing common diseases. In research, sequencing is rapidly
disease was arguably introduced in a large expanding our ability to discover associations
scale by the 2007 by the Wellcome Trust Case with rare conditions. The NIH’s Undiagnosed
Control Consortium, which successfully iden- Disease Network, for instance, routinely
Precision Medicine and Informatics
951 28
employs whole exome sequencing (WES) or ICD9 and ICD10 codes into >1800 pheno-
whole genome sequencing (WGS) to diagnose type case groups. A 2013 study shows that this
individuals. As a notable win for sequencing, approach was able to replicate 66% of ade-
the UDN has been able to diagnose 35% of quately-powered SNP- phenotype pairs, and
individuals referred into their network, 74% of also identified several new associations that
which were made with the addition of genomic were replicated (Denny et al., 2013). A catalog
sequencing to comprehensive clinical pheno- of some of the PheWAS associations found
typing (Splinter et al., 2018). In addition, they to date is available at 7 http://phewascatalog.
have defined 31 new syndromes through their org. A PheWAS of all phenotypes available
comprehensive clinical and molecular assess- in the UK Biobank has also been performed
ments of undiagnosed patients. (7 http://www.nealelab.is/uk-biobank).
PheWAS can essentially be performed on
any broad collection of phenotypes.
28.4.3 Phenome-Wide Association Researchers have used raw unaggregated ICD
Studies (PheWAS) codes, other aggregation systems of ICD
codes, or phenotypes collected from observa-
Growth of EHR based cohorts provided rich tional cohorts (Hebbring et al., 2013; Pathak,
and diverse phenotype data to complement Kiefer, Bielinski, & Chute, 2012; S A
biologic data. Whereas GWASs provided a Pendergrass et al., 2011). The disadvantage to
way to assess genomic accession associations using more granular ICD codes is the
in a hypothesis-free manner starting around increased number of hypotheses being tested,
2005, GWAS usually assesses only one phe- which hinders the statistical power to detect a
notype at a time. However, the growth of result. Lack of ICD code aggregation can also
GWAS quickly highlighted the occurrence of introduce variability in coding practices that
genetic pleiotropy – the condition in which decreases sample size for a given phenotype,
one gene influences multiple independent phe- such as the number of specific diagnostic
notypes. Thus, the rich collection of diverse codes available to represent common condi-
phenotype information in EHRs and other tions and their complications, such as diabetes
growing cohorts provided the ability to simul- mellitus subtypes (e.g., with specific codes for
taneously access phenotype associations in controlled or uncontrolled glucose status and
the same scanning hypothesis free manner as its resulting cardiovascular, renal, or neuro-
GWAS. The first PheWAS EHR-based aggre- logical complications) or gout (e.g., chronic or
gated billing codes into 744 PheWAS “cases” acute, with or without tophi, etc).
(Denny, Ritchie, Basford, et al., 2010). Each PheWAS can quickly highlight potential
case was linked to a control group. After identi- pleiotropy of a given genetic variant or other
fication of case and control groups, a PheWAS independent variable by analyzing for associa-
(. Fig. 28.3b) is essentially a pairwise test of tions with multiple phenotypes within a single
all phenotypes against an independent vari- population, one can test the independence of
able, such as a genetic variant or laboratory the potential pleiotropic findings with subse-
value. For a genetic variant, PheWAS is analo- quent conditioned analyses. Other advantages
gous to genetic association tests performed in of PheWAS is that they are quick to perform
a GWAS, with a typical approach employing and easily implemented through existing R
a logistic regression adjusted for demographic packages (7 https://github.com/PheWAS/
and genetic variables, such as genetic ances- PheWAS) or iteration through common sta-
try. The first PheWAS tested seven known tistical packages. A disadvantage of PheWAS
SNP-disease associations, replicating four is that its phenotypes can be coarse and can
and suggested a couple of new associations. have both lower sensitivity and PPV than cus-
Newer approaches to PheWAS have lever- tom phenotype algorithms as discussed in
aged increased density of phenotypes from 7 Sect. 28.3. Fortunately, these types of bias
the EHR, which current methods mapping typically biased towards the null. Associations
952 J. C. Denny et al.
found via PheWAS can require refinement 2015; Okada et al., 2014; Wood et al., 2014).
and subsequent validation. Collectively, these genetic variants can explain
a much larger percentage of the variance in
disease risk than the individual risk variants,
28.4.4 Other Omic Investigations even when the effect sizes of many of the indi-
vidual variants may be rather small (e.g., hav-
In addition to genomics, the growth of a num- ing odds ratios of ~1.01). As a tool, researchers
ber of other omic approaches are providing have aggregated genetic risk variants into a
greater insight into an individual’s environ- calculated score (called a “genetic risk score”,
ment, endophenotypes, and molecular mea- GRS, or “polygenic risk score”, PRS), typi-
sures. Some of these include the microbiome, cally as a sum of the presence of the variant
proteome, metabolome, and other bioassays. multiplied by a weight, often taken from a
Additional dense phenotypic and environmen- regression analysis. These risk scores need to
28 tal assessments include dense measures of the account linkage disequilibrium to find inde-
environment and personal sensor-based tech- pendent loci and may also produce a weighted
nologies, such as consumer activity monitors. model using penalized regression. A simple
Publicly available datasets providing detailed approach can be given as:
measures of pollution, the built environment, k
weather patterns, availability of quality food GRS wi N i
or greenspace, and sociodemographic factors i 1
are available for linkage via geolocation, linked
via smartphones and other devices that con- where wi is the weight for the variant (e.g., the
tinuously track geolocation. These devices can log odds ratio from a logistic regression) and
also measure activity and heart rate to provide Ni is the number of risk alleles for that variant
greater insight into a person’s habits and phys- (typically, 0,1, or 2). The clinical advantage of
iological factors. Today, the clinical impact a GRS is that it provides a way to evaluate the
of many of these measures is not yet known. aggregate risk of an individual having a given
However, there growing ubiquity through both disease that takes into account many typically
research and commercial interests are enabling small risk factors.
deeper investigation into their clinical impact. For instance, consider breast cancer
They are also being included in large research genetic testing. It has long been recognized
cohorts (see 7 Sect. 28.6). that variants in BRCA1 and BRCA2 confer
significant increased risk of breast cancer to
carriers of these mutations. While pathogenic
28.5 pproaches to Using Dense
A variants in these genes do confer a large risk
Genomic and Phenomic Data of breast cancer (lifetime risk of 45–65%), the
vast majority of breast cancer is not related to
for Discovery these variants, since they are present in <1%
of the general population (Torkamani,
28.5.1 Combining Genotypes Wineinger, & Topol, 2018). However, com-
and Phenotypes as Risk mon SNPs from a large breast cancer GWAS
Scores published in 2017 represents about 41% of the
familial risk of breast cancer (Michailidou
Most genetic variants discovered via GWAS et al., 2017). Studies with cardiovascular dis-
have had relatively mild effect sizes for their eases have found similar results and potential
phenotype of interest. However, the size of clinical utility for PRS. In a study looking at 5
modern GWAS, now involving hundreds of prospectively-followed cardiovascular cohorts
thousands of individuals for more common with genetic testing found that polygenic risk
traits, have allowed identification of many scores and lifestyle factors were independently
independent genetic loci, sometimes reaching associated with incident cardiovascular events
into the hundreds of distinct loci (Locke et al., (Khera et al., 2016). Moreover, their study
Precision Medicine and Informatics
953 28
identified that patients with high genetic risk 28.5.2 Mendelian Randomization
of cardiovascular disease but healthy lifestyles
were at similar risk to those with unhealthy Mendelian randomization (MR) is a technique
lifestyles but low genetic risk. Importantly, used to provide evidence for the causality of a
healthy lifestyles decreased cardiovascular biomarker on a disease state in conditions in
risk at any genetic risk threshold, suggesting which randomized controlled trials are diffi-
the importance of potential preventative life- cult or too expensive to pursue. For example,
style modifications and therapies in those low density lipoprotein (LDL) and high-
individuals at high genetic risk. density lipoprotein (HDL) levels have long
A more recently introduced approach is to been associated with myocardial infarctions in
do a similar process with phenotypes in a phe- observational cohorts, but it is unclear
notype risk score (PheRS) (Bastarache et al., whether they are markers or causal: perhaps
2018). In the initial demonstration of PheRS, these levels are a marker of diet, activity level,
ICD codes were mapped to phecodes and or other unknown factors. It is essentially
summed weighted based on the inverse log of impossible (and probably unethical) to per-
the frequency of the phecode in the EHR: form a randomized control trial that alters
m someone’s LDL or HDL levels in isolation.
PheRS w p xi , p However, a number of genetic variants have
p 1 been found that alter LDL and HDL levels.
Since alleles are randomly distributed to ova
where: or sperm during meiosis, studying the impact
xi,p = 1 if individuali has phenotypep or 0 of biomarker-influencing alleles provides a
otherwise naturally occurring randomization of the risk
factor. Genetic variants are generally not
N associated with behavioral, social, and some
w p = log
np physiological factors – reducing confounding.
Thus, by studying the impact on the clinical
where np is the number of individuals with outcome of the variants associated with the
phenotype p. By aggregating phenotypes in a biomarker, one can assert causality of the bio-
similar way as genotypes, a combined score marker to the outcome of the variant. MR
can increase the sensitivity to detect the phe- has proven a powerful tool in recent years
notypic impact of a genetic variant. For (. Fig. 28.6). MR studies have demonstrated
example, the PheRS for cystic fibrosis includes clear associations between LDL and triglycer-
component phenotypes such as bronchiecta- ide levels and cardiovascular disease while
sis, pneumonia, infertility, and asthma. The casting doubt on the role of HDL in protect-
disease code itself (“cystic fibrosis”) is not ing against cardiovascular disease (Holmes
part of the PheRS. Based on EHR weighting, et al., 2015; Voight et al., 2012). The latter is
bronchiectasis has a much higher weight than particularly interesting as cholesteryl ester
asthma since asthma is much more common. transfer protein (CETP) inhibitors, medica-
This approach was used in an initial demon- tions targeted to raise HDL, have so far not
stration exercise looking at ~1200 Mendelian been successful at reducing mortality
diseases that could be tested in an EHR popu- (Mohammadpour & Akhlaghi, 2013).
lation. In this study, PheRS was able to iden- By providing an approach to assess cau-
tify diagnosed genetic diseases in the EHR sality, MR can also provide an approach to
using the component phenotypes of disease investigate potential drug effects. An MR
and was also able to be used to identify novel approach demonstrated the the lipid-lowering
pathogenic variants for undiagnosed condi- agent ezetimibe would reduce cardiovascular
tions. disease (by studying the clinical impact of
954 J. C. Denny et al.
genetic variants mimicking its effect) prior to subtypes is unclear. Two examples of where
28 a randomized controlled trial demonstrated targeted treatment for disease is currently
this effect (Cannon et al., 2015; Myocardial being utilized in practice are pharmacoge-
Infarction Genetics Consortium Investigators nomics and oncology, which are discussed in
et al., 2014). Similarly, MR has been used more detail in 7 Sect. 28.7.
to show that diabetes is a potential concern
for PCSK9 inhibitors and combined with
PheWAS to highlight potential unanticipated 28.5.4 se of Machine Learning
U
side effects (Jerome et al., 2018; Schmidt et al., and Artificial Intelligence
2017). to Advance Precision
Medicine
28.5.3 sing Dense Data-Driven
U The focus of this section is largely on the
Measures to “Redefine” application of these methods to advance pre-
Disease cision medicine. Other discussions on machine
learning and artificial intelligence appear in
There is increasing enthusiasm that precision 7 Chap. 9.
medicine will lead to new ways of defining dis- Machine learning falls into two major
ease and selecting treatments that will identify classes of approaches: supervised and unsu-
more rational therapeutic choices, improve pervised, with the ability to apply many differ-
our understanding of prognosis, and result in ent algorithms. Supervised machine learning
more effective disease screening. One example approaches tasks use a gold standard set as
is cystic fibrosis pharmacotherapy, for which input to learn classifiers designed to optimally
medications have been developed to target mimic the training set. Unsupervised machine
defects corresponding with specific genetic learning learn patterns from the data without
variants (O’Reilly & Elphick, 2013). These labeled training sets. Machine learning has the
targeted therapies have dramatic influence on potential to augment any classification task,
the disease course. There is a hope that similar and has long been used with clinical data.
approaches could be found for many common Machine learning has been used for many
diseases aiding in drug selection and risk tasks in EHRs (such as in natural language
stratification for diseases such as depression, processing (Jiang et al., 2011; Y. Wu et al.,
diabetes, hypertension, heart disease, and 2017)) and in bioinformatics, such as aiding in
many other common diseases. Recent studies interpretation of genetic variants (Kircher
using clinical and molecular information have et al., 2014). Some of these use cases are refer-
suggested subtypes of type 2 diabetes, heart enced above, such as the learning of phenotype
failure, and autism (Ahmad et al., 2014; classifiers using EHR data and labeled cases or
Doshi-Velez, Ge, & Kohane, 2014; Li et al., controls (Carroll, Eyler, & Denny, 2011; Liao
2015); however, the clinical impact of such et al., 2010; Lin et al., 2015; Peissig et al., 2014).
Precision Medicine and Informatics
955 28
Recent areas of exploration in machine to cardiovascular disease risk. Its initial dis-
learning have seen a rapid rise in deep learn- coveries were derived from detailed longitudi-
ing approaches. These take massive datasets nal assessment of just under 5000 individuals.
and multi-layered neural networks to learn Most of these epidemiological cohorts have
patterns in data that have proven superior largely focused on answering exposure- or
to other machine learning techniques. They disease-focused questions. Two developments
typically require very large data sets that have beginning in the early 2000s have brought
previously been unavailable in healthcare or new data resources to advanced discovery in
biomedical research. However, the recent rapid more disease-neutral fashion. One has been
growth of available EHR, genomic, and imag- the growth of large national-scale cohorts
ing data sources is enabling a new potential for containing diverse phenotypic information
machine learning to be applied to these data connected with biosamples, and the other has
sets as well. Recent examples include training been the growth of incorporation of routinely
algorithms to identify malignant skin lesions collected healthcare data linked to biologi-
and diabetic retinopathy from retinal scan cal specimens. Example cohorts include the
(Esteva et al., 2017; Gulshan et al., 2016). UK Biobank, the Million Veteran Program,
These algorithms can present with perfor- the All of Us Research Program, and the
mance that equals that of trained physicians at China Kadoorie Biobank (. Table 28.2).
times. A challenge of these approaches is that Collectively, these cohorts will enroll millions
they required huge data sets: the Gulshan et al. of individuals across the world for longitudi-
algorithm which used nearly 130,000 labeled nal assessment of healthcare outcomes ana-
retinal digital images to train its diabetic reti- lyzed against molecular and environmental
nopathy algorithm (Gulshan et al., 2016). The exposures. Each of these cohorts includes
growing availability of large-scale public bio- both participant-generated survey data and
medical data through cohorts such as those healthcare-derived data linked with biospeci-
mentioned in 7 Sect. 28.6 represent an impor- mens. Several of these cohorts also include the
tant opportunity to accelerate such research. ability to recontact individuals. Collectively,
In addition, a number of for-profit companies, these multiple complementary avenues of
such as Alphabet, IBM, and many startups, phenotype assessment augment passive
have formed partnerships with diverse clini- phenotype collection (e.g., with EHR and
cal entities from individual healthcare systems claims-type data) with participant-provided
to the United Kingdom’s National Health information and the potential for reassess-
Service (Saria, Butte, & Sheikh, 2018). ment with deeper phenotyping along topics of
interest. Within All of Us, the participant pro-
vided survey information and the in-person
28.6 arge Cohorts to Advance
L research protocol physical measures are both
Precision Medicine Discovery being incorporated into the OMOP Common
Data Model to simplify comparison of these
Clinical care since the 1960s has been dra- data modalities. Digital engagement through
matically influenced by observational cohort email or websites or the collection of health-
studies. Studies such as the Framingham care information enable cost-efficient follow-
Heart Study, the Nurses Health Study, or up for healthcare outcomes over long periods
National Health and Nutrition Examination of time. Some of these larger resources are
Survey (NHANES) have produced dramatic also pioneering newer models of researcher
insights that have fundamentally changed our access that facilitate broad researcher commu-
understanding of modifiable risk factors for nities to access the environments. An impor-
many diseases. For instance, the Framingham tant aspect for these cohorts is the diversity
Heart Study taught us that blood pressure, of its participants, which is discussed in the
cholesterol, smoking, and activity contribute next section.
956 J. C. Denny et al.
Limited to cohorts exceeding 100,000 individuals with biosamples. Sizes reported are as of 11/2018
eMERGE The Electronic Medical Records and Genomics Network
28 Genomics England U.K. 7 www. WGS for rare disease and cancer for 100 k
genomicsengland.
co.uk/
Thailand SE Proactive genotyping for SJS/TEN risk alleles
Asia in carbamazepine-exposed patients
Sanford U.S. 7 imagenetics. PGx and AMV among primary care
sanfordhealth.org/ population
All of Us Research U.S. 7 joinallofus.org Stated goal of PGx and AMV for >1 million
Program
Geisinger MyCode U.S. 7 www.geisinger.org/ AMV; about 190 k enrolled
mycode
eMERGE The Electronic Medical Records and Genomics Network, IGNITE Implementing Genomics into
Practice
.. Fig. 28.7 Screenshot of clinical decision support advisor for Clopidogrel pharmacogenetic advice
960 J. C. Denny et al.
28.8 Sequencing Early in Life We offer one final example in which genome
sequencing was used as a last resort in a medical
One crucial complication in the search for odyssey to identify the cause of a mysterious
genomic explanations for any given disease or bowel condition in a 4-year-old boy named
phenotype is the impact of environmental inter- Nicholas Volker (Worthey et al., 2011). Having
actions. Over time, every person on earth is ruled out every diagnosis they could conceive
exposed to environmental factors that may dif- of, doctors resorted to exome sequencing, lead-
fer based not only on a factory that disposes of ing to the identification of 16,124 mutations, of
industrial waste near a drinking water supply or which 1527 were novel. A causal mutation was
the traffic on the street they grew up on, but also discovered in the gene XIAP. This gene was
by the foods they eat, the climates in which they already known to play a role in XLP, or X-linked
live, and the infections they have harbored. lymphoproliferative syndrome and retrospec-
Those external variables, hard to control for and tive review showed that colitis had been
28 sometimes even to know, can have major effects observed in 2 XLP patients in the past. Based
on the downstream products and activities of on these findings, a cord blood transplant was
one’s genomic fingerprint. Early in life, however, performed, and 2 years later, Nic’s intestinal
those effects are less pronounced. Of course, the issues had not returned. News coverage of this
impact of the in-utero environment on the well- story by the Milwaukee Journal Sentinel was
being of the developing fetus is well established. awarded a Pulitzer Prize for explanatory report-
But a genetic defect is much more likely to be ing (Journal Sentinel wins Pulitzer Prize for
the cause in a newborn with an unidentified dis- “One in a Billion” DNA series, n.d.).
ease than in an adult patient who has undergone
a lifetime of environmental insults. In this vein,
a number of initiatives have been established 28.9 Direct to Consumer Genetics
across the US to offer clinical sequencing ser-
vices for young patients, including programs at In the wake of the human genome project and
Children’s Hospital of Philadelphia, Duke the commoditization of genotypic data, a
University, Partners Healthcare, the Baylor number of companies were founded to pro-
College of Medicine, and the Medical College vide consumers with their own genetic infor-
of Wisconsin. More controversial on paper, and mation directly. These direct-to-consumer
not yet being performed in practice, is prenatal (DTC) genomic companies began making the
genome sequencing. Ethicists are exploring the services broadly available when deCODE
potential implications of this possible direction genetics launched the deCODEme service in
(Donley, Hull, & Berkman, 2012). November 2007, followed a few days later by
Addressing the time and resources needed 23andMe. Navigenics was launched the fol-
to perform genome interpretation, one striking lowing spring. These companies offered con-
success story was achieved at Children’s Mercy sumers the opportunity to provide a saliva
Hospitals and Clinics in Kansas City, MO specimen or buccal swab through the mail,
(Saunders et al., 2012). Investigators used an and in exchange to receive genotypic informa-
Illumina HiSeq 2500 machine and an internally- tion for a range of known genetic markers.
developed automated analysis pipeline to per- Different companies emphasized different
form whole-genome sequencing and make a aspects of genetic testing. Navigenics focused
differential diagnosis for genetic disorders in on known disease risk markers, while
under 50 h. The diagnoses in question are 23andMe was much broader, including dis-
among the ~3500 known monogenetic disor- ease markers but also ancestry information
ders that have been characterized. In this case, and “recreational” genetic information, for
WGS is not being used to identify novel, previ- example earwax type and the ability to smell a
ously unknown mutations. Rather, it is shorten- distinct odor in urine after eating asparagus.
ing the path to diagnosis to just over 2 days Navigenics offered free genetic counseling as
instead of the more traditional 4–6 weeks as a part of their service, while 23andMe and
battery of tests were performed sequentially. deCODEme provided referrals to genetic
Precision Medicine and Informatics
961 28
counselors. A study of concordance between implementation. The irony of the ability to
these three services found >99.6% agreement personalize care based on an individual’s
among them, but in some cases the predicted makeup is that it requires huge data sets of
relative risks differed in magnitude or even many individuals densley phenotyped to have
direction (Imai, Kricka, & Fortina, 2011). statistical power to make predictions for rare
This disagreement is likely due to differences variants, diseases, and outcomes. Thus, preci-
in the specific SNPs and the reference popula- sion medicine requires that we have large data
tion used to calculate risk. sets that are shareable and available for
From the companies’ perspectives, their research. We will also need to effectively enroll
customers offer a rich resource of genomic diverse populations and ensure that the data
data for potential research and data mining. includes both molecular data and social behav-
23andMe created a research initiative called ioral determinants of health. In addition, the
23andWe through which they enlist custom- ability to make accurate decisions for the indi-
ers “to collaborate with us on cutting-edge vidual patient requires implementation in the
genetic research.”(23andWe: The First Annual EHR, as the amount of data required to make
Update – 23andMe Blog, n.d.) They invite decisions is vast and changing quickly.
users to fill out questionnaires and then use the
phenotypic information to perform genome- nnSuggested Readings
wide analysis studies. This approach enabled Denny, J. C., Bastarache, L., & Roden, D. M.
researchers at the company to replicate a num- (2016). Phenome-Wide Association Studies as
ber of known associations, and to discover a a Tool to Advance Precision Medicine. Annual
number of novel associations, recreational Review of Genomics and Human Genetics,
though they may be, for curly hair, freckling, 17(1), 353–373. https://doi.org/10.1146/
sunlight-induced sneezing, and the ability to annurev-genom-090314-024956. Provides an
smell a metabolite in urine after eating aspara- overview and history of phenome-wide asso-
gus (Tung et al., 2011). deCODE, purchased ciation studies. Different approaches to
by Amgen in 2012, boasts a large number of PheWAS are described, along with the biases,
medically significant genetic discoveries to have advantages, and disadvantages of each.
come out of their volunteer registry of 160,000 Green, E. D., Guyer, M. S., & National Human
Icelanders, more than half of the adult popu- Genome Research Institute. (2011). Charting a
lation of that country (SCIENCE | deCODE course for genomic medicine from base pairs to
genetics, n.d.). Navigenics was purchased by bedside. Nature, 470(7333), 204–213. https://doi.
Life Technologies (now part of Thermo Fisher org/10.1038/nature09764. Provides an overview
Scientific Inc.) in 2012 and no longer offers of the NHGRI strategic plan through 2020,
their Health Compass genetic testing service. including the plan moving discovery in large
cohorts to implementation in clinical enterprises.
Kirby, J. C., Speltz, P., Rasmussen, L. V., Basford,
28.10 Conclusion M., Gottesman, O., Peissig, P. L., … Denny, J.
C. (2016). PheKB: a catalog and workflow for
Physicians have always sought to provide care creating electronic phenotype algorithms for
personalized to the individual. The current era transportability. Journal of the American
of large and deep data about individual Medical Informatics Association, 23(6), 1046–
patients is ushering in the promise of precision 1052. https://doi.org/10.1093/jamia/ocv202.
medicine that tailors care to the individual Introduces the Phenotype KnowledgeBase
based on factors not previously observable by website, which contains phenotype algorithms
the clinician, such as genomic data, predictive and related comments, plus implementation
patterns derived from mining clinical data, or and validation data, for finding cases and con-
dense sensors tracking activity and heart rate trols for genomic analysis from EHR data.
at density previously not possible. For preci- The paper includes some summary tables and
sion medicine to become a reality, we will need experiences from the first several years of
informatics, to enable both its discovery and uploaded EHR phenotype algorithms.
962 J. C. Denny et al.
Newton, K. M., Peissig, P. L., Kho, A. N., Bielinski, ??Questions for Discussion
S. J., Berg, R. L., Choudhary, V., … Denny, J. 1. Design a study to assess the genomic
C. (2013). Validation of electronic medical influences of a disease or drug response
record-based phenotyping algorithms: results phenotype using EHR data. Who
and lessons learned from the eMERGE net- would be your cases and controls? What
work. Journal of the American Medical features would define each case and
Informatics Association, 20(e1), e147–54. control, and how would you validate
https://doi.org/10.1136/amiajnl-2012-000896. that the algorithms you picked for cases
This paper provides best practices and lessons and controls were indeed finding the
learned from the Electronics Medical Records patients you wanted to find?
and Genomics (eMERGE) Network for how 2. Research studies traditionally have not
research-grade phenotypes are found from returned their research results to study
EHR data. This paper includes phenotype subjects. However, genetic studies are
28 algorithm design, creation, and validation pro- on the forefront of changing paradigms
cess, as well as some experiences regarding in this space. What do you think about
what worked well and what did not. the implications of returning results to
Pulley, J. M., Denny, J. C., Peterson, J. F., Bernard, patients? How would you feel if you
G. R., Vnencak-Jones, C. L., Ramirez, A. H., were a subject in a research study?
… Roden, D. M. (2012). Operational imple- Would you want results back or not?
mentation of prospective genotyping for per- 3. What are the implications of returning
sonalized medicine: the design of the results of actionable genetic variants
Vanderbilt PREDICT project. Clinical (such as those causing breast and
Pharmacology and Therapeutics, 92(1), 87–95. ovarian cancer) found incidentally
https://doi.org/10.1038/clpt.2011.371. This during research studies or clinical
paper describes one of the first prospective testing purposes?
implementations of pharmacogenomics. 4. What are some ways in which precision
Patients were selected based on their risk for medicine may improve health disparities
potentially needing a medication affected by between different populations? In what
pharmacogenes. They were tested on a multi- ways might precision medicine worsen
plexed platform, and then medication recom- them? How can researchers promote
mendations were provided through research that ameliorates this risk?
computer-based provider order entry decision 5. What are some requirements for a
support. The first implementation was health system or a physician in the
CYP2C19 and clopidogrel (an antiplatelet context of pharmacogenomic testing?
medication), but because the platform tested 6. Given that genomics do not generally
multiple pharmacovariants, drug-genome change over the lifetime, how can a
interactions could be added over time. patient take their genomic test results
Wellcome Trust Case Control Consortium. (2007). from one institution to another? What
Genome-wide association study of 14,000 technological and non-technological
cases of seven common diseases and 3,000 solutions could be employed to allow a
shared controls. Nature, 447(7145), 661–678. patient to take their genetic results with
https://doi.org/10.1038/nature05911. This was them?
one of the first large scale genome-wide asso- 7. Discuss the strengths and weaknesses
ciation studies, which found common genetic of EHRs for precision medicine studies
variants influencing seven common diseases. of diseases, drug responses, and
One interesting component, discovered loci exposures. What kinds of exposures
for type 2 diabetes, was in FTO, whose effect and health outcomes does an EHR
on diabetes risk is largely mediated through excel at capturing and where would
adiposity. This shows the importance of con- traditional survey or in-person
sidering phenotypes along the causal pathway assessment measures perform better?
when performing GWAS.
Precision Medicine and Informatics
963 28
Bibliography Delaney, J. T., Ramirez, A. H., Bowton, E., Pulley, J. M.,
Basford, M. A., Schildcrout, J. S., et al. (2012).
Predicting clopidogrel response using DNA samples
1000 Genomes Project Consortium, Auton, A., Brooks,
linked to an electronic health record. Clinical
L. D., Durbin, R. M., Garrison, E. P., Kang, H. M.,
Pharmacology and Therapeutics, 91(2), 257–263.
et al. (2015). A global reference for human genetic
https://doi.org/10.1038/clpt.2011.221.
variation. Nature, 526(7571), 68–74. https://doi.
Denny, J. C., Bastarache, L., Ritchie, M. D., Carroll,
org/10.1038/nature15393.
R. J., Zink, R., Mosley, J. D., et al. (2013). Systematic
23andWe: The First Annual Update – 23andMe Blog.
comparison of phenome-wide association study of
(n.d.). Retrieved from https://blog.23andme.
electronic medical record data and genome-wide
com/23andme-and-you/23andwe-the-first-annual-
association study data. Nature Biotechnology,
update/
31(12), 1102–1110. https://doi.org/10.1038/
Ahmad, T., Pencina, M. J., Schulte, P. J., O’Brien, E.,
nbt.2749.
Whellan, D. J., Piña, I. L., et al. (2014). Clinical
Denny, J. C., Crawford, D. C., Ritchie, M. D., Bielinski,
implications of chronic heart failure phenotypes
S. J., Basford, M. A., Bradford, Y., et al. (2011).
defined by cluster analysis. Journal of the American
Variants near FOXE1 are associated with hypothy-
College of Cardiology, 64(17), 1765–1774. https://
roidism and other thyroid conditions: Using elec-
doi.org/10.1016/j.jacc.2014.07.979.
tronic medical records for genome- and
Bastarache, L., Hughey, J. J., Hebbring, S., Marlo, J.,
phenome-wide studies. American Journal of Human
Zhao, W., Ho, W. T., et al. (2018). Phenotype risk
Genetics, 89(4), 529–542. https://doi.org/10.1016/j.
scores identify patients with unrecognized
ajhg.2011.09.008.
Mendelian disease patterns. Science, 359(6381),
Denny, J. C., Ritchie, M. D., Basford, M. A., Pulley,
1233–1239. https://doi.org/10.1126/science.aal4043.
J. M., Bastarache, L., Brown-Gentry, K., et al.
Bentley, A. R., Callier, S., & Rotimi, C. N. (2017).
(2010). PheWAS: Demonstrating the feasibility of a
Diversity and inclusion in genomic research: Why
phenome-wide scan to discover gene-disease associ-
the uneven progress? Journal of Community
ations. Bioinformatics, 26(9), 1205–1210. https://doi.
Genetics, 8(4), 255–266. https://doi.org/10.1007/
org/10.1093/bioinformatics/btq126.
s12687-017-0316-6.
Denny, J. C., Ritchie, M. D., Crawford, D. C.,
Cannon, C. P., Blazing, M. A., Giugliano, R. P.,
Schildcrout, J. S., Ramirez, A. H., Pulley, J. M.,
McCagg, A., White, J. A., Theroux, P., et al. (2015).
et al. (2010). Identification of genomic predictors of
Ezetimibe added to statin therapy after acute coro-
atrioventricular conduction: Using electronic medi-
nary syndromes. The New England Journal of
cal records as a tool for genome science. Circulation,
Medicine, 372(25), 2387–2397. https://doi.
122(20), 2016–2021. https://doi.org/10.1161/
org/10.1056/NEJMoa1410489.
CIRCULATIONAHA.110.948828.
Carroll, R. J., Eyler, A. E., & Denny, J. C. (2011). Naïve
Dewan, A., Liu, M., Hartman, S., Zhang, S. S.-M., Liu,
electronic health record phenotype identification for
D. T. L., Zhao, C., et al. (2006). HTRA1 promoter
rheumatoid arthritis. AMIA Annual Symposium
polymorphism in wet age-related macular degenera-
Proceedings, 2011, 189–196.
tion. Science, 314(5801), 989–992. https://doi.
Carroll, R. J., Thompson, W. K., Eyler, A. E.,
org/10.1126/science.1133807.
Mandelin, A. M., Cai, T., Zink, R. M., et al. (2012).
Donley, G., Hull, S. C., & Berkman, B. E. (2012).
Portability of an algorithm to identify rheumatoid
Prenatal whole genome sequencing: Just because we
arthritis in electronic health records. Journal of the
can, should we? The Hastings Center Report, 42(4),
American Medical Informatics Association, 19(e1),
28–40. https://doi.org/10.1002/hast.50.
e162–e169. https://doi.org/10.1136/amiajnl-2011-
Doshi-Velez, F., Ge, Y., & Kohane, I. (2014).
000583.
Comorbidity clusters in autism spectrum disorders:
Cohen, J. C., Boerwinkle, E., Mosley, T. H., & Hobbs,
An electronic health record time-series analysis.
H. H. (2006). Sequence variations in PCSK9,
Pediatrics, 133(1), e54–e63. https://doi.org/10.1542/
low LDL, and protection against coronary heart
peds.2013-0819.
disease. The New England Journal of Medicine,
Eadon, M. T., Desta, Z., Levy, K. D., Decker, B. S.,
354(12), 1264–1272. https://doi.org/10.1056/
Pierson, R. C., Pratt, V. M., et al. (2016).
NEJMoa054013.
Implementation of a pharmacogenomics consult
Conway, M., Berg, R. L., Carrell, D., Denny, J. C., Kho,
service to support the INGENIOUS trial. Clinical
A. N., Kullo, I. J., et al. (2011). Analyzing the het-
Pharmacology and Therapeutics, 100(1), 63–66.
erogeneity and complexity of Electronic Health
https://doi.org/10.1002/cpt.347.
Record oriented phenotyping algorithms. AMIA
Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter,
Annual Symposium Proceedings, 2011, 274–283.
S. M., Blau, H. M., & Thrun, S. (2017).
Crawford, D. C., Crosslin, D. R., Tromp, G., Kullo, I. J.,
Dermatologist-level classification of skin cancer
Kuivaniemi, H., Hayes, M. G., et al. (2014). eMER-
with deep neural networks. Nature, 542(7639), 115–
GEing progress in genomics-the first seven years.
118. https://doi.org/10.1038/nature21056.
Frontiers in Genetics, 5, 184. https://doi.org/10.3389/
Green, E. D., Guyer, M. S., & National Human Genome
fgene.2014.00184.
Research Institute. (2011). Charting a course for
964 J. C. Denny et al.
genomic medicine from base pairs to bedside. et al. (2012). Use of diverse electronic medical
Nature, 470(7333), 204–213. https://doi.org/10.1038/ record systems to identify genetic risk for type 2 dia-
nature09764. betes within a genome-wide association study.
Gulshan, V., Peng, L., Coram, M., Stumpe, M. C., Wu, Journal of the American Medical Informatics
D., Narayanaswamy, A., et al. (2016). Development Association, 19(2), 212–218. https://doi.org/10.1136/
and validation of a deep learning algorithm for amiajnl-2011-000439.
detection of diabetic retinopathy in retinal fundus Kirby, J. C., Speltz, P., Rasmussen, L. V., Basford, M.,
photographs. The Journal of the American Medical Gottesman, O., Peissig, P. L., et al. (2016). PheKB:
Association, 316(22), 2402–2410. https://doi. A catalog and workflow for creating electronic phe-
org/10.1001/jama.2016.17216. notype algorithms for transportability. Journal of
Hebbring, S. J., Schrodi, S. J., Ye, Z., Zhou, Z., Page, the American Medical Informatics Association,
D., & Brilliant, M. H. (2013). A PheWAS approach 23(6), 1046–1052. https://doi.org/10.1093/jamia/
in studying HLA-DRB1*1501. Genes and ocv202.
Immunity, 14(3), 187–191. https://doi.org/10.1038/ Kircher, M., Witten, D. M., Jain, P., O’Roak, B. J.,
gene.2013.2. Cooper, G. M., & Shendure, J. (2014). A general
28 Holmes, M. V., Asselbergs, F. W., Palmer, T. M., Drenos,
F., Lanktree, M. B., Nelson, C. P., et al. (2015).
framework for estimating the relative pathogenicity
of human genetic variants. Nature Genetics, 46(3),
Mendelian randomization of blood lipids for coro- 310–315. https://doi.org/10.1038/ng.2892.
nary heart disease. European Heart Journal, 36(9), Klein, R. J., Zeiss, C., Chew, E. Y., Tsai, J. Y., Sackler,
539–550. https://doi.org/10.1093/eurheartj/eht571. R. S., Haynes, C., et al. (2005). Complement factor
Imai, K., Kricka, L. J., & Fortina, P. (2011). Concordance H polymorphism in age-related macular degenera-
study of 3 direct-to-consumer genetic-testing ser- tion. Science (New York, N.Y), 308(5720), 385–389.
vices. Clinical Chemistry, 57(3), 518–521. https:// Kullo, I. J., Ding, K., Jouni, H., Smith, C. Y., & Chute,
doi.org/10.1373/clinchem.2010.158220. C. G. (2010). A genome-wide association study of
Imran, T. F., Posner, D., Honerlaw, J., Vassy, J. L., Song, red blood cell traits using the electronic medical
R. J., Ho, Y.-L., et al. (2018). A phenotyping algo- record. PLoS One, 5(9). https://doi.org/10.1371/
rithm to identify acute ischemic stroke accurately journal.pone.0013011.
from a national biobank: The million veteran pro- Kurreeman, F., Liao, K., Chibnik, L., Hickey, B., Stahl,
gram. Clinical Epidemiology, 10, 1509–1521. https:// E., Gainer, V., et al. (2011). Genetic basis of autoan-
doi.org/10.2147/CLEP.S160764. tibody positive and negative rheumatoid arthritis
Jerome, R. N., Pulley, J. M., Roden, D. M., Shirey-Rice, risk in a multi-ethnic cohort derived from electronic
J. K., Bastarache, L. A., Bernard, G., et al. (2018). health records. American Journal of Human
Using human “experiments of nature” to predict Genetics, 88(1), 57–69. https://doi.org/10.1016/j.
drug safety issues: An example with PCSK9 inhibi- ajhg.2010.12.007.
tors. Drug Safety, 41(3), 303–311. https://doi. Li, L., Cheng, W.-Y., Glicksberg, B. S., Gottesman, O.,
org/10.1007/s40264-017-0616-0. Tamler, R., Chen, R., et al. (2015). Identification of
Jiang, M., Chen, Y., Liu, M., Rosenbloom, S. T., Mani, type 2 diabetes subgroups through topological anal-
S., Denny, J. C., & Xu, H. (2011). A study of ysis of patient similarity. Science Translational
machine-learning-based approaches to extract clini- Medicine, 7(311), 311ra174. https://doi.org/10.1126/
cal entities and their assertions from discharge sum- scitranslmed.aaa9364.
maries. Journal of the American Medical Informatics Liao, K. P., Cai, T., Gainer, V., Goryachev, S., Zeng-
Association, 18(5), 601–606. https://doi.org/10.1136/ treitler, Q., Raychaudhuri, S., et al. (2010).
amiajnl-2011-000163. Electronic medical records for discovery research in
Journal Sentinel wins Pulitzer Prize for “One in a rheumatoid arthritis. Arthritis Care & Research,
Billion” DNA series. (n.d.). Retrieved from http:// 62(8), 1120–1127. https://doi.org/10.1002/acr.20184.
archive.jsonline.com/news/milwaukee/120091754. Lin, C., Karlson, E. W., Dligach, D., Ramirez, M. P.,
html/ Miller, T. A., Mo, H., et al. (2015). Automatic iden-
Kaneko, A., Lum, J. K., Yaviong, L., Takahashi, N., tification of methotrexate-induced liver toxicity in
Ishizaki, T., Bertilsson, L., et al. (1999). High and patients with rheumatoid arthritis from the elec-
variable frequencies of CYP2C19 mutations: tronic medical record. Journal of the American
Medical consequences of poor drug metabolism in Medical Informatics Association, 22(e1), e151–e161.
Vanuatu and other Pacific islands. Pharmacogenetics, https://doi.org/10.1136/amiajnl-2014-002642.
9(5), 581–590. Locke, A. E., Kahali, B., Berndt, S. I., Justice, A. E.,
Khera, A. V., Emdin, C. A., Drake, I., Natarajan, P., Pers, T. H., Day, F. R., et al. (2015). Genetic studies
Bick, A. G., Cook, N. R., et al. (2016). Genetic risk, of body mass index yield new insights for obesity
adherence to a healthy lifestyle, and coronary dis- biology. Nature, 518(7538), 197–206. https://doi.
ease. The New England Journal of Medicine, 375(24), org/10.1038/nature14177.
2349–2358. https://doi.org/10.1056/ MacArthur, J., Bowler, E., Cerezo, M., Gil, L., Hall, P.,
NEJMoa1605086. Hastings, E., et al. (2017). The new NHGRI-EBI
Kho, A. N., Hayes, M. G., Rasmussen-Torvik, L., Catalog of published genome-wide association
Pacheco, J. A., Thompson, W. K., Armstrong, L. L., studies (GWAS Catalog). Nucleic Acids Research,
Precision Medicine and Informatics
965 28
45(D1), D896–D901. https://doi.org/10.1093/nar/ Pendergrass, S. A., Brown-Gentry, K., Dudek, S., Frase,
gkw1133. A., Torstenson, E. S., Goodloe, R., et al. (2013).
Michailidou, K., Lindström, S., Dennis, J., Beesley, J., Phenome-wide association study (PheWAS) for
Hui, S., Kar, S., et al. (2017). Association analysis detection of pleiotropy within the Population
identifies 65 new breast cancer risk loci. Nature, Architecture using Genomics and Epidemiology
551(7678), 92–94. https://doi.org/10.1038/ (PAGE) Network. PLoS Genetics, 9(1), e1003087.
nature24284. https://doi.org/10.1371/journal.pgen.1003087.
Millard, L. A. C., Davies, N. M., Timpson, N. J., Tilling, Pendergrass, S. A., Brown-Gentry, K., Dudek, S. M.,
K., Flach, P. A., & Davey Smith, G. (2015). Torstenson, E. S., Ambite, J. L., Avery, C. L., et al.
MR-PheWAS: Hypothesis prioritization among (2011). The use of phenome-wide association stud-
potential causal effects of body mass index on many ies (PheWAS) for exploration of novel genotype-
outcomes, using Mendelian randomization. phenotype relationships and pleiotropy discovery.
Scientific Reports, 5, 16645. https://doi.org/10.1038/ Genetic Epidemiology, 35(5), 410–422. https://doi.
srep16645. org/10.1002/gepi.20589.
Mohammadpour, A. H., & Akhlaghi, F. (2013). Future Perera, M. A., Cavallari, L. H., Limdi, N. A., Gamazon,
of cholesteryl ester transfer protein (CETP) inhibi- E. R., Konkashbaev, A., Daneshjou, R., et al.
tors: A pharmacological perspective. Clinical (2013). Genetic variants associated with warfarin
Pharmacokinetics, 52(8), 615–626. https://doi. dose in African-American individuals: A genome-
org/10.1007/s40262-013-0071-8. wide association study. The Lancet, 382(9894), 790–
Myocardial Infarction Genetics Consortium 796. https://doi.org/10.1016/S0140-6736(13)60681-9.
Investigators, Stitziel, N. O., Won, H.-H., Morrison, Peterson, J. F., Field, J. R., Unertl, K. M., Schildcrout,
A. C., Peloso, G. M., Do, R., et al. (2014). J. S., Johnson, D. C., Shi, Y., et al. (2016). Physician
Inactivating mutations in NPC1L1 and protection response to implementation of genotype-tailored
from coronary heart disease. The New England antiplatelet therapy. Clinical Pharmacology and
Journal of Medicine, 371(22), 2072–2082. https:// Therapeutics, 100(1), 67–74. https://doi.org/10.1002/
doi.org/10.1056/NEJMoa1405386. cpt.331.
Newton, K. M., Peissig, P. L., Kho, A. N., Bielinski, Phillips, E. J., Sukasem, C., Whirl-Carrillo, M., Müller,
S. J., Berg, R. L., Choudhary, V., et al. (2013). D. J., Dunnenberger, H. M., Chantratita, W., et al.
Validation of electronic medical record-based phe- (2018). Clinical pharmacogenetics implementation
notyping algorithms: Results and lessons learned consortium guideline for HLA genotype and use of
from the eMERGE network. Journal of the carbamazepine and oxcarbazepine: 2017 update.
American Medical Informatics Association, 20(e1), Clinical Pharmacology and Therapeutics, 103(4),
e147–e154. https://doi.org/10.1136/amia- 574–581. https://doi.org/10.1002/cpt.1004.
jnl-2012-000896. Popejoy, A. B., & Fullerton, S. M. (2016). Genomics is
Okada, Y., Wu, D., Trynka, G., Raj, T., Terao, C., failing on diversity. Nature, 538(7624), 161–164.
Ikari, K., et al. (2014). Genetics of rheumatoid https://doi.org/10.1038/538161a.
arthritis contributes to biology and drug discovery. Pulley, J. M., Denny, J. C., Peterson, J. F., Bernard,
Nature, 506(7488), 376–381. https://doi.org/ G. R., Vnencak-Jones, C. L., Ramirez, A. H., et al.
10.1038/nature12873. (2012). Operational implementation of prospective
O’Donnell, P. H., Bush, A., Spitz, J., Danahey, K., genotyping for personalized medicine: The design
Saner, D., Das, S., et al. (2012). The 1200 patients of the Vanderbilt PREDICT project. Clinical
project: Creating a new medical model system for Pharmacology and Therapeutics, 92(1), 87–95.
clinical implementation of pharmacogenomics. https://doi.org/10.1038/clpt.2011.371.
Clinical Pharmacology and Therapeutics, 92(4), 446– Ramirez, A. H., Shi, Y., Schildcrout, J. S., Delaney, J. T.,
449. https://doi.org/10.1038/clpt.2012.117. Xu, H., Oetjens, M. T., et al. (2012). Predicting war-
O’Reilly, R., & Elphick, H. E. (2013). Development, farin dosage in European-Americans and African-
clinical utility, and place of ivacaftor in the treat- Americans using DNA samples linked to an
ment of cystic fibrosis. Drug Design, Development electronic health record. Pharmacogenomics, 13(4),
and Therapy, 7, 929–937. https://doi.org/10.2147/ 407–418. https://doi.org/10.2217/pgs.11.164.
DDDT.S30345. Rasmussen-Torvik, L. J., Stallings, S. C., Gordon, A. S.,
Pathak, J., Kiefer, R. C., Bielinski, S. J., & Chute, C. G. Almoguera, B., Basford, M. A., Bielinski, S. J., et al.
(2012). Applying semantic web technologies for phe- (2014). Design and anticipated outcomes of the
nome-wide scan using an electronic health record eMERGE-PGx project: A multicenter pilot for pre-
linked biobank. Journal of Biomedical Semantics, emptive pharmacogenomics in electronic health
3(1), 10. https://doi.org/10.1186/2041-1480-3-10. record systems. Clinical Pharmacology and
Peissig, P. L., Santos Costa, V., Caldwell, M. D., Therapeutics, 96(4), 482–489. https://doi.
Rottscheit, C., Berg, R. L., Mendonca, E. A., & org/10.1038/clpt.2014.137.
Page, D. (2014). Relational machine learning for Relling, M. V., Schwab, M., Whirl-Carrillo, M., Suarez-
electronic health record-driven phenotyping. Kurtz, G., Pui, C.-H., Stein, C. M., et al. (2019).
Journal of Biomedical Informatics, 52, 260–270. Clinical Pharmacogenetics Implementation
https://doi.org/10.1016/j.jbi.2014.07.007. Consortium (CPIC) guideline for thiopurine dosing
966 J. C. Denny et al.
based on TPMT and NUDT15 genotypes: 2018 Tung, J. Y., Do, C. B., Hinds, D. A., Kiefer, A. K.,
update. Clinical Pharmacology and Therapeutics, Macpherson, J. M., Chowdry, A. B., et al. (2011).
105(5), 1095–1105. https://doi.org/10.1002/cpt.1304. Efficient replication of over 180 genetic associations
Ritchie, M. D., Denny, J. C., Crawford, D. C., Ramirez, with self-reported medical data. PLoS One, 6(8),
A. H., Weiner, J. B., Pulley, J. M., et al. (2010). e23473. https://doi.org/10.1371/journal.
Robust replication of genotype-phenotype associa- pone.0023473.
tions across multiple diseases in an electronic medi- Voight, B. F., Peloso, G. M., Orho-Melander, M.,
cal record. American Journal of Human Genetics, Frikke-Schmidt, R., Barbalic, M., Jensen, M. K.,
86(4), 560–572. https://doi.org/10.1016/j. et al. (2012). Plasma HDL cholesterol and risk of
ajhg.2010.03.003. myocardial infarction: A mendelian randomisation
Robinson, J. R., Wei, W.-Q., Roden, D. M., & Denny, study. The Lancet, 380(9841), 572–580. https://doi.
J. C. (2018). Defining phenotypes from clinical data org/10.1016/S0140-6736(12)60312-2.
to drive genomic research. Annual Review of Wei, W.-Q., & Denny, J. C. (2015). Extracting research-
Biomedical Data Science, 1(1), 69–92. https://doi. quality phenotypes from electronic health records to
org/10.1146/annurev-biodatasci-080917-013335. support precision medicine. Genome Medicine, 7(1),
28 Sabatine, M. S., Giugliano, R. P., Keech, A. C.,
Honarpour, N., Wiviott, S. D., Murphy, S. A., et al.
41. https://doi.org/10.1186/s13073-015-0166-y.
Wei, W.-Q., Teixeira, P. L., Mo, H., Cronin, R. M.,
(2017). Evolocumab and clinical outcomes in Warner, J. L., & Denny, J. C. (2016). Combining bill-
patients with cardiovascular disease. The New ing codes, clinical notes, and medications from elec-
England Journal of Medicine, 376(18), 1713–1722. tronic health records provides superior phenotyping
https://doi.org/10.1056/NEJMoa1615664. performance. Journal of the American Medical
Saria, S., Butte, A., & Sheikh, A. (2018). Better medicine Informatics Association, 23(e1), e20–e27. https://
through machine learning: What’s real, and what’s doi.org/10.1093/jamia/ocv130.
artificial? PLoS Medicine, 15(12), e1002721. https:// Wellcome Trust Case Control Consortium. (2007).
doi.org/10.1371/journal.pmed.1002721. Genome-wide association study of 14,000 cases of
Saunders, C. J., Miller, N. A., Soden, S. E., Dinwiddie, seven common diseases and 3,000 shared controls.
D. L., Noll, A., Alnadi, N. A., et al. (2012). Rapid Nature, 447(7145), 661–678. https://doi.org/10.1038/
whole-genome sequencing for genetic disease diag- nature05911.
nosis in neonatal intensive care units. Science White, K. D., Abe, R., Ardern-Jones, M., Beachkofsky,
Translational Medicine, 4(154), 154ra135. https:// T., Bouchard, C., Carleton, B., et al. (2018). SJS/
doi.org/10.1126/scitranslmed.3004041. TEN 2017: Building multidisciplinary networks to
Schmidt, A. F., Swerdlow, D. I., Holmes, M. V., Patel, drive science and translation. The Journal of Allergy
R. S., Fairhurst-Hunter, Z., Lyall, D. M., et al. and Clinical Immunology. In Practice, 6(1), 38–69.
(2017). PCSK9 genetic variants and risk of type 2 https://doi.org/10.1016/j.jaip.2017.11.023.
diabetes: A mendelian randomisation study. The Wood, A. R., Esko, T., Yang, J., Vedantam, S., Pers,
Lancet. Diabetes & Endocrinology, 5(2), 97–105. T. H., Gustafsson, S., et al. (2014). Defining the role
https://doi.org/10.1016/S2213-8587(16)30396-5. of common variation in the genomic and biological
SCIENCE | deCODE genetics. (n.d.). Retrieved from architecture of adult human height. Nature
https://www.decode.com/research/ Genetics, 46(11), 1173–1186. https://doi.
Scott, S. A., Sangkuhl, K., Stein, C. M., Hulot, J. S., org/10.1038/ng.3097.
Mega, J. L., Roden, D. M., et al. (2013). Clinical Worthey, E. A., Mayer, A. N., Syverson, G. D., Helbling,
Pharmacogenetics Implementation Consortium D., Bonacci, B. B., Decker, B., et al. (2011). Making
guidelines for CYP2C19 genotype and clopidogrel a definitive diagnosis: Successful clinical application
therapy: 2013 update. Clinical Pharmacology and of whole exome sequencing in a child with intracta-
Therapeutics, 94(3), 317–323. https://doi. ble inflammatory bowel disease. Genetics in
org/10.1038/clpt.2013.105. Medicine, 13(3), 255–262. https://doi.org/10.1097/
Splinter, K., Adams, D. R., Bacino, C. A., Bellen, H. J., GIM.0b013e3182088158.
Bernstein, J. A., Cheatle-Jarvela, A. M., et al. Wu, A. H., White, M. J., Oh, S., & Burchard, E. (2015).
(2018). Effect of genetic diagnosis on patients with The Hawaii clopidogrel lawsuit: The possible effect
previously undiagnosed disease. The New England on clinical laboratory testing. Personalized Medicine,
Journal of Medicine, 379(22), 2131–2139. https:// 12(3), 179–181. https://doi.org/10.2217/pme.15.4.
doi.org/10.1056/NEJMoa1714458. Wu, Y., Denny, J. C., Trent Rosenbloom, S., Miller,
Tannock, I. F., & Hickman, J. A. (2016). Limits to per- R. A., Giuse, D. A., Wang, L., et al. (2017). A long
sonalized cancer medicine. The New England Journal journey to short abbreviations: Developing an
of Medicine, 375(13), 1289–1294. https://doi. open- source framework for clinical abbreviation
org/10.1056/NEJMsb1607705. recognition and disambiguation (CARD). Journal
Torkamani, A., Wineinger, N. E., & Topol, E. J. (2018). of the American Medical Informatics Association,
The personal and clinical utility of polygenic risk 24(e1), e79–e86. https://doi.org/10.1093/jamia/
scores. Nature Reviews. Genetics, 19(9), 581–590. ocw109.
https://doi.org/10.1038/s41576-018-0018-x.
967 III
Biomedical
Informatics in the
Years Ahead
Contents
Health Information
Technology Policy
Robert S. Rudin, Paul C. Tang, and David W. Bates
Contents
References – 982
(. Table 29.1). Policies that promoted the use healthcare workflows to make greater use of
of health IT were informed by projections the technology, and developing and spreading
based on models showing large potential sav- best practices.
ings for many forms of health IT. One study
by the RAND Corporation estimated that
EHRs could save more than $81 billion per 29.2.3 sing Health IT to Measure
U
year (Hillestad et al. 2005). Another study Quality of Care
estimated that electronic clinical data
exchange has the potential to save $77.8 bil- All health care stakeholders agree that a
lion per year (Walker et al. 2005). Many of health care system should deliver high quality
these savings were expected to come from care. But how does one measure care quality?
reductions in redundant tests and use of Current methods of quality measurement rely
generic drugs, as well as reductions in adverse largely on administrative claims submitted by
drug events and other errors that EHRs might providers to insurers. These data may be use-
prevent (Bates et al. 1998; Wang et al. 2003). ful for certain quality measurements such as
29 Telehealth and PHRs were also projected to for assessing a primary care physician’s mam-
result in billions of dollars in savings (Kaelber mography screening rates, but they lack
and Pan 2008; Cusack et al. 2008). important clinical details, such as the results
One weakness of these projections is that of laboratory tests. They also do not represent
they relied on expert opinions for some point a comprehensive picture of the care that is
estimates because, other than several studies delivered, assess the appropriateness of most
showing that EHRs reduce costs by reducing medical procedures, or determine if a patient’s
medical errors, few studies have tried to exam- quality of life has improved after treatment.
ine empirically the effect of health IT on costs Also, most patients in the U.S. switch insur-
(Tierney et al. 1987, 1993). Also, some of the ance companies every few years, limiting the
projections have been criticized because they ability of any one insurer to measure quality
estimate potential savings rather than actual improvements over longer periods of time,
measured savings (Congressional Budget which is required to assess accurately the
Office 2008). However, the projections do not treatment of many medical conditions.
include several types of savings that may Increasingly, clinical data available
result from providing better preventive care through EHRs are used for quality measure-
and care coordination, which would reduce ment (Ancker et al. 2015). Clinical data are
the need for patients’ use of high cost proce- much more comprehensive than administra-
dures in hospitals and emergency rooms. They tive claims, and methods for measuring clini-
also do not include potential reductions in cal quality using these data are growing. In
costs that may result from decision aids for the U.S., there is growing policy interest in
patients, which may, for example, reduce the creating such measures as shown in the
number of unnecessary surgeries (O’Connor National Quality Strategy and other reports
et al. 2009). And they do not include other (AHRQ 2017). This approach has been used
innovations such as the impact of small in the United Kingdom (U.K.) where nearly
changes in EHR displays. For example, one 200 quality measures have regularly been
study found that when the fees associated with assessed, with up to 25% of payment for gen-
laboratory tests were shown to clinicians when eral practitioners based on performance on
they ordered the test, rates of test ordering these measures (Roland and Olesen 2016).
decreased by more than 8% (Feldman et al. While initially popular, U.K. physicians have
2013). The actual savings, therefore, may be become increasingly disenchanted with the
much greater than the projections suggest. As administrative requirements of the program.
described above, realizing these savings will There is growing support for developing
likely require more than simply adopting the patient-reported outcome measurements
technology – it will also require redesigning which may be integrated in PHRs, or obtained
Health Information Technology Policy
975 29
through other mechanisms and integrated 29.2.4 Holding Providers
with the patient’s clinical data (Lavallee et al. Accountable for Cost
2016).
and Quality
However, using electronic clinical data to
generate quality measures is also associated
Currently, in the U.S., most care is delivered
with problems. Studies have found that clinical
using a fee-for-service payment system, in
data in EHRs are often incomplete, inaccu-
which providers are paid for every procedure
rate, and may not be comparable across differ-
or patient visit. Under this payment method,
ent EHRs (Chan et al. 2010; Colin et al. 2018).
providers have incentives to provide more care
Existing measures also tend to focus more on
rather than less, which contributes to over-
adherence to care processes rather than
treatment (Lyu et al. 2017). It is therefore not
patient outcomes (Burstin et al. 2016). More
surprising to find that in the U.S., costs are
research is needed to develop and standardize
high and rising, nearly double those of many
meaningful quality measures that would be
other industrial nations, and quality of care is
worth the burden of reporting them.
mixed (Squires 2015). As . Fig. 29.1 shows,
the U.S. spends more money per capita on be held accountable to a substantial degree for
health care than any other country by a wide the care they delivery without a robust health
margin. Yet, many studies suggest that the IT infrastructure.
U.S. is far from the world’s leader in overall
care quality (Squires 2015). A seminal study
by McGlynn et al. in 2003 found that patients 29.2.5 Informatics Research
in the U.S. received recommended care only
about half of the time across a broad array of Although EHRs have become widespread,
quality measures (McGlynn et al. 2003). An many health IT capabilities are still emerging,
updated version in 2016 found that those or standards have not yet been defined. New
results had not changed much (Levine et al. applications will still require additional
2016). research and development. For example, we
Policymakers are trying to replace the are still in the early stages of understanding
fee-for-service payment method with other how to design applications for team care
methods that would hold providers account- (7 Chap. 17), remote patient monitoring
29 able for the care they deliver. These policies (7 Chaps. 20 and 21), online disease manage-
create incentives for healthcare providers to ment (7 Chaps. 11 and 19), clinical decision-
constrain costs and may therefore motivate making (7 Chap. 24), alerts and reminders
greater use of health IT tools to achieve this (7 Chap. 24), public health and disease sur-
goal. In the U.S., one of the proposed mech- veillance (7 Chap. 18), clinical trial recruiting
anisms for accomplishing this is through (7 Chap. 27), and evaluations of the impact
Accountable Care Organizations (ACOs). As of technologies on care and costs (7 Chap. 13).
specified in the Affordable Care Act of 2010,2 One concern is that most provider organiza-
an ACO is a group of providers who are held tions, and increasingly even academic medical
accountable, to some extent, for both the centers, are now using software applications
cost and the quality of a designated group of made by private vendors, and innovating with
patients (Berwick 2011; McClellan et al. them can be more challenging than with
2010). ACOs are still a work in progress, but homegrown products. Private vendors may
early indications suggest that they may not be investing enough resources in research
reduce some costs (McWilliams et al. 2018). to produce transformational innovations
The concept of ACOs depends on having an (Shortliffe 2012). It will be essential to iden-
electronic health information infrastructure tify “sandboxes” in which new and innovative
in place, including widespread use of EHRs, IT approaches can be developed and tested.
because health IT would enable ACOs to More interactions between industry and aca-
improve quality, reduce costs, and measure demia may be a good way to accelerate prog-
their performance. Without prior federal ress (Rudin et al. 2016).
incentives for health IT adoption, these poli- Federal funding plays a major role in sup-
cies to aim to change incentives may not have porting this kind of upstream informatics
been feasible. research to help to incubate these new tech-
Many other countries have experimented nologies but is decreasing in recent years.
with paying providers for quality and out- Because the benefits of such research will
comes, or holding providers responsible for accrue to everyone who uses the health care
costs, although few have done both at the system, the investment of public funds is justi-
same time to a high degree. Health IT systems fied. Few private companies have taken the
are critical for many of these efforts. Few pol- risk of doing this kind of experimental
icymakers or researchers believe providers can research to date in part because many health
IT companies have been relatively small and
were focused on adding the functionalities
2 7 http://www.healthcare.gov/law/index.html that are needed to meet federal certification
(Accessed 12/9/2012). requirements. More recently, some health IT
Health Information Technology Policy
977 29
companies have become larger but they have for Economic and Clinical Health (HITECH)
not sponsored much research. It is too early to Act.3 This legislation authorized $27 billion in
know what impact private companies will stimulus funds to be paid to health care pro-
have on health IT innovation, but historically, viders who demonstrate “meaningful use” of
most of the innovation in health informatics electronic health records as defined by specific
has occurred at universities and other criteria (Blumenthal 2010). Although there is
government- funded research organizations debate as to the extent to which HITECH
affiliated with academic medical centers. accelerated EHR adoption in ambulatory
clinics, EHR adoption increased dramatically
among hospitals and clinics in the U.S, after
29.3 eyond Adoption: Policy
B these incentives were put in place (Mennemeyer
et al. 2016). Today, over 90% of hospitals and
for Optimizing and Innovating clinics have adopted some form of EHR, but
with Health IT there is large variation in adoption of specific
EHR capabilities (HealthIT 2016).
Many governments around the world have Now that EHRs have become widely
previously implemented policies to accelerate adopted, policymakers in many countries are
the adoption of health IT. The U.K. achieved shifting focus toward optimizing the technol-
near universal adoption of EHRs because it ogy and fostering innovation to achieve
devoted substantial resources to the effort greater impact. In the U.S., policy efforts are
early on and has a national health care system now trying to improve interoperability and
which directly manages most of the health health information exchange among provid-
care providers in the country (Cresswell and ers and patients and facilitate innovation by
Sheikh 2009; Ashworth and Millett 2008). making health information accessible to third
Most other industrialized nations had party applications using application program-
achieved high levels of adoption in primary ming interfaces (APIs). U.S. policy has also
care by the early 2000s (Jha et al. 2008). incorporated many health IT efforts into a
Countries that achieved particularly high lev- larger program that affects how Center for
els of adoption in non-hospital settings Medicare and Medicare Services (CMS) pays
include Denmark, the Netherlands, Sweden, health providers for services. This section
Hong Kong, Singapore, Australia, and New describes some of these efforts.
Zealand. Similar to the U.K., these countries
devoted national resources for this effort.
Levels of adoption in hospitals, however,
lagged in many countries. 29.3.1 Health Information
In the U.S., after years of slow adoption of Exchange
health IT relative to other developed coun-
tries, the federal government began to address All countries have challenges sharing clinical
this issue in 2004 by establishing the Office of data among providers (see 7 Chap. 15). For
the National Coordinator for Health IT many years, U.S. policy promoted data
(ONC). This office is located within the exchange through the formation of regional
U.S. Department of Health and Human health information exchanges (HIEs). These
Services and tasked with “promoting develop- organizations provided a variety of services
ment of a nationwide Health IT infrastruc- including aggregating EHR data from local
ture that allows for electronic use and health care providers to create aggregate lon-
exchange of information.” The importance of gitudinal patient records, automating the
this office grew considerably in 2009 when
Congress passed legislation that is considered
3 7 https://www.healthit.gov/topic/laws-regulation-
a major landmark in the history of health IT
and-policy/health-it-legislation (Accessed
policy: the Health Information Technology 10/16/2018).
978 R. S. Rudin et al.
delivery of laboratory results, integrating with Some have proposed a different approach
pharmacies to facilitate e-prescribing, and to data exchange in which patients can aggre-
facilitating public health and quality report- gate and control access to their complete
ing. Although some HIEs are well-established, health records (Szolovits et al. 1994). The his-
the number of these organizations has been tory and details of this model are explained in
declining and many of the remaining ones 7 Chap. 15. There has been an increase in
may not be financially viable (Adler-Milstein interest in this approach recently. However, it
et al. 2016). is too early to tell if it will become widely
Why is it so difficult to establish an HIE? adopted.
Part of the problem is that EHR products did No country to date has completely solved
not always use the same technical data stan- the problem of clinical data exchange. In
dards and are not interoperable. Recently every country that attempts to foster data
developed technical and semantic standards exchange, the hardest issues appear to be
have made considerable progress in making socio-political rather than technical, and there
the standards robust (Health Level Seven is clear agreement that health IT policy is par-
29 International 2019). However, additional cus- ticularly important to address these problems,
tom programming is still required to integrate especially in establishing standards. The U.K.
EHRs with HIEs. HIEs face many other chal- has set up a “spine” which allows summary
lenges including: recruiting providers who are care documents to be widely exchanged
reluctant to share data with competing medi- (Greenhalgh et al. 2010). However, the overall
cal groups, privacy and security concerns, program has encountered major political dif-
legal issues, HIE-related fees, training clini- ficulties, and has been largely dismantled.
cians to use the HIE, and the lack of a busi- Canada has established a program called
ness case (7 Chap. 15). The business case Canada Health Infoway, which has empha-
problem is perhaps the most pressing – for a sized setting up an infrastructure for data
business to thrive, key stakeholders must be exchange (Rozenblum et al. 2011). While that
willing to pay for the product or service. In effort has been somewhat successful, relatively
HIE, the primary financial beneficiaries are little in the way of clinical data is being
employers and insurers, but they have been exchanged to date, in part because the adop-
reluctant to pay for the exchange services tion rate of electronic health records remains
(Walker et al. 2005). While regional HIEs low. In Scandinavia, there has been substan-
have faltered, EHR vendor-based networks tial concern about the privacy aspects of data
have emerged as an alternative, but the extent exchange, especially in Sweden, though data
to which they will succeed in the long term is exchange is taking place in Denmark and its
uncertain. These networks may be limited to use is growing.
one vendor or involve a consortium of ven-
dors. Currently, the most prominent vendor-
based networks are Epic CareEverywhere, 29.3.2 Patient Portals
CareQuality, and the CommonWell Health and Telehealth
Alliance.
Policymakers have recognized that for data Although EHRs have become widely adopted,
exchange to be comprehensive, these networks other forms of health IT are still lagging. To
as well as regional HIEs will need to interact encourage more patient-centric care, many
and share data. To address this concern, there countries are trying to foster the adoption of
are plans to establish a “trusted exchange Patient Portals and Telehealth (see 7 Chaps.
framework” that facilitates this interaction 11 and 20). In the U.S., federal incentives pro-
(HealthIT 2018). Policymakers have also moted patient portals, and adoption rates are
identified information blocking on the part of growing. To promote telehealth, policymakers
vendors and providers as a concern and plan are exploring the possibility of reimbursing
to issue regulations to prevent it. for telehealth care, which would probably
Health Information Technology Policy
979 29
improve adoption of this technology consid- ing complex systems, including poor usability,
erably (Mehrotra et al. 2016). Even though inadequate testing and quality assurance,
many states have passed “parity laws” that software flaws, poor implementation deci-
require commercial insurers to reimburse for sions, inattention to workflow design, or inad-
telehealth visits, most healthcare encounters equate training. Policymakers have funded
are still in-person. development of frameworks and guidelines to
help implement and use health IT in a way
that addresses safety concerns (Sittig and
29.3.3 Application Programming Singh 2012).
Interfaces
To accelerate innovation, policymakers in the
U.S. have begun to promote Application 29.4.1 hould Health IT
S
Programming Interfaces (APIs) for EHR Be Regulated as Medical
data. APIs are software mechanisms that Devices?
allow different applications to connect to one
another and share information. All modern One policy option for reducing the likelihood
software utilizes APIs for purposes ranging of health IT-related medical errors is to create
from communication with a computer’s oper- regulations that require health IT products to
ating system to querying a website for the lat- adhere to strict principles of safe design and
est news stories. In healthcare, one use of be tested and certified (see also 7 Chap. 12)
APIs is to allow patients to more easily down- (Shuren et al. 2018). This is how many medi-
load their latest medical data into an applica- cal devices are regulated by the U.S. Food and
tion of their choosing, such as an application Drug Administration.4 While this approach
on their smartphone that helps them organize may ensure some degree of patient safety, the
and understand their health data. Another regulatory burden will increase the price of
use of APIs is to allow providers to install health IT systems, raise barriers of entry for
third party applications for use within their new companies, and could stifle innovation.
EHRs. If patients and providers can pick and Also, even with regulations, health IT prod-
choose applications, a new market of innova- ucts might still have safety issues because soft-
tive applications may arise to take advantage ware products can be used in many ways,
of these data. Standardization of APIs across unlike other medical devices that have more
EHRs is critical because otherwise applica- limited utility.
tion developers will be required to spend effort There is an active debate about the appro-
customizing their product to integrate with priate types of regulation for medical apps for
every EHR vendor. In the U.S., new policies use by patients. Currently estimates have
will require EHR vendors to support APIs as found more than 150,000 health apps avail-
a condition for certification and for receiving able for download, but analysts have found
certain payments (Leventhal 2018). that few have demonstrated clinical utility
(Singh et al. 2016). The FDA does not regu-
late most apps but has recently begun a pilot
“precertification” program for digital health
29.4 olicies to Ensure Safety
P which will provide information about ven-
of Health IT dors’ software quality control processes but
does not involve evaluations of outcomes
As adoption of health IT accelerates and new (Bates et al. 2018; Lee and Kesselheim 2018).
innovations are developed, it is important to This is controversial, and some feel it does not
be vigilant about, and to reduce, the risk of go far enough (Bates et al. 2018).
unintended harmful side effects related to
health IT use. Harm could arise from deficien-
cies in many areas when designing and deploy- 4 7 http://www.fda.gov/ (Accessed 12/10/12).
980 R. S. Rudin et al.
Harmonizing State Privacy Law Collaborative. (2009). States. JAMA, 313(14), 1471–1473. https://doi.
Harmonizing state privacy law. Washington, DC: org/10.1001/jama.2015.2252.
Office of the National Coordinator of Health Infor- Lyu, H., Xu, T., Brotman, D., Mayer-Blackwell, B., Coo-
mation Technology. per, M., Daniel, M., et al. (2017). Overtreatment in
Haug, C. J. (2018). Turning the tables – the new Euro- the United States. PLoS One, 12(9), e0181970.
pean general data protection regulation. New Eng- https://doi.org/10.1371/journal.pone.0181970.
land Journal of Medicine, 379(3), 207–209. https:// Malani, P. (2018). Logging in: Using patient portals to
doi.org/10.1056/NEJMp1806637. access health information. Retrieved from https://
Health Level Seven International. (2019). Retrieved deepblue.lib.umich.edu/handle/2027.42/145683.
from https://www.hl7.org/. McClellan, M., McKethan, A. N., Lewis, J. L., Roski, J.,
HealthIT. (2016). Office-based physician electronic & Fisher, E. S. (2010). A national strategy to put
health record adoption. Retrieved from https:// accountable care into practice. Health Affairs (Mill-
dashboard.healthit.gov/quickstats/pages/physician- wood), 29(5), 982–990.
ehr-adoption-trends.php. McGlynn, E. A., Asch, S. M., Adams, J., Keesey, J.,
HealthIT. (2018). Draft trusted exchange network. Hicks, J., DeCristofaro, A., & Kerr, E. A. (2003).
Retrieved from https://www.healthit.gov/sites/ The quality of health care delivered to adults in the
default/files/draft-trusted-exchange-framework.pdf. United States. The New England Journal of Medi-
Hillestad, R., Bigelow, J., Bower, A., Girosi, F., Meili, cine, 348, 2635–2645.
29 R., Scoville, R., & Taylor, R. (2005). Can electronic
medical record systems transform health care?
McWilliams, J. M., Hatfield, L. A., Landon, B. E.,
Hamed, P., & Chernew, M. E. (2018). Medicare
Potential health benefits, savings, and costs. Health spending after 3 years of the Medicare Shared Sav-
Affairs (Millwood), 24, 1103–1117. ings Program. New England Journal of Medicine,
Jha, A. K., Doolan, D., Grandt, D., Scott, T., & Bates, 379(12), 1139–1149. https://doi.org/10.1056/
D. W. (2008). The use of health information tech- NEJMsa1803388.
nology in seven nations. International Journal of Mehrotra, A., Jena, A. B., Busch, A. B., Souza, J.,
Medical Informatics, 77(12), 848–854. Uscher-Pines, L., & Landon, B. E. (2016). Utiliza-
Jones, S. S., Heaton, P. S., Rudin, R. S., & Schneider, tion of telemedicine among rural medicare benefi-
E. C. (2012). Unraveling the IT productivity para- ciaries. JAMA, 315(18), 2015–2016. https://doi.
dox – lessons for health care. The New England Jour- org/10.1001/jama.2016.2186.
nal of Medicine, 366(24), 2243–2245. Menachemi, N., Rahurkar, S., Harle, C. A., & Vest, J. R.
Jones, S. S., Rudin, R. S., Perry, T., & Shekelle, P. G. (2018). The benefits of health information exchange:
(2014). Health information technology: An updated An updated systematic review. Journal of the Ameri-
systematic review with a focus on meaningful use. can Medical Informatics Association, 25(9), 1259–
Annals of Internal Medicine, 160(1), 48–54. https:// 1265. https://doi.org/10.1093/jamia/ocy035.
doi.org/10.7326/M13-1531. Mennemeyer, S. T., Menachemi, N., Rahurkar, S., &
Kaelber, D., & Pan, E. C. (2008). The value of personal Ford, E. W. (2016). Impact of the HITECH Act on
health record (PHR) systems. AMIA Annual Sym- physicians’ adoption of electronic health records.
posium Proceedings, 2008, 343–347. Journal of the American Medical Informatics Asso-
Lavallee, D. C., Chenok, K. E., Love, R. M., Petersen, ciation, 23(2), 375–379. https://doi.org/10.1093/
C., Holve, E., Segal, C. D., & Franklin, P. D. (2016). jamia/ocv103.
Incorporating patient-reported outcomes into Milani, R. V., Lavie, C. J., Bober, R. M., Milani, A. R.,
health care to engage patients and enhance care. & Ventura, H. O. (2017). Improving hypertension
Health Affairs (Millwood), 35(4), 575–582. https:// control and patient engagement using digital tools.
doi.org/10.1377/hlthaff.2015.1362. The American Journal of Medicine, 130(1), 14–20.
Lee, T. T., & Kesselheim, A. S. (2018). U.S. Food and https://doi.org/10.1016/j.amjmed.2016.07.029.
Drug Administration precertification pilot program Miller, A. R., & Tucker, C. (2009). Privacy protection
for digital health software: Weighing the benefits and technology diffusion: The case of electronic
and risks. Annals of Internal Medicine, 168(10), 730– medical records. Management Science, 55, 1077–
732. https://doi.org/10.7326/m17-2715. 1093.
Leventhal, R. (2018). BREAKING: CMS finalizes “pro- O’Connor, A. M., Bennett, C. L., Stacey, D., Barry, M.,
moting interoperability” rule. Retrieved from https:// Col, N. F., Eden, K. B., et al. (2009). Decision aids
www.h ealthcare-informatics.c om/article/value- for people facing health treatment or screening deci-
based-care/breaking-cms-finalizes-promoting- sions. Cochrane Database of Systematic Reviews,
interoperability-rule. (3), CD001431.
Levine, D. M., Linder, J. A., & Landon, B. E. (2016). Pew. (2018). Enhanced patient matching is critical to
The quality of outpatient care delivered to adults in achieving full promise of digital health records.
the United States, 2002 to 2013. JAMA Internal Retrieved from https://www.pewtrusts.org/-/media/
Medicine, 176(12), 1778–1790. https://doi. assets/2018/09/healthit_enhancedpatientmatching_
org/10.1001/jamainternmed.2016.6217. report_final.pdf.
Liu, V., Musen, M. A., & Chou, T. (2015). Data breaches Pham, H. H., Schrag, D., O’Malley, A. S., Wu, B., &
of protected health information in the United Bach, P. B. (2007). Care patterns in medicare and
Health Information Technology Policy
985 29
their implications for pay for performance. The Smith, P. C., Araya-Guerra, R., Bublitz, C., Parnes, B.,
New England Journal of Medicine, 356(11), 1130– Dickinson, L. M., Vorst, R. V., et al. (2005). Missing
1139. clinical information during primary care visits.
Roland, M., & Olesen, F. (2016). Can pay for perfor- JAMA, 293(5), 565–571.
mance improve the quality of primary care? BMJ, Squires, D. (2015). U.S. health care from a global per-
354, i4058. https://doi.org/10.1136/bmj.i4058. spective: Spending, use of services, prices, and health
Rozenblum, R., Jang, Y., Zimlichman, E., Salzberg, C., in 13 countries. Retrieved from https://www.
Tamblyn, M., Buckeridge, D., et al. (2011). A quali- c o m m o n we a l t h f u n d .o rg / p u bl i c at i o n s / i s s u e -
tative study of Canada’s experience with the imple- briefs/2015/oct/us-health-care-global-perspective.
mentation of electronic health information Szolovits, P., Doyle, J., Long, W. J., Kohane, I., & Pauker,
technology. CMAJ: Canadian Medical Association S. G. (1994). Guardian angel: Patient-centered health
Journal, 183(5), E281–E288. information systems. Retrieved from https://
Rudin, R. S., Salzberg, C. A., Szolovits, P., Volk, L. A., smarthealthit.org/1994/10/guardian-angel-patient-
Simon, S. R., & Bates, D. W. (2011). Care transitions centered-health-information-systems/.
as opportunities for clinicians to use data exchange Tang, P. C., Fafchamps, D., & Shortliffe, E. H. (1994).
services: How often do they occur? Journal of the Traditional medical records as a source of clinical
American Medical Informatics Association: JAMIA, data in the outpatient setting. Proceedings of the
18(6), 853–858. Annual Symposium Computer Applications in
Rudin, R. S., Bates, D. W., & MacRae, C. (2016). Accel- Medical Care, 575–579.
erating innovation in Health IT. New England Jour- Tang, P. C., Ash, J. S., Bates, D. W., Overhage, J. M., &
nal of Medicine, 375(9), 815–817. https://doi. Sands, D. Z. (2006). Personal health records: Defi-
org/10.1056/NEJMp1606884. nitions, benefits, and strategies for overcoming
Rudin, R. S., Hillestad, R., Ridgely, M. S., Qureshi, barriers to adoption. Journal of the American
N. S., II, Davis, J. S., & Fischer, S. H. (2018). Defin- Medical Informatics Association: JAMIA, 13(2),
ing and evaluating patient-empowered approaches to 121–126.
improving record matching. Santa Monica: The Tierney, W. M., McDonald, C. J., Martin, D. K., & Rog-
RAND Corporation. ers, M. P. (1987). Computerized display of past test
Shekelle, P. G., Pronovost, P. J., Wachter, R. S., Taylor, results. Effect on outpatient testing. Annals of Inter-
S. L., Dy, S. M., Foy, R., et al. (2011). Advancing the nal Medicine, 107(4), 569–574.
science of patient safety. Annals of Internal Medi- Tierney, W. M., Miller, M. E., Overhage, J. M., &
cine, 154(10), 693–696. McDonald, C. J. (1993). Physician inpatient order
Shortliffe, T. (2012). The future of biomedical informat- writing on microcomputer workstations. Effects on
ics: A perspective from academia. Keynote Presen- resource utilization. JAMA, 269(3), 379–383.
tation, Medical Informatics Europe 2012, Pisa, van Walraven, C., Taljaard, M., Bell, C. M., Etchells, E.,
Italy. Zarnke, K. B., Stiell, I. G., & Forster, A. J. (2008).
Shuren, J., Patel, B., & Gottlieb, S. (2018). FDA regula- Information exchange among physicians caring for
tion of mobile medical apps. JAMA, 320(4), 337– the same patient in the community. Canadian Medi-
338. https://doi.org/10.1001/jama.2018.8832. cal Association Journal, 179(10), 1013–1018.
Singh, K., Drouin, K., Newmark, L. P., Lee, J., Fax- Walker, J., Pan, E., Johnston, D., Adler-Milstein, J.,
vaag, A., Rozenblum, R., et al. (2016). Many mobile Bates, D. W., & Middleton, B. (2005). The value of
health apps target high-need, high-cost popula- health care information exchange and interoperabil-
tions, but gaps remain. Health Affairs (Millwood), ity. Health Affairs (Millwood), (Suppl Web Exclu-
35(12), 2310–2318. https://doi.org/10.1377/ sives), W5–10–W5–18.
hlthaff.2016.0578. Wang, S. J., Middleton, B., Prosser, L. A., Bardon,
Sinsky, C. A., Dyrbye, L. N., West, C. P., Satele, D., C. G., Spurr, C. D., Carchidi, P. J., et al. (2003). A
Tutty, M., & Shanafelt, T. D. (2017). Professional cost-benefit analysis of electronic medical records in
satisfaction and the career plans of US physicians. primary care. American Journal of Medicine, 114(5),
Mayo Clinic Proceedings, 92(11), 1625–1635. https:// 397–403.
doi.org/10.1016/j.mayocp.2017.08.017. Wells, S., Rozenblum, R., Park, A., Dunn, M., & Bates,
Sittig, D. F., & Singh, H. (2012). Electronic health D. W. (2014). Personal health records for patients
records and national patient-safety goals. New Eng- with chronic disease: A major opportunity. Applied
land Journal of Medicine, 367(19), 1854–1860. Clinical Informatics, 5(2), 416–429.
https://doi.org/10.1056/NEJMsb1205420. Yamin, C. K., Emani, S., Williams, D. H., Lipsitz, S. R.,
Sittig, D. F., Ash, J. S., & Singh, H. (2014). The SAFER Karson, A. S., Wald, J. S., & Bates, D. W. (2011).
guides: Empowering organizations to improve the The digital divide in adoption and use of a personal
safety and effectiveness of electronic health records. health record. Archives of Internal Medicine, 171(6),
The American Journal of Managed Care, 20(5), 418– 568–574.
423.
987 30
Contents
30.1 The Present and Its Evolution from the Past – 988
References – 1016
.. Table 30.1 Table of contents sections and chapters from all five editions of this book, aligned by subject
matter
Recurrent themes Recurrent themes Recurrent themes Recurrent themes Recurrent themes in
in medical in medical in biomedical in biomedical biomedical
informatics informatics informatics informatics informatics
1. The computer 1. The computer 1. The computer 1. Biomedical 1. Biomedical
meets medicine: meets medicine meets medicine and informatics: The informatics: The
Emergence of a and biology: biology: Emergence science and the science and the
discipline Emergence of a of a discipline pragmatics pragmatics
discipline
2. Medical data: 2. Medical data: 2. Biomedical data: 2. Biomedical data: 2. Biomedical data:
Their acquisition, Their acquisition, Their acquisition, Their acquisition, Their acquisition,
storage, and use storage, and use storage, and use storage, and use storage, and use
3. Medical 3. Medical 3. Biomedical 3. Biomedical 3. Biomedical
decision making: decision-making: decision making: decision making: decision making:
Probabilistic Probabilistic Probabilistic Probabilistic Probabilistic
medical medical reasoning clinical reasoning clinical reasoning clinical reasoning
reasoning
4. Essential 4. Essential 5. Essential
concepts for concepts for concepts for
medical medical computing biomedical
computing computing
The Future of Informatics in Biomedicine
989 30
(continued)
990 J. J. Cimino et al.
The next decade was revolutionary, how- facing systems. In addition, chapter author-
ever, and it had a profound effect on informat- ship evolved substantially as new topics were
ics. During the 1990s, the Human Genome introduced and authors from earlier editions
Project made it clear that much of what needed brought on coauthors whose expertise comple-
to be accomplished in human biology and genet- mented their own.
ics could not be achieved with the use of the The book title remained unchanged in the
computational methods available or introduced fourth edition in 2014 (and in this edition),
at that time. Many of the informatics techniques but changes to the chapter titles provide more
that had been developed in the clinical world detail to what was evolving (. Table 30.1). The
became relevant to genomics research, where fourth edition introduced several new topics,
investigators coined the term bioinformatics for including telehealth, translational bioinformat-
their computational explorations. Thus, the field ics, and clinical research informatics. And the
of informatics began to broaden to span both current edition has added new chapters in the
basic and applied clinical sciences. In an effort to areas of human-computer interaction,
acknowledge this evolution, the second edition mHealth, and precision medicine.
of this book was renamed, with “medical care” Thus, a review of the titles and tables of
giving way to “health care” (to acknowledge the contents of the five editions of this book, span-
field’s growing role in prevention and public ning 30 years with 20 chapters at the outset
30 health) and the addition of “biomedicine” (to evolving to 30 chapters now, provides a thumb-
embrace the role of informatics in human biol- nail view of the evolution of the field as a
ogy research) (. Table 30.1). In addition, a new whole. So, what evolution can we observe and
chapter on bioinformatics was added to the edi- what does it tell us about where we are headed?
tion when it appeared in 2000. Similarly, the sec- We see a field that started with a strong focus
ond edition took a broader view to consider on computer programming for clinical medi-
topics such as standards, ethics, integrated deliv- cine and ancillary services. The field grew to
ery networks, and public health. Bibliographic embrace research areas related to medicine and
retrieval expanded to be information retrieval, health, ranging from molecular to biologic sys-
and there were changes in emphasis in several tems to organisms, and beyond to populations.
other chapters as well. And, as with any emerging discipline, biomedi-
In an attempt to acknowledge and empha- cal informatics began to differentiate its activ-
size the shared methods that applied in both ities into practice (the lion’s share), research
the human life sciences and in clinical medicine (not just biomedical research informatics, but
and health, the academic discipline began to research on informatics in its own right), and
change its name from “medical informatics” to education. Connections among these three
“biomedical informatics”. Several departments types of activities and overlap across domains
were renamed or created with this new name were often scant, as shown in . Fig. 30.1. Four
for the field. Hence, when the third edition of trends, described throughout this book, have
this book appeared in 2006, it adopted the title blurred many of the distinctions (. Fig. 30.2).
Biomedical Informatics, discarding the more First, the broader field of biomedicine itself
limited “medical informatics” focus. Although has begun to blur the distinctions among its
several chapters were simply updated and some traditional areas of research domains. This is
were deleted, others were divided into two evident in the emergence of the “translational
components (e.g., the Imaging Systems chapter science” philosophy, with clear recognition that
from the second edition was divided into a each scientific endeavor builds on the discover-
methodologic chapter on imaging/structural ies made at some other, usually smaller scale.
informatics plus an application chapter on One culmination of this trend is precision medi-
Imaging Systems in Radiology) (. Table 30.1). cine (see 7 Chap. 28), in which discoveries at
Furthermore, totally new chapters were drawn the genomic level are translated into knowl-
from other fields, including cognitive science, edge that supports decisions for patients and
natural language processing, and consumer- populations.
The Future of Informatics in Biomedicine
993 30
Translational Clinical Clinical Public Health
Bioinformatics
Research Research Informatics Informatics
Informatics Informatics Informatics
Practice
Informatics
Research
Informatics
Education
Basic Science Translational Science Clinical Research Patient Care Public Health
.. Fig. 30.1 Prior state of biomedical informatics domains followed practice, depicted here as more or less semi-trans-
and activities. Clinical informatics was the earliest and con- parent connections. Education in informatics, both for
tinues to be the largest domain, as reflected in this book, research and practice, began in the clinical domain, espe-
with other domains following in time. The advent of the cially with nursing informatics, to be followed by nascent
Human Genome Project led to rapid expansion of the bioinformatics training programs. Connections between
application of bioinformatics. Research in each domain the education and research varied
Informatics
Practice
Informatics
Research
Informatics
Education
Basic Science Translational Science Clinical Research Patient Care Public Health
.. Fig. 30.2 Today, biomedical informatics is becoming a informatics applications in research and practice settings
continuum, with fewer distinctions among domains and are increasingly seen as research subjects in “living labo-
activities. This mirrors the continuum of biomedicine, with ratories” (3) for guiding the improvement of the tools they
its recognition of the translational nature ranging from basic use and learning new ways to apply informatics methods.
science public health (1), exemplified by precision medicine Finally, education and training in informatics is increas-
which draws on genomic knowledge to directly improve care ingly leaving the classroom and moving to practice sites
of the individual patient. Increased clinical informatics for observation and learning and as settings for experi-
activity has resulted in increased availability of clinical data, menting with new solutions (4). (7 https://systems.jhu.edu/
which informs research to produce better evidence to guide research/public-health/2019-ncov-map-faqs/. Copyright
practice, resulting in a “learning health system” (2). Users of 2020, Johns Hopkins University. All rights reserved)
Second, the learning health system (7 Chaps. oratory, the clinic, and the hospital are becom-
1 and 17) is making use of lare-scale data col- ing living informatics laboratories for testing
lected from patients and populations to frame informatics ideas and observing their impact.
research questions that are answered in the labo- Fourth, the connections between informat-
ratory. Knowledge discovered there is then ics education and informatics practice are being
returned to the point of care to support evi- strengthened. As electronic health records have
dence-based diagnostic, preventive and thera- become ubiquitous in clinical practice, comput-
peutic decisions. ing devices and information technologies have
Third, informatics research is moving from become virtually the only tools used by every
the computer lab out to where potential users health care provider. Therefore, the importance
of informatics tools are actually working. of rigorous training in the use of these tools
Harnessing tools from cognitive science and has increased. Informatics training programs
mixed methods evaluation, the biological lab- are now able to train their students with the
994 J. J. Cimino et al.
living laboratories by studying how research medical informatics is likely to influence the
and patient care systems are being used and can twenty-first century. As it happens, shortly after
be improved. The long tradition of formal these pieces were written, events have unfolded
informatics training in nursing programs is now to put these predictions to the test.
being adopted in clinical medicine, which has Writing in early 2020, we are referring of
recently added clinical informatics as a board- course of the COVID-19 pandemic. The chap-
certified subspecialty through the American ters in this book were largely written in the
Board of Medical Specialties (ABMS). preceding year or two. Some have been updat-
ed to discuss the current situation (e.g., see
7 Chap. 18), but no textbook can keep current
30.2 Looking to the Future with the rapidly unfolding events of this cur-
rent natural disaster. The level of public inter-
Given the general trends we have outlined, what est in biomedical information, ranging from
can we expect in terms of specific advances for virology and immunology, to pharmacology
the field? For that, we invited seven visionaries and epidemiology, has risen to unprecedented
to share their predictions for the directions we levels. Up-to-the-minute data are being pro-
will, or least should, be moving. We chose inno- vided with great volume, variety and velocity
vative thinkers who could provide insights on (the hallmarks of “big data – see 7 Chap. 13)
the future of biomedical informatics from a through government, academic and news
variety of perspectives: bioinformatics (Tarczy- media sources for popular consumption
Hornoch), industry (Horvitz), nursing (. Fig. 30.3).1 All of this requires rapid devel-
(Murphy), health policy (Blumenthal), aca- opment and delivery of informatics solutions
demic informatics (Frisse), clinical medicine on an unprecedented scale, along with
(Wachter), and federal government (Brennan). approaches for confirming the veracity of data.
Individually, they provide perspectives on gov-
ernment efforts, policy changes, research
advances, and clinical practice. Together, they 1 7 https://www.arcgis.com/apps/opsdashboard/
index.html#/bda7594740fd40299423467b48e9ecf6
weave a rich tapestry that presages how bio-
(accessed 2/13/2021).
The Future of Informatics in Biomedicine
995 30
The lessons and predictions discussed by extend beyond the protection of data privacy.
our guest visionaries can be applied directly to Other impacts on society may be both deep
the care of patients with suspected or con- and unanticipated, with the possibility that
firmed COVID-19. For example, Tarczy- unintended consequences may exacerbate dif-
Hornoch (7 Box 30.1) describes correlation ferences among how different people, with dif-
of genomic and functional data with clinical ference education, cultures, and financial
outcomes data. Thanks to adoption and means, may experience health care and manage
interoperability of electronic health records their own health. Unintended consequences of
(7 Chaps. 14 and 15), sufficient data are technology have been rampant in many fields
becoming available to provide an understand- (did anyone anticipate that television would
ing of risk factors for disease severity as well engender generations of “couch potatoes”?).
as benefits risks of putative treatments much Wachter, who is well known for his provoca-
more rapidly than could be achieved with for- tive book characterizing the “digital doctor”
mal human subject studies (Xu et al. 2020). who is already somewhat upon us (Wachter
Horvitz (7 Box 30.2) provides an inventory of 2017), focuses on the informatician (or infor-
the methods drawn from the field of artificial maticist) of the future and the impact that
intelligence (see 7 Chaps. 1 and 24) that stand such individuals will have in clinical settings
ready to use these data to infer answers to such (7 Box 30.6). He acknowledges the problems
pressing questions through machine learning. that are highlighted in his popular book, but
He also describes supplemental methods for envisions an ultimately positive future in
“machine teaching” that can be brought to which “the experience of being both patient
bear when some data remain sparse (Feijóo and healthcare professional will be far more
et al. 2020). satisfying”, due in part to the role that infor-
Murphy (7 Box 30.3) describes the advan- matics, and those who practice this new spe-
tages of tele-visits for improving access to cialty, will play in the clinical environment.
care; the social distancing required for helping Tarczy-Hornoch, Horvitz, Murphy, Blumen-
to control the pandemic provides additional thal, Frisse, and Wachter all describe ways in
incentive for care at a distance, even for those which clinicians’ workflow can be influenced
who otherwise have sufficient access to in- for the better through informatics, with par-
person care. Fortunately, the technology of ticular attention to their quality of life, which
tele-visits (7 Chap. 20) has progressed to the the pandemic has demonstrated can require as
point where healthcare institutions have been much attention, for some individuals, as do
able to make the necessary transition with an the lives of the patients they serve (Dewey
ease that would not have been possible 10 et al. 2020). And of course, all of these activi-
years earlier (Hong et al. 2020). ties are supported through access to data and
Blumenthal (7 Box 30.4) enumerates issues literature (including the works cited here),
related to safety of information systems and the made possible by the National Institutes of
government’s role in developing policies that Health, with the National Library of Medi-
address privacy concerns (7 Chaps. 12 and 29). cine at the fore (Zayas-Cabán et al. 2020), as
The immediate need for such policies relates to well as other governmental and nongovern-
the balance between individual rights and the mental organizations. As Brennan notes in
protection of the public, as software developers her perspective (7 Box 30.7), the federal gov-
race to create patient contact tracing applica- ernment must provide the resources for the
tions (Abeler et al. 2020) and patients begin to things that only it can do (such as gathering
collect their own intimate, detailed data through and consolidating epidemiologic data) and
the wearables mentioned by Frisse (see also collaborate with leaders in the private sector
7 Chap. 19 and Ding et al. 2020). that can provide additional breadth and depth
Frisse (7 Box 30.5) recognizes that with the of expertise. The COVID monitoring dash-
success of informatics, and its growing impact board shown in . Fig. 30.3 is just the tip of
on both science and health, the challenges and an iceberg of such cooperation (in this case
complexities involved with “doing it right” between the Centers for Disease Control &
996 J. J. Cimino et al.
Prevention and Johns Hopkins University), after the current challenges are overcome.
when one considers all the work that went Public recognition of the importance of infor-
into obtaining the underlying data. matics will lead to increased resources for
Today, biomedical informatics is moving out research, increased interest in education and
of the shadows. Instead of hearing “biomedi- training, and new opportunities for applica-
cal informatics? What’s that?”, we hear “How tions in biomedicine, in preparation for the
is biomedical informatics helping to solve this inevitable challenges we know to anticipate.
problem?” There are many answers and The intent of this book is to prepare those
although they may seem specific to the current who wish to understand, support and lead
pandemic, they will remain applicable long these changes.
Box 30.1 A Perspective on the Future practice and for the development of evidence-
of Translational Bioinformatics and Pre- based guidelines. The validation of genomic dis-
cision Medicine covery and demonstration of its suitability for
Peter Tarczy-Hornoch widespread adoption (T2/T3) is just beginning.
This evolution can be illustrated on the clinical
30 The first part of the twenty-first century saw the T2 front by the work of the American College of
establishment of the fields of translational bio- Medical Genetics, which monitors new genomic
informatics (TBI) and precision medicine (PM), discoveries to identify what secondary findings
accompanied by the movement of this research in genome and exome sequencing meet criteria
into the early T-phases [1] of translational for reporting [2]. Thus far only selected muta-
research (e.g. T0–T2 research focused on discov- tions of around 60 genes (out of over 20,000 in
ery and early application). The next decade will the genome) meet the ACMG’s rigorous criteria
see the fields move into the later T-phases of for clinical reporting. The clinical validation of
broader adoption and diffusion (T3) and into new discoveries facilitated by TBI is a key step in
evaluating the population impact in terms of the development of informatics tools that apply
health outcomes (T4). Due to shared core meth- this knowledge to practice (e.g. decision-support
odologies plus pressures on the health system, tools). As an example of informatics T2 work,
T3/T4 research will demonstrate a convergence researchers have begun to assess the cost/benefit
between TBI/PM (predicting individual out- of genomic decision-support tools in the elec-
comes) and integration with more population- tronic health record [3]. The research and appli-
based approaches such as comparative cation of PM informatics approaches for T1
effectiveness research and, more broadly, the discovery and T2 application parallel those of
concept of the learning healthcare system. TBI (oftentimes incorporating genomic ele-
The earliest work in TBI and PM focused on ments as part of the input data for the develop-
the identification of opportunities and new ment of predictive models).
approaches (T0), discovery to early health appli- In the coming decade the types and volume
cations (T1), and assessment of value (T2). In of data used for TBI and PM discovery and
the TBI area, T0 work focused on studies pilot- application will continue to expand and the dis-
ing the combination of both genomic data and tinctions between TBI and PM will blur even
electronic health record data for discovery (e.g. further. As the cost of genome sequencing con-
the early phase of the eMERGE project) and tinues to drop, increasing numbers of patients
proof-of-concept T1 translational applications will have genotypic information available to cor-
of genomic discoveries to clinical care (e.g. tar- relate with clinical and other information, which
geted pharmacogenomic decision-support sys- will enable both larger scale discovery and appli-
tems). We also see an emerging body of T2 cation. In the cancer domain, for example, new
translational research that is beginning to assess single-cell sequencing approaches will provide
the value of these new discoveries for health additional granular data on a specific patient’s
The Future of Informatics in Biomedicine
997 30
clonal mutational profiles. In the metabolomics file of the individual patient. The core methods
and proteomics areas, the cost of gathering these and approaches used for analysis and discovery,
data is dropping, both at the patient and more and the ones used for decision support, will be
targeted (e.g. organ) levels. The ability to begin fundamentally similar, whatever mix of input
to correlate these functional data with clinical variables is used across the spectrum of genetic,
outcome data and with response-to-therapy biologic, clinical, patient provided, or environ-
data will provide powerful new biological and mental data. In light of this, the distinction
clinical insights. These new sources of biological between TBI and PM will likely vanish.
process data will complement new sources of There are number of promising data ana-
phenotypic and environmental data. Text min- lytics methods currently under development
ing will enable free-text notes describing pheno- that are likely to be useful in the TBI/PM infor-
type and environment (e.g. social determinants matics area. One category is the creation of
of health) to be transformed into more discrete more automated model- selection and tuning
data suitable for machine learning. Increasing methods. Without these it will be difficult to
availability of geocoded environmental data (e.g. scale a number of the approaches currently
climate data, pollution data, air quality data, being used since they are dependent on the
pollen counts, etc.) will enable cross-links to involvement of human data scientists. Similarly,
patient data. With patient engagement and per- there is foundational work being done in aca-
mission (and substantial work on standards and demia and industry that is seeking better unsu-
security), specific environmental data from the pervised learning approaches. These are needed
Internet of Things (e.g. lighting and temperature because the ability to develop gold-standard
data in a home) may also be linked with genom- training sets is now often constrained by the
ic, biological and clinical data for a patient. amount of human effort required.
Similarly other patient data can be integrated, Another broad category is methods that pro-
such as questionnaire and survey data (patient vide some explanatory power related to predic-
reported outcomes and mMeasures) and data tions. As one example, current machine learning
from consumer wearables (counts of steps, heart approaches identify correlation but generally
rate monitoring, sleep monitoring) as well as cannot provide insight into causation. New
consumer medical devices (home glucose and approaches show promise when they leverage
blood pressure monitoring, and, recently, more large enough data sets to begin to infer causa-
experimental transdermal monitoring of meta- tion. Another example is using automated tools
bolic processes). This increase in data about both to develop predictive models and then to
individual patients, as well as the number of use artificial intelligence techniques to develop
patients for which these rich data are available, an explanatory model. Both these examples
will greatly accelerate the T0–T1 discovery and illustrate ways in which new methods may begin
initial clinical application phases of TBI/ to address the concern raised by some overly
PM. The volume of data and potential out- opaque “black box” predictive models. A final
comes are such that the informatics tools will broad category is methods that begin to leverage
become ever more important for discovery. available data more effectively, including new
Already health care providers struggle with AI-based image-analysis approaches, next-gen-
information overload and with the need to be eration hybrid statistical and rules-based text-
current on new medical discoveries. The antici- mining approaches, and new approaches to
pated complexity and volume of new findings improve the use of temporal information in pre-
and correlations will be such that computer- diction algorithms (e.g. the slope and tempo of
based decision-support tools will be obligatory visits and laboratory values).
for application of these new findings. All of The rapidly rising costs of health care in the
these approaches will fit into the paradigm of United States, without a corresponding improve-
using predictions to provide early/preventive ment in quality, will influence the development
interventions that are tailored to the unique pro- of informatics tools for precision medicine.
998 J. J. Cimino et al.
Broadly this will mean that work in the TBI/PM the care-delivery process (e.g. the electronic
informatics space will need to factor in the per- health record and related data).
spectives of the Quadruple Aim: (1) enhancing Finally, in order to address the fourth ele-
patient experience, (2) improving population ment in the Quadruple Aim, we will need to
health, (3) reducing costs, and (4) improving the determine how best to deploy predictive ana-
work life of healthcare providers. Regarding the lytics tools. It will be important to preserve
first of these, it will be important to ensure that provider decision-making autonomy, to pro-
predictive model-based decision-support tools vide sufficient explanatory ability and rigor-
are built in ways to ensure the pursuit of shared ous validation to ensure that providers trust
decision making involving the patient. Tools the results, and to diminish the information
must also provide the appropriate support for and alert overload that providers face today.
ensuring that behavioral changes occur (e.g. if a In summary, we have just begun to see TBI
model predicts the need for increased aerobic and PM informatics discoveries and applica-
exercise, there must be methods to ensure that tions have an impact on achieving the broader
occurs). It will similarly be important to ensure, goals of improving health and the more focused
as data are shared and models are developed, goals of the Quadruple Aim. Over the next
that attention is paid to ethical, legal and social decade, with advances in data analytics methods
30 aspects of data sharing. This will help to main- and increasing sources of data regarding an
tain the trust of patients and to avoid unintend- increasing number of patients, we are likely to
ed biases in the models (consider, for example, see remarkable progress in the development of
the recent issues with facial recognition software more easily developed and more accurate pre-
that works well on white males but not on wom- dictive models that will allow us to intervene at
en of color). The ethical lens will be particularly the patient level. These advances will be inte-
important to ensure that the privacy and trust of grated into the broader trends in health care as
patients and public are preserved as these large encapsulated in the Quadruple Aim, which will
scale data-intensive methods are developed and require additional research and innovation to
deployed. The recent academic and popular ensure that the full potential of TBI and PM are
press discussions of the breeches of trust by large realized.
scale social media and other Internet companies 1. Khoury, M. J., Gwinn, M., Yoon, P. W.,
should serve as a cautionary tale. Dowling, N., Moore, C. A., & Bradley, L.
Regarding the next two elements in the (2007). The continuum of translation research
Quadruple Aim, it will be critical that the work in genomic medicine: How can we accelerate
in TBI/PM be subject to the same kinds of the appropriate integration of human genom-
assessments that we expect for other diagnostic ic discoveries into health care and disease pre-
and therapeutic interventions. Currently there vention? Genetics in Medicine, 9(10), 665–674.
are deployed tools that demonstrate a sensitiv- 2. Kalia, S. S., Adelman, K., Bale, S. J., Chung,
ity and specificity that are far below the values W. K., Eng, C., Evans, J. P., et al. (2017).
that we would otherwise demand of diagnostic Recommendations for reporting of second-
and screening tests. Informatics interventions ary findings in clinical exome and genome
have not been treated in quite the same way as sequencing, 2016 update (ACMG SF v2.0):
laboratory tests or medications. Efforts to dem- A policy statement of the American College
onstrate value and real-world impact of TBI/ of Medical Genetics and Genomics. Genet-
PM tools will align with broader efforts to dem- ics in Medicine, 19(2), 249–255.
onstrate effectiveness in the real world (e.g. 3. Mathias, P. C., Tarczy-Hornoch, P., &
Comparative Effectiveness Research). They Shirts, B. H. (2017). Modeling the costs of
will also form a key aspect of the Learning clinical decision support for genomic preci-
Healthcare System approach, since TBI/PM sion medicine. Clinical Pharmacology &
tools will help to assure that learning can occur Therapeutics, 102(2), 340–348.
from analysis of the data artifacts generated in
The Future of Informatics in Biomedicine
999 30
Box 30.2 The Future of Biomedical ing new capabilities and services into existing
Informatics: Bottlenecks and Opportu- clinical workflows.
nities Multiple advances coming with the march
Eric Horvitz of computer science will help to address chal-
lenges of translating ideas and methods that
I see the rich, interdisciplinary field of biomed- have been nurtured by biomedical informati-
ical informatics as the gateway to the future of cians for decades. At the base level, such
health care. The concepts, methods, rich histo- advances include ongoing leaps in computing
ry of contributions, and the aspirations of bio- power and in storage, but also key innovations
medical informatics define key opportunities with computing principles and methods in such
ahead in biomedicine—and shine light on the subdisciplines as databases, programming lan-
path to achieving true evidence-based health guages, security and privacy, human-computer
care. interaction, visualization, and sensing and
Progress with influences of biomedical ubiquitous computing.
informatics on health care over the last three Faster and more effective translation of
decades has been slower than I had hoped. ideas and methods from biomedical informat-
However, I remain optimistic about a forth- ics will also be enabled by jumps in the quality
coming biomedical informatics revolution, of available computing tools and infrastruc-
made possible by a confluence of advances ture. Increases in the power and ease-of-use of
across industry and academia. Such a revolu- cloud computing platforms are being fueled by
tion will accelerate discovery in biomedicine, unprecedented investments in research and
enhance the quality of health care, and reduce development by information technology com-
the costs of health care delivery. panies—companies that are competing intense-
From my perch as an investigator and direc- ly with one another for contracts with
tor of a worldwide system of computer science enterprises that are hungry for digital transfor-
research labs, I view key opportunities ahead as mation and the latest in modern computing
hinging on (1) addressing the often underap- tools. Cloud computing companies are packag-
preciated bottleneck of translation—moving ing in their offerings sets of development tools
biomedical informatics principles and proto- and constellations of specialized services. Many
types into real-world practice, and (2) making of these offerings are relevant to biomedical
progress on persisting challenges in principles informatics efforts, including machine learning
and applications of artificial intelligence (AI). I toolkits, suites for analysis and visualization of
am optimistic that we will make progress on data, and computer vision, speech recognition,
both fronts and that there will be synergies and natural language analysis services made
among these advances. available via programmatic interfaces.
On challenges of translation, I believe that Beyond developing generic platform capa-
the difficulties of transitioning ideas and imple- bilities, cloud service providers are motivated to
mentations from academic and industry gain understandings in key vertical markets,
research centers into the open world of medical such as health care, finance, and defense, and
practice have been widely underappreciated. have been working to custom-tailor their gen-
Numerous factors are at play, including poor eral platforms with tools, designs, and services
understanding of how computing solutions can for use in specific sectors. For example, there is
assist with the tasks and day-to-day needs of incentive to support rising standards on sche-
health care practitioners and patients, inade- mata (e.g., Fast Healthcare Interoperability
quate appreciation of the needs and difficulties Resources (FHIR)) for storing and transferring
of developing site-specific solutions, poor com- electronic health records and on methods to
pute infrastructure, and a constellation of chal- ensure the privacy of patient data. There is also
lenges with human factors, including entrenched pressure to develop special versions of comput-
patterns of practice and difficulties of integrat- ing services for medicine, such as language
1000 J. J. Cimino et al.
models and, more generally, natural language are multiple opportunities to build and to inte-
capabilities specialized for medical terminolo- grate pipelines where data flow via machine
gy, enabling more accurate understanding and learning to predictions and via automated deci-
analysis of medical text and speech. Competitor sion analyses to recommendations about test-
cloud providers have also worked to identify ing and treatment. Making key investments to
and provide efficient methods and tools for build and refine effective data-to-prediction-to-
important vertical needs, such as the rising decision pipelines will provide great value in
importance of determining DNA sequences multiple areas of medicine [2].
and interpreting protein expression data. Such Opportunities ahead for biomedical infor-
special needs of researchers and clinicians have matics include leveraging recent advances in
led to the availability of efficient and inexpen- deep learning in medical applications, especially
sive cloud-computing services for genomic and for image recognition and natural language
proteomic analyses. tasks. These multilayered neural network archi-
Moving on to the second realm of opportu- tectures are celebrated for providing surprising
nities, around harnessing advances in the con- boosts in classification accuracy in multiple
stellation of technologies that we call AI, I application areas and for easing engineering
believe that our community can do more to overhead, as they do not require special feature
30 leverage existing methods and also to closely engineering. The methods have been shown to
follow, push, and contribute to advances in AI perform well for recognition in the image-centric
subfields. Beyond methods available today, key areas of pathology and radiology. Different vari-
developments will be required in principles and ants of deep learning are also being explored for
applications to realize the long-term goals of building predictive models from clinical data
biomedical informatics. I am seeing good prog- drawn from electronic health records. Beyond
ress and am optimistic that the advances com- direct applications, deep learning methods have
ing over the next decade will be deeply enabling. led to enhanced capabilities in multiple areas of
On existing technologies, and focusing on AI with relevance to goals in biomedical infor-
the example of developing effective decision matics, including key advances in computer
support systems, we have been very slow to vision, speech recognition, text summarization,
leverage the visionary ideas proposed by Robert and language translation.
Ledley and Lee Lusted in 1959 [1]. Ledley and With all of the recent fanfare about deep
Lusted provided a blueprint for constructing learning, it is easy to overlook the applicability
differential diagnoses and to use decision-theo- of other machine learning methods, including
retic analyses to generate recommendations for probabilistic graphical models, generalized
action. Biomedical informatics investigators additive models, and even logistic regression for
have been top leaders with exploring proto- serving as the heart of predictions in recom-
types for decision support systems, and systems mendation engines. While excitement about
constructed over 60 years of research have been deep learning is appropriate, it is important to
shown to perform at expert levels. However, note that the methods typically require large
real-world impact has been limited to date. A amounts of data of the right form and that such
key bottleneck has been the scarcity and cost of datasets may not be available for medical appli-
expertise and data. I believe that harnessing cations of interest. Other approaches have
advances in machine learning will be particu- proven to be as accurate for clinical applications
larly critical for delivering on the vision of evi- and provide other benefits such as providing
dence-based clinical decision making. Machine more intelligible, explainable inferences. Also,
learning techniques available today can and when sufficiently large corpora of data labeled
should be playing a more central role in health with ground truth are not available, knowledge
care for assisting with pattern recognition, acquisition techniques, referred to broadly as
diagnosis, and prediction of outcomes. There machine teaching, can provide value. While work
The Future of Informatics in Biomedicine
1001 30
is moving forward on machine teaching, exist- new sources of biomedical knowledge, and to
ing methods and tools can be valuable in build- address the challenge of data scarcity and
ing models for prediction and classification. related difficulties with the generalizability of
I believe that it is important to note that hav- data resources for health care applications.
ing access to powerful machine learning proce- On data scarcity and generalizability, an
dures may be insufficient for addressing goals in important, often underappreciated challenge
biomedical informatics. Key challenges for mov- in biomedical informatics is that the accuracy
ing ahead with developing and deploying effective of diagnosis and decision support may not
decision support systems include identifying transfer well across institutions. In our work
where and when such systems would provide val- at Microsoft Research, we found that accura-
ue, collecting sufficient amounts of the right kind cies of a system trained on data obtained from
of data for applications, developing and integrat- a site can plummet when used at another loca-
ing automated decision analyses to move from tion. The poor generality of datasets is based
predictions to recommendations for action [2], on multiple factors, including differences in
maintaining systems over time, developing means patient populations—with site-specific inci-
to build and apply learned models at multiple dence rates, covariates, and presentations of
sites, and addressing human-factors, including illness, site-specific capture of evidence in the
formulating means for achieving smooth integra- electronic health record, and site-specific defi-
tion of inferences and recommendations into nitions of signs, symptoms, and lab results. As
clinical workflows, and providing explanations of an example, we found site-specificity when my
inferences to clinicians [3]. Providing explanations team studied the task of building models to
of predictions generated by machine-learned predict the likelihood that patients being dis-
models is a topic of rising interest [4]. I hope to charged from a hospital would be readmitted
see revitalized interest and similar enthusiasm within 30 days. The accuracy of prediction for
extended to addressing challenges identified in a model learned from a massive dataset drawn
biomedical informatics with the intelligibility from single large urban hospital dropped
and explanation of the advice provided by other when the model was applied at other hospitals.
forms of reasoning employed in decision support This observation of poor generalizability was
systems, including logical, probabilistic, and deci- behind our decision to develop a capability for
sion-theoretic inference [5]. performing automated, recurrent machine
Key opportunities in AI research for prog- learning separately at each site that would rely
ress with developing and fielding effective on local data for predictions. This local train-
decision support systems include efforts in and-test capability served as the core engine of
principles and applications of transfer learn- an advisory system for readmissions manage-
ing, unsupervised learning, and causal inference. ment, named Readmission Manager, that was
Transfer learning refers to methods that allow commercialized by Microsoft.
for data or task competencies learned in one Moving forward, research on a set of meth-
area to be applied to another [6]. Unsupervised ods jointly referred to as transfer learning may
and semi-supervised learning refers to meth- help to address challenges of data scarcity and
ods that can be used to build models and per- generalizability. Transfer learning algorithms
form tasks without having a complete set of for mapping the learnings from one hospital to
labeled data, such as labels about the final another show promise in medicine [6]. Such
diagnoses of patients when working with elec- methods include multitask learning. Also,
tronic health records data. Causal inference obtaining spanning datasets, composed of
refers to methods that can be used to identify large amounts of data drawn from multiple
causal knowledge, versus statistical associa- sites, may provide effective generalization. In
tions that are commonly inferred from data. support of this approach, methods called mul-
Advances in these areas promise to provide tiparty computation have been developed that
1002 J. J. Cimino et al.
can enable learning from multiple, privately General methodology and case study.
held databases, where there is no violation of PLOS One Medicine, 9(10), e109264.
privacy among the contributing organizations. https://doi.org/10.1371/journal.
Beyond the daily practice of health care, pone.0109264
and uses in such applications as diagnosis and 3. Teach, R. L., & Shortliffe, E. H. (1981). An
treatment, methods for learning and reasoning analysis of physician attitudes regarding com-
from data can provide the foundations for new puter-based clinical consultation systems.
directions in the clinical sciences via tools and Computers and Biomedical Research, 14(6),
analyses that identify subtle but important sig- 542–558. 7 https://doi.org/10.1016/0010-
nals in the fusing of clinical, behavioral, envi- 4809(81)90012-4.
ronmental, genetic, and epigenetic data. I see 4. Caruana, R., Koch, P., Lou, Y., Sturm, M.,
many directions springing from applications of Gehrke, J., & Elhadad, N. (2015). Intelligi-
machine learning, reasoning, planning, and ble models for healthcare: Predicting pneu-
causal inference for health care delivery as well monia risk and hospital 30-day readmission.
as in supporting efforts in health care policy KDD, August 10–13, 2015, Sydney, NSW,
and in the discovery of new biomedical under- Australia.
standings. 5. Horvitz, E., Heckerman, D., Nathwani, B.,
30 I remain excited about advances in biomed- & Fagan, L. M. (1986). The use of a heuris-
ical informatics and see a biomedical informat- tic problem-solving hierarchy to facilitate the
ics revolution on the horizon. Such a revolution explanation of hypothesis-directed reason-
will build on the glowing embers of decades of ing. Proceedings of Medical Informatics,
contributions and the flames of late-breaking Washington, DC (October 1986), North
activities that address long-term challenges and Holland: New York, pp. 27–31.
bottlenecks. 6. Wiens, J., Guttag, J., & Horvitz, E. (2014).
1. Ledley, R. S., & Lusted, L. B. (1959). Rea- A study in transfer learning: Leveraging
soning foundations of medical diagnosis. data from multiple hospitals to enhance
Science, 130(3366), 9–21. hospital-specific predictions. Journal of the
2. Bayati, M., Braverman, M., Gillam, M., American Medical Informatics Association,
Mack, K. M., Ruiz, G., Smith, M. S., & 21(4), 699–706. 7 https://doi.org/10.1136/
Horvitz, E. (2014). Data-driven decisions amiajnl-2013-002162.
for reducing readmissions for heart failure:
The Future of Informatics in Biomedicine
1003 30
Box 30.3 The Future of Nursing Infor- settings to achieve desired health and health-
matics care outcomes. This support is accomplished
Judy Murphy using information structures, information pro-
cesses, and information technology [6].
The focus of this commentary is on the future NI continues to grow. In the most recent
of biomedical informatics from a nursing per- Health Information and Management Systems
spective, but it is helpful to understand the Society (HIMSS) NI Workforce Survey, 57% of
background and history of nursing’s role in respondents held a post-graduate degree in
the field. Starting there, the focus will move to nursing or nursing informatics and 44% were
looking at nursing informatics today and then specialty certified by ANA in NI or other nurs-
looking to the future of the field from a nurs- ing specialty. Another 32% were currently pur-
ing point of view. suing NI certification, and over half have been
Nurses have contributed to the purchase, working in an informatics role for more than 7
design, and implementation of health infor- years [7].
mation technology (IT) since the 1970s. The Since the HITECH Act of 2009, nursing
term “nursing informatics” (NI) first appeared informatics specialists have played a pivotal
in the literature in the 1980s [1–3]. The defini- role in influencing the adoption of electronic
tion of NI has evolved ever since, molded by health records (EHR) for meaningful use.
maturation of the field and influenced by Having the breadth and depth of healthcare
health policy. In a classic article that described knowledge and understanding clinical practice
its domain, NI was defined as the combination workflows, nurses help all clinicians understand
of nursing, information, and computer sci- the application and value of the EHR. Nurses
ences to manage and process data into infor- have a perspective of the many venues of care
mation and knowledge for use in nursing and working with all care team members, as
practice [4]. Nurses who worked in NI during well as working with patients at different points
that time were pioneers who often got into in their care continuum. Nurses help the patient
informatics practice because they were good utilize health IT to improve engagement in
clinicians, were involved in IT projects as edu- their own care, take control of their own health
cators or project team members or were just and become an integral part of the decision-
technically curious and willing to try new making process and care team. As patient
things. Their roles, titles, and responsibilities advocates, nurses understand the power of the
varied greatly. patient in a participatory role and how this can
A solid foundation for the NI profession improve outcomes.
continued to be laid over the ensuing 40 years. The type and quality of care that nurses
Today, informatics has been built into under- provide to their patients will benefit immensely
graduate nursing education and there are over a from the continued advancement of technology
hundred schools offering post-graduate NI and informatics in healthcare. Although there
education. NI is recognized as a specialty by are many ways those advancements will impact
the American Nursing Association (ANA) and nursing, here are two areas that hold the great-
has a specialty certification [5]. NI is now est promise for nursing’s future.
described as the specialty that integrates nurs- Data and the Continuous Learning Health
ing science with multiple information and ana- System: Nursing research has not been as pro-
lytical sciences to identify, define, manage, and lific as medical research, so there is a lot less
communicate data, information, knowledge, known about the true impact/outcomes of
and wisdom in nursing practice. NI supports nursing interventions. But now that organiza-
nurses, consumers, patients, the interprofes- tions are aggregating health data electronical-
sional healthcare team, and other stakeholders ly in an EHR and other Health IT, nurses can
in their decision-making in all roles and in all more easily identify practices that measurably
1004 J. J. Cimino et al.
impact individuals by mining the data and care gaps for preventive and disease manage-
using prescriptive, predictive and cognitive ment services, monitor patients’ conditions while
analytics to correlate actions to improved out- they live their lives and not just when they visit a
comes. The collection, summarization and healthcare facility, and provide consulting and
analysis of data can be from multiple venues educational services.
and sources, including social determinants The future of nursing informatics has no
and patient-generated information for person- bounds; technologies of all kinds will continue
alization. Then, it’s not just about impacting to evolve, and informatics will help nurses both
traditional care, but about the impact across integrate new technologies into their practice as
the continuum for the individual and includ- well as manage the impact of new technologies
ing public health and population health man- on that practice. Informatics will help invent
agement. The learnings can be iterated back the future of nursing care transformation.
into nursing practice in months instead of 1. Ball, M., & Hannah, K. (1984). Using com-
years, using protocols/guidelines, documenta- puters in nursing. Reston, VA: Reston Pub-
tion templates, and clinical decision support – lishers.
making it easier to do the right thing and 2. Grobe, S. (1988). Nursing informatics compe-
‘hard-wiring’ new best practices – thus, creat- tencies for nurse educators and researchers. In
30 ing a continuous learning health system. H.E. Petersen & U. Gerdin-Jelger (Eds.), Pre-
Care Coordination and Healthcare Anywhere: paring nurses for using information systems:
The advancement of technology has provided us Recommended informatics competencies.
the opportunity to provide care anytime/any- New York: National League for Nursing.
where and there’s little question that both 3. Hannah, K. J. (1985). Current trends in
patients and providers are increasingly drawn to nursing informatics: Implications for cur-
the concept of healthcare services that are vir- riculum planning. In K. J. Hannah, E. J.
tual. This includes “visits” using communication Guillemin, & D. N. Conklin (Eds.), Nursing
technologies such as email, phone and videocon- uses of computers and information science.
ference, as well as telehealth technologies for Amsterdam: Elsevier.
remote monitoring and management of condi- 4. Graves, J., & Corcoran, S. (1989). The study
tions or chronic disease. Coupling this with of nursing informatics. Image: Journal of
engaged patients using portals and mobile apps Nursing Scholarship, 21(4), 227–231.
creates a new ecosystem for nurses and their 5. ANCC. (2018). Informatics nursing certifi-
patients to interact. Care coordination between cation. Retrieved from 7 https://www.
venues of care and across the continuum will be nursingworld.org/our-certifications/infor-
directly impacted in a positive way. As nurses matics-nurse/.
have primary responsibility for coordinating 6. ANA. (2014). Nursing informatics: Scope
care and helping patients navigate the complexi- and standards of practice (2nd ed.). Silver
ties of the healthcare system, this will be a way Spring: ANA.
for them to extend their reach to more patients 7. HIMSS. (2017). Nursing informatics work-
and to improve the quality of the care provided force survey. Retrieved from 7 https://www.
to each patient. Nurses can more easily close himss.org/ni-workforce-survey.
The Future of Informatics in Biomedicine
1005 30
Box 30.4 Biomedical Informatics: The strating the value of services provided and to
Future of the Field from a Health Policy manage resource use continuously over a
Perspective reporting period. Interoperability and exchange
David Blumenthal of health care data will become a business
imperative to the extent that accountable pro-
Policy issues and developments in the United viders must absorb the costs of services pro-
States will be vital to the evolution and effica- vided to their patients at other health care
cy of health information technology (HIT) in facilities in their communities.
the future. This is true because health policy HIT for value maximization will also put
has made HIT a mainstream feature of the much greater emphasis on improving clinical
U.S. health care system and a vital tool for decisions so as to enhance the value of services
improving it. performed. In a value-oriented environment,
Two types of health policy issues will vital- usable and helpful decision support will achieve
ly affect the future of HIT, its uses and its ben- a priority it has never had in the current fee-for-
efits. The first type is generic to the U.S. health service environment. Another priority will like-
care system but will indirectly affect how HIT ly be the capability to assess the comparative
evolves. The second type of policy issue focus- performance of clinicians within organizations
es particularly on HIT. so as to evaluate reasons for variation in
Generic policy issues include payment decision-making and health care outcomes.
reform and the push toward consumer empow- A bipartisan interest in making health care
erment. There is an urgent need for payment markets more competitive and responsive to
reform to address issues such as the high costs patients’ needs is also motivating a push
associated with the U.S. health system. HIT toward patient empowerment through sharing
has the potential to be a powerful tool in electronic data with patients and their fami-
health system improvement but whether that lies. This movement is reflected in legislation
potential is exploited will depend on the needs and regulations that encourage providers to
and priorities of its users, especially health share EHR data with individuals or their des-
care providers. In a fee-for-service environ- ignated third parties. The Office of the
ment, where volume and revenue maximiza- National Coordinator for Health Information
tion are prioritized, purchasers of HIT will Technology (ONC) issued a rule in 2015 that
demand that it serve these purposes. The requires certified EHRs to have standardized
requirement to capture detailed information application programming interfaces (APIs)
for billing purposes will be paramount to the [1], which will facilitate access to EHR data by
design and configuration of electronic health patients and their agents. A new ONC rule,
records (EHRs) and other IT. Information proposed in the spring of 2018 [2], would also
systems will be used to assure that providers discourage so called information blocking.
capture every billable service in a way that The growing interest in data-sharing with
maximizes revenue collected. patients is also apparent in Apple’s decision
Payment approaches that prioritize value to work with 13 prominent health systems [3]
will favor different HIT configurations, espe- to accept their patients’ EHR data. Large,
cially if those payment methods hold providers innovative technology companies like Apple
accountable through risk-sharing for the cost may be able to support patient empowerment
and quality of services. HIT will have to facili- by fashioning user-friendly applications that
tate the capture and reporting of quality and use patients’ data to inform their decision-
cost information for the purpose of demon- making.
1006 J. J. Cimino et al.
The emergence of such applications will EHR certification process will likely play a
raise a host of policy issues. Finding ways to role in pursuing improved safety of patient
assure the safety of these consumer-facing data.
applications will be a critical part of consumer To address these and other health IT safety
empowerment, and constitutes a key policy concerns, multiple experts have proposed the
agenda. To this end, the Food and Drug establishment of a safety collaborative com-
Administration (FDA) is making an effort to posed of EHR developers, hospitals, govern-
adjust its traditional regulatory approaches for ment, health practitioners, and other key
the special circumstances of HIT applications. organizations to work together to resolve safety
One example of their efforts is the Accelerated problems.
Digital Clinical Ecosystem (ADviCE), a partner- Finally, policy interventions may be required
ship between the University of California, San to improve equity of access to benefits of HIT in
Francisco, several other universities and health rural areas and for underserved populations.
systems, and the FDA to share best practices and Lack of connectivity and sophisticated technical
data for using, integrating, and deploying health support can handicap rural providers in their
technology services and applications. ADviCE efforts to use advanced HIT.
will make recommendations on the types of data With the increasing power of HIT in health
30 needed, data sharing, transparency, and use. care will come increased reliance on its capa-
Policymakers must also find ways to protect bilities for responding to policy challenges,
privacy of patients, either through enforceable both general and HIT-specific. For the most
voluntary standards or governmental regula- part, these challenges will stimulate evolution
tion of emerging private organizations, like in HIT design that makes it even more useful
Apple, that play the role of data stewards. and important for the future of our health care
Some HIT specific policy issues are also system, and its patients and providers.
likely to influence the future development of 1. 7 https://www.healthit.gov/sites/default/
health information technology. On this front, files/facas/HITSC_Onc_2015_edition_
the increased use of EHRs has also given rise final_rule_presentation_2015-11-03.pdf
to safety challenges, as enumerated in a recent 2. 7 https://www.reginfo.gov/public/do/
report from the Pew Charitable Trusts [4]. For e A g e n d a Vi ew Ru l e ? p u b I d = 2 0 1 8 0 4 &
example, patients may receive the incorrect RIN=0955-AA01
dose of a medication or clinicians may select 3. 7 https://hbr.o rg/2018/03/apples-pact-
the wrong person when inputting an order. with-13-health-care-systems-might-actu-
These safety issues are probably linked with ally-disrupt-the-industry
the usability of EHRs, and suggest the need 4. 7 https://www.pewtrusts.org/en/research-
for improved user-centered design focused on and-analysis/reports/2017/12/improving-
the needs of both clinicians and patients. The patient-care-through-safe-health-it
The Future of Informatics in Biomedicine
1007 30
Box 30.5 Future Perspective sets collected in near real-time. Patients and
Mark Frisse their families can also access much more infor-
mation and are have become truly central to
As this textbook demonstrates, biomedical health care; patients are speaking up, and our
informatics paradigms are changing. Original health system is listening.
paradigms of necessity were moored by an envi- Other academic disciplines, once working
ronment where data sets were small; data stor- on the periphery of biomedical informatics, are
age was limited; computation required massive converging and taking center stage. Social sci-
and costly hardware; and high-bandwidth net- entists explore care complexity both in delivery
work connections were rare. Most major bio- settings in and in the home. For example, cogni-
medical research was conducted in large tive scientists seek more effective and efficient
laboratories and, with a few exceptions, compu- ways of managing care tasks. Operations
tational needs were limited. Health care delivery research professionals seek to improve patient
and clinical research generally took place in access, scheduling, workflow analysis, capacity
hospitals and large clinics both affiliated with management, throughput, and systems science.
medical schools and endowed with talent, reve- Behavioral psychologists are studying how
nues, and capital necessary for their successful mobile technologies can “nudge” patients and
operation. Payment models and reimbursement providers into better behaviors. As a result,
for health care operations took place behind the informatics has become even more imaginative,
scenes without excessive complexity and with extensive, rigorous, broad, accessible, and inex-
few burdens on providers. Public health work- pensive.
ers, health policy researchers and related groups The accomplishments have been many, the
had access only to selective, retrospective, and future seems bright, and the potential for soci-
often manually-collected data and limited ana- etal good is promising. But, to paraphrase nov-
lytic capabilities. Informatics was a select, elist William Gibson, this bright future is not
expensive, and time-consuming endeavor. now nor will it quickly become evenly distrib-
Despite great challenges, remarkable feats were uted. Both in biomedicine and in society at
accomplished. large, new paradigms and technologies trans-
Recent paradigms are untethered from formed commerce, interpersonal communica-
many early constraints. Today, data sets are tion, social interactions, and behaviors have
massive and plentiful; data storage is inexpen- upended almost every aspect of society. By inte-
sive and seems unlimited; computation is ubiq- grating and analyzing the multiple data streams
uitous and extends from minute sensor devices emerging from our personal behavior, commu-
to massive cloud-based virtual machines; high- nication, reading habits, purchasing patterns,
bandwidth network connections are pervasive and social interactions, data and algorithms are
and central to American life. The range of bio- capable of startlingly accurate predictions that
medical research activities is far broader and is in turn can profoundly influence behavior. The
constrained more by funding and talent limita- velocity of these changes carries biomedical
tions than by facilities; and computation is not informatics – and all of society – into an uncer-
only central to traditional research approaches tain future full of promise and peril.
but has extended the reach of scientific investi- Consider the American healthcare system.
gation dramatically through the analysis of The United States has the highest per capita
data sets ranging from molecules to genomes. expenditures for health care in the world, yet, by
Largely because of Internet- based services, many measures, important health care quality
more providers and other care givers have measures lag far behind these of other countries
access to information they need. Public health [1]. Despite significant advances in technology
workers, health policy researchers and other and clinical informatics, this trend continues.
interest groups can access large and broad data This may be due in part because technologies are
1008 J. J. Cimino et al.
not capable only of reducing complexity, they cannot simply continue their current approach at
are also capable of introducing additional com- the expense of providing the cognitive support
plexity whether such complexity is warranted or patients and clinicians desperately need. To
not. One cannot argue against effective infor- improve clinician morale and productivity, the
matics support for prescribing decisions; biology urge to introduce even further unnecessary
and the clinical condition warrant extreme detail administrative burdens on care providers must
to complexity. Similarly, knowledge of total and be resisted.
out-of-pocket drug costs would be helpful if Given the many turbulent transformations
patients were presented with choices, but it is dif- in care delivery methods, care delivery organiza-
ficult to rationalize the hundreds (if not thou- tions, and patient-centered health technologies,
sands) of different formularies imposed by many clinical informatics advances will be the
health plans. One can argue that effective mea- realized through extension of traditional EHRs
surement of outcomes and care metrics is essen- and still others will be the product of experi-
tial for demonstrably increasing quality of care, mentation with clinical technologies address
but the value of many quality metrics is uncer- immediate consumer-directed needs and view
tain and the administrative burdens imposed on EHR connectivity as secondary objective. Since
clinicians who must collect these data borders on both models will be introduced, evaluated, and
30 the intolerable, often coming at the expense of adopted, one must understand how informatics
patient interaction. As the economist Uwe can influence the evolution of many different
Reinhardt wrote: “I have been at many confer- types of clinical systems.
ences at which concerned clinicians explore so- The ascendancy of data science has been a
called ‘evidence-based medicine,’ replete with central theme of biomedical informatics. Broadly
‘evidence-based best-clinical-practice guidelines’ construed, these activities expand fundamental
and the associated ‘clinical pathways.’ I cannot biomedical informatics activities through the
recall a conference on the topic of ‘evidence- introduction of new technologies and techniques.
based best administrative practices,’ (although I Findings emerging from increasingly interopera-
may have missed it.)” [2]. ble clinical databases like i2b2, OMOP, and
Consider the future role of the traditional PCORNet further stimulate essential large-scale,
institution-centric electronic health record. collaborative data standardization and ontology
Federal incentives greatly accelerated the intro- development. These in turn will simplify the
duction of EHRs into hospitals and clinics and inclusion of a broader array of personal, environ-
made transactions like e-Prescribing routine. mental, and biologic computable knowledge
Data and communications standards allow com- structures. Machine learning and related disci-
munication across different clinical systems and plines arising from these activities foster discovery
expand capabilities for medication management, of previously unknown medication interactions,
care coordination and other clinical activities genetic propensities, behavioral risks, predictions,
outside of hospitals and clinics. Common EHR and actionable care interventions.
data elements and organizational data ware- Social networks and other forms of infor-
houses are simplifying secondary data use for mal communication are having similar impacts.
quality reporting, administration, population In principle, these networks can gather isolated
health, research, and other uses. Web portals, individuals sharing common concerns and can
mobile communications, and patient-accessible reinforce positive behaviors and combat imped-
EHRs are engaging patients and their families to iments to health – social isolation, misinforma-
a greater degree. But this rapid introduction of tion, and costs. Some forms of “digital group
EHRs has been a mixed blessing. Critics claim therapy” or “group telemedicine” may be par-
that EHRs focus on administrative and payment ticularly well-suited in these circumstances.
at the expense of providing the cognitive support A dazzling array of new technologies must
patients and clinicians desperately need. EHRs also be understood and when appropriate intro-
The Future of Informatics in Biomedicine
1009 30
duced into clinical research and care delivery. tices challenge society’s very idea of a common
The collection, integration, and analysis of new truth.
data streams produced by these devices are Advances in data science and analytics,
already being used to manage diet, weight, exer- when combined with sensors and devices on the
cise, and even cardiac rhythm problems. person, in the home or in public spaces raise
Untethered from traditional EHRs, these prod- fears that “someone/something is always
ucts are producing new and valuable sources of watching.” If data are aggregated and used by
ambiently-collected data at lower costs. an unauthorized “data- industrial complex”
Speech and gesture recognition will simplify working outside of socially acceptable norms,
human-computer interaction. Ambient data col- privacy rights are threatened. Better means of
lection methods simplify collection of routine anonymizing data and more realistic privacy
data and provide additional context for docu- and data use policies will become even more
mentation and interpretation. Clinician-computer important.
interactions may be unobtrusive and allow great- Although paradigms change, an emphasis
er focus on patients rather than computer screens. on data, information, knowledge, and effective
Ambient data collection – including video inter- use remains foundational. A primary responsi-
pretation of clinician – patient interactions may bility of biomedical information is to ensure that
be used to more completely summarize the clini- everything from data generation to knowledge
cal encounter. Image recognition technologies generation is continually improving through
can diagnose skin disorders, radiographs, and greater consistency and efficiency. These
some other medical images. Machine learning improvements in turn should result in systems
algorithms will reliably screen for abnormalities that more effectively address real needs and not
and complement human judgement. merely automate flawed behaviors or practices.
We cannot can fully control how innova- Our future depends on the extent to which we
tions will be adopted, nor can we predict their can introduce efficient means of presenting
societal impact. Informatics – and innovation needed, reliable, and consistent information and
more broadly – is a two-edged sword. the extent to which our efforts ensure better out-
For example, clinical systems have improved comes for individuals and society. To be effec-
care, reduced costs, and contributed to new tive, informatics professionals proceed based on
insights through translational informatics and their experience, knowledge, and values. They
data science. At the same time, they have added must, in other words, practice wisdom.
considerably to administrative burdens and cost, 1. Schneider, E. C., Sarnak, D. O., Squires, D.,
and in practice, may emphasize administrative Shah, A., & Doty, M. M. (2017). Mirror, mir-
tasks over the critical cognitive work that is the ror 2017: International comparison reflects
foundation of clinical medicine. At the clinical flaws and opportunities for better U.S. Health
and policy level, efforts to simplify programs Care. Commonwealth Fund. 7 http://www.
and processes become even more important. commonwealthfund.org/interactives/2017/
Similarly, social networks and telemedicine july/mirror-mirror/.
allow previously isolated individuals to rein- 2. Reinhardt, U. E. (2013, September 13).
force possibly socially objectionable attitudes Waste vs value in American Health Care.
or behaviors. But these same networks can rap- New York Times. 7 https://economix.blogs.
idly distribute and reinforce exaggerated or nytimes.com/2013/09/13/waste-vs-value-in-
false claims about the efficacy of vaccinations, american-health-care/.
treatments, and scientific evidence; these prac-
1010 J. J. Cimino et al.
Box 30.6 The Future of Health IT: healthcare system look like? The answer is that
A Clinical Perspective the experience of being both patient and health-
Robert M. Wachter care professional will be far more satisfying.
Let’s turn first to the hospital. Much of the
About a decade ago, I hired a young clinical care that we currently think of as requiring hos-
informaticist for a faculty position at UCSF. I pitalization will undoubtedly be accomplished
told him he had an incredibly bright future, within less expensive settings (including the
since we would soon implement a well-respect- patient’s home), aided by a variety of technolo-
ed vendor-built electronic health record (EHR). gies ranging from clinical sensors to advanced
I was confident that this would be exciting and audio and video capabilities. The hospital will
important work, work that would keep him mostly exist to care for very sick patients – the
fully employed for years to come. types we might today associate with being in
I didn’t share with him my worry: what the ICU. And the ICU will likely no longer be a
would his job be after the EHR was installed? walled off physical space. Rather, every hospi-
Needless to say, despite the fact that our tal bed will be modular, capable of supporting
EHR has been up and running for 6 years, he ICU level care with the push of a few buttons.
Decision-making about who needs higher
30 remains gainfully employed. In fact, he is busi-
er than ever. His experience taught me some- levels of care will not be left to the clinician’s
thing I did not understand at the time: the “eyeball test.” Instead, clinicians’ experience
implementation of the EHR is merely the first will be augmented by sophisticated AI-based
step in the process of extracting value from prediction tools constantly humming in the
healthcare digitization. background, alerting doctors and nurses that,
In fact, I have come to see the process of say, a patient’s probability of death just spiked
digitization as involving four steps: up and thus she bears closer watching. Of
1. Digitizing the record course, taking advantage of all these AI-
2. Connecting all the digital parts (“interoper- generated predictions will require cracking the
ability”) tough nut of alert fatigue. This will be accom-
3. Gaining insights from the digital data now plished by markedly decreasing false positive
being generated by and traversing the sys- rates, implementing advanced data visualiza-
tem tion and other prioritization methods, and like-
4. Taking advantage of digitization to build ly through the discovery of approaches that
and/or implement new tools and approach- haven’t yet been invented.
es that deliver healthcare value (improving Patient rooms will have large video screens
quality, safety, patient experience, access, and sophisticated camera and audio equipment
and equity while also lowering costs and to allow for tele-visits. Patients and families will
improving efficiency and productivity) be able to review clinicians’ notes, test results,
and treatment recommendations, either on the
In the United States, the $30 billion of big screen or on their hospital-issued tablet
incentive payments distributed by the govern- computer. Patients will not only have full access
ment under the HITECH Act from 2010–2014 to their EHR but will also receive educational
succeeded in achieving the first step – nearly all materials (“here’s what to expect from your
hospitals and 90% of physician offices now use MRI tonight”) and motivation (“Good job on
an EHR. While we see sporadic examples of your incentive spirometer today!”) – provided
activities under Steps 2, 3, and even 4, they are by the technology. The dreaded nurse call but-
by far the exception. ton will be replaced by a voice-activated system
As we look beyond the present, let’s fantasize in which a patient’s request results in a nurse
about a future in world in which we have substan- appearing on screen and even taking some
tially accomplished all four steps. What might our actions (increasing the IV flow rate or adjusting
The Future of Informatics in Biomedicine
1011 30
the bed, for example) remotely. If a new pill is he used in the past day (through the “Internet
needed, likely as not a robot will deliver it. of Things”). The technology will integrate this
When the hospital doctor comes to visit the information, along with streaming data on
patient, the room’s telemedicine capabilities heart rate and blood pressure drawn from wear-
will allow additional parties to participate. For able or stick-on sensors, to offer recommenda-
example, a palliative care discussion can involve tions for drug and activity adjustments. Ditto
distant family members, the inpatient palliative for patients with diabetes, emphysema, asthma,
care team, and a physician at an outside hos- and the like.
pice. An infectious disease consult might Making all of this function will require a
involve a discussion between the patient, the new workflow, and with it a new set of health-
hospitalist, and the ID consultant in real time, care professionals. Sometimes called “care traffic
rather than the serial visits and imperfect com- controllers,” they will be clinicians (likely nurses
munication through chart notes that marks with advanced training in population health
current practice. and some informatics) who will monitor, via
Speaking of notes, in both inpatient and advanced digital dashboards, the status of 100,
outpatient settings physicians will no longer or 1000 such patients, contacting and coaching
spend hours typing notes into the EHR. Rather, the ones who seem to be having problems. The
natural language processing technology will initial contacts may be generated by AI-driven
“listen” to the doctor-patient conversation via algorithms and delivered by the technology, but
room-based microphones and create a useful the care traffic controller will intervene if the
note, improving itself over time as it learns each patient continues to have problems. For patients
physicians’ individual practice style and patient continuing to do poorly, a physician will become
population (“digital scribes”). Documentation engaged. Even then, many of these encounters
will increasingly become the byproduct of the will be IT-enabled remote ones.
doctor-patient encounter, not a central focus For patients with acute medical issues,
on the physician’s attention. much of the care will be delivered by apps,
On the other hand, clinicians will glean far which will also offer AI-derived recommenda-
more useful information and insights from tions for simple diagnoses and interventions.
their digital tools, including the EHR. As data Patients who require higher levels of care will
are entered into the patient’s chart, the EHR see a clinician through telemedicine or commu-
will suggest possible diagnoses and testing nity-based urgent care. Urgent care clinics will
approaches, and guidelines and recommended be conveniently placed in supermarkets and
treatment approaches will be a click or a voice pharmacies, and our eventual success in achiev-
command away. In essence, the EHR and the ing complete interoperability – the patient’s
electronic textbook will merge into one inte- record always available via the cloud – will
grated tool. enhance the ability to view the relevant parts of
Turning to the outpatient arena, much of the EHR and to record data that then becomes
the care that currently requires in-person visits available to all subsequent practitioners.
will be conducted via IT-enabled home care The promise of precision medicine will
and televisits. The care of patients with chronic finally be realized. For example, the guidelines
diseases will be utterly transformed, with a far for treating a 50-year old woman with high
greater emphasis on real-time, home-based, blood pressure or elevated cholesterol will
technology-enabled decision support and dis- become far more complex and customized,
ease management. The heart failure patient will considering a variety of patient- and popula-
begin his day by weighing himself on a digital tion-based risk factors and large amounts of
scale and answering a few questions on the genetic information. This same complexity also
computer (“How is your breathing? How did means that the clinician will depend on the
you sleep?”). It might even know how much salt computer to “know” all of these variables and
1012 J. J. Cimino et al.
suggest the best approach. Rather than remem- also the contribution of companies, large and
bering the correct approach to hypercholester- small, some built specifically to solve specific
emia in middle-aged women (there will no healthcare problems, others digital giants (the
longer be any one correct approach), the role of Apples, Googles, and Amazons of the world)
the clinician will become more about interpret- taking advantage of their capabilities in areas
ing the computer’s output (including interven- like app development, supply chain, and AI to
ing when it seems wrong), communicating the attack healthcare problems.
findings to the patient, and motivating the nec- Importantly, in such a multidimensional
essary behavioral change. Of course, this digital world, success cannot come simply by
changed role will require a significant evolution buying pieces of technology, peeling off the
in medical education. bubble wrap, and dropping them into health-
In fact, the ability to analyze vast amounts care systems and workflows. It will be up to
of digital data will transform all clinical clinical informaticists to deeply understand the
research. Rather than basing most of our treat- needs of patients, clinicians, families, and
ment recommendations on small randomized administrators, the complexities of the tech-
clinical trials, many advances will come through nologies, and the economic, regulatory, privacy,
analyses of actual clinical data, seeing which and often ethical context. Informatics profes-
30 approaches are associated with better out- sionals will be the ones making the clinical and
comes. Of course, this will require sophisticated business case for change, and working with
adjustment for confounders, which should also both vendors and clinicians to ensure that these
be facilitated by the vast amounts of fully inte- new approaches and technologies actually
grated digital clinical information. Individual achieve their aims.
healthcare systems will take advantage of these This is why the job of the clinician infor-
data as well, transforming themselves into so- maticist will remain highly secure for the fore-
called “learning healthcare systems” by mining seeable future. While the job of the informaticist
their own data and experience to determine will no longer be to implement a core enterprise
which approaches lead to the best outcomes. EHR, he or she will be doing something more
The vision that I’ve described here is not complex and likely more valuable: reimagining
around the corner – it is likely 10–15 years the work and the workflow to take advantage
away. And achieving it will require not just the of evolving digital capabilities to improve
technology of a few large EHR vendors, but healthcare value.
The Future of Informatics in Biomedicine
1013 30
Box 30.7 The Future of Biomedical ed divisions and agencies, including the Center
Informatics from the Federal Govern- for Disease Control and Prevention, the Center
ment Perspective for Medicare and Medicaid, and the Agency
Patricia Flatley Brennan for Health Care Research and Quality to rap-
idly respond to public health threats, monitor
Advances in biomedical informatics, including health care expenditures and quality and foster
computational bioinformatics, are essential to systemic interoperability. Partnerships between
accelerating scientific discovery and assuring the NIH and with other federal agencies out-
the health of society. Finally, after 40 years of side of the health sector will allow the invest-
promise, there is sufficient data and computing ments in biomedical informatics to benefit from
power to realize the visions of early biomedical generalized investment in data curation, large
informatics leaders that data-powered health scale data management and storage, privacy
could become a reality. Decades of slow but and network platforms.
steady progress towards formalizing biomedi- The NLM will do for data what it has done
cal knowledge through effective use of lan- for the literature – making them findable, acces-
guage and messaging standards is now sible, interoperable and re-usable (FAIR).
complemented by improvement in heuristics These attributes, linked under the rubric of the
and algorithms that can translate those formal- FAIR principles, provide guidance for how a
izations into actionable decisions. The atten- federal library makes its resources available to
tion of the field to key users has broadened to the public. Making data FAIR requires
include basic science researchers and clinicians improved curation strategies, ones that balance
as well as patients and families. automated approaches with human indexing
As a major provider of health care services, and metadata developments in a way that takes
as well as a key funder of health care services, advantage of the speed of automation while
supporter of biomedical and health related preserving human talent for the most-complex
research, and guardian of key health quality cases. The Library-of-the-Future will continue
initiatives, the United States federal govern- to see the NLM serving as the custodian of key
ment plays and will continue to play a signifi- collections, but also increasing its reach as a
cant role in advancing biomedical informatics connector of important information and data
over the next decade. Federal investment will resources that exist outside of its boundaries.
lead to advances in data management and pro- Future developments may also lead to a discov-
tection, new ways to draw knowledge out of ery-on-demand approach to locating and
health data, and delivery of better, accurate obtaining information that has not be previ-
and complete health information at the point ously archived. The NLM will invest in research
of need, anywhere. Perspectives of open sci- that advances use of these important collec-
ence, ensuring economic advancement through tions and provides novel methodologies to
research, and a recognition of the accountabil- interrogate them.
ity of the government to the taxpayer are Some agencies within the Department of
engendering a new commitment of openness Health and Human Services, such as the Office
and responsiveness to society. of the National Coordinator of Health
The National Library of Medicine (NLM), Information Technology (ONC), will continue
one of the 27 institutes and centers at the to invest in broad, societal resources to maintain
National Institutes of Health, is key among the the health information infrastructure. Other
several federal agencies committed to ensuring agencies, such as the Center for Medicare and
the availability of high-quality data to charac- Medicaid Services, will continue to be both data
terize patient problems, account for health care consumers (payment schemes resting on claims
resource expenditure and foster research driven for health services) as well as data contributors,
by greater understanding of clinical phenome- making their information accessible to consum-
na. The NLM partners with other health relat- ers for enhanced self-monitoring and to
1014 J. J. Cimino et al.
researchers to foster discovery informed by care. Federal resources should be spent on those
Several trans-federal initiatives are on the hori- things that only the federal government should
zon, designed to ensure efficient investment in do. These investments include short and long-
scalable, re-suable information resources. term research and d evelopment programs that
The recognition across NIH of the impor- advance the health and well-being of society, edu-
tance linking clinical information and biological cating the workforce of the future, and protecting
data portends expanded investment in the meth- key assets in perpetuity. Most of this investment is
ods for curating and integrating information likely to occur through the NLM. In the biomedi-
across time within a person and across people to cal informatics arena this means investing in
better understand the health of individuals and research to develop method that are scalable, sus-
populations. Rapid growth of data from research tainable and reproducible, creating computation-
taxes existing technical capabilities and demands al approaches to data management capable of
additional policy development and financial curation at scale, developing the libraries of the
investment to house important data resources. future that not only encompass literature and
The federal government fosters policies that pro- data but also the interim product of research such
tect patient privacy and develop the incentive as protocols, ethics and human subjects agree-
structures to accelerate the adoption of effective ment, as well as novel methods of documenting
30 computer systems for health care. With the rap- research activities, such as the next generation of
idly growing data generating initiatives, the fed- Jupyter notebooks. Development efforts should
eral government must take a critical role in be applied to the ever-growing amount of text-
determining how to best select and preserve the based journal articles and reports, to devise new
full range of information. The federal govern- and creative ways to expose the literature to a
ment will host public discussion and dialogs that variety of publics. Educational programs and
ensure the clinical information is sufficiently efforts of the future will infuse data science and
broad to reflect the clinical experience of all per- advanced biomedical informatics lessons not only
sons. It is responsible for ensuring the cross- in the training programs of specialists, but across
national arrangements needed to keep scientific the biomedical research and clinical training pro-
exchange of health data open and free-flowing. grams, and even extending into equipping patients
Interagency coordination is needed to and lay people with access to data and informa-
ensure that technological advances benefit tion and tools to make use of those resources.
health care and that health dollars leverage It’s worthy to note two very important
investment made in other sectors. The primary trends that will shape the future of the federal
point of coordination is through the engagement with health information technolo-
Networking and Information Technology gies. First, there will be an increase in public
Research and Development (NITRD) private partnerships to leverage knowledge in
Program. NITRD is a trans-agency initiative the technical and information technology sec-
designed to provide the research and develop- tor in support of health care. Such partnership
ment foundations for advancing information should lead to a more robust and interoperative
technologies, and also to deploy those tech- health information environment. Second, there
nologies in the service of the country. The NIH will be certain roles that the federal government
reports its technological research and develop- must preserve, such as maintaining accurate
ment expenditures to the President through the and freely-accessible information resources for
NITRD program. The NIH broadly, and the the public good and overseeing the develop-
NLM specifically, participate in the many ment of policies that foster data sharing while
workgroups that focus on broad ranging topics protecting individual and institutional rights.
such as computing-enabled human interac- Future federal efforts will be accompanied by
tion, communication and augmentation, collaborations with industry. These collabora-
cybersecurity and privacy, and high capability tions could take the form of joint investments in
computing infrastructure and applications. common problems, such as data quality or cura-
The Future of Informatics in Biomedicine
1015 30
tion. Other forms of partnership may emerge that and how it is used and valued. The federal
engage the federal investment for research and government has two key levers for expanding
development with accelerated pathways for tech- the definition of what constitutes health:
nology transfer. Including industrial members on investing in research to demonstrate the con-
Federal Advisory Committees will provide path- sideration of health data, including social
ways for exchange of information. and behavioral predictors, on the impact of
The NLM will continue to play a leadership what constitutes health is a major contribu-
role in maintaining accurate and freely-accessi- tion. Additionally, because of its role as a
ble information resources. The NLM has taken major funder of health care through the
a major step towards this by migrating all of CMS, the federal government shapes what is
our public facing information resources onto a considered of value in health care, such as
common, sustainable technical platform. This research that finds ways to incorporate the
migration will not only enhance efficiencies but social and behavioral predictors of health
also allow for increased interoperability across into routine data collection, and then to
our resources. A common technical platform, ensure the use of this information in the diag-
coupled with enhancement of terminology and nostic, treatment and evaluation aspects of
vocabulary systems, will make it more feasible the health care process.
for intended users to traverse the information The future of biomedical informatics from
resources housed here. the federal perspective is one characterized by
In the future there will be an increasing openness, partnerships and perpetual storage
role of the federal government in protecting of biomedical knowledge. A vibrant research
and preserving information in perpetuity. program will be needed to develop and deploy
The enabling legislation of the NLM directs the tools needed to accomplish this vision.
it to collect the medical knowledge of the Thoughtful deliberation is essential to protect
time and store it permanently in ways that the privacy rights of individuals while fostering
make it accessible for a wide range of users. the greatest degree of sharing of data and
As the largest funder of public health and information needed to achieve the goals
heath care, the federal government indirectly enabled by data driven discovery.
shapes what constitutes health information
1016 J. J. Cimino et al.
Supplementary
Information
Glossary – 1018
Name Index – 1091
Subject Index – 1131
Glossary
21st Century Cures Act A comprehensive bill from participants receiving an intervention
that promotes and funds the acceleration of or interventions under study. It is also com-
research into preventing and curing serious mon to monitor study participants for adverse
illnesses; accelerates drug and medical device events during this phase.
development; attempts to address the opioid
abuse crisis; and tries to improve mental Active storage In a hierarchical data-storage
health service delivery. It also includes a scheme, the devices used to store data that
health IT-related provisions on interoperabil- have long-term validity and that must be
ity, data sharing/exchange and electronic accessed rapidly.
health records.
Acute Physiology and Chronic Health
Abductive reasoning Can be characterized as Evaluation, Version III [APACHE III] A scoring
a cyclical process of generating possible expla- system for rating the disease severity for par-
nations or a set of hypotheses that are able to ticular use in intensive care units.
account for the available data and then each
of these hypotheses is evaluated on the basis Adaptive learning Adapting the presenta-
of its potential consequences. In this regard, tion of learning content in response to
abductive reasoning is a data-driven process continuous assessment of the learner’s per-
that relies heavily on the domain expertise of formance.
the person.
Address An indicator of location; typically
Accountability Security function that ensures a number that refers to a specific position in
users are responsible for their access to and a computer’s memory or storage device; see
use of information based on a documented also: Internet Address.
need and right to know.
ADE See: Adverse Drug Events.
Accountable care A descendant of man-
aged care, accountable care is an approach Admission-discharge-transfer (ADT) The
to improving care and reducing costs. See: core component of a hospital information
Accountable Care Organizations. system that maintains and updates the hos-
pital census, including bed assignments of
Accountable Care Organizations (ACOs) An patients.
organization of health care providers that
agrees to be accountable for the quality, cost, ADT See: Admission-discharge-transfer.
and overall care of their patients. An ACO
will be reimbursed on the basis of managing Advanced Cardiac Life Support A course to
the care of a population of patients and are train providers on the procedure and set of
determined by quality scores and reductions clinical interventions for urgent treatment of
in total costs of care. cardiovascular emergencies.
ACO See: Accountable Care Organizations. Advanced Research Projects Agency Network
(ARPANET) A large wide-area network cre-
Active failures Errors that occur in an acute ated in the 1960s by the U.S. Department
situation, the effects of which are immediately of Defense Advanced Research Projects
felt. Agency (DARPA) for the free exchange of
information among universities and research
Active phase The phase of a clinical research organizations; the precursor to today’s
study during which investigators collect data Internet.
1019
Glossary
AMRS See: Ambulatory medical record sys- tate the integration and communication of
tems. information, perform bookkeeping functions,
monitor patient status, aid in education.
Analog signal A signal that takes on a con-
tinuous range of values. Application programming interface (API) A
specification that enables distinct software
Analog-to-digital conversion (ADC) Conver modules or components to communicate with
sion of sampled values from a continuous- each other.
valued signal to a discrete-valued digital
representation. Applications (applied) research Systematic
investigation or experimentation with the
Anchoring and adjustment A heuristic used goal of applying knowledge to achieve practi-
when estimating probability, in which a per- cal ends.
son first makes a rough approximation (the
anchor), then adjusts this estimate to account Apps Software applications, especially ones
for additional information. downloaded to mobile devices.
Anonymize Applied to health data and infor- Arden Syntax for Medical Logic Module A
mation about a unique individual, the act of coding scheme or language that provides a
de-identifying or stripping away any and all canonical means for writing rules that relate
data which could be used to identify that indi- specific patient situations to appropriate
vidual. actions for practitioners to follow. The Arden
Syntax standard is maintained by HL7.
ANSI See: American National Standards
Institute. Argument A word or phrase that helps com-
plete the meaning of a predicate.
Antibiogram Pattern of sensitivity of a micro-
organism to various antibiotics. ARPANET See Advanced Research Projects
Agency Network.
APACHE III See Acute Physiology and Chronic
Health Evaluation, Version III. Artificial intelligence (AI) The branch of com-
puter science concerned with endowing com-
Apache Open source Web server software puters with the ability to simulate intelligent
that was significant in facilitating the initial human behavior.
growth of the World Wide Web.
Artificial neural network A computer pro-
Applets Small computer programs that can gram that performs classification by taking as
be embedded in an HTML document and input a set of findings that describe a given
that will execute on the user’s computer when situation, propagating calculated weights
referenced. through a network of several layers of inter-
connected nodes, and generating as output
Application program A computer program a set of numbers, where each output corre-
that automates routine operations that store sponds to the likelihood of a particular clas-
and organize data, perform analyses, facili- sification that could explain the findings.
1021
Glossary
ASCII See: American Standard Code for Availability In decision making, a heuristic
Information Interchange. method by which a person estimates the prob-
ability of an event based on the ease with
Assembler A computer program that trans- which he can recall similar events. In secu-
lates assembly-language programs into rity systems, a function that ensures delivery
machine-language instructions. of accurate and up-to-date information to
authorized users when needed.
Assembly language A low-level language for
writing computer programs using symbolic Averaging out at chance nodes The process
names and addresses within the computer’s by which each chance node of a decision tree
memory. is replaced in the tree by the expected value of
the event that it represents.
Association of American Medical Colleges
(AAMC) A non-profit organization that Backbone links Sections of high-capacity
includes all US and Canadian medical col- trunk (backbone) network that interconnect
leges and many teaching hospitals, and sup- regional and local networks.
ports them in their education and research
mission. Backbone Network A high-speed commu-
nication network that carries major traffic
Asynchronous Transfer Mode (ATM) A net- between smaller networks.
work protocol designed for sending streams
of small, fixed length cells of information over Background question A question that asks
very high-speed, dedicated connections, often for general information on a topic (see also:
digital optical circuits. foreground question).
Audit trail A chronological record of all Backward chaining Also known as goal-
accesses and changes to data records, often directed reasoning. A form of inference used
used to promote accountability for use of, and in rule-based systems in which the inference
access to, medical data. engine determines whether the premise (left-
hand side) of a given rule is true by invok-
Augmented reality Imposition of synthetic ing other rules that can conclude the values
three-dimensional and text information on of variables that currently are unknown and
top of a view of the real world seen through that are referenced in the premise of the given
specialized glasses worn by the learner. rule. The process continues recursively until
all rules that can supply the required values
Authenticated A process for positive and have been considered.
unique identification of users, implemented
to control system access. Bag-of-words A language model where text is
represented as a collection of words, indepen-
Authorized Within a system, a process for dent of each other and disregarding word order.
limiting user activities only to actions defined
as appropriate based on the user’s role. Bandwidth The capacity for information
transmission; the number of bits that can be
Automated indexing The most common transmitted per unit of time.
method of full-text indexing; words in a
document are stripped of common suffixes, Baseline rate: population The prevalence of the
entered as items in the index, then assigned condition under consideration in the population
weights based on their ability to discrimi- from which the subject was selected; individual:
nate among documents (see vector-space The frequency, rate, or degree of a condition
model). before an intervention or other perturbation.
1022 Glossary
Basic Local Alignment Search Tool (BLAST) An Best of breed An information technology
algorithm for determining optimal genetic strategy that favors the selection of individual
sequence alignments based on the obser- applications based on their specific function-
vations that sections of proteins are often ality rather than a single application that inte-
conserved without gaps and that there are grates a variety of functions.
statistical analyses of the occurrence of small
subsequences within larger sequences that Best of cluster Best of cluster became a vari-
can be used to prune the search for matching ant of the “best of breed” strategy by selecting
sequences in a large database. a single vendor for a group of similar depart-
mental systems, such laboratory, pharmacy
Basic research Systematic investigation or and radiology.
experimentation with the goal of discovering
new knowledge, often by proposing new gen- Bibliographic content In information
eralizations from the results of several experi- retrieval, information abstracted from the
ments. original source.
Bayes’ theorem An algebraic expression often Binary The condition of having only two val-
used in clinical diagnosis for calculating post- ues or alternatives.
test probability of a condition (a disease, for
example) if the pretest probability. (preva- Biobank A repository for biological materi-
lence) of the condition, as well as the sensitiv- als that collects, processes, stores, and distrib-
ity and specificity of the test, are known (also utes biospecimens (usually human) for use in
called Bayes’ rule). Bayes’ theorem also has research.
broad applicability in other areas of biomedi-
cal informatics where probabilistic inference is Biocomputation The field encompassing the
pertinent, including the interpretation of data modeling and simulation of tissue, cell, and
in bioinformatics. genetic behavior; see biomedical computing.
development of biomedical instrumentation Bit depth The number of bits that represent an
and biomedical devices. individual pixel in an image; the more bits, the
more intensities or colors can be represented.
Biomedical informatics The interdisciplinary
field that studies and pursues the effective uses Bit rate The rate of information transfer; a
of biomedical data, information, and knowl- function of the rate at which signals can be
edge for scientific inquiry, problem solving, transmitted and the efficacy with which digi-
and decision making, driven by efforts to tal information is encoded in the signal.
improve human health.
BLAST See: Basic Local Alignment Search
Biomedical Information Science and Tool.
Technology Initiative (BISTI) An initiative
launched by the NIH in 2000 to make opti- Blinding In the context of clinical research,
mal use of computer science, mathematics, blinding refers to the process of obfuscat-
and technology to address problems in biol- ing from the participant and/or investigator
ogy and medicine. It includes a consortium what study intervention a given participant is
of senior-level representatives from each of receiving. This is commonly done to reduce
the NIH institutes and centers plus represen- study biases.
tatives of other Federal agencies concerned
with biocomputing. See: 7 http://www.bisti. Blog A type of Web site that provides discus-
nih.gov. sion or information on specific topics.
Biomedical taxonomy A formal system for Blue Button A feature of the Veteran
naming entities in biomedicine. Administration’s VistA system that exports
an entire patient’s record in electronic form.
Biomolecular imaging A discipline at the
intersection of molecular biology and in vivo BlueTooth A standard for the short-range
imaging, it enables the visualisation of cellular wireless interconnection of mobile phones,
function and the follow-up of the molecular computers, and other electronic devices.
processes in living organisms without perturb-
ing them. Body The portion of a simple electronic mail
message that contains the free-text content of
Biopsychosocial model A model of medi- the message.
cal care that emphasizes not only an under-
standing of disease processes, but also the Body of knowledge An information resource
psychological and social conditions of the that encapsulates the knowledge of a field or
patient that affect both the disease and its discipline.
therapy.
Boolean operators The mathematical opera-
Biosample Biological source material used in tors and, or, and not, which are used to com-
experimental assays. bine index terms in information retrieval
searching.
Biosurveillance A public health activity that
monitors a population for occurrence of a Boolean searching A search method in which
rare disease of increased occurrence of a com- search criteria are logically combined using
mon one. Also see Public Health Surveillance and, or, and not operators.
and Surveillance.
Bootstrap A small set of initial instruc-
Bit The logical atomic element for all digital tions that is stored in read-only memory and
computers. executed each time a computer is turned on.
1024 Glossary
Execution of the bootstrap is called boot- Cadaver An embalmed human body used for
ing the computer. By analogy, the process of teaching anatomy through the process of dis-
starting larger computer systems. secting tissue.
Bottom-up An algorithm for analyzing small Canonical form A preferred string or name
pieces of a problem and building them up into for a term or collection of names; the canoni-
larger components. cal form may be determined by a set of rules
(e.g., “all capital letters with words sorted in
Bound morpheme A morpheme that alphabetical order”) or may be simply chosen
creates a different form of a word but arbitrarily.
must always occur with another morpheme
(e.g., −ed, −s). Capitated payments System of health- care
reimbursement in which providers are paid a
B-pref A method for measuring retrieval per- fixed amount per patient to take care of all the
formance in which documents without rel- health-needs of a population of patients.
evance judgments are excluded.
Capitation Payments to providers, typically
Bridge A device that links or routes signals on an annual basis, in return for which the
from one network to another. clinicians provide all necessary care for the
patient and do not submit additional fee-for-
Broadband A data-transmission technique service bills.
in which multiple signals may be transmit-
ted simultaneously, each modulated within an Cardiac output A measure of blood volume
assigned frequency range. pumped out of the left or right ventricle of the
heart, expressed as liters per minute.
Browsing Scanning a database, a list of files,
or the Internet, either for a particular item or Care coordinator See: Case Manager.
for anything that seems to be of interest.
Care plan A document that provides direction
Bundled payments In the healthcare context, for individualized patient care.
refers to the practice of reimbursing provid-
ers based on the total expected costs of a par- Cascading finite state automata (FSA) A tag-
ticular episode of care. Generally occupies a ging method in natural language processing
“middle ground” between fee-for-service and in which as series of finite state automata are
capitation mechanisms. employed such that the output of one FSA
becomes the input for another.
Business logic layer A conceptual level of
system architecture that insulates the appli- Case Refers to the capitalization of letters in
cations and processing components from the a word.
underlying data and the user interfaces that
access the data. Case manager A person in charge of coordi-
nating all aspects of a patient’s care.
Buttons Graphic elements within a dialog
box or user-selectable areas within an HTML CCD See: Continuity of Care Document.
document that, when activated, perform a
specified function (such as invoking other CCOW See: Clinical Context Object
HTML documents and services). Workgroup.
C statistic The area under an receiver operat- CDC See Centers for Disease Control and
ing characteristic (ROC) curve. Prevention.
CDR See: Clinical data repository. tution using a common set of databases and
interfaces.
CDS Hooks A technical approach designed to
invoke external CDS services from within the Central processing unit (CPU) The “brain” of
EHR workflow based upon a triggering event. the computer. The CPU executes a program
Services may be in the form of (a) information stored in main memory by fetching and exe-
cards – provide text for the user to read; (b) cuting instructions in the program.
suggestion cards – provide a specific sugges-
tion for which the EHR renders a button that Central Test Node (CTN) DICOM software
the user can click to accept, with subsequent to foster cooperative demonstrations by the
population of the change into the EHR user medical imaging vendors.
interface; and (c) app link cards – provide a
link to an app. Certificate Coded authorization information
that can be verified by a certification author-
CDSS See: Clinical decision-support system. ity to grant system access.
Central computer system A single system that CHIN See: Community Health Information
handles all computer applications in an insti- Network.
1026 Glossary
Classroom Technologies All technology used Clinical decision support Any process that
in a classroom setting including projection provides health-care workers and patients
of two-dimensional slides or views of three- with situation-specific knowledge that can
dimensional objects, electronic markup of a inform their decisions regarding health and
screen presentation, real time feedback sys- health care.
tems such as class polling, and digital record-
ing of a class session. Clinical decision-support system (CDSS) A
computer-based system that assists physicians
CLIA certification See: Clinical Laboratory in making decisions about patient care.
Improvement Amendments of 1988
Certification. Clinical Document Architecture An HL7
standard for naming and structuring clinical
Client–server Information processing inter- documents, such as reports.
action that distributes application processing
between a local computer (the client) and a Clinical expert system A computer program
remote computer resource (the server). designed to provide decision support for
diagnosis or therapy planning at a level of
Clinical and translational research A broad sophistication that an expert physician might
spectrum of research activities involving the provide.
translation of findings from initial labora-
tory- based studies into early-stage clinical Clinical guidelines Systematically developed
studies, and subsequently, from the findings statements to assist practitioner and patient
of those studies in clinical and/or population- decisions about appropriate health care for
level practice. This broad area incorporates specific clinical circumstances.
1027
Glossary
Clinical informatics The application of bio- interacts with human subjects. Patient-
medical informatics methods in the patient- oriented research includes: (a) mechanisms
care domain; a combination of computer of human disease; (b) therapeutic interven-
science, information science, and clinical sci- tions; (c) clinical trial; and (d) development
ence designed to assist in the management of new technologies. (2) Epidemiologic and
and processing of clinical data, information, behavioral studies. (3) Outcomes research and
and knowledge to support clinical practice. health services research.
Clinical information system (CIS) The com- Clinical research informatics (CRI) The appli-
ponents of a health-care information system cation of biomedical informatics methods in
designed to support the delivery of patient the clinical research domain to support all
care, including order communications, results aspects of clinical research, from hypothesis
reporting, care planning, and clinical docu- generation, through study design, study exe-
mentation. cution and data collection, data analysis, and
dissemination of results.
Clinical judgment Decision making by cli-
nicians that incorporates professional expe- Clinical Research Management System
rience and social, ethical, psychological, (CRMS) A clinical research management sys-
financial, and other factors in addition to the tem is a technology platform that supports
objective medical data. and enables the conduct of clinical research,
including clinical trials, usually through a
Clinical Laboratory Improvement Amend combination of functional modules target-
ments of 1988 certification Clinical Labo ing the preparatory, enrollment, active, and
ratory Improvement Amendments of 1988, dissemination phases of such research pro-
establishing laboratory testing quality stan- grams. CRMS systems are often also referred
dards to ensure the accuracy, reliability and to as Clinical Trials Management Systems
timeliness of patient test results, regardless of (CTMS), particularly when they are used to
where the test was performed. manage only clinical trials rather than various
types of clinical research.
Clinical modifications A published set of
changes to the International Classification of Clinical subgroup A subset of a population in
Diseases (ICD) that provides additional levels which the members have similar characteris-
of detail necessary for statistical reporting in tics and symptoms, and therefore similar like-
the United States. lihood of disease.
Clinical pathway Disease-specific plan that Clinical trials Research projects that involve
identifies clinical goals, interventions, and the direct management of patients and are
expected outcomes by time period. generally aimed at determining optimal modes
of therapy, evaluation, or other interventions.
Clinical Quality Language An expression lan-
guage standardized by HL7 that is used to Clinical-event monitors Systems that elec-
characterize both quality measure logic and tronically and automatically record the occur-
decision-support logic. rence or changes of specific clinical events,
such as blood pressure, respiratory capability,
Clinical research The range of studies and tri- or heart rhythms.
als in human subjects that fall into the three
sub-categories: (1) Patient-oriented research: Clinically relevant population The population
Research conducted with human subjects of patients that is seen in actual practice. In
(or on material of human origin such as tis- the context of estimating the sensitivity and
sues, specimens and cognitive phenomena) for specificity of a diagnostic test, that group of
which an investigator (or colleague) directly patients in whom the test actually will be used.
1028 Glossary
Collaborative workspace A virtual environ- niques to analyze biological systems. See also
ment in which multiple participants can inter- bioinformatics.
act, synchronously or asynchronously, to
perform a collaborative task. Computed check A procedure applied to
entered data that detects errors based on
Color resolution A measure of the ability to whether values have the correct mathematical
distinguish among different colors (indicated relationship; (e.g., white blood cell differential
in a digital image by the number of bits per counts, reported as percentages, must sum to
pixel). Three sets of multiple bits are required 100.
to specify the intensity of red, green, and blue
components of each pixel color. Computed tomography (CT) An imag-
ing modality in which X rays are projected
Commodity internet A general-purpose con- through the body from multiple angles and
nection to the Internet, not configured for any the resultant absorption values are analyzed
particular purpose. by a computer to produce cross-sectional
slices.
Common Data Elements (CDEs) Standards for
data that stipulate the methods by which the Computer architecture The basic structure of
data are collected and the controlled termi- a computer, including memory organization,
nologies used to represent them. Many stan- a scheme for encoding data and instructions,
dard sets of CDEs have been developed, often and control mechanisms for performing com-
overlapping in nature. puting operations.
Communication Data transmission and infor- Computer memories Store programs and
mation exchange between computers using data that are being used actively by a CPU.
accepted protocols via an exchange medium
such as a telephone line or fiber optic cable. Computer program A set of instructions that
tells a computer which mathematical and logi-
Community Health Information Network cal operations to perform.
(CHIN) A computer network developed for
exchange of sharable health information Computer simulated patient See Virtual
among independent participant organizations patient.
in a geographic area (or community).
Computer-aided diagnosis (CAD) Any form
Comparative effectiveness research A form of diagnosis in which a computer program
of clinical research that compares examines out- helps suggest or rank diagnostic consider-
comes of two or more interventions to determine ations.
if one is statistically superior to another.
Computer-based (or computerized) physician
Compiler A program that translates a pro- order entry (CPOE) A clinical information sys-
gram written in a high-level programming tem that allows physicians and other clinicians
language to a machine-language program, to record patient-specific orders for commu-
which can then be executed. nication to other patient care team members
and to other information systems (such as
Comprehensibility and control Security func- test orders to laboratory systems or medica-
tion that ensures that data owners and data tion orders to pharmacy systems). Sometimes
stewards have effective control over informa- called provider order entry or practitioner
tion confidentiality and access. order entry to emphasize such systems’ uses
by clinicians other than physicians.
Computational biology The science of com-
puter-based mathematical and statistical tech-
1030 Glossary
Computer-based patient records (CPRs) An Conditioned event A chance event, the prob-
early name for electronic health records ability of which is affected by another chance
(EHRs) dating to the early 1990s. event (the conditioning event).
Concept A unit of thought made explicit Conditioning event A chance event that
through the representation of properties of an affects the probability of occurrence of
object or a set of common objects. An abstract another chance event (the conditioned event).
idea generalized from specific instances of
objects that occur in the world. Confidentiality The ability of data owners
and data stewards to control access to or
Conceptual graph A formal notation in which release of private information.
knowledge is represented through explicit
relationships between concepts. Graphs can Consistency check A procedure applied to
be depicted with diagrams consisting of entered data that detects errors based on inter-
shapes and arrows, or in a text format. nal inconsistencies; e.g., recognizing a problem
with the recording of cancer of the prostate as
Conceptual knowledge Knowledge about the diagnosis for a female patient.
concepts.
Constructivism Argues that humans generate
Concordant test results Test results that knowledge and meaning from an interaction
reflect the true patient state (true-positive and between their experiences and their ideas.
true- negative results).
Constructivist Argues that humans generate
Conditional probability The probability of knowledge and meaning from an interaction
an event, contingent on the occurrence of between their experiences and their ideas.
another event.
Consumer health informatics(CHI) Appli
Conditionally independent Two events, A cations of medical informatics technologies
and B, are conditionally independent if the that focus on patients or healthy individuals
occurrence of one does not influence the as the primary users.
probability of the occurrence of the other,
when both events are conditioned on a third Content In information retrieval, media
event C. Thus, p[A | B,C] = p[A | C] and p[B developed to communicate information or
| A,C] = p[B | C]. The conditional probability knowledge.
of two conditionally independent events both
occur- ring is the product of the individual Content based image retrieval Also known as
conditional probabilities: p[A,B | C] = p[A query by image content (QBIC) and content-
| C] × p[B | C]. For example, two tests for a based visual information retrieval (CBVIR)
disease are conditionally independent when is the application of computer vision tech-
the probability of the result of the second test niques to the image retrieval problem, that is,
does not depend on the result of the first test, the problem of searching for digital images in
given the disease state. For the case in which large databases.
disease is present, p[second test positive | first
test positive and disease present] = p[second Context free grammar A mathematical
test positive | first test negative and disease model of a set of strings whose members are
present] = p[second test positive | disease defined as capable of being generated from
present]. More succinctly, the tests are con- a starting symbol, using rules in which a
ditionally independent if the sensitivity and single symbol is expanded into one or more
specificity of one test do not depend on the symbols.
result of the other test (See independent).
1031
Glossary
Contingency table A 2 × 2 table that shows ticipants assigned to the control or compara-
the relative frequencies of true-positive, true- tor arm of a study. Depending on the study
negative, false-positive, and false-negative type, the goal is to generate data as the basis
results. of comparison with the experimental inter-
vention of interest in order to determine the
Continuity of care The coordination of care safety, efficacy, or benefits of an experimen-
received by a patient over time and across tal intervention.
multiple healthcare providers.
Controlled terminology A finite, enumerated
Continuity of Care Document (CCD) An HL7 set of terms intended to convey information
standard that enables specification of the patient unambiguously.
data that relate to one or more encounters with
the healthcare system. The CCD is used for Copyright law Protection of written materials
interchange of patient information (e.g., within and intellectual property from being copied
Health Information Exchanges). The format verbatim.
enables all the electronic information about a
patient to be aggregated within a standardized Coreference chains Provide a compact
data structure that can be parsed and inter- re
presentation for encoding the words and
preted by a variety of information systems. phrases in a text that all refer to the same
entity.
Continuous glucose monitor (CGM) A device
that automatically tracks a diabetic patient’s Coreference resolution In natural language
blood glucose levels throughout the day and processing, the assignment of specific mean-
night using a tiny sensor inserted under the ing to some indirect reference.
skin.
Correctional Telehealth The application of
Continuum of care The full spectrum of telehealth to the care of prison inmates, where
health services provided to patients, including physical delivery of the patient to the practi-
health maintenance, primary care, acute care, tioner is impractical.
critical care, rehabilitation, home care, skilled
nursing care, and hospice care. Covered entities Under the HIPAA Privacy
Rule, a covered entity is an organization or
Contract-management system A computer individual that handles personal health infor-
system used to support managed-care con- mation. Covered entities include providers,
tracting by estimating the costs and payments health plans, and clearinghouses.
associated with potential contract terms and
by comparing actual with expected payments COVID-19 A disease that was identified in late
based on contract terms. 2019 and was declared a global pandemic on
March 11, 2020. COVID-19 became an inter-
Contrast The difference in light intensity national public health emergency, affecting
between dark and light areas of an image. essentially all countries on the planet. It is
characterized by contagion before symptoms,
Contrast resolution A metric for how well an high rate of transmission between human
imaging modality can distinguish small differ- beings, variable severity among affected indi-
ences in signal intensity in different regions of viduals, and relatively high mortality rate.
the image.
CPOE See: Computer-based (or Compu
Control intervention In the context of clini- terized) Physician (or Provider) Order Entry.
cal research, a control intervention repre-
sents the intervention (e.g. placebo, standard CPR (or CPRS) See: Computer-based patient
care, etc.) given to the group of study par- records.
1032 Glossary
CPU See: Central processing unit. grams simultaneously and that allows users to
interact with those programs in standardized
CRI See: Clinical research informatics. ways.
CRMS (or CRDMS) See: Clinical Research Data buses An electronic pathway for trans-
Management System. ferring data—for instance, between a CPU
and memory.
Cryptographic encoding Scheme for pro-
tecting data through authentication and Data capture The process of collecting data
authorization controls based on use of keys to be stored in an information system; it
for encrypting and decrypting information. includes entry by a person using a keyboard
and collection of data from sensors.
CT (or CAT) See: Computed tomography.
Data Encryption Standard (DES) A widely-
Cumulative Index to Nursing and Allied Health used method of for securing encryption that
Literature (CINHL) A non-NLM bibliographic uses a private (secret) key for encryption and
database the covers nursing and allied health requires the same key for decryption (see also,
literature, including physical therapy, occupa- public key cryptography).
tional therapy, laboratory technology, health
education, physician assistants, and medical Data independence The insulation of appli
records. cations programs from changes in data-
storage structures and data-access strategies.
Curly Braces Problem The situation that
arises in Arden Syntax where the code used to Data layer A conceptual level of system archi-
enumerate the variables required by a medi- tecture that isolates the data collected and
cal logic module (MLM) cannot describe stored in the enterprise from the applications
how the variables actually derive their values and user interfaces used to access those data.
from data in the EHR database. Each variable
definition in an MLM has {curly braces} that Data Recording The documentation of infor-
enclose words in natural language that indi- mation for archival or future use through
cate the meaning of the corresponding vari- mechanisms such as handwritten text, draw-
able. The particular database query required ings, machine-generated traces, or photo-
to supply a value for the variable must be graphic images.
specified by the local implementer, however.
The curly braces problem makes it impossible Data science The field of study that uses
for an MLM developed at one institution to analytic, quantitative, and domain exper-
operate at another without local modification. tise for knowledge discovery, typically using
“big data” which could be structured and/or
Cursor A blinking region of a display moni- unstructured.
tor, or a symbol such as an arrow, that indi-
cates the currently active position on the Database A collection of stored data—typi-
screen. cally organized into fields, records, and files—
and an associated description (schema).
Cybersecurity Measures that seeks to protect
against the criminal or unauthorized use of Database management system (DBMS) An
electronic data. integrated set of programs that manages
access to databases.
CYK See: Cocke-Younger-Kasami.
Data-interchange standards Adopted for-
Dashboard A user-interface element that dis- mats and protocols for exchange of data
plays data produced by several computer pro- between independent computer systems.
1033
Glossary
Datum Any single observation of fact. A De-identified aggregate data Data reports
medical datum generally can be regarded as that are summarized or altered slightly in a
the value of a specific parameter (for example, way that makes the discernment of the iden-
red-blood-cell count) for a particular object tity of any of the individuals whose data was
(for example, a patient) at a given point in used for the report impossible or so difficult
time. as to be extremely improbable. The process of
de-identifying aggregate data is known as sta-
DBMIS See: Database Management System. tistical disclosure control.
DCMI See: Dublin Core Metadata Initiative. Delta check A procedure applied to entered
data that detects large and unlikely differences
Debugger A system program that provides between the values of a new result and of the
traces, memory dumps, and other tools to previous observations; e.g., a recorded weight
assist programmers in locating and eliminat- that changes by 100 lb in 2 weeks.
ing errors in their programs.
Demonstration study Study that establishes
Decision analysis A methodology for mak- a relation—which may be associational or cau
ing decisions by identifying alternatives and sal—between a set of measured variables.
assessing them with regard to both the likeli-
hood of possible outcomes and the costs and Dental informatics The application of bio-
benefits of those outcomes. medical informatics methods and techniques
to problems derived from the field of dentistry.
Decision node A symbol that represents a Viewed as a subarea of clinical informatics.
choice among actions. By convention, a deci-
sion node is represented in a decision tree by Deoxyribonucleic acid (DNA) The genetic
a square. material that is the basis for heredity. DNA
is a long polymer chemical made of four
Decision support The process of assisting basic subunits. The sequence in which these
humans in making decisions, such as inter- subunits occur in the polymer distinguishes
preting clinical information or choosing a one DNA molecule from another and in turn
diagnostic or therapeutic action. See: Clinical directs a cell’s production of proteins and all
Decision Support. other basic cellular processes.
Diagnosis The process of analyzing avail- Digital image An image that is stored as a
able data to determine the pathophysiologic grid of numbers, where each picture element
explanation for a patient’s symptoms. (pixel) in the grid represents the intensity, and
possibly color, of a small area.
Diagnosis-based reimbursement Payments
to providers (typically hospitals) based on the Digital Image Communications in Medicine
diagnosis made by a physician at the time of (DICOM) A standard for electronic exchang-
admission. ing digital health images, such as x-rays and
CT scans.
Diagnosis-related group (DRG) One of
almost 500 categories based on major diag- Digital library Organized collections of elec-
nosis, length of stay, secondary diagnosis, tronic content, intended for specific commu-
surgical procedure, age, and types of ser- nities or domains.
vices required. Used to determine the fixed
payment per case that Medicare will reim- Digital object identifier (DOI) A system for
burse hospitals for providing care to elderly providing unique identifiers for published
patients. digital objects, consisting of a prefix that is
1035
Glossary
assigned by the International DOI Foundation Discourse Large portions of text forming a
to the publishing entity and a suffix that is narrative, such as paragraphs and documents.
assigned and maintained by the entity.
Discrete event simulation model A modeling
Digital radiography (DR) The process of pro- approach that assesses interactions between
ducing X-ray images that are stored in digi- people, typically composed of patients
tal form in computer memory, rather than on that have attributes and that experience events.
film.
Discussion board An on-line environment for
Digital signal A signal that takes on discrete exchanging public messages among partici-
values from a specified range of values. pants.
Digital signal processing (DSP) An integrated Discussion lists and messaging boards Online
circuit designed for high-speed data manipu- tools for asynchronous text conversation.
lation and used in audio communications,
image manipulation, and other data acquisi- Disease Any condition in an organism that is
tion and control applications. other than the healthy state.
Digital subscriber line (DSL) A digital tele- Dissemination phase During the dissemina-
phone service that allows high-speed network tion phase of a clinical research study, inves-
communication using conventional (twisted tigators analyze and report upon the data
pair) telephone wiring. generated during the active phase.
Double blind A clinical study methodology in Dynamic A simulation program that mod-
which neither the researchers nor the subjects els changes in patient state over time and in
know to which study group a subject has been response to students’ therapeutic decisions.
assigned.
Dynamic programming A computationally
Double-blinded study In the context of intensive computer-science technique used,
clinical research, a double blinded study is for example, to determine optimal sequence
a study in which both the investigator and alignments in many computational biology
participant are blinded from the assignment applications.
of an intervention. In this scenario, a trusted
third party must maintain records of such Dynamic transmission model A model that
study arm assignments to inform later data divides a population into compartments (for
analyses. example, uninfected, infected, recovered,
dead), and for transitions between compart-
Draft standard for trial use A proposal for ments are governed by differential or differ-
a standard developed by HL7 that is suffi- ence equations.
ciently well defined that early adopters can
use the specification in the development of Dynamical systems models Models that
HIT. Ultimately, the draft standard may be describe and predict the interactions over time
refined and put to a ballot for endorsement by between multiple components of a phenom-
the members of the organization, thus creat- enon that are viewed as a system. Dynamical
ing an official standard. systems models are often used to construct
“controllers,” algorithms that adjust function-
DRG See Diagnosis-Related Groups. ing of the system (an airplane, artificial pan-
creas, etc.) to maximize a set of optimization
Drug repurposing Identifying existing drugs criteria.
that may be useful for indications other
than those for which they were initially Earley parsing A dynamic programming
approved. method for parsing context-free grammar.
DSP See: Digital signal processing. eCRF See: Electronic Case Report Form.
Dublin Core Metadata Initiative (DCMI) A EDC See: Electronic Data Capture.
standard metadata model for indexing pub-
lished documents. EDI See: Electronic data interchange.
1037
Glossary
Electronic Case Report Form (eCRF) A com- EMBASE A commercial biomedical and phar-
putational representation of paper case report macological database from ExcerptaMedica,
forms (CRFs) used to enable EDC. which provides information about medical
and drug-related subjects.
Electronic Data Capture (EDC) EDC is the
process of capturing study-related data ele- Emergent design Study where the design or
ments via computational mechanisms. plan of research can and does change as the
study progresses. Characteristic of subjectiv-
Electronic Data interchange (EDI) Electronic ist studies.
exchange of standard data transactions, such
as claims submission and electronic funds Emotion detection A natural language tech-
transfer. nique for determining the mental state of the
author of a text document.
Electronic Health Record (EHR) A repository
of electronically maintained information EMPI See: Enterprise master patient index.
about an individual’s lifetime health status
and health care, stored such that it can serve EMR (or EMRS) See: Electronic Medical
the multiple legitimate users of the record. See Record.
also EMR and CPR.
EMTREE A hierarchically structured, con-
Electronic health record system An elec- trolled vocabulary used for subject indexing,
tronic health record and the tools used used to index EMBASE.
to manage the information; also referred
to as a computer-based patient-record Enabling technology Any technology that
system and often shortened to electronic improves organizational processes through
health record. its use rather than on its own. Computers,
for example, are useless unless “enabled” by
Electronic Medical Record (EMR) The elec- operation systems and applications or imple-
tronic record documenting a patient’s care in mented in support of work flows that might
a provider organization such as a hospital or not otherwise be possible.
a physician’s office. Often used interchange-
ably with Electronic Health Record (EHR), Encryption The process of transforming
although EHRs refer more typically to an information such that its meaning is hidden,
individual’s lifetime health status and care with the intent of keeping it secret, such that
rather than the set of particular organiza- only those who know how to decrypt it can
tionally- based experiences. read it; see decryption.
1038 Glossary
Enterprise information warehouse (EIW) A Ethernet A network standard that uses a bus
data base in which data from clinical, finan- or star topology and regulates communica-
cial and other operational sources are col- tion traffic using the Carrier Sense Multiple
lected in order to be compared and contrasted Access with Collision Detection (CSMA/CD)
across the enterprise. approach.
Exome The entire sequence of all genes Extrinsic evaluation An evaluation of a com-
within a genome, approximately 1–3% of the ponent of a system based on an evaluation of
entire genome. the performance of the entire system.
Expected value The value that is expected on F measure A measure of overall accuracy
average for a specified chance event or deci- that is a combination of precision and recall.
sion.
Factual knowledge Knowledge of facts with-
Experimental intervention In the context of out necessarily having any in-depth under-
clinical research, an experimental interven- standing of their origin or implications.
tion represents the treatment or other inter-
vention delivered to a participant assigned to False negative A negative result that occurs
the experimental arm of the study in order to in a true situation. Examples include a desired
determine the safety, efficacy, or benefits of entity that is missed by a search routine or a
that intervention. test result that appears normal when it should
be abnormal.
Experimental science Systematic study
characterized by posing hypotheses, design- False positive A positive result that occurs in
ing experiments, performing analyses, and a false situation. Examples include an inap-
interpreting results to validate or disprove propriate entity that is returned by a search
hypotheses and to suggest new hypotheses routine or a test result that appears abnormal
for study. when it should be normal.
Extensible markup language (XML) A sub- False-negative rate (FNR) The probability of a
set of the Standard Generalized Markup negative result, given that the condition under
Language (SGML) from the World Wide consideration is true—for example, the proba-
Web Consortium (W3C), designed espe- bility of a negative test result in a patient who
cially for Web documents. It allows design- has the disease under consideration.
ers to create their own custom-tailored tags,
enabling the definition, transmission, vali- False-negative result (FN) A negative result
dation, and interpretation of data between when the condition under consideration is
applications and between organizations. true—for example, a negative test result in a
patient who has the disease under consider-
External router A computer that resides on ation.
multiple networks and that can forward and
translate message packets sent from a local False-positive rate (FPR) The probability of
or enterprise network to a regional network a positive result, given that the condition
beyond the bounds of the organization. under consideration is false—for example,
the probability of a positive test result in a
External validity In the context of clinical patient who does not have the disease under
research, external validity refers to the abil- consideration.
1040 Glossary
False-positive result (FP) A positive result Field user effect study A study of the actual
when the condition under consideration is actions or decisions of the users of the
false—for example, a positive test result in a resource.
patient who does not have the disease under
consideration. File In a database, a collection of similar
records.
Fast Healthcare Interoperability Resource
(FHIR) An HL7 standard for information File format Representation of data within
exchange using a well-defined and limited set a file; can refer to the method for individual
of resources. characters and values (for example, ASCII or
binary) or their organization within the file
FDDI See: Fiber Distributed Data Interface. (for example, XML or text).
Gene expression microarray Study the expres- Genomics database An organized collec-
sion of large numbers of genes with one tion of information from gene sequencing,
another and create multiple variations on a protein characterization, and other genomic
genetic theme to explore the implications of research.
changes in genome function on human disease.
Genotype The genetic makeup, as distin-
Gene Expression Omnibus (GEO) A central- guished from the physical appearance, of an
ized database of gene expression microarray organism or a group of organisms.
datasets.
Genotypic Refers to the genetic makeup of
Gene Ontology(GO) A structured controlled an organism.
vocabulary used for annotating genes and pro-
teins with molecular function. The vocabulary GEO See: Gene Expression Omnibus.
contains three distinct ontologies, Molecular
Function, Biological Process and Cellular Geographic Information System (GIS) A sys-
Component. tem designed to capture, store, manipulate,
analyze, manage, and visually present all types
Genes Units encoded in DNA and they of location-specific data.
are transcribed into ribonucleic acid (RNA).
Geographic Information System (GIS) A sys-
Genetic data An overarching term used to tem designed to capture, store, manipulate,
label various collections of facts about the analyze, manage, and visually present all types
genomes of individuals, groups or species. of location-specific data.
Genetic risk score (GRS) A calculation of the Gigabits per second (Gbps) A common unit
likelihood of a particular phenotype being of measure for data transmission over high-
present based on a weighed score of one or speed networks.
more genetic variants; also referred to as a
polygenic risk score (PRS). Gigabyte 230 or 1,073,741,824 bytes.
Genomics The study of all of the nucleo- Google A commercial search engine that
tide sequences, including structural genes, provides free searching of documents on the
regulatory sequences, and noncoding DNA World Wide Web.
1043
Glossary
GPS A system for calculating precise geo- the user to provide a tactile sensation as part
graphical location by triangulating informa- of a simulated activity.
tion obtained from satellites and/or cell towers.
Haptic sensation The sensation of touch or
GPU See: Graphics processing unit. feel. It can be applied to a simulation of such
sensation as presented within a virtual or aug-
Grammar A mathematical model of a poten- mented reality scenario.
tially infinite set of strings.
Hard disk A magnetic disk used for data
Graph In computer science, a set of nodes or storage and typically fixed in the disk-drive
circles connected by a set of edges or lines. unit.
Graphical user interface (GUI) A type of envi- Hardware The physical equipment of a com-
ronment that represents programs, files, and puter system, including the central processing
options by means of icons, menus, and dialog unit, memory, data-storage devices, worksta-
boxes on the screen. tions, terminals, and printers.
Gray scale A scheme for representing inten- HCI See: Human-computer interaction.
sity in a black-and-white image. Multiple bits
per pixel are used to represent intermediate HCO See: Healthcare organization.
levels of gray.
Head word The key word in a multi- word
Guardian Angel Proposal A proposed struc- phrase that conveys the central meaning of
ture for a lifetime, patient-centered health the phrase. For example, a phrase containing
information system. adjectives and a noun, the noun is typically
the head word.
GUI See: Graphical user interface.
Header (of email) The portion of a simple
Guidance In a computer-based education electronic mail message that contains informa-
program, proactive feedback, help facilities, tion about the date and time of the message,
and other tools designed to assist a student in the address of the sender, the addresses of
learning the covered material. the recipients, the subject, and other optional
information.
Guideline Element Model (GEM) An XML
specification for marking up textual docu- Health Evaluation and Logical Processing
ments that describe clinical practice guide- [HELP] On of the first electronic health record
lines. The guideline-related XML tags make it systems, developed at LDS Hospital in Sal
possible for information systems to determine Lake City, Utah. Still in use today, it was
the nature of the text that has been marked up innovative for its introduction of automated
and its role in the guideline specification. alerts.
Health information and communication tech- standards for electronic healthcare transac-
nology (HICT) The broad spectrum of hard- tions and national identifiers for providers,
ware and software used to capture, store and health plans, and employers. It also addresses
transmit health information. the security and privacy of health data.
Health Information exchange (HIE) The Health Level Seven (HL7) An ad hoc stan-
process of moving health information elec- dards group formed to develop standards for
tronically among disparate health care orga- exchange of health care data between inde-
nizations for clinical care and other purposes; pendent computer applications; more specifi-
or an organization that is dedicated to provid- cally, the health care data messaging standard
ing health information exchange. developed and adopted by the HL7 standards
group.
Health Information Infrastructure (HII) The
set of public and private resources, including Health literacy A constellation of skills,
networks, databases, and policies, for collect- including the ability to perform basic reading,
ing, storing, and transmitting health informa- math, and everyday health tasks like compre-
tion. hending prescription bottles and appointment
slips, required to function in the health care
Health Information Technology (HIT) These of environment.
computers and communications tech nology
in healthcare and public health settings. Health Maintenance Organization (HMO) A
group practice or affiliation of independent
Health Information Technology for Economic practitioners that contracts with patients to
and Clinical Health (HITECH) Also referred pro- vide comprehensive health care for a
to as HITECH Act. Passed by the Congress fixed periodic payment specified in advance.
as Title IV of the American Recovery and
Reinvestment Act of 2009 (ARRA) in 2009, Health on the Net[HON] A private organiza-
established four major goals that promote tion establishing ethical standards for health
the use of health information technology: information published on the World Wide
(1) Develop standards for the nationwide Web.
electronic exchange and use of health infor-
mation; (2) Invest $20B in incentives to Health Record Bank (HRB) An independent
encourage doctors and hospitals to use HIT organization that provides a secure elec-
to electronically exchange patients’ health tronic repository for storing and maintaining
information; (3) Generate $10B in savings an individual’s lifetime health and medical
through improvements in quality of care records from multiple sources and assuring
and care coordination, and reductions in that the individual always has complete con-
medical errors and duplicative care and (4) trol over who accesses their information.
Strengthen Federal privacy and security law
to protect identifiable health information Healthcare Effectiveness Data and Information
from misuse. Also codified the Office of the Set (HEDIS) Employers and individuals use
National Coordinator for Health Information HEDIS to measure the quality of health
Technology (ONC) within the Department of plans. HEDIS measures how well health plans
Health and Human Services. give service to their members. HEDIS is one
of health care’s most widely used performance
Health Insurance Portability and Accoun improvement tools. It is developed and main-
tability Act (HIPAA) A law enacted in 1996 to tained by the National Committee for Quality
protect health insurance coverage for workers Assurance.
and their families when they change or lose
their jobs. An “administrative simplification” Healthcare organization (HCO) Any
provision requires the Department of Health healthrelated organization that is involved in
and Human Services to establish national direct patient care.
1045
Glossary
Healthcare team A coordinated group of parts (e.g., sub-sub-tasks). The tasks are orga-
health professionals including physicians, nized according to specific goals.
nurses, case managers, dieticians, pharma-
cists, therapists, and other practitioners who High-bandwidth An information channel
collaborate in caring for a patient. that is capable of carrying delivering data at a
relatively high rate.
HEDIS See: Healthcare Effectiveness Data
and Information Set. Higher-level process A complex process com-
prising multiple lower-level processes.
HELP See Health Evaluation and Logical
Processing. HII See: Health Information Infrastructure.
HELP sector A decision rule encoded in the HII See: Health Information Infrastructure.
HELP system, a clinical information system
that was developed by researchers at LDS Hindsight bias The tendency to over-estimate
Hospital in Salt Lake City. the prior predictability of an event, once the
events has already taken place. For example,
Helper (plug- in) An application that are if event A occurs before event B, there may be
launched by a Web browser when the browser an assumption that A predicted B.
downloads a file that the browser is not able
to process itself. HIPAA See: Health Insurance Portability and
Accountability Act.
Heuristic A mental “trick” or rule of thumb;
a cognitive process used in learning or prob- HIPAA See: Health Insurance Portability and
lem solving. Accountability Act.
Heuristic evaluation (HE) A usability inspec- HIS See: Hospital information system.
tion method, in which the system is evaluated
on the basis of a small set of well-tested design Historical control In the context of clinical
principles such as visibility of system status, research, historical controls are subjects who
user control and freedom, consistency and represent the targeted population of interest
standards, flexibility and efficiency of use. for a study. Typically, their data are derived
from existing resources in a retrospective
HICT See: Health information and commu- manner and that represent targeted outcomes
nication technology. From standard of care in a non-interventional state (often resulting
practices), so as to provide the basis for com- among humans and other elements of a sys-
parison to data sets derived from participants tem, and the profession that applies theory,
who have received an experimental interven- principles, data, and other methods to design
tion under study. in order to optimize human well-being and
overall system performance.
HIE See: Health Information Exchange.
Historically controlled study See: before-after
HIE See: Health Information Exchange. study.
Economic and Clinical Health Act, passed by mine the complete sequence of human deoxy-
the Congress in 2009, which authorized finan- ribonucleic acid (DNA), as it is encoded in
cial incentives to be paid to eligible physicians each of the 23 chromosomes.
and hospitals for the adoption of “meaning-
ful use” of EHRs in the United States. The Human immunodeficiency virus (HIV) A ret-
law also called for the certification of EHR rovirus that invades and inactivates helper T
technology and for educational programs to cells of the immune system and is a cause of
enhance its dissemination and adoption. AIDS and AIDS-related complex.
ICMP See: Internet Control Message Protocol. Image metadata Data about images, such as
the type of image (e.g., modality), patient that
Icon In a graphical interface, a pictorial rep- was imaged, date of imaging, image features
resentation of an object or function. (quantitative or qualitative), and other infor-
mation pertaining to the image and its con-
ICT See: Information and communications tents.
technology.
Image processing The transformation of
IDF See: Inverse document frequency. one or more input images, either into one
or more output images, or into an abstract
IDN See: Integrated delivery network. representation of the contents of the input
images.
Image acquisition The process of generat-
ing images from the modality and converting Image quantitation The process of extracting
them to digital form if they are not intrinsi- useful numerical parameters or deriving cal-
cally digital. culations from the image or from ROIs in the
image.
Image compression A mathematical pro-
cess for removing redundant or relatively Image reasoning Computerized methods
unimportant information from an electronic that use images to formulate conclusions or
image such that the resulting file appears the answer questions that require knowledge and
same (lossless compression) or similar (lossy logical inference.
compression) when compared to the origi-
nal. Image rendering/visualization A variety
of techniques for creating image displays,
Image content representation Makes the diagrams, or animations to display images
infor-mation in images accessible to machines more in a different perspective from the raw
for processing. images.
Inspection method Class of usability evalu- Interactome The set of all molecular interac-
ation methods in which experts appraising a tions in a cell.
system, playing the role of a user to identify
potential usability and interaction issues with Interface engine Software that mediates
a system. the exchange of information among two or
more systems. Typically, each system must
Institute of Medicine The health arm of the know how to communicate with the interface
National Academy of Sciences, which pro- engine, but not need to know the information
vides unbiases, authoritative advice to deci- format of the other systems.
sion makers and the public. Renamed the
National Academy of Medicine in 2016. Intermediate effect process of continu-
ally learning, re-learning, and exercising new
Institutional Review Board (IRB) A commit- knowledge, punctuated by periods of appar-
tee responsible for reviewing an institution’s ent decrease in mastery and declines in perfor-
research projects involving human subjects in mance, which may be necessary for learning
order to protect their safety, rights, and wel- to take place. People at intermediate levels of
fare. expertise may perform more poorly than those
at lower level of expertise on some tasks, due to
Integrated circuit A circuit of transistors, the challenges of assimilating new knowledge or
resistors, and capacitors constructed on a skills over the course of the learning process.
single chip and interconnected to perform a
specific function. Internal validity In the context of clinical
research, internal validity refers to the mini-
Integrated delivery network (IDN) A large mization of potential biases during the design
conglomerate health-care organization devel- and execution of the trial.
oped to provide and manage comprehensive
health-care services. International Classification of Diseases, 9th
Edition, Clinical Modifications A US exten-
Integrated Service Digital Network (ISDN) A sion of the World Health Organization’s
digital telephone service that allows high- International Classification of Diseases, 9th
speed network communications using conven- Edition.
tional (twisted pair) telephone wiring.
International Medical Informatics Association
Integrative model Model for understanding (IMIA) An international organization dedicated
a phenomenon that draws from multiple dis- to advancing biomedical and health informat-
ciplines and is not necessarily based on first ics; an “organization of organizations”, it’s
principles. members are national informatics societies and
organizations, such as AMIA.
Intellectual property Software programs,
knowledge bases, Internet pages, and other International Organization for Standards
creative assets that require protection against (ISO) The international body for information
copying and other unauthorized use. and other standards.
Internet Control Message Protocol (ICMP) A user; (B) allows for complete access, exchange,
network-level Internet protocol that provides and use of all electronically accessible health
error correction and other information rel- information for authorized use under appli-
evant to processing data packets. cable State or Federal law; and (C) does not
constitute information blocking.
Internet Corporation for Assigned Names and
Numbers (ICANN) The organization respon- Interpreter A program that converts each
sible for managing Internet domain name and statement in a high-level program to a
IP address assignments. machine- language representation and then
executes the binary instruction(s).
Internet of Things (IoT) A system of intercon-
nected computing devices that can transfer Interventional radiology A subspecialty of
data and be controlled over a network. In the radiology that uses imaging to guide invasive
consumer space, IoT technologies are most diagnostic or therapeutic procedures.
commonly found in the built environment
where devices and appliances (such as lighting Intrinsic evaluation An evaluation of a com-
fixtures, security systems or thermostats) can ponent of a system that focuses only on the
be controlled via smartphones or smart speak- performance of the component. See also
ers, creating “smart” homes or offices. Extrinsic Evaluation.
Interoperability The 21st Century Cures Act ISG See: Internet Support Group.
defines interoperability as health informa-
tion technology that—(A) enables the secure ISO See: InternationalOrganization for Stan
exchange of electronic health information dards.
with, and use of electronic health informa-
tion from, other health information technol- Iso-semantic mapping A relationship bet
ogy without special effort on the part of the ween an entity in one dataset or model and an
1052 Glossary
entity in another dataset or model where the Keyboard A data-input device used to enter
meaning of the two entities is identical, even alphanumeric characters through typing.
if the syntax or lexical form is different.
Keyword A word or phrase that conveys spe-
ISP See: Internet service provider. cial meaning or to refer to information that is
relevant to such a meaning (as in an index).
Job A set of tasks submitted by a user for
processing by a computer system. Kilobyte 210 or 1024 bytes.
Key field A field in the record of a file that KPI See: Key Performance Indicator.
uniquely identifies the record within the file.
Laboratory function study Study that
Key Performance Indicator (KPI) A metric explores important properties of an informa-
defined to be an important factor in the suc- tion resource in isolation from the clinical set-
cess of an organization. Typically, several Key ting.
Performance indicators are displayed on a
Dashboard.
1053
Glossary
Laser printer Output device that uses an LED See: Light-emitting diode.
electromechanically controlled laser beam to
generate an image on a xerographic surface, Lexemes A minimal lexical unit in a language
which then is used to produce paper copies. that represents different forms of the same
word.
Latency The time required for a signal to
travel between two points in a network. Lexical-statistical retrieval Retrieval
based
on a combination of word matching and rel-
Latent failures Enduring systemic problems evance ranking.
that make errors possible but are less visible
or not evident for some time. Lexicon A catalogue of the words in a lan-
guage, usually containing syntactic informa-
Law of proximity Principle from Gestalt psy- tion such as parts of speech, pluralization
chology that states that visual entities that are rules, etc.
close together are perceptually grouped.
Light-emitting diode (LED) A semiconductor
Law of symmetry Principle from Gestalt psy- device that emits a particular frequency of
chology that states that symmetric objects are light when a current is passed through it; typi-
more readily perceived. cally used for indicator lights and computer
screens because low power requirement, mini-
LCD See: Liquid crystal display. mal heat generated, and durability.
Lean A management strategy that focuses Likelihood ratio (LR) A measure of the dis-
only on those process that are able to contrib- criminatory power of a test. The LR is the
ute specific and measurable value for the end ratio of the probability of a result when
customer. The LEAN concept originated with the condition under consideration is true to
Toyota’s focus on efficient manufacturing pro- the probability of a result when the condition
cesses. under consideration is false (for example, the
probability of a result in a diseased patient to
Learning Content Management System A the probability of a result in a non-diseased
software platform that allows educational patient). The LR for a positive test is the ratio
content creators to host, manage, and track of true-positive rate (TPR) to false-positive
changes in content. rate (FPR).
Learning health system A proposed model Link-based An indexing approach that gives
for health care in which outcomes from past relevance weight to web pages based on how
and current patient care provide are system- often they are cited by other pages.
atically collected, analyzed and then fed back
into decision making about best practices for Linux An open source operating system based
future patient care. on principles of Unix and first developed by
Linus Torvalds in 1991.
Learning healthcare system See: Learning
health system. Liquid crystal display (LCD) A display technol-
ogy that uses rod-shaped molecules to bend
1054 Glossary
light and alter contrast and viewing angle to to store data while still allowing for the re- cre-
produce images. ation of the original data.
Logical Observations, Identifiers, Names and Machine code The set of primitive instruc-
Codes (LOINC) A controlled terminology tions to a computer represented in binary
created for providing coded terms for obser- code (machine language).
vational procedures. Originally focused on
laboratory tests, it has expanded to include Machine language The set of primitive
many other diagnostic procedures. instructions represented in binary code
(machine code).
Logical positivist A philosophical position
that holds that there is a single truth that can Machine learning A computing technique in
be inferred from the right combination of which information learned from data is used
studies. to improve system performance.
Magnetic tape A long ribbon of material that Markov cycle The period of time specified
can accept and store magnetic charge. Data for a transition probability within a Markov
are encoded on magnetic tape as sequences of model.
charges along longitudinal tracks.
Markov model A mathematical model of a set
Magnetoencephalography (MEG) A method of strings in which the probability of a given
for measuring the electromagnetic fields gen- symbol occurring depends on the identity of
erated by the electrical activity of the neurons the immediately preceding symbol or the two
using a large arrays of scalp sensors, the out- immediately preceding symbols. Processes
put of which are processed in a similar way to modeled in this way are often called Markov
CT in order to localize the source of the elec- processes.
tromagnetic and metabolic shifts occurring in
the brain during trauma. Markov process A mathematical model of
a set of strings in which the probability of a
Mailing list A set of mailing addresses used given symbol occurring depends on the iden-
for bulk distribution of electronic or physical tity of the immediately preceding symbol or
mail. the two immediately preceding symbols.
Services (CMS) Incentive Programs that informatics is now viewed as the subfield of
governs the use of electronic health records clinical informatics that deals with the manage-
and allows eligible providers and hospitals ment of disease and the role of physicians.
to earn incentive payments by meeting spe-
cific criteria. The term refers to the belief Medical Information Bus (MIB) A data-
that health care providers using electronic communication system that supports data
health records in a meaningful, or effec- acquisition from a variety of independent
tive, way will be able to improve health care devices.
quality and efficiency.
Medical information science The field of
Measurement study Study to determine the study concerned with issues related to the
extent and nature of the errors with which a management and use of biomedical informa-
measurement is made using a specific instru- tion (see also biomedical informatics).
ment (cf. Demonstration study).
Medical Literature Analysis and Retrieval
Measures of concordance Measures of agree- System (MEDLARS) The initial electronic
ment in test performance: the true-positive version of Index Medicus developed by the
and true-negative rates. National Library of Medicine.
Medical computing The application of meth- Medical Subject Headings (MeSH) Some
ods of computing to medical topics (see medi- 18,000 terms used to identify the subject con-
cal computer science). tent of the biomedical literature. The National
Library of Medicine’s MeSH vocabulary has
Medical entities dictionary (MED) A com- emerged as the de facto standard for biomedi-
pendium of terms found in electronic medi- cal indexing.
cal record systems. Among the best known
MEDs is that developed and maintained by Medication A substance used for medical
the Columbia University Irving Medical treatment, typically a medicine or drug.
Center and Columbia University. Contains in
excess of 100,000 terms. MEDLARS Online (MEDLINE) The National
Library of Medicine’s electronic catalog of
Medical errors Errors or mistakes, committed the biomedical literature, which includes
by health professionals, that hold the poten- information abstracted from journal articles,
tial to result in harm to the patient. including author names, article title, journal
source, publication date, abstract, and medi-
Medical home A primary care practice that cal subject headings.
will maintain a comprehensive problem list to
make fully informed decisions in coordinating Medline Plus An online resource from the
their care. National Library of Medicine that con-
tains health topics, drug information,
Medical informatics An earlier term for the medical dictionaries, directories, and other
biomedical informatics discipline, medical resources, organized for use by health care
consumers.
1057
Glossary
Megabits per second (Mbps) A common unit Mental representations Internal cognitive
of measure for specifying a rate of data trans- states that have a certain correspondence with
mission. the external world.
Mosaic The first graphical web browser cred- rather than long network addresses, avoiding
ited with popularizing the World Wide Web complex lookups in a routing table.
and developed at the National Center for
Supercomputing Applications (NCSA) at the Multiuser system A computer system
University of Illinois. that shares its resources among multiple
simultaneous users.
Motion artifact Visual interference caused by
the difference between the frame rate of an Mutually exclusive State in which one, and
imaging device and the motion of the object only one, of the possible conditions is true;
being imaged. for example, either A or not A is true, and one
of the statements is false. When using Bayes’
Mouse A small boxlike device that is moved theorem to perform medical diagnosis, we
on a flat surface to position a cursor on the generally assume that diseases are mutually
screen of a display monitor. A user can select exclusive, meaning that the patient has exactly
and mark data for entry by depressing buttons one of the diseases under consideration and
on the mouse. not more.
Needs assessment A study carried out to help Network stack The method within a single
understand the users, their context and their machine by which the responsibilities for net-
needs and skills, to inform the design of the work communications are divided into differ-
information resource. ent levels, with clear interfaces between the
levels, thereby making network software more
Negative dictionary A list of stop words used modular.
in information retrieval.
Neuroinformatics An emerging subarea of
Negative predictive value (PV–) The probabil- applied biomedical informatics in which the
ity that the condition of interest is absent if discipline’s methods are applied to the man-
the result is negative—for example, the prob- agement of neurological data sets and the
ability that specific a disease is absent given a modeling of neural structures and function.
negative test result.
Next Generation Internet Initiative A feder-
Negligence theory A concept from tort law ally funded research program in the late 1990s
that states that providers of goods and ser- and early in the current decade that sought
vices are expected to uphold the standards of to provide technical enhancements to the
the community, thereby facing claims of neg- Internet to support future applications that
ligence if individuals are harmed by substan- currently are infeasible or are incapable of
dard goods or services. scaling for routine use.
1062 Glossary
Omics A set of areas of study in biology that Open standards development policy In stan-
use the suffix “-ome”, used to connote breadth dards group, a policy that allows anyone to
or completeness of the objects being studied, become involved in discussing and defining
for example genomics or proteomics. the standard.
On line transaction processing (OLTP) A sys- Optical Character Recognition (OCR) The con-
tem designed for use by thousands of simulta- version of typed text within scanned docu-
neous users doing repetitive queries. ments to computer understandable text.
Optical disk A round, flat plate of plastic or Page A partitioned component of a com-
metal that is used to store information. Data puter users’ programs and data that can be
are encoded through the use of a laser that kept in temporary storage and brought into
marks the surface of the disc. main memory by the operating system as
needed.
Order entry The use of a computer system for
entering treatments, requests for lab tests or Pager One of the first mobile devices for elec-
radiologic studies, or other interventions that tronic communication between a base station
the attending clinician wishes to have per- (typically a telephone, but later a computer)
formed for the benefit of a patient. and an individual person. Initially restricted
to receiving only numeric data (e.g., a tele-
Orienting issues/questions The initial ques- phone number), pagers later incorporated the
tions or issues that evaluators seek to answer ability to transmit a response (referred to as
in a subjectivist study, the answers to which “two way pagers”) as well as alpha characters
often in turn prompt further questions. so that a message of limited length could be
transmitted from a small keyboard. Pagers
Outcome data Formal information regarding have been gradually replaced by cellular
the results of interventions. phones because of their greater flexibility and
broader geographical coverage.
Outcome measurements Using metrics that
assess the end result of an intervention rather PageRank (PR) algorithm In indexing for
than an intervening process. For example, information retrieval on the Internet, an algo-
remembering to check a patient’s Hemoglobin rithmic scheme for giving more weight to a
A1C is a process measure, whereas reducing Web page when a large number of other pages
the complications of diabetes is an outcome link to it.
measure.
Parallel processing The use of multiple pro-
Outcome variable Similar to “dependent cessing units running in parallel to solve a
variable,” a variable that captures the end single problem (see multiprocessing).
result of a health care or educational process;
for example, long-term operative complica- Parse tree The representation of structural
tion rate or mastery of a subject area. relationships that results when using a gram-
mar (usually context free) to analyze a given
Outpatient A patient seen in a clinic rather sentence.
than in the hospital setting.
Partial parsing The analysis of structural rela-
Output The results produced when a process tionships that results when using a grammar
is applied to input. Some forms of output are to analyze a segment of a given sentence.
hardcopy documents, images displayed on
video display terminals, and calculated values Partial-match searching An approach
of variables. to information retrieval that recognizes
the inexact nature of both indexing and
P4 medicine P4 medicine: a term coined by retrieval, and attempts to return the user
Dr. Leroy Hood for healthcare that strives content ranked by how close it comes to the
to be personalized, predictive, preventive and user’s query.
participatory.
Participant calendaring Participant calendar-
Packets In networking, a variable-length ing refers to the capability of a CRMS to sup-
message containing data plus the network port the tracking of participant compliance with
addresses of the sending and receiving nodes, a study schema, usually represented as a calen-
and other control information. dar of temporal events.
1065
Glossary
Participant screening and registration partic- Patient portal An online application that
ipant screening and registration refers to the allows individuals to view health information
capability of a CTMS to support the enroll- and otherwise interact with their physicians
ment phase of a clinical study. and hospitals.
Participants The people or organizations who Patient record The collection of informa-
provide data for the study. According to the tion traditionally kept by a health care pro-
role of the information resource, these may vider or organization about an individual’s
include patients, friends and family, formal health status and health care; also referred
and informal carers, the general public, health to as the patient’s chart, medical record, or
professionals, system developers, guideline health record, and originally called the “unit
developers, students, health service managers, record”.
etc.
Patient safety The reduction in the risk of
Part-of-speech tags Assignment of syntac- unnecessary harm associated with health care
tic classes to a given sequence of words, e.g., to an acceptable minimum; also the name of a
determiner, adjective, noun and verb. movement and specific research area.
Parts of speech The categories to which words Patient triage The process of allocating
in a sentence are assigned in accordance with patients to different levels or urgency of care
their syntactic function. depending upon the complaints or symptoms
displayed.
Patent A specific legal approach for protect-
ing methods used in implementing or instanti- Patient-specific information Information
ating ideas (see intellectual property). derived and organized from a specific patient.
Personal Internetworked Notary and Guardian Phase I (clinical trial) Investigators evaluate
(PING) An early personally controlled health a novel therapy in a small group of partici-
record, later known as Indivo. pants in order to assess overall safety. This
safety assessment includes dosing levels in
Personalized medicine Also often call indi- the case of non-interventional therapeutic
vidualized medicine, refers to a medical model trials, and potential side effects or adverse
in which decisions are custom-tailored to the effects of the therapy. Often, Phase I trials of
patient based on that individual’s genomic non-interventional therapies involve the use
data, preferences, or other considerations. of normal volunteers who do not have the
Such decisions may involve diagnosis, treat- disease state targeted by the novel therapy.
1067
Glossary
Phase II (clinical trial) Investigators evaluate a Phenotype definition The process of deter-
novel therapy in a larger group of participants mining the set of observable descriptors that
in order to assess the efficacy of the treatment characterize an organism’s phenotype.
in the targeted disease state. During this phase,
assessment of overall safety is continued. Phenotype risk score (PheRS) A calculation
of the likelihood of a particular genetic vari-
Phase III (clinical trial) Investigators evaluate ant being present based on a weighed score of
a novel therapy in an even larger group of one or more phenotypic characteristics.
participants and compare its performance to
a reference standard which is usually the cur- Phenotypic Refers to the physical character-
rent standard of care for the targeted disease istics or appearance of an organism.
state. This phase typically employs a random-
ized controlled design, and often a multi-cen- Picture Archive and Communication Systems
ter RCT given the numbers of variation of (PACS) An integrated computer system that
subjects that must be recruited to adequately acquires, stores, retrieves, and displays digital
test the hypothesis. In general, this is the final images.
study phase to be performed before seeking
regulatory approval for the novel therapy Pixel One of the small picture elements that
and broader use in standard- of-care environ- makes up a digital image. The number of pix-
ments. els per square inch determines the spatial res-
olution. Pixels can be associated with a single
Phase IV (clinical trial) Investigators study the bit to indicate black and white or with mul-
performance and safety of a novel therapy tiple bits to indicate color or gray scale.
after it has been approved and marketed. This
type of study is performed in order to detect Placebo In the context of clinical research,
long-term outcomes and effects of the ther- a placebo is a false intervention (e.g., a mock
apy. It is often called “post-market surveil- intervention given to a participant that resem-
lance” and is, in fact, not an RCT at all, but a bles the intervention experienced by individu-
less formal, observational study. als receiving the experimental intervention,
except that it has no anticipated impact on the
PheKB.org A web site that houses EHR-based individual’s health or other indicated status),
algorithms for determining phenotypes. usually used in the context of a control group
or intervention.
Phenome characterization Identification of
the individual traits of an organism that char- Plain old telephone service (POTS) The stan-
acterize its phenotype. dard low speed, analog telephone service
that is still used by many homes and busi-
Phenome-wide association scan A study that nesses.
derives case and controls populations using
the EMR to define clinical phenotypes and Plastination A method of embalming a part
then examines the association of those phe- of a human body using plastic to suffuse
notypes with specific genotypes. human tissue.
Predatory journal A name given to journals population. Prevalence is the prior probability
that publish under the OA model and have no of a specific condition (or diagnosis), before
to minimal peer review of submitted papers. any other information is available.
Probes Genetic markers used it genetic assays Prognostic scoring system An approach to
to determine the presences or absence of a prediction of patient outcomes based on for-
particular variant. mal analysis of current variables, generally
through methods that compare the patient
Problem impact study A study carried out in in some way with large numbers of similar
the field with real users as participants and patients from the past.
real tasks to assess the impact of the informa-
tion resource on the original problem it was Progressive caution The idea that reason,
designed to resolve. caution and attention to ethical issues must
govern research and expanding applications
Problem space The range of possible solu- in the field of biomedical informatics.
tions to a problem.
Propositions An expression, generally in lan-
Problem-based learning Small groups of guage or other symbolic form, that can be
students, supported by a facilitator, learned believed, doubted, or denied or is either true
through discussion of individual case scenar- or false.
ios.
Prospective study An experiment in which
Procedural knowledge Knowledge of how to researchers, before collecting data for analy-
perform a task (as opposed to factual knowl- sis, define study questions and hypotheses, the
edge about the world). study population, and data to be collected.
Publication type One of several classes of Quality management A specific effort to let
articles or books into which a new publica- quality of care be the goal that determines
tion will fall (e.g., review articles, case reports, changes in processes, staffing, or investments.
original research, textbook, etc.).
1072 Glossary
Quality measurements Numeric metrics Radiology The medical field that deals with
that assess the quality of health care ser- the definition of health conditions through
vices. Examples of quality measures include the use of visual images that reflect informa-
the portion of a physician’s patients who are tion from within the human body.
screened for breast cancer and 30-day hospital
readmission rates. These measurements have Radiology Information System (RIS) Com
tradition- ally been derived from administra- puter-based information system that supports
tive claims data or paper charts but there is radiology department operations; includes
increasing interest in using clinical data form management of the film library, scheduling
electronic sources. of patient examinations, reporting of results,
and billing.
Quality-adjusted life year (QALY) A mea-
sure of the value of a health outcome that Random-access memory (RAM) The portion
reflects both longevity and morbidity; it is of a computer’s working memory that can
the expected length of life in years, adjusted be both read and written into. It is used to
to account for diminished quality of life due store the results of intermediate computation,
to physical or mental disability, pain, and and the programs and data that are currently
so on. in use (also called variable memory or core
memory).
Quasi-experiments A quasiexperiment is a
non-randomized, observational study design Randomized clinical trial (RCT) A prospective
in which conclusions are drawn from the experiment in which subjects are randomly
evaluation of naturally occurring and non- assigned to study subgroups to compare the
controlled events or cases. effects of alternate treatments.
Query-response cycle For a database system, Read-only memory (ROM) The portion of a
the process of submitting a single request for computer’s working memory that can be read,
information and receiving the results. but not written into.
receive. Such collections are typically orga- Remote Intensive Care Use of networked
nized to include patients with a specific dis- communications methods to monitor patients
ease or class of diseases. in an intensive care unit from a distance far
removed from the patients themselves. See
Regular expression A mathematical model remote monitoring.
of a set of strings, defined using characters of
an alphabet and the operators concatenation, Remote interpretation Evaluating tests (espe-
union and closure (zero or more occurrences cially imaging studies) by having them deliv-
of an expression). ered digitally to a location that may be far
removed from the patient.
Regulated Clinical Research Information
Management (RCRIM) An HL7 workgroup Remote monitoring The use of electronic
that is developing standards to improve infor- devices to monitor the condition of a patient
mation management for preclinical and clini- from a distant location. Typically used to refer
cal research. to the ability to record and review patient data
(such as vital signs) by a physician located in his/
Relations among named entities The charac- her office or a hospital while the patient remains
terization of two entities in NLP with respect at home. See also remote intensive care.
to the semantic nature of the relationship
between them. Remote-presence health care The use of
video teleconferencing, image transmission,
Relative recall An approach to measuring and other technologies that allow clinicians to
recall when it is unrealistic to enumerate all evaluate and treat patients in other than face-
the relevant documents in a database. Thus to-face situations.
the denominator in the calculation of recall is
redefined to represent the number of relevant Report generation A mechanism by which
documents identified by multiple searches on users specify their data requests on the input
the query topic. screen of a program that then produces the
actual query, using information stored in a data-
Relevance judgment In the context of infor- base schema, often at predetermined intervals.
mation retrieval, a judgment of which docu-
ments should be retrieved by which topics in a Representation A level of medical data
test collection. encoding, the process by which as much detail
as possible is coded.
Relevance ranking The degree to which the
results are relevant to the information need Representational effect The phenomenon by
specified in a query. which different representations of a common
abstract structure can have a significant effect
Reminder message A computer-generated on reasoning and decision making.
warning that is generated when a record meets
prespecified criteria, often referring to an Representational state A particular configu-
action that is expected but is frequently for- ration of an information-bearing structure,
gotten; e.g., a message that a patient is due for such as a monitor display, a verbal utterance,
an immunization. or a printed label, that plays some functional
role in a process within the system.
Remote access Access to a system or to infor-
mation therein, typically by telephone or Representativeness A heuristic by which a
communications network, by a user who is person judges the chance that a condition
physically removed from the system. is true based on the degree of similarity
between the current situation and the ste-
1075
Glossary
Resource Description Framework (RDF) An Rich text format (RTF) A format developed to
emerging standard for cataloging metadata allow the transfer of graphics and formatted
about information resources (such as Web text between different applications and oper-
pages) using the Extensible Markup Language ating systems.
(XML).
RIM See Reference Information Model.
RESTful API A “lightweight” application pro-
gramming interface that enables the transfer Risk attitude A person’s willingness to take
of data between two Web-based software sys- risks.
tems.
Risk-neutral Having the characteristic of
Results reporting A software system or sub- being indifferent between the expected value
system used to allow clinicians to access the of a gamble and the gamble itself.
results of laboratory, radiology, and other
tests for a patient. Role-limited access The mechanism by which
an individual’s access to information in a
Retrieval A process by which queries are database, such as a medical record, is limited
com- pared against an index to create depending upon that user’s job characteristics
results for the user who specified the query. and their need to have access to the informa-
tion.
Retrospective chart review The use of past
data from clinical charts (classically paper Router/switch In networking, a device that
records) of selected patients in order to per- sits on the network, receives messages, and
form research regarding a clinical question. for- wards them accordingly to their intended
See also retrospective study. destination.
Retrospective study A research study per- RS-232 A commonly used standard for serial
formed by analyzing data that were previously data communication that defines the number
gathered for another purpose, such as patient and type of the wire connections, the volt-
care. See also retrospective chart review. age, and the characteristics of the signal,
and thus allows data communication among
Return on investment A metric for the ben- electronic devices produced by different
efits of an investment, equal to the net benefits manufacturers.
of an investment divided by its cost.
1076 Glossary
RSS feed A bliographic message stream that and to decrypt information. Thus, the key must
provides content from Internet sources. be kept secret, known to only the sender and
intended receiver of information.
Rule engine A software component that
implements an inference engine that operates Secure Sockets Layer (SSL) A protocol for
on production rules. transmitting private documents via the
Internet. It has been replaced by Transport
Rule-based system A kind of knowledge- Layer Security. By convention, URLs that
based system that performs inference using require an SSL connection start with https:
production rules. instead of http:
Sampling rate The rate at which the continu- Security The process of protecting informa-
ously varying values of an analog signal are tion from destruction or misuse, including
measured and recorded. both physical and computer-based mecha-
nisms.
Scenario A method of teaching that presents
a clinical problem in a story format. Segmentation In image processing, the
extraction of selected regions of interest from
Schema In a database-management system, an image using automated or manual tech-
a machine-readable definition of the contents niques.
and organization of a database.
Selectivity In data collection and recording,
Schemata Higher-level kinds of knowledge the process that accounts for individual styles,
structures. reflecting an ongoing decision-making pro-
cess, and often reflecting marked distinctions
SCORM Shareable Content Object Reference among clinicians.
Model, a standard for interoperability
between learning content objects. Self-experimentation Experiments in which
experimenters themselves are subjects of their
Script In software systems, a keystroke-by- research.
keystroke record of the actions performed for
later reuse. Semantic analysis The study of how symbols
or signs are used to designate the meaning of
SDO See: Standards development organiza- words and the study of how words combine to
tions. form or fail to form meaning.
Semantic patterns The study of the patterns words and the study of how words combine to
formed by the co-occurrence of individual form or fail to form meaning.
words in a phrase of the co-occurrence of the
associated semantic types of the words. Sequence alignment An arrangement of two
or more sequences (usually of DNA or RNA),
Semantic relations A classification of the highlighting their similarity. The sequences
meaning of a linguistic relationship, e.g., are padded with gaps (usually denoted by
“treated in 1995” signifies time while “treated dashes) so that wherever possible, columns
in ER” signifies location. contain identical or similar characters from
the sequences involved.
Semantic sense In NLP, the distinction
between individual word meaning of terms Sequence database A database that stores the
that may be in the same semantic class. nucleotide or amino acid sequences of genes
(or genetic markers) and proteins respectively.
Semantic types The categorization of words
into semantic classes according to meaning. Sequence information Information from a
Usually, the classes that are formed are rel- database that captures the sequence of com-
evant to specific domains. ponent elements in a biological structure (e.g.,
the sequence of amino acids in a protein or of
Semantic Web A future view which envi- nucleotides in a DNA segment).
sions the Internet not only as a source of
content but also as a source of intelligently Sequential Bayes A reasoning method based
linked, agent-driven, structured collections of on a naïve Bayesian model, where Bayes’
machine-readable information. rule is applied sequentially for each new
piece of evidence that is provided to the sys-
Semantics The meaning of individual words tem. With each application of Bayes’ rule,
and the meaning of phrases or sentences con- the posterior probability of each diagnostic
sisting of combinations of words. possibility is used as the new prior probabil-
ity for that diagnosis the next time Bayes’
Semi structured interview Where the investi- rule is invoked.
gator specifies in advance a set of topics that
he would like to address but is flexible about Server A computer that shares its resources
the order in which these topics are addressed, with other computers and supports the activi-
and is open to discussion of topics not on the ties of many users simultaneously within an
pre-specified list. enterprise.
Sender In data interchange, the program or
Service An intangible activity provided to
system that sends a transmitted message.
consumers, generally at a price, by a (presum-
ably) qualified individual or system.
Sensitivity (of a test) The probability of a
positive result, given that the condition under
Service oriented architectures (SOA) A soft-
consideration is present—for example, the
ware design framework that allows specific
probability of a positive test result in a person
processing or information functions (services)
who has the disease under consideration (also
to run on an independent computing platform
called the true-positive rate).
that can be called by simple messages from
Sentence boundary In NLP, distinguishing another computer application. Often con-
the end of one sentence and the beginning of sidered to be more flexible and efficient than
the next. more traditional data base architectures. Best
known example is the Internet which is based
Sentiment analysis The study of how symbols largely on SOA design principles.
or signs are used to designate the meaning of
1078 Glossary
Set-top box A device, such as a cable box, Simultaneous access Access to shared, com-
that converts video content to analog or digi- puter-stored information by multiple con-
tal television signals. current users.
Simple Object Access Protocol (SOAP) A pro- Single-user systems Computers designed for
tocol for information exchange through the use by single individuals, such as personal
HTTP/HTTPS or SMTP transport protocol computers, as opposed to servers or other
using web services and utilizing Extensible resources that are designed to be shared by
Markup Language (XML) as the format for multiple people at the same time.
messages.
Six sigma A management strategy that seeks
Simulation A system that behaves according to improve the quality of work processes by
to a model of a process or another system; for identifying and removing the causes of defects
example, simulation of a patient’s response to and minimizing the variability of those pro-
therapeutic interventions allows a student to cesses. Statistically, a six sigma process is one
learn which techniques are effective without that is free of defects or errors 99.99966%,
risking human life. which equates to operating a process that fits
six standard deviations between the mean
Simulation center Specialized type of learn- value of the process and the specification limit
ing center, though its governance may reside of that process.
1079
Glossary
Slip A type of medical error that occurs when Social determinants of health Conditions
the actor selects the appropriate course of in which people live, learn, work, and play.
action, but it was executed inappropriately. Negative examples include: poverty, poor
access to healthy foods, substandard educa-
Slots In a frame-based representation, the tion, unsafe neighborhoods.
elements that are used to define the semantic
characteristics of the frame. Social networking The use of a dedicated
Web site to communicate informally (on the
SMART See: Substitutable Medical Appli site, by email, or via SMS messages) with
cations and Reusable Technologies. other members of the site, typically by post-
ing messages, photographs, etc.
SMART on FHIR An open, standards- based
platform for medical apps to access patients’ Sociotechnical systems An approach to the
data from electronic medical records. SMART study of work in complex settings that empha-
on FHIR builds on two technology efforts: the sizes the interaction between people and tech-
Substitutable Medical Applications, Reusable nology in workplaces.
Technologies (SMART) Platforms Project and
Fast Health Information Resources (FHIR). Software Computer programs that direct the
hardware how to carry out specific automated
Smart phones A mobile telephone that typi- processes.
cally integrates voice calls with access to the
Internet to enable both access to web sites and Software development life cycle (SDLC) or
the ability to download email and applica- software development process A framework
tions that then reside on the device. imposed over software development in order
to better ensure a repeatable, predictable pro-
Smartwatch A type of wearable computer cess that controls cost and improves quality of
in the form of a wristwatch. Typically pro- a software product.
vides health monitoring features, ability to
run simple third-party apps, and WiFi or Software oversight committee A groups
Bluetooth connectivity, in addition to tell- within organizations that is constituted to
ing time. oversee computer programs and to assess
their safety and efficacy in the local set-
SMS messaging The sending of messages ting.
using the text communication service compo-
nent of phone, web or mobile communication Software psychology A behavioral approach
system-Short Message Service. to understanding and furthering software
design, specifically studying human beings’
SNOMED Systematized Nomenclature of interactions with systems and software. It is
Medicine—A set of standardized medical the intellectual predecessor to the discipline
terms that can be processed electronically; of Human-Computer interaction.
useful for enhancing the standardized use of
medical terms in clinical systems. Solid state drive (SSD) A data storage device
using integrated circuit assemblies as memory
SNOMED-CT The result of the merger of an to store data persistently. SSDs have no mov-
earlier version of SNOMED with the Read ing mechanical components, which distinguish
Clinical Terms. them from traditional electromechanical mag-
netic disks such as hard disk drives (HDDs)
SNP See Single nucleotide polymorphism. or floppy disks, which contain spinning disks
and movable read/write heads.
1080 Glossary
Spamming The process of sending unso- Standard order sets Predefined lists of steps
licited email to large numbers of unwilling that should be taken to deal with certain
recipients, typically to sell a product or make recurring situations in the care of patients,
a political statement. typically in hospitals; e.g., orders to be fol-
lowed routinely when a patient is in the post-
Spatial resolution A measure of the ability surgical recovery room.
to distinguish among points that are close to
each other (indicated in a digital image by the Standard-gamble A technique for utility
number of pixels per square inch). assessment that enables an analyst to deter-
mine the utility of an outcome by comparing
Specialist Lexicon One of three UMLS an individual’s preference for a chance event
Knowledge Sources, this lexicon is intended when compared with a situation of certain
to be a general English lexicon that includes outcome.
many biomedical terms and supports natural
language processing. Standards development organizations An
organization charged with developing a stan-
Specificity (of a test) The probability of a dard that is accepted by the community of
negative result, given that the condition under affected individuals.
consideration is absent—for example, the
probability of a negative test result in a person Static In patient simulations, a program that
who does not have a disease under consider- presents a predefined case in detail but which
ation (also called the true-negative rate). does not vary in its response depending on the
actions taken by the learner.
Spectrum bias Systematic error in the esti-
mate of a study parameter that results when Stemming The process of converting a word
the study population includes only selected to its root form by removing common suffixes
subgroups of the clinically relevant popula- from the end.
tion—for example, the systematic error in the
estimates of sensitivity and specificity that Stop words In full-text indexing, a list of words
results when test performance is measured in that are low in semantic content (e.g., “the”,
a study population consisting of only healthy “a”, “an”) and are generally not useful as mech-
volunteers and patients with advanced dis- anisms for retrieving documents.
ease.
Storage devices A piece of computer equip-
Speech recognition Translation by computer ment on which information can be stored.
of voice input, spoken using a natural vocab-
ulary and cadence, into appropriate natural Store-and-forward A telecommunications
language text, codes, and commands. technique in which information is sent to an
intermediate station where it is kept and sent
Spelling check A procedure that checks at a later time to the final destination or to
the spelling of individual words in entered another intermediate station.
data.
Strict product liability The principle that
Spirometer An instrument for measuring the states that a product must not be harmful.
air capacity of the lungs.
Structural alignment The study of methods
Standard of care The community-accepted for organizing and managing diverse sources
norm for management of a specified clinical of information about the physical organiza-
problem. tion of the body and other physical structures.
1081
Glossary
Structural informatics The study of methods Study population The population of sub-
for organizing and managing diverse sources jects—usually a subset of the clinically rel-
of information about the physical organiza- evant population—in whom experimental
tion of the body and other physical structures. outcomes (for example, the performance of a
Often used synonymously with “imaging diagnostic test) are measured.
informatics”.
Subheadings In MeSH, qualifiers of subject
Structure validation A study carried out to headings that narrow the focus of a term.
help understand the needs for an information
resource, and demonstrate that its proposed Subjectivist approaches Class of approaches
structure makes sense to key stakeholders. to evaluation that rely primarily on qualitative
data derived from observation, interview, and
Structured data entry A method of human- analysis of documents and other artifacts.
computer interaction in which users fill in Studies under this rubric focus on description
missing values by making selections from pre- and explanation; they tend to evolve rather
defined menus. The approach discretizes user than be prescribed in advance.
input and makes it possible for a computer
system to reason directly with the data that Sublanguage Language of a specialized
are provided. domain, such as medicine, biology, or law.
Structured encounter form A form for col- Substitutable Medical Applications and
lecting and recording specific information Reusable Technologies (SMART) A technical
during a patient visit. platform enables EHR systems to behave as
“iPhone-like platforms” through an applica-
Structured interview An interview with a tion programming interface (API) and a set
schedule of questions that are always pre- of core services that support easy addition
sented in the same words and in the same and deletion of third party apps, such that
order. the core system is stable and the apps are sub-
stitutable.
Structured Query Language (SQL) A com-
monly used syntax for retrieving information Summarization A computer system that
from relational databases. attempts to automatically summarize a larger
body of content.
Structured reports A report where the con-
tent of the report has coded values for the key Summary ROC curve A composite ROC curve
information in each pre-specified part of the developed by using estimates from many
report, enabling efficient and reliable compu- studies.
tation on the report.
Summative evaluation after the product is in
Study arm in the context of clinical research, use, is valuable both to justify the completed
a study arm represents a specific modality of project and to learn from one’s mistakes.
an experimental intervention to which a par-
ticipant is assigned, usually through a process Supervised learning An approach to machine
of randomization (e.g., random assigned in a learning in which an algorithm uses a set of
balanced manner to such an arm). Arms are inputs and corresponding outputs to try to
used in clinical study designs where multiple learn a model that will enable prediction of an
variants of a given experimental intervention output when faced with a previously unseen
are under study, for example, varying the tim- input.
ing or dose of a given medication between
arms to determine an optimal therapeutic Supervised learning technique A method
strategy. for determining how data values may sug-
1082 Glossary
gest classifications, where the possible classi- with a computer system and that allow users
fications are enumerated in advance, and the to operate the hardware.
performance of a system is enhanced by eval-
uating how well the system classifies a training Systematic review A type of journal article
set of data. Statistical regression, neural net- that reviews the literature related to a specific
works, and support vector machines are forms clinical question, analyzing the data in accor-
supervised learning. dance with formal methods to assure that
data are suitably compared and pooled.
Supervised machine learning A machine
learning approach that uses a gold standard Systems biology Research on biological
set as input to learn classifiers. networks or biochemical pathways. Often,
systems biology analyses take a comprehen-
Surveillance The ongoing collection, analy- sive approach to model biological function
sis, interpretation, and dissemination of by taking the interactions (physical, regula-
data on health conditions (e.g., breast can- tory, similarity, etc.) of a set of genes as a
cer) and threats to health (e.g., smoking whole.
prevalence). In a computer-based medical
record system, systematic review of patients’ Tablet Generally refers to a personal com-
clinical data to detect and flag conditions puting device that resembles a paper tablet in
that merit attention. Also see public health size and incorporates features such as a touch
surveillance and biosurveillance. screen to facilitate data entry.
public health and health administration. See information retrieval, a word or phrase which
telemedicine. forms part of the basis for a search request.
Telehome care The use of communications Term frequency (TF) In information retrieval,
and information technology to deliver health a measurement of how frequently a term
services and to exchange health information occurs in a document.
to and from the home (or community) when
distance separates the participants. Term weighting The assignment of metrics
to terms so as to help specify their utility in
Tele-ICU See remote intensive care. retrieving documents well matched to a query.
Telemedicine A broad term used to describe Terminal A simple device that has no process-
the delivery of health care at a distance, ing capability of its own but allows a user to
increasingly but not exclusively by means of access a server.
the Internet.
Terminology A set of terms representing the
Teleophthalmology The use of telemedicine system of concepts of a particular subject
methods to deliver ophthalmology services. field.
information only from that provider organi- Thick-client A computer node in a network
zation. or client–server architecture that provides
rich functionality independent of the central
Text generation Methods that create coher- server. See also thin client.
ent natural language text from structured data
or from textual documents in order to satisfy Thin client A program on a local computer
a communication goal. system that mostly provides connectivity to
a larger resource over a computer network,
Text mining The use of large text collec- thereby providing access to computational
tions (e.g., medical histories, consultation power that is not provided by the machine,
reports, articles from the literature, web-based which is local to the user.
resources) and natural language processing
to allow inferences to be drawn, often in the Think-aloud protocol In cognitive science,
form of associations or knowledge that were the generation of a description of what a per-
not previously apparent. See also data mining. son is thinking or considering as they solve a
problem.
Text processing The analysis of text by com-
puter. Thread The smallest sequence of pro-
grammed instructions that can be managed
Text readability assessment and simplifica- independently by an operating system sched-
tion An application of NLP in which compu- uler.
tational methods are used to assess the clarity
of writing for a certain audience or to revise Three-dimensional printing Construction of
the exposition using simpler terminology and a physical model of anatomy or other object
sentence construction. by laying down plastic versions of a stack of
cross-sectional slices through the object.
Text REtrieval Conference (TREC) Organized
by NIST, an annual conference on text Three-dimensional structure information In
retrieval that has provided a testbed for evalu- a biological database, information regarding
ation and a forum for presentation of results. the three-dimensional relationships among
(see 7 trec.nist.gov). elements in a molecular structure.
Text summarization Takes one or several doc- Time-sharing networks An historical term
uments as input and produces a single, coher- describing some of the earliest computer net-
ent text that synthesizes the main points of works allowing remote access to systems.
the input documents.
Time-trade-off A common approach to util-
Text-comprehension A process in which text ity assessment, comparing a better state of
can be described at multiple levels of realiza- health lasting a shorter time, with a lesser state
tion from surface codes (e.g., words and syn- of health lasting a longer time. The time-trad-
tax) to deeper level of semantics. eoff technique provides a convenient method
for valuing outcomes that accounts for gains
TF*IDF weighting A specific approach to term (or losses) in both length and quality of life.
weighting which combines the inverse docu-
ment frequency (IDF) and term frequency Tokenization The process of breaking an
(TF). unstructured sequence of characters into
larger units called “token,” e.g., words, num-
Thesaurus A set of subject headings or bers, dates and punctuation.
descriptors, usually with a cross-reference sys-
tem for use in the organization of a collection Tokens In language processing, the compos-
of documents for reference and retrieval. ite entities constructed from individual char-
1085
Glossary
acters, typically words, numbers, dates, or into proactive, predictive, preventive, and par-
punctuation. ticipatory health.
patient who does not have the disease under UMLS See: Unified Medical Language
consideration. System.
Universal Serial Bus(USB) A connection tech- Utility In decision making, a number that
nology for attaching peripheral devices to a represents the value of a specific outcome to
computer, providing fast data exchange. a decision maker (see, for example, quality-
adjusted life-years).
Unobtrusive measures Measures made using
the records accrued as part of the routine use Validity check A set of procedures applied to
of the information resource, including, for data entered into an EHR intended to detect
example, user log files. or prevent the entry of erroneous data; e.g.,
range checks and pattern checks.
Unstructured interview An interview where
there are no predetermined questions. Value-based reimbursement In health care,
an alternative to traditional fee-for-
service
Unsupervised machine learning A machine reimbursement, aimed at rewarding quality
learning approach that learns patterns from rather than quantity of services.
the data without labeled training sets.
Variable Quantity measured in a study.
URAC An organization that accredits the Variables can be measured at the nominal,
quality of information from various sources, ordinal, interval, or ratio levels.
including health-related Web sites.
Vector mathematics In the context of infor-
Usability The quality of being able to provide mation retrieval, mathematical systems for
good service to one who wishes to use a prod- measuring and comparing vector representa-
uct. tions of documents and their contents.
Usability testing A class of methods for col- Vector-space model A method of full- text
lecting empirical data of representative users indexing in which documents can be concep-
performing representative tasks; considered tualized as vectors of terms, with retrieval
the gold standard in usability evaluation based on the cosine similarity of the angle
methods. between the query and document vectors.
Veterinary informatics The application of Virtual Private Network (VPN) A private com-
biomedical informatics methods and tech- munications network, usually used within a
niques to problems derived from the field of company or organization, or by several dif-
veterinary medicine. Viewed as a subarea of ferent companies or organizations, commu-
clinical informatics. nicating over a public network. VPN message
traffic is carried on public networking infra-
Video-display terminal (VDT) A device for structure (e.g., the Internet) using standard
displaying input signals as characters on a (often insecure) protocols.
screen, typically a computer monitor.
Virtual reality A collection of interface meth-
View In a database-management system, a ods that simulate reality more closely than
logical submodel of the contents and struc- does the standard display monitor, gener-
ture of a database used to support one or a ally with a response to user maneuvers that
subset of applications. heighten the sense of being connected to the
simulation.
View schemas An application-specific
description of a view that supports that pro- Virtual world A three-dimensional represen-
gram’s activities with respect to some general tation of an environment such as a hospital, a
database for which there are multiple views. clinic or a home-care location. The represented
space usually includes a virtual patient, and
Virtual address A technique in memory man- interactive equipment and supplies that can
agement such that each address referenced by be used to examine and care for the patient.
the CPU goes through an address mapping Some virtual worlds are multi-user and allow
from the virtual address of the program to a multiple learners to manifest themselves as
physical address in main memory. characters in the virtual world for interaction
with each other and the patient.
Virtual medical record A standard model
of the data elements found in EHR systems. Virus/worm A software program that is writ-
The virtual medical record approach assumes ten for malicious purposes to spread from one
that, even if particular EHR implementations machine to another and to do some kind of
adopt nonstandard data dictionaries and dis- damage. Such programs are generally self-
parate ways for storing clinical data, map- replicating, which has led to the comparison
ping the contents of each EHR to a canonical with biological viruses.
model greatly simplifies interoperability with
CDS Systems and other applications that may Visual-analog scale A method for valuing
need to access the data. health outcomes, wherein a person simply
rates the quality of life with a health outcome
Virtual memory A scheme by which users can on a scale from 0 to 100.
access information stored in auxiliary mem-
ory as though it were in main memory. Virtual Vocabulary A dictionary containing the ter-
memory addresses are automatically trans- minology of a subject field.
lated into actual addresses by the hardware.
Volatile A characteristic of a computer’s
Virtual patient A digital representation of memory, in that contents are changed when
a patient encounter that can range from a the next program runs and are not retained
simple review of clinical findings to a realis- when power is turned off.
tic graphical view of a person who can con-
verse and can be examined for various clinical Volume rendering A method whereby a com-
symptoms and laboratory tests. puter program projects a two-
dimensional
1089
Glossary
image directly from a three- dimensional WebMD An American company that provides
voxel array by casting rays from the eye of web-based health information services.
the observer through the volume array to the
image plane. Whole Slide Digitization The process of cap-
turing an entire specimen on a slide into a dig-
vonNeuman machine A computer architec- ital image. Compared with capturing images
ture that comprises a single processing unit, of a single field of view from a microscope,
computer memory, and a memory bus. this captures the entire specimen, and can be
millions of pixels on a side. This allows subse-
Voxel A volume element, or small cubic area quent or remote review of the specimen with-
of a three-dimensional digital image (see out requiring capture of individual fields.
pixel).
Wide-area networks (WANs) A network that
Washington DC Principles for Free Access to connects computers owned by independent
Science An organization of non-profit pub- institutions and distributed over long dis-
lishers that aims to balance wide access with tances.
the need to maintain sustainable revenue
models. Wi-Fi A common wireless networking tech-
nology