TPP RX Monitoring
TPP RX Monitoring
Some rights reserved. This work is available under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 IGO
licence (CC BY-NC-SA 3.0 IGO; https://creativecommons.org/licenses/by-nc-sa/3.0/igo).
Under the terms of this licence, you may copy, redistribute and adapt the work for non-commercial purposes, provided the
work is appropriately cited, as indicated below. In any use of this work, there should be no suggestion that WHO endorses
any specific organization, products or services. The use of the WHO logo is not permitted. If you adapt the work, then you
must license your work under the same or equivalent Creative Commons licence. If you create a translation of this work,
you should add the following disclaimer along with the suggested citation: “This translation was not created by the World
Health Organization (WHO). WHO is not responsible for the content or accuracy of this translation. The original English
edition shall be the binding and authentic edition”.
Any mediation relating to disputes arising under the licence shall be conducted in accordance with the mediation rules of
the World Intellectual Property Organization (http://www.wipo.int/amc/en/mediation/rules/).
Suggested citation. Target product profiles for tests for tuberculosis treatment monitoring and optimization. Geneva:
World Health Organization; 2023. Licence: CC BY-NC-SA 3.0 IGO.
Sales, rights and licensing. To purchase WHO publications, see https://www.who.int/publications/book-orders. To submit
requests for commercial use and queries on rights and licensing, see https://www.who.int/copyright.
Third-party materials. If you wish to reuse material from this work that is attributed to a third party, such as tables, figures
or images, it is your responsibility to determine whether permission is needed for that reuse and to obtain permission from
the copyright holder. The risk of claims resulting from infringement of any third-party-owned component in the work rests
solely with the user.
General disclaimers. The designations employed and the presentation of the material in this publication do not imply
the expression of any opinion whatsoever on the part of WHO concerning the legal status of any country, territory, city or
area or of its authorities, or concerning the delimitation of its frontiers or boundaries. Dotted and dashed lines on maps
represent approximate border lines for which there may not yet be full agreement.
The mention of specific companies or of certain manufacturers’ products does not imply that they are endorsed or
recommended by WHO in preference to others of a similar nature that are not mentioned. Errors and omissions excepted,
the names of proprietary products are distinguished by initial capital letters.
All reasonable precautions have been taken by WHO to verify the information contained in this publication. However, the
published material is being distributed without warranty of any kind, either expressed or implied. The responsibility for the
interpretation and use of the material lies with the reader. In no event shall WHO be liable for damages arising from its use.
Acknowledgements iv
Glossary vii
1. Introduction 1
1.1 Background 1
1.2 Patient care pathway 2
1.3 Purpose 5
1.4 Target audience 5
1.5 Targets 5
2. Methodology 7
2.1 Stakeholder and Task Force consultation 7
2.2 Cost–effectiveness modelling 7
2.3 Delphi process and technical consultation 10
2.4 Public consultation and Scientific Target Product Profile Development Group meeting 11
References 27
Annexes 31
Annex 1. Declarations of interests 31
Annex 2. Results of the stakeholder consultation and Delphi survey 33
Annex 3. Technical consultation for the development of target product profiles
for tests and biomarkers for monitoring and optimizing tuberculosis treatment,
26–28 September 2022 (virtual meeting) 35
Annex 4. Scientific TPP Development Group meeting, 27–29 March 2023, Istanbul,
Türkiye (hybrid meeting with remote connection) 38
iii
Acknowledgements
This document has been prepared by the Global Tuberculosis Programme of the World Health
Organization (WHO) with support from the Target Product Profiles (TPPs) Core Group, consisting
of Saskia den Boon (WHO, Switzerland), Claudia Denkinger (University of Heidelberg, Germany),
Dennis Falzon (WHO, Switzerland), Ankur Gupta-Wright (University College London, United Kingdom
of Great Britain and Northern Ireland, and University of Heidelberg, Germany) and Emily MacLean
(University of Sydney, Australia), with input from the TPP Task Force, consisting of Daniela Cirillo (San
Raffaele Institute, Italy), Frank Cobelens (Amsterdam Institute for Global Health and Development,
Netherlands (Kingdom of the)), Stephen Gillespie (University of St Andrews, United Kingdom),
Mikashmi Kohli (FIND, Switzerland), Morten Ruhwald (FIND, Switzerland) and Rada Savic (University
of California San Francisco, United States of America).
WHO thanks all other members of the Scientific TPP Development Group who met 27–29
March 2023: Mustapha Gidado (KNCV TB Plus, Netherlands (Kingdom of the)), Delia Goletti
(Translational Research Unit, National Institute for Infectious Diseases–Scientific Institute for Research,
Hospitalization and Healthcare [INMI-IRCCS], Italy), Rumina Hasan (Aga Khan University, Supranational
Reference Laboratory, Karachi, Pakistan, and London School of Hygiene and Tropical Medicine,
United Kingdom), Cathy Hewison (Médecins Sans Frontières [MSF], France), Kobto Koura (The
International Union Against Tuberculosis and Lung Disease [UNION], France), Christian Lienhardt (French
National Research Institute for Sustainable Development, France, and FAST-TB Initiative, CRDF Global,
USA), Patrick Lungu (East, Central and Southern Africa Health Community [ECSA], Zambia), Timothy
McHugh (University College London, United Kingdom), Lindsay McKenna (Treatment Action Group
[TAG], USA), Thomas Scriba (University of Cape Town, South Africa) and Christine Sekaggya-Wiltshire
(Infectious Diseases Institute, Uganda). Funding agencies were represented by the following: Grania
Brigden (The Global Fund to Fight AIDS, Tuberculosis and Malaria, Switzerland), Debra Hanna (Bill &
Melinda Gates Foundation, USA) and Cherise Scott (Unitaid, Switzerland).
WHO also thanks the modelling team that conducted a health economic analysis to inform the
development of the TPPs: Abdulkadir Civan (University of Heidelberg, Türkiye) and Florian Marx
(University of Heidelberg, Germany), with input from Hae-Young Kim (New York University, USA) and
Hojoon Sohn (Seoul National University, Republic of Korea).
WHO also appreciates the input of those who participated in the virtual technical consultation
held 26–28 September 2022: Macarthur Charles (Centers for Disease Control and Prevention [CDC],
USA), Keertan Dheda (University of Cape Town and London School of Hygiene and Tropical Medicine,
South Africa), Kathy Eisenach (independent consultant, USA), Ronald Allan Fabella (Disease Prevention
and Control Bureau, Department of Health, Philippines), Anneke Hesseling (Stellenbosch University,
South Africa), Ravinder Kumar (Central TB Division, National Tuberculosis Elimination Programme,
India), Yuhong Liu (Beijing Chest Hospital, China), Sanjay Kumar Mattoo (Central TB Division, National
Tuberculosis Elimination Programme, India), Norbert Ndjeka (National TB Programme, South Africa),
Ezio Tavora dos Santos Filho (WHO Civil Society Task Force and Rio de Janeiro Federal University,
Brazil), Boitumelo Semete-Makokotlela (South African Health Products Regulatory Authority,
South Africa), Jose Lapa e Silva (Ministry of Health, Brazil), Kelly Stinson (Cultura, LLC, USA), Nguyen
Thuy Thuong (Oxford University Clinical Research Unit, Viet Nam), Cesar Ugarte-Gil (Universidad
Peruana Cayetano Heredia, Peru) and Hui Xia (National Center for TB Control and Prevention, China
Center for Disease Control, China). The United States Agency for International Development [USAID],
USA was represented by Sevim Ahmedov.
iv Target product profiles for tests for tuberculosis treatment monitoring and optimization
WHO acknowledges the participation in this virtual technical consultation of several commercial
developers of tests for TB treatment monitoring and optimization: Devasena Gnanashanmugam
(Cepheid, USA), Ammar Jagirdar (former employee of Qure.ai, India), Nakaishi Kazunari
(Tauns Laboratories, Japan), Ahmed Maged (Abbott, USA), Megumi Komada (LSI Medience, Japan),
Jerome Nigou (Institut de Pharmacologie et Biologie Structurale, France), Akos Somoskovi (Roche,
USA) and Sruti Sridhar (Qure.ai, India).
WHO also acknowledges the participation of members of the WHO Secretariat: Nazir Ismail,
Fuad Mirzayev, Samuel Schumacher, Kerri Viney, Matteo Zignol (Global Tuberculosis Programme,
Switzerland), Martin van den Boom (Regional Office for the Eastern Mediterranean), Ernesto Montoro
(Regional Office for the Americas), Askar Yedilbayev (Regional Office for Europe), Kleydson Andrade
(Country Office for Brazil), Nkateko Mkhondo (Country Office for South Africa), Kirankumar Rade
(Country Office for India) and Chen Zhongdan (Country Office for China), as well as Corinne Merle
(Special Programme for Research and Training in Tropical Diseases). Overall guidance and direction
were provided by Tereza Kasaeva, Director of the Global Tuberculosis Programme.
The meeting, reviews and document were funded through a grant provided by USAID.
Acknowledgements v
Abbreviations and acronyms
vi Target product profiles for tests for tuberculosis treatment monitoring and optimization
Glossary
Unless otherwise specified, the definitions listed below apply to the terms as used in this publication.
They may have different meanings in other contexts.
• Good outcome (also known as good treatment outcome) is considered to be bacteriological
or clinical improvement, or both, at the end of treatment for tuberculosis (TB) without evidence
of relapse within 6 months. This definition is aligned with revised World Health Organization
(WHO) treatment outcome definitions and incorporates the definition of “cured” (i.e. evidence of
bacteriological response) as well as includes people without bacteriologically confirmed TB who
have a good clinical response (1, 2). This definition also incorporates the operational research
definition of “sustained treatment success” (1).
• Poor outcome (also known as poor treatment outcome) is considered to be a lack of bacteriological
or clinical improvement, or both, by the end of TB treatment; early relapse; the need to prematurely
terminate or switch TB treatment; or death related to TB. This does not consider all post-treatment
complications or other aspects of cure that may be important to people with TB. It does not include
people who were lost to follow up from TB treatment, as it is unlikely any tests would be able to
predict who will be lost to follow up.
• Bacteriological response refers to bacteriological conversion of positive cultures (for drug-resistant
TB and drug-susceptible TB) or smears (for drug-susceptible TB only) to negative without reversion.
Reversion refers to cultures or smears becoming positive after bacteriological conversion.
Bacteriological response is relevant only to those people with bacteriologically confirmed TB who
have had serial samples analysed.
• Early relapse is reversion of bacteriological response or recurrence of TB symptoms in those who
have completed TB treatment or been declared cured within 6 months of the end of TB treatment.
• TB treatment optimization refers to initiating or switching to an effective TB treatment regimen
that results in a high likelihood of a good outcome. This includes using treatment stratification at
treatment initiation when it is determined that some people could achieve a good outcome on a
less intensive regimen (which may be a shorter regimen or a regimen with fewer medicines), while
others might need a more intensive regimen (which might be longer, include more medicines or
include adjuvant interventions) to achieve a good outcome. It also includes changing treatment as
a result of poor response to the current treatment regimen.
• Test refers to biomarker-based and non-biomarker-based tests, such as imaging-based tests, a score
based on clinical features or an assessment of cough sounds. While many of the characteristics
and targets in these TPPs assume that a test (or tests) will be biomarker based, the TPPs also apply
to non-biomarker-based tests. Such tests may even be more acceptable to people with TB and
health care workers and preferred to biomarker-based tests.
Glossary vii
1. Introduction
1.1 Background
Tuberculosis (TB) continues to be a major cause of morbidity and mortality globally despite being
curable and preventable. In 2021, an estimated 10.6 million people developed TB disease, and an
estimated 1.6 million people died from TB (3). Recent increases in the incidence of and mortality
from TB after many years of steady declines have been attributed to disruptions associated with the
coronavirus disease (COVID-19) pandemic. Although the largest gap in the TB care cascade remains
between the estimated incidence and number of cases notified – that is, an estimated 4.2 million
people with TB who are not notified and thus probably not diagnosed or treated – worldwide
only 86% of people who started on TB treatment in 2020 successfully completed it. Treatment success
remains lower in some World Health Organization (WHO) regions than in others (e.g. it is 72% in the
Region of the Americas), in people living with HIV (77% globally in 2020) and in those with drug-
resistant TB (DR-TB), for whom it is 60% (3).
TB treatment regimens are long and arduous for people with TB and can be associated with both
serious adverse effects and significant financial costs, which impact adherence and treatment
outcomes (4). Monitoring TB treatment to identify those who are at risk of poor outcomes could
improve overall treatment success. WHO currently recommends the regular use of sputum smear
microscopy and mycobacterial culture to monitor the response to treatment in adults with pulmonary
TB (4–6). However, these methods have important limitations as they rely on people on treatment
producing sputum samples and, therefore, are less useful in certain populations (e.g. in people
with extrapulmonary TB, children and people living with HIV). Sputum smear microscopy has poor
diagnostic accuracy for predicting poor outcomes, and culture is expensive, slow and not readily
available in many settings with a high burden of TB (7). While rapid, near-patient molecular assays
to detect Mycobacterium tuberculosis have transformed the diagnosis of TB, these assays are not
currently suitable for monitoring the response to TB treatment due to the persistence of nucleic acids
in nonviable TB bacilli during and sometimes beyond successful TB treatment (8).
WHO guidelines also recommend sputum smear microscopy or mycobacterial culture, or both, as
a test of cure, and the treatment outcome definition of cure includes a negative sputum culture or
smear during the last month of treatment (1, 4–6).1 Relying on sputum samples as tests of cure for
TB has limitations similar to those for monitoring TB treatment, with even fewer people being able
to produce quality samples at the end of treatment (9, 10). The current definition of cure for routine
programmatic monitoring also does not consider relapse, and sputum culture at the end of treatment
has poor sensitivity for relapse (11, 12). Although other indicators (e.g. weight, clinical symptoms,
1
Prior to 2021, cure was defined by WHO as a “pulmonary TB patient with bacteriologically confirmed TB at the beginning of treatment
who was smear- or culture-negative in the last month of treatment and on at least one previous occasion”. Since 2021, cure is now
defined as a patient who has “completed treatment as recommended by the national policy with evidence of bacteriological response
and no evidence of failure”. Bacteriological response is defined as the conversion of sputum culture or smear without reversion.
1. Introduction 1
chest X-ray or other imaging) are recommended in some guidelines to monitor TB treatment, including
in children and in extrapulmonary or sputum-negative TB disease, there is a need for accurate tests
to identify people who have been cured of TB (5, 6, 13, 14).
Clinical trials of novel, shorter TB treatment regimens for drug-susceptible TB (DS-TB) have consistently
shown that the vast majority of people with TB achieve relapse-free cure with 4 months of treatment
(15). This suggests that the standard 6-month first-line regimen is longer than needed for most people
with TB to achieve sustained cure (i.e. most people with TB are overtreated). A 4-month regimen
composed of rifapentine, isoniazid, pyrazinamide and moxifloxacin is now conditionally recommended
by WHO as an alternative to the current standard 6-month regimen (5, 16). A 4-month regimen has
also been recommended for non-severe TB in children and young adolescents, with non-severe being
defined by chest X-ray findings and clinical signs (16, 17). An alternative strategy of treating people
with TB with even shorter regimens (i.e. 2 months of treatment with bedaquiline and linezolid added
to isoniazid, pyrazinamide and ethambutol) and then treating those who relapse for longer was non-
inferior for a composite outcome of death, ongoing treatment or active disease when compared with
the standard 6-month treatment in a clinical trial (18). WHO target regimen profiles for TB treatment
aim for new TB regimens to be 2 months or shorter for both DS-TB and DR-TB (19).
Being able to accurately predict who will achieve a good outcome with shorter treatment regimens and
who will need longer or different regimens could have important clinical and public health benefits,
and could improve the quality of care for individual people with TB (20). This is even more pertinent
for DR-TB, for which a range of regimens with differing durations and adverse effects are currently
recommended (5). People with TB differ in their risk of relapsing after treatment. Therefore, there is
a clear need for new tools that can accurately predict outcomes in people who are starting or already
taking TB treatment and that allow for treatment to be optimized to improve outcomes.
There are novel platforms and biomarkers in the pipeline that offer the potential to monitor
TB treatment, predict outcomes, identify cure and allow optimization of management. These have
been summarized in reviews (21, 22) (a detailed review is beyond the scope of this document), and
potential tests include:
• host characteristic assays, including assays for cytokines, transcriptomic profiles and other biomarkers;
• pathogen burden and fitness assays;
• imaging-based assays;
• clinical scores, clinical symptoms and signs, cough sounds and lung function tests.
2 Target product profiles for tests for tuberculosis treatment monitoring and optimization
Fig. 1 presents a diagrammatic representation of a typical care pathway for people diagnosed
with DS-TB; it is based on a review of international TB treatment guidelines, including those from
countries with a high burden of TB, as well as the published literature on the implementation
of sputum-based tools for monitoring TB treatment.
The results of the review were supported by a survey about the implementation of monitoring for TB
treatment sent to NTPs and others who support TB treatment programmes in high-burden countries. The
results demonstrate that TB treatment monitoring is implemented in most settings using sputum smear
microscopy and culture, and, to a lesser extent, chest X-ray, clinical assessment and a patient’s weight.
Monitoring most commonly occurs at the primary care level, with visits at 2 months after treatment
initiation and at the end of treatment; note that monitoring for treatment of DR-TB usually occurs
monthly at the secondary care level. The main barriers to implementing treatment monitoring were
cost and laboratory capacity to support the tests, delays in receiving test results, difficulties in people
attending health care facilities for follow up and the workload for health care workers looking after
people with TB. Only a small minority of countries have started to implement 4-month TB treatment
regimens for DS-TB, and no settings have yet considered how monitoring will be adapted for newer,
shorter regimens. These factors were considered while developing the characteristics and targets for
these TPPs.
1. Introduction 3
Fig. 1. Graphical representation of a current typical care pathway for treating people
with drug-susceptible TB in high-burden countries
Current TB care pathway Notes New tests for treatment
monitoring and
optimizationa
Baseline visit(s)
2-month visit
Sputum for SSM with Recommended at 2 months, Identify patients with a poor
or without culture although suboptimally response to treatment
implemented to inform the decision to step
down to continuation phase or
switch to different TB regimen
• Check adherence and as new regimens become
Decision informed by results of
adherence interventions available (e.g. from less to
Step down to SSM and culture, severity of
• Review diagnosis No more intensive regimen); can
continuation phase of disease (e.g. SSM grade, chest
• Refer to specialist be done days or weeks after
treatment? X-ray), improvements in
• Repeat tests for drug treatment initiation
symptoms, weight gain,
resistance
adherence and treatment
• Restart TB treatment
support, comorbidities
Yes
Successful treatment
completion Cured or treatment completed as
per WHO guidelines
Follow up and
management for
post-TB lung disease
DR-TB: drug-resistant TB; DS-TB: drug-susceptible TB; mWRD: molecular WHO-approved rapid diagnostic test; SSM:
sputum smear microscopy; TB: tuberculosis.
a
The timing for new tests for TB treatment monitoring and optimization are likely to change, depending on test
characteristics and new TB regimens. Monitoring tests for TB treatment may be done before the 2-month treatment visit.
4 Target product profiles for tests for tuberculosis treatment monitoring and optimization
1.3 Purpose
The overall purpose of these TPPs is to provide a set of parameters to guide the development and
manufacture of new tests to monitor and optimize TB treatment while considering the needs of
TB programmes and people with TB. While tests for TB optimization may well be useful in clinical
trials of TB treatment (25), the primary purpose of these TPPs is to develop tests for programmatic
use in high-burden settings. In parallel, Maclean et al. have developed guidance for generating
evidence to advise researchers about how to evaluate candidate tests (MacLean EL, et al., manuscript
in preparation, 2023). Recent advances in TB treatment regimens and important uncertainties about
the potential role for novel tests in monitoring and optimizing TB treatment have created a demand
for these TPPs. However, it is acknowledged that they may need reviewing and updating before the
typical 5-year period ends.
1.5 Targets
These TPPs provide both minimal and optimal targets for each included characteristic. The minimal
requirements are the lowest acceptable level for that characteristic, and the optimal requirements are
the ideal levels for that characteristic, expected to have the greatest public health impact (Table 1).
The minimal and optimal targets represent a range. Ideally, products will meet all minimal targets
and as many of the optimal targets as possible.
Term Definition
Characteristic A test requirement or specification that is measurable
Minimal For a specific characteristic, minimal refers to the lowest acceptable output for that
characteristic. To be acceptable, tests should meet the minimal target.
Optimal For a specific characteristic, optimal represents the ideal output that is believed to be
realistically achievable. Meeting the optimal targets will have the greatest impact for
end-users, clinicians and people with TB. Ideally, developers would design and develop
their solutions to meet the optimal targets for all characteristics.
1. Introduction 5
2. Methodology
WHO followed a stepwise approach to identify test characteristics that are important for people with
TB and TB programmes, as well as for test developers and other stakeholders.
A TPP Core Group was formed to coordinate the development of the TPPs and lead the writing process.
WHO also constituted a Scientific TPP Development Group, consisting of leading scientists and experts,
public health officials, and in-country end-user representatives. Members of this group were engaged
throughout the TPP development process and proposed the final TPPs during an in-person meeting.
Members of the Scientific TPP Development Group completed standard WHO declarations of interest
procedures (Annex 1). A sub-set of this group formed the TPP Task Force and was consulted more
frequently to help direct the TPP development.
The initial draft TPP document was prepared by the TPP Core Group based on a systematic literature
review (22), a draft (unpublished) TPP developed by FIND, and multiple meetings of the Core Group
and the TPP Task Force.
A Markov multistate model was developed to explore the potential health impacts and costs of
hypothetical novel tests used during TB treatment to identify people at high risk of a poor treatment
outcome2 who would benefit from further investigation or extended or modified treatment, or a
2
Tests used for treatment monitoring are assumed to identify patients for whom treatment produces an inadequate response and who
are therefore at high risk of a poor treatment outcome.
2. Methodology 7
combination of these (i.e. treatment monitoring; see also Section 3.1 about use cases). The model
was used to simulate standard treatment for non-rifampicin-resistant TB in a hypothetical cohort of
1 000 adults diagnosed with pulmonary TB.
The core model structure (Fig. 2) consists of three states used to distinguish between people who
will have an adequate response to treatment and those who will have an inadequate response due
to drug resistance, or poor adherence or other causes.3 People initiating TB treatment move into one
of the three states conditional on undetected drug resistance and other reasons. During the course
of treatment, people transition from an adequate response to an inadequate response to treatment
at rates describing the incidence of poor adherence and the acquisition of DR-TB. An inadequate
response due to poor adherence includes individuals who take their medication irregularly and those
who interrupt their treatment.
The model’s structure includes subdivisions for time spent with an adequate response to treatment (0
to 6 months; Fig. 2). People who complete the final month of treatment transition into one of two
outcome states, cure or failure/relapse, at probabilities depending on the total time spent with an
adequate response to treatment. People with fewer months of an adequate response have a lower
probability of cure compared with those with up to 6 months of adequate response. Failure/relapse
is a composite outcome that includes people with persistent active disease during treatment or early
disease reactivation (i.e. relapse). Cure is assumed in the absence of failure and of relapse during a
period of at least 2 years after treatment completion. Individuals can die at any time during treatment,
with higher rates of death assumed for people during their first month of treatment and among those
with an inadequate response to treatment.
Treatment monitoring is implemented as a process that can be enabled (i.e. switched on or off) at the
end of each month of treatment (Fig. 2). A probabilistic decision-analytic process is implemented to
simulate interventions after a positive test. These consist of (i) adherence assessment and counselling
plus a 2-month extension of treatment (i.e. in the event of poor adherence) and (ii) drug-resistance
testing and initiation of treatment for DR-TB (i.e. if resistance is confirmed).
Four model scenarios were considered for the analysis (Table 2). The base-case scenario assumes no
treatment monitoring. Each test scenario assumes a reference test is conducted prior to initiation
of treatment.
Monitoring scenario 1 assumes the use of smear microscopy to detect an inadequate response
to treatment. Two additional scenarios assume the use of hypothetical novel tests that meet the
minimal (scenario 2) and optimal (scenario 3) criteria for sensitivity and specificity to detect an
inadequate response to treatment.
Costs incurred under each scenario are estimated in 2023 US dollars (US$), adopting a health care
system perspective. Average costs reflect the costs of conducting a particular monitoring test,
assessing adherence, providing counselling, conducting drug-resistance testing (i.e. in the event
of a positive monitoring test) and providing treatment. Model parameters were obtained from the
published literature. The number of unfavourable treatment outcomes averted (i.e. death, failure/
relapse) and disability-adjusted life years (DALYs) averted were estimated to measure the health impact
achieved under the monitoring scenarios. Costs and DALYs averted were discounted at an annual
rate of 3.0%. Incremental cost–effectiveness ratios were estimated using fixed estimates of costs per
test performed. In addition, for each hypothetical test scenario, costs per monitoring test performed
were estimated in relation to variable willingness-to-pay thresholds.
3
Other causes of an inadequate response to treatment may include, for example, malabsorption of the medicine or severe TB disease
resulting in a delayed treatment response.
8 Target product profiles for tests for tuberculosis treatment monitoring and optimization
The model was used to explore the potential health impacts and to estimate incremental cost–
effectiveness ratios of hypothetical monitoring tests (Table 2). Incremental cost–effectiveness ratios
represent the additional costs incurred per additional DALY averted, using smear microscopy as
a reference. These ratios were estimated for different price levels per monitoring test performed and
compared against different willingness-to-pay thresholds, including the estimated cost per DALY
averted with monitoring using smear microscopy. The model uses probabilistic analysis to account
for parameter uncertainty. Best estimates represent the mean of 1 000 model trajectories with 95%
uncertainty intervals calculated as the 2.5th and 97.5th percentile values.
Fig. 2. Structure of the Markov multistate model used to assess the potential health
and cost–effectiveness impacts of hypothetical novel tests used during TB treatment to
identify people at high risk of a poor outcome
Adequate response to
treatment
Treatment for
drug-resistant TB
Monitoring t
Failure/
Cure
relapse
0 1 2 3 4 5 6
2. Methodology 9
Table 2. Scenarios and their key parameters used in the Markov multistate model
to assess the potential health and cost–effectiveness impacts of hypothetical novel
tests used during TB treatment to identify people at high risk of a poor outcome
Description Mean Uncertainty Source
value interval
Base-case scenario
(no monitoring)
Monitoring scenario 1
(Sputum smear microscopy, month 2)
Monitoring coverage (sputum-based) 0.750 0.650–0.850 Assumptiona
Sensitivity 0.448 0.225–0.673 Model estimate, based on
Horne et al. 2010 (26), Davis et al. 2013 (27)
Specificity 0.826 0.732–0.924 Model estimate, based on
Horne et al. 2010 (26), Davis et al. 2013 (27)
Cost per testb 5.00 2.00–8.00 Unit cost study repository
(https://www.ghcosting.org)
Monitoring scenario 2c
(Hypothetical test with TPP minimum criteria, month 2)
Monitoring coverage (sputum-based) 0.750 0.650–0.850 Assumptiona
Sensitivity 0.825 0.750–0.900 TPP test criteria (minimal)
Specificity 0.850 0.800–0.900 TPP test criteria (minimal)
Cost per testb Variesd NA NA
Monitoring scenario 3 c
10 Target product profiles for tests for tuberculosis treatment monitoring and optimization
2.4 Public consultation and Scientific Target Product Profile
Development Group meeting
Public comment on draft v. 0.1 of the TPPs was invited through an online survey that was distributed
through the WHO mailing list. The draft TPP and online survey were available from 9 February to 9
March 2023. The results of the public consultation were presented at a final Scientific TPP Development
Group meeting (Annex 4), at which agreement on the final TPPs was obtained (v. 1.0, May 2023).
2. Methodology 11
3. Target product profiles
3.1 Use cases for tests for TB treatment monitoring and optimization
Based on the decisions facing clinicians who manage people with TB, as outlined in the patient care
pathway in Fig. 1, and possible stratified TB treatment regimens in the future, three use cases for
tests for TB treatment monitoring and optimization have been identified. These use cases are for
tests conducted (i) at the time of TB treatment initiation, to identify people who will not achieve
a good treatment outcome (see Glossary) with a less intensive TB treatment regimen; (ii) during
TB treatment, to identify people at high risk of a poor treatment outcome and who would benefit
from further investigation or a more intensive TB treatment regimen, or both; and (iii) at the end of
TB treatment, to identify those who have not achieved a good treatment outcome. These use cases
are further described in Table 3.
Less intensive TB treatment regimens may be shorter in duration or contain fewer medicines. A more
intensive TB treatment regimen may be longer, contain more medicines or include adjuvant therapies. It
is assumed that these different regimens will be non-inferior to the standard of care in terms of efficacy,
but less intensive regimens will have some advantages to people with TB and TB programmes (e.g.
shorter duration, smaller pill burden, less risk of adverse reactions to medicines).
Table 3. Summary of use cases for tests for TB treatment monitoring and optimization
Use case Timing Explanatory notes Consequences
Identify people Treatment Prioritizes avoiding undertreatment If the test predicts a poor treatment
who require a initiation of people who would have a poor outcome, the person will be started on
more intensive TB treatment outcome on a less intensive a more intensive TB regimen, which may
treatment regimen regimen (e.g. those with more severe be longer or contain more medicines,
disease). The test would, therefore, aim or the patient may need additional
to predict with high accuracy those interventions.
likely to have a poor treatment outcome
If the test predicts a good treatment
on a less intensive treatment regimen
outcome, the person can be initiated on
(i.e. the tests need high sensitivity for
a less intensive TB regimen (albeit with
predicting poor outcomes).
ongoing monitoring).
Identify people During Aims to identify people who are not If the test shows a poor treatment
at risk of a treatment adequately responding to TB treatment. response or risk of poor treatment
poor outcome These are sometimes known as tests for outcome, the person may need a
with current TB TB treatment monitoring or treatment different, more intensive optimized
treatment response, and they have aims similar treatment regimen (e.g. longer or with
to using sputum-based microscopy or more medicines or adjuvant therapies),
culture during treatment. These tests may need to undergo further testing
need to be accurate enough to not (e.g. for drug resistance) or may need
miss people responding poorly and adherence support interventions or
also to minimize the number of people adjuvant therapies. If a test shows
incorrectly identified as at risk of a poor a good treatment response, the
treatment outcome. person can continue with the current
treatment.
TB: tuberculosis.
• While there are many different aspects of TB disease severity and response to treatment (e.g.
bacteriological, clinical, radiological, functional), these tests will aim to identify markers that predict
the outcome of TB treatment and instances in which this outcome could be improved by modifying
treatment (i.e. in terms of medicines or length of treatment) or by adjuvant interventions.
• Ideally, one test will be developed that can be used for all use cases, but this is not mandatory.
The TPPs outline the characteristics that tests for all use cases will have in common, and they list
separately those characteristics for which the targets are different between use cases.
• Based on the TB care pathways for the 6-month DS-TB regimens currently implemented by TB
programmes in high-burden settings, follow-up visits and tests for TB treatment monitoring are
usually done at around 1–2 months after treatment initiation and tests of cure at around 5–6 months.
New tests may also be done several times and at different times, according to what they measure
and how they predict poor outcomes, and based on implementation of new treatment regimens
(e.g. the 4-month rifapentine-based regimen). TB programmes may change their TB care pathways
in response to newer regimens and diagnostic tests.
• In the patient care pathway analysis and cost–effectiveness modelling, the example of a 6-
month DS-TB scenario was used, but the TPPs also apply to the monitoring and optimization of
shorter regimens or those for DR-TB.
• Tests that identify drug resistance will also predict a poor outcome with standard DS-TB regimens,
and switching to appropriate TB regimens will improve outcomes. However, these TPPs are not
referring to tests of drug resistance. TPPs for tests of drug resistance are described elsewhere (28).
• A reference standard is useful when considering the diagnostic accuracy of tests. However, there is
no perfect test of treatment response or optimization. Therefore, the best proxy for the reference
standard for these tests will be the final treatment outcome 6 months after the end of treatment.
A more detailed discussion of reference standards will be available in a manuscript that provides
methodological guidance for evaluating tests for treatment monitoring and optimization (MacLean
EL et al., manuscript in preparation, 2023).
14 Target product profiles for tests for tuberculosis treatment monitoring and optimization
3.2. Minimal and optimal targets for key characteristics for tests for TB
treatment monitoring and optimization
Tables 4–6 describe the target characteristics for tests to be used when people start treatment, during
treatment and at the end of treatment; Table 7 describes the operational characteristics that all TPP
tests have in common.
TB: tuberculosis.
a
If a test detects those with a high likelihood of a good treatment outcome, the targets for sensitivity and specificity should
be reversed.
TB: tuberculosis.
a
If a test detects those likely to have a good treatment outcome, the targets for sensitivity and specificity should be
reversed.
16 Target product profiles for tests for tuberculosis treatment monitoring and optimization
Table 6. Target characteristics for tests used to identify people with a poor treatment
outcome at the end of TB treatment
Characteristic Minimal target Optimal target Explanatory notes
Sensitivity ≥80% ≥95% The test should have high sensitivity to identify people who
(for detecting have a poor outcome despite completing their anticipated TB
those with a treatment. People identified as having a poor outcome will
poor treatment have further investigations or optimized treatment, or both,
outcome)a and be followed up appropriately.
Specificity (for ≥90% ≥95% The test should correctly identify those persons with a good
detecting those treatment outcome to reduce overtreatment, as this has
with a good important costs and consequences for people with TB and TB
treatment programmes. The prevalence of poor outcomes will be lower at
outcome) the end of TB treatment, which is why the specificity is higher
than for the other use cases.
Timing For use at the For use at the The test should be done at the time of or just before the
last possible last follow- anticipated completion of TB treatment to maximize its
point prior up visit or last accuracy for detecting poor treatment outcomes.
to the end of medication refill
Ideally, the test would be performed and still be accurate at the
TB treatment prior to the
final follow-up visit or final medication refill because, from an
(e.g. within the anticipated end
operational perspective, people may not return at the end of
last week of of TB treatment
treatment.
treatment)
Ideally, the timing should reduce the risk of people with a
poor outcome having a break in their treatment while waiting
for results; therefore, the timing of the test also requires
consideration of its turnaround time. Note that the duration of
treatment may vary as newer TB regimens are introduced or for
DR-TB or central nervous system disease.
Frequency For use at two For use only The test needs to identify people with a poor treatment
time points (so once, at outcome at the end of TB treatment.
two results can the end of
Ideally, the test would be done only once, at the end of
be compared) treatment
treatment, to reduce costs and resources. It is possible the
(without
test will need to be done at least twice to compare serial
the need
measurements (e.g. at treatment initiation and early during
for baseline
treatment). Ideally, the test would be the same test as for one
measurement)
or more of the other use cases; therefore, at least one other
result would be available for comparison.
18 Target product profiles for tests for tuberculosis treatment monitoring and optimization
Characteristic Minimal target Optimal target Explanatory notes
Target For use by For use by all These tests are aimed at people starting or already receiving
population people with people starting TB treatment and at health care workers seeking to optimize
pulmonary TB or already subsequent treatment.
or any form of receiving
Ideally, the test would be applicable to all people starting
bacteriologically treatment for
TB treatment, including children, elderly people, pregnant
confirmed TB TB
women, people living with HIV, people with severe malnutrition
or comorbidities, and people with DR-TB or extrapulmonary
or disseminated TB, or both. However, it may be that a
test is applicable only to a subpopulation of people being
treated for TB (e.g. those with pulmonary TB or those with
bacteriologically confirmed TB), as current monitoring tests are.
Although not ideal, it may be acceptable if the nature of the
test precludes its use in all people, but it provides substantial
improvement over existing similar tests (e.g. sputum smear
microscopy) and its use is still cost effective; however, this may
impair uptake and implementation of the test.
Sample type (if For use with For use with Optimally, tests should use not only sputum as the sample type,
a clinical sample a minimally a minimally as sputum is not produced by all people with TB (particularly
is required) invasive invasive, easily subgroups such as children, people living with HIV and those
respiratory accessible with extrapulmonary TB), it is difficult to obtain safely and it
sample (i.e. sample (e.g. tends to become scarcer as TB treatment progresses.
not limited to urine, breath,
As a minimal requirement, tests should use respiratory samples
sputum alone) capillary blood)
and not be limited to sputum. Tests based on respiratory samples
(e.g. oral swabs, saliva or breath) are likely to have a higher yield
and thus perform better for people with pulmonary TB.
Ideally, samples that are minimally invasive and easy to access
are preferred, such as urine or capillary blood. The volume
required should be reasonable to collect with one sample.
Samples should require minimal processing or preparation prior
to testing, and they should pose minimal risks to health care
workers with respect to infection prevention and control.
Tests may not require clinical samples at all (e.g. imaging
modalities such as chest X-ray or digital chest X-ray with
computer-aided detection, ultrasound) or may be scores based
on multiple clinical observations, with or without extra clinical
samples being required. Tests with artificial intelligence–based
biomarkers of voice or cough sounds also would not require
clinical samples.
Time to result ≤1 day ≤2 hours The time to result reflects the time from when a sample is
received to the release of results under optimal programmatic
conditions.
Most people with TB will have treatment initiation and follow
up at primary care–level health facilities. Ideally, the results
should be available during the same clinical encounter so
health care workers can make decisions about management
and treatment immediately; this is particularly important for
tests done at the time of treatment initiation.
If it takes longer to receive the results, then people may need
to be contacted (e.g. by phone or SMS) for a return visit,
implying more cost for people treated for TB and risk of loss to
follow up. This may be necessary for tests requiring samples to
be transported to laboratories. Complementary measures will
need to be in place to facilitate sample transportation to the
place of testing and to automate the production and electronic
transmission of results to the clinician and patient.
20 Target product profiles for tests for tuberculosis treatment monitoring and optimization
Characteristic Minimal target Optimal target Explanatory notes
Maintenance Yearly servicing Ideally, The need for instrument maintenance should be minimal,
(for instrument- of instrument maintenance requiring at most yearly maintenance and minimal expertise for
based assays) free, or maintenance and service.
maintenance
Maintenance and technical support should be available
should be
in-country or local staff could be trained to provide this, or
done locally or
both. Alternatively, maintenance could be done remotely. It
remotely
should be possible to receive software updates for instruments
over a low-bandwidth mobile internet connection. The cost of
maintenance should be low, and ideally, service agreements
will be detailed in the purchase contract.
Quality control Provision of Integrated Quality control would ideally be integrated into the test,
reagents for quality control including point-of-care tests.
quality control
If external reagents are required for quality control, these
should be provided with the test kits.
Training and ≤3 days ≤1 day Ideally, training for those performing the test should be minimal.
education
Optimally, health care workers at community or peripheral
health facilities should be able to conduct the tests with brief
training, but tests requiring training as a laboratory technician
would also be acceptable.
In a hypothetical cohort of 1 000 people with TB and using a test meeting the minimal TPP targets
for diagnostic accuracy, of the 150 who will have a poor treatment outcomes, 135 would be correctly
identified by this test and 15 would be missed (i.e. inappropriately started on a less intensive regimen).
Of the 850 who would have a good treatment outcome on a less intensive regimen, 595 would be
correctly identified and started on a less intensive regimen, while 255 would be incorrectly classified
as likely to have a poor treatment outcome and started on a more intensive regimen. Under the
optimal TPP diagnostic accuracy targets (≥95% sensitivity and ≥80% specificity), only 7 people likely
to have a poor treatment outcome on less intensive TB treatment would be missed, and 170 likely
to have a good treatment outcome on less intensive TB treatment would be overtreated with a more
intensive regimen.
100 100
98.4 97.5 96.6
90 PPV 90
NPV
80 80
70 70
Predictive value (%)
60 60
50 50
40 42.9 40
30 34.6 30
20 25.0 20
10 10
0 0
0 5 15 20 25 30
3.3.2 Tests to identify the risk of a poor treatment outcome during TB treatment
Assuming that the prevalence of poor outcomes during treatment is 10%4 and assuming minimal
TPP accuracy targets – that is, 75% sensitivity and 80% specificity for a poor treatment outcome –
the test would have an NPV of 96.6% and a PPV of 29.4%. Therefore, in the hypothetical cohort of
1 000 people with TB, 25 of the 100 with a poor response to treatment would be missed, and 180 of
the 900 people with a good response to treatment would be incorrectly identified as having a poor
response and, therefore, might be overtreated. Under optimal TPP accuracy targets of ≥90% sensitivity
and specificity, only 9 of 100 people would be missed, and 90 of 900 would be overtreated (Fig. 4).
Fig. 4. Negative predictive values (NPVs) and positive predictive values (PPVs) for a
test with 90% sensitivity and 90% specificity for detecting those at risk of a poor
TB treatment outcome at varying prevalences of poor outcomes
100 100
98.8 98.1 97.3
90 PPV 90
NPV
80 80
70 70
Predictive value (%)
69.2
60 60
61.4
50 50
50
40 40
30 30
20 20
10 10
0 0
0 5 10 15 20 25 30
4
The prevalence of poor outcomes is assumed to decrease during and at the end of treatment because treatment would already have
failed some patients or they would have had another unfavourable outcome.
22 Target product profiles for tests for tuberculosis treatment monitoring and optimization
3.3.3 Tests to detect people with poor outcomes at the end of TB treatment
Assuming a 5%4 prevalence of poor treatment outcomes by the end of treatment, including
early relapse, a test with 80% sensitivity and 90% specificity (i.e. the TPP minimal accuracy targets)
for detecting poor outcomes would have an NPV of 98.8%, and a PPV of 29.6% (Fig. 5). Therefore,
in the hypothetical cohort of 1 000 TB people, 10 of the 50 with a poor treatment outcome would
be missed, and 95 of the 950 people with a good treatment response would be incorrectly identified as
having a poor response and, therefore, might be overtreated. Under optimal accuracy targets of ≥95%
sensitivity and ≥95% specificity, only 5 out of 100 people would be missed and 48 of 950 overtreated.
Fig. 5. Negative predictive values (NPVs) and positive predictive values (PPVs) for a
test with 80% sensitivity and 90% specificity for detecting a poor outcome at
varying prevalences of poor TB treatment outcomes
100 100
98.8 97.6 96.2
90 90
80 80
70 70
Predictive value (%)
60 60
58.5
50 50
47.1
40 40
30 30
29.6
PPV
20 NPV 20
10 10
0 0
0 5 10 15 20 25 30
Currently, the tests most commonly used by TB programmes for monitoring TB treatment are sputum
smear microscopy and, sometimes, sputum culture or chest X-ray. These range in cost from US$
3.00 to US$ 20.00. Based on the cost–effectiveness modelling, tests meeting the optimal criteria for
sensitivity and specificity to detect a poor response to treatment should cost less than US$ 25.00
to achieve better health impacts at lower costs compared with sputum smear microscopy (Fig. 6).
The acceptable costs of tests may depend on how well they meet or surpass the targets described
in these TPPs.
The cost of less than US$ 25.00 is indicative only and allows for different pricing. Novel tests for
monitoring and optimizing TB treatment should be priced so that their use and implementation are
cost effective given the reduction in costs and morbidity associated with identifying people at risk
of poor outcomes from TB treatment. It is expected that public support for the development of the
tests and market shaping would lead to lower test costs.
60
50
40
30
US$ 24.75
20
10
0
1000 2000 3000 4000 5000
Willingness to pay per additional DALY averted (US$)
a
The figure shows estimates for a hypothetical, non-sputum-based test for TB treatment monitoring with optimal sensitivity
and specificity, according to criteria in the target product profiles. At a willingness-to-pay threshold of US$ 1 575 per additional
DALY averted (the best estimate of cost per DALY averted under smear microscopy–based testing), the test should cost ≤
US$ 24.75. The purple line indicates the best estimate; the grey shaded area indicates the 95% uncertainty interval.
There are additional cost considerations for the different use cases addressed in these TPPs. A test
that is done at the start of treatment and can direct people to optimal treatment early is likely to lead
to cost savings (e.g. shorter treatment regimens) and these can be reflected in the cost of the test.
Using serial testing – that is, monitoring tests conducted during consecutive months of treatment – to
identify individuals with an inadequate response to treatment may be advisable if the test is affordable.
Modelling suggests that serial testing should start during the early phase of treatment (Fig. 7). Serial
24 Target product profiles for tests for tuberculosis treatment monitoring and optimization
testing strategies may also provide opportunities to detect trends in treatment response over time and to
follow up on people for whom treatment was modified to ensure that the modified regimen is beneficial.
Ideally, tests will not require instruments, but if instruments are required these should be affordable
for TB and other health programmes, as capital costs are often a barrier to implementation. Higher
instrument costs may be more acceptable for multiuse platforms, but they would have to be supported
by evidence of their cost effectiveness. Instrument costs should include warranties, service contracts
and technical support for ≥3 years. Different models of instrument provision should be considered,
such as rental contracts or cost-per-result models.
Fig. 7. Model projections of costs per unfavourable treatment outcome averted for different
strategies of single time point and serial testing (i.e. repeated monitoring for
consecutive months) during TB treatment using a hypothetical sputum-based test that meets
optimal target criteriaa
3.3
Monitoring started
during month
1
3.2 2
3
Average cost / unfavorable treatment
outcome averted (thousand US$)
3.1
3.0
2.9
2.8
2.7
2.6
2.5
1 2 3 4 5 6
Month of treatment
a
The figure shows that the cost per unfavourable treatment outcome averted can be lower for serial testing compared with
single time point testing if it is started early in treatment, and that the cost is higher in later stages of treatment
References 27
14. Migliori GB, Tiberi S, Zumla A, Petersen E, Chakaya JM, Wejse C, et al. MDR/XDR-TB management
of patients and contacts: challenges facing the new decade. The 2020 clinical update by the Global
Tuberculosis Network. Int J Infect Dis. 2020;92S:S15–25. doi:10.1016/J.IJID.2020.01.042.
15. Imperial MZ, Nahid P, Phillips PPJ, Davies GR, Fielding K, Hanna D, et al. A patient-level pooled
analysis of treatment-shortening regimens for drug-susceptible pulmonary tuberculosis. Nat Med.
2018;24:1708–15. doi:10.1038/S41591–018–0224–2.
16. Turkova A, Wills GH, Wobudeya E, Chabala C, Palmer M, Kinikar A, et al. Shorter treatment
for nonsevere tuberculosis in African and Indian children. New Engl J Med. 2022;386:911–22.
doi:10.1056/NEJMoa2104535.
17. WHO consolidated guidelines on tuberculosis. Module 5: management of tuberculosis
in children and adolescents. Geneva: World Health Organization; 2022 (https://
apps.who.int/iris/handle/10665/352522, accessed 5 February 2023).
18. Paton NI, Cousins C, Suresh C, Burhan E, Chew KL, Dalay VB, et al. Treatment strategy for rifampin-
susceptible tuberculosis. N Engl J Med. 2023;388:873–87. doi:10.1056/NEJMoa2212537.
19. Target regimen profiles for tuberculosis treatment. 2023 edition. Geneva: World Health Organization;
2023.
20. Papineni P, Phillips P, Lu Q, Cheung YB, Nunn A, Paton N. TRUNCATE-TB: an innovative trial design
for drug-sensitive tuberculosis. Int J Infect Dis. 2016;45:404. doi:10.1016/J.IJID.2016.02.863.
21. Heyckendorf J, Georghiou SB, Frahm N, Heinrich N, Kontsevaya I, Reimann M, et al. Tuberculosis
treatment monitoring and outcome measures: new interest and new strategies. Clin Microbiol Rev.
2022;35:e0022721. doi:10.1128/CMR.00227–21.
22. Zimmer AJ, Lainati F, Aguilera Vasquez N, Chedid C, McGrath S, Benedetti A, et al. Biomarkers that
correlate with active pulmonary tuberculosis treatment response: a systematic review and meta-
analysis. J Clin Microbiol. 2022;60:e0185921. doi:10.1128/JCM.01859–21.
23. Mapping care pathways. London: National Institute for Health and Care Excellence;
2015 (https://www.nice.org.uk/media/default/About/what-we-do/Into-practice/HTAP/
HTAPMappingCarePathwaysResource.pdf, accessed 8 June 2023).
24. Micocci M, Gordon AL, Allen AJ, Hicks T, Kierkegaard P, Mclister A, et al. COVID-19 testing in English
care homes and implications for staff and residents. Age Ageing. 2021;50:668–72. doi:10.1093/
ageing/afab015.
25. Hills NK, Lyimo J, Nahid P, Savic RM, Lienhardt C, Phillips PPJ. A systematic review of endpoint
definitions in late phase pulmonary tuberculosis therapeutic trials. Trials. 2021;22:515. doi:10.1186/
S13063–021–05388–1.
26. Horne DJ, Royce S, Gooze L, Narita M, Hopewell PC, Nahid P, Steingart KR. Sputum monitoring
during tuberculosis treatment for predicting outcome: systematic review and meta-analysis. Lancet
Infect Dis 2010;10(6):387–94. doi:10.1016/S1473–3099(10)70071–2.
27. Davis JL, Cattamanchi A, Cuevas LE, Hopewell PC, Steingart KR. Diagnostic accuracy of same-day
microscopy versus standard microscopy for pulmonary tuberculosis: a systematic review and meta-
analysis. Lancet Infect Dis 2013; 13(2):147–54. doi:10.1016/S1473–3099(12)70232–3.
28. Target product profile for next-generation TB drug-susceptibility testing at peripheral centres. Geneva:
World Health Organization; 2021 (https://apps.who.int/iris/handle/10665/343656, accessed 8 June
2023).
29. Peetluk LS, Ridolfi FM, Rebeiro PF, Liu D, Rolla VC, Sterling TR. Systematic review of prediction
models for pulmonary tuberculosis treatment outcomes in adults. BMJ Open. 2021;11:e044687.
doi:10.1136/BMJOPEN-2020–044687.
28 Target product profiles for tests for tuberculosis treatment monitoring and optimization
30. Imperial MZ, Phillips PPJ, Nahid P, Savic RM. Precision-enhancing risk stratification tools for selecting
optimal treatment durations in tuberculosis clinical trials. American J Respir Crit Care Med.
2021;204:1086–96. doi:10.1164/RCCM.202101–0117OC.
31. Moberg J, Oxman AD, Rosenbaum S, Schünemann HJ, Guyatt G, Flottorp S, et al. The GRADE
Evidence to Decision (EtD) framework for health system and public health decisions. Health Res
Policy Syst. 2018;16:45. doi:10.1186/s12961–018–0320–2.
32. Evaluating and publicly designating regulatory authorities as WHO listed authorities: policy document.
Geneva: World Health Organization; 2021 (https://apps.who.int/iris/handle/10665/341749, accessed
8 June 2023).
References 29
Annexes
The following members of the Scientific TPP Development Group declared no interests that could
conflict with the objectives of the Target Product Profiles: Abdulkadir Civan, Frank Cobelens,
Mustapha Gidado, Ankur Gupta-Wright, Rumina Hasan, Cathy Hewison, Kobto Koura, Mikashmi Kohli,
Christian Lienhardt, Patrick Lungu, Emily MacLean, Florian Marx and Lindsay McKenna.
The following members of the development group declared interests that were judged not to conflict
with the objectives of the Target Product Profiles:
Daniela Cirillo declared research support to the Ospedale San Raffaele of US$ 38 600 from the
TB Alliance in an unrestricted grant for a multipartner test of minimum inhibitory concentrations
for pretomanid, which ended in 2020, and a grant of US$ 62 629 from the European Committee on
Antimicrobial Susceptibility Testing (EUCAST) to coordinate work on a standard protocol that involved
reference laboratories for different anti-TB medicines.
Claudia Denkinger declared that during 2014–2019 she was head of the Tuberculosis Programme
at the FIND, a non-profit global health organization based in Geneva, Switzerland. At FIND she was
involved in exploratory work on TPP development and tools for treatment monitoring. FIND never
received industry funding for this purpose. In her current role, she continues to work on TB diagnostics,
specifically on research related to treatment monitoring tools, but she has never received industry
funding for this purpose.
Stephen Gillespie declared research grants to the University of St Andrews from the European Union
and the European and Developing Countries Clinical Trials Partnership (EDCTP) of € 1 million over
5 years and noted that his group has received payments to support testing of the LifeArc TB-MBLA
(Mycobacterium tuberculosis molecular bacterial load assay) kit (< £ 10 000). His group has received
grants that have been used to test the utility of the TB-MBLA kit as noted above. The University has
registered the trademark VitalBacteria to allow them to sell test kits for research use at cost. The
group has knowledge related to TB-MBLA, and a patent filing with regard to some aspects of the
work is in process.
Delia Goletti declared receiving honoraria for public lectures from bioMérieux (€ 1 500 in 2021) and
QIAGEN (€ 2 500 for 2021 and 2022), as well as honoraria for contributing to the development of
tests (US$ 122 500 from Quidel for 2019, 2020 and 2021; € 650 from PBD Biotech in 2022). She
also declared that the National Institute for Infectious Diseases L. Spallanzani in Rome, Italy, where
she works, received grants to evaluate new tests for diagnosing TB infections from bioMérieux
(€45 000 for each of two grants).
Annexes 31
Anneke Hesseling declared research support to Stellenbosch University in the form of US$ 4 million
per annum for several investigator-initiated research studies, with the US National Institutes of
Health (NIH)–funded IMPAACT network, the Tuberculosis Trials Consortium, Unitaid, Biomedical
Research Computing–Wellcome Trust, the South African Medical Research Council and the EDCTP.
Timothy McHugh declared a consultancy with Lumora Ltd (Erba Molecular) (£ 2 000, ceased in 2019),
a research collaboration with the TB Alliance (worth approximately £ 2 000 000 during 20162022), as
well as a research award received from the European Union’s Innovative Medicines Initiative UNITE4TB
project (€ 1 000 000, current).
Morten Ruhwald declared previous employment at the Statens Serum Institut and notes that employees
and former employees can be paid up to € 40 000 in taxable income if a license agreement involving
patents with the employee as an inventor generates an extraordinarily high income for the Statens
Serum Institut.
Rada Savic declared research funding or grants to her employer from the Bill & Melinda Gates
Foundation (BMGF), Critical Path Institute/BMGF, CZ BioHub Unitaid, the Global Alliance for TB
Drug Development, Innovative Medicines Initiative, US NIH, National Institute of Allergy and Infectious
Diseases (NIAID, part of the NIH), NIH/NIAID/Johns Hopkins University (JHU), NIH / NIAID / Rutgers, NIH/
NIAID/Stellenbosch, and WHO for a value of US$ 18.5 million (2016–2028), as well as a leadership
grant from the AIDS Clinical Trials Group.
Thomas Scriba declared research support including grants from the US NIH, the Bill & Melinda
Gates Foundation, the South African Medical Research Council and the EDCTP to his employer in
addition to a patent held by the University of Cape Town.
32 Target product profiles for tests for tuberculosis treatment monitoring and optimization
Annex 2. Results of the stakeholder consultation and Delphi survey
1. Stakeholder consultation
At the start of the target product profile (TPP) development process, stakeholders were surveyed to
gain input about their needs as they related to the TPPs.
A total of 45 individuals responded, including researchers, scientists and clinicians; staff at national
TB programmes and ministries of health, nongovernmental organizations, laboratories, public health
agencies and implementing partners; as well as developers of diagnostics, biomarkers and tests for TB;
and representatives of civil society organizations. Respondents came from all WHO regions except
for the Eastern Mediterranean. Altogether 35 (78%) respondents had worked in TB for more than
10 years.
The first set of questions in the survey focused on the proposal to examine three distinct use
case scenarios: a test to monitor treatment (i.e. during treatment), a test for cure (i.e. at the end
of treatment) and a test for disease severity (i.e. at the start of treatment). Respondents were asked
to indicate their agreement with definitions proposed for each of the tests. There was 84–100%
agreement (38–45 respondents) with the proposed TPP categories and definitions, but comments
highlighted a preference for programmes to have one test that could be used in all three scenarios.
In the same spirit, it was proposed that one TPP document should be developed to cover the three
scenarios instead of three separate TPPs.
When asked to prioritize the goals that should guide the development of the TPPs, the highest priority
was given to identifying people who would benefit from a change in their TB treatment, followed by
identifying people who have had a poor bacteriological response to treatment and identifying those
with an adequate response to treatment. The most important characteristic was considered to be
diagnostic accuracy, followed by the turnaround time for results and the setting in which the test
could be used. In addition, participants suggested that the acceptability of the test by people with
TB was important and indicated that a simple self-test could be advantageous.
Most participants considered it important that the TPPs include recommendations for special
populations (76% agreement, 35 respondents) so that their needs are specifically considered. The
most important population groups in order of priority were children, people living with HIV, people
with drug-resistant TB, pregnant and lactating women, and people with extrapulmonary TB.
There was general agreement (82%, 37 respondents) that it is important to be able to monitor treatment
using samples other than sputum, and the preferred sample types in order of priority were saliva,
urine, blood and imaging, followed by monitoring based on clinical features or measurements alone.
The ideal times to conduct a test to monitor treatment in order of priority were at 1 month after
starting treatment, within days to 1–2 weeks of starting treatment and if a patient was experiencing
clinical deterioration. The preferred frequencies for tests used to monitor treatment were in order
of priority, at the start of treatment and then again when a decision about treatment needs to
be made, and multiple times after starting treatment, followed by only once, when a decision about
treatment needs to be made. The ideal timing for conducting a test of cure was in order of priority,
at the end of TB treatment, 1 month prior to completing treatment and after completion to check
for early relapse, followed by early during TB treatment.
Annexes 33
2. Delphi survey
Participants of the technical consultation were sent the draft v. 0.0 TPP and a Delphi survey, as an
integral part of the consultation process. The survey was sent to 47 people of whom 29 (62%)
submitted a complete response. Participants were asked to express their level of agreement with
the proposed targets according to a predefined Likert scale ranging from 1 to 5 (1 – agree, 2 –
somewhat agree, 3 – neither agree nor disagree, 4 – somewhat disagree and 5 – disagree). Individuals
were asked to provide comments or alternative targets when they did not agree with a proposed
target (i.e. those scored at 3, 4 or 5). The targets on which fewer than 80% of participants agreed
were discussed further at the virtual technical consultation.
There was general agreement on the targets for assay or instrument design, target placement of test,
target user of test, maintenance, quality control, sample throughput, target population, sensitivity
and specificity. Fewer than 80% of participants (68%) agreed on the proposed minimal target for time
to result of 3 days or less and this was changed to 1 day or less in the subsequent discussion. There was
also disagreement (only 65% agreed) on the minimal requirement for sample type which was initially
proposed to be sputum only. After further discussion this was changed to “for use with a minimally
invasive respiratory sample (i.e. not limited to sputum alone)”. The Delphi survey also showed that
fewer than 80% of participants agreed (79%) on the optimal target proposed for timing for tests used
at treatment initiation to identify people with TB who require a more intensive treatment regimen.
In the subsequent discussions this target was adapted from “≤ 3 days of starting treatment” to “up
to 7 days after starting treatment”. Although the Delphi survey indicated a low level of agreement
(76%) on the proposed target for frequency for tests used to identify people at risk of a poor
treatment outcome during TB treatment, after further discussion in the meeting participants agreed
to maintain this target as “for use once at follow-up visit during treatment (without the need for
baseline measurement)”.
34 Target product profiles for tests for tuberculosis treatment monitoring and optimization
Annex 3. Technical consultation for the development of target product
profiles for tests and biomarkers for monitoring and optimizing
tuberculosis treatment, 26–28 September 2022 (virtual meeting)
Agenda
Day 1: Monday, 26 September Chair: Daniela Cirillo
14.00–14.15 Welcome and opening remarks Tereza Kasaeva
14.15–14.30 Introduction and background to the meeting, TPP Saskia den Boon
development process and meeting agenda
14.30–14.45 Limitations of current tests and overview of tests in the Emily MacLean
pipeline: results from a systematic review
14.45–15.30 Draft TPPs and results of Delphi survey Ankur Gupta-Wright
15.30–15.45 Break
15:45–17:00 Discussion of TPP categories and attributes: All
sensitivity and specificity Introduction by Ankur
Gupta-Wright
Annexes 35
Participants
Macarthur Charles, US Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Daniela Cirillo*, San Raffaele Institute, Milan, Italy
Abdulkadir Civan, University of Heidelberg, İzmir, Türkiye
Frank Cobelens*, Amsterdam Institute for Global Health and Development, Amsterdam, Netherlands
(Kingdom of the)
Claudia Denkinger*, University of Heidelberg, Heidelberg, Germany
Keertan Dheda, University of Cape Town and London School of Hygiene and Tropical Medicine, Cape
Town, South Africa
Norbert Djeka, National TB Programme, Pretoria, South Africa
Kathy Eisenach, independent consultant, Little Rock, Arkansas, USA
Ronald Allan Fabella, Disease Prevention and Control Bureau, Department of Health, Manila, Philippines
Mustapha Gidado, KNCV TB Plus, The Hague, Netherlands (Kingdom of the)
Stephen Gillespie*, University of St Andrews, St Andrews, United Kingdom
Delia Goletti, Translational Research Unit, National Institute for Infectious Diseases–Scientific Institute
for Research, Hospitalization and Healthcare (INMI-IRCCS), Rome, Italy
Ankur Gupta-Wright*, University College London, United Kingdom, and University of Heidelberg,
Heidelberg, Germany
Rumina Hasan, Aga Khan University, Supranational Reference Laboratory Karachi, Pakistan, and
London School of Hygiene and Tropical Medicine, United Kingdom
Anneke Hesseling, Stellenbosch University, Cape Town, South Africa
Cathy Hewison, Médecins Sans Frontières (MSF), Paris, France
Kobto Koura, The International Union Against Tuberculosis and Lung Disease, Paris, France
Mikashmi Kohli*, FIND, Geneva, Switzerland
Ravinder Kumar, Central TB Division, National Tuberculosis Elimination Programme, Delhi, India
Jose Lapa e Silva, Ministry of Health, Rio de Janeiro, Brazil
Christian Lienhardt, French National Research Institute for Sustainable Development, Montpellier,
France and FAST-TB Initiative, Civilian Research and Development Foundation Global, Arlington,
Virginia, USA
Yuhong Liu, Beijing Chest Hospital, Beijing, China
Emily MacLean*, University of Sydney, Sydney, Australia
Sanjay Kumar Mattoo, Central TB Division, National Tuberculosis Elimination Programme, Delhi, India
Florian Marx, University of Heidelberg, Berlin, Germany
Timothy McHugh, University College London, London, United Kingdom
Lindsay McKenna, Treatment Action Group, New York, New York, USA
Morten Ruhwald*, FIND, Geneva, Switzerland
Rada Savic*, University of California San Francisco, San Francisco, California, USA
Thomas Scriba, University of Cape Town, Cape Town, South Africa
Christine Sekaggya-Wilthsire, Infectious Diseases Institute, Kampala, Uganda
Boitumelo Semete-Makokotlela, South African Health Products Regulatory Authority, Pretoria, South
Africa
Kelly Stinson, Cultura, LCC, Atlanta, Georgia, USA
36 Target product profiles for tests for tuberculosis treatment monitoring and optimization
Ezio Tavora dos Santos Filho, WHO Civil Society Task Force and Rio de Janeiro Federal University, Rio
de Janeiro, Brazil
Nguyen Thuy Thuong, Oxford University Clinical Research Unit, Ho Chi Minh City, Viet Nam
Cesar Ugarte-Gil, Universidad Peruana Cayetano Heredia, Lima, Peru
Hui Xia, National Center for TB Control and Prevention, China Center for Disease Control, Beijing,
China
* member of the task force
Funding agencies
Sevim Ahmedov, United States Agency for International Development, Washington, DC, USA
Grania Brigden, The Global Fund to Fight AIDS, Tuberculosis and Malaria, Geneva, Switzerland
Debra Hanna, Bill & Melinda Gates Foundation, Seattle, Washington, USA
Annexes 37
Annex 4. Scientific TPP Development Group meeting, 27–29 March 2023,
Istanbul, Türkiye (hybrid meeting with remote connection)
Agenda
Day 1: Monday, 27 March Chairs: Ankur Gupta-Wright
and Mikashmi Kohl
9.30–10.00 Arrival and registration
10.00–10.10 Welcome and opening remarks Matteo Zignol
10.10–10.30 Meeting objectives, presentation of participants and review Dennis Falzon
of declarations of interest
10.30–10.50 TPP development process and meeting agenda Saskia den Boon
10:50–11.20 Break
11.20–11:50 Feedback from the public consultation Emily MacLean
11:50–12:30 Discussion All
12.30–13.30 Lunch
13.30– 14.00 Analysis of patient care pathways Ankur Gupta-Wright
14:00–15:00 Discussion All
15:00–15:30 Break
15:00– 17:30 Discussion and consensus-seeking about targets for test All
characteristics common to all TPPs
10.30–11.00 Break
11.00– 11.30 WHO target regimen profiles for TB treatment Fuad Mirzayev and Samuel
Schumacher
11.30–11.45 Recap and summary Claudia Denkinger
11.45–12.00 Next steps and closing Saskia den Boon and Matteo
Zignol
12.00–14.00 Lunch
38 Target product profiles for tests for tuberculosis treatment monitoring and optimization
Members of the Scientific TPP Development Group
Frank Cobelens*, Amsterdam Institute for Global Health and Development, Amsterdam, Netherlands
(Kingdom of the) (could not attend the meeting)
Daniela Cirillo*, San Raffaele Institute, Milan, Italy
Claudia Denkinger*, University of Heidelberg, Heidelberg, Germany
Mustapha Gidado, KNCV TB Plus, Netherlands (Kingdom of the)
Stephen Gillespie*, University of St Andrews, St Andrews, United Kingdom
Delia Goletti, Translational Research Unit, National Institute for Infectious Diseases–Scientific Institute
for Research, Hospitalization and Healthcare (INMI-IRCCS), Rome, Italy
Ankur Gupta-Wright*, University College London, United Kingdom, and University of Heidelberg,
Heidelberg, Germany
Rumina Hasan, Aga Khan University, Supranational Reference Laboratory, Karachi, Pakistan, and
London School of Hygiene and Tropical Medicine, United Kingdom
Cathy Hewison, Médecins Sans Frontières, Paris, France
Kobto Koura, The International Union Against Tuberculosis and Lung Disease, Paris, France
Mikashmi Kohli*, FIND, Geneva, Switzerland
Christian Lienhardt, French National Research Institute for Sustainable Development, Montpellier,
France, and and FAST-TB Initiative, CRDF (Civilian Research and Development Foundation) Global,
Arlington, Virginia, USA
Patrick Lungu, East, Central and Southern Africa Health Community, Lusaka, Zambia
Emily MacLean*, University of Sydney, Sydney, Australia
Timothy McHugh, University College London, London, United Kingdom
Lindsay McKenna, Treatment Action Group, New York, New York, USA
Morten Ruhwald*, FIND, Geneva, Switzerland (could not attend the meeting)
Rada Savic*, University of California San Francisco, San Francisco, California, USA
Thomas Scriba, University of Cape Town, Cape Town, South Africa
Christine Sekaggya-Wiltshire, Infectious Diseases Institute, Kampala, Uganda
* member of the task force
Funding agencies
Grania Brigden, The Global Fund to Fight AIDS, Tuberculosis and Malaria, Geneva, Switzerland
Debra Hanna, Bill & Melinda Gates Foundation, Seattle, Washington, USA
Cherise Scott, Unitaid, Geneva, Switzerland
Mathematical modellers
Abdulkadir Civan, University of Heidelberg, İzmir, Türkiye
Florian Marx, University of Heidelberg, Berlin, Germany
42 Target product profiles for tests for tuberculosis treatment monitoring and optimization