0% found this document useful (0 votes)
17 views2 pages

Coombes 2005

The commentary discusses the maturation of serum proteomics profiling technology since the initial NCI-FDA study, highlighting the challenges of reproducibility and sensitivity in mass spectrometry. Despite numerous studies attempting to identify cancer biomarkers, many findings have not been validated independently, and issues related to experimental design and data analysis persist. The authors emphasize the need for improved protocols and methodologies to enhance the reliability of proteomic profiling in clinical applications.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views2 pages

Coombes 2005

The commentary discusses the maturation of serum proteomics profiling technology since the initial NCI-FDA study, highlighting the challenges of reproducibility and sensitivity in mass spectrometry. Despite numerous studies attempting to identify cancer biomarkers, many findings have not been validated independently, and issues related to experimental design and data analysis persist. The authors emphasize the need for improved protocols and methodologies to enhance the reliability of proteomic profiling in clinical applications.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

C O M M E N TA R Y

Serum proteomics profiling—a young


© 2005 Nature Publishing Group http://www.nature.com/naturebiotechnology

technology begins to mature


Kevin R Coombes, Jeffrey S Morris, Jianhua Hu, Sarah R Edmonson & Keith A Baggerly

During the three years since the US National reproducibility remain unresolved. This article
Cancer Institute–Food and Drug Administration outlines some possible improvements in experi-
(NCI-FDA) proteomics group published their mental design and data analysis that might help
seminal1 (but flawed2–4) study using mass in this regard.
spectrometry to profile the serum proteome
of ovarian cancer patients, more than 60 pub- Potential problems
lished studies have applied similar technology It is important to recognize the provisional
to a wide range of cancers and other diseases. nature of the conclusions of most serum pro-
Although some studies have looked directly at teomic studies carried out to date. In most
tumor tissue, most have targeted easily accessible cases where the protein peaks have been iden-
fluids such as serum. A few studies used high- tified, they have turned out to be well-known,
resolution instruments with a matrix-assisted acute-phase proteins. Critics of this application
P.M. Motta & S. Makabe/Photo Researchers, Inc.
laser desorption and ionization (MALDI) ion of mass spectrometry have claimed that it is
source and a time-of-flight (TOF) ion detector. inherently limited in its depth of coverage, with The seminal NCI-FDA study used mass
The majority, however, have relied on lower res- a dynamic range that prevents detection of low- spectrometry to analyze the serum proteome of
olution instruments and surface-enhanced laser abundance proteins12,13. As for predictive pat- patients with ovarian cancer, cells of which are
shown above in a scanning electron micrograph
desorption and ionization (SELDI), a MALDI terns, no two independent studies have found
(magnification: 414×).
variant that uses commercially prepared chro- the same patterns to date. Moreover, none of the
matographic surfaces (ProteinChips; Ciphergen, patterns has been validated in an independent
Fremont, CA, USA) to separate proteins. study by another laboratory. In one study, both contrasts of interest. Similar problems appear to
These studies have a common goal: to identify the sensitivity and the specificity of the pattern have affected the NCI-FDA studies4,16.
biomarker patterns in the proteome that can be declined significantly when samples were pro- In another MALDI study of the serum pro-
used for diagnosis, prognosis or monitoring of cessed in the same laboratory after a delay of teome of 50 cancer patients carried out by our
disease. The preponderance of evidence from several months14. groups at M.D. Anderson15, hierarchical clus-
these studies seems to suggest that proteomic Our experience at the M.D. Anderson tering produced two clusters. In this case, the
profiling is able to detect anonymous protein Cancer Center suggests that it can be difficult sample collection protocol had been changed
peaks that are expressed differently in cancer to obtain stable, reproducible mass spectrom- after the first 20 samples had been collected;
patients than in healthy individuals5–7. The etry results over time and across laboratories. the clusters matched the sample collection
studies also provide evidence that we may be In one unsuccessful study, SELDI experiments protocols. The change in protocols resulted in
able to discover patterns that are useful for pre- were performed to analyze the serum proteome reproducible changes in the serum proteome,
diction of the presence or absence of disease8–11. from 247 patients with five different subtypes and these changes were significantly larger than
For serum proteomics to realize its full poten- of cancer, with 40, 60, 65, 62 and 20 patients those resulting from different histological sub-
tial, however, several problems in sensitivity and with each subtype. Simple hierarchical cluster- types of cancer15.
ing (average clustering with distance based on The source of these problems appears to be
Kevin R. Coombes, Jeffrey S. Morris, Jianhua Pearson correlation) of the data produced six inextricably linked to the promise of the tech-
Hu and Keith A. Baggerly are at the Section of clusters. Unfortunately, the clusters matched nology. Mass spectrometry can be exquisitely
Bioinformatics, Department of Biostatistics and the run dates of the samples, not the biological sensitive to changes in the part of the proteome
Applied Mathematics, The University of Texas subtypes of cancer15. The same clustering was that it can measure accurately. Differences in
M.D. Anderson Cancer Center, 1515 Holcombe observed using repeated runs of a quality con- sample collection or sample handling affect the
Blvd., Box 447, Houston, Texas 77030, USA trol sample, which was run concurrently with proteome, to a degree that can dominate bio-
and Sarah R. Edmonson is in the Department the experimental samples. Because some of the logical changes. In addition, as with many sen-
of Family and Community Medicine, Baylor biological subtypes were confounded with the sitive instruments, mass spectrometers can be
College of Medicine, Houston, Texas 77098, USA. run order, it was impossible in this data set to temperamental. They provide the most accurate
e-mail: kcoombes@mail.mdanderson.org separate technological effects from biological and highest resolution results over a very small

NATURE BIOTECHNOLOGY VOLUME 23 NUMBER 3 MARCH 2005 291


C O M M E N TA R Y

subset of their mass range, where they have extreme, the NCI-FDA proteomics group per- used to uncover proteomic expression patterns
been carefully focused. Relatively small changes forms minimal preprocessing, and they allow the linked to a disease state. Careful studies are now
to the operating conditions can be amplified to intensity at every measured mass-to-charge (m/ underway to develop reproducible protocols. An
produce fairly large differences in mass spectra, z) value to be a potential feature for distinguish- increased awareness of the importance of sound
making it difficult to maintain consistent, repro- ing cancer patients from healthy controls1,23,24. experimental design is enhancing the ability
ducible results. Because their experimental design confounded of studies to yield reproducible results. Rapid
biology with technology2–4,16, their approach is improvements are being made in developing
Potential solutions not yet firmly established. At the other extreme, methods to analyze the data produced by mass
Reproducibility can be enhanced in several a majority of published studies perform pre- spectrometry protein profiling experiments. The
© 2005 Nature Publishing Group http://www.nature.com/naturebiotechnology

ways: by improving instrumentation, experi- processing and peak detection using software next step will be to show that these improved
mental protocols, study design or analysis tools. from Ciphergen, the manufacturer of the SELDI protocols, designs and analytical tools produce
Although improved instruments might help, the instrument25. In our opinion, the Ciphergen results that are sufficiently reliable and repro-
studies conducted to date using higher resolu- software is overly conservative when peak find- ducible to change the way patients are treated.
tion MALDI instruments have not convincingly ing, and its baseline correction algorithm pro-
1. Petricoin, E.F. et al. Lancet 359, 572–577 (2002).
demonstrated that they produce more robust duces biased estimates of peak heights. These 2. Sorace, J.M. & Zhan, M. BMC Bioinformatics 4, 24
proteomics patterns than the SELDI approach. algorithmic weaknesses can reduce the effective (2004).
3. Baggerly, K.A., et al. Bioinformatics 20, 777–785
This may change as the technology evolves. sensitivity of the instrument below its true capa-
(2004).
Performing peptide profiling instead of protein bilities and can hamper its reproducibility. 4. Baggerly, K.A., et al. Signal in noise: evaluating reported
profiling is promising because it concentrates Researchers have been actively developing reproducibility of serum proteomics tests for ovarian
cancer. J. Natl. Cancer Inst., 97, in press (2005).
all measurements in a small region of extremely better methods for processing mass spectra. 5. Vlahou, A. et al. Clin. Chem. 50, 1438–1441 (2004).
high mass resolution17,18. In addition, high-end Yasui et al.26,27 have used a super smoother to 6. Pusztai, L. et al. Cancer 100, 1814–1822 (2004).
Fourier transform mass spectrometry is being find peaks, but had to settle for binary indica- 7. Lin, Z., Jenson, S.D., Lim, M.S. & Elenitoba-Johnson,
K.S. Mod. Pathol. 17, 670–678 (2004).
evaluated for its reproducibility in serum pro- tors of presence or absence in lieu of quanti- 8. Zhang, Z. et al. Cancer Res. 64, 5882–5890 (2004).
teomic profiling19. fication. Our group28 introduced an iterative 9. Wadsworth, J.T. et al. Clin. Cancer Res. 10, 1625–1632
(2004).
The Early Detection Research Network method to perform simultaneous background
10. Soltys, S.G. et al. Clin. Cancer Res. 10, 4806–4812
(EDRN) is currently conducting a multi- correction and peak finding, but this method (2004).
institutional study to develop and validate required manual adjustment of numerous 11. Xiao, Z. et al. Cancer Res. 64, 2904–2909 (2004).
12. Diamandis, E.P. Expert Rev. Mol. Diagn. 4, 575–577
protocols to produce reproducible mass parameters. Qu et al.29 have applied wavelets for (2004).
spectra20. Their initial report computed the data reduction. Because the wavelet coefficients 13. Diamandis, E.P. Mol. Cell Proteomics 3, 367–378
coefficient of variation in mass location, sig- do not directly correspond to physical quanti- (2004).
14. Rogers, M.A. et al. Cancer Res. 63, 6971–6983
nal-to-noise ratio and normalized intensity for ties, however, they are difficult to interpret. (2003).
three protein peaks, concluding that between- We also used wavelets for noise reduction30, 15. Hu, J., et al. Brief. Funct. Genomic. Proteomic. 3, 322–
331 (2005).
laboratory reproducibility was as good as especially in concert with the use of the mean
16. Baggerly, K.A., et al. Endocr. Relat. Cancer 11, 583–584
within-laboratory reproducibility21. It is still spectrum to borrow strength across spectra31. (2004).
unclear, however, if reproducibility of the loca- Our studies benefited from a computer model 17. Villanueva, J. et al. Anal. Chem. 76, 1560–1570
(2004).
tion and intensity of a few peaks is enough to based on the physics of a mass spectrometer, 18. Koomen, J.M. et al. Rapid Commun. Mass Spectrom.
ensure the reproducibility of the entire spec- which used a simple model of the detector32. 18, 2537–1848 (2004).
trum. It is possible that more global measures In another approach, Sauve and Speed33 have 19. Johnson, K.L., et al. Anal. Chem. 76, 5097–5103
(2004).
of reproducibility (that take into account all used dynamic programming to improve calibra- 20. Grizzle, W.E. et al. Dis. Markers 19, 185–195 (2004).
or most peaks) will be required. tion across multiple spectra and morphological 21. Semmes, O.J. et al. Clin. Chem. 51, 102–112 (2005).
22. Ransohoff, D.F. Nat. Rev. Cancer 4, 309–314 (2004).
The need for better study design and experi- filters for baseline correction.
23. Petricoin, E.F. 3rd et al. J. Natl. Cancer Inst. 94, 1576–
mental design for proteomic profiling has been Elsewhere, Malyarenko et al.34 have 1578 (2002).
discussed elsewhere15,22. Epidemiologists and approached mass spectra from the vantage of 24. Conrads, T.P. et al. Endocr. Relat. Cancer 11, 163–178
(2004).
statisticians have known the necessary design time series. They obtained better estimates of 25. Fung, E.T. & Enderwick, C. Biotechniques Suppl, 34–41
principles for a long time. It often seems, how- baseline by realizing that its primary source (2002).
ever, that these principles are forgotten in the resided in the physics of the ion detector. They 26. Yasui, Y. et al. J. Biomed. Biotechnol. 2003, 242–248
(2003).
excitement that accompanies the first applica- also used deconvolution filters to smooth the 27. Yasui, Y. et al. Biostatistics 4, 449–463 (2003).
tions of a new technology. The spectrum of sub- signal and obtain better resolution of mass 28. Coombes, K.R. et al. Clin. Chem. 49, 1615–1623
(2003).
jects in the study, both with and without disease, peaks. We expect that incorporating knowledge
29. Qu, Y. et al. Biometrics 59, 143–151 (2003).
must reflect the actual population. Validation of the physical and chemical properties of mass 30. Coombes, K.R. et al. Improved peak detection and
data sets must be independent and large. Care spectrometry instruments will lead to further quantification of mass spectrometry data acquired from
surface-enhanced laser desorption and ionization by
must be taken to avoid confounding biological improvements in analytical methods. denoising spectra with the undecimated discrete wavelet
variables of interest (e.g., presence or absence transform. Proteomics, in press (2005).
of disease) with technological factors (e.g., Conclusions 31. Morris, J.S., Coombes, K.R., Koomen, J., Baggerly, K.A.
& Kobayashi, R. Bioinformatics, published online 26
sample collection or run order) that might bias Mass spectrometry profiling of the serum pro- January 2005.
the results. Appropriate measures to minimize teome remains an exciting tool with the poten- 32. Coombes, K.R., et al. Understanding the characteristics
of mass spectrometry data through the use of simulation.
confounding and bias include running blinded tial to transform medicine. As often happens
Cancer Informatics, in press.
tests, randomization and blocking. with a young technology, the spectacular claims 33. Sauve, A.C. & Speed, T.P. Normalization, baseline cor-
The current state of the art in the analysis of of early studies have not held up under closer rection and alignment of high-throughput mass spec-
trometry data Proceedings Gensips 2004, in press
serum proteome profiling experiments allows scrutiny. Nevertheless, numerous studies have (2005).
considerable room for improvement. At one provided evidence that this technology can be 34. Malyarenko, D.I. et al. Clin. Chem. 51, 65–74 (2005).

292 VOLUME 23 NUMBER 3 MARCH 2005 NATURE BIOTECHNOLOGY

You might also like