Abstract
This work presents an optimization tool that finds the optimal number of threads for multi-thread data-flow software. Threads are assumed to encapsulate parallel executable key functionalities, are connected through finite capacity queues, and require certain hardware resources. We show how a combination of measurement and calculation, based on queueing theory, leads to an algorithm that recursively determines the best combination of threads, i.e. the best configuration of the multi-thread data-flow software on a given host. The algorithm proceeds on the directed graph of a queueing network that models this software. Experiments on different machines verify our optimization approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Frigo, M., Johnson, S.G.: The design and implementation of FFTW3. Proceedings of the IEEE 93, 216–231 (2005), http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.136.7045
Püschel, M., Moura, J.M.F., Johnson, J., Padua, D., Veloso, M., Singer, B., Xiong, J., Franchetti, F., Gacic, A., Voronenko, Y., Chen, K., Johnson, R.W., Rizzolo, N.: SPIRAL: Code Generation for DSP Transforms. Proceedings of the IEEE, Special Issue on Program Generation, Optimization, and Adaptation 93(2), 232–275 (2005)
Whaley, R.C., Dongarra, J.J.: Automatically tuned linear algebra software. In: Supercomputing 1998: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing (CDROM), pp. 1–27. IEEE Computer Society, Washington, DC (1998)
Osogami, T., Kato, S.: Optimizing system configurations quickly by guessing at the performance. SIGMETRICS Perform. Eval. Rev. 35(1), 145–156 (2007)
Balsamo, S., Person, V.D.N., Inverardi, P.: A review on queueing network models with finite capacity queues for software architectures performance prediction. Performance Evaluation 51(2-4), 269–288 (2003)
Jain, R.K.: The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling. Wiley, Chichester (1991), http://www.cse.wustl.edu/~jain/books/perfbook.htm
Zukerman, M.: Introduction to Queueing Theory and Stochastic Teletraffic Models. Zukerman (2009), http://www.ee.cityu.edu.hk/~zukerman/classnotes.pdf
Bolch, G., Greiner, S., de Meer, H., Trivedi, K.S.: Queueing Networks and Markov Chains: Modeling and Performance Evaluation with Computer Science Applications, 2nd edn. Wiley Blackwell (May 2006), http://www4.informatik.uni-erlangen.de/QNMC
Kobayashi, H., Mark, B.L.: System Modeling And Analysis - Foundations of System Performance Evaluation, 1st edn., vol. 1. Prentice-Hall, Englewood Cliffs (2008), http://www.princeton.edu/kobayashi/Book/book.html
Agresti, A.: Categorical Data Analysis, 2nd edn. Wiley-Interscience, Hoboken (2002)
Weidlich, R., Nussbaumer, M., Hlavacs, H.: “Optimization towards consolidation or throughput for multi-thread software. In: International Symposium on Parallel Architectures, Algorithms and Programming, pp. 161–168 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hlavacs, H., Nussbaumer, M. (2011). Optimization for Multi-thread Data-Flow Software. In: Thomas, N. (eds) Computer Performance Engineering. EPEW 2011. Lecture Notes in Computer Science, vol 6977. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24749-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-24749-1_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24748-4
Online ISBN: 978-3-642-24749-1
eBook Packages: Computer ScienceComputer Science (R0)