skip to main content
10.1145/301104.301111acmconferencesArticle/Chapter ViewAbstractPublication PagesppoppConference Proceedingsconference-collections
Article
Free access

Automatic parallelization of divide and conquer algorithms

Published: 01 May 1999 Publication History

Abstract

Divide and conquer algorithms are a good match for modern parallel machines: they tend to have large amounts of inherent parallelism and they work well with caches and deep memory hierarchies. But these algorithms pose challenging problems for parallelizing compilers. They are usually coded as recursive procedures and often use pointers into dynamically allocated memory blocks and pointer arithmetic. All of these features are incompatible with the analysis algorithms in traditional parallelizing compilers.This paper presents the design and implementation of a compiler that is designed to parallelize divide and conquer algorithms whose subproblems access disjoint regions of dynamically allocated arrays. The foundation of the compiler is a flow-sensitive, context-sensitive, and interprocedural pointer analysis algorithm. A range of symbolic analysis algorithms build on the pointer analysis information to extract symbolic bounds for the memory regions accessed by (potentially recursive) procedures that use pointers and pointer arithmetic. The symbolic bounds information allows the compiler to find procedure calls that can execute in parallel without violating the data dependences. The compiler generates code that executes these calls in parallel. We have used the compiler to parallelize several programs that use divide and conquer algorithms. Our results show that the programs perform well and exhibit good speedup.

References

[1]
S. Amarasinghe, J. Anderson, M. Lain, and C. Tseng. The SUIF compiler for scalable parallel machines. In Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing, February 1995.
[2]
W. Blume and R. Eigenmann. Symbolic range propagation. In Proceedings of the 9th International Parallel Processing Symposium, pages 357-363, Santa Barbara, CA, April 1995. IEEE Computer Society Press, Los Alamitos, Calif.
[3]
R. Blumofe, C. Joerg, B. Kuszmaul, C. Leiserson, K. Randall, and Y. Zhou. Cilk: An efficient multithreaded runtime system. In Proceedings of the 5th A CM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Santa Barbara, CA, July 1995. ACM, New York.
[4]
D. Chase, M. Wegman, and F. Zadek. Analysis of pointers and structures. In Proceedings of the SIGPLAN '90 Conference on Program Language Design and Implementation, pages 296-310, White Plains, NY, June 1990. ACM, New York.
[5]
T.H. Cormen, C.E. Leiserson, and R.L. Rivest. Introductions to Algorithms. The MIT Press, Cambridge, Mass., Cambridge, MA, 1990.
[6]
M. Emami, R. Ghiya, and L. Hendren. Contextsensitive interprocedural points-to analysis in the presence of function pointers. In Proceedings of the SIC- PLAN '94 Conference on Program Language Design and Implementation, pages 242-256, Orlando, FL, June 1994. ACM, New York.
[7]
J. Frens and D. Wise. Auto-blocking matrixmultiplication or tracking BLAS3 performance from source code. In Proceedings of the 6th A CM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Las Vegas, NV, June 1997.
[8]
graph? a shape analysis for heap-directed pointers in C. In Proceedings of the gJrd Annual A CM Symposium on the Principles of Programming Languages, pages 1-15, January 1996.
[9]
V. Guarna. A technique for analyzing pointer and structure references in parallel restructuring compilers. Proceedings of the 1988 international Conference on Parallel Processing, August 1988.
[10]
M. Haghighat and C. Polychronopoulos. Symbolic analysis: A basis for parallelization, optimization, and scheduling of programs. In Proceedings " the ... Workshop on Languages and Compilers for Parallel Computing, Portland, OR, August 1993.
[11]
M.W. Hall, S.P. Amarasinghe, B.tt. Murphy, S. Liao, #,#rl {# q T.#,m T"h.-,*,a,-t;r,-r,-r,#,vc,#.,-r.-#;r,-,#vallollc#,#,#_ #--#,A,LVL.#.;. a.,fl.#aJJ. J#,.,v,.,.,v,tu.6 #.,vl, l#l,l.=l#.,--#,l, lt,l#a,t }#ts.#z. casjc#asos-s #L#- ing an interprocedural parallelizing compiler. In Proceedings of Supereomputing '95, San Diego, CA, December 1995. IEEE Computer Society Press, Los Alamitos, Calif.
[12]
P. Havlak and K. Kennedy. An implementation of interprocedural bounded regular section analysis. 1#'b'1#" "l#nsactions on Parallel and Distributed Systems, 2(3):350-360, July 1991.
[13]
L. Hendren and A. Nicolau. Parallelizing programs with recursive data structures. IEEE Transactions on Paraltet and Distributed " .... 1 "# ... #- Jwtuaxy #u. oys#emn, x ) :oo-, t, 1
[14]
J. Larus and P. Hilfinger. Detecting conflicts between structure accesses. In Proceedings of the SIGPLAN '88 Conference on Program Language Design and Imple-
[15]
E. Molar, D. Kranz, and R. Halstead. Lazy task creation: A technique for increasing the granularity of parallel programs. In Proceedings of the 1990 A CM Conference on Lisp and Functional Programming, pages 185-- 197. ACM, New York, June 1990.
[16]
r1#1 # #1^^,. XX T.T.II .--.1 1:) #,f .. k. T).A;.#.A ..#. L.m,u.i .#. #.v,uuu, #vx. xxa., #,a u. ;v-uxl.,.,. . x#.,.o,w,#, (,.#.#y data-flow analysis for run-time paraUelization. In Proceedings of the 1998 A CM International Conference on Supercomputing, Melbourne, Australia, July 1993.
[17]
M l:/innrri and p F/isis. #Amrn,tat.ivity analvri.#" h new framework for parallelizing compilers. In Proceedings of the SIGPLAN '96 Conference on Program Language Design and Implementation, pages 54--67, Philadelphia, PA, May 1996. ACM, New York.
[18]
R. Rugina and M. Rinard. Pointer analysis for multithreaded programs, in Proceedings of the SIGPLAN '99 Conference on t'rogram Language ues;gn and #mplementation, Atlanta, GA, May 1999.
[19]
M. Sagiv, T. Reps, and R. Wilhelm. Solving shapeanalysis problems in languages with destructive updatand Systems, 20(1)'1-50, January 1998.
[20]
R,. T#rioieL F. irigoin, and P. Feautrier. Direct parallelization of CALL statements. In Proceedings of the ~ _q!GPLAN '86 Symposium on Compiler Construction, Palo Alto, CA, June 1986.
[21]
D 1XT:I ... .1 x i r ___ r#m __. .. L-#j #. vv-,ou,. #,u #,.,#m, -.mcm,# context-sensitive pointer analysis for C programs. In Proceedings of the SIGPLAN '95 Conference on Program Language De. sign and Implementation, La Jolla, CA, June~ 1995. ACM, New York.

Cited By

View all
  • (2024)Equivalence by Canonicalization for Synthesis-Backed RefactoringProceedings of the ACM on Programming Languages10.1145/36564538:PLDI(1879-1904)Online publication date: 20-Jun-2024
  • (2024)Scaling Type-Based Points-to Analysis with SaturationProceedings of the ACM on Programming Languages10.1145/36564178:PLDI(990-1013)Online publication date: 20-Jun-2024
  • (2024)When Is Parallelism Fearless and Zero-Cost with Rust?Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3626183.3659966(27-40)Online publication date: 17-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PPoPP '99: Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
May 1999
192 pages
ISBN:1581131003
DOI:10.1145/301104
  • Chairmen:
  • Marc Snir,
  • Andrew A. Chien
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1999

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

PPoPP99
Sponsor:

Acceptance Rates

PPoPP '99 Paper Acceptance Rate 17 of 79 submissions, 22%;
Overall Acceptance Rate 230 of 1,014 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)66
  • Downloads (Last 6 weeks)9
Reflects downloads up to 24 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Equivalence by Canonicalization for Synthesis-Backed RefactoringProceedings of the ACM on Programming Languages10.1145/36564538:PLDI(1879-1904)Online publication date: 20-Jun-2024
  • (2024)Scaling Type-Based Points-to Analysis with SaturationProceedings of the ACM on Programming Languages10.1145/36564178:PLDI(990-1013)Online publication date: 20-Jun-2024
  • (2024)When Is Parallelism Fearless and Zero-Cost with Rust?Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3626183.3659966(27-40)Online publication date: 17-Jun-2024
  • (2021)An order-aware dataflow model for parallel Unix pipelinesProceedings of the ACM on Programming Languages10.1145/34735705:ICFP(1-28)Online publication date: 19-Aug-2021
  • (2020)Deriving parametric multi-way recursive divide-and-conquer dynamic programming algorithms using polyhedral compilersProceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization10.1145/3368826.3377916(317-329)Online publication date: 22-Feb-2020
  • (2019)Safe automated refactoring for intelligent parallelization of Java 8 streamsProceedings of the 41st International Conference on Software Engineering10.1109/ICSE.2019.00072(619-630)Online publication date: 25-May-2019
  • (2018)Soft document clustering using a novel graph covering approachBioData Mining10.1186/s13040-018-0172-x11:1Online publication date: 14-Jun-2018
  • (2018)LernaProceedings of the 11th ACM International Systems and Storage Conference10.1145/3211890.3211897(37-48)Online publication date: 4-Jun-2018
  • (2018)Towards a compiler analysis for parallel algorithmic skeletonsProceedings of the 27th International Conference on Compiler Construction10.1145/3178372.3179513(174-184)Online publication date: 24-Feb-2018
  • (2018)Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications2018 IEEE/ACM 4th International Workshop on Extreme Scale Programming Models and Middleware (ESPM2)10.1109/ESPM2.2018.00004(1-8)Online publication date: Nov-2018
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media