Downloads & Free Reading Options - Results

Dtic Ad1003134%3a Frpa%3a A Framework For Recursive Parallel Algorithms by Defense Technical Information Center

Read "Dtic Ad1003134%3a Frpa%3a A Framework For Recursive Parallel Algorithms" by Defense Technical Information Center through these free online access and download options.

Search for Downloads

Search by Title or Author

Books Results

Source: The Internet Archive

The internet Archive Search Results

Available books for downloads and borrow from The internet Archive

1DTIC AD1003134: FRPA: A Framework For Recursive Parallel Algorithms

By

Recursion continues to play an important role in high-performance computing. However, parallelizing recursive algorithms while achieving high performance is nontrivial and can result in complex, hard to maintain code. In particular, assigning processors to subproblems is complicated by recent observations that communication costs often dominate computation costs. Previous work [1][3] demonstrates that carefully choosing which divide-and-conquer steps to execute in parallel (breadth-first steps) and which to execute sequentially (depth-first steps) can result in significant performance gains over naive scheduling. Our Framework for Recursive Parallel Algorithms (FRPA) allows for the separation of an algorithms implementation from its parallelization. The programmer must simply define how to split a problem, solve the base case, and merge solved subproblems; FRPA handles parallelizing the code and tuning the recursive parallelization strategy, enabling algorithms to achieve high performance. To demonstrate FRPAs performance capabilities, we present a detailed analysis of two algorithms: Strassen-Winograd [1] and Communication-Optimal Parallel Recursive Rectangular Matrix Multiplication (CARMA) [3]. Our single-precision CARMA implementation is fewer than 80 lines of code and achieves a speedup of up to 11x over Intels Math Kernel Library (MKL) [4] matrix multiplication routine on skinny matrices. Our double-precision Strassen-Winograd implementation, at just 150 lines of code, is up to 45 faster than MKL for large square matrix multiplications. To show FRPAs generality and simplicity, we implement six additional algorithms: mergesort, quicksort, TRSM, SYRK, Cholesky decomposition, and Delaunay triangulation [5]. FRPA is implemented in C++, runs in shared-memory environments, uses Intels Cilk Plus [6] for task-based parallelism, and leverages OpenTuner [7] to tune the parallelization strategy.

“DTIC AD1003134: FRPA: A Framework For Recursive Parallel Algorithms” Metadata:

  • Title: ➤  DTIC AD1003134: FRPA: A Framework For Recursive Parallel Algorithms
  • Author: ➤  
  • Language: English

“DTIC AD1003134: FRPA: A Framework For Recursive Parallel Algorithms” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 20.22 Mbs, the file-s for this book were downloaded 63 times, the file-s went public at Fri Jan 24 2020.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC AD1003134: FRPA: A Framework For Recursive Parallel Algorithms at online marketplaces:


Buy “Dtic Ad1003134%3a Frpa%3a A Framework For Recursive Parallel Algorithms” online:

Shop for “Dtic Ad1003134%3a Frpa%3a A Framework For Recursive Parallel Algorithms” on popular online marketplaces.