Downloads & Free Reading Options - Results

Multiprocessors by Tabak%2c Daniel%2c 1934

Read "Multiprocessors" by Tabak%2c Daniel%2c 1934 through these free online access and download options.

Search for Downloads

Search by Title or Author

Books Results

Source: The Internet Archive

The internet Archive Search Results

Available books for downloads and borrow from The internet Archive

1Energy And Time Efficient Scheduling Of Tasks With Dependencies On Asymmetric Multiprocessors

By

In this work we study the problem of scheduling tasks with dependencies in multiprocessor architectures where processors have different speeds. We present the preemptive algorithm "Save-Energy" that given a schedule of tasks it post processes it to improve the energy efficiency without any deterioration of the makespan. In terms of time efficiency, we show that preemptive scheduling in an asymmetric system can achieve the same or better optimal makespan than in a symmetric system. Motivited by real multiprocessor systems, we investigate architectures that exhibit limited asymmetry: there are two essentially different speeds. Interestingly, this special case has not been studied in the field of parallel computing and scheduling theory; only the general case was studied where processors have $K$ essentially different speeds. We present the non-preemptive algorithm ``Remnants'' that achieves almost optimal makespan. We provide a refined analysis of a recent scheduling method. Based on this analysis, we specialize the scheduling policy and provide an algorithm of $(3 + o(1))$ expected approximation factor. Note that this improves the previous best factor (6 for two speeds). We believe that our work will convince researchers to revisit this well studied scheduling problem for these simple, yet realistic, asymmetric multiprocessor architectures.

“Energy And Time Efficient Scheduling Of Tasks With Dependencies On Asymmetric Multiprocessors” Metadata:

  • Title: ➤  Energy And Time Efficient Scheduling Of Tasks With Dependencies On Asymmetric Multiprocessors
  • Authors:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 8.75 Mbs, the file-s for this book were downloaded 74 times, the file-s went public at Sat Jul 20 2013.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVu - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find Energy And Time Efficient Scheduling Of Tasks With Dependencies On Asymmetric Multiprocessors at online marketplaces:


2The Impact Of Memory Models On Software Reliability In Multiprocessors

By

The memory consistency model is a fundamental system property characterizing a multiprocessor. The relative merits of strict versus relaxed memory models have been widely debated in terms of their impact on performance, hardware complexity and programmability. This paper adds a new dimension to this discussion: the impact of memory models on software reliability. By allowing some instructions to reorder, weak memory models may expand the window between critical memory operations. This can increase the chance of an undesirable thread-interleaving, thus allowing an otherwise-unlikely concurrency bug to manifest. To explore this phenomenon, we define and study a probabilistic model of shared-memory parallel programs that takes into account such reordering. We use this model to formally derive bounds on the \emph{vulnerability} to concurrency bugs of different memory models. Our results show that for 2 (or a small constant number of) concurrent threads, weaker memory models do indeed have a higher likelihood of allowing bugs. On the other hand, we show that as the number of parallel threads increases, the gap between the different memory models becomes proportionally insignificant. This suggests the counter-intuitive rule that \emph{as the number of parallel threads in the system increases, the importance of using a strict memory model diminishes}; which potentially has major implications on the choice of memory consistency models in future multi-core systems.

“The Impact Of Memory Models On Software Reliability In Multiprocessors” Metadata:

  • Title: ➤  The Impact Of Memory Models On Software Reliability In Multiprocessors
  • Authors:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 11.97 Mbs, the file-s for this book were downloaded 74 times, the file-s went public at Sun Sep 22 2013.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVu - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find The Impact Of Memory Models On Software Reliability In Multiprocessors at online marketplaces:


3NASA Technical Reports Server (NTRS) 19940006188: Low Latency Messages On Distributed Memory Multiprocessors

By

Many of the issues in developing an efficient interface for communication on distributed memory machines are described and a portable interface is proposed. Although the hardware component of message latency is less than one microsecond on many distributed memory machines, the software latency associated with sending and receiving typed messages is on the order of 50 microseconds. The reason for this imbalance is that the software interface does not match the hardware. By changing the interface to match the hardware more closely, applications with fine grained communication can be put on these machines. Based on several tests that were run on the iPSC/860, an interface that will better match current distributed memory machines is proposed. The model used in the proposed interface consists of a computation processor and a communication processor on each node. Communication between these processors and other nodes in the system is done through a buffered network. Information that is transmitted is either data or procedures to be executed on the remote processor. The dual processor system is better suited for efficiently handling asynchronous communications compared to a single processor system. The ability to send data or procedure is very flexible for minimizing message latency, based on the type of communication being performed. The test performed and the proposed interface are described.

“NASA Technical Reports Server (NTRS) 19940006188: Low Latency Messages On Distributed Memory Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19940006188: Low Latency Messages On Distributed Memory Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19940006188: Low Latency Messages On Distributed Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 29.34 Mbs, the file-s for this book were downloaded 86 times, the file-s went public at Tue Oct 04 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19940006188: Low Latency Messages On Distributed Memory Multiprocessors at online marketplaces:


4DTIC ADA232861: Performance Effects Of Irregular Communications Patterns On Massively Parallel Multiprocessors

By

We conduct a detailed study of the performance effects of irregular communications patterns on the CM-2. We characterized the communications capabilities of the CM-2 under a variety of controlled conditions. In the process of carrying out our performance evaluation, we develop and make extensive use of a parameterized synthetic mesh. In addition we carry out timings with unstructured meshes generated for aerodynamic codes and a set of sparse matrices with banded patterns of non-zeros. This benchmarking suite stresses the communications capabilities of the CM-2 in a range of different ways. Our benchmark results demonstrate that it is possible to make effective use of much of the massive concurrency available in the communications network.

“DTIC ADA232861: Performance Effects Of Irregular Communications Patterns On Massively Parallel Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA232861: Performance Effects Of Irregular Communications Patterns On Massively Parallel Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA232861: Performance Effects Of Irregular Communications Patterns On Massively Parallel Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 19.04 Mbs, the file-s for this book were downloaded 59 times, the file-s went public at Thu Mar 01 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA232861: Performance Effects Of Irregular Communications Patterns On Massively Parallel Multiprocessors at online marketplaces:


5Scalable Shared Memory Multiprocessors

We conduct a detailed study of the performance effects of irregular communications patterns on the CM-2. We characterized the communications capabilities of the CM-2 under a variety of controlled conditions. In the process of carrying out our performance evaluation, we develop and make extensive use of a parameterized synthetic mesh. In addition we carry out timings with unstructured meshes generated for aerodynamic codes and a set of sparse matrices with banded patterns of non-zeros. This benchmarking suite stresses the communications capabilities of the CM-2 in a range of different ways. Our benchmark results demonstrate that it is possible to make effective use of much of the massive concurrency available in the communications network.

“Scalable Shared Memory Multiprocessors” Metadata:

  • Title: ➤  Scalable Shared Memory Multiprocessors
  • Language: English

“Scalable Shared Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 668.95 Mbs, the file-s for this book were downloaded 10 times, the file-s went public at Sat Oct 21 2023.

Available formats:
ACS Encrypted PDF - Cloth Cover Detection Log - DjVuTXT - Djvu XML - Dublin Core - EPUB - Item Tile - JPEG Thumb - LCP Encrypted EPUB - LCP Encrypted PDF - Log - MARC - MARC Binary - Metadata - OCR Page Index - OCR Search Text - PNG - Page Numbers JSON - RePublisher Final Processing Log - RePublisher Initial Processing Log - Scandata - Single Page Original JP2 Tar - Single Page Processed JP2 ZIP - Text PDF - Title Page Detection Log - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Scalable Shared Memory Multiprocessors at online marketplaces:


6NASA Technical Reports Server (NTRS) 19830010018: Queueing Analysis Of A Canonical Model Of Real-time Multiprocessors

By

A logical classification of multiprocessor structures from the point of view of control applications is presented. A computation of the response time distribution for a canonical model of a real time multiprocessor is presented. The multiprocessor is approximated by a blocking model. Two separate models are derived: one created from the system's point of view, and the other from the point of view of an incoming task.

“NASA Technical Reports Server (NTRS) 19830010018: Queueing Analysis Of A Canonical Model Of Real-time Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19830010018: Queueing Analysis Of A Canonical Model Of Real-time Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19830010018: Queueing Analysis Of A Canonical Model Of Real-time Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 19.90 Mbs, the file-s for this book were downloaded 53 times, the file-s went public at Wed Aug 17 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19830010018: Queueing Analysis Of A Canonical Model Of Real-time Multiprocessors at online marketplaces:


7DTIC ADA267484: Optimal Cube-Connected Cube Multiprocessors

By

Many CFD (computational fluid dynamics) and other scientific applications can be partitioned into subproblems. However, in general the partitioned subproblems are very large. They demand high performance computing power themselves, and the solutions of the subproblems have to be combined at each time step. In this paper, the cubeconnect cube (CCCube) architecture is studied. The CCCube architecture is an extended hypercube structure with each node represented as a cube. It requires fewer physical links between nodes than the hypercube, and provides the same communication support as the hypercube does on many applications. The reduced physical links can be used to enhance the bandwidth of the remanding links and, therefore, enhance the overall performance. The concept and the method to obtain optimal CCCubes, which are the CCCubes with a minimum number of links under a given total number of nodes, are proposed. The superiority of optimal CCCubes over standard hypercubes has also been shown in terms of the link usage in the embedding of a binomial tree. A useful computation structure based on a semi-binomial tree for divide-and- conquer type of parallel algorithms has been identified. We have shown that this structure can be implemented in optimal CCCubes without performance degradation compared with regular hypercubes. The result presented in this paper should provide a useful approach to design of scientific parallel computers.... Parallel processing, Parallel architectures, Hypercube, Cubeconnected cube, Optimal cube-connected cube, Divide-and-conquer paradigm, CFD applications.

“DTIC ADA267484: Optimal Cube-Connected Cube Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA267484: Optimal Cube-Connected Cube Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA267484: Optimal Cube-Connected Cube Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 10.79 Mbs, the file-s for this book were downloaded 73 times, the file-s went public at Sun Mar 11 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA267484: Optimal Cube-Connected Cube Multiprocessors at online marketplaces:


8DTIC ADA161552: A Performance Analysis Of Multiprocessors Using Two-Level Caches.

By

This thesis proposes a two-level cache organization for multiprocessors. The first level of cache consists of a private cache per processor. The second level of caches is shared by all processors. The main memory is also similarly shared. A cache coherence solution is proposed for such an organization. The performance of the proposed multi-processor is evaluated with analytical methods. The factors that affect the performance are quantitatively discussed. A variation of the proposed coherence algorithm is presented to improve the performance. Keywords: High reliability; Cache memories; Mathematical analysis. (Author)

“DTIC ADA161552: A Performance Analysis Of Multiprocessors Using Two-Level Caches.” Metadata:

  • Title: ➤  DTIC ADA161552: A Performance Analysis Of Multiprocessors Using Two-Level Caches.
  • Author: ➤  
  • Language: English

“DTIC ADA161552: A Performance Analysis Of Multiprocessors Using Two-Level Caches.” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 34.50 Mbs, the file-s for this book were downloaded 48 times, the file-s went public at Sun Feb 04 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA161552: A Performance Analysis Of Multiprocessors Using Two-Level Caches. at online marketplaces:


9Multiprocessors System Of Realization Binaural Hearing With A Array Of Microphones

By

In article the variant of practical realization of system binaural hearing inherent is offered to the person and many representatives of fauna. Properties binaural hearing are realized by operations above images of the acoustic signals received from a array of microphones. Realization assumes the coherent, synchronized work of 3 microprocessors which are carrying out reading of signals from 4 microphones through two 16 digit ADC with frequency of digitization 100 kHz on each channel with synchronization between channels with accuracy 5 uS. In realized the Flash card in capacity 2 GB and the microprocessor realizing interface USB from the personal COMPUTER is included. Realization provides independent work of system and together with the personal COMPUTER. The system has properties not inherent in hearing of the person – zero time of adaptation at transition from a strong signal to weak and on the contrary, localization of very short signals.

“Multiprocessors System Of Realization Binaural Hearing With A Array Of Microphones” Metadata:

  • Title: ➤  Multiprocessors System Of Realization Binaural Hearing With A Array Of Microphones
  • Author:
  • Language: rus

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 6.08 Mbs, the file-s for this book were downloaded 26 times, the file-s went public at Fri Mar 15 2024.

Available formats:
Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Multiprocessors System Of Realization Binaural Hearing With A Array Of Microphones at online marketplaces:


10New Analytic Models For Multiprocessors With Various Interconnection Structures

By

Book Source: Digital Library of India Item 2015.191727 dc.contributor.author: Adarshpal Singh Sethi dc.date.accessioned: 2015-07-08T01:09:06Z dc.date.available: 2015-07-08T01:09:06Z dc.date.digitalpublicationdate: 2005-08-20 dc.identifier.barcode: 1990010090218 dc.identifier.origpath: /rawdataupload/upload/0090/218 dc.identifier.copyno: 1 dc.identifier.uri: http://www.new.dli.ernet.in/handle/2015/191727 dc.description.scannerno: 12 dc.description.scanningcentre: IIIT, Allahabad dc.description.main: 1 dc.description.tagged: 0 dc.description.totalpages: 102 dc.format.mimetype: application/pdf dc.language.iso: English dc.rights: Out_of_copyright dc.source.library: Indian Institute Of Technology Kanpur dc.subject.classification: Technology dc.subject.classification: Engineering. Technology In General dc.subject.classification: Computer Science Engineering dc.title: New Analytic Models For Multiprocessors With Various Interconnection Structures

“New Analytic Models For Multiprocessors With Various Interconnection Structures” Metadata:

  • Title: ➤  New Analytic Models For Multiprocessors With Various Interconnection Structures
  • Author:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 98.56 Mbs, the file-s for this book were downloaded 144 times, the file-s went public at Sun Jan 22 2017.

Available formats:
Abbyy GZ - Additional Text PDF - Archive BitTorrent - DjVuTXT - Djvu XML - Image Container PDF - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP -

Related Links:

Online Marketplaces

Find New Analytic Models For Multiprocessors With Various Interconnection Structures at online marketplaces:


11NASA Technical Reports Server (NTRS) 19970009334: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors

By

The computational requirements for an adaptive solution of unsteady problems change as the simulation progresses. This causes workload imbalance among processors on a parallel machine which, in turn, requires significant data movement at runtime. We present a new dynamic load-balancing framework, called JOVE, that balances the workload across all processors with a global view. Whenever the computational mesh is adapted, JOVE is activated to eliminate the load imbalance. JOVE has been implemented on an IBM SP2 distributed-memory machine in MPI for portability. Experimental results for two model meshes demonstrate that mesh adaption with load balancing gives more than a sixfold improvement over one without load balancing. We also show that JOVE gives a 24-fold speedup on 64 processors compared to sequential execution.

“NASA Technical Reports Server (NTRS) 19970009334: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19970009334: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19970009334: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 18.02 Mbs, the file-s for this book were downloaded 58 times, the file-s went public at Thu Oct 13 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - JPEG Thumb - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19970009334: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors at online marketplaces:


12NASA Technical Reports Server (NTRS) 19880003553: Principles For Problem Aggregation And Assignment In Medium Scale Multiprocessors

By

One of the most important issues in parallel processing is the mapping of workload to processors. This paper considers a large class of problems having a high degree of potential fine grained parallelism, and execution requirements that are either not predictable, or are too costly to predict. The main issues in mapping such a problem onto medium scale multiprocessors are those of aggregation and assignment. We study a method of parameterized aggregation that makes few assumptions about the workload. The mapping of aggregate units of work onto processors is uniform, and exploits locality of workload intensity to balance the unknown workload. In general, a finer aggregate granularity leads to a better balance at the price of increased communication/synchronization costs; the aggregation parameters can be adjusted to find a reasonable granularity. The effectiveness of this scheme is demonstrated on three model problems: an adaptive one-dimensional fluid dynamics problem with message passing, a sparse triangular linear system solver on both a shared memory and a message-passing machine, and a two-dimensional time-driven battlefield simulation employing message passing. Using the model problems, the tradeoffs are studied between balanced workload and the communication/synchronization costs. Finally, an analytical model is used to explain why the method balances workload and minimizes the variance in system behavior.

“NASA Technical Reports Server (NTRS) 19880003553: Principles For Problem Aggregation And Assignment In Medium Scale Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19880003553: Principles For Problem Aggregation And Assignment In Medium Scale Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19880003553: Principles For Problem Aggregation And Assignment In Medium Scale Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 46.83 Mbs, the file-s for this book were downloaded 55 times, the file-s went public at Mon Sep 19 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19880003553: Principles For Problem Aggregation And Assignment In Medium Scale Multiprocessors at online marketplaces:


13DTIC ADA591420: An Optimal Real-Time Voltage And Frequency Scaling For Uniform Multiprocessors

By

One of the most important issues in parallel processing is the mapping of workload to processors. This paper considers a large class of problems having a high degree of potential fine grained parallelism, and execution requirements that are either not predictable, or are too costly to predict. The main issues in mapping such a problem onto medium scale multiprocessors are those of aggregation and assignment. We study a method of parameterized aggregation that makes few assumptions about the workload. The mapping of aggregate units of work onto processors is uniform, and exploits locality of workload intensity to balance the unknown workload. In general, a finer aggregate granularity leads to a better balance at the price of increased communication/synchronization costs; the aggregation parameters can be adjusted to find a reasonable granularity. The effectiveness of this scheme is demonstrated on three model problems: an adaptive one-dimensional fluid dynamics problem with message passing, a sparse triangular linear system solver on both a shared memory and a message-passing machine, and a two-dimensional time-driven battlefield simulation employing message passing. Using the model problems, the tradeoffs are studied between balanced workload and the communication/synchronization costs. Finally, an analytical model is used to explain why the method balances workload and minimizes the variance in system behavior.

“DTIC ADA591420: An Optimal Real-Time Voltage And Frequency Scaling For Uniform Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA591420: An Optimal Real-Time Voltage And Frequency Scaling For Uniform Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA591420: An Optimal Real-Time Voltage And Frequency Scaling For Uniform Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 14.90 Mbs, the file-s for this book were downloaded 54 times, the file-s went public at Mon Sep 17 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA591420: An Optimal Real-Time Voltage And Frequency Scaling For Uniform Multiprocessors at online marketplaces:


14DTIC ADA272838: The Effects Of Block Size On The Performance Of Coherent Caches In Shared-Memory Multiprocessors

By

Several studies have shown that the performance of coherent caches depends on the relationship between the cache block size and the granularity of sharing and locality exhibited by the program. Large cache blocks exploit processor and spatial locality, but may cause unnecessary cache invalidations due to false sharing. Small cache blocks can reduce the number of cache invalidation, but increase the number of bus or network transactions required to load data into the cache. In this dissertation we use reference traces from a variety of parallel programs and detailed simulation of a scalable shared-memory multiprocessor to examine the effects of cache block size on the performance of coherent caches and quantify this impact with respect to the network bandwidth and latency. Our results suggest that, regardless of the available bandwidth or latency, applications with good spatial locality favor long cache lines, and for these applications the relative benefits of longer cache lines increase with the bandwidth and latency.

“DTIC ADA272838: The Effects Of Block Size On The Performance Of Coherent Caches In Shared-Memory Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA272838: The Effects Of Block Size On The Performance Of Coherent Caches In Shared-Memory Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA272838: The Effects Of Block Size On The Performance Of Coherent Caches In Shared-Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 81.14 Mbs, the file-s for this book were downloaded 84 times, the file-s went public at Tue Mar 13 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA272838: The Effects Of Block Size On The Performance Of Coherent Caches In Shared-Memory Multiprocessors at online marketplaces:


15DTIC ADA300197: Exploiting Bandwidth To Reduce Average Memory Access Time In Scalable Multiprocessors.

By

The overhead of remote memory accesses is a major impediment to achieving good application performance on scalable shared-memory multiprocessors. This dissertation explores ways in which to exploit network and memory bandwidth in order to reduce the average cost of memory accesses. We consider scenarios (1) where the remote access cost is dominated by contention, and (2) where the hardware provides abundant band- width and the remote access time is dominated by the unsaturated request/access/reply sequence of operations. We introduce and evaluate two techniques for increasing the effective bandwidth available to processors, software interleaving and eager combining. We also evaluate strategies for hiding the high cost of remote accesses, including several forms of prefetching and update-based coherence protocols. We use both analytic models and detailed simulations of multiprocessor systems to quantify the effectiveness of these techniques, and to provide insight into the potential and limitations of exploiting bandwidth to reduce average memory access cost.

“DTIC ADA300197: Exploiting Bandwidth To Reduce Average Memory Access Time In Scalable Multiprocessors.” Metadata:

  • Title: ➤  DTIC ADA300197: Exploiting Bandwidth To Reduce Average Memory Access Time In Scalable Multiprocessors.
  • Author: ➤  
  • Language: English

“DTIC ADA300197: Exploiting Bandwidth To Reduce Average Memory Access Time In Scalable Multiprocessors.” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 214.27 Mbs, the file-s for this book were downloaded 67 times, the file-s went public at Sat Mar 24 2018.

Available formats:
Abbyy GZ - Additional Text PDF - Archive BitTorrent - DjVuTXT - Djvu XML - Image Container PDF - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA300197: Exploiting Bandwidth To Reduce Average Memory Access Time In Scalable Multiprocessors. at online marketplaces:


16DTIC ADA457629: Contention-Conscious Transaction Ordering In Embedded Multiprocessors Systems

By

This paper explores the problem of efficiently ordering interprocessor communication operations in statically-scheduled multiprocessors for iterative dataflow graphs. In most digital signal processing applications, the throughput of the system is significantly affected by communication costs. By explicitly modeling these costs within an effective graph-theoretic analysis framework, we show that ordered transaction schedules can significantly outperform self-timed schedules even when synchronization costs are low. However, we also show that when communication latencies are non-negligible, finding an optimal transaction order given a static schedule is an NP-complete problem, and that this intractability holds both under iterative and non-iterative execution. We develop new heuristics for finding efficient transaction orders, and perform an experimental comparison to gauge the performance of these heuristics.

“DTIC ADA457629: Contention-Conscious Transaction Ordering In Embedded Multiprocessors Systems” Metadata:

  • Title: ➤  DTIC ADA457629: Contention-Conscious Transaction Ordering In Embedded Multiprocessors Systems
  • Author: ➤  
  • Language: English

“DTIC ADA457629: Contention-Conscious Transaction Ordering In Embedded Multiprocessors Systems” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 16.90 Mbs, the file-s for this book were downloaded 58 times, the file-s went public at Thu Jun 07 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - JPEG Thumb - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA457629: Contention-Conscious Transaction Ordering In Embedded Multiprocessors Systems at online marketplaces:


17DTIC ADA448087: System Synthesis For Optically-Connected, Multiprocessors On-Chip

By

Optical interconnects are being considered as a possible solution to the well known problems of scaling in VLSI interconnects. Along with enabling higher speed interconnects, optics allows the construction of highly connected and irregular networks that are streamlined for particular applications. Using these networks, it is possible to implement application mappings that allow flexible, single-hop communication patterns between processors. This has advantages for reduced system latency and power. Such optically connected multiprocessors are particularly promising for embedded digital signal processing (DSP) applications, which are highly parallel, and typically have tight constraints on latency and power consumption. This paper addresses novel trade-offs involving communication routing flexibility, power consumption, and performance that arise in the context of system synthesis of optically-interconnected multiprocessors. We report on experimental results that expose these trade-offs, and propose systematic techniques to address them efficiently. We demonstrate the performance of these techniques on several benchmark examples.

“DTIC ADA448087: System Synthesis For Optically-Connected, Multiprocessors On-Chip” Metadata:

  • Title: ➤  DTIC ADA448087: System Synthesis For Optically-Connected, Multiprocessors On-Chip
  • Author: ➤  
  • Language: English

“DTIC ADA448087: System Synthesis For Optically-Connected, Multiprocessors On-Chip” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 6.26 Mbs, the file-s for this book were downloaded 64 times, the file-s went public at Sat Jun 02 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - JPEG Thumb - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA448087: System Synthesis For Optically-Connected, Multiprocessors On-Chip at online marketplaces:


18DTIC ADA272948: Scheduling For Locality In Shared-Memory Multiprocessors

By

The last decade has produced enormous improvements in processor speed without a corresponding improvement in bus or interconnection network speeds. As a result, the relative costs of communication and computation in shared-memory multiprocessors have changed dramatically, and many parallel applications do not execute efficiently on today's multiprocessors. In this dissertation we quantify the effect of this trend-in architecture on parallel program performance, explain the implications of this trend on popular parallel programming models, and propose system software to efficiently map parallel programs and programming models to modern shared-memory multiprocessors. We propose new decomposition and scheduling algorithms that significantly reduce communication overhead. Our experiments over a wide variety of shared-memory multiprocessors demonstrate that the performance benefits of our scheduling-for-locality algorithms are significant, improving performance by up to 60% for some applications. We conclude that communication overhead need not dominate performance, given an appropriate programming model, multiprogramming scheduling policy, and user- level decomposition and scheduling algorithms. Shared-memory multiprocessors, Architecture trends, Loop scheduling, Lightweight thread scheduling, Multiprogramming

“DTIC ADA272948: Scheduling For Locality In Shared-Memory Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA272948: Scheduling For Locality In Shared-Memory Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA272948: Scheduling For Locality In Shared-Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 82.84 Mbs, the file-s for this book were downloaded 49 times, the file-s went public at Tue Mar 13 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA272948: Scheduling For Locality In Shared-Memory Multiprocessors at online marketplaces:


19NASA Technical Reports Server (NTRS) 19920002452: Vienna FORTRAN: A FORTRAN Language Extension For Distributed Memory Multiprocessors

By

Exploiting the performance potential of distributed memory machines requires a careful distribution of data across the processors. Vienna FORTRAN is a language extension of FORTRAN which provides the user with a wide range of facilities for such mapping of data structures. However, programs in Vienna FORTRAN are written using global data references. Thus, the user has the advantage of a shared memory programming paradigm while explicitly controlling the placement of data. The basic features of Vienna FORTRAN are presented along with a set of examples illustrating the use of these features.

“NASA Technical Reports Server (NTRS) 19920002452: Vienna FORTRAN: A FORTRAN Language Extension For Distributed Memory Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19920002452: Vienna FORTRAN: A FORTRAN Language Extension For Distributed Memory Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19920002452: Vienna FORTRAN: A FORTRAN Language Extension For Distributed Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 32.05 Mbs, the file-s for this book were downloaded 73 times, the file-s went public at Tue Sep 27 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19920002452: Vienna FORTRAN: A FORTRAN Language Extension For Distributed Memory Multiprocessors at online marketplaces:


20Building Portable Thread Schedulers For Hierarchical Multiprocessors: The BubbleSched Framework

By

Exploiting full computational power of current more and more hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture. Unfortunately, most operating systems only provide a poor scheduling API that does not allow applications to transmit valuable scheduling hints to the system. In a previous paper, we showed that using a bubble-based thread scheduler can significantly improve applications' performance in a portable way. However, since multithreaded applications have various scheduling requirements, there is no universal scheduler that could meet all these needs. In this paper, we present a framework that allows scheduling experts to implement and experiment with customized thread schedulers. It provides a powerful API for dynamically distributing bubbles among the machine in a high-level, portable, and efficient way. Several examples show how experts can then develop, debug and tune their own portable bubble schedulers.

“Building Portable Thread Schedulers For Hierarchical Multiprocessors: The BubbleSched Framework” Metadata:

  • Title: ➤  Building Portable Thread Schedulers For Hierarchical Multiprocessors: The BubbleSched Framework
  • Authors:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 12.39 Mbs, the file-s for this book were downloaded 85 times, the file-s went public at Sat Sep 21 2013.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVu - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find Building Portable Thread Schedulers For Hierarchical Multiprocessors: The BubbleSched Framework at online marketplaces:


21Unstructured Scientific Computation On Scalable Multiprocessors

By

Exploiting full computational power of current more and more hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture. Unfortunately, most operating systems only provide a poor scheduling API that does not allow applications to transmit valuable scheduling hints to the system. In a previous paper, we showed that using a bubble-based thread scheduler can significantly improve applications' performance in a portable way. However, since multithreaded applications have various scheduling requirements, there is no universal scheduler that could meet all these needs. In this paper, we present a framework that allows scheduling experts to implement and experiment with customized thread schedulers. It provides a powerful API for dynamically distributing bubbles among the machine in a high-level, portable, and efficient way. Several examples show how experts can then develop, debug and tune their own portable bubble schedulers.

“Unstructured Scientific Computation On Scalable Multiprocessors” Metadata:

  • Title: ➤  Unstructured Scientific Computation On Scalable Multiprocessors
  • Authors:
  • Language: English

“Unstructured Scientific Computation On Scalable Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 1515.28 Mbs, the file-s for this book were downloaded 80 times, the file-s went public at Sat Nov 04 2017.

Available formats:
ACS Encrypted EPUB - ACS Encrypted PDF - Abbyy GZ - Archive BitTorrent - Cloth Cover Detection Log - DjVuTXT - Djvu XML - Dublin Core - EPUB - Grayscale PDF - Item Tile - JPEG-Compressed PDF - JSON - LCP Encrypted EPUB - LCP Encrypted PDF - MARC - MARC Binary - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Remediated EPUB - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Unstructured Scientific Computation On Scalable Multiprocessors at online marketplaces:


22DTIC ADA268985: Low Latency Messages On Distributed Memory Multiprocessors

By

This paper describes many of the issues in developing an efficient interface for communication on distributed memory machines and proposes a portable interface. Although the hardware component of message latency is less than one microsecond on many distributed memory machines, the software latency associated with sending and receiving typed messages is on the order of 50 microseconds. The reason for this imbalance is that the software interface does not match the hardware. By changing the interface to match the hardware more closely, applications with fine grained communication can be put on these machines. Based on several tests that we have run on the iPSC/860, we propose an interface that will better match current distributed memory machines. The model used in the proposed interface consists of a computation processor and a communication processor on each node. Communication between these processors and other nodes in the system is done through a buffered network. Information that is transmitted is either data or procedures to be executed on the remote processor. The dual processor system is better suited for efficiently handling asynchronous communications compared to a single processor system. The ability to send data or procedure invocations is very flexible for minimizing message latency, based on the type of communication being performed. This paper describes the tests performed and the proposed interface.... Parallel computers, Distributed memory, Communication, Message latency.

“DTIC ADA268985: Low Latency Messages On Distributed Memory Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA268985: Low Latency Messages On Distributed Memory Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA268985: Low Latency Messages On Distributed Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 19.05 Mbs, the file-s for this book were downloaded 61 times, the file-s went public at Mon Mar 12 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA268985: Low Latency Messages On Distributed Memory Multiprocessors at online marketplaces:


23DTIC ADA281628: Software Cache Coherence For Large Scale Multiprocessors

By

Shared memory provides an attractive and intuitive programming model that makes good use of programmer time and effort. Shared memory however requires a coherence mechanism to allow caching for performance and to ensure that processors do not use stale data in their caches. We evaluate several algorithmic and architectural alternatives in the design space of NCC-NUMA machines with a globally-accessible physical address space. We present a new adaptive algorithm for software cache coherence that reduces interprocessor communication and scales to large numbers of processors; we compare it to existing software and hardware coherence schemes. We also evaluate the tradeoffs among various write policies (write-through, write-through with a write-collect buffer) and the effect on performance of using remote memory access. Finally, we observe that certain simple program changes can greatly improve performance. For example, we find that the use of reader-writer locks, synchronization variable relocation, and data structure padding and alignment can allow a protocol to avoid significant amounts of coherence overhead.

“DTIC ADA281628: Software Cache Coherence For Large Scale Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA281628: Software Cache Coherence For Large Scale Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA281628: Software Cache Coherence For Large Scale Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 18.29 Mbs, the file-s for this book were downloaded 63 times, the file-s went public at Sun Mar 18 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA281628: Software Cache Coherence For Large Scale Multiprocessors at online marketplaces:


24DTIC ADA281502: A Preliminary Evaluation Of Cache-Miss-Initiated Prefetching Techniques In Scalable Multiprocessors

By

Prefetching is an important technique for reducing the average latency of memory accesses in scalable cache-coherent multiprocessors. Aggressive prefetching can significantly reduce the number of cache misses, but may introduce bursty network and memory traffic, and increase data sharing and cache pollution. Given that we anticipate enormous increases in both network bandwidth and latency, we examine whether aggressive prefetching triggered by a miss (cache-miss-initiated prefetching) can substantially improve the running time of parallel programs. Using execution-driven simulation of parallel programs on scalable cache-coherent maching, we study the performance of three cache-miss-initiated prefetching techniques: large cache blocks, sequential prefetching, and hybrid prefetching. Large cache blocks (which fetch multiple words within a single block) and sequential prefetching (which fetches multiple consecutive blocks) are well-known prefetching strategies. Hybrid prefetching is a novel technique combining hardware and software support for stride-directed prefetching. Our simulation results show that large cache blocks rarely provide significant performance improvements; the improvement in the miss rate is often too small (or nonexistent) to offset a corresponding increase in the miss penalty. Our results also show that sequential and hybrid prefetching perform better than prefetching via large cache blocks, and that hybrid prefetching performs at least as well as sequential prefetching.

“DTIC ADA281502: A Preliminary Evaluation Of Cache-Miss-Initiated Prefetching Techniques In Scalable Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA281502: A Preliminary Evaluation Of Cache-Miss-Initiated Prefetching Techniques In Scalable Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA281502: A Preliminary Evaluation Of Cache-Miss-Initiated Prefetching Techniques In Scalable Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 21.91 Mbs, the file-s for this book were downloaded 52 times, the file-s went public at Mon Mar 19 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA281502: A Preliminary Evaluation Of Cache-Miss-Initiated Prefetching Techniques In Scalable Multiprocessors at online marketplaces:


25Microsoft Research Audio 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly

By

Software tends to misbehave more often than we would like due to bugs and security attacks. The good news is that there are a variety of dynamic program monitoring tools (which we call 'lifeguards') that can detect and sometimes repair these problems at runtime. The bad news, however, is that these lifeguards (e.g., Valgrind) often slow down execution by an order of magnitude or more, thereby limiting their usefulness. With the emergence of chip multiprocessing as the dominant computing platform, the raw horsepower for performing these monitoring tasks may exist on-chip. In order for the lifeguards to effectively exploit this raw horsepower, however, they need two forms of additional support: the ability to unobtrusively observe the dynamic behavior of the monitored application, and the ability to rewind that application for the sake of repairing a problem. In this joint project that we recently started at Intel Research Pittsburgh and Carnegie Mellon University, we are exploring whether this support can be provided by a single new architectural mechanism: a log that is captured by the hardware, managed by the system, and exposed to the lifeguard software.Our initial results using three diverse lifeguards demonstrate an order-of-magnitude speedup relative to equivalent tools written in Valgrind. Our design also includes a prediction-based compression scheme that reduces the log-related bandwidth and storage requirements by an order of magnitude, to less than one byte per instruction. ©2006 Microsoft Corporation. All rights reserved.

“Microsoft Research Audio 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly” Metadata:

  • Title: ➤  Microsoft Research Audio 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly
  • Author:
  • Language: English

“Microsoft Research Audio 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "audio" format, the size of the file-s is: 57.75 Mbs, the file-s for this book were downloaded 5 times, the file-s went public at Sat Nov 23 2013.

Available formats:
Archive BitTorrent - Item Tile - Metadata - Ogg Vorbis - PNG - VBR MP3 -

Related Links:

Online Marketplaces

Find Microsoft Research Audio 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly at online marketplaces:


26"Superluminal" FITS File Processing On Multiprocessors: Zero Time Endian Conversion Technique

By

The FITS is the standard file format in astronomy, and it has been extended to agree with astronomical needs of the day. However, astronomical datasets have been inflating year by year. In case of ALMA telescope, a ~ TB scale 4-dimensional data cube may be produced for one target. Considering that typical Internet bandwidth is a few 10 MB/s at most, the original data cubes in FITS format are hosted on a VO server, and the region which a user is interested in should be cut out and transferred to the user (Eguchi et al., 2012). The system will equip a very high-speed disk array to process a TB scale data cube in a few 10 seconds, and disk I/O speed, endian conversion and data processing one will be comparable. Hence to reduce the endian conversion time is one of issues to realize our system. In this paper, I introduce a technique named "just-in-time endian conversion", which delays the endian conversion for each pixel just before it is really needed, to sweep out the endian conversion time; by applying this method, the FITS processing speed increases 20% for single threading, and 40% for multi-threading compared to CFITSIO. The speed-up by the method tightly relates to modern CPU architecture to improve the efficiency of instruction pipelines due to break of "causality", a programmed instruction code sequence.

“"Superluminal" FITS File Processing On Multiprocessors: Zero Time Endian Conversion Technique” Metadata:

  • Title: ➤  "Superluminal" FITS File Processing On Multiprocessors: Zero Time Endian Conversion Technique
  • Author:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 10.52 Mbs, the file-s for this book were downloaded 332 times, the file-s went public at Sat Jul 20 2013.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVu - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find "Superluminal" FITS File Processing On Multiprocessors: Zero Time Endian Conversion Technique at online marketplaces:


27DTIC ADA289474: Internationnal Conference On Software For Multiprocessors And Supercomputers Theory, Practice, Experience (2nd) Held In Moscow On September 19 - 23, 1994.

By

The FITS is the standard file format in astronomy, and it has been extended to agree with astronomical needs of the day. However, astronomical datasets have been inflating year by year. In case of ALMA telescope, a ~ TB scale 4-dimensional data cube may be produced for one target. Considering that typical Internet bandwidth is a few 10 MB/s at most, the original data cubes in FITS format are hosted on a VO server, and the region which a user is interested in should be cut out and transferred to the user (Eguchi et al., 2012). The system will equip a very high-speed disk array to process a TB scale data cube in a few 10 seconds, and disk I/O speed, endian conversion and data processing one will be comparable. Hence to reduce the endian conversion time is one of issues to realize our system. In this paper, I introduce a technique named "just-in-time endian conversion", which delays the endian conversion for each pixel just before it is really needed, to sweep out the endian conversion time; by applying this method, the FITS processing speed increases 20% for single threading, and 40% for multi-threading compared to CFITSIO. The speed-up by the method tightly relates to modern CPU architecture to improve the efficiency of instruction pipelines due to break of "causality", a programmed instruction code sequence.

“DTIC ADA289474: Internationnal Conference On Software For Multiprocessors And Supercomputers Theory, Practice, Experience (2nd) Held In Moscow On September 19 - 23, 1994.” Metadata:

  • Title: ➤  DTIC ADA289474: Internationnal Conference On Software For Multiprocessors And Supercomputers Theory, Practice, Experience (2nd) Held In Moscow On September 19 - 23, 1994.
  • Author: ➤  
  • Language: English

“DTIC ADA289474: Internationnal Conference On Software For Multiprocessors And Supercomputers Theory, Practice, Experience (2nd) Held In Moscow On September 19 - 23, 1994.” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 455.14 Mbs, the file-s for this book were downloaded 426 times, the file-s went public at Wed Mar 21 2018.

Available formats:
Abbyy GZ - Additional Text PDF - Archive BitTorrent - DjVuTXT - Djvu XML - Image Container PDF - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA289474: Internationnal Conference On Software For Multiprocessors And Supercomputers Theory, Practice, Experience (2nd) Held In Moscow On September 19 - 23, 1994. at online marketplaces:


28DTIC ADA289883: Scalability Of Atomic Primitives On Distributed Shared Memory Multiprocessors.

By

Many hardware primitives have been proposed for synchronization and atomic memory update on shared-memory multiprocessors. In this paper, we focus on general-purpose primitives that have proven popular on small-scale bus-based machines, but have yet to become widely available on large-scale, distributed-memory machines. Specifically, we propose several alternative implementations of fetch and Phi compare and swap, and load inked/store-conditional. We then analyze the performance of these implementations for various data sharing patterns, in both real and synthetic applications. Our results indicate that good overall performance can be obtained by implementing compare and swap in a multiprocessor's cache controllers, and by providing an additional instruction to load an exclusive copy of a line. (AN)

“DTIC ADA289883: Scalability Of Atomic Primitives On Distributed Shared Memory Multiprocessors.” Metadata:

  • Title: ➤  DTIC ADA289883: Scalability Of Atomic Primitives On Distributed Shared Memory Multiprocessors.
  • Author: ➤  
  • Language: English

“DTIC ADA289883: Scalability Of Atomic Primitives On Distributed Shared Memory Multiprocessors.” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 17.20 Mbs, the file-s for this book were downloaded 66 times, the file-s went public at Wed Mar 21 2018.

Available formats:
Abbyy GZ - Additional Text PDF - Archive BitTorrent - DjVuTXT - Djvu XML - Image Container PDF - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA289883: Scalability Of Atomic Primitives On Distributed Shared Memory Multiprocessors. at online marketplaces:


29DTIC ADA204321: Sparse Elimination On Vector Multiprocessors

By

The research of this grant spanned number of topical areas in its four years duration (1) Blocked parallel solution of dense and sparse systems. Closely-related to the original proposal, this research involved a study of the relationship between task granularity and block partitioning size in the solution of linear algebra problems. The rationale for this blocking was the restricted effective memory bandwidth of the shared-memory CRAY-2 due to memory conflicts. The final result was development of unique black-box models of the CRAY-2 memory system based on dedicated machine measurements. In the realization that the limited parallelism of the CRAY-2 was restrictive for future algorithm studies, a new effort precursing future research cooperative with WPAFB personnel was initiated.

“DTIC ADA204321: Sparse Elimination On Vector Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA204321: Sparse Elimination On Vector Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA204321: Sparse Elimination On Vector Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 6.66 Mbs, the file-s for this book were downloaded 71 times, the file-s went public at Wed Feb 21 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA204321: Sparse Elimination On Vector Multiprocessors at online marketplaces:


30Simulation-based Fault-tolerant Multiprocessors System

By

System reliability is an important issue in designing modern multiprocessor systems. This paper proposes a fault-tolerant, scalable, multiprocessor system architecture that adopts a pipeline scheme. To verify the performance of the proposed system, the SimEvent/Stateflow tool of the MATLAB program was used to simulate the system. The proposed system uses twelve processors (P), connected in a linear array, to build a ten-stage system with two backup processors (BP). However, the system can be expanded by adding more processors to increase pipeline stages and performance, and more backup processors to increase system reliability. The system can automatically reorganize itself in the event of a failure of one or two processors and execution continues without interruption. Each processor communicates with its neighboring processors through input/output (I/O) ports which are used as bypass links between the processors. In the event of a processor failure, the function of the faulty processor is assigned to the next processor that is free from faults. The fast Fourier transform (FFT) algorithm is implemented on the simulated circuit to evaluate the performance of the proposed system. The results showed that the system can continue to execute even if one or two processors fail without a noticeable decrease in performance.

“Simulation-based Fault-tolerant Multiprocessors System” Metadata:

  • Title: ➤  Simulation-based Fault-tolerant Multiprocessors System
  • Author: ➤  

“Simulation-based Fault-tolerant Multiprocessors System” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 8.78 Mbs, the file-s for this book were downloaded 54 times, the file-s went public at Fri Apr 28 2023.

Available formats:
Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Simulation-based Fault-tolerant Multiprocessors System at online marketplaces:


31Mode Change Protocol For Multi-Mode Real-Time Systems Upon Identical Multiprocessors

By

In this paper, we propose a synchronous protocol without periodicity for scheduling multi-mode real-time systems upon identical multiprocessor platforms. Our proposal can be considered to be a multiprocessor extension of the uniprocessor protocol called "Minimal Single Offset protocol".

“Mode Change Protocol For Multi-Mode Real-Time Systems Upon Identical Multiprocessors” Metadata:

  • Title: ➤  Mode Change Protocol For Multi-Mode Real-Time Systems Upon Identical Multiprocessors
  • Authors:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 3.18 Mbs, the file-s for this book were downloaded 81 times, the file-s went public at Wed Sep 18 2013.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVu - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find Mode Change Protocol For Multi-Mode Real-Time Systems Upon Identical Multiprocessors at online marketplaces:


32NASA Technical Reports Server (NTRS) 19960049753: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors

By

The computational requirements for an adaptive solution of unsteady problems change as the simulation progresses. This causes workload imbalance among processors on a parallel machine which, in turn, requires significant data movement at runtime. We present a new dynamic load-balancing framework, called JOVE, that balances the workload across all processors with a global view. Whenever the computational mesh is adapted, JOVE is activated to eliminate the load imbalance. JOVE has been implemented on an IBM SP2 distributed-memory machine in MPI for portability. Experimental results for two model meshes demonstrate that mesh adaption with load balancing gives more than a sixfold improvement over one without load balancing. We also show that JOVE gives a 24-fold speedup on 64 processors compared to sequential execution.

“NASA Technical Reports Server (NTRS) 19960049753: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19960049753: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19960049753: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 16.03 Mbs, the file-s for this book were downloaded 53 times, the file-s went public at Thu Oct 13 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19960049753: Impact Of Load Balancing On Unstructured Adaptive Grid Computations For Distributed-Memory Multiprocessors at online marketplaces:


33Fault Tolerance In Real Time Multiprocessors - Embedded Systems

By

All real time tasks which are termed as critical tasks by nature have to complete its execution before its deadline, even in presence of faults. The most popularly used real time task assignment algorithms are First Fit (FF), Best Fit (BF), Bin Packing (BP).The common task scheduling algorithms are Rate Monotonic (RM), Earliest Deadline First (EDF) etc.All the current approaches deal with either fault tolerance or criticality in real time. In this paper we have proposed an integrated approach with a new algorithm, called SASA (Sorting And Sequential Assignment) which maps the real time task assignment with task schedule and fault tolerance

“Fault Tolerance In Real Time Multiprocessors - Embedded Systems” Metadata:

  • Title: ➤  Fault Tolerance In Real Time Multiprocessors - Embedded Systems
  • Authors:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 2.35 Mbs, the file-s for this book were downloaded 149 times, the file-s went public at Sun Sep 22 2013.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVu - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find Fault Tolerance In Real Time Multiprocessors - Embedded Systems at online marketplaces:


34DTIC ADA213910: Memory Management For Large-Scale NUMA (NonUniform Memory Access) Multiprocessors

By

Large-scale shared-memory multiprocessors such as the BBN Butterfly and IBM RP3 introduce a new level in the memory hierarchy; multiple physical memories with different memory access times. An operating system for these NUMA (NonUniform Memory Access) multiprocessors should provide traditional virtual memory management, facilitate dynamic and widespread memory sharing, and minimize the apparent disparity between local and nonlocal memory. In addition, the implementation must be scalable to configurations with hundreds or thousands of processors. This paper describes memory management in the Psyche multiprocessor operating system, under development at the University of Rochester. The Psyche kernel manages a multi-level memory hierarchy consisting of local memory, nonlocal memory, and backing store. Local memory stores private data and serves as a cache for shared data; nonlocal memory stores shared data and serves as a disk cache. The system structure isolates the policies and mechanisms that manage different layers in the memory hierarchy, so that customized data structures and policies can be constructed for each layer. Local memory management policies are implemented using mechanisms that are independent of the architectural configuration; global policies are implemented using multiple processes that increase in number as the architecture scales. Psyche currently runs on the BBN Butterfly Plus multiprocessor. (kr)

“DTIC ADA213910: Memory Management For Large-Scale NUMA (NonUniform Memory Access) Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA213910: Memory Management For Large-Scale NUMA (NonUniform Memory Access) Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA213910: Memory Management For Large-Scale NUMA (NonUniform Memory Access) Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 19.18 Mbs, the file-s for this book were downloaded 69 times, the file-s went public at Fri Feb 23 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA213910: Memory Management For Large-Scale NUMA (NonUniform Memory Access) Multiprocessors at online marketplaces:


35Memory Management In Symunix II: A Design For Large-scale Shared Memory Multiprocessors

By

19 p. 28 cm

“Memory Management In Symunix II: A Design For Large-scale Shared Memory Multiprocessors” Metadata:

  • Title: ➤  Memory Management In Symunix II: A Design For Large-scale Shared Memory Multiprocessors
  • Authors:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 35.90 Mbs, the file-s for this book were downloaded 347 times, the file-s went public at Tue Sep 30 2008.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - Cloth Cover Detection Log - DjVu - DjVuTXT - Djvu XML - Dublin Core - Grayscale PDF - Item Tile - MARC - MARC Binary - MARC Source - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scan Factors - Scandata - Single Page Original JP2 Tar - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Memory Management In Symunix II: A Design For Large-scale Shared Memory Multiprocessors at online marketplaces:


36DTIC ADA203348: SPRINGNET: A Network Of Multiprocessors For Hard Real-Time

By

The Spring project has been very active in multiple areas of real-time computing. We have developed significant results in many aspects of real-time scheduling, in real-time operating system design, in real-time transactions, and in time constrained communication protocols. We have also made progress in implementing a software (simulation) testbed, in developing a hardware operating system kernel testbed called SpringNet, and in implementing real-time transactions on a hardware testbed called CARAT. We have also begun a substantial research effort concerning dependable real-time systems. In the following sections we provide a brief overview of the status and plans for the Spring project as a whole.

“DTIC ADA203348: SPRINGNET: A Network Of Multiprocessors For Hard Real-Time” Metadata:

  • Title: ➤  DTIC ADA203348: SPRINGNET: A Network Of Multiprocessors For Hard Real-Time
  • Author: ➤  
  • Language: English

“DTIC ADA203348: SPRINGNET: A Network Of Multiprocessors For Hard Real-Time” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 6.90 Mbs, the file-s for this book were downloaded 42 times, the file-s went public at Wed Feb 21 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA203348: SPRINGNET: A Network Of Multiprocessors For Hard Real-Time at online marketplaces:


37DTIC ADA158274: Sparse Elimination On Vector Multiprocessors.

By

Studies of microtasking with up to 16 CRAY X-MP processors for LU decomposition of dense systems of equations have given rise to hybrid algorithms. One issue addressed has been the problem of memory bank conflicts, which increases with the number of processors. Conflict resistant algorithms have been developed. It is possible to assembly-code the X-MP so that accesses are pre-fetched into vector registers. Several reports have been prepared recently under this effort, and a paper entitled 'Task Granularity Studies in a Many-Processor Cray X-MP' has been accepted for publication in 'Parallel Computing.'

“DTIC ADA158274: Sparse Elimination On Vector Multiprocessors.” Metadata:

  • Title: ➤  DTIC ADA158274: Sparse Elimination On Vector Multiprocessors.
  • Author: ➤  
  • Language: English

“DTIC ADA158274: Sparse Elimination On Vector Multiprocessors.” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 5.57 Mbs, the file-s for this book were downloaded 73 times, the file-s went public at Fri Feb 02 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - JPEG Thumb - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA158274: Sparse Elimination On Vector Multiprocessors. at online marketplaces:


38Microsoft Research Video 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly

By

Software tends to misbehave more often than we would like due to bugs and security attacks. The good news is that there are a variety of dynamic program monitoring tools (which we call 'lifeguards') that can detect and sometimes repair these problems at runtime. The bad news, however, is that these lifeguards (e.g., Valgrind) often slow down execution by an order of magnitude or more, thereby limiting their usefulness. With the emergence of chip multiprocessing as the dominant computing platform, the raw horsepower for performing these monitoring tasks may exist on-chip. In order for the lifeguards to effectively exploit this raw horsepower, however, they need two forms of additional support: the ability to unobtrusively observe the dynamic behavior of the monitored application, and the ability to rewind that application for the sake of repairing a problem. In this joint project that we recently started at Intel Research Pittsburgh and Carnegie Mellon University, we are exploring whether this support can be provided by a single new architectural mechanism: a log that is captured by the hardware, managed by the system, and exposed to the lifeguard software.Our initial results using three diverse lifeguards demonstrate an order-of-magnitude speedup relative to equivalent tools written in Valgrind. Our design also includes a prediction-based compression scheme that reduces the log-related bandwidth and storage requirements by an order of magnitude, to less than one byte per instruction. ©2006 Microsoft Corporation. All rights reserved.

“Microsoft Research Video 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly” Metadata:

  • Title: ➤  Microsoft Research Video 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly
  • Author:
  • Language: English

“Microsoft Research Video 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 902.03 Mbs, the file-s for this book were downloaded 54 times, the file-s went public at Thu May 01 2014.

Available formats:
Animated GIF - Archive BitTorrent - Item Tile - Metadata - Ogg Video - Thumbnail - Windows Media - h.264 -

Related Links:

Online Marketplaces

Find Microsoft Research Video 104198: Log-Based Architectures: Using Chip Multiprocessors To Help Software Behave Correctly at online marketplaces:


39DTIC ADA604029: Estimating Performance Of Single Bus, Shared Memory Multiprocessors

By

Given standard characteristics of processors and memory, we present two simple ways of estimating the performance of shared memory multiprocessors. At the cost of a few simple arithmetic operations, a computer designer can estimate the range of performance using our 4-point bound model. If more accuracy is required, we show that a one page program can estimate performance within 3% of trace-driven simulation, while reducing software development time, disk space, and CPU time by orders of magnitude. To demonstrate the use of our models, an application to the SPUR multiprocessor design is presented.

“DTIC ADA604029: Estimating Performance Of Single Bus, Shared Memory Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA604029: Estimating Performance Of Single Bus, Shared Memory Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA604029: Estimating Performance Of Single Bus, Shared Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 28.06 Mbs, the file-s for this book were downloaded 54 times, the file-s went public at Fri Sep 21 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA604029: Estimating Performance Of Single Bus, Shared Memory Multiprocessors at online marketplaces:


40Univac :: 1100 :: Brochures :: S6818r3 Sperry Series 1100 Multiprocessors Brochure Oct83

From the bitsavers.org collection, a scanned-in computer-related document. univac :: 1100 :: brochures :: S6818r3 Sperry Series 1100 Multiprocessors Brochure Oct83

“Univac :: 1100 :: Brochures :: S6818r3 Sperry Series 1100 Multiprocessors Brochure Oct83” Metadata:

  • Title: ➤  Univac :: 1100 :: Brochures :: S6818r3 Sperry Series 1100 Multiprocessors Brochure Oct83
  • Language: English

“Univac :: 1100 :: Brochures :: S6818r3 Sperry Series 1100 Multiprocessors Brochure Oct83” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 19.12 Mbs, the file-s for this book were downloaded 165 times, the file-s went public at Tue Feb 18 2014.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVu - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find Univac :: 1100 :: Brochures :: S6818r3 Sperry Series 1100 Multiprocessors Brochure Oct83 at online marketplaces:


41NASA Technical Reports Server (NTRS) 19850025419: Modeling And Measurement Of Fault-tolerant Multiprocessors

By

The workload effects on computer performance are addressed first for a highly reliable unibus multiprocessor used in real-time control. As an approach to studing these effects, a modified Stochastic Petri Net (SPN) is used to describe the synchronous operation of the multiprocessor system. From this model the vital components affecting performance can be determined. However, because of the complexity in solving the modified SPN, a simpler model, i.e., a closed priority queuing network, is constructed that represents the same critical aspects. The use of this model for a specific application requires the partitioning of the workload into job classes. It is shown that the steady state solution of the queuing model directly produces useful results. The use of this model in evaluating an existing system, the Fault Tolerant Multiprocessor (FTMP) at the NASA AIRLAB, is outlined with some experimental results. Also addressed is the technique of measuring fault latency, an important microscopic system parameter. Most related works have assumed no or a negligible fault latency and then performed approximate analyses. To eliminate this deficiency, a new methodology for indirectly measuring fault latency is presented.

“NASA Technical Reports Server (NTRS) 19850025419: Modeling And Measurement Of Fault-tolerant Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19850025419: Modeling And Measurement Of Fault-tolerant Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19850025419: Modeling And Measurement Of Fault-tolerant Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 43.24 Mbs, the file-s for this book were downloaded 77 times, the file-s went public at Wed Sep 14 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19850025419: Modeling And Measurement Of Fault-tolerant Multiprocessors at online marketplaces:


42DTIC ADA175121: Sparse Elimination On Vector Multiprocessors.

By

The availability of instruction-level simulators for the CRAY X-MP and the CRAY-2, together with early access to the MFECC and NAS CRAY-2's, has made possible the study of a variety of equation-solving issues for many-processor VMP configurations. These include: (1) the development of equation-solving algorithms on the CRAY-2, and; (2) task granularity studies; and (3) memory conflict studies.

“DTIC ADA175121: Sparse Elimination On Vector Multiprocessors.” Metadata:

  • Title: ➤  DTIC ADA175121: Sparse Elimination On Vector Multiprocessors.
  • Author: ➤  
  • Language: English

“DTIC ADA175121: Sparse Elimination On Vector Multiprocessors.” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 4.62 Mbs, the file-s for this book were downloaded 57 times, the file-s went public at Mon Feb 12 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA175121: Sparse Elimination On Vector Multiprocessors. at online marketplaces:


43NASA Technical Reports Server (NTRS) 19870017079: MPF: A Portable Message Passing Facility For Shared Memory Multiprocessors

By

The design, implementation, and performance evaluation of a message passing facility (MPF) for shared memory multiprocessors are presented. The MPF is based on a message passing model conceptually similar to conversations. Participants (parallel processors) can enter or leave a conversation at any time. The message passing primitives for this model are implemented as a portable library of C function calls. The MPF is currently operational on a Sequent Balance 21000, and several parallel applications were developed and tested. Several simple benchmark programs are presented to establish interprocess communication performance for common patterns of interprocess communication. Finally, performance figures are presented for two parallel applications, linear systems solution, and iterative solution of partial differential equations.

“NASA Technical Reports Server (NTRS) 19870017079: MPF: A Portable Message Passing Facility For Shared Memory Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19870017079: MPF: A Portable Message Passing Facility For Shared Memory Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19870017079: MPF: A Portable Message Passing Facility For Shared Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 15.90 Mbs, the file-s for this book were downloaded 62 times, the file-s went public at Mon Sep 19 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - JPEG Thumb - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19870017079: MPF: A Portable Message Passing Facility For Shared Memory Multiprocessors at online marketplaces:


44DTIC ADA433094: Application-Specific Optical Interconnects For Embedded Multiprocessors

By

As transistor sizes shrink and we approach the end of Moore's law interconnects-both on-chip and off-chip-will represent the biggest bottleneck for embedded systems designers. Several groups are researching optical interconnects to cope with this tread. Optical interconnects enable new system architectures. These new architectures in turn require new methods for high-level application mapping and hardware/software co-design. In this presentation, we discuss high-level scheduling and interconnect topology synthesis techniques for embedded multiprocessors. We focus on designs that are streamlined for one or more digital signal processing (DSP) applications. That is, we seek to synthesize an application-specific interconnect topology for a multiprocessor DSP design. We show that flexible interconnect topologies that allow single- hop communication between processors offer advantages for reduced power and latency. We have previously shown that multiprocessor scheduling algorithms can deadlock in the general case of a topology graph that is not strongly connected or if communication is limited to be single hop. We have also demonstrated an efficient algorithm that can be used in conjunction with existing scheduling algorithms for avoiding this deadlock 1. In this presentation we discuss the advantages of performing application scheduling and interconnect synthesis jointly and present a probabilistic scheduling/ interconnect algorithm utilizing graph isomorphism to pare the design space. We demonstrate the performance advantages that an application-specific interconnect topology can produce for several DSP beachmarks.

“DTIC ADA433094: Application-Specific Optical Interconnects For Embedded Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA433094: Application-Specific Optical Interconnects For Embedded Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA433094: Application-Specific Optical Interconnects For Embedded Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 5.76 Mbs, the file-s for this book were downloaded 54 times, the file-s went public at Fri May 25 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA433094: Application-Specific Optical Interconnects For Embedded Multiprocessors at online marketplaces:


45DTIC ADA192799: An Evaluation Methodology For Dependable Multiprocessors.

By

This report outlines an approach to a methodology for evaluating high performance, reliable computers. The purpose of the methodology is to provide a framework and a basis for tool development that will make it possible to conduct such evaluations systematically and efficiently. The increasing complexities of high performance computer systems and the stringent requirement for high reliability in harsh environments (e.g., space) make such an evaluation methodology an absolute necessity. The report discusses sources of difficulty in evaluation, such as the many complexities of multiprocessing, the difficulty of distinguishing various factor (algorithms, software), operating systems, fault diagnostics, etc.) that affect performance and fault tolerance, the use of formal and experimental analyses, and the special problems of computer security. Criteria and suggestions are given for the design of unified working environments and specific classes of tools that support the methodology.

“DTIC ADA192799: An Evaluation Methodology For Dependable Multiprocessors.” Metadata:

  • Title: ➤  DTIC ADA192799: An Evaluation Methodology For Dependable Multiprocessors.
  • Author: ➤  
  • Language: English

“DTIC ADA192799: An Evaluation Methodology For Dependable Multiprocessors.” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 67.12 Mbs, the file-s for this book were downloaded 64 times, the file-s went public at Sat Feb 17 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA192799: An Evaluation Methodology For Dependable Multiprocessors. at online marketplaces:


46DTIC ADA286047: A Periodic Scheduling Heuristic For Mapping Iterative Task Graphs Onto Distributed Memory Multiprocessors

By

This thesis investigates the problem of statically assigning the tasks of applications represented by repetitive task graphs (such as sonar or radar signal processing) to the processors of a distributed memory multiprocessor system with the objective of maximizing graph instance throughput. The repetitive nature of these task graphs allows for pipelining and the overlapping of successive graph instances, suggesting a departure from classical directed acyclic graph scheduling techniques. To investigate such a claim, a version of the Mapping Heuristic (MH) ELR 90 is extended for use with iterative applications. Then a new heuristic, Periodic Scheduling (PS), is developed to capitalize on the repetitive nature of these task graphs by overlapping successive graph instances. The PS heuristic assigns tasks to processors in such a way so as to minimize the maximal utilization of the processors and the communications links between them. This maximal utilization figure dictates the interval between successive instances of the task graph. We conduct experiments in which the graph instance throughput of PS is compared to that of MH across a broad range of processor topologies, utilizing several communications/computation ratios. It is shown that, compared to MH, the PS heuristic improves the throughput performance between two and 50 percent. Particularly noteworthy improvement is noted on systems with high average inter-node communications costs. Assignment, Distributed processors, Heuristic algorithm, Mapping problem, Scheduling.

“DTIC ADA286047: A Periodic Scheduling Heuristic For Mapping Iterative Task Graphs Onto Distributed Memory Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA286047: A Periodic Scheduling Heuristic For Mapping Iterative Task Graphs Onto Distributed Memory Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA286047: A Periodic Scheduling Heuristic For Mapping Iterative Task Graphs Onto Distributed Memory Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 27.98 Mbs, the file-s for this book were downloaded 64 times, the file-s went public at Tue Mar 20 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA286047: A Periodic Scheduling Heuristic For Mapping Iterative Task Graphs Onto Distributed Memory Multiprocessors at online marketplaces:


47DTIC ADA206305: Highly Parallel Iterative Methods For Massively Parallel Multiprocessors

By

There are at least two-critical components required to obtain extremely fast methods for solving sparse linear systems. One is the use of efficient and robust numerical algorithms, and the other is the employment of effective techniques for delivering a large amount of computing power. These requirements can conflict with one another in a variety of ways. Many modern methods of solving reasonably general classes of linear systems involve a degree of implicitness; this implicitness can limit the amount of available concurrency. When Krylov space linear solvers are used, the choice of preconditioner can play a major role in determining the operation count of the resulting algorithm. Unfortunately, some of the most powerful preconditioners are obtained by using incompletely factored matrices. To apply these preconditioners, we must repeatedly solve sparse triangular systems. The efficiency with which such solutions could be carried out was characterized by Saad and Schultz. Sparse triangular systems arising from a range of problems have been solved efficiently on a number of shared memory architectures 1,3,4,9. Because data dependencies limit the concurrency available from a sparse triangular solve, it has not been clear that triangular solves could be employed usefully in programs written for massively parallel architectures.

“DTIC ADA206305: Highly Parallel Iterative Methods For Massively Parallel Multiprocessors” Metadata:

  • Title: ➤  DTIC ADA206305: Highly Parallel Iterative Methods For Massively Parallel Multiprocessors
  • Author: ➤  
  • Language: English

“DTIC ADA206305: Highly Parallel Iterative Methods For Massively Parallel Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 13.17 Mbs, the file-s for this book were downloaded 70 times, the file-s went public at Thu Feb 22 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DTIC ADA206305: Highly Parallel Iterative Methods For Massively Parallel Multiprocessors at online marketplaces:


48NASA Technical Reports Server (NTRS) 19910021435: Asynchronous And Corrected-asynchronous Numerical Solutions Of Parabolic PDES On MIMD Multiprocessors

By

A major problem in achieving significant speed-up on parallel machines is the overhead involved with synchronizing the concurrent process. Removing the synchronization constraint has the potential of speeding up the computation. The authors present asynchronous (AS) and corrected-asynchronous (CA) finite difference schemes for the multi-dimensional heat equation. Although the discussion concentrates on the Euler scheme for the solution of the heat equation, it has the potential for being extended to other schemes and other parabolic partial differential equations (PDEs). These schemes are analyzed and implemented on the shared memory multi-user Sequent Balance machine. Numerical results for one and two dimensional problems are presented. It is shown experimentally that the synchronization penalty can be about 50 percent of run time: in most cases, the asynchronous scheme runs twice as fast as the parallel synchronous scheme. In general, the efficiency of the parallel schemes increases with processor load, with the time level, and with the problem dimension. The efficiency of the AS may reach 90 percent and over, but it provides accurate results only for steady-state values. The CA, on the other hand, is less efficient, but provides more accurate results for intermediate (non steady-state) values.

“NASA Technical Reports Server (NTRS) 19910021435: Asynchronous And Corrected-asynchronous Numerical Solutions Of Parabolic PDES On MIMD Multiprocessors” Metadata:

  • Title: ➤  NASA Technical Reports Server (NTRS) 19910021435: Asynchronous And Corrected-asynchronous Numerical Solutions Of Parabolic PDES On MIMD Multiprocessors
  • Author: ➤  
  • Language: English

“NASA Technical Reports Server (NTRS) 19910021435: Asynchronous And Corrected-asynchronous Numerical Solutions Of Parabolic PDES On MIMD Multiprocessors” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 13.20 Mbs, the file-s for this book were downloaded 47 times, the file-s went public at Mon Sep 26 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find NASA Technical Reports Server (NTRS) 19910021435: Asynchronous And Corrected-asynchronous Numerical Solutions Of Parabolic PDES On MIMD Multiprocessors at online marketplaces:


49TLB Consistency On Highly-parallel Shared-memory Multiprocessors

By

22 p. 28 cm

“TLB Consistency On Highly-parallel Shared-memory Multiprocessors” Metadata:

  • Title: ➤  TLB Consistency On Highly-parallel Shared-memory Multiprocessors
  • Authors:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 38.66 Mbs, the file-s for this book were downloaded 601 times, the file-s went public at Tue Sep 30 2008.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - Cloth Cover Detection Log - DjVu - DjVuTXT - Djvu XML - Dublin Core - Grayscale PDF - Item Tile - MARC - MARC Binary - MARC Source - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scan Factors - Scandata - Single Page Original JP2 Tar - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find TLB Consistency On Highly-parallel Shared-memory Multiprocessors at online marketplaces:


50Synchronization Costs On Multiprocessors

By

16 p. 28 cm

“Synchronization Costs On Multiprocessors” Metadata:

  • Title: ➤  Synchronization Costs On Multiprocessors
  • Author:
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 26.34 Mbs, the file-s for this book were downloaded 283 times, the file-s went public at Tue Sep 30 2008.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - Cloth Cover Detection Log - DjVu - DjVuTXT - Djvu XML - Dublin Core - Grayscale PDF - Item Tile - MARC - MARC Binary - MARC Source - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scan Factors - Scandata - Single Page Original JP2 Tar - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Synchronization Costs On Multiprocessors at online marketplaces:


Buy “Multiprocessors” online:

Shop for “Multiprocessors” on popular online marketplaces.