|
|
|||
|
|
|||
|
Next: Microkernel Technology Support to Parallel Applications Up: Research Subfields Previous: ParallelizationData Distribution and Loop Transformations Index: Contents Page Algorithms and Architectures for Linear Algebra and Sparse Matrix computationsWe are working in multilevel block algorithms applied to dense and general sparse matrices. First of all, it has been studied, proposed, developed and evaluated multilevel block algorithms. The main goal of multilevel block algorithms is to optimize the locality exploitation of data in linear algebra operations. The target architectures of multilevel block algorithms are those with several levels in the memory hierarchy. The different forms of the algorithms have been classified and we have proved that one of them, that we call MOB, is optimal in performance and can be easily designed. The performance improvements obtained with the MOB algorithms are outstanding. In the design of the multilevel block algorithms, in addition to the conflicts that appear in cache lines, we consider TLB misses and page faults. The evaluation of MOB algorithms is done theoretically on a memory hierarchy model and is validated with trace-driven simulations and experimental measurements on two new computers that use high performance microprocessors.
For the case of sparse linear systems, parallel algorithms have been implemented.
The basic approach of this piece of work is to extract the coarse grain parallelism that those applications have. In order to do so, domain decomposition techniques have been used and a message passing programming model on a distributed memory multicomputer have been assumed. All this work is related to project ESPRIT-6753 (IDENTIFY) in collaboration with the British Rutherford Appleton Laboratory and the French company Bertin et Cie. In particular, the developed software can be applied to Fluid Dynamics problems. The role of our research group in the project was to implement the linear solvers for the project. Some of the methods developed are: i) Element by element methods, ii) Iterative methods on the global matrix, iii) Iterative methods on the Schur Complement matrix. In addition, we have also implemented the corresponding software for the assembly of the matrices which is necessary for methods (ii) and (iii). This latter software allows the assembly both from a finite volume discretization and from a finite element discretization. With the objective of improving the efficiency of the parallel solvers, different reordering algorithms have been included in the code. Those algorithms have the aim of reducing the band and increment the parallelism. Some of those algorithms process elements within submatrices and others process submatrices within the global matrix. Within the field of sparse linear systems, one can find banded linear systems. This type of problems appears in the discretization of partial differential equations. The size of the band can vary from one only subdiagonal (bidiagonal matrices) to tens of diagonals. The work developed has focussed on three different parts. In a first part, algorithms have been studied for narrow banded systems (bidiagonal and tridiagonal). Those methods have been studied for their execution on vector processors in:
In the second part, the methods studied above have been analyzed for strictly diagonal dominant systems of equations. New methods have been proposed for the solution of those systems. The study has a mathematical component to prove the accuracy of the methods and an architectural component to study the methods on specialized architectures:
Finally, a study of the algorithms for different applications has been carried out:
Next: Microkernel Technology Support to Parallel Applications Up: Research Subfields Previous: ParallelizationData Distribution and Loop Transformations Index: Contents Page
|
| Inici | Presentació | Docència | Recerca | Centres de Recerca | Novetats |
| ||
|
|