In the first part of this work, a description of the mathematical and numerical models employed is presented.
The mathematical model implemented in numerical form solves the 2D hydrodynamic equations for shallow waters, obtained by vertically integrating pressure-velocity 3D Navier Stokes equations.
A finite differences scheme is being employed over a staggered mesh to solve the discrete form of the shallow water equations. As a result of this scheme, a tri-diagonal system must be solved. A block domain decomposition technique with overlapped regions is used to parallelize the program.
Two approaches are employed for the parallel solver:
The advantages of each method are studied and explained below.
A stability analysis of the the scheme is performed in one and two dimensions employing Fourier transform of the constant coefficients shallow water PDE. Condition for both time and space step (cell size) in the discrete solution are found.
In the second part of this work, extensive applications of the above mentioned models are made to test speedup, accuracy and convergence of the numerical results to both analytical solutions and real world data.
A comparison of the numerical results and the analytical solution, for the one dimensional case, is carried out. Comparison of results from the extensive application of the code to a real tide simulation over the Río de la Plata are presented in chapter 8 and a graphical output of an application to predict the behavior of oil spills is introduced.
The speedup of a parallel program, defined as the serial runtime (ts) over parallel runtime (tp), is measured using several grids resolutions on the same geometry: a rectangular channel. The meshes employed in this study have 5.000, 20.000 and 80.000 grid points.
Numerical accuracy validation is performed using a grid refinement study over the same meshes.
The "parallel computers" available to test the performance have been:
The RISC workstations cluster have shown advantages over the SMP equipment, and other parallel computers, in the performance/price ratio, scalability, standard components usage, as it's documented in Warren et al. (1997). That has been confirmed along this work. The advantages are more evident in the case of the cluster of PCs thanks to the components low price.