Back| P- |
| P_i P_i factor main partition P_i a_i = l_i u_i on each processo lbwl, lbwu: lower and upper bandwidth of local solver factor main partition P_i a_i = l_i u_i on each processo lbwl, lbwu: lower and upper bandwidth of local solver factor main partition P_i a_i = l_i u_i on each processo lbwl, lbwu: lower and upper bandwidth of local solver factor main partition P_i a_i = l_i u_i on each processo lbwl, lbwu: lower and upper bandwidth of local solver |
| Pack Pack Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec Pack params and positions into arrays for global consistency chec |
| packed packed on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of |
| pair pair dlasorte sorts eigenpairs so that real eigenpairs are together an since every 2nd subdiagonal is guaranteed to be zero. bm1 = m for 1st block on proc pair, bm2 2nd bloc bm1 = m for 1st block on proc pair, bm2 2nd bloc bm1 = m for 1st block on proc pair, bm2 2nd bloc bm1 = m for 1st block on proc pair, bm2 2nd bloc slasorte sorts eigenpairs so that real eigenpairs are together an since every 2nd subdiagonal is guaranteed to be zero. |
| paired paired this is set if the input matrix had an odd number of real eigenvalues and things couldn't be paired or if the inpu 0 indicates successful completion. this is set if the input matrix had an odd number of real eigenvalues and things couldn't be paired or if the inpu 0 indicates successful completion. |
| panel panel = 1: the data layout blocksize; = 2: the panel blocking factor = 4: execution path control; |
| Parallel Parallel since every 2nd subdiagonal is guaranteed to be zero. this routine does no Parallel work arguments the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. this is the right-looking Parallel level 2 blas version of th this is the right-looking Parallel level 3 blas version of th reference: f. tisseur and j. dongarra, "a Parallel divide an on distributed memory architectures", see "on the correctness of Parallel bisection in floatin see "on the correctness of Parallel bisection in floatin pchentrd is a prototype version of pchetrd which uses tailored codes (either the serial, chetrd, or the Parallel code, pchettrd pclacp3 is an auxiliary routine that copies from a global Parallel the entire submatrix that is copied gets placed on one node or the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. pcstein computes the eigenvectors of a symmetric tridiagonal matrix in Parallel, using inverse iteration. the eigenvectors foun orthogonalize vectors that are on different processes. the extent pctrevc computes some or all of the right and/or left eigenvectors of a complex upper triangular matrix t in Parallel the right eigenvector x and the left eigenvector y of t corresponding the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. this is the right-looking Parallel level 2 blas version of th this is the right-looking Parallel level 3 blas version of th pdlacp3 is an auxiliary routine that copies from a global Parallel the entire submatrix that is copied gets placed on one node or matrix after modification by a rank-one symmetric matrix, in Parallel t = q(in) ( d(in) + rho * z*z' ) q'(in) = q(out) * d(out) * q'(out) the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. pdstebz computes the eigenvalues of a symmetric tridiagonal matrix in Parallel. the user may ask for all eigenvalues, all eigenvalues i static partitioning of work is done at the beginning of pdstebz which pdstedc computes all eigenvalues and eigenvectors of a symmetric tridiagonal matrix in Parallel, using the divide an pdstein computes the eigenvectors of a symmetric tridiagonal matrix in Parallel, using inverse iteration. the eigenvectors foun orthogonalize vectors that are on different processes. the extent reference: f. tisseur and j. dongarra, "a Parallel divide an on distributed memory architectures", see "on the correctness of Parallel bisection in floatin see "on the correctness of Parallel bisection in floatin pdsyntrd is a prototype version of pdsytrd which uses tailored codes (either the serial, dsytrd, or the Parallel code, pdsyttrd the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. this is the right-looking Parallel level 2 blas version of th this is the right-looking Parallel level 3 blas version of th pslacp3 is an auxiliary routine that copies from a global Parallel the entire submatrix that is copied gets placed on one node or matrix after modification by a rank-one symmetric matrix, in Parallel t = q(in) ( d(in) + rho * z*z' ) q'(in) = q(out) * d(out) * q'(out) the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. psstebz computes the eigenvalues of a symmetric tridiagonal matrix in Parallel. the user may ask for all eigenvalues, all eigenvalues i static partitioning of work is done at the beginning of psstebz which psstedc computes all eigenvalues and eigenvectors of a symmetric tridiagonal matrix in Parallel, using the divide an psstein computes the eigenvectors of a symmetric tridiagonal matrix in Parallel, using inverse iteration. the eigenvectors foun orthogonalize vectors that are on different processes. the extent reference: f. tisseur and j. dongarra, "a Parallel divide an on distributed memory architectures", see "on the correctness of Parallel bisection in floatin see "on the correctness of Parallel bisection in floatin pssyntrd is a prototype version of pssytrd which uses tailored codes (either the serial, ssytrd, or the Parallel code, pssyttrd the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. this is the right-looking Parallel level 2 blas version of th this is the right-looking Parallel level 3 blas version of th reference: f. tisseur and j. dongarra, "a Parallel divide an on distributed memory architectures", see "on the correctness of Parallel bisection in floatin see "on the correctness of Parallel bisection in floatin pzhentrd is a prototype version of pzhetrd which uses tailored codes (either the serial, zhetrd, or the Parallel code, pzhettrd pzlacp3 is an auxiliary routine that copies from a global Parallel the entire submatrix that is copied gets placed on one node or the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. the mapping for matrices must be blocked, reflecting the nature of the divide and conquer algorithm as a task-Parallel algorithm chunk of the matrix. pzstein computes the eigenvectors of a symmetric tridiagonal matrix in Parallel, using inverse iteration. the eigenvectors foun orthogonalize vectors that are on different processes. the extent pztrevc computes some or all of the right and/or left eigenvectors of a complex upper triangular matrix t in Parallel the right eigenvector x and the left eigenvector y of t corresponding since every 2nd subdiagonal is guaranteed to be zero. this routine does no Parallel work arguments |
| parallelism parallelism communication can be broken into several parts for efficient parallelism loop over all the bulges, just doing the work communication can be broken into several parts for efficient parallelism loop over all the bulges, just doing the work |
| parallelization parallelization details of the parallelization we delay spreading v across to all processor columns (which this code is basically a parallelization of the following sni this code is basically a parallelization of the following sni details of the parallelization we delay spreading v across to all processor columns (which this code is basically a parallelization of the following sni details of the parallelization we delay spreading v across to all processor columns (which details of the parallelization we delay spreading v across to all processor columns (which this code is basically a parallelization of the following sni |
| parameter parameter workspace needs are larger for pcheevx. liwork parameter adde orfac, icluster() and gap() parameters added orthogonalize vectors that are on different processes. the extent of orthogonalization is controlled by the input parameter lwork process. pcstein decides on the allocation of work among the point arithmetic. cure: increase the parameter "fudge", recompile orthogonalize vectors that are on different processes. the extent of orthogonalization is controlled by the input parameter lwork process. pdstein decides on the allocation of work among the workspace needs are larger for pdsyevx. liwork parameter adde orfac, icluster() and gap() parameters added tailored eigen-routines to choose problem-dependent parameters for the local environment. see ispe point arithmetic. cure: increase the parameter "fudge", recompile orthogonalize vectors that are on different processes. the extent of orthogonalization is controlled by the input parameter lwork process. psstein decides on the allocation of work among the workspace needs are larger for pssyevx. liwork parameter adde orfac, icluster() and gap() parameters added workspace needs are larger for pzheevx. liwork parameter adde orfac, icluster() and gap() parameters added orthogonalize vectors that are on different processes. the extent of orthogonalization is controlled by the input parameter lwork process. pzstein decides on the allocation of work among the |
| parameters parameters .. parameters . test the input parameters .. parameters . sn (output) complex parameters of the rotation matrix further details .. parameters . test the input parameters .. parameters . .. parameters . test the input parameters test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . internal parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . on output, work(1) returns the workspace needed to guarantee completion. if the input parameters are incorrect, work(1 on output rwork(1) returns the real workspace needed to guarantee completion. if the input parameters are incorrect orfac, icluster() and gap() parameters adde .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . where a' is the conjugate transpose of a, and pclacon must be re-called with all the other parameters unchanged ix (global input) integer .. parameters . .. parameters . .. parameters . .. parameters . lrwork (local input) integer dimension of rwork .. parameters . .. parameters . .. parameters . .. parameters . get grid parameters and local indexes get grid parameters get grid parameters and local indexes get grid parameters .. parameters . .. parameters . internal parameters internal parameters get grid parameters .. parameters . get grid parameters .. parameters . .. parameters . get grid parameters .. parameters . get grid parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . get grid parameters .. parameters . test the input parameters .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . .. parameters . internal parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . internal parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . a' * x, if kase=2, pdlacon must be re-called with all the other parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . lwork (local input) integer dimension of work .. parameters . .. parameters . pdlamch determines double precision machine parameters arguments .. parameters . .. parameters . get grid parameters get grid parameters and local indexes get grid parameters .. parameters . .. parameters . internal parameters internal parameters .. parameters . .. parameters . get grid parameters .. parameters . .. parameters . .. parameters . get grid parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . get grid parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . .. parameters . internal parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . internal parameters .. parameters . needed to guarantee completion. if the input parameters are incorrect, work(1) may also b .. parameters . orfac, icluster() and gap() parameters adde .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . parameters tailored eigen-routines to choose problem-dependent parameters for the local environment. see ispe parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . internal parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . a' * x, if kase=2, pslacon must be re-called with all the other parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . lwork (local input) integer dimension of work .. parameters . .. parameters . pslamch determines single precision machine parameters arguments .. parameters . .. parameters . get grid parameters get grid parameters and local indexes get grid parameters .. parameters . .. parameters . internal parameters internal parameters .. parameters . .. parameters . get grid parameters .. parameters . .. parameters . .. parameters . get grid parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . get grid parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . .. parameters . internal parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . internal parameters .. parameters . needed to guarantee completion. if the input parameters are incorrect, work(1) may also b .. parameters . orfac, icluster() and gap() parameters adde .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . internal parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . on output, work(1) returns the workspace needed to guarantee completion. if the input parameters are incorrect, work(1 on output rwork(1) returns the real workspace needed to guarantee completion. if the input parameters are incorrect orfac, icluster() and gap() parameters adde .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . where a' is the conjugate transpose of a, and pzlacon must be re-called with all the other parameters unchanged ix (global input) integer .. parameters . .. parameters . .. parameters . .. parameters . lrwork (local input) integer dimension of rwork .. parameters . .. parameters . .. parameters . .. parameters . get grid parameters and local indexes get grid parameters get grid parameters and local indexes get grid parameters .. parameters . .. parameters . internal parameters internal parameters get grid parameters .. parameters . get grid parameters .. parameters . .. parameters . get grid parameters .. parameters . get grid parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . get grid parameters .. parameters . test the input parameters .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . .. parameters . internal parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters the following are restrictions on the input parameters. some of thes may reflect fundamental technical limitations. test the input parameters .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . .. parameters . test the input parameters .. parameters . .. parameters . test the input parameters test the input parameters .. parameters . test the input parameters .. parameters . sn (output) complex*16 parameters of the rotation matrix further details |
| params params pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec |
| paramters paramters test the input paramters test the input paramters test the input paramters test the input paramters |
| paranoid paranoid pivmin (input) double precision the minimum absolute of a "pivot" in the "paranoid least max_j |e(j)^2| *safe_min, and at least safe_min, where the innermost loop to avoid overflow and determine the sign of a floating point number. pdlapdct will be referred to as the "paranoid pivmin (input) real the minimum absolute of a "pivot" in the "paranoid least max_j |e(j)^2| *safe_min, and at least safe_min, where the innermost loop to avoid overflow and determine the sign of a floating point number. pslapdct will be referred to as the "paranoid |
| part part ccombamax1 finds the element having maximum real part absolut the active part of the matrix is partitione a11 a12 a13 before entry with uplo = 'u' or 'u', the leading n by n upper triangular part of the array t must contain the uppe t is not referenced. the active part of the matrix is partitione a11 a12 a13 before entry with uplo = 'u' or 'u', the leading n by n upper triangular part of the array t must contain the uppe t is not referenced. adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat dl (local input/local output) complex pointer to local part of global vector storing the lower diagonal of th aligned with d. adjust addressing into matrix space to properly get into the beginning part of the relevant dat dl (local input/local output) complex pointer to local part of global vector storing the lower diagonal of th aligned with d. adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain leading n-by-n lower triangular part of sub( a ) contains n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the pchentrd is faster than pchetrd on almost all matrices, particularly small ones (i.e. n < 500 * sqrt(p) ), provided tha uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular returns the matrices x and y which are needed to apply the transfor- mation to the unreduced part of sub( a ) if m >= n, sub( a ) is reduced to upper bidiagonal form; if m < n, to (desca(lld_),*) on entry, the hessenberg matrix whose tridiagonal part i unchanged on exit. pclacp2 copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes pclacpy copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes more clever. we break each transformation down into 3 parts a group of rotn transformations (this is on v which is needed, with t and y, to apply the transformation to the unreduced part of the matrix, using an update of the form icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors uplo (global input) character specifies the part of the distributed matrix sub( a ) to b = 'u': upper triangular part is set; the strictly lower uplo (global input) character specifies the part of the distributed matrix sub( a ) to b = 'u': upper triangular part is set; the strictly lower a (global input) complex array, dimension (desca(lld_),*) on entry, the hessenberg matrix whose tridiagonal part i unchanged on exit. q' * sub( a ) * q, and returns the matrices v and w which are needed to apply the transformation to the unreduced part of sub( a ) if uplo = 'u', pclatrd reduces the last nb rows and columns of a the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors. l > 0 a (local input/local output) complex pointer into the pclauu2 computes the product u * u' or l' * l, where the triangular factor u or l is stored in the upper or lower triangular part o pclauum computes the product u * u' or l' * l, where the triangular factor u or l is stored in the upper or lower triangular part o if incx = m_x, then sub( x ) is a vector distributed over a process row. each process part of this row receives the result if incx = 1, then sub( x ) is a vector distributed over a process adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. diag(sr)*a*diag(sc). if uplo = 'u', the leading n-by-n upper triangular part of a contains the uppe triangular part of a is not referenced. if uplo = 'l', the n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. n-by-n hermitian distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. d (local input/local output) complex pointer to local part of global vector storing the main diagonal of th on exit, this array contains information containing the adjust addressing into matrix space to properly get into the beginning part of the relevant dat d (local input/local output) complex pointer to local part of global vector storing the main diagonal of th on exit, this array contains information containing the adjust addressing into matrix space to properly get into the beginning part of the relevant dat matrix a(ia:ia+n-1,ja:ja+n-1). if uplo = 'u', the leading n-by-n upper triangular part of this distributed matrix con triangular part is not referenced. if uplo = 'l', the distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of the matrix sub( a ) contains the upper triangula is not referenced. if uplo = 'l', the leading n-by-n lower triangular matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of the matrix sub( a ) contain lower triangular part of sub( a ) is not referenced. matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contains the upper triangula is not referenced. if uplo = 'l', the leading n-by-n lower sub( a ) which is to be factored. on exit, the leading m-by-m upper triangular part of sub( a ) contains the upper trian sub( a ), with the array tau, represent the unitary matrix z the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat dl (local input/local output) double precision pointer to local part of global vector storing the lower diagonal of th aligned with d. adjust addressing into matrix space to properly get into the beginning part of the relevant dat dl (local input/local output) double precision pointer to local part of global vector storing the lower diagonal of th aligned with d. adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat and returns the matrices x and y which are needed to apply the transformation to the unreduced part of sub( a ) if m >= n, sub( a ) is reduced to upper bidiagonal form; if m < n, to (desca(lld_),*) on entry, the hessenberg matrix whose tridiagonal part i unchanged on exit. pdlacp2 copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes pdlacpy copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes v which is needed, with t and y, to apply the transformation to the unreduced part of the matrix, using an update of the form icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors uplo (global input) character specifies the part of the distributed matrix sub( a ) to b = 'u': upper triangular part is set; the strictly lower uplo (global input) character specifies the part of the distributed matrix sub( a ) to b = 'u': upper triangular part is set; the strictly lower (desca(lld_),*) on entry, the hessenberg matrix whose tridiagonal part i unchanged on exit. and returns the matrices v and w which are needed to apply the transformation to the unreduced part of sub( a ) if uplo = 'u', pdlatrd reduces the last nb rows and columns of a the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors. l > 0 a (local input/local output) double precision pointer into the pdlauu2 computes the product u * u' or l' * l, where the triangular factor u or l is stored in the upper or lower triangular part o pdlauum computes the product u * u' or l' * l, where the triangular factor u or l is stored in the upper or lower triangular part o the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. diag(sr)*a*diag(sc). if uplo = 'u', the leading n-by-n upper triangular part of a contains the uppe triangular part of a is not referenced. if uplo = 'l', the n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. d (local input/local output) double precision pointer to local part of global vector storing the main diagonal of th on exit, this array contains information containing the adjust addressing into matrix space to properly get into the beginning part of the relevant dat d (local input/local output) double precision pointer to local part of global vector storing the main diagonal of th on exit, this array contains information containing the adjust addressing into matrix space to properly get into the beginning part of the relevant dat uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular n-by-n symmetric distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the n-by-n symmetric distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the n-by-n symmetric distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain leading n-by-n lower triangular part of sub( a ) contains n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the pdsyntrd is faster than pdsytrd on almost all matrices, particularly small ones (i.e. n < 500 * sqrt(p) ), provided tha uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular matrix a(ia:ia+n-1,ja:ja+n-1). if uplo = 'u', the leading n-by-n upper triangular part of this distributed matrix con triangular part is not referenced. if uplo = 'l', the distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of the matrix sub( a ) contains the upper triangula is not referenced. if uplo = 'l', the leading n-by-n lower triangular matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of the matrix sub( a ) contain lower triangular part of sub( a ) is not referenced. matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contains the upper triangula is not referenced. if uplo = 'l', the leading n-by-n lower sub( a ) which is to be factored. on exit, the leading m-by-m upper triangular part of sub( a ) contains the upper trian sub( a ), with the array tau, represent the orthogonal matrix if incx = m_x, then sub( x ) is a vector distributed over a process row. each process part of this row receives the result if incx = 1, then sub( x ) is a vector distributed over a process if incx = m_x, then sub( x ) is a vector distributed over a process row. each process part of this row receives the result if incx = 1, then sub( x ) is a vector distributed over a process adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat dl (local input/local output) real pointer to local part of global vector storing the lower diagonal of th aligned with d. adjust addressing into matrix space to properly get into the beginning part of the relevant dat dl (local input/local output) real pointer to local part of global vector storing the lower diagonal of th aligned with d. adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat and returns the matrices x and y which are needed to apply the transformation to the unreduced part of sub( a ) if m >= n, sub( a ) is reduced to upper bidiagonal form; if m < n, to (desca(lld_),*) on entry, the hessenberg matrix whose tridiagonal part i unchanged on exit. pslacp2 copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes pslacpy copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes v which is needed, with t and y, to apply the transformation to the unreduced part of the matrix, using an update of the form icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors uplo (global input) character specifies the part of the distributed matrix sub( a ) to b = 'u': upper triangular part is set; the strictly lower uplo (global input) character specifies the part of the distributed matrix sub( a ) to b = 'u': upper triangular part is set; the strictly lower (desca(lld_),*) on entry, the hessenberg matrix whose tridiagonal part i unchanged on exit. and returns the matrices v and w which are needed to apply the transformation to the unreduced part of sub( a ) if uplo = 'u', pslatrd reduces the last nb rows and columns of a the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors. l > 0 a (local input/local output) real pointer into the pslauu2 computes the product u * u' or l' * l, where the triangular factor u or l is stored in the upper or lower triangular part o pslauum computes the product u * u' or l' * l, where the triangular factor u or l is stored in the upper or lower triangular part o the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. diag(sr)*a*diag(sc). if uplo = 'u', the leading n-by-n upper triangular part of a contains the uppe triangular part of a is not referenced. if uplo = 'l', the n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. d (local input/local output) real pointer to local part of global vector storing the main diagonal of th on exit, this array contains information containing the adjust addressing into matrix space to properly get into the beginning part of the relevant dat d (local input/local output) real pointer to local part of global vector storing the main diagonal of th on exit, this array contains information containing the adjust addressing into matrix space to properly get into the beginning part of the relevant dat uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular n-by-n symmetric distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the n-by-n symmetric distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the n-by-n symmetric distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain leading n-by-n lower triangular part of sub( a ) contains n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the pssyntrd is faster than pssytrd on almost all matrices, particularly small ones (i.e. n < 500 * sqrt(p) ), provided tha uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular matrix a(ia:ia+n-1,ja:ja+n-1). if uplo = 'u', the leading n-by-n upper triangular part of this distributed matrix con triangular part is not referenced. if uplo = 'l', the distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of the matrix sub( a ) contains the upper triangula is not referenced. if uplo = 'l', the leading n-by-n lower triangular matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of the matrix sub( a ) contain lower triangular part of sub( a ) is not referenced. matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contains the upper triangula is not referenced. if uplo = 'l', the leading n-by-n lower sub( a ) which is to be factored. on exit, the leading m-by-m upper triangular part of sub( a ) contains the upper trian sub( a ), with the array tau, represent the orthogonal matrix adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat dl (local input/local output) complex*16 pointer to local part of global vector storing the lower diagonal of th aligned with d. adjust addressing into matrix space to properly get into the beginning part of the relevant dat dl (local input/local output) complex*16 pointer to local part of global vector storing the lower diagonal of th aligned with d. adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain leading n-by-n lower triangular part of sub( a ) contains n-by-n hermitian distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contain lower triangular part is not referenced. if uplo = 'l', the pzhentrd is faster than pzhetrd on almost all matrices, particularly small ones (i.e. n < 500 * sqrt(p) ), provided tha uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular returns the matrices x and y which are needed to apply the transfor- mation to the unreduced part of sub( a ) if m >= n, sub( a ) is reduced to upper bidiagonal form; if m < n, to (desca(lld_),*) on entry, the hessenberg matrix whose tridiagonal part i unchanged on exit. pzlacp2 copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes pzlacpy copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes more clever. we break each transformation down into 3 parts a group of rotn transformations (this is on v which is needed, with t and y, to apply the transformation to the unreduced part of the matrix, using an update of the form icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after uplo (global input) character specifies whether the upper or lower triangular part of th = 'u': upper triangular the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors uplo (global input) character specifies the part of the distributed matrix sub( a ) to b = 'u': upper triangular part is set; the strictly lower uplo (global input) character specifies the part of the distributed matrix sub( a ) to b = 'u': upper triangular part is set; the strictly lower a (global input) complex*16 array, dimension (desca(lld_),*) on entry, the hessenberg matrix whose tridiagonal part i unchanged on exit. q' * sub( a ) * q, and returns the matrices v and w which are needed to apply the transformation to the unreduced part of sub( a ) if uplo = 'u', pzlatrd reduces the last nb rows and columns of a the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors. l > 0 a (local input/local output) complex*16 pointer into the pzlauu2 computes the product u * u' or l' * l, where the triangular factor u or l is stored in the upper or lower triangular part o pzlauum computes the product u * u' or l' * l, where the triangular factor u or l is stored in the upper or lower triangular part o if incx = m_x, then sub( x ) is a vector distributed over a process row. each process part of this row receives the result if incx = 1, then sub( x ) is a vector distributed over a process adjust addressing into matrix space to properly get into the beginning part of the relevant dat adjust addressing into matrix space to properly get into the beginning part of the relevant dat uplo (global input) character*1 specifies whether the upper or lower triangular part of th = 'u': upper triangular n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. diag(sr)*a*diag(sc). if uplo = 'u', the leading n-by-n upper triangular part of a contains the uppe triangular part of a is not referenced. if uplo = 'l', the n-by-n symmetric distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. n-by-n hermitian distributed matrix sub( a ) to be factored. if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. d (local input/local output) complex*16 pointer to local part of global vector storing the main diagonal of th on exit, this array contains information containing the adjust addressing into matrix space to properly get into the beginning part of the relevant dat d (local input/local output) complex*16 pointer to local part of global vector storing the main diagonal of th on exit, this array contains information containing the adjust addressing into matrix space to properly get into the beginning part of the relevant dat matrix a(ia:ia+n-1,ja:ja+n-1). if uplo = 'u', the leading n-by-n upper triangular part of this distributed matrix con triangular part is not referenced. if uplo = 'l', the distributed matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part o and its strictly lower triangular part is not referenced. sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of the matrix sub( a ) contains the upper triangula is not referenced. if uplo = 'l', the leading n-by-n lower triangular matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of the matrix sub( a ) contain lower triangular part of sub( a ) is not referenced. matrix sub( a ). if uplo = 'u', the leading n-by-n upper triangular part of sub( a ) contains the upper triangula is not referenced. if uplo = 'l', the leading n-by-n lower sub( a ) which is to be factored. on exit, the leading m-by-m upper triangular part of sub( a ) contains the upper trian sub( a ), with the array tau, represent the unitary matrix z the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors the columns of the distributed submatrix sub( a ) containing the meaningful part of the householder reflectors the active part of the matrix is partitione a11 a12 a13 before entry with uplo = 'u' or 'u', the leading n by n upper triangular part of the array t must contain the uppe t is not referenced. zcombamax1 finds the element having maximum real part absolut the active part of the matrix is partitione a11 a12 a13 before entry with uplo = 'u' or 'u', the leading n by n upper triangular part of the array t must contain the uppe t is not referenced. |
| partial partial cdbtrf computes an lu factorization of a real m-by-n band matrix a without using partial pivoting with row interchanges this is the unblocked version of the algorithm, calling level 2 blas. cdttrf computes an lu factorization of a complex tridiagonal matrix a using elimination without partial pivoting the factorization has the form ddbtrf computes an lu factorization of a real m-by-n band matrix a without using partial pivoting with row interchanges this is the unblocked version of the algorithm, calling level 2 blas. ddttrf computes an lu factorization of a complex tridiagonal matrix a using elimination without partial pivoting the factorization has the form compute lu factors with partial pivoting ( pt = lu lu factorization with partial pivotin the lu decomposition with partial pivoting and row interchanges i tation matrix, l is unit lower triangular, and u is upper triangular. distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using partial pivoting with row interchanges the factorization has the form sub( a ) = p * l * u, where p is a pcgetrf computes an lu factorization of a general m-by-n distributed matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting wit lu factorization with partial pivotin the lu decomposition with partial pivoting and row interchanges i tation matrix, l is unit lower triangular, and u is upper triangular. distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using partial pivoting with row interchanges the factorization has the form sub( a ) = p * l * u, where p is a pdgetrf computes an lu factorization of a general m-by-n distributed matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting wit lu factorization with partial pivotin the lu decomposition with partial pivoting and row interchanges i tation matrix, l is unit lower triangular, and u is upper triangular. distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using partial pivoting with row interchanges the factorization has the form sub( a ) = p * l * u, where p is a psgetrf computes an lu factorization of a general m-by-n distributed matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting wit lu factorization with partial pivotin the lu decomposition with partial pivoting and row interchanges i tation matrix, l is unit lower triangular, and u is upper triangular. distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using partial pivoting with row interchanges the factorization has the form sub( a ) = p * l * u, where p is a pzgetrf computes an lu factorization of a general m-by-n distributed matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting wit sdbtrf computes an lu factorization of a real m-by-n band matrix a without using partial pivoting with row interchanges this is the unblocked version of the algorithm, calling level 2 blas. sdttrf computes an lu factorization of a complex tridiagonal matrix a using elimination without partial pivoting the factorization has the form compute lu factors with partial pivoting ( pt = lu zdbtrf computes an lu factorization of a real m-by-n band matrix a without using partial pivoting with row interchanges this is the unblocked version of the algorithm, calling level 2 blas. zdttrf computes an lu factorization of a complex tridiagonal matrix a using elimination without partial pivoting the factorization has the form |
| participate participate the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th the last processor does not participate in the factorization o the last processor does not participate in the solution of th |
| particular particular a(ia:ia+n-1,ja:ja+n-1) after equilibration (if done). if rcond is less than the machine precision (in particular, i this condition is indicated by a return code of info > 0. in particular, if sub( b ) is square and nonsingular, the gq factorization of inv( sub( b ) )* sub( a ): in particular, if sub( b ) is square and nonsingular, the gr factorization of sub( a )*inv( sub( b ) ): a after equilibration (if done). if rcond is less than the machine precision (in particular, if rcond = 0), the matri indicated by a return code of info > 0, and the solution and because vectors may be seen as particular matrices, a distribute a(ia:ia+n-1,ja:ja+n-1) after equilibration (if done). if rcond is less than the machine precision (in particular, i this condition is indicated by a return code of info > 0. in particular, if sub( b ) is square and nonsingular, the gq factorization of inv( sub( b ) )* sub( a ): in particular, if sub( b ) is square and nonsingular, the gr factorization of sub( a )*inv( sub( b ) ): a after equilibration (if done). if rcond is less than the machine precision (in particular, if rcond = 0), the matri indicated by a return code of info > 0, and the solution and because vectors may be seen as particular matrices, a distribute computers. users are encouraged to modify this subroutine to set the tuning parameters for their particular machine using the optio a(ia:ia+n-1,ja:ja+n-1) after equilibration (if done). if rcond is less than the machine precision (in particular, i this condition is indicated by a return code of info > 0. in particular, if sub( b ) is square and nonsingular, the gq factorization of inv( sub( b ) )* sub( a ): in particular, if sub( b ) is square and nonsingular, the gr factorization of sub( a )*inv( sub( b ) ): a after equilibration (if done). if rcond is less than the machine precision (in particular, if rcond = 0), the matri indicated by a return code of info > 0, and the solution and because vectors may be seen as particular matrices, a distribute because vectors may be seen as particular matrices, a distribute a(ia:ia+n-1,ja:ja+n-1) after equilibration (if done). if rcond is less than the machine precision (in particular, i this condition is indicated by a return code of info > 0. in particular, if sub( b ) is square and nonsingular, the gq factorization of inv( sub( b ) )* sub( a ): in particular, if sub( b ) is square and nonsingular, the gr factorization of sub( a )*inv( sub( b ) ): a after equilibration (if done). if rcond is less than the machine precision (in particular, if rcond = 0), the matri indicated by a return code of info > 0, and the solution and |
| particularly particularly pchentrd is faster than pchetrd on almost all matrices, particularly small ones (i.e. n < 500 * sqrt(p) ), provided tha pdsyntrd is faster than pdsytrd on almost all matrices, particularly small ones (i.e. n < 500 * sqrt(p) ), provided tha pssyntrd is faster than pssytrd on almost all matrices, particularly small ones (i.e. n < 500 * sqrt(p) ), provided tha pzhentrd is faster than pzhetrd on almost all matrices, particularly small ones (i.e. n < 500 * sqrt(p) ), provided tha |
| partition partition partition d( start:endd ) and stack parts, largest one firs choose partition entry as median of 3 partition d( start:endd ) and stack parts, largest one firs choose partition entry as median of 3 user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz user-input value of partition siz partition d( start:endd ) and stack parts, largest one firs choose partition entry as median of 3 partition d( start:endd ) and stack parts, largest one firs choose partition entry as median of 3 |
| partitioned partitioned the active part of the matrix is partitioned a11 a12 a13 the active part of the matrix is partitioned a11 a12 a13 the active part of the matrix is partitioned a11 a12 a13 the active part of the matrix is partitioned a11 a12 a13 |
| partitioning partitioning which is about to be factorized. the number of rows in the partitioning are jb, i2, i3 respectively, and the number and the subdiagonal elements of a31 lie outside the band. which is about to be factorized. the number of rows in the partitioning are jb, i2, i3 respectively, and the number and the subdiagonal elements of a31 lie outside the band. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. the interval [vl, vu], or the eigenvalues indexed il through iu. a static partitioning of work is done at the beginning of pdstebz whic eigenvalues. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. the interval [vl, vu], or the eigenvalues indexed il through iu. a static partitioning of work is done at the beginning of psstebz whic eigenvalues. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithm description: divide and conquer implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. implemented. these go by many names, including divide and conquer, partitioning, domain decomposition-type, etc algorithms are the appropriate choice. which is about to be factorized. the number of rows in the partitioning are jb, i2, i3 respectively, and the number and the subdiagonal elements of a31 lie outside the band. which is about to be factorized. the number of rows in the partitioning are jb, i2, i3 respectively, and the number and the subdiagonal elements of a31 lie outside the band. |
| parts parts partition d( start:endd ) and stack parts, largest one firs choose partition entry as median of 3 partition d( start:endd ) and stack parts, largest one firs choose partition entry as median of 3 the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: more clever. we break each transformation down into 3 parts a group of rotn transformations (this is on the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: proc (iqrow, iqcol) receive the parts of z more clever. we break each transformation down into 3 parts a group of rotn transformations (this is on the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: proc (iqrow, iqcol) receive the parts of z more clever. we break each transformation down into 3 parts a group of rotn transformations (this is on the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: more clever. we break each transformation down into 3 parts a group of rotn transformations (this is on the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: the matrices factored on each processor. the factors of these submatrices overwrite the corresponding parts of 2) reduced system phase: partition d( start:endd ) and stack parts, largest one firs choose partition entry as median of 3 partition d( start:endd ) and stack parts, largest one firs choose partition entry as median of 3 |
| pass pass for example if the row pivots should be applied to the columns of sub( a ), pass rowcol='c' and pivroc='c' notes the routine makes only one pass through the vector sub( x ) notes for example if the row pivots should be applied to the columns of sub( a ), pass rowcol='c' and pivroc='c' notes the routine makes only one pass through the vector sub( x ) notes for example if the row pivots should be applied to the columns of sub( a ), pass rowcol='c' and pivroc='c' notes the routine makes only one pass through the vector sub( x ) notes for example if the row pivots should be applied to the columns of sub( a ), pass rowcol='c' and pivroc='c' notes the routine makes only one pass through the vector sub( x ) notes |
| passed passed scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same work is done on that. at the end of the border, the data is passed back and everything stays a lot simpler scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same the first k values of the final deflation-altered z-vector which will be passed to slaed3 q2 (output) double precision array, dimension (ldq2, nq) the first k values of the final deflation-altered z-vector which will be passed to slaed3 z (global input) double precision array, dimension (n) work is done on that. at the end of the border, the data is passed back and everything stays a lot simpler scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same first, n2 second, and so on, and unused problem dimensions are passed a value of -1 in the calling subroutine. for example, pjlaenv is used to scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same the first k values of the final deflation-altered z-vector which will be passed to slaed3 q2 (output) real array, dimension (ldq2, nq) the first k values of the final deflation-altered z-vector which will be passed to slaed3 z (global input) real array, dimension (n) work is done on that. at the end of the border, the data is passed back and everything stays a lot simpler scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same work is done on that. at the end of the border, the data is passed back and everything stays a lot simpler scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same scalapack routines *do not support intercontext operations* so that the grid passed to a single scalapack routine *must be the same |
| passes passes needs, they will be sent and received. then the next major loop passes over the data and searches for two consecutiv needs, they will be sent and received. then the next major loop passes over the data and searches for two consecutiv needs, they will be sent and received. then the next major loop passes over the data and searches for two consecutiv needs, they will be sent and received. then the next major loop passes over the data and searches for two consecutiv |
| past past however, because there are many bulges, k1(ki) & k2(ki) might go past that range while later bulges (ki+1,ki+2,etc..) ar communication sometimes k1(ki)=hbl-2 & k2(ki)=hbl-1 so both however, because there are many bulges, k1(ki) & k2(ki) might go past that range while later bulges (ki+1,ki+2,etc..) ar however, because there are many bulges, k1(ki) & k2(ki) might go past that range while later bulges (ki+1,ki+2,etc..) ar however, because there are many bulges, k1(ki) & k2(ki) might go past that range while later bulges (ki+1,ki+2,etc..) ar communication sometimes k1(ki)=hbl-2 & k2(ki)=hbl-1 so both |
| path path a group of rotn transformations (this is on the critical path.) (loops 50-120 2.) the small work it takes so that each of the rows a group of rotn transformations (this is on the critical path.) (loops 130-180 and columns is at the same place. for example, = 3: the algorithmic blocking factor; = 4: execution path control a group of rotn transformations (this is on the critical path.) (loops 130-180 and columns is at the same place. for example, a group of rotn transformations (this is on the critical path.) (loops 50-120 2.) the small work it takes so that each of the rows |
| patterned patterned pjlaenv is patterned after ilaenv and keeps the same interface i used at present in scalapack. most scalapack codes use the input |
| PBLAS PBLAS compute a bound on the computed solution vector to see if the level 2 PBLAS routine pctrsv can be used this is the blocked form of the algorithm, calling level 3 PBLAS notes when the result of a vector-oriented PBLAS call is a scalar, it wil being operated on. let x be a generic term for the input vector(s). this is the blocked form of the algorithm, calling level 3 PBLAS notes based on pdzasum from the level 1 PBLAS. the change i based on pscasum from the level 1 PBLAS. the change i this is the blocked form of the algorithm, calling level 3 PBLAS notes compute a bound on the computed solution vector to see if the level 2 PBLAS routine pztrsv can be used this is the blocked form of the algorithm, calling level 3 PBLAS notes when the result of a vector-oriented PBLAS call is a scalar, it wil being operated on. let x be a generic term for the input vector(s). |
| PCAMAX PCAMAX based on PCAMAX from level 1 pblas. the change is to use th |
| PCDBSV PCDBSV PCDBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCDBTRF PCDBTRF see PCDBTRF and pcdbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PCDBTRF banded diagonally dominant-like distributed |
| PCDBTRS PCDBTRS see pcdbtrf and PCDBTRS for details ===================================================================== PCDBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCDOTU PCDOTU if the scaling needed for a in the dot product is 1, call PCDOTU to perform the dot product |
| PCDTSV PCDTSV PCDTSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCDTTRF PCDTTRF see PCDTTRF and pcdttrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PCDTTRF tridiagonal diagonally dominant-like distributed |
| PCDTTRS PCDTTRS see pcdttrf and PCDTTRS for details ===================================================================== PCDTTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCGBSV PCGBSV PCGBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCGBTRF PCGBTRF see PCGBTRF and pcgbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PCGBTRF banded distributed |
| PCGBTRS PCGBTRS see pcgbtrf and PCGBTRS for details ===================================================================== PCGBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCGEBD2 PCGEBD2 PCGEBD2 reduces a complex general m-by-n distributed matri form b by an unitary transformation: q' * sub( a ) * p = b. |
| PCGEBRD PCGEBRD PCGEBRD reduces a complex general m-by-n distributed matri form b by an unitary transformation: q' * sub( a ) * p = b. watobd = max(max(wpclange,wPCGEBRD) this is an auxiliary routine called by PCGEBRD notes here q and p**h are the unitary distributed matrices determined by PCGEBRD when reducing a complex distributed matrix a(ia:*,ja:*) t as products of elementary reflectors h(i) and g(i) respectively. |
| PCGECON PCGECON PCGECON estimates the reciprocal of the condition number of a genera 1-norm or the infinity-norm, using the lu factorization computed by lwork is local input and must be at least lwork = max( PCGECON( lwork ), pcgerfs( lwork ) |
| PCGEEQU PCGEEQU PCGEEQU computes row and column scalings intended to equilibrate a reduce its condition number. r returns the row scale factors and c |
| PCGEHD2 PCGEHD2 PCGEHD2 reduces a complex general distributed matrix sub( a q' * sub( a ) * q = h, where |
| PCGEHRD PCGEHRD PCGEHRD reduces a complex general distributed matrix sub( a q' * sub( a ) * q = h, where this is an auxiliary routine called by PCGEHRD. in the followin nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of ihi-ilo elementary reflectors, as returned by PCGEHRD q = h(ilo) h(ilo+1) . . . h(ihi-1). |
| PCGELQ2 PCGELQ2 PCGELQ2 computes a lq factorization of a complex distributed m-by- |
| PCGELQF PCGELQF PCGELQF computes a lq factorization of a complex distributed m-by- if m < n, sub( a ) is overwritten by details of its lq factorization as returned by PCGELQF ia (global input) integer as returned by PCGELQF notes as returned by PCGELQF notes as returned by PCGELQF. q is of order m if side = 'l' and of order as returned by PCGELQF. q is of order m if side = 'l' and of order |
| PCGELS PCGELS PCGELS solves overdetermined or underdetermined complex linea or its conjugate-transpose, using a qr or lq factorization of |
| PCGEMR2D PCGEMR2D although all processes call PCGEMR2D, only the processes that ow first column of b receive data. the calls to cgebs2d/cgebr2d |
| PCGEQL2 PCGEQL2 PCGEQL2 computes a ql factorization of a complex distributed m-by- |
| PCGEQLF PCGEQLF PCGEQLF computes a ql factorization of a complex distributed m-by- as returned by PCGEQLF notes as returned by PCGEQLF notes as returned by PCGEQLF. q is of order m if side = 'l' and of order as returned by PCGEQLF. q is of order m if side = 'l' and of order |
| PCGEQPF PCGEQPF PCGEQPF computes a qr factorization with column pivoting of |
| PCGEQR2 PCGEQR2 PCGEQR2 computes a qr factorization of a complex distributed m-by- |
| PCGEQRF PCGEQRF if m >= n, sub( a ) is overwritten by details of its qr factorization as returned by PCGEQRF factorization as returned by pcgelqf. PCGEQRF computes a qr factorization of a complex distributed m-by- as returned by PCGEQRF notes as returned by PCGEQRF notes as returned by PCGEQRF. q is of order m if side = 'l' and of order as returned by PCGEQRF. q is of order m if side = 'l' and of order |
| PCGERFS PCGERFS PCGERFS improves the computed solution to a system of linea the solutions. lwork is local input and must be at least lwork = max( pcgecon( lwork ), PCGERFS( lwork ) |
| PCGERQ2 PCGERQ2 PCGERQ2 computes a rq factorization of a complex distributed m-by- |
| PCGERQF PCGERQF PCGERQF computes a rq factorization of a complex distributed m-by- as returned by PCGERQF notes as returned by PCGERQF notes as returned by PCGERQF. q is of order m if side = 'l' and of order as returned by PCGERQF. q is of order m if side = 'l' and of order |
| PCGESV PCGESV PCGESV computes the solution to a complex system of linear equation sub( a ) * x = sub( b ), |
| PCGESVD PCGESVD PCGESVD computes the singular value decomposition (svd) of a singular vectors. the svd is written as |
| PCGESVX PCGESVX PCGESVX uses the lu factorization to compute the solution to |
| PCGETF2 PCGETF2 PCGETF2 computes an lu factorization of a general m-by- partial pivoting with row interchanges. |
| PCGETRF PCGETRF 1-norm or the infinity-norm, using the lu factorization computed by PCGETRF an estimate is obtained for norm(inv(a(ia:ia+n-1,ja:ja+n-1))), and factors of the matrix sub( a ) = p * l * u as computed by PCGETRF iaf (global input) integer entry contains the factors l and u from the factorization a(ia:ia+n-1,ja:ja+n-1) = p*l*u as computed by PCGETRF equilibrated matrix a(ia:ia+n-1,ja:ja+n-1). PCGETRF computes an lu factorization of a general m-by-n distribute row interchanges. pcgetri computes the inverse of a distributed matrix using the lu factorization computed by PCGETRF. this method inverts u and the inva by solving the system inva*l = inv(u) for inva. with a general n-by-n distributed matrix sub( a ) using the lu factorization computed by PCGETRF and sub( b ) denotes b(ib:ib+n-1,jb:jb+nrhs-1). |
| PCGETRI PCGETRI PCGETRI computes the inverse of a distributed matrix using the l computes the inverse of sub( a ) = a(ia:ia+n-1,ja:ja+n-1) denoted |
| PCGETRS PCGETRS PCGETRS solves a system of distributed linear equation op( sub( a ) ) * x = sub( b ) |
| PCGGQRF PCGGQRF PCGGQRF computes a generalized qr factorization o an n-by-p matrix sub( b ) = b(ib:ib+n-1,jb:jb+p-1): |
| PCGGRQF PCGGRQF PCGGRQF computes a generalized rq factorization o and a p-by-n matrix sub( b ) = b(ib:ib+p-1,jb:jb+n-1): |
| PCHEEV PCHEEV PCHEEV computes selected eigenvalues and, optionally, eigenvector of scalapack routines. |
| PCHEEVD PCHEEVD PCHEEVD computes all the eigenvalues and eigenvectors of a hermitia |
| PCHEEVX PCHEEVX PCHEEVX computes selected eigenvalues and, optionally, eigenvector of scalapack routines. eigenvalues/vectors can be selected by |
| PCHEGS2 PCHEGS2 PCHEGS2 reduces a complex hermitian-definite generalized eigenproble |
| PCHEGST PCHEGST PCHEGST reduces a complex hermitian-definite generalized eigenproble pchengst performs the same function as PCHEGST, but is based o triangular solves (the basis of pchengst). |
| PCHEGVX PCHEGVX PCHEGVX computes all the eigenvalues, and optionally of a complex generalized hermitian-definite eigenproblem, of the form |
| PCHENGST PCHENGST PCHENGST reduces a complex hermitian-definite generalize |
| PCHENTRD PCHENTRD PCHENTRD is a prototype version of pchetrd which uses tailore when the workspace provided by the user is adequate. |
| PCHEPTRD PCHEPTRD pclamr1d has not been tested except withint the contect of PCHEPTRD, the prototype reduction to tridiagonal form code purpose |
| PCHETD2 PCHETD2 PCHETD2 reduces a complex hermitian matrix sub( a ) to hermitia q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PCHETRD PCHETRD support for uplo='u' is limited to calling the old, slow, PCHETRD PCHETRD reduces a complex hermitian matrix sub( a ) to hermitia q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). pchettrd is not intended to be called directly. all users are encourage to call PCHETRD which will then call pchettrd i the process grid must be square ( i.e. nprow = npcol ) and this is an auxiliary routine called by PCHETRD to redistribute d, this is an auxiliary routine called by PCHETRD notes nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of nq-1 elementary reflectors, as returned by PCHETRD if uplo = 'u', q = h(nq-1) . . . h(2) h(1); |
| PCHETTRD PCHETTRD ictxt = desca( ctxt_ ) anb = pjlaenv( ictxt, 3, 'PCHETTRD', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) ictxt = desca( ctxt_ ) anb = pjlaenv( ictxt, 3, 'PCHETTRD', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) pchentrd is a prototype version of pchetrd which uses tailored codes (either the serial, chetrd, or the parallel code, PCHETTRD PCHETTRD reduces a complex hermitian matrix sub( a ) to hermitia q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PCLABRD PCLABRD PCLABRD reduces the first nb rows and columns of a complex genera or lower bidiagonal form by an unitary transformation q' * a * p, and |
| PCLACGV PCLACGV PCLACGV conjugates a complex vector of length n, sub( x ), wher x(ix:ix+n-1,jx) if incx = 1, and |
| PCLACON PCLACON PCLACON estimates the 1-norm of a square, complex distributed matri products. x and v are aligned with the distributed matrix a, this |
| PCLACONSB PCLACONSB PCLACONSB looks for two consecutive small subdiagonal elements b given by h44, h33, & h43h34 and see if this would make a look for two consecutive small subdiagonal elements: PCLACONSB is the routine that does this |
| PCLACP2 PCLACP2 PCLACP2 copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes |
| PCLACP3 PCLACP3 PCLACP3 is an auxiliary routine that copies from a global paralle the entire submatrix that is copied gets placed on one node or |
| PCLACPY PCLACPY PCLACPY copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes |
| PCLAEVSWP PCLAEVSWP PCLAEVSWP moves the eigenvectors (potentially unsorted) fro array, sorted so that the corresponding eigenvalues are sorted. |
| PCLAHRD PCLAHRD PCLAHRD reduces the first nb columns of a complex genera elements below the k-th subdiagonal are zero. the reduction is |
| PCLAMR1D PCLAMR1D PCLAMR1D has not been tested except withint the contect o |
| PCLANGE PCLANGE watobd = max(max(wPCLANGE,wpcgebrd) PCLANGE returns the value of the one norm, or the frobenius norm distributed matrix sub( a ) = a(ia:ia+m-1, ja:ja+n-1). |
| PCLAPIV PCLAPIV PCLAPIV applies either p (permutation matrix indicated by ipiv sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column same mb (or nb) block. if you want to pivot a full matrix, use PCLAPIV notes |
| PCLAPV2 PCLAPV2 PCLAPV2 applies either p (permutation matrix indicated by ipiv a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the |
| PCLAQGE PCLAQGE PCLAQGE equilibrates a general m-by-n distributed matri factors in the vectors r and c. |
| PCLAQSY PCLAQSY PCLAQSY equilibrates a symmetric distributed matri vectors sr and sc. |
| PCLARFB PCLARFB PCLARFB applies a complex block reflector q or its conjugat denoting c(ic:ic+m-1,jc:jc+n-1), from the left or the right. |
| PCLARFG PCLARFG PCLARFG generates a complex elementary reflector h of order n, suc |
| PCLARFT PCLARFT PCLARFT forms the triangular factor t of a complex block reflector |
| PCLARZB PCLARZB PCLARZB applies a complex block reflector q or its conjugat denoting c(ic:ic+m-1,jc:jc+n-1), from the left or the right. |
| PCLARZT PCLARZT PCLARZT forms the triangular factor t of a complex block reflecto reflectors as returned by pctzrzf. |
| PCLASCL PCLASCL PCLASCL multiplies the m-by-n complex distributed matrix sub( a is done without over/underflow as long as the final result |
| PCLASE2 PCLASE2 PCLASE2 initializes an m-by-n distributed matrix sub( a ) denotin offdiagonals. pclase2 requires that only dimension of the matrix |
| PCLASET PCLASET PCLASET initializes an m-by-n distributed matrix sub( a ) denotin offdiagonals. |
| PCLASMSUB PCLASMSUB PCLASMSUB looks for a small subdiagonal element from the botto |
| PCLASSQ PCLASSQ PCLASSQ returns the values scl and smsq such tha ( scl**2 )*smsq = x( 1 )**2 +...+ x( n )**2 + ( scale**2 )*sumsq, |
| PCLASWP PCLASWP PCLASWP performs a series of row or column interchanges o interchange is initiated for each of rows or columns k1 trough k2 of |
| PCLATRA PCLATRA PCLATRA computes the trace of an n-by-n distributed matrix sub( a process of the grid. |
| PCLATRD PCLATRD PCLATRD reduces nb rows and columns of a complex hermitia tridiagonal form by an unitary similarity transformation |
| PCLATRZ PCLATRZ PCLATRZ reduces the m-by-n ( m<=n ) complex upper trapezoida to upper triangular form by means of unitary transformations. |
| PCLATTRS PCLATTRS dimension ( 2*desct(lld_) ) additional workspace may be required if PCLATTRS is update |
| PCLAUU2 PCLAUU2 PCLAUU2 computes the product u * u' or l' * l, where the triangula the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PCLAUUM PCLAUUM PCLAUUM computes the product u * u' or l' * l, where the triangula the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PCLAWIL PCLAWIL PCLAWIL gets the transform given by h44,h33, & h43h34 into |
| PCMAX1 PCMAX1 PCMAX1 computes the global index of the maximum element in absolut in indx and the value is returned in amax, |
| PCPBSV PCPBSV PCPBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCPBTRF PCPBTRF see PCPBTRF and pcpbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PCPBTRF banded symmetric positive definite distributed |
| PCPBTRS PCPBTRS see pcpbtrf and PCPBTRS for details ===================================================================== PCPBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCPOCON PCPOCON PCPOCON estimates the reciprocal of the condition number (in th using the cholesky factorization a = u**h*u or a = l*l**h computed by lwork is local input and must be at least lwork = max( PCPOCON( lwork ), pcporfs( lwork ) lwork = 3*desca( lld_ ) |
| PCPOEQU PCPOEQU PCPOEQU computes row and column scalings intended t sub( a ) = a(ia:ia+n-1,ja:ja+n-1) and reduce its condition number |
| PCPORFS PCPORFS PCPORFS improves the computed solution to a system of linea and provides error bounds and backward error estimates for the lwork is local input and must be at least lwork = max( pcpocon( lwork ), PCPORFS( lwork ) lwork = 3*desca( lld_ ) |
| PCPOSV PCPOSV PCPOSV computes the solution to a complex system of linear equation sub( a ) * x = sub( b ), |
| PCPOSVX PCPOSVX PCPOSVX uses the cholesky factorization a = u**h*u or a = l*l**h t |
| PCPOTF2 PCPOTF2 PCPOTF2 computes the cholesky factorization of a complex hermitia |
| PCPOTRF PCPOTRF sub( b ) must have been previously factorized as u**h*u or l*l**h by PCPOTRF notes sub( b ) must have been previously factorized as u**h*u or l*l**h by PCPOTRF notes sub( b ) must have been previously factorized as u**h*u or l*l**h by PCPOTRF notes using the cholesky factorization a = u**h*u or a = l*l**h computed by PCPOTRF an estimate is obtained for norm(inv(a(ia:ia+n-1,ja:ja+n-1))), and cholesky factorization sub( a ) = l*l**h or u**h*u, as computed by PCPOTRF iaf (global input) integer PCPOTRF computes the cholesky factorization of an n-by-n comple a(ia:ia+n-1, ja:ja+n-1). cholesky factorization sub( a ) = u**h*u or l*l**h computed by PCPOTRF notes hermitian positive definite distributed matrix using the cholesky factorization sub( a ) = u**h*u or l*l**h computed by PCPOTRF |
| PCPOTRI PCPOTRI PCPOTRI computes the inverse of a complex hermitian positive definit cholesky factorization sub( a ) = u**h*u or l*l**h computed by |
| PCPOTRS PCPOTRS PCPOTRS solves a system of linear equation sub( a ) * x = sub( b ) |
| PCPTSV PCPTSV PCPTSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCPTTRF PCPTTRF see PCPTTRF and pcpttrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PCPTTRF tridiagonal symmetric positive definite distributed |
| PCPTTRS PCPTTRS see pcpttrf and PCPTTRS for details ===================================================================== PCPTTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PCSRSCL PCSRSCL PCSRSCL multiplies an n-element complex distributed vecto underflow as long as the final sub( x )/a does not overflow or |
| PCSTEBZ PCSTEBZ computed is returned in nz. if (mod(info/8,2).ne.0), then PCSTEBZ failed to comput send e-mail to scalapack@cs.utk.edu computed is returned in nz. if (mod(info/8,2).ne.0), then PCSTEBZ failed t send e-mail to scalapack@cs.utk.edu |
| PCSTEIN PCSTEIN performance. in the limit (i.e. clustersize = n-1) PCSTEIN will perform no better than cstein on for clustersize = n/sqrt(nprow*npcol) reorthogonalizing performance. in the limit (i.e. clustersize = n-1) PCSTEIN will perform no better than cstein on 1 processor all eigenvectors will increase the total execution time PCSTEIN computes the eigenvectors of a symmetric tridiagonal matri correspond to user specified eigenvalues. pcstein does not |
| PCTRCON PCTRCON PCTRCON estimates the reciprocal of the condition number of 1-norm or the infinity-norm. |
| PCTREVC PCTREVC PCTREVC computes some or all of the right and/or left eigenvectors o |
| PCTRRFS PCTRRFS PCTRRFS provides error bounds and backward error estimates for th coefficient matrix. |
| PCTRSV PCTRSV compute a bound on the computed solution vector to see if the level 2 pblas routine PCTRSV can be used been implemented in pclattrs which is called by this routine to solve the triangular systems. pclattrs just calls PCTRSV each eigenvector is normalized so that the element of largest |
| PCTRTI2 PCTRTI2 PCTRTI2 computes the inverse of a complex upper or lower triangula contained in one and only one process memory space (local operation). |
| PCTRTRI PCTRTRI PCTRTRI computes the inverse of a upper or lower triangula |
| PCTRTRS PCTRTRS the solution matrix x must be computed by PCTRTRS or some othe refinement because doing so cannot improve the backward error. PCTRTRS solves a triangular system of the for sub( a ) * x = sub( b ) or sub( a )**t * x = sub( b ) or |
| PCTZRZF PCTZRZF q is a product of k elementary reflectors as returned by PCTZRZF currently, only storev = 'r' and direct = 'b' are supported. h of order > n, which is defined as a product of k elementary reflectors as returned by PCTZRZF if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular; PCTZRZF reduces the m-by-n ( m<=n ) complex upper trapezoidal matri of unitary transformations. as returned by PCTZRZF. q is of order m if side = 'l' and of order as returned by PCTZRZF. q is of order m if side = 'l' and of order |
| PCUNG2L PCUNG2L PCUNG2L generates an m-by-n complex distributed matrix q denotin the last n columns of a product of k elementary reflectors of order m |
| PCUNG2R PCUNG2R PCUNG2R generates an m-by-n complex distributed matrix q denotin the first n columns of a product of k elementary reflectors of order |
| PCUNGL2 PCUNGL2 PCUNGL2 generates an m-by-n complex distributed matrix q denotin the first m rows of a product of k elementary reflectors of order n |
| PCUNGLQ PCUNGLQ PCUNGLQ generates an m-by-n complex distributed matrix q denotin the first m rows of a product of k elementary reflectors of order n |
| PCUNGQL PCUNGQL PCUNGQL generates an m-by-n complex distributed matrix q denotin the last n columns of a product of k elementary reflectors of order m |
| PCUNGQR PCUNGQR a(ia+i:ia+n-1,ja+i-1), and taua in taua(ja+i-1). to form q explicitly, use scalapack subroutine PCUNGQR b(ib+i:ib+p-1,jb+i-1), and taub in taub(jb+i-1). to form z explicitly, use scalapack subroutine PCUNGQR PCUNGQR generates an m-by-n complex distributed matrix q denotin the first n columns of a product of k elementary reflectors of order |
| PCUNGR2 PCUNGR2 PCUNGR2 generates an m-by-n complex distributed matrix q denotin last m rows of a product of k elementary reflectors of order n |
| PCUNGRQ PCUNGRQ exit in b(ib+n-k+i-1,jb:jb+p-k+i-2), and taub in taub(ib+n-k+i-1). to form z explicitly, use scalapack subroutine PCUNGRQ exit in a(ia+m-k+i-1,ja:ja+n-k+i-2), and taua in taua(ia+m-k+i-1). to form q explicitly, use scalapack subroutine PCUNGRQ PCUNGRQ generates an m-by-n complex distributed matrix q denotin last m rows of a product of k elementary reflectors of order n |
| PCUNM2L PCUNM2L PCUNM2L overwrites the general complex m-by-n distributed matri |
| PCUNM2R PCUNM2R PCUNM2R overwrites the general complex m-by-n distributed matri |
| PCUNMBR PCUNMBR to the workspace required for the subprograms cbdsqr, PCUNMBR(qln), and pcunmbr(prt), where qln and prt are th to pcunmbr. nru is equal to the local number of rows of if vect = 'q', PCUNMBR overwrites the general complex distribute |
| PCUNMHR PCUNMHR PCUNMHR overwrites the general complex m-by-n distributed matri |
| PCUNML2 PCUNML2 PCUNML2 overwrites the general complex m-by-n distributed matri |
| PCUNMLQ PCUNMLQ PCUNMLQ overwrites the general complex m-by-n distributed matri |
| PCUNMQL PCUNMQL PCUNMQL overwrites the general complex m-by-n distributed matri |
| PCUNMQR PCUNMQR to form q explicitly, use scalapack subroutine pcungqr. to use q to update another matrix, use scalapack subroutine PCUNMQR the matrix z is represented as a product of elementary reflectors to form z explicitly, use scalapack subroutine pcungqr. to use z to update another matrix, use scalapack subroutine PCUNMQR alignment requirements PCUNMQR overwrites the general complex m-by-n distributed matri |
| PCUNMR2 PCUNMR2 PCUNMR2 overwrites the general complex m-by-n distributed matri |
| PCUNMR3 PCUNMR3 PCUNMR3 overwrites the general complex m-by-n distributed matri |
| PCUNMRQ PCUNMRQ to form z explicitly, use scalapack subroutine pcungrq. to use z to update another matrix, use scalapack subroutine PCUNMRQ alignment requirements to form q explicitly, use scalapack subroutine pcungrq. to use q to update another matrix, use scalapack subroutine PCUNMRQ the matrix z is represented as a product of elementary reflectors PCUNMRQ overwrites the general complex m-by-n distributed matri |
| PCUNMRZ PCUNMRZ PCUNMRZ overwrites the general complex m-by-n distributed matri |
| PCUNMTR PCUNMTR PCUNMTR overwrites the general complex m-by-n distributed matri |
| PDDBSV PDDBSV PDDBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDDBTRF PDDBTRF see PDDBTRF and pddbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PDDBTRF banded diagonally dominant-like distributed |
| PDDBTRS PDDBTRS see pddbtrf and PDDBTRS for details ===================================================================== PDDBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDDTSV PDDTSV PDDTSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDDTTRF PDDTTRF see PDDTTRF and pddttrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PDDTTRF tridiagonal diagonally dominant-like distributed |
| PDDTTRS PDDTTRS see pddttrf and PDDTTRS for details ===================================================================== PDDTTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDGBSV PDGBSV PDGBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDGBTRF PDGBTRF see PDGBTRF and pdgbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PDGBTRF banded distributed |
| PDGBTRS PDGBTRS see pdgbtrf and PDGBTRS for details ===================================================================== PDGBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDGEBD2 PDGEBD2 PDGEBD2 reduces a real general m-by-n distributed matri form b by an orthogonal transformation: q' * sub( a ) * p = b. |
| PDGEBRD PDGEBRD elementary reflector h(i) or g(i), which determines q or p, as returned by PDGEBRD in its array argument tauq or taup PDGEBRD reduces a real general m-by-n distributed matri form b by an orthogonal transformation: q' * sub( a ) * p = b. watobd = max(max(wpdlange,wPDGEBRD) this is an auxiliary routine called by PDGEBRD notes here q and p**t are the orthogonal distributed matrices determined by PDGEBRD when reducing a real distributed matrix a(ia:*,ja:*) t as products of elementary reflectors h(i) and g(i) respectively. elementary reflector h(i) or g(i), which determines q or p, as returned by PDGEBRD in its array argument tauq or taup elementary reflector h(i) or g(i), which determines q or p, as returned by PDGEBRD in its array argument tauq or taup |
| PDGECON PDGECON PDGECON estimates the reciprocal of the condition number of a genera or the infinity-norm, using the lu factorization computed by pdgetrf. lwork is local input and must be at least lwork = max( PDGECON( lwork ), pdgerfs( lwork ) |
| PDGEEQU PDGEEQU PDGEEQU computes row and column scalings intended to equilibrate a reduce its condition number. r returns the row scale factors and c |
| PDGEHD2 PDGEHD2 PDGEHD2 reduces a real general distributed matrix sub( a tion: q' * sub( a ) * q = h, where |
| PDGEHRD PDGEHRD PDGEHRD reduces a real general distributed matrix sub( a tion: q' * sub( a ) * q = h, where this is an auxiliary routine called by PDGEHRD. in the followin nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of ihi-ilo elementary reflectors, as returned by PDGEHRD q = h(ilo) h(ilo+1) . . . h(ihi-1). |
| PDGELQ2 PDGELQ2 PDGELQ2 computes a lq factorization of a real distributed m-by- |
| PDGELQF PDGELQF PDGELQF computes a lq factorization of a real distributed m-by- if m < n, sub( a ) is overwritten by details of its lq factorization as returned by PDGELQF ia (global input) integer as returned by PDGELQF notes as returned by PDGELQF notes as returned by PDGELQF. q is of order m if side = 'l' and of order as returned by PDGELQF. q is of order m if side = 'l' and of order |
| PDGELS PDGELS PDGELS solves overdetermined or underdetermined real linea or its transpose, using a qr or lq factorization of sub( a ). it is |
| PDGEMR2D PDGEMR2D although all processes call PDGEMR2D, only the processes that ow first column of b receive data. the calls to dgebs2d/dgebr2d |
| PDGEQL2 PDGEQL2 PDGEQL2 computes a ql factorization of a real distributed m-by- |
| PDGEQLF PDGEQLF PDGEQLF computes a ql factorization of a real distributed m-by- as returned by PDGEQLF notes as returned by PDGEQLF notes as returned by PDGEQLF. q is of order m if side = 'l' and of order as returned by PDGEQLF. q is of order m if side = 'l' and of order |
| PDGEQPF PDGEQPF PDGEQPF computes a qr factorization with column pivoting of |
| PDGEQR2 PDGEQR2 PDGEQR2 computes a qr factorization of a real distributed m-by- |
| PDGEQRF PDGEQRF if m >= n, sub( a ) is overwritten by details of its qr factorization as returned by PDGEQRF factorization as returned by pdgelqf. PDGEQRF computes a qr factorization of a real distributed m-by- as returned by PDGEQRF notes as returned by PDGEQRF notes as returned by PDGEQRF. q is of order m if side = 'l' and of order as returned by PDGEQRF. q is of order m if side = 'l' and of order |
| PDGERFS PDGERFS PDGERFS improves the computed solution to a system of linea the solutions. lwork is local input and must be at least lwork = max( pdgecon( lwork ), PDGERFS( lwork ) |
| PDGERQ2 PDGERQ2 PDGERQ2 computes a rq factorization of a real distributed m-by- |
| PDGERQF PDGERQF PDGERQF computes a rq factorization of a real distributed m-by- as returned by PDGERQF notes as returned by PDGERQF notes as returned by PDGERQF. q is of order m if side = 'l' and of order as returned by PDGERQF. q is of order m if side = 'l' and of order |
| PDGESV PDGESV PDGESV computes the solution to a real system of linear equation sub( a ) * x = sub( b ), |
| PDGESVD PDGESVD PDGESVD computes the singular value decomposition (svd) of a singular vectors. the svd is written as |
| PDGESVX PDGESVX PDGESVX uses the lu factorization to compute the solution to a rea |
| PDGETF2 PDGETF2 PDGETF2 computes an lu factorization of a general m-by- partial pivoting with row interchanges. |
| PDGETRF PDGETRF distributed real matrix a(ia:ia+n-1,ja:ja+n-1), in either the 1-norm or the infinity-norm, using the lu factorization computed by PDGETRF an estimate is obtained for norm(inv(a(ia:ia+n-1,ja:ja+n-1))), and factors of the matrix sub( a ) = p * l * u as computed by PDGETRF iaf (global input) integer entry contains the factors l and u from the factorization a(ia:ia+n-1,ja:ja+n-1) = p*l*u as computed by PDGETRF equilibrated matrix a(ia:ia+n-1,ja:ja+n-1). PDGETRF computes an lu factorization of a general m-by-n distribute row interchanges. pdgetri computes the inverse of a distributed matrix using the lu factorization computed by PDGETRF. this method inverts u and the inva by solving the system inva*l = inv(u) for inva. with a general n-by-n distributed matrix sub( a ) using the lu factorization computed by PDGETRF sub( b ) denotes b(ib:ib+n-1,jb:jb+nrhs-1). |
| PDGETRI PDGETRI PDGETRI computes the inverse of a distributed matrix using the l computes the inverse of sub( a ) = a(ia:ia+n-1,ja:ja+n-1) denoted |
| PDGETRS PDGETRS PDGETRS solves a system of distributed linear equation op( sub( a ) ) * x = sub( b ) |
| PDGGQRF PDGGQRF PDGGQRF computes a generalized qr factorization o an n-by-p matrix sub( b ) = b(ib:ib+n-1,jb:jb+p-1): |
| PDGGRQF PDGGRQF PDGGRQF computes a generalized rq factorization o and a p-by-n matrix sub( b ) = b(ib:ib+p-1,jb:jb+n-1): |
| PDHEGST PDHEGST pdsyngst performs the same function as PDHEGST, but is based o triangular solves (the basis of pdsyngst). |
| PDHENGST PDHENGST pdsyngst calls pdhegst when uplo='u', hence PDHENGST provide |
| PDHETTRD PDHETTRD pdsyttrd is not intended to be called directly. all users are encourage to call pdsytrd which will then call PDHETTRD i the process grid must be square ( i.e. nprow = npcol ) and |
| PDLABAD PDLABAD PDLABAD takes as input the values computed by pdlamch for underflo the log of large is sufficiently large. this subroutine is intended |
| PDLABRD PDLABRD PDLABRD reduces the first nb rows and columns of a real genera or lower bidiagonal form by an orthogonal transformation q' * a * p, |
| PDLACON PDLACON PDLACON estimates the 1-norm of a square, real distributed matrix a x and v are aligned with the distributed matrix a, this information |
| PDLACONSB PDLACONSB PDLACONSB looks for two consecutive small subdiagonal elements b given by h44, h33, & h43h34 and see if this would make a look for two consecutive small subdiagonal elements: PDLACONSB is the routine that does this |
| PDLACP2 PDLACP2 PDLACP2 copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes |
| PDLACP3 PDLACP3 PDLACP3 is an auxiliary routine that copies from a global paralle the entire submatrix that is copied gets placed on one node or |
| PDLACPY PDLACPY PDLACPY copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes |
| PDLAEBZ PDLAEBZ PDLAEBZ contains the iteration loop which computes the eigenvalue j = 1,...,minp. it uses and computes the function n(w), which is will, in general, be reordered on output. see the comments in PDLAEBZ for more on the function n(w) nval (input/output) integer array, dimension (2*(kl-kf)) |
| PDLAECV PDLAECV PDLAECV checks if the input intervals [ intvl(2*i-1), intvl(2*i) ] pdlaecv modifies kf to be the index of the last converged interval, |
| PDLAED0 PDLAED0 PDLAED0 computes all eigenvalues and corresponding eigenvectors of |
| PDLAED1 PDLAED1 PDLAED1 computes the updated eigensystem of a diagona in parallel. nn (global output) integer, the order of matrix u, (PDLAED1) nn2 (global output) integer, the order of matrix q2, (pdlaed1). |
| PDLAED2 PDLAED2 secular equation problem is reduced by one. this stage is performed by the routine PDLAED2 the second stage consists of calculating the updated PDLAED2 sorts the two sets of eigenvalues together into a singl there are two ways in which deflation can occur: when two or more |
| PDLAED3 PDLAED3 eigenvalues. this is done by finding the roots of the secular equation via the routine slaed4 (as called by PDLAED3) problem. on exit, rho has been modified to the value required by PDLAED3 z (global input) double precision array, dimension (n) PDLAED3 finds the roots of the secular equation, as defined by th appropriate calls to slaed4 > 0: if info = 1 through n, the i(th) eigenvalue did not converge in PDLAED3 alignment requirements |
| PDLAEVSWP PDLAEVSWP PDLAEVSWP moves the eigenvectors (potentially unsorted) fro array, sorted so that the corresponding eigenvalues are sorted. |
| pdlahqr pdlahqr following differs in comparison to pdlahqr |
| PDLAHRD PDLAHRD PDLAHRD reduces the first nb columns of a real general n-by-(n-k+1 k-th subdiagonal are zero. the reduction is performed by an orthogo- |
| pdlaiect pdlaiect the appropriate slmake.inc file to include the compiler switch -dno_ieee. this switch only affects the compilation of pdlaiect.c arguments the appropriate slmake.inc file to include the compiler switch -dno_ieee. this switch only affects the compilation of pdlaiect.c arguments |
| PDLAMCH PDLAMCH pdlabad takes as input the values computed by PDLAMCH for underflo the log of large is sufficiently large. this subroutine is intended PDLAMCH determines double precision machine parameters arguments note : if eigenvectors are desired later by inverse iteration
( pdstein ), abstol should be set to 2*PDLAMCH('s')
d (global input) double precision array, dimension (n)
abstol (global input) double precision if jobz='v', setting abstol to PDLAMCH( context, 'u') yield abstol (global input) double precision if jobz='v', setting abstol to PDLAMCH( context, 'u') yield abstol (global input) double precision if jobz='v', setting abstol to PDLAMCH( context, 'u') yield abstol (global input) double precision if jobz='v', setting abstol to PDLAMCH( context, 'u') yield |
| PDLAMR1D PDLAMR1D PDLAMR1D has not been tested except withint the contect o |
| PDLANGE PDLANGE watobd = max(max(wPDLANGE,wpdgebrd) PDLANGE returns the value of the one norm, or the frobenius norm distributed matrix sub( a ) = a(ia:ia+m-1, ja:ja+n-1). |
| PDLAPDCT PDLAPDCT without overflow. see PDLAPDCT for the "paranoid" implementation of the stur PDLAPDCT counts the number of negative eigenvalues of (t - sigma i) the innermost loop to avoid overflow and determine the sign of a |
| PDLAPIV PDLAPIV PDLAPIV applies either p (permutation matrix indicated by ipiv sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column same mb (or nb) block. if you want to pivot a full matrix, use PDLAPIV notes |
| PDLAPV2 PDLAPV2 PDLAPV2 applies either p (permutation matrix indicated by ipiv a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the |
| PDLAQGE PDLAQGE PDLAQGE equilibrates a general m-by-n distributed matri factors in the vectors r and c. |
| PDLAQSY PDLAQSY PDLAQSY equilibrates a symmetric distributed matri vectors sr and sc. |
| PDLARED1D PDLARED1D where wpdlange, wPDLARED1D, wpdlared2d, wpdgebrd are th pdlange, pdlared1d, pdlared2d, pdgebrd. using the PDLARED1D redistributes a 1d arra it assumes that the input array, bycol, is distributed across workspaces required respectively for the subprograms pzlange, PDLARED1D, pdlared2d, pzgebrd. using th |
| PDLARED2D PDLARED2D watobd = max(max(wpdlange,wpdgebrd), max(wPDLARED2D,wp(pre)lared1d)) where wpdlange, wpdlared1d, wpdlared2d, wpdgebrd are the PDLARED2D redistributes a 1d arra it assumes that the input array, byrow, is distributed across workspaces required respectively for the subprograms pzlange, pdlared1d, PDLARED2D, pzgebrd. using th |
| PDLARFB PDLARFB PDLARFB applies a real block reflector q or its transpose q**t to from the left or the right. |
| PDLARFG PDLARFG PDLARFG generates a real elementary reflector h of order n, suc |
| PDLARFT PDLARFT PDLARFT forms the triangular factor t of a real block reflector |
| PDLARZB PDLARZB PDLARZB applies a real block reflector q or its transpose q**t t from the left or the right. |
| PDLARZT PDLARZT PDLARZT forms the triangular factor t of a real block reflecto reflectors as returned by pdtzrzf. |
| PDLASCL PDLASCL PDLASCL multiplies the m-by-n real distributed matrix sub( a is done without over/underflow as long as the final result |
| PDLASE2 PDLASE2 PDLASE2 initializes an m-by-n distributed matrix sub( a ) denotin offdiagonals. pdlase2 requires that only dimension of the matrix |
| PDLASET PDLASET PDLASET initializes an m-by-n distributed matrix sub( a ) denotin offdiagonals. |
| PDLASMSUB PDLASMSUB PDLASMSUB looks for a small subdiagonal element from the botto |
| PDLASRT PDLASRT PDLASRT sort the numbers in d in increasing order and th |
| PDLASSQ PDLASSQ PDLASSQ returns the values scl and smsq such tha ( scl**2 )*smsq = x( 1 )**2 +...+ x( n )**2 + ( scale**2 )*sumsq, |
| PDLASWP PDLASWP PDLASWP performs a series of row or column interchanges o interchange is initiated for each of rows or columns k1 trough k2 of |
| PDLATRA PDLATRA PDLATRA computes the trace of an n-by-n distributed matrix sub( a process of the grid. |
| PDLATRD PDLATRD PDLATRD reduces nb rows and columns of a real symmetric distribute form by an orthogonal similarity transformation q' * sub( a ) * q, |
| PDLATRZ PDLATRZ PDLATRZ reduces the m-by-n ( m<=n ) real upper trapezoidal matri upper triangular form by means of orthogonal transformations. |
| PDLAUU2 PDLAUU2 PDLAUU2 computes the product u * u' or l' * l, where the triangula the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PDLAUUM PDLAUUM PDLAUUM computes the product u * u' or l' * l, where the triangula the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PDLAWIL PDLAWIL PDLAWIL gets the transform given by h44,h33, & h43h34 into |
| PDORG2L PDORG2L PDORG2L generates an m-by-n real distributed matrix q denotin the last n columns of a product of k elementary reflectors of order m |
| PDORG2R PDORG2R PDORG2R generates an m-by-n real distributed matrix q denotin the first n columns of a product of k elementary reflectors of order |
| PDORGL2 PDORGL2 PDORGL2 generates an m-by-n real distributed matrix q denotin the first m rows of a product of k elementary reflectors of order n |
| PDORGLQ PDORGLQ PDORGLQ generates an m-by-n real distributed matrix q denotin the first m rows of a product of k elementary reflectors of order n |
| PDORGQL PDORGQL PDORGQL generates an m-by-n real distributed matrix q denotin the last n columns of a product of k elementary reflectors of order m |
| PDORGQR PDORGQR a(ia+i:ia+n-1,ja+i-1), and taua in taua(ja+i-1). to form q explicitly, use scalapack subroutine PDORGQR b(ib+i:ib+p-1,jb+i-1), and taub in taub(jb+i-1). to form z explicitly, use scalapack subroutine PDORGQR PDORGQR generates an m-by-n real distributed matrix q denotin the first n columns of a product of k elementary reflectors of order |
| PDORGR2 PDORGR2 PDORGR2 generates an m-by-n real distributed matrix q denotin last m rows of a product of k elementary reflectors of order n |
| PDORGRQ PDORGRQ b(ib+n-k+i-1,jb:jb+p-k+i-2), and taub in taub(ib+n-k+i-1). to form z explicitly, use scalapack subroutine PDORGRQ a(ia+m-k+i-1,ja:ja+n-k+i-2), and taua in taua(ia+m-k+i-1). to form q explicitly, use scalapack subroutine PDORGRQ PDORGRQ generates an m-by-n real distributed matrix q denotin last m rows of a product of k elementary reflectors of order n |
| PDORM2L PDORM2L PDORM2L overwrites the general real m-by-n distributed matri |
| PDORM2R PDORM2R PDORM2R overwrites the general real m-by-n distributed matri |
| PDORMBR PDORMBR max(wdbdsqr, max(wantu*wPDORMBRqln, wantvt*wpdormbrprt)) where if vect = 'q', PDORMBR overwrites the general real distributed m-by- |
| PDORMHR PDORMHR PDORMHR overwrites the general real m-by-n distributed matri |
| PDORML2 PDORML2 PDORML2 overwrites the general real m-by-n distributed matri |
| PDORMLQ PDORMLQ PDORMLQ overwrites the general real m-by-n distributed matri |
| PDORMQL PDORMQL PDORMQL overwrites the general real m-by-n distributed matri |
| PDORMQR PDORMQR to form q explicitly, use scalapack subroutine pdorgqr. to use q to update another matrix, use scalapack subroutine PDORMQR the matrix z is represented as a product of elementary reflectors to form z explicitly, use scalapack subroutine pdorgqr. to use z to update another matrix, use scalapack subroutine PDORMQR alignment requirements PDORMQR overwrites the general real m-by-n distributed matri |
| PDORMR2 PDORMR2 PDORMR2 overwrites the general real m-by-n distributed matri |
| PDORMR3 PDORMR3 PDORMR3 overwrites the general real m-by-n distributed matri |
| PDORMRQ PDORMRQ to form z explicitly, use scalapack subroutine pdorgrq. to use z to update another matrix, use scalapack subroutine PDORMRQ alignment requirements to form q explicitly, use scalapack subroutine pdorgrq. to use q to update another matrix, use scalapack subroutine PDORMRQ the matrix z is represented as a product of elementary reflectors PDORMRQ overwrites the general real m-by-n distributed matri |
| PDORMRZ PDORMRZ PDORMRZ overwrites the general real m-by-n distributed matri |
| PDORMTR PDORMTR PDORMTR overwrites the general real m-by-n distributed matri ldc = max( 1, nrc ) sizemqrleft = the workspace requirement for PDORMTR |
| PDPBSV PDPBSV PDPBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDPBTRF PDPBTRF see PDPBTRF and pdpbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PDPBTRF banded symmetric positive definite distributed |
| PDPBTRS PDPBTRS see pdpbtrf and PDPBTRS for details ===================================================================== PDPBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDPOCON PDPOCON PDPOCON estimates the reciprocal of the condition number (in th using the cholesky factorization a = u**t*u or a = l*l**t computed by lwork is local input and must be at least lwork = max( PDPOCON( lwork ), pdporfs( lwork ) lwork = 3*desca( lld_ ) |
| PDPOEQU PDPOEQU PDPOEQU computes row and column scalings intended t sub( a ) = a(ia:ia+n-1,ja:ja+n-1) and reduce its condition number |
| PDPORFS PDPORFS PDPORFS improves the computed solution to a system of linea and provides error bounds and backward error estimates for the lwork is local input and must be at least lwork = max( pdpocon( lwork ), PDPORFS( lwork ) lwork = 3*desca( lld_ ) |
| PDPOSV PDPOSV PDPOSV computes the solution to a real system of linear equation sub( a ) * x = sub( b ), |
| PDPOSVX PDPOSVX PDPOSVX uses the cholesky factorization a = u**t*u or a = l*l**t t |
| PDPOTF2 PDPOTF2 PDPOTF2 computes the cholesky factorization of a real symmetri |
| PDPOTRF PDPOTRF using the cholesky factorization a = u**t*u or a = l*l**t computed by PDPOTRF an estimate is obtained for norm(inv(a(ia:ia+n-1,ja:ja+n-1))), and cholesky factorization sub( a ) = l*l**t or u**t*u, as computed by PDPOTRF iaf (global input) integer PDPOTRF computes the cholesky factorization of an n-by-n rea a(ia:ia+n-1, ja:ja+n-1). cholesky factorization sub( a ) = u**t*u or l*l**t computed by PDPOTRF notes symmetric positive definite distributed matrix using the cholesky factorization sub( a ) = u**t*u or l*l**t computed by PDPOTRF sub( b ) must have been previously factorized as u**t*u or l*l**t by PDPOTRF notes sub( b ) must have been previously factorized as u**t*u or l*l**t by PDPOTRF notes sub( b ) must have been previously factorized as u**h*u or l*l**h by PDPOTRF notes |
| PDPOTRI PDPOTRI PDPOTRI computes the inverse of a real symmetric positive definit cholesky factorization sub( a ) = u**t*u or l*l**t computed by |
| PDPOTRS PDPOTRS PDPOTRS solves a system of linear equation sub( a ) * x = sub( b ) |
| PDPTSV PDPTSV PDPTSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDPTTRF PDPTTRF see PDPTTRF and pdpttrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PDPTTRF tridiagonal symmetric positive definite distributed |
| PDPTTRS PDPTTRS see pdpttrf and PDPTTRS for details ===================================================================== PDPTTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PDPTTRSV PDPTTRSV end of PDPTTRSV |
| PDRSCL PDRSCL PDRSCL multiplies an n-element real distributed vector sub( x ) b long as the final result sub( x )/a does not overflow or underflow. |
| PDSTEBZ PDSTEBZ PDSTEBZ computes the eigenvalues of a symmetric tridiagonal matrix i the interval [vl, vu], or the eigenvalues indexed il through iu. a from smallest to largest within the block (the output array w from PDSTEBZ with order='b' is expected here). thi on output, the first m elements contain the input computed is returned in nz. if (mod(info/8,2).ne.0), then PDSTEBZ failed to comput send e-mail to scalapack@cs.utk.edu computed is returned in nz. if (mod(info/8,2).ne.0), then PDSTEBZ failed t send e-mail to scalapack@cs.utk.edu from smallest to largest within the block (the output array w from PDSTEBZ with order='b' is expected here). thi on output, the first m elements contain the input |
| PDSTEDC PDSTEDC ======= PDSTEDC computes all eigenvalues and eigenvectors of conquer algorithm. |
| PDSTEIN PDSTEIN note : if eigenvectors are desired later by inverse iteration
( PDSTEIN ), abstol should be set to 2*pdlamch('s')
d (global input) double precision array, dimension (n)
PDSTEIN computes the eigenvectors of a symmetric tridiagonal matri correspond to user specified eigenvalues. pdstein does not performance. in the limit (i.e. clustersize = n-1) PDSTEIN will perform no better than dstein on for clustersize = n/sqrt(nprow*npcol) reorthogonalizing performance. in the limit (i.e. clustersize = n-1) PDSTEIN will perform no better than dstein on 1 processor all eigenvectors will increase the total execution time |
| PDSYEV PDSYEV PDSYEV computes all eigenvalues and, optionally, eigenvector of scalapack routines. |
| PDSYEVD PDSYEVD PDSYEVD computes all the eigenvalues and eigenvector of scalapack routines. |
| PDSYEVX PDSYEVX PDSYEVX computes selected eigenvalues and, optionally, eigenvector of scalapack routines. eigenvalues/vectors can be selected by |
| PDSYGS2 PDSYGS2 PDSYGS2 reduces a real symmetric-definite generalized eigenproble |
| PDSYGST PDSYGST PDSYGST reduces a real symmetric-definite generalized eigenproble |
| PDSYGVX PDSYGVX PDSYGVX computes all the eigenvalues, and optionally of a real generalized sy-definite eigenproblem, of the form |
| PDSYNGST PDSYNGST PDSYNGST reduces a complex hermitian-definite generalize |
| PDSYNTRD PDSYNTRD PDSYNTRD is a prototype version of pdsytrd which uses tailore when the workspace provided by the user is adequate. |
| PDSYPTRD PDSYPTRD pdlamr1d has not been tested except withint the contect of PDSYPTRD, the prototype reduction to tridiagonal form code purpose |
| PDSYTD2 PDSYTD2 PDSYTD2 reduces a real symmetric matrix sub( a ) to symmetri q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PDSYTRD PDSYTRD this is an auxiliary routine called by PDSYTRD to redistribute d, this is an auxiliary routine called by PDSYTRD notes nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of nq-1 elementary reflectors, as returned by PDSYTRD if uplo = 'u', q = h(nq-1) . . . h(2) h(1); where sizesytrd = the workspace requirement for PDSYTRD if eigenvectors are requested (jobz = 'v' ) then support for uplo='u' is limited to calling the old, slow, PDSYTRD PDSYTRD reduces a real symmetric matrix sub( a ) to symmetri q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). pdsyttrd is not intended to be called directly. all users are encourage to call PDSYTRD which will then call pdhettrd i the process grid must be square ( i.e. nprow = npcol ) and |
| PDSYTTRD PDSYTTRD anb = pjlaenv( desca( ctxt_), 3, 'PDSYTTRD', 'l' sqnpc = int( sqrt( dble( nprow * npcol ) ) ) anb = pjlaenv( desca( ctxt_), 3, 'PDSYTTRD', 'l' sqnpc = int( sqrt( dble( nprow * npcol ) ) ) pdsyntrd is a prototype version of pdsytrd which uses tailored codes (either the serial, dsytrd, or the parallel code, PDSYTTRD PDSYTTRD reduces a complex hermitian matrix sub( a ) to hermitia q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PDTRCON PDTRCON PDTRCON estimates the reciprocal of the condition number of 1-norm or the infinity-norm. |
| PDTRRFS PDTRRFS PDTRRFS provides error bounds and backward error estimates for th coefficient matrix. |
| PDTRTI2 PDTRTI2 PDTRTI2 computes the inverse of a real upper or lower triangula contained in one and only one process memory space (local operation). |
| PDTRTRI PDTRTRI PDTRTRI computes the inverse of a upper or lower triangula |
| PDTRTRS PDTRTRS the solution matrix x must be computed by PDTRTRS or some othe refinement because doing so cannot improve the backward error. PDTRTRS solves a triangular system of the for sub( a ) * x = sub( b ) or sub( a )**t * x = sub( b ), |
| PDTZRZF PDTZRZF q is a product of k elementary reflectors as returned by PDTZRZF currently, only storev = 'r' and direct = 'b' are supported. h of order > n, which is defined as a product of k elementary reflectors as returned by PDTZRZF if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular; as returned by PDTZRZF. q is of order m if side = 'l' and of order as returned by PDTZRZF. q is of order m if side = 'l' and of order PDTZRZF reduces the m-by-n ( m<=n ) real upper trapezoidal matri of orthogonal transformations. |
| PDZASUM PDZASUM based on PDZASUM from the level 1 pblas. the change i |
| PDZSUM1 PDZSUM1 PDZSUM1 returns the sum of absolute values of a comple |
| per per adjust addressing into matrix space to properly get int locq( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locq( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a locc( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locc( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a adjust addressing into matrix space to properly get int adjust addressing into matrix space to properly get int locc( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locc( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a adjust addressing into matrix space to properly get int locc( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locc( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a locc( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locc( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a locq( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locq( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a adjust addressing into matrix space to properly get int locc( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locc( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a adjust addressing into matrix space to properly get int locc( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locc( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a locc( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locc( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a locq( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locq( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a adjust addressing into matrix space to properly get int locq( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locq( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a locc( n ) = numroc( n, nb_a, mycol, csrc_a, npcol ). an upper bound for these quantities may be computed by locc( n ) <= ceil( ceil(n/nb_a)/npcol )*nb_a adjust addressing into matrix space to properly get int |
| perfectly perfectly same processes. these conditions ensure that sub( a ) and sub( af ) (resp. sub( x ) and sub( b ) ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), same processes. these conditions ensure that sub( a ) and sub( af ) (resp. sub( x ) and sub( b ) ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), distributed the same way on the same processes. these conditions ensure that sub( x ) and sub( b ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), same processes. these conditions ensure that sub( a ) and sub( af ) (resp. sub( x ) and sub( b ) ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), same processes. these conditions ensure that sub( a ) and sub( af ) (resp. sub( x ) and sub( b ) ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), distributed the same way on the same processes. these conditions ensure that sub( x ) and sub( b ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), same processes. these conditions ensure that sub( a ) and sub( af ) (resp. sub( x ) and sub( b ) ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), same processes. these conditions ensure that sub( a ) and sub( af ) (resp. sub( x ) and sub( b ) ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), distributed the same way on the same processes. these conditions ensure that sub( x ) and sub( b ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), same processes. these conditions ensure that sub( a ) and sub( af ) (resp. sub( x ) and sub( b ) ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), same processes. these conditions ensure that sub( a ) and sub( af ) (resp. sub( x ) and sub( b ) ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), distributed the same way on the same processes. these conditions ensure that sub( x ) and sub( b ) are "perfectly" aligned moreover, this routine requires the distributed submatrices sub( a ), |
| Perform Perform Perform qr iterations on rows and columns ilo to i until subdiagonal element has become negligible.
Perform the triangular system solve {l_i}{{bu'}_i} = {b_i
Perform the triangular solve {u_i}^c{bl'}_i^c = {bl_i}^
work(1) returns workspace adequate workspace to allow optimal Performance lwork (local input) integer for optimal Performance, greater workspace is needed, i.e nhegst_lwopt ) Perform qr iterations on rows and columns ilo to i until subdiagonal element has become negligible. Perform the global scaled su Perform the local computation within a process colum Perform the local computation within a process colum Perform the local computation within a process colum Perform the local computation within a process colum if the scaling needed for a in the dot product is 1, call pcdotu to Perform the dot product
Perform the triangular system solve {l_i}{{b'}_i}^c = {b_i}^
Perform the triangular system solve {l_i}{{b'}_i}^c = {b_i}^
Perform the triangular system solve {l_i}{{bu'}_i} = {b_i
Perform the triangular solve {u_i}^t{bl'}_i^t = {bl_i}^
interval with a desired value of n(w). = 2 : Perform bisection iteration to find eigenvalues of t n (input) integer Perform qr iterations on rows and columns ilo to i until subdiagonal element has become negligible. Perform the global scaled su Perform the local computation within a process colum Perform the local computation within a process colum
Perform the triangular system solve {l_i}{{b'}_i}^t = {b_i}^
Perform the triangular system solve {l_i}{{b'}_i}^t = {b_i}^
if you want to guarantee orthogonality (at the cost of potentially poor Performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor Performance) you should ad (clustersize-1)*n
Perform the triangular system solve {l_i}{{bu'}_i} = {b_i
Perform the triangular solve {u_i}^t{bl'}_i^t = {bl_i}^
interval with a desired value of n(w). = 2 : Perform bisection iteration to find eigenvalues of t n (input) integer Perform qr iterations on rows and columns ilo to i until subdiagonal element has become negligible. Perform the global scaled su Perform the local computation within a process colum Perform the local computation within a process colum
Perform the triangular system solve {l_i}{{b'}_i}^t = {b_i}^
Perform the triangular system solve {l_i}{{b'}_i}^t = {b_i}^
if you want to guarantee orthogonality (at the cost of potentially poor Performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor Performance) you should ad (clustersize-1)*n
Perform the triangular system solve {l_i}{{bu'}_i} = {b_i
Perform the triangular solve {u_i}^c{bl'}_i^c = {bl_i}^
work(1) returns workspace adequate workspace to allow optimal Performance lwork (local input) integer for optimal Performance, greater workspace is needed, i.e nhegst_lwopt ) Perform qr iterations on rows and columns ilo to i until subdiagonal element has become negligible. Perform the global scaled su Perform the local computation within a process colum Perform the local computation within a process colum Perform the local computation within a process colum Perform the local computation within a process colum if the scaling needed for a in the dot product is 1, call pzdotu to Perform the dot product
Perform the triangular system solve {l_i}{{b'}_i}^c = {b_i}^
Perform the triangular system solve {l_i}{{b'}_i}^c = {b_i}^
Perform qr iterations on rows and columns ilo to i until subdiagonal element has become negligible. |
| performance performance work(1) returns workspace adequate workspace to allow optimal performance lwork (local input) integer for optimal performance, greater workspace is needed, i.e nhegst_lwopt ) pchengst calls pchegst when uplo='u', hence pchengst provides improved performance only when uplo='l', ibtype=1 pchengst also calls pchegst when insufficient workspace is the tailored codes provide performance that is essentiall assumed to be interleaved in memory for better cache performance. the diagonal entries of t are in the entrie entries are d(2),d(4),...,d(2*n-2). to avoid overflow, the assumed to be interleaved in memory for better cache performance. the diagonal entries of t are in the entrie entries are d(2),d(4),...,d(2*n-2). to avoid overflow, the if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n pdsyngst calls pdhegst when uplo='u', hence pdhengst provides improved performance only when uplo='l', ibtype=1 pdsyngst also calls pdhegst when insufficient workspace is the tailored codes provide performance that is essentiall this version provides a set of parameters which should give good, but not optimal, performance on many of the currently availabl the tuning parameters for their particular machine using the option assumed to be interleaved in memory for better cache performance. the diagonal entries of t are in the entrie entries are d(2),d(4),...,d(2*n-2). to avoid overflow, the assumed to be interleaved in memory for better cache performance. the diagonal entries of t are in the entrie entries are d(2),d(4),...,d(2*n-2). to avoid overflow, the if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n pssyngst calls pshegst when uplo='u', hence pshengst provides improved performance only when uplo='l', ibtype=1 pssyngst also calls pshegst when insufficient workspace is the tailored codes provide performance that is essentiall work(1) returns workspace adequate workspace to allow optimal performance lwork (local input) integer for optimal performance, greater workspace is needed, i.e nhegst_lwopt ) pzhengst calls pzhegst when uplo='u', hence pzhengst provides improved performance only when uplo='l', ibtype=1 pzhengst also calls pzhegst when insufficient workspace is the tailored codes provide performance that is essentiall |
| performed performed of the factorization. note that permutations are performed on the matrix, so tha by lapack. followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on of the factorization. note that permutations are performed on the matrix, so tha by lapack. in this case the loop over the levels will not be performed followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on the following steps are performed 1. if fact = 'e', real scaling factors are computed to equilibrate if lrwork is too small to compute all the eigenvectors requested, no computation is performed and info=-2 not know how many eigenvectors are requested until amount by which the eigenvalues should be scaled to compensate for the scaling performed in this routine returned here to allow for future enhancement. if lrwork is too small to compute all the eigenvectors requested, no computation is performed and info=-2 not know how many eigenvectors are requested until amount by which the eigenvalues should be scaled to compensate for the scaling performed in this routine returned here to allow for future enhancement. triangular portion of a is updated, av is computed as: tril(a) * v + v^t * tril(a,-1). this is performed a mvr2) followed by a transpose and a sum across the columns. pclacp2 copies all or part of a distributed matrix a to another distributed matrix b. no communication is performed, pclacp a(ia:ia+m-1,ja:ja+n-1) and sub( b ) denotes b(ib:ib+m-1,jb:jb+n-1). pclacpy copies all or part of a distributed matrix a to another distributed matrix b. no communication is performed, pclacp a(ia:ia+m-1,ja:ja+n-1) and sub( b ) denotes b(ib:ib+m-1,jb:jb+n-1). elements below the k-th subdiagonal are zero. the reduction is performed by an unitary similarity transformation q' * a * q. th reflector i - v*t*v', and also the matrix y = a * v * t. this is the unblocked form of the algorithm, calling level 2 blas. no communication is performed by this routine, the matrix to operat of the factorization. note that permutations are performed on the matrix, so tha by lapack. followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on the following steps are performed 1. if fact = 'e', real scaling factors are computed to equilibrate followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on of the factorization. note that permutations are performed on the matrix, so tha by lapack. followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on of the factorization. note that permutations are performed on the matrix, so tha by lapack. in this case the loop over the levels will not be performed followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on the following steps are performed 1. if fact = 'e', real scaling factors are computed to equilibrate pdlacp2 copies all or part of a distributed matrix a to another distributed matrix b. no communication is performed, pdlacp a(ia:ia+m-1,ja:ja+n-1) and sub( b ) denotes b(ib:ib+m-1,jb:jb+n-1). pdlacpy copies all or part of a distributed matrix a to another distributed matrix b. no communication is performed, pdlacp a(ia:ia+m-1,ja:ja+n-1) and sub( b ) denotes b(ib:ib+m-1,jb:jb+n-1). secular equation problem is reduced by one. this stage is performed by the routine pdlaed2 the second stage consists of calculating the updated distributed matrix a(ia:ia+n-1,ja:ja+n-k) so that elements below the k-th subdiagonal are zero. the reduction is performed by an orthogo matrices v and t which determine q as a block reflector i - v*t*v', this is the unblocked form of the algorithm, calling level 2 blas. no communication is performed by this routine, the matrix to operat of the factorization. note that permutations are performed on the matrix, so tha by lapack. followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on the following steps are performed 1. if fact = 'e', real scaling factors are computed to equilibrate followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on if lwork is too small to compute all the eigenvectors requested, no computation is performed and info=-2 not know how many eigenvectors are requested until amount by which the eigenvalues should be scaled to compensate for the scaling performed in this routine returned here to allow for future enhancement. if lwork is too small to compute all the eigenvectors requested, no computation is performed and info=-2 not know how many eigenvectors are requested until amount by which the eigenvalues should be scaled to compensate for the scaling performed in this routine returned here to allow for future enhancement. triangular portion of a is updated, av is computed as: tril(a) * v + v^t * tril(a,-1). this is performed a mvr2) followed by a transpose and a sum across the columns. of the factorization. note that permutations are performed on the matrix, so tha by lapack. followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on of the factorization. note that permutations are performed on the matrix, so tha by lapack. in this case the loop over the levels will not be performed followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on the following steps are performed 1. if fact = 'e', real scaling factors are computed to equilibrate pslacp2 copies all or part of a distributed matrix a to another distributed matrix b. no communication is performed, pslacp a(ia:ia+m-1,ja:ja+n-1) and sub( b ) denotes b(ib:ib+m-1,jb:jb+n-1). pslacpy copies all or part of a distributed matrix a to another distributed matrix b. no communication is performed, pslacp a(ia:ia+m-1,ja:ja+n-1) and sub( b ) denotes b(ib:ib+m-1,jb:jb+n-1). secular equation problem is reduced by one. this stage is performed by the routine pslaed2 the second stage consists of calculating the updated distributed matrix a(ia:ia+n-1,ja:ja+n-k) so that elements below the k-th subdiagonal are zero. the reduction is performed by an orthogo matrices v and t which determine q as a block reflector i - v*t*v', this is the unblocked form of the algorithm, calling level 2 blas. no communication is performed by this routine, the matrix to operat of the factorization. note that permutations are performed on the matrix, so tha by lapack. followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on the following steps are performed 1. if fact = 'e', real scaling factors are computed to equilibrate followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on if lwork is too small to compute all the eigenvectors requested, no computation is performed and info=-2 not know how many eigenvectors are requested until amount by which the eigenvalues should be scaled to compensate for the scaling performed in this routine returned here to allow for future enhancement. if lwork is too small to compute all the eigenvectors requested, no computation is performed and info=-2 not know how many eigenvectors are requested until amount by which the eigenvalues should be scaled to compensate for the scaling performed in this routine returned here to allow for future enhancement. triangular portion of a is updated, av is computed as: tril(a) * v + v^t * tril(a,-1). this is performed a mvr2) followed by a transpose and a sum across the columns. of the factorization. note that permutations are performed on the matrix, so tha by lapack. followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on of the factorization. note that permutations are performed on the matrix, so tha by lapack. in this case the loop over the levels will not be performed followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on the following steps are performed 1. if fact = 'e', real scaling factors are computed to equilibrate if lrwork is too small to compute all the eigenvectors requested, no computation is performed and info=-2 not know how many eigenvectors are requested until amount by which the eigenvalues should be scaled to compensate for the scaling performed in this routine returned here to allow for future enhancement. if lrwork is too small to compute all the eigenvectors requested, no computation is performed and info=-2 not know how many eigenvectors are requested until amount by which the eigenvalues should be scaled to compensate for the scaling performed in this routine returned here to allow for future enhancement. triangular portion of a is updated, av is computed as: tril(a) * v + v^t * tril(a,-1). this is performed a mvr2) followed by a transpose and a sum across the columns. pzlacp2 copies all or part of a distributed matrix a to another distributed matrix b. no communication is performed, pzlacp a(ia:ia+m-1,ja:ja+n-1) and sub( b ) denotes b(ib:ib+m-1,jb:jb+n-1). pzlacpy copies all or part of a distributed matrix a to another distributed matrix b. no communication is performed, pzlacp a(ia:ia+m-1,ja:ja+n-1) and sub( b ) denotes b(ib:ib+m-1,jb:jb+n-1). elements below the k-th subdiagonal are zero. the reduction is performed by an unitary similarity transformation q' * a * q. th reflector i - v*t*v', and also the matrix y = a * v * t. this is the unblocked form of the algorithm, calling level 2 blas. no communication is performed by this routine, the matrix to operat of the factorization. note that permutations are performed on the matrix, so tha by lapack. followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on the following steps are performed 1. if fact = 'e', real scaling factors are computed to equilibrate followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on followed by an analagous backsolve, both using the structure of the factored matrix, are performed for a linear system, a local backsubstitution is performed on |
| performs performs ctrmvt performs the matrix-vector operation x := conjg( t' ) *y, and w := t *z, dtrmvt performs the matrix-vector operation x := t' *y, and w := t *z, pchengst performs the same function as pchegst, but is based o triangular solves (the basis of pchengst). the above formula allows tau to be spread down in the same call to sgsum2d which performs the sum-to-all of c the computation of v, which could be performed in any processor distributed matrix b. no communication is performed, pclacp2 performs a local copy sub( a ) := sub( b ), where sub( a ) denote pclacp2 requires that only dimension of the matrix operands is distributed matrix b. no communication is performed, pclacpy performs a local copy sub( a ) := sub( b ), where sub( a ) denote pclaswp performs a series of row or column interchanges o interchange is initiated for each of rows or columns k1 trough k2 of in addition, this routine performs a global minimization and maximi distributed matrix b. no communication is performed, pdlacp2 performs a local copy sub( a ) := sub( b ), where sub( a ) denote pdlacp2 requires that only dimension of the matrix operands is distributed matrix b. no communication is performed, pdlacpy performs a local copy sub( a ) := sub( b ), where sub( a ) denote pdlaswp performs a series of row or column interchanges o interchange is initiated for each of rows or columns k1 trough k2 of pdsyngst performs the same function as pdhegst, but is based o triangular solves (the basis of pdsyngst). the above formula allows tau to be spread down in the same call to dgsum2d which performs the sum-to-all of c the computation of v, which could be performed in any processor in addition, this routine performs a global minimization and maximi distributed matrix b. no communication is performed, pslacp2 performs a local copy sub( a ) := sub( b ), where sub( a ) denote pslacp2 requires that only dimension of the matrix operands is distributed matrix b. no communication is performed, pslacpy performs a local copy sub( a ) := sub( b ), where sub( a ) denote pslaswp performs a series of row or column interchanges o interchange is initiated for each of rows or columns k1 trough k2 of pssyngst performs the same function as pshegst, but is based o triangular solves (the basis of pssyngst). the above formula allows tau to be spread down in the same call to sgsum2d which performs the sum-to-all of c the computation of v, which could be performed in any processor pzhengst performs the same function as pzhegst, but is based o triangular solves (the basis of pzhengst). the above formula allows tau to be spread down in the same call to dgsum2d which performs the sum-to-all of c the computation of v, which could be performed in any processor distributed matrix b. no communication is performed, pzlacp2 performs a local copy sub( a ) := sub( b ), where sub( a ) denote pzlacp2 requires that only dimension of the matrix operands is distributed matrix b. no communication is performed, pzlacpy performs a local copy sub( a ) := sub( b ), where sub( a ) denote pzlaswp performs a series of row or column interchanges o interchange is initiated for each of rows or columns k1 trough k2 of strmvt performs the matrix-vector operation x := t' *y, and w := t *z, ztrmvt performs the matrix-vector operation x := conjg( t' ) *y, and w := t *z, |
| permu permu the lu decomposition with partial pivoting and row interchanges is used to factor sub( a ) as sub( a ) = p * l * u, where p is a permu l and u are stored in sub( a ). the factored form of sub( a ) is then the lu decomposition with partial pivoting and row interchanges is used to factor sub( a ) as sub( a ) = p * l * u, where p is a permu l and u are stored in sub( a ). the factored form of sub( a ) is then the lu decomposition with partial pivoting and row interchanges is used to factor sub( a ) as sub( a ) = p * l * u, where p is a permu l and u are stored in sub( a ). the factored form of sub( a ) is then the lu decomposition with partial pivoting and row interchanges is used to factor sub( a ) as sub( a ) = p * l * u, where p is a permu l and u are stored in sub( a ). the factored form of sub( a ) is then |
| Permutation Permutation Permutation and forward elimination (triang. solve a = p * l * u, where p is a Permutation matrix, l is a unit lower triangula the factorization has the form sub( a ) = p * l * u, where p is a Permutation matrix, l is lower triangular with unit diagona (upper trapezoidal if m < n). the factorization has the form sub( a ) = p * l * u, where p is a Permutation matrix, l is lower triangular with unit diagonal ele (upper trapezoidal if m < n). l and u are stored in sub( a ). pclapiv applies either p (Permutation matrix indicated by ipiv sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pclapv2 applies either p (Permutation matrix indicated by ipiv a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the direc (global input) character specifies in which order the Permutation is applied = 'b' (backward) Permutation and forward elimination (triang. solve a = p * l * u, where p is a Permutation matrix, l is a unit lower triangula the factorization has the form sub( a ) = p * l * u, where p is a Permutation matrix, l is lower triangular with unit diagona (upper trapezoidal if m < n). the factorization has the form sub( a ) = p * l * u, where p is a Permutation matrix, l is lower triangular with unit diagonal ele (upper trapezoidal if m < n). l and u are stored in sub( a ). indx (workspace) integer array, dimension (n) the Permutation used to sort the contents of dlamda int indx (workspace) integer array, dimension (n) the Permutation used to sort the contents of dlamda int pdlapiv applies either p (Permutation matrix indicated by ipiv sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pdlapv2 applies either p (Permutation matrix indicated by ipiv a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the direc (global input) character specifies in which order the Permutation is applied = 'b' (backward) Permutation and forward elimination (triang. solve a = p * l * u, where p is a Permutation matrix, l is a unit lower triangula the factorization has the form sub( a ) = p * l * u, where p is a Permutation matrix, l is lower triangular with unit diagona (upper trapezoidal if m < n). the factorization has the form sub( a ) = p * l * u, where p is a Permutation matrix, l is lower triangular with unit diagonal ele (upper trapezoidal if m < n). l and u are stored in sub( a ). indx (workspace) integer array, dimension (n) the Permutation used to sort the contents of dlamda int indx (workspace) integer array, dimension (n) the Permutation used to sort the contents of dlamda int pslapiv applies either p (Permutation matrix indicated by ipiv sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pslapv2 applies either p (Permutation matrix indicated by ipiv a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the direc (global input) character specifies in which order the Permutation is applied = 'b' (backward) Permutation and forward elimination (triang. solve a = p * l * u, where p is a Permutation matrix, l is a unit lower triangula the factorization has the form sub( a ) = p * l * u, where p is a Permutation matrix, l is lower triangular with unit diagona (upper trapezoidal if m < n). the factorization has the form sub( a ) = p * l * u, where p is a Permutation matrix, l is lower triangular with unit diagonal ele (upper trapezoidal if m < n). l and u are stored in sub( a ). pzlapiv applies either p (Permutation matrix indicated by ipiv sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pzlapv2 applies either p (Permutation matrix indicated by ipiv a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the direc (global input) character specifies in which order the Permutation is applied = 'b' (backward) |
| permutations permutations of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. of the factorization. note that permutations are performed on the matrix, so tha by lapack. |
| permuted permuted rowcol (global input) character*1 specifies if the rows or columns are to be permuted = 'c' columns will be permuted. rowcol (global input) character specifies if the rows or columns are to be permuted = 'c' columns will be permuted. rowcol (global input) character specifies if the rows or columns are permuted = 'c' (columns) rowcol (global input) character*1 specifies if the rows or columns are to be permuted = 'c' columns will be permuted. rowcol (global input) character specifies if the rows or columns are to be permuted = 'c' columns will be permuted. rowcol (global input) character specifies if the rows or columns are permuted = 'c' (columns) rowcol (global input) character*1 specifies if the rows or columns are to be permuted = 'c' columns will be permuted. rowcol (global input) character specifies if the rows or columns are to be permuted = 'c' columns will be permuted. rowcol (global input) character specifies if the rows or columns are permuted = 'c' (columns) rowcol (global input) character*1 specifies if the rows or columns are to be permuted = 'c' columns will be permuted. rowcol (global input) character specifies if the rows or columns are to be permuted = 'c' columns will be permuted. rowcol (global input) character specifies if the rows or columns are permuted = 'c' (columns) |
| perturbation perturbation if eigenvalues j and j-1 are too close, add a relatively small perturbation if eigenvalues j and j-1 are too close, add a relatively small perturbation |
| perturbed perturbed d (global input/output) double precision array, dimension (n) on entry,the eigenvalues of the rank-1-perturbed matrix d (global input/output) real array, dimension (n) on entry,the eigenvalues of the rank-1-perturbed matrix |
| Peter Peter and marbwus hegland, australian natonal university. feb., 1997. based on code written by : Peter arbenz, eth zurich, 1996 ===================================================================== and markus hegland, australian national university. feb., 1997. based on code written by : Peter arbenz, eth zurich, 1996 eth, zurich. and markus hegland, australian national university. feb., 1997. based on code written by : Peter arbenz, eth zurich, 1996 eth, zurich. and marbwus hegland, australian natonal university. feb., 1997. based on code written by : Peter arbenz, eth zurich, 1996 ===================================================================== |
| PHASE PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ******************************************************************* PHASE 1: local computation phase p pieces with one stored on each processor, and then proceeds in 2 PHASEs for the factorization or 3 for th 1) local phase: ***************************************** local computation PHASE |
| phases phases p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: |
| physically physically the dimension of the array iwork used as workspace for physically transposing the pivots if nprow == npcol then the dimension of the array iwork used as workspace for physically transposing the pivots if nprow == npcol then the dimension of the array iwork used as workspace for physically transposing the pivots if nprow == npcol then the dimension of the array iwork used as workspace for physically transposing the pivots if nprow == npcol then |
| pictures pictures triangular matrix, stopping/starting at the diagonal, which is the point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered triangular matrix, stopping/starting at the diagonal, which is the point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered triangular matrix, stopping/starting at the diagonal, which is the point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered triangular matrix, stopping/starting at the diagonal, which is the point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered triangular matrix, stopping/starting at the diagonal, which is the point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered triangular matrix, stopping/starting at the diagonal, which is the point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered |
| piece piece lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th interchanges will be applied. on exit, the local pieces local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this local array contains the local pieces of th interchanges will be applied. on exit, this array contains lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th interchanges will be applied. on exit, the local pieces local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this local array contains the local pieces of th interchanges will be applied. on exit, this array contains lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th interchanges will be applied. on exit, the local pieces local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this local array contains the local pieces of th interchanges will be applied. on exit, this array contains lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th interchanges will be applied. on exit, the local pieces local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this local array contains the local pieces of th interchanges will be applied. on exit, this array contains lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions |
| pieces pieces lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the diagonal and the first superdiagonal of sub( a ) are local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the diagonal and the first superdiagonal of sub( a ) are to an array of dimension ( lld_a, locc(ja+n-1) ). on entry, this array contains the local pieces of the factors l and unit diagonal elements of l are not stored. to an array of dimension ( lld_a, locc(ja+n-1) ), the local pieces of the m-by-n distributed matrix whos local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of the n-by- the upper triangle and the first subdiagonal of sub( a ) are local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of the n-by- the upper triangle and the first subdiagonal of sub( a ) are local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and below the diagonal of sub( a ) contain the m by min(m,n) local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and below the diagonal of sub( a ) contain the m by min(m,n) (lld_b, locc(jb+nrhs-1)). on entry, this array contains the local pieces of the distributed matrix b of right hand sid sub( b ) is m-by-nrhs if trans='n', and n-by-nrhs otherwise. local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri lower triangle of the distributed submatrix local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri lower triangle of the distributed submatrix local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n memory to an array of local dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distribute local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the n-by-n distributed matri local pieces of the factors l and u from the factorization local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the m-by- the local pieces of the factors l and u from the factoriza- local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the m-by- array contains the local pieces of the factors l and u from local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the l and u obtained by th exit, if info = 0, sub( a ) contains the inverse of the memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the factor diagonal elements of l are not stored. local memory to an array of dimension (lld_a, locc(ja+m-1)). on entry, the local pieces of the n-by-m distributed matri and above the diagonal of sub( a ) contain the min(n,m) by m local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locq(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the first nb rows and columns of the matrix are overwritten; to an array of dimension (lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix sub( a to an array of dimension (lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix sub( a locc(ja+n-k)). on entry, this array contains the the local pieces of the n-by-(n-k+1) general distributed matri the k-th subdiagonal in the first nb columns are overwritten to an array of dimension (lld_a, locc(ja+n-1)) containing the local pieces of the distributed matrix sub( a ) ia (global input) integer local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th interchanges will be applied. on exit, the local pieces local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this local array contains the local pieces of th interchanges will be applied. on exit, this array contains memory to an array of local dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the distributed symmetri triangular part of sub( a ) contains the upper triangular side = 'l', ( lld_v, locc(jv+n-1) ) if storev = 'r' and side = 'r'. it contains the local pieces of the distribute see further details. local memory to an array of dimension (lld_x,*). this array contains the local pieces of the distributed vector sub( x ) the vector x. on exit, it is overwritten with the vector v. (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local pieces of the distributed vectors v representing th lld_v >= locr(iv+k-1). local memory to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distribute pieces of the distributed matrix multiplied by cto/cfrom. to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distributed matrix sub( a is set as follows: to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distributed matrix sub( a is set as follows: local memory to an array of dimension (lld_a, * ). on entry, this array contains the local pieces of the distri applied. on exit the permuted distributed matrix. to an array of dimension ( lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix the trac local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangular part of sub( a ) contains the upper trian- local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor l or u matrix sub( a ) is overwritten with the upper triangle of the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor l or u matrix sub( a ) is overwritten with the upper triangle of the x (local input) complex array containing the local pieces of a distributed matrix of dimension of at leas this array contains the entries of the distributed vector lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). an array of dimension ( lld_a, locc(ja+n-1) ). on entry, this array contains the local pieces of the factors l or u fro l*l', as computed by pcpotrf. memory to an array of local dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the n-by-n hermitia if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor u or sub( a ) = u**h*u or l*l**h, as computed by pcpotrf. (lld_b,locc(jb+nrhs-1)). on entry, this array contains the the local pieces of the right hand sides sub( b ) distributed matrix x. on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions sx (local input/local output) complex array containing the local pieces of a distributed matrix o ( (jx-1)*m_x + ix + ( n - 1 )*abs( incx ) ) to an array of dimension ( lld_a, locc(ja+n-1) ). this array contains the local pieces of the triangular distribute n-by-n upper triangular part of this distributed matrix con- to an array of local dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the original triangula if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a,locc(ja+n-1)), this array contains the local pieces of the triangular matri part of the matrix sub( a ) contains the upper triangular local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th n-by-n upper triangular part of the matrix sub( a ) contains to an array of dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the distributed triangula triangular part of sub( a ) contains the upper triangular local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangular part of sub( a ) contains the upper trian- matrix argument a(ia:*,ja+n-k:ja+n-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:*,ja:ja+k-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:ia+k-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:ia+k-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia:*,ja+n-k:ja+n-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia:*,ja:ja+k-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia+m-k:ia+m-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia+m-k:ia+m-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or q'*sub( c ) or sub( c )*q' or sub( c )*q; if vect='p, local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the diagonal and the first superdiagonal of sub( a ) are local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the diagonal and the first superdiagonal of sub( a ) are to an array of dimension ( lld_a, locc(ja+n-1) ). on entry, this array contains the local pieces of the factors l and unit diagonal elements of l are not stored. to an array of dimension ( lld_a, locc(ja+n-1) ), the local pieces of the m-by-n distributed matrix whos local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of the n-by- the upper triangle and the first subdiagonal of sub( a ) are local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of the n-by- the upper triangle and the first subdiagonal of sub( a ) are local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and below the diagonal of sub( a ) contain the m by min(m,n) local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and below the diagonal of sub( a ) contain the m by min(m,n) (lld_b, locc(jb+nrhs-1)). on entry, this array contains the local pieces of the distributed matrix b of right hand sid sub( b ) is m-by-nrhs if trans='n', and n-by-nrhs otherwise. local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri lower triangle of the distributed submatrix local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri lower triangle of the distributed submatrix local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n memory to an array of local dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distribute local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the n-by-n distributed matri local pieces of the factors l and u from the factorization local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the m-by- the local pieces of the factors l and u from the factoriza- local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the m-by- array contains the local pieces of the factors l and u from local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the l and u obtained by th exit, if info = 0, sub( a ) contains the inverse of the memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the factor diagonal elements of l are not stored. local memory to an array of dimension (lld_a, locc(ja+m-1)). on entry, the local pieces of the n-by-m distributed matri and above the diagonal of sub( a ) contain the min(n,m) by m local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the first nb rows and columns of the matrix are overwritten; to an array of dimension (lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix sub( a to an array of dimension (lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix sub( a locc(ja+n-k)). on entry, this array contains the the local pieces of the n-by-(n-k+1) general distributed matri the k-th subdiagonal in the first nb columns are overwritten to an array of dimension (lld_a, locc(ja+n-1)) containing the local pieces of the distributed matrix sub( a ) ia (global input) integer local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th interchanges will be applied. on exit, the local pieces local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this local array contains the local pieces of th interchanges will be applied. on exit, this array contains memory to an array of local dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the distributed symmetri triangular part of sub( a ) contains the upper triangular side = 'l', ( lld_v, locc(jv+n-1) ) if storev = 'r' and side = 'r'. it contains the local pieces of the distribute see further details. local memory to an array of dimension (lld_x,*). this array contains the local pieces of the distributed vector sub( x ) the vector x. on exit, it is overwritten with the vector v. (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local pieces of the distributed vectors v representing th lld_v >= locr(iv+k-1). local memory to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distribute pieces of the distributed matrix multiplied by cto/cfrom. to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distributed matrix sub( a is set as follows: to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distributed matrix sub( a is set as follows: to an array of dimension (lld_q, locc(jq+n-1) ). this array contains the local pieces of the distributed matrix sub( a local memory to an array of dimension (lld_a, * ). on entry, this array contains the local pieces of the distri applied. on exit the permuted distributed matrix. to an array of dimension ( lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix the trac local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangular part of sub( a ) contains the upper trian- local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor l or u matrix sub( a ) is overwritten with the upper triangle of the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor l or u matrix sub( a ) is overwritten with the upper triangle of the matrix argument a(ia:*,ja+n-k:ja+n-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:*,ja:ja+k-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:ia+k-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:ia+k-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia:*,ja+n-k:ja+n-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia:*,ja:ja+k-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia+m-k:ia+m-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia+m-k:ia+m-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or q'*sub( c ) or sub( c )*q' or sub( c )*q; if vect='p, local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). to an array of dimension ( lld_a, locc(ja+n-1) ). on entry, this array contains the local pieces of the factors l or or l*l', as computed by pdpotrf. memory to an array of local dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the n-by-n symmetri if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor u or sub( a ) = u**t*u or l*l**t, as computed by pdpotrf. (lld_b,locc(jb+nrhs-1)). on entry, this array contains the the local pieces of the right hand sides sub( b ) distributed matrix x. on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions sx (local input/local output) double precision array containing the local pieces of a distributed matrix o ( (jx-1)*m_x + ix + ( n - 1 )*abs( incx ) ) local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locq(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains to an array of dimension ( lld_a, locc(ja+n-1) ). this array contains the local pieces of the triangular distribute n-by-n upper triangular part of this distributed matrix con- to an array of local dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the original triangula if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a,locc(ja+n-1)), this array contains the local pieces of the triangular matri part of the matrix sub( a ) contains the upper triangular local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th n-by-n upper triangular part of the matrix sub( a ) contains to an array of dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the distributed triangula triangular part of sub( a ) contains the upper triangular local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangular part of sub( a ) contains the upper trian- x (local input) complex*16 array containing the local pieces of a distributed matrix of dimension of at leas this array contains the entries of the distributed vector x (local input) complex array containing the local pieces of a distributed matrix of dimension of at leas this array contains the entries of the distributed vector lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the diagonal and the first superdiagonal of sub( a ) are local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the diagonal and the first superdiagonal of sub( a ) are to an array of dimension ( lld_a, locc(ja+n-1) ). on entry, this array contains the local pieces of the factors l and unit diagonal elements of l are not stored. to an array of dimension ( lld_a, locc(ja+n-1) ), the local pieces of the m-by-n distributed matrix whos local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of the n-by- the upper triangle and the first subdiagonal of sub( a ) are local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of the n-by- the upper triangle and the first subdiagonal of sub( a ) are local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and below the diagonal of sub( a ) contain the m by min(m,n) local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and below the diagonal of sub( a ) contain the m by min(m,n) (lld_b, locc(jb+nrhs-1)). on entry, this array contains the local pieces of the distributed matrix b of right hand sid sub( b ) is m-by-nrhs if trans='n', and n-by-nrhs otherwise. local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri lower triangle of the distributed submatrix local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri lower triangle of the distributed submatrix local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n memory to an array of local dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distribute local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the n-by-n distributed matri local pieces of the factors l and u from the factorization local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the m-by- the local pieces of the factors l and u from the factoriza- local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the m-by- array contains the local pieces of the factors l and u from local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the l and u obtained by th exit, if info = 0, sub( a ) contains the inverse of the memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the factor diagonal elements of l are not stored. local memory to an array of dimension (lld_a, locc(ja+m-1)). on entry, the local pieces of the n-by-m distributed matri and above the diagonal of sub( a ) contain the min(n,m) by m local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the first nb rows and columns of the matrix are overwritten; to an array of dimension (lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix sub( a to an array of dimension (lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix sub( a locc(ja+n-k)). on entry, this array contains the the local pieces of the n-by-(n-k+1) general distributed matri the k-th subdiagonal in the first nb columns are overwritten to an array of dimension (lld_a, locc(ja+n-1)) containing the local pieces of the distributed matrix sub( a ) ia (global input) integer local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th interchanges will be applied. on exit, the local pieces local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this local array contains the local pieces of th interchanges will be applied. on exit, this array contains memory to an array of local dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the distributed symmetri triangular part of sub( a ) contains the upper triangular side = 'l', ( lld_v, locc(jv+n-1) ) if storev = 'r' and side = 'r'. it contains the local pieces of the distribute see further details. local memory to an array of dimension (lld_x,*). this array contains the local pieces of the distributed vector sub( x ) the vector x. on exit, it is overwritten with the vector v. (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local pieces of the distributed vectors v representing th lld_v >= locr(iv+k-1). local memory to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distribute pieces of the distributed matrix multiplied by cto/cfrom. to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distributed matrix sub( a is set as follows: to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distributed matrix sub( a is set as follows: to an array of dimension (lld_q, locc(jq+n-1) ). this array contains the local pieces of the distributed matrix sub( a local memory to an array of dimension (lld_a, * ). on entry, this array contains the local pieces of the distri applied. on exit the permuted distributed matrix. to an array of dimension ( lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix the trac local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangular part of sub( a ) contains the upper trian- local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor l or u matrix sub( a ) is overwritten with the upper triangle of the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor l or u matrix sub( a ) is overwritten with the upper triangle of the matrix argument a(ia:*,ja+n-k:ja+n-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:*,ja:ja+k-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:ia+k-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:ia+k-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia:*,ja+n-k:ja+n-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia:*,ja:ja+k-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia+m-k:ia+m-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia+m-k:ia+m-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or q'*sub( c ) or sub( c )*q' or sub( c )*q; if vect='p, local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). an array of dimension ( lld_a, locc(ja+n-1) ). on entry, this array contains the local pieces of the factors l or u fro l*l', as computed by pspotrf. memory to an array of local dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the n-by-n symmetri if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor u or sub( a ) = u**t*u or l*l**t, as computed by pspotrf. (lld_b,locc(jb+nrhs-1)). on entry, this array contains the the local pieces of the right hand sides sub( b ) distributed matrix x. on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions sx (local input/local output) real array containing the local pieces of a distributed matrix o ( (jx-1)*m_x + ix + ( n - 1 )*abs( incx ) ) local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locq(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains to an array of dimension ( lld_a, locc(ja+n-1) ). this array contains the local pieces of the triangular distribute n-by-n upper triangular part of this distributed matrix con- to an array of local dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the original triangula if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a,locc(ja+n-1)), this array contains the local pieces of the triangular matri part of the matrix sub( a ) contains the upper triangular local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th n-by-n upper triangular part of the matrix sub( a ) contains to an array of dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the distributed triangula triangular part of sub( a ) contains the upper triangular local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangular part of sub( a ) contains the upper trian- lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bwl+bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). sx (local input/local output) complex*16 array containing the local pieces of a distributed matrix o ( (jx-1)*m_x + ix + ( n - 1 )*abs( incx ) ) on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(2*bwl+2*bwu+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the diagonal and the first superdiagonal of sub( a ) are local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the diagonal and the first superdiagonal of sub( a ) are to an array of dimension ( lld_a, locc(ja+n-1) ). on entry, this array contains the local pieces of the factors l and unit diagonal elements of l are not stored. to an array of dimension ( lld_a, locc(ja+n-1) ), the local pieces of the m-by-n distributed matrix whos local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of the n-by- the upper triangle and the first subdiagonal of sub( a ) are local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of the n-by- the upper triangle and the first subdiagonal of sub( a ) are local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and below the diagonal of sub( a ) contain the m by min(m,n) local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and below the diagonal of sub( a ) contain the m by min(m,n) (lld_b, locc(jb+nrhs-1)). on entry, this array contains the local pieces of the distributed matrix b of right hand sid sub( b ) is m-by-nrhs if trans='n', and n-by-nrhs otherwise. local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri lower triangle of the distributed submatrix local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri lower triangle of the distributed submatrix local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri and above the diagonal of sub( a ) contain the min(m,n) by n memory to an array of local dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distribute local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the n-by-n distributed matri local pieces of the factors l and u from the factorization local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the m-by- the local pieces of the factors l and u from the factoriza- local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the m-by- array contains the local pieces of the factors l and u from local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the l and u obtained by th exit, if info = 0, sub( a ) contains the inverse of the memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of the factor diagonal elements of l are not stored. local memory to an array of dimension (lld_a, locc(ja+m-1)). on entry, the local pieces of the n-by-m distributed matri and above the diagonal of sub( a ) contain the min(n,m) by m local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangle of a( ia:ia+m-1, ja+n-m:ja+n-1 ) contains the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th the leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locq(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th the first nb rows and columns of the matrix are overwritten; to an array of dimension (lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix sub( a to an array of dimension (lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix sub( a locc(ja+n-k)). on entry, this array contains the the local pieces of the n-by-(n-k+1) general distributed matri the k-th subdiagonal in the first nb columns are overwritten to an array of dimension (lld_a, locc(ja+n-1)) containing the local pieces of the distributed matrix sub( a ) ia (global input) integer local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th interchanges will be applied. on exit, the local pieces local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this local array contains the local pieces of th interchanges will be applied. on exit, this array contains memory to an array of local dimension (lld_a,locc(ja+n-1)). on entry, the local pieces of the distributed symmetri triangular part of sub( a ) contains the upper triangular side = 'l', ( lld_v, locc(jv+n-1) ) if storev = 'r' and side = 'r'. it contains the local pieces of the distribute see further details. local memory to an array of dimension (lld_x,*). this array contains the local pieces of the distributed vector sub( x ) the vector x. on exit, it is overwritten with the vector v. (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local pieces of the distributed vectors v representing th lld_v >= locr(iv+k-1). local memory to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distribute pieces of the distributed matrix multiplied by cto/cfrom. to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distributed matrix sub( a is set as follows: to an array of dimension (lld_a,locc(ja+n-1)). this array contains the local pieces of the distributed matrix sub( a is set as follows: local memory to an array of dimension (lld_a, * ). on entry, this array contains the local pieces of the distri applied. on exit the permuted distributed matrix. to an array of dimension ( lld_a, locc(ja+n-1) ). this array contains the local pieces of the distributed matrix the trac local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th leading n-by-n upper triangular part of sub( a ) contains local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangular part of sub( a ) contains the upper trian- local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor l or u matrix sub( a ) is overwritten with the upper triangle of the local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor l or u matrix sub( a ) is overwritten with the upper triangle of the x (local input) complex*16 array containing the local pieces of a distributed matrix of dimension of at leas this array contains the entries of the distributed vector lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th used in lapack. please see the notes below and the lld_a >=(bw+1) (stored in desca). on entry, this array contains the local pieces of th l^t a(1:n, ja:ja+n-1). an array of dimension ( lld_a, locc(ja+n-1) ). on entry, this array contains the local pieces of the factors l or u fro l*l', as computed by pzpotrf. memory to an array of local dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the n-by-n hermitia if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, this array contains the local pieces of th if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the triangular factor u or sub( a ) = u**h*u or l*l**h, as computed by pzpotrf. (lld_b,locc(jb+nrhs-1)). on entry, this array contains the the local pieces of the right hand sides sub( b ) distributed matrix x. on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions on entry, this array contains the the local pieces of the right hand side on exit, this contains the local piece of the solutions to an array of dimension ( lld_a, locc(ja+n-1) ). this array contains the local pieces of the triangular distribute n-by-n upper triangular part of this distributed matrix con- to an array of local dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the original triangula if uplo = 'u', the leading n-by-n upper triangular part of local memory to an array of dimension (lld_a,locc(ja+n-1)), this array contains the local pieces of the triangular matri part of the matrix sub( a ) contains the upper triangular local memory to an array of dimension (lld_a,locc(ja+n-1)). on entry, this array contains the local pieces of th n-by-n upper triangular part of the matrix sub( a ) contains to an array of dimension (lld_a,locc(ja+n-1) ). this array contains the local pieces of the distributed triangula triangular part of sub( a ) contains the upper triangular local memory to an array of dimension (lld_a, locc(ja+n-1)). on entry, the local pieces of the m-by-n distributed matri upper triangular part of sub( a ) contains the upper trian- matrix argument a(ia:*,ja+n-k:ja+n-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:*,ja:ja+k-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:ia+k-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer argument a(ia:ia+k-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia:*,ja+n-k:ja+n-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia:*,ja:ja+k-1). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia+m-k:ia+m-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer matrix argument a(ia+m-k:ia+m-1,ja:*). on exit, this array contains the local pieces of the m-by-n distributed matrix q ia (global input) integer local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or q'*sub( c ) or sub( c )*q' or sub( c )*q; if vect='p, local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. local memory to an array of dimension (lld_c,locc(jc+n-1)). on entry, the local pieces of the distributed matrix sub(c) or sub( c )*q' or sub( c )*q. |
| PIVMIN PIVMIN PIVMIN (input) double precisio implementation of the sturm sequence loop. this must be at PIVMIN (input) double precisio implementation of the sturm sequence loop. this must be at PIVMIN (input) rea implementation of the sturm sequence loop. this must be at PIVMIN (input) rea implementation of the sturm sequence loop. this must be at |
| pivot pivot find pivot and test for singularity. km is the number o find pivot and test for singularity. km is the number o gaussian elimination with pivotin of the matrix into p l u. ipiv (local output) integer array, dimension >= desca( nb ). pivot indices for local factorizations factorization and solve. locr(m_a)+mb_a. if fact = 'f', then ipiv is an input argu- ment and on entry contains the pivot indices from the fac pcgetrf; ipiv(i) -> the global row local row i was sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. or inv( p ) to a m-by-n distributed matrix sub( a ) denoting a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. th pivoting the rows of sub( a ), ipiv should be distributed along a interchange is initiated for each of rows or columns k1 trough k2 of sub( a ). this routine assumes that the pivoting information ha also note that this routine will only work for k1-k2 being in the gaussian elimination with pivotin of the matrix into p l u. ipiv (local output) integer array, dimension >= desca( nb ). pivot indices for local factorizations factorization and solve. locr(m_a)+mb_a. if fact = 'f', then ipiv is an input argu- ment and on entry contains the pivot indices from the fac pdgetrf; ipiv(i) -> the global row local row i was pivmin (input) double precision the minimum absolute of a "pivot" in the "paranoid least max_j |e(j)^2| *safe_min, and at least safe_min, where pivmin (input) double precision the minimum absolute of a "pivot" in this "paranoid least max_j |e(j)^2| *safe_min, and at least safe_min, where sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. or inv( p ) to a m-by-n distributed matrix sub( a ) denoting a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. th pivoting the rows of sub( a ), ipiv should be distributed along a interchange is initiated for each of rows or columns k1 trough k2 of sub( a ). this routine assumes that the pivoting information ha also note that this routine will only work for k1-k2 being in the gaussian elimination with pivotin of the matrix into p l u. ipiv (local output) integer array, dimension >= desca( nb ). pivot indices for local factorizations factorization and solve. locr(m_a)+mb_a. if fact = 'f', then ipiv is an input argu- ment and on entry contains the pivot indices from the fac psgetrf; ipiv(i) -> the global row local row i was pivmin (input) real the minimum absolute of a "pivot" in the "paranoid least max_j |e(j)^2| *safe_min, and at least safe_min, where pivmin (input) real the minimum absolute of a "pivot" in this "paranoid least max_j |e(j)^2| *safe_min, and at least safe_min, where sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. or inv( p ) to a m-by-n distributed matrix sub( a ) denoting a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. th pivoting the rows of sub( a ), ipiv should be distributed along a interchange is initiated for each of rows or columns k1 trough k2 of sub( a ). this routine assumes that the pivoting information ha also note that this routine will only work for k1-k2 being in the gaussian elimination with pivotin of the matrix into p l u. ipiv (local output) integer array, dimension >= desca( nb ). pivot indices for local factorizations factorization and solve. locr(m_a)+mb_a. if fact = 'f', then ipiv is an input argu- ment and on entry contains the pivot indices from the fac pzgetrf; ipiv(i) -> the global row local row i was sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. or inv( p ) to a m-by-n distributed matrix sub( a ) denoting a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. th pivoting the rows of sub( a ), ipiv should be distributed along a interchange is initiated for each of rows or columns k1 trough k2 of sub( a ). this routine assumes that the pivoting information ha also note that this routine will only work for k1-k2 being in the find pivot and test for singularity. km is the number o find pivot and test for singularity. km is the number o |
| pivoting pivoting cdbtrf computes an lu factorization of a real m-by-n band matrix a without using partial pivoting with row interchanges this is the unblocked version of the algorithm, calling level 2 blas. cdttrf computes an lu factorization of a complex tridiagonal matrix a using elimination without partial pivoting the factorization has the form ddbtrf computes an lu factorization of a real m-by-n band matrix a without using partial pivoting with row interchanges this is the unblocked version of the algorithm, calling level 2 blas. ddttrf computes an lu factorization of a complex tridiagonal matrix a using elimination without partial pivoting the factorization has the form compute lu factors with partial pivoting ( pt = lu gaussian elimination without pivoting of the matrix into l u. gaussian elimination without pivoting of the matrix into l u. gaussian elimination with pivoting of the matrix into p l u. lu factorization with partial pivoting pcgeqpf computes a qr factorization with column pivoting of ipiv (local input) integer array of dimension locr(m_af)+mb_af. this array contains the pivoting information as compute was swapped with. this array is tied to the distributed the lu decomposition with partial pivoting and row interchanges i tation matrix, l is unit lower triangular, and u is upper triangular. distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using partial pivoting with row interchanges the factorization has the form sub( a ) = p * l * u, where p is a pcgetrf computes an lu factorization of a general m-by-n distributed matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting wit ipiv (local input) integer array, dimension locr(m_a)+mb_a keeps track of the pivoting information. ipiv(i) is th array is tied to the distributed matrix a. ipiv (local input) integer array, dimension ( locr(m_a)+mb_a ) this array contains the pivoting information this array is tied to the distributed matrix a. sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. or inv( p ) to a m-by-n distributed matrix sub( a ) denoting a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. th pivoting the rows of sub( a ), ipiv should be distributed along a interchange is initiated for each of rows or columns k1 trough k2 of sub( a ). this routine assumes that the pivoting information ha also note that this routine will only work for k1-k2 being in the gaussian elimination without pivoting of the matrix into l u. gaussian elimination without pivoting of the matrix into l u. gaussian elimination with pivoting of the matrix into p l u. lu factorization with partial pivoting pdgeqpf computes a qr factorization with column pivoting of ipiv (local input) integer array of dimension locr(m_af)+mb_af. this array contains the pivoting information as compute was swapped with. this array is tied to the distributed the lu decomposition with partial pivoting and row interchanges i tation matrix, l is unit lower triangular, and u is upper triangular. distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using partial pivoting with row interchanges the factorization has the form sub( a ) = p * l * u, where p is a pdgetrf computes an lu factorization of a general m-by-n distributed matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting wit ipiv (local input) integer array, dimension locr(m_a)+mb_a keeps track of the pivoting information. ipiv(i) is th array is tied to the distributed matrix a. ipiv (local input) integer array, dimension ( locr(m_a)+mb_a ) this array contains the pivoting information this array is tied to the distributed matrix a. sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. or inv( p ) to a m-by-n distributed matrix sub( a ) denoting a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. th pivoting the rows of sub( a ), ipiv should be distributed along a interchange is initiated for each of rows or columns k1 trough k2 of sub( a ). this routine assumes that the pivoting information ha also note that this routine will only work for k1-k2 being in the gaussian elimination without pivoting of the matrix into l u. gaussian elimination without pivoting of the matrix into l u. gaussian elimination with pivoting of the matrix into p l u. lu factorization with partial pivoting psgeqpf computes a qr factorization with column pivoting of ipiv (local input) integer array of dimension locr(m_af)+mb_af. this array contains the pivoting information as compute was swapped with. this array is tied to the distributed the lu decomposition with partial pivoting and row interchanges i tation matrix, l is unit lower triangular, and u is upper triangular. distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using partial pivoting with row interchanges the factorization has the form sub( a ) = p * l * u, where p is a psgetrf computes an lu factorization of a general m-by-n distributed matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting wit ipiv (local input) integer array, dimension locr(m_a)+mb_a keeps track of the pivoting information. ipiv(i) is th array is tied to the distributed matrix a. ipiv (local input) integer array, dimension ( locr(m_a)+mb_a ) this array contains the pivoting information this array is tied to the distributed matrix a. sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. or inv( p ) to a m-by-n distributed matrix sub( a ) denoting a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. th pivoting the rows of sub( a ), ipiv should be distributed along a interchange is initiated for each of rows or columns k1 trough k2 of sub( a ). this routine assumes that the pivoting information ha also note that this routine will only work for k1-k2 being in the gaussian elimination without pivoting of the matrix into l u. gaussian elimination without pivoting of the matrix into l u. gaussian elimination with pivoting of the matrix into p l u. lu factorization with partial pivoting pzgeqpf computes a qr factorization with column pivoting of ipiv (local input) integer array of dimension locr(m_af)+mb_af. this array contains the pivoting information as compute was swapped with. this array is tied to the distributed the lu decomposition with partial pivoting and row interchanges i tation matrix, l is unit lower triangular, and u is upper triangular. distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using partial pivoting with row interchanges the factorization has the form sub( a ) = p * l * u, where p is a pzgetrf computes an lu factorization of a general m-by-n distributed matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting wit ipiv (local input) integer array, dimension locr(m_a)+mb_a keeps track of the pivoting information. ipiv(i) is th array is tied to the distributed matrix a. ipiv (local input) integer array, dimension ( locr(m_a)+mb_a ) this array contains the pivoting information this array is tied to the distributed matrix a. sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. or inv( p ) to a m-by-n distributed matrix sub( a ) denoting a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. th pivoting the rows of sub( a ), ipiv should be distributed along a interchange is initiated for each of rows or columns k1 trough k2 of sub( a ). this routine assumes that the pivoting information ha also note that this routine will only work for k1-k2 being in the sdbtrf computes an lu factorization of a real m-by-n band matrix a without using partial pivoting with row interchanges this is the unblocked version of the algorithm, calling level 2 blas. sdttrf computes an lu factorization of a complex tridiagonal matrix a using elimination without partial pivoting the factorization has the form compute lu factors with partial pivoting ( pt = lu zdbtrf computes an lu factorization of a real m-by-n band matrix a without using partial pivoting with row interchanges this is the unblocked version of the algorithm, calling level 2 blas. zdttrf computes an lu factorization of a complex tridiagonal matrix a using elimination without partial pivoting the factorization has the form |
| pivots pivots the dimension of the array iwork used as workspace for physically transposing the pivots if nprow == npcol then matrix a. this routine will transpose the pivot vector if necessary. for example if the row pivots should be applied to the columns o specifies in which order the permutation is applied: = 'f' (forward) applies pivots forward from top of matrix = 'b' (backward) applies pivots backward from bottom of the dimension of the array iwork used as workspace for physically transposing the pivots if nprow == npcol then matrix a. this routine will transpose the pivot vector if necessary. for example if the row pivots should be applied to the columns o specifies in which order the permutation is applied: = 'f' (forward) applies pivots forward from top of matrix = 'b' (backward) applies pivots backward from bottom of the dimension of the array iwork used as workspace for physically transposing the pivots if nprow == npcol then matrix a. this routine will transpose the pivot vector if necessary. for example if the row pivots should be applied to the columns o specifies in which order the permutation is applied: = 'f' (forward) applies pivots forward from top of matrix = 'b' (backward) applies pivots backward from bottom of the dimension of the array iwork used as workspace for physically transposing the pivots if nprow == npcol then matrix a. this routine will transpose the pivot vector if necessary. for example if the row pivots should be applied to the columns o specifies in which order the permutation is applied: = 'f' (forward) applies pivots forward from top of matrix = 'b' (backward) applies pivots backward from bottom of |
| PIVROC PIVROC for example if the row pivots should be applied to the columns of sub( a ), pass rowcol='c' and PIVROC='c' notes for example if the row pivots should be applied to the columns of sub( a ), pass rowcol='c' and PIVROC='c' notes for example if the row pivots should be applied to the columns of sub( a ), pass rowcol='c' and PIVROC='c' notes for example if the row pivots should be applied to the columns of sub( a ), pass rowcol='c' and PIVROC='c' notes |
| PJLAENV PJLAENV ictxt = desca( ctxt_ ) anb = PJLAENV( ictxt, 3, 'pchettrd', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) ictxt = desca( ctxt_ ) anb = PJLAENV( ictxt, 3, 'pchettrd', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) ictxt = desca( ctxt_ ) anb = PJLAENV( ictxt, 3, 'pchettrd', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) nps = max( numroc( n, 1, 0, 0, nprow ), 2*anb ) anb = PJLAENV( desca( ctxt_ ), 3, 'pchettrd', 'l', 0, 0 anb = PJLAENV( desca( ctxt_), 3, 'pdsyttrd', 'l' sqnpc = int( sqrt( dble( nprow * npcol ) ) ) anb = PJLAENV( desca( ctxt_), 3, 'pdsyttrd', 'l' sqnpc = int( sqrt( dble( nprow * npcol ) ) ) ictxt = desca( ctxt_ ) anb = PJLAENV( ictxt, 3, 'pdsyttrd', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) nps = max( numroc( n, 1, 0, 0, nprow ), 2*anb ) anb = PJLAENV( desca( ctxt_ ), 3, 'pdsyttrd', 'l', 0, 0 PJLAENV is called from the scalapack symmetric and hermitia problem-dependent parameters for the local environment. see ispec anb = PJLAENV( desca( ctxt_), 3, 'pssyttrd', 'l' sqnpc = int( sqrt( dble( nprow * npcol ) ) ) anb = PJLAENV( desca( ctxt_), 3, 'pssyttrd', 'l' sqnpc = int( sqrt( dble( nprow * npcol ) ) ) ictxt = desca( ctxt_ ) anb = PJLAENV( ictxt, 3, 'pssyttrd', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) nps = max( numroc( n, 1, 0, 0, nprow ), 2*anb ) anb = PJLAENV( desca( ctxt_ ), 3, 'pssyttrd', 'l', 0, 0 ictxt = desca( ctxt_ ) anb = PJLAENV( ictxt, 3, 'pzhettrd', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) ictxt = desca( ctxt_ ) anb = PJLAENV( ictxt, 3, 'pzhettrd', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) ictxt = desca( ctxt_ ) anb = PJLAENV( ictxt, 3, 'pzhettrd', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) nps = max( numroc( n, 1, 0, 0, nprow ), 2*anb ) anb = PJLAENV( desca( ctxt_ ), 3, 'pzhettrd', 'l', 0, 0 |
| place place move block into place that it will be expected to be fo move block into place that it will be expected to be fo where eps is the machine precision. if abstol is less than or equal to zero, then eps*norm(t) will be used in its place obtained by reducing a to tridiagonal form. where eps is the machine precision. if abstol is less than or equal to zero, then eps*norm(t) will be used in its place obtained by reducing a to tridiagonal form. the tailored codes place no restrictions on ia, ja, mb or nb by pchetrd to keep the interface simple. these restrictions are 2.) the small work it takes so that each of the rows and columns is at the same place. for example through some column tmp. (loops 250-260) to 1. indeed, i suspect that ib should always be set to 1 or ignored with 1 used in its place pclamr1d has not been tested except withint the contect of move block into place that it will be expected to be fo move block into place that it will be expected to be fo move block into place that it will be expected to be fo the blacs context handle in which the computation takes place small (local input/local output) double precision indxp (workspace) integer array, dimension (n) the permutation used to place deflated values of d at the en and indxp(k+1:n) points to the deflated eigenvalues. 2.) the small work it takes so that each of the rows and columns is at the same place. for example through some column tmp. (loops within 190) the blacs context handle in which the computation takes place cmach (global input) character*1 to 1. indeed, i suspect that ib should always be set to 1 or ignored with 1 used in its place pdlamr1d has not been tested except withint the contect of move block into place that it will be expected to be fo move block into place that it will be expected to be fo where eps is the machine precision. if abstol is less than or equal to zero, then eps*norm(t) will be used in its place obtained by reducing a to tridiagonal form. where eps is the machine precision. if abstol is less than or equal to zero, then eps*norm(t) will be used in its place obtained by reducing a to tridiagonal form. the tailored codes place no restrictions on ia, ja, mb or nb by pdsytrd to keep the interface simple. these restrictions are move block into place that it will be expected to be fo the blacs context handle in which the computation takes place small (local input/local output) real indxp (workspace) integer array, dimension (n) the permutation used to place deflated values of d at the en and indxp(k+1:n) points to the deflated eigenvalues. 2.) the small work it takes so that each of the rows and columns is at the same place. for example through some column tmp. (loops within 190) the blacs context handle in which the computation takes place cmach (global input) character*1 to 1. indeed, i suspect that ib should always be set to 1 or ignored with 1 used in its place pslamr1d has not been tested except withint the contect of move block into place that it will be expected to be fo move block into place that it will be expected to be fo where eps is the machine precision. if abstol is less than or equal to zero, then eps*norm(t) will be used in its place obtained by reducing a to tridiagonal form. where eps is the machine precision. if abstol is less than or equal to zero, then eps*norm(t) will be used in its place obtained by reducing a to tridiagonal form. the tailored codes place no restrictions on ia, ja, mb or nb by pssytrd to keep the interface simple. these restrictions are move block into place that it will be expected to be fo move block into place that it will be expected to be fo where eps is the machine precision. if abstol is less than or equal to zero, then eps*norm(t) will be used in its place obtained by reducing a to tridiagonal form. where eps is the machine precision. if abstol is less than or equal to zero, then eps*norm(t) will be used in its place obtained by reducing a to tridiagonal form. the tailored codes place no restrictions on ia, ja, mb or nb by pzhetrd to keep the interface simple. these restrictions are 2.) the small work it takes so that each of the rows and columns is at the same place. for example through some column tmp. (loops 250-260) to 1. indeed, i suspect that ib should always be set to 1 or ignored with 1 used in its place pzlamr1d has not been tested except withint the contect of move block into place that it will be expected to be fo move block into place that it will be expected to be fo |
| placed placed array into a local replicated array or vise versa. notice that the entire submatrix that is copied gets placed on one node o can receive, or just one row or column of nodes. = 'y': equilibration was done, i.e., sub( a ) has been re- placed by array into a local replicated array or vise versa. notice that the entire submatrix that is copied gets placed on one node o can receive, or just one row or column of nodes. = 'y': equilibration was done, i.e., sub( a ) has been re- placed by array into a local replicated array or vise versa. notice that the entire submatrix that is copied gets placed on one node o can receive, or just one row or column of nodes. = 'y': equilibration was done, i.e., sub( a ) has been re- placed by array into a local replicated array or vise versa. notice that the entire submatrix that is copied gets placed on one node o can receive, or just one row or column of nodes. = 'y': equilibration was done, i.e., sub( a ) has been re- placed by |
| Please Please this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. this local portion is stored in the packed banded format used in lapack. Please see the notes below and th distributed matrices. the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please the array descriptor for the distributed matrix a. contains information of mapping of a to memory. Please |
| Point Point ia (global input) integer a's global row index, which Points to the beginning of th a (local input/local output) complex Pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex Pointer into th on entry, this array contains the local pieces of the triangular matrix, stopping/starting at the diagonal, which is the Point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered triangular matrix, stopping/starting at the diagonal, which is the Point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered = 0 : find an interval with desired values of n(w) at the endPoints of the interval interval with a desired value of n(w). this code makes very mild assumptions about floating Point add/subtract, or on those binary machines without guard digits triangular matrix, stopping/starting at the diagonal, which is the Point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered the innermost loop to avoid overflow and determine the sign of a floating Point number. pdlapdct will be referred to as the "paranoid arithmetic (b) the sign bit of a single precision floating Point number is assumed be in the 32nd bit positio this code makes very mild assumptions about floating Point add/subtract, or on those binary machines without guard digits ia (global input) integer a's global row index, which Points to the beginning of th a (local input/local output) double precision Pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision Pointer into th on entry, this array contains the local pieces of the = 0 : find an interval with desired values of n(w) at the endPoints of the interval interval with a desired value of n(w). this code makes very mild assumptions about floating Point add/subtract, or on those binary machines without guard digits triangular matrix, stopping/starting at the diagonal, which is the Point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered the innermost loop to avoid overflow and determine the sign of a floating Point number. pslapdct will be referred to as the "paranoid arithmetic (b) the sign bit of a double precision floating Point number is assumed be in the 32nd or 64th bit positio this code makes very mild assumptions about floating Point add/subtract, or on those binary machines without guard digits ia (global input) integer a's global row index, which Points to the beginning of th a (local input/local output) real Pointer into th on entry, this array contains the local pieces of the a (local input/local output) real Pointer into th on entry, this array contains the local pieces of the ia (global input) integer a's global row index, which Points to the beginning of th a (local input/local output) complex*16 Pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 Pointer into th on entry, this array contains the local pieces of the triangular matrix, stopping/starting at the diagonal, which is the Point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered triangular matrix, stopping/starting at the diagonal, which is the Point of reflection. the pictures below demonstrate this refered to as rowsums, and the column sums shown by | are refered |
| pointer pointer a (local input/local output) complex pointer int lld_a >=(bwl+bwu+1) (stored in desca). a (local input/local output) complex pointer int lld_a >=(bwl+bwu+1) (stored in desca). dl (local input/local output) complex pointer to loca matrix. globally, dl(1) is not referenced, and dl must be dl (local input/local output) complex pointer to loca matrix. globally, dl(1) is not referenced, and dl must be a (local input/local output) complex pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). is nr+bwu where nr is the number of columns on the last processor finally aptr is the pointer to the first element of a. as lapac has to be adjusted on processor mycol=0. a (local input/local output) complex pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input) complex pointer into the local memor this array contains the local pieces of the factors l and u a (local input) complex pointer into the local memor local pieces of the m-by-n distributed matrix whose a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th ( lld_a, locc(ja+n-1) ). on entry, the m-by-n matrix a. a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input) complex pointer into the loca this array contains the local pieces of the distributed a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the n-by-n distributed matrix a (local input/local output) complex pointer int (lld_a,locc(ja+n-1)). on entry, the n-by-n matrix a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) complex pointer into th on entry, the local pieces of the l and u obtained by the a (local input) complex pointer into the loca on entry, this array contains the local pieces of the factors a (local input/local output) complex pointer into th on entry, the local pieces of the n-by-m distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the x (local input/local output) complex pointer into th on entry the vector to be conjugated v (local workspace) complex pointer into the loca the final return, v = a*w, where est = norm(v)/norm(w) a (local input) complex pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input) complex pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input/local output) complex pointer int locc(ja+n-k)). on entry, this array contains the the local a (local output) complex*16 pointer into th on output, a is replicated across all processes in a (local input) complex pointer into the local memor local pieces of the distributed matrix sub( a ). icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this local array contains the local pieces of the a (local input/local output) complex pointer into th containing on entry the m-by-n matrix sub( a ). on exit, a (input/output) complex pointer into the loca on entry, the local pieces of the distributed symmetric v (local input) complex pointer into the local memor storev = 'c', ( lld_v, locc(jv+m-1)) if storev = 'r' and x (local input/local output) complex, pointer into th contains the local pieces of the distributed vector sub( x ). v (input/output) complex pointer into the local memor if storev = 'c', and (locr(iv+k-1),locc(jv+n-1)) if v (local input) complex pointer into the local memor (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local v (input/output) complex pointer into the local memor the distributed matrix v contains the householder vectors. a (local input/local output) complex pointer into th this array contains the local pieces of the distributed a (local output) complex pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local output) complex pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the distri- a (local input) complex pointer into the local memor contains the local pieces of the distributed matrix the trace a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the local pieces of the triangular factor l or u. a (local input/local output) complex pointer into th on entry, the local pieces of the triangular factor l or u. n (global input) pointer to intege n >= 0. a (local input/local output) complex pointer int lld_a >=(bw+1) (stored in desca). a (local input/local output) complex pointer int lld_a >=(bw+1) (stored in desca). a (local input) complex pointer into the local memory t array contains the local pieces of the factors l or u from a (local input) complex pointer into the local memory to a n-by-n hermitian positive definite distributed matrix a (local input) complex pointer into the loca this array contains the local pieces of the n-by-n hermitian a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer int ( lld_a, locc(ja+n-1) ). a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex pointer into th on entry, the local pieces of the triangular factor u or l a (local input) complex pointer into local memory t array contains the factors l or u from the cholesky facto- d (local input/local output) complex pointer to loca matrix. d (local input/local output) complex pointer to loca matrix. n (global input) pointer to intege n >= 0. a (local input) complex pointer into the local memor contains the local pieces of the triangular distributed a (local input) complex pointer into the local memor array contains the local pieces of the original triangular a (local input/local output) complex pointer into th this array contains the local pieces of the triangular matrix a (local input/local output) complex pointer into th on entry, this array contains the local pieces of the a (local input) complex pointer into the local memor contains the local pieces of the distributed triangular a (local input/local output) complex pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex pointer into th on entry, the j-th column must contain the vector which a (local input/local output) complex pointer into th on entry, the j-th column must contain the vector which a (local input/local output) complex pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) complex pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) complex pointer into th on entry, the j-th column must contain the vector which a (local input/local output) complex pointer into th on entry, the j-th column must contain the vector which a (local input/local output) complex pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) complex pointer into th on entry, the i-th row must contain the vector which defines a (local input) complex pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) complex pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) complex pointer into the local memor vect='q', and (lld_a,locc(ja+nq-1)) if vect = 'p'. nq = m a (local input) complex pointer into the local memor and (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input) complex pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) complex pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) complex pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex pointer into the local memor or (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input/local output) double precision pointer int lld_a >=(bwl+bwu+1) (stored in desca). a (local input/local output) double precision pointer int lld_a >=(bwl+bwu+1) (stored in desca). dl (local input/local output) double precision pointer to loca matrix. globally, dl(1) is not referenced, and dl must be dl (local input/local output) double precision pointer to loca matrix. globally, dl(1) is not referenced, and dl must be a (local input/local output) double precision pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). is nr+bwu where nr is the number of columns on the last processor finally aptr is the pointer to the first element of a. as lapac has to be adjusted on processor mycol=0. a (local input/local output) double precision pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input) double precision pointer into the local memor this array contains the local pieces of the factors l and u a (local input) double precision pointer into the local memor local pieces of the m-by-n distributed matrix whose a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th ( lld_a, locc(ja+n-1) ). on entry, the m-by-n matrix a. a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input) double precision pointer into the loca this array contains the local pieces of the distributed a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the n-by-n distributed matrix a (local input/local output) double precision pointer int (lld_a,locc(ja+n-1)). on entry, the n-by-n matrix a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) double precision pointer into th on entry, the local pieces of the l and u obtained by the a (local input) double precision pointer into the loca on entry, this array contains the local pieces of the factors a (local input/local output) double precision pointer into th on entry, the local pieces of the n-by-m distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the v (local workspace) double precision pointer into the loca the final return, v = a*w, where est = norm(v)/norm(w) a (local input) double precision pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input) double precision pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input/local output) double precision pointer int locc(ja+n-k)). on entry, this array contains the the local a (local output) complex*16 pointer into th on output, a is replicated across all processes in a (local input) double precision pointer into the local memor local pieces of the distributed matrix sub( a ). icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this local array contains the local pieces of the a (local input/local output) double precision pointer into th containing on entry the m-by-n matrix sub( a ). on exit, a (input/output) double precision pointer into the loca on entry, the local pieces of the distributed symmetric v (local input) double precision pointer into the local memor storev = 'c', ( lld_v, locc(jv+m-1)) if storev = 'r' and x (local input/local output) double precision, pointer into th contains the local pieces of the distributed vector sub( x ). v (input/output) double precision pointer into the local memor if storev = 'c', and (locr(iv+k-1),locc(jv+n-1)) if v (local input) double precision pointer into the local memor (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local v (input/output) double precision pointer into the local memor the distributed matrix v contains the householder vectors. a (local input/local output) double precision pointer into th this array contains the local pieces of the distributed a (local output) double precision pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local output) double precision pointer into the local memor contains the local pieces of the distributed matrix sub( a ) q (local input) double precision pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the distri- a (local input) double precision pointer into the local memor contains the local pieces of the distributed matrix the trace a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double precision pointer into th on entry, the local pieces of the triangular factor l or u. a (local input/local output) double precision pointer into th on entry, the local pieces of the triangular factor l or u. a (local input/local output) double precision pointer into th on entry, the j-th column must contain the vector which a (local input/local output) double precision pointer into th on entry, the j-th column must contain the vector which a (local input/local output) double precision pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) double precision pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) double precision pointer into th on entry, the j-th column must contain the vector which a (local input/local output) double precision pointer into th on entry, the j-th column must contain the vector which a (local input/local output) double precision pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) double precision pointer into th on entry, the i-th row must contain the vector which defines a (local input) double precision pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) double precision pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) double precision pointer into the local memor vect='q', and (lld_a,locc(ja+nq-1)) if vect = 'p'. nq = m a (local input) double precision pointer into the local memor and (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input) double precision pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double precision pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double precision pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) double precision pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) double precision pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double precision pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double precision pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double precision pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double precision pointer into the local memor or (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input/local output) double precision pointer int lld_a >=(bw+1) (stored in desca). a (local input/local output) double precision pointer int lld_a >=(bw+1) (stored in desca). a (local input) double precision pointer into the local memor this array contains the local pieces of the factors l or u a (local input) double precision pointer into the local memor n-by-n symmetric positive definite distributed matrix a (local input) double precision pointer into the loca this array contains the local pieces of the n-by-n symmetric a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer int ( lld_a, locc(ja+n-1) ). a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, the local pieces of the triangular factor u or l a (local input) double precision pointer into local memory t array contains the factors l or u from the cholesky facto- d (local input/local output) double precision pointer to loca matrix. d (local input/local output) double precision pointer to loca matrix. n (global input) pointer to intege n >= 0. a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input) double precision pointer into the local memor contains the local pieces of the triangular distributed a (local input) double precision pointer into the local memor array contains the local pieces of the original triangular a (local input/local output) double precision pointer into th this array contains the local pieces of the triangular matrix a (local input/local output) double precision pointer into th on entry, this array contains the local pieces of the a (local input) double precision pointer into the local memor contains the local pieces of the distributed triangular a (local input/local output) double precision pointer into th on entry, the local pieces of the m-by-n distributed matrix n (global input) pointer to intege n >= 0. n (global input) pointer to intege n >= 0. a (local input/local output) real pointer int lld_a >=(bwl+bwu+1) (stored in desca). a (local input/local output) real pointer int lld_a >=(bwl+bwu+1) (stored in desca). dl (local input/local output) real pointer to loca matrix. globally, dl(1) is not referenced, and dl must be dl (local input/local output) real pointer to loca matrix. globally, dl(1) is not referenced, and dl must be a (local input/local output) real pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). is nr+bwu where nr is the number of columns on the last processor finally aptr is the pointer to the first element of a. as lapac has to be adjusted on processor mycol=0. a (local input/local output) real pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input) real pointer into the local memor this array contains the local pieces of the factors l and u a (local input) real pointer into the local memor local pieces of the m-by-n distributed matrix whose a (local input/local output) real pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) real pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th ( lld_a, locc(ja+n-1) ). on entry, the m-by-n matrix a. a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input) real pointer into the loca this array contains the local pieces of the distributed a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the n-by-n distributed matrix a (local input/local output) real pointer int (lld_a,locc(ja+n-1)). on entry, the n-by-n matrix a (local input/local output) real pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) real pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) real pointer into th on entry, the local pieces of the l and u obtained by the a (local input) real pointer into the loca on entry, this array contains the local pieces of the factors a (local input/local output) real pointer into th on entry, the local pieces of the n-by-m distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, this array contains the local pieces of the v (local workspace) real pointer into the loca the final return, v = a*w, where est = norm(v)/norm(w) a (local input) real pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input) real pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input/local output) real pointer int locc(ja+n-k)). on entry, this array contains the the local a (local output) complex*16 pointer into th on output, a is replicated across all processes in a (local input) real pointer into the local memor local pieces of the distributed matrix sub( a ). icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this local array contains the local pieces of the a (local input/local output) real pointer into th containing on entry the m-by-n matrix sub( a ). on exit, a (input/output) real pointer into the loca on entry, the local pieces of the distributed symmetric v (local input) real pointer into the local memor storev = 'c', ( lld_v, locc(jv+m-1)) if storev = 'r' and x (local input/local output) real, pointer into th contains the local pieces of the distributed vector sub( x ). v (input/output) real pointer into the local memor if storev = 'c', and (locr(iv+k-1),locc(jv+n-1)) if v (local input) real pointer into the local memor (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local v (input/output) real pointer into the local memor the distributed matrix v contains the householder vectors. a (local input/local output) real pointer into th this array contains the local pieces of the distributed a (local output) real pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local output) real pointer into the local memor contains the local pieces of the distributed matrix sub( a ) q (local input) real pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input/local output) real pointer into th on entry, this array contains the local pieces of the distri- a (local input) real pointer into the local memor contains the local pieces of the distributed matrix the trace a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) real pointer into th on entry, the local pieces of the triangular factor l or u. a (local input/local output) real pointer into th on entry, the local pieces of the triangular factor l or u. a (local input/local output) real pointer into th on entry, the j-th column must contain the vector which a (local input/local output) real pointer into th on entry, the j-th column must contain the vector which a (local input/local output) real pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) real pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) real pointer into th on entry, the j-th column must contain the vector which a (local input/local output) real pointer into th on entry, the j-th column must contain the vector which a (local input/local output) real pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) real pointer into th on entry, the i-th row must contain the vector which defines a (local input) real pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) real pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) real pointer into the local memor vect='q', and (lld_a,locc(ja+nq-1)) if vect = 'p'. nq = m a (local input) real pointer into the local memor and (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input) real pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) real pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) real pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) real pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) real pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) real pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) real pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) real pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) real pointer into the local memor or (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input/local output) real pointer int lld_a >=(bw+1) (stored in desca). a (local input/local output) real pointer int lld_a >=(bw+1) (stored in desca). a (local input) real pointer into the local memory t array contains the local pieces of the factors l or u from a (local input) real pointer into the local memory to a n-by-n symmetric positive definite distributed matrix a (local input) real pointer into the loca this array contains the local pieces of the n-by-n symmetric a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer int ( lld_a, locc(ja+n-1) ). a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, the local pieces of the triangular factor u or l a (local input) real pointer into local memory t array contains the factors l or u from the cholesky facto- d (local input/local output) real pointer to loca matrix. d (local input/local output) real pointer to loca matrix. n (global input) pointer to intege n >= 0. a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input) real pointer into the local memor contains the local pieces of the triangular distributed a (local input) real pointer into the local memor array contains the local pieces of the original triangular a (local input/local output) real pointer into th this array contains the local pieces of the triangular matrix a (local input/local output) real pointer into th on entry, this array contains the local pieces of the a (local input) real pointer into the local memor contains the local pieces of the distributed triangular a (local input/local output) real pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer int lld_a >=(bwl+bwu+1) (stored in desca). a (local input/local output) complex*16 pointer int lld_a >=(bwl+bwu+1) (stored in desca). n (global input) pointer to intege n >= 0. dl (local input/local output) complex*16 pointer to loca matrix. globally, dl(1) is not referenced, and dl must be dl (local input/local output) complex*16 pointer to loca matrix. globally, dl(1) is not referenced, and dl must be a (local input/local output) complex*16 pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). is nr+bwu where nr is the number of columns on the last processor finally aptr is the pointer to the first element of a. as lapac has to be adjusted on processor mycol=0. a (local input/local output) complex*16 pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input) complex*16 pointer into the local memor this array contains the local pieces of the factors l and u a (local input) complex*16 pointer into the local memor local pieces of the m-by-n distributed matrix whose a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th ( lld_a, locc(ja+n-1) ). on entry, the m-by-n matrix a. a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input) complex*16 pointer into the loca this array contains the local pieces of the distributed a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the n-by-n distributed matrix a (local input/local output) complex*16 pointer int (lld_a,locc(ja+n-1)). on entry, the n-by-n matrix a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) complex*16 pointer into th on entry, the local pieces of the l and u obtained by the a (local input) complex*16 pointer into the loca on entry, this array contains the local pieces of the factors a (local input/local output) complex*16 pointer into th on entry, the local pieces of the n-by-m distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the x (local input/local output) complex*16 pointer into th on entry the vector to be conjugated v (local workspace) complex*16 pointer into the loca the final return, v = a*w, where est = norm(v)/norm(w) a (local input) complex*16 pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input) complex*16 pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input/local output) complex*16 pointer int locc(ja+n-k)). on entry, this array contains the the local a (local output) complex*16 pointer into th on output, a is replicated across all processes in a (local input) complex*16 pointer into the local memor local pieces of the distributed matrix sub( a ). icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after icurcol : process column containing diagonal block irsc0 : pointer to part of work used to store the rowsums whil irsr0 : pointer to part of work used to store the rowsums after a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this local array contains the local pieces of the a (local input/local output) complex*16 pointer into th containing on entry the m-by-n matrix sub( a ). on exit, a (input/output) complex*16 pointer into the loca on entry, the local pieces of the distributed symmetric v (local input) complex*16 pointer into the local memor storev = 'c', ( lld_v, locc(jv+m-1)) if storev = 'r' and x (local input/local output) complex*16, pointer into th contains the local pieces of the distributed vector sub( x ). v (input/output) complex*16 pointer into the local memor if storev = 'c', and (locr(iv+k-1),locc(jv+n-1)) if v (local input) complex*16 pointer into the local memor (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local v (input/output) complex*16 pointer into the local memor the distributed matrix v contains the householder vectors. a (local input/local output) complex*16 pointer into th this array contains the local pieces of the distributed a (local output) complex*16 pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local output) complex*16 pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the distri- a (local input) complex*16 pointer into the local memor contains the local pieces of the distributed matrix the trace a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the local pieces of the triangular factor l or u. a (local input/local output) complex*16 pointer into th on entry, the local pieces of the triangular factor l or u. n (global input) pointer to intege n >= 0. a (local input/local output) complex*16 pointer int lld_a >=(bw+1) (stored in desca). a (local input/local output) complex*16 pointer int lld_a >=(bw+1) (stored in desca). a (local input) complex*16 pointer into the local memory t array contains the local pieces of the factors l or u from a (local input) complex*16 pointer into the local memory to a n-by-n hermitian positive definite distributed matrix a (local input) complex*16 pointer into the loca this array contains the local pieces of the n-by-n hermitian a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer int ( lld_a, locc(ja+n-1) ). a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input/local output) complex*16 pointer into th on entry, the local pieces of the triangular factor u or l a (local input) complex*16 pointer into local memory t array contains the factors l or u from the cholesky facto- d (local input/local output) complex*16 pointer to loca matrix. d (local input/local output) complex*16 pointer to loca matrix. a (local input) complex*16 pointer into the local memor contains the local pieces of the triangular distributed a (local input) complex*16 pointer into the local memor array contains the local pieces of the original triangular a (local input/local output) complex*16 pointer into th this array contains the local pieces of the triangular matrix a (local input/local output) complex*16 pointer into th on entry, this array contains the local pieces of the a (local input) complex*16 pointer into the local memor contains the local pieces of the distributed triangular a (local input/local output) complex*16 pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) complex*16 pointer into th on entry, the j-th column must contain the vector which a (local input/local output) complex*16 pointer into th on entry, the j-th column must contain the vector which a (local input/local output) complex*16 pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) complex*16 pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) complex*16 pointer into th on entry, the j-th column must contain the vector which a (local input/local output) complex*16 pointer into th on entry, the j-th column must contain the vector which a (local input/local output) complex*16 pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) complex*16 pointer into th on entry, the i-th row must contain the vector which defines a (local input) complex*16 pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) complex*16 pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) complex*16 pointer into the local memor vect='q', and (lld_a,locc(ja+nq-1)) if vect = 'p'. nq = m a (local input) complex*16 pointer into the local memor and (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input) complex*16 pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex*16 pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex*16 pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) complex*16 pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) complex*16 pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex*16 pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex*16 pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex*16 pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) complex*16 pointer into the local memor or (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which |
| pointers pointers initialize pointers initialize pointers |
| pointeur pointeur nn2 (global output) integer, the order of matrix q2, (pdlaed1). ib1 (global output) integer, pointeur on q1, (pdlaed1) nn2 (global output) integer, the order of matrix q2, (pslaed1). ib1 (global output) integer, pointeur on q1, (pslaed1) |
| points points ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th column (i.e. mycol .eq. curcol) they are the same. however, on some processors, a( lii, lij ) points to an elemen iz (global input) integer z's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning o ip (global input) integer ipiv's global row index, which points to the beginning of th ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). isplit (global input) integer array, dimension (n) the splitting points, at which t breaks up into submatrices the second of rows/columns isplit(1)+1 through isplit(2), ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). iq (global input) integer q's global row index, which points to the beginning of th id (global input) integer q's global row/col index, which points to the beginnin the permutation used to place deflated values of d at the end of the array. indxp(1:k) points to the nondeflated d-value iz (global input) integer z's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning o ip (global input) integer ipiv's global row index, which points to the beginning of th ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). isplit (global output) integer array, dimension (n) the splitting points, at which t breaks up into submatrices the second of rows/columns isplit(1)+1 through isplit(2), iq (global input) integer q's global row index, which points to the beginning of th isplit (global input) integer array, dimension (n) the splitting points, at which t breaks up into submatrices the second of rows/columns isplit(1)+1 through isplit(2), ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th column (i.e. mycol .eq. curcol) they are the same. however, on some processors, a( lii, lij ) points to an elemen ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). iq (global input) integer q's global row index, which points to the beginning of th id (global input) integer q's global row/col index, which points to the beginnin the permutation used to place deflated values of d at the end of the array. indxp(1:k) points to the nondeflated d-value iz (global input) integer z's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning o ip (global input) integer ipiv's global row index, which points to the beginning of th ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). isplit (global output) integer array, dimension (n) the splitting points, at which t breaks up into submatrices the second of rows/columns isplit(1)+1 through isplit(2), iq (global input) integer q's global row index, which points to the beginning of th isplit (global input) integer array, dimension (n) the splitting points, at which t breaks up into submatrices the second of rows/columns isplit(1)+1 through isplit(2), ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th column (i.e. mycol .eq. curcol) they are the same. however, on some processors, a( lii, lij ) points to an elemen ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning of th column (i.e. mycol .eq. curcol) they are the same. however, on some processors, a( lii, lij ) points to an elemen iz (global input) integer z's global row index, which points to the beginning of th ia (global input) integer a's global row index, which points to the beginning o ip (global input) integer ipiv's global row index, which points to the beginning of th ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). ja (global input) integer the index in the global array a that points to the start o or a submatrix of a). isplit (global input) integer array, dimension (n) the splitting points, at which t breaks up into submatrices the second of rows/columns isplit(1)+1 through isplit(2), |
| poor poor o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: of the values computed by pdlamch. this subroutine is needed because pdlamch does not compensate for poor arithmetic in the upper half o o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: of the values computed by pslamch. this subroutine is needed because pslamch does not compensate for poor arithmetic in the upper half o o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: o(nb) on each processor. if this is too small, divide and conquer is a poor choice of algorithm submatrix reference: |
| port port pcheevx assumes ieee 754 standard compliant arithmetic. to port the appropriate slmake.inc file to include the compiler switch pdsyevx assumes ieee 754 standard compliant arithmetic. to port the appropriate slmake.inc file to include the compiler switch pssyevx assumes ieee 754 standard compliant arithmetic. to port the appropriate slmake.inc file to include the compiler switch pzheevx assumes ieee 754 standard compliant arithmetic. to port the appropriate slmake.inc file to include the compiler switch |
| portion portion on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of use factorization of odd-even connection block to modify locally stored portion of right hand side(s use factorization of odd-even connection block to modify locally stored portion of right hand side(s on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of array ( nq x anb-1 ) for efficiency. since only the lower triangular portion of a is updated, av is computed as two local triangular matrix-vector multiplications (both in relatively even if each is not very small. thus it is necessary to scan the "tridiagonal portion of the matrix." i l and examines b (local input/output) complex array of size (ldb,m) if rev=0, this is the global portion of the arra if rev=1, this is the unchanged on exit. if the matrix is hermitian, we address only a triangular portion can be obtained by adding along row i and column i of the the if the matrix is symmetric, we address only a triangular portion can be obtained by adding along row i and column i of the the k (global output) integer on exit, this yields the bottom portion of the unreduce on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of use factorization of odd-even connection block to modify locally stored portion of right hand side(s use factorization of odd-even connection block to modify locally stored portion of right hand side(s on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of use factorization of odd-even connection block to modify locally stored portion of right hand side(s use factorization of odd-even connection block to modify locally stored portion of right hand side(s on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of relatively even if each is not very small. thus it is necessary to scan the "tridiagonal portion of the matrix." i l and examines b (local input/output) double precision array of size (ldb,m) if rev=0, this is the global portion of the arra if rev=1, this is the unchanged on exit. if the matrix is symmetric, we address only a triangular portion can be obtained by adding along row i and column i of the the k (global output) integer on exit, this yields the bottom portion of the unreduce on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of use factorization of odd-even connection block to modify locally stored portion of right hand side(s use factorization of odd-even connection block to modify locally stored portion of right hand side(s array ( nq x anb-1 ) for efficiency. since only the lower triangular portion of a is updated, av is computed as two local triangular matrix-vector multiplications (both in on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of use factorization of odd-even connection block to modify locally stored portion of right hand side(s use factorization of odd-even connection block to modify locally stored portion of right hand side(s on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of relatively even if each is not very small. thus it is necessary to scan the "tridiagonal portion of the matrix." i l and examines b (local input/output) real array of size (ldb,m) if rev=0, this is the global portion of the arra if rev=1, this is the unchanged on exit. if the matrix is symmetric, we address only a triangular portion can be obtained by adding along row i and column i of the the k (global output) integer on exit, this yields the bottom portion of the unreduce on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of use factorization of odd-even connection block to modify locally stored portion of right hand side(s use factorization of odd-even connection block to modify locally stored portion of right hand side(s array ( nq x anb-1 ) for efficiency. since only the lower triangular portion of a is updated, av is computed as two local triangular matrix-vector multiplications (both in on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of use factorization of odd-even connection block to modify locally stored portion of right hand side(s use factorization of odd-even connection block to modify locally stored portion of right hand side(s on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of array ( nq x anb-1 ) for efficiency. since only the lower triangular portion of a is updated, av is computed as two local triangular matrix-vector multiplications (both in relatively even if each is not very small. thus it is necessary to scan the "tridiagonal portion of the matrix." i l and examines b (local input/output) complex*16 array of size (ldb,m) if rev=0, this is the global portion of the arra if rev=1, this is the unchanged on exit. if the matrix is hermitian, we address only a triangular portion can be obtained by adding along row i and column i of the the if the matrix is symmetric, we address only a triangular portion can be obtained by adding along row i and column i of the the k (global output) integer on exit, this yields the bottom portion of the unreduce on entry, this array contains the local pieces of the this local portion is stored in the packed banded forma scalapack manual for more detail on the format of l^t a(1:n, ja:ja+n-1). this local portion is stored in the packed banded forma scalapack manual for more detail on the format of use factorization of odd-even connection block to modify locally stored portion of right hand side(s use factorization of odd-even connection block to modify locally stored portion of right hand side(s |
| position position arithmetic (b) the sign bit of a single precision floating point number is assumed be in the 32nd bit position arithmetic (b) the sign bit of a double precision floating point number is assumed be in the 32nd or 64th bit position |
| positions positions pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec pack params and positions into arrays for global consistency chec |
| positive positive u * x = b, or u**h * x = b, where l or u is the cholesky factor of a hermitian positive a = u**h*d*u or a = l*d*l**h (computed by cpttrf). l**t* x = b, or l * x = b, where l is the cholesky factor of a hermitian positive a = l*d*l**h (computed by dpttrf). if fact = 'f' and equed = 'r' or 'b', each element of r must be positive with the distributed matrix a. hermitian, and sub( b ) denoting b( ib:ib+n-1, jb:jb+n-1 ) is assumed to be hermitian positive definite notes where a(1:n, ja:ja+n-1) is an n-by-n complex banded symmetric positive definite distribute a(1:n, ja:ja+n-1) is an n-by-n complex banded symmetric positive definite distribute depending on the value of uplo, a stores either u or l in the equn pcpocon estimates the reciprocal of the condition number (in the 1-norm) of a complex hermitian positive definite distributed matri pcpotrf. pcpoequ computes row and column scalings intended to equilibrate a distributed hermitian positive definite matri (with respect to the two-norm). sr and sc contain the scale pcporfs improves the computed solution to a system of linear equations when the coefficient matrix is hermitian positive definit solutions. where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is an n-by-n hermitian distributed positive definite matrix and x and sub( b matrices. if fact = 'f' and equed = 'y', each element of sr must be positive sc (local input/local output) complex array, pcpotf2 computes the cholesky factorization of a complex hermitian positive definite distributed matrix sub( a )=a(ia:ia+n-1,ja:ja+n-1) the factorization has the form pcpotrf computes the cholesky factorization of an n-by-n complex hermitian positive definite distributed matrix sub( a ) denotin pcpotri computes the inverse of a complex hermitian positive definit cholesky factorization sub( a ) = u**h*u or l*l**h computed by where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a n-by-n hermitian positive definite distributed matrix using the cholesk sub( b ) denotes the distributed matrix b(ib:ib+n-1,jb:jb+nrhs-1). where a(1:n, ja:ja+n-1) is an n-by-n complex tridiagonal symmetric positive definite distribute a(1:n, ja:ja+n-1) is an n-by-n complex tridiagonal symmetric positive definite distribute depending on the value of uplo, a stores either u or l in the equn if fact = 'f' and equed = 'r' or 'b', each element of r must be positive with the distributed matrix a. where a(1:n, ja:ja+n-1) is an n-by-n real banded symmetric positive definite distribute a(1:n, ja:ja+n-1) is an n-by-n real banded symmetric positive definite distribute depending on the value of uplo, a stores either u or l in the equn pdpocon estimates the reciprocal of the condition number (in the 1-norm) of a real symmetric positive definite distributed matri pdpotrf. pdpoequ computes row and column scalings intended to equilibrate a distributed symmetric positive definite matri (with respect to the two-norm). sr and sc contain the scale pdporfs improves the computed solution to a system of linear equations when the coefficient matrix is symmetric positive definit solutions. where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is an n-by-n symmetric distributed positive definite matrix and x and sub( b matrices. if fact = 'f' and equed = 'y', each element of sr must be positive sc (local input/local output) double precision array, pdpotf2 computes the cholesky factorization of a real symmetric positive definite distributed matrix sub( a )=a(ia:ia+n-1,ja:ja+n-1) the factorization has the form pdpotrf computes the cholesky factorization of an n-by-n real symmetric positive definite distributed matrix sub( a ) denotin pdpotri computes the inverse of a real symmetric positive definit cholesky factorization sub( a ) = u**t*u or l*l**t computed by where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a n-by-n symmetric positive definite distributed matrix using the cholesk sub( b ) denotes the distributed matrix b(ib:ib+n-1,jb:jb+nrhs-1). where a(1:n, ja:ja+n-1) is an n-by-n real tridiagonal symmetric positive definite distribute a(1:n, ja:ja+n-1) is an n-by-n real tridiagonal symmetric positive definite distribute sy, and sub( b ) denoting b( ib:ib+n-1, jb:jb+n-1 ) is assumed to be symmetric positive definite notes if fact = 'f' and equed = 'r' or 'b', each element of r must be positive with the distributed matrix a. where a(1:n, ja:ja+n-1) is an n-by-n real banded symmetric positive definite distribute a(1:n, ja:ja+n-1) is an n-by-n real banded symmetric positive definite distribute depending on the value of uplo, a stores either u or l in the equn pspocon estimates the reciprocal of the condition number (in the 1-norm) of a real symmetric positive definite distributed matri pspotrf. pspoequ computes row and column scalings intended to equilibrate a distributed symmetric positive definite matri (with respect to the two-norm). sr and sc contain the scale psporfs improves the computed solution to a system of linear equations when the coefficient matrix is symmetric positive definit solutions. where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is an n-by-n symmetric distributed positive definite matrix and x and sub( b matrices. if fact = 'f' and equed = 'y', each element of sr must be positive sc (local input/local output) real array, pspotf2 computes the cholesky factorization of a real symmetric positive definite distributed matrix sub( a )=a(ia:ia+n-1,ja:ja+n-1) the factorization has the form pspotrf computes the cholesky factorization of an n-by-n real symmetric positive definite distributed matrix sub( a ) denotin pspotri computes the inverse of a real symmetric positive definit cholesky factorization sub( a ) = u**t*u or l*l**t computed by where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a n-by-n symmetric positive definite distributed matrix using the cholesk sub( b ) denotes the distributed matrix b(ib:ib+n-1,jb:jb+nrhs-1). where a(1:n, ja:ja+n-1) is an n-by-n real tridiagonal symmetric positive definite distribute a(1:n, ja:ja+n-1) is an n-by-n real tridiagonal symmetric positive definite distribute sy, and sub( b ) denoting b( ib:ib+n-1, jb:jb+n-1 ) is assumed to be symmetric positive definite notes if fact = 'f' and equed = 'r' or 'b', each element of r must be positive with the distributed matrix a. hermitian, and sub( b ) denoting b( ib:ib+n-1, jb:jb+n-1 ) is assumed to be hermitian positive definite notes where a(1:n, ja:ja+n-1) is an n-by-n complex banded symmetric positive definite distribute a(1:n, ja:ja+n-1) is an n-by-n complex banded symmetric positive definite distribute depending on the value of uplo, a stores either u or l in the equn pzpocon estimates the reciprocal of the condition number (in the 1-norm) of a complex hermitian positive definite distributed matri pzpotrf. pzpoequ computes row and column scalings intended to equilibrate a distributed hermitian positive definite matri (with respect to the two-norm). sr and sc contain the scale pzporfs improves the computed solution to a system of linear equations when the coefficient matrix is hermitian positive definit solutions. where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is an n-by-n hermitian distributed positive definite matrix and x and sub( b matrices. if fact = 'f' and equed = 'y', each element of sr must be positive sc (local input/local output) complex*16 array, pzpotf2 computes the cholesky factorization of a complex hermitian positive definite distributed matrix sub( a )=a(ia:ia+n-1,ja:ja+n-1) the factorization has the form pzpotrf computes the cholesky factorization of an n-by-n complex hermitian positive definite distributed matrix sub( a ) denotin pzpotri computes the inverse of a complex hermitian positive definit cholesky factorization sub( a ) = u**h*u or l*l**h computed by where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a n-by-n hermitian positive definite distributed matrix using the cholesk sub( b ) denotes the distributed matrix b(ib:ib+n-1,jb:jb+nrhs-1). where a(1:n, ja:ja+n-1) is an n-by-n complex tridiagonal symmetric positive definite distribute a(1:n, ja:ja+n-1) is an n-by-n complex tridiagonal symmetric positive definite distribute depending on the value of uplo, a stores either u or l in the equn l**t* x = b, or l * x = b, where l is the cholesky factor of a hermitian positive a = l*d*l**h (computed by spttrf). u * x = b, or u**h * x = b, where l or u is the cholesky factor of a hermitian positive a = u**h*d*u or a = l*d*l**h (computed by zpttrf). |
| possible possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, only spot checks of the consistency of the eigenvalues across the different processes. because of this, it is possible that messages. set a subdiagonal to zero now if it's possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible the diagonal. this choice of sr and sc puts the condition number of b within a factor n of the smallest possible condition numbe note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible substitution. it is the hope that scaling would be used to make the the code robust against possible overflow. but scaling has not ye the triangular systems. pclattrs just calls pctrsv. note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, set a subdiagonal to zero now if it's possible h11 = smalla(1,1,ki) quick return if possible quick return if possible quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible the diagonal. this choice of sr and sc puts the condition number of b within a factor n of the smallest possible condition numbe note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, set a subdiagonal to zero now if it's possible h11 = smalla(1,1,ki) quick return if possible quick return if possible quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible the diagonal. this choice of sr and sc puts the condition number of b within a factor n of the smallest possible condition numbe note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, only spot checks of the consistency of the eigenvalues across the different processes. because of this, it is possible that messages. set a subdiagonal to zero now if it's possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible the diagonal. this choice of sr and sc puts the condition number of b within a factor n of the smallest possible condition numbe note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible note that a consequence of this chart is that it is not possible to opposite requirements for the orientation of the blacs grid, quick return if possible substitution. it is the hope that scaling would be used to make the the code robust against possible overflow. but scaling has not ye the triangular systems. pzlattrs just calls pztrsv. quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible quick return if possible |
| post post multiplied by diag(r(ia:ia+m-1)), = 'c': column equilibration, i.e., sub( a ) has been post = 'b': both row and column equilibration, i.e., sub( a ) multiplied by diag(r(ia:ia+m-1)), = 'c': column equilibration, i.e., sub( a ) has been post = 'b': both row and column equilibration, i.e., sub( a ) multiplied by diag(r(ia:ia+m-1)), = 'c': column equilibration, i.e., sub( a ) has been post = 'b': both row and column equilibration, i.e., sub( a ) multiplied by diag(r(ia:ia+m-1)), = 'c': column equilibration, i.e., sub( a ) has been post = 'b': both row and column equilibration, i.e., sub( a ) |
| postmultiplied postmultiplied = 'c': column equilibration, i.e., a(ia:ia+n-1,ja:ja+n-1) has been postmultiplied by diag(c) a(ia:ia+n-1,ja:ja+n-1) has been replaced by = 'c': column equilibration, i.e., a(ia:ia+n-1,ja:ja+n-1) has been postmultiplied by diag(c) a(ia:ia+n-1,ja:ja+n-1) has been replaced by = 'c': column equilibration, i.e., a(ia:ia+n-1,ja:ja+n-1) has been postmultiplied by diag(c) a(ia:ia+n-1,ja:ja+n-1) has been replaced by = 'c': column equilibration, i.e., a(ia:ia+n-1,ja:ja+n-1) has been postmultiplied by diag(c) a(ia:ia+n-1,ja:ja+n-1) has been replaced by |
| potentially potentially this node will potentially do more work late if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n pclaevswp moves the eigenvectors (potentially unsorted) fro array, sorted so that the corresponding eigenvalues are sorted. this node will potentially do more work late pdlaevswp moves the eigenvectors (potentially unsorted) fro array, sorted so that the corresponding eigenvalues are sorted. if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n this node will potentially do more work late pslaevswp moves the eigenvectors (potentially unsorted) fro array, sorted so that the corresponding eigenvalues are sorted. if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n this node will potentially do more work late if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n if you want to guarantee orthogonality (at the cost of potentially poor performance) you should ad (clustersize-1)*n pzlaevswp moves the eigenvectors (potentially unsorted) fro array, sorted so that the corresponding eigenvalues are sorted. |
| PpB0 PpB0 lwork >= max( mb_a * ( mpa0 + nqa0 + mb_a ), max( (mb_a*(mb_a-1))/2, (PpB0 + nqb0)*mb_a ) nb_b * ( ppb0 + nqb0 + nb_b ) ), where lwork >= max( mb_a * ( mpa0 + nqa0 + mb_a ), max( (mb_a*(mb_a-1))/2, (PpB0 + nqb0)*mb_a ) nb_b * ( ppb0 + nqb0 + nb_b ) ), where lwork >= max( mb_a * ( mpa0 + nqa0 + mb_a ), max( (mb_a*(mb_a-1))/2, (PpB0 + nqb0)*mb_a ) nb_b * ( ppb0 + nqb0 + nb_b ) ), where lwork >= max( mb_a * ( mpa0 + nqa0 + mb_a ), max( (mb_a*(mb_a-1))/2, (PpB0 + nqb0)*mb_a ) nb_b * ( ppb0 + nqb0 + nb_b ) ), where |
| PqB0 PqB0 lwork >= max( nb_a * ( npa0 + mqa0 + nb_a ), max( (nb_a*(nb_a-1))/2, (PqB0 + npb0)*nb_a ) mb_b * ( npb0 + pqb0 + mb_b ) ), where lwork >= max( nb_a * ( npa0 + mqa0 + nb_a ), max( (nb_a*(nb_a-1))/2, (PqB0 + npb0)*nb_a ) mb_b * ( npb0 + pqb0 + mb_b ) ), where lwork >= max( nb_a * ( npa0 + mqa0 + nb_a ), max( (nb_a*(nb_a-1))/2, (PqB0 + npb0)*nb_a ) mb_b * ( npb0 + pqb0 + mb_b ) ), where lwork >= max( nb_a * ( npa0 + mqa0 + nb_a ), max( (nb_a*(nb_a-1))/2, (PqB0 + npb0)*nb_a ) mb_b * ( npb0 + pqb0 + mb_b ) ), where |
| practice practice factors is not guaranteed to reduce the condition number of sub( a ) but works well in practice notes factors is not guaranteed to reduce the condition number of sub( a ) but works well in practice notes factors is not guaranteed to reduce the condition number of sub( a ) but works well in practice notes factors is not guaranteed to reduce the condition number of sub( a ) but works well in practice notes |
| Pre Pre watobd = max(max(wpclange,wpcgebrd), max(wpclared2d,wp(Pre)lared1d)) where wpclange, wpclared1d, wpclared2d, wpcgebrd are the = 'n': no equilibration = 'r': row equilibration, i.e., sub( a ) has been Pre = 'c': column equilibration, i.e., sub( a ) has been post- Pre-calculate bw^ Pre-calculate bw^ a (local input/workspace) block cyclic double Precisio global dimension (m, n), local dimension (mp, nq) a (local input/local output) double Precision pointer into th containing on entry the m-by-n matrix sub( a ). on exit, Pre-calculate bw^ Pre-calculate bw^ watobd = max(max(wpslange,wpsgebrd), max(wpslared2d,wp(Pre)lared1d)) where wpslange, wpslared1d, wpslared2d, wpsgebrd are the = 'n': no equilibration = 'r': row equilibration, i.e., sub( a ) has been Pre = 'c': column equilibration, i.e., sub( a ) has been post- Pre-calculate bw^ Pre-calculate bw^ s (global output) double Precision array, dimension siz r (local input) double Precision array, dimension locr(m_a distributed matrix a, and replicated across every process Pre-calculate bw^ Pre-calculate bw^ |
| prec prec pdlamch determines double precision machine parameters arguments pslamch determines single precision machine parameters arguments |
| precisely precisely the entire submatrix that is copied gets placed on one node or more. the receiving node can be specified precisely, or all node the entire submatrix that is copied gets placed on one node or more. the receiving node can be specified precisely, or all node the entire submatrix that is copied gets placed on one node or more. the receiving node can be specified precisely, or all node the entire submatrix that is copied gets placed on one node or more. the receiving node can be specified precisely, or all node |
| PRECISION PRECISION ulp (local input) real on entry, machine PRECISION ab (input/output) double PRECISION array, dimension (ldab,n 2*kl+ku+1; rows 1 to kl of the array need not be set. s (local input/output) double PRECISION array, (lds,* referenced. it is assumed that s has jblk double shifts a (global input/output) double PRECISION array, (lda,* the updated matrix on exit. s (local input/output) double PRECISION array, dimension ld on exit, the diagonal blocks of s have been rewritten to pair t - double PRECISION array of dimension ( ldt, n ) upper triangular part of the array t must contain the upper of the matrix a. if the reciprocal of the condition number is less than machine PRECISION, steps 4-6 are skipped 4. the system of equations is solved for x using the factored form where eps is the machine PRECISION. if abstol is less tha where norm(t) is the 1-norm of the tridiagonal matrix where eps is the machine PRECISION. if abstol is less tha where norm(t) is the 1-norm of the tridiagonal matrix of the matrix a. if the reciprocal of the condition number is less than machine PRECISION, steps 4-6 are skipped 4. the system of equations is solved for x using the factored form a (local input/local output) double PRECISION pointer int lld_a >=(bwl+bwu+1) (stored in desca). a (local input/local output) double PRECISION pointer int lld_a >=(bwl+bwu+1) (stored in desca). dl (local input/local output) double PRECISION pointer to loca matrix. globally, dl(1) is not referenced, and dl must be dl (local input/local output) double PRECISION pointer to loca matrix. globally, dl(1) is not referenced, and dl must be a (local input/local output) double PRECISION pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). a (local input/local output) double PRECISION pointer int lld_a >=(2*bwl+2*bwu+1) (stored in desca). a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input) double PRECISION pointer into the local memor this array contains the local pieces of the factors l and u a (local input) double PRECISION pointer into the local memor local pieces of the m-by-n distributed matrix whose a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the n-by-n a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th ( lld_a, locc(ja+n-1) ). on entry, the m-by-n matrix a. a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input) double PRECISION pointer into the loca this array contains the local pieces of the distributed a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the n-by-n distributed matrix a (local input/workspace) block cyclic double PRECISION global dimension (m, n), local dimension (mp, nq) of the matrix a. if the reciprocal of the condition number is less than machine PRECISION, steps 4-6 are skipped 4. the system of equations is solved for x using the factored form a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the m-by-n a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the l and u obtained by the a (local input) double PRECISION pointer into the loca on entry, this array contains the local pieces of the factors a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the n-by-m distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix small (local input/local output) double PRECISION on exit, if log10(large) is sufficiently large, the square a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the v (local workspace) double PRECISION pointer into the loca the final return, v = a*w, where est = norm(v)/norm(w) a (global input) double PRECISION array, dimensio on entry, the hessenberg matrix whose tridiagonal part is a (local input) double PRECISION pointer into the local memor contains the local pieces of the distributed matrix sub( a ) a (global input/output) double PRECISION array, dimensio on entry, the parallel matrix to be copied into or from. a (local input) double PRECISION pointer into the local memor contains the local pieces of the distributed matrix sub( a ) abstol (input) double PRECISION is narrower than abstol, or than reltol times the larger (in intvl (input/output) double PRECISION array, dimension (2*(kl-kf) oendpoint f the j-th interval, and intvl(2*j) is the right d (global input/output) double PRECISION array, dimension (n on exit, if info = 0, the eigenvalues in descending order. d (global input/output) double PRECISION array, dimension (n on exit, the eigenvalues of the repaired matrix. d (input/output) double PRECISION array, dimension (n be combined. d (input/output) double PRECISION array, dimension (n be combined. zin (local input) double PRECISION array the eigenvectors on input. each eigenvector resides entirely a (local input/local output) double PRECISION pointer int locc(ja+n-k)). on entry, this array contains the the local pdlamch determines double PRECISION machine parameters arguments a (local input) double PRECISION pointer into the local memor local pieces of the distributed matrix sub( a ). sigma (input) double PRECISION than or equal to sigma. a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this local array contains the local pieces of the a (local input/local output) double PRECISION pointer into th containing on entry the m-by-n matrix sub( a ). on exit, a (input/output) double PRECISION pointer into the loca on entry, the local pieces of the distributed symmetric bycol (local input) distributed block cyclic double PRECISION arra bycol is distributed across the process rows byrow (local input) distributed block cyclic double PRECISION arra byrow is distributed across the process columns v (local input) double PRECISION pointer into the local memor storev = 'c', ( lld_v, locc(jv+m-1)) if storev = 'r' and alpha (local output) double PRECISION vector sub( x ). v (input/output) double PRECISION pointer into the local memor if storev = 'c', and (locr(iv+k-1),locc(jv+n-1)) if v (local input) double PRECISION pointer into the local memor (lld_v, locc(jv+n-1)) if side = 'r'. it contains the local v (input/output) double PRECISION pointer into the local memor the distributed matrix v contains the householder vectors. cfrom (global input) double PRECISION the distributed matrix sub( a ) is multiplied by cto/cfrom. alpha (global input) double PRECISION set. alpha (global input) double PRECISION set. a (global input) double PRECISION array, dimensio on entry, the hessenberg matrix whose tridiagonal part is d (global input/output) double PRECISION array, dimmension (n x (input) double PRECISION x( i ) = x(ix+(jx-1)*m_x +(i-1)*incx ), 1 <= i <= n. a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the distri- a (local input) double PRECISION pointer into the local memor contains the local pieces of the distributed matrix the trace a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the triangular factor l or u. a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the triangular factor l or u. a (global input) double PRECISION array, dimensio on entry, the hessenberg matrix. a (local input/local output) double PRECISION pointer into th on entry, the j-th column must contain the vector which a (local input/local output) double PRECISION pointer into th on entry, the j-th column must contain the vector which a (local input/local output) double PRECISION pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) double PRECISION pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) double PRECISION pointer into th on entry, the j-th column must contain the vector which a (local input/local output) double PRECISION pointer into th on entry, the j-th column must contain the vector which a (local input/local output) double PRECISION pointer into th on entry, the i-th row must contain the vector which defines a (local input/local output) double PRECISION pointer into th on entry, the i-th row must contain the vector which defines a (local input) double PRECISION pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) double PRECISION pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) double PRECISION pointer into the local memor vect='q', and (lld_a,locc(ja+nq-1)) if vect = 'p'. nq = m a (local input) double PRECISION pointer into the local memor and (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input) double PRECISION pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double PRECISION pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double PRECISION pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) double PRECISION pointer into the local memor j-th column must contain the vector which defines the elemen- a (local input) double PRECISION pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double PRECISION pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double PRECISION pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double PRECISION pointer into the local memor and (lld_a,locc(ja+n-1)) if side='r', where a (local input) double PRECISION pointer into the local memor or (lld_a,locc(ja+n-1)) if side = 'r'. the vectors which a (local input/local output) double PRECISION pointer int lld_a >=(bw+1) (stored in desca). a (local input/local output) double PRECISION pointer int lld_a >=(bw+1) (stored in desca). a (local input) double PRECISION pointer into the local memor this array contains the local pieces of the factors l or u a (local input) double PRECISION pointer into the local memor n-by-n symmetric positive definite distributed matrix a (local input) double PRECISION pointer into the loca this array contains the local pieces of the n-by-n symmetric a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the of the matrix a. if the reciprocal of the condition number is less than machine PRECISION, steps 4-6 are skipped 4. the system of equations is solved for x using the factored form a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the triangular factor u or l a (local input) double PRECISION pointer into local memory t array contains the factors l or u from the cholesky facto- d (local input/local output) double PRECISION pointer to loca matrix. d (local input/local output) double PRECISION pointer to loca matrix. sa (global input) double PRECISION sub( x ). sa must be >= 0, or the subroutine will divide by are needed for the "fast" sturm count are : (a) infinity arithmetic (b) the sign bit of a single PRECISION floatin (c) the sign of negative zero. d (global input/output) double PRECISION array, dimension (n on exit, if info = 0, the eigenvalues in descending order. d (global input) double PRECISION array, dimension (n a (local input/workspace) block cyclic double PRECISION array locc(ja+n-1) ) a (local input/workspace) block cyclic double PRECISION array locc(ja+n-1) ) a (local input/workspace) block cyclic double PRECISION array local dimension ( lld_a, locc(ja+n-1) ) a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input) double PRECISION pointer into the local memor contains the local pieces of the triangular distributed a (local input) double PRECISION pointer into the local memor array contains the local pieces of the original triangular a (local input/local output) double PRECISION pointer into th this array contains the local pieces of the triangular matrix a (local input/local output) double PRECISION pointer into th on entry, this array contains the local pieces of the a (local input) double PRECISION pointer into the local memor contains the local pieces of the distributed triangular a (local input/local output) double PRECISION pointer into th on entry, the local pieces of the m-by-n distributed matrix asum (local output) pointer to double PRECISION only in its scope. of the matrix a. if the reciprocal of the condition number is less than machine PRECISION, steps 4-6 are skipped 4. the system of equations is solved for x using the factored form pslamch determines single PRECISION machine parameters arguments of the matrix a. if the reciprocal of the condition number is less than machine PRECISION, steps 4-6 are skipped 4. the system of equations is solved for x using the factored form are needed for the "fast" sturm count are : (a) infinity arithmetic (b) the sign bit of a double PRECISION floatin (c) the sign of negative zero. a (local input/workspace) block cyclic double PRECISION array locc(ja+n-1) ) where eps is the machine PRECISION. if abstol is less tha where norm(t) is the 1-norm of the tridiagonal matrix where eps is the machine PRECISION. if abstol is less tha where norm(t) is the 1-norm of the tridiagonal matrix sa (global input) double PRECISION sub( x ). sa must be >= 0, or the subroutine will divide by d (local output) double PRECISION array, dimensio the distributed diagonal elements of the bidiagonal matrix d (local output) double PRECISION array, dimensio the distributed diagonal elements of the bidiagonal matrix anorm (global input) double PRECISION matrix a(ia:ia+n-1,ja:ja+n-1). r (local output) double PRECISION array, dimension locr(m_a scale factors for sub( a ). r is aligned with the distributed rwork (local workspace/local output) double PRECISION array on exit, rwork(1) returns the minimal and optimal lrwork. ferr (local output) double PRECISION array of local dimensio the estimated forward error bound for each solution vector s (global output) double PRECISION array, dimension siz of the matrix a. if the reciprocal of the condition number is less than machine PRECISION, steps 4-6 are skipped 4. the system of equations is solved for x using the factored form w (global output) double PRECISION array, dimension (n eigenvalues in ascending order. w (global output) double PRECISION array, dimension (n vl (global input) double PRECISION for eigenvalues. not referenced if range = 'a' or 'i'. scale (global output) double PRECISION compensate for the scaling performed in this routine. vl (global input) double PRECISION for eigenvalues. not referenced if range = 'a' or 'i'. scale (global output) double PRECISION compensate for the scaling performed in this routine. d (local output) double PRECISION array, dimension locc(ja+n-1 d(i) = a(i,i). d is tied to the distributed matrix a. d (local output) double PRECISION array, dimension locc(ja+n-1 d(i) = a(i,i). d is tied to the distributed matrix a. d (local output) double PRECISION array, dimension locc(ja+n-1 d(i) = a(i,i). d is tied to the distributed matrix a. d (local output) double PRECISION array, dim locq(ja+n-1 d(i) = a(i,i). d is tied to the distributed matrix a. d (local output) double PRECISION array, dimensio the distributed diagonal elements of the bidiagonal matrix est (global output) double PRECISION zin (local input) double PRECISION array the eigenvectors on input. each eigenvector resides entirely work (local workspace) double PRECISION array dimension (lwork nq0 if norm = '1', 'o' or 'o', r (local input) double PRECISION array, dimension locr(m_a distributed matrix a, and replicated across every process sr (local input) double PRECISION array, dimension locr(m_a with the distributed matrix a, and replicated across every cfrom (global input) double PRECISION the distributed matrix sub( a ) is multiplied by cto/cfrom. smlnum (global input) double PRECISION unchanged on exit. scale (local input/local output) double PRECISION on exit, scale is overwritten with scl , the scaling factor d (local output) double PRECISION array, dimension locc(ja+n-1 d(i) = a(i,i). d is tied to the distributed matrix a. amax (global output) pointer to double PRECISION vector sub( x ) only in the scope of sub( x ). anorm (global input) double PRECISION matrix a(ia:ia+n-1,ja:ja+n-1). sr (local output) double PRECISION array, dimension locr(m_a for sub( a ). sr is aligned with the distributed matrix a, ferr (local output) double PRECISION array of local dimensio the estimated forward error bound for each solution vector of the matrix a. if the reciprocal of the condition number is less than machine PRECISION, steps 4-6 are skipped 4. the system of equations is solved for x using the factored form d (global input) double PRECISION array, dimension (n rcond (global output) double PRECISION matrix a(ia:ia+n-1,ja:ja+n-1), computed as rwork (local workspace) double PRECISION array ferr (local output) double PRECISION array of local dimensio each solution vector of sub( x ). if xtrue is the true ulp (local input) real on entry, machine PRECISION ulp (local input) double PRECISION unchanged on exit. cs (output) double PRECISION parameters of the rotation matrix. |
| premultiplied premultiplied 6. if fact = 'e' and equilibration was used, the matrix x is premultiplied by diag(c) (if trans = 'n') or diag(r) (i before equilibration. 6. if equilibration was used, the matrix x is premultiplied b equilibration. 6. if fact = 'e' and equilibration was used, the matrix x is premultiplied by diag(c) (if trans = 'n') or diag(r) (i before equilibration. 6. if equilibration was used, the matrix x is premultiplied b equilibration. 6. if fact = 'e' and equilibration was used, the matrix x is premultiplied by diag(c) (if trans = 'n') or diag(r) (i before equilibration. 6. if equilibration was used, the matrix x is premultiplied b equilibration. 6. if fact = 'e' and equilibration was used, the matrix x is premultiplied by diag(c) (if trans = 'n') or diag(r) (i before equilibration. 6. if equilibration was used, the matrix x is premultiplied b equilibration. |
| preparation preparation apply factorization to lower connection block bl_i conjugate transpose the connection block in preparation move the connection block in preparation. reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase conjugate transpose the connection block in preparation reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase apply factorization to lower connection block bl_i transpose the connection block in preparation move the connection block in preparation. reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase transpose the connection block in preparation reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase apply factorization to lower connection block bl_i transpose the connection block in preparation move the connection block in preparation. reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase transpose the connection block in preparation reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase apply factorization to lower connection block bl_i conjugate transpose the connection block in preparation move the connection block in preparation. reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase conjugate transpose the connection block in preparation reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase reduced system has been solved, communicate solutions to nearest neighbors in preparation for local computation phase |
| Prepare Prepare Prepare to use wilkinson's shift Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul copy submatrix of size 2*jblk and Prepare to do generalize Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul copy submatrix of size 2*jblk and Prepare to do generalize Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul copy submatrix of size 2*jblk and Prepare to do generalize Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul copy submatrix of size 2*jblk and Prepare to do generalize Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare output: set info = 0 if no error, and divide by descmul Prepare to use wilkinson's shift |
| present present in its present form, pcheev assumes a homogeneous system and make different processes. because of this, it is possible that a compensate for the scaling performed in this routine. at present, scale is always returned as 1.0, it i compensate for the scaling performed in this routine. at present, scale is always returned as 1.0, it i the tailored codes place no restrictions on ia, ja, mb or nb. at present, ia, ja, mb and nb are restricted to those values allowe documented below. (search for "restrictions".) in its present form, pdsyev assumes a homogeneous system and make the different processes. because of this, it is possible that a in its present form, pdsyevd assumes a homogeneous system and make the different processes. because of this, it is possible that a compensate for the scaling performed in this routine. at present, scale is always returned as 1.0, it i compensate for the scaling performed in this routine. at present, scale is always returned as 1.0, it i the tailored codes place no restrictions on ia, ja, mb or nb. at present, ia, ja, mb and nb are restricted to those values allowe documented below. (search for "restrictions".) at present, only n1 is used, and it (n1) is used only fo in its present form, pssyev assumes a homogeneous system and make the different processes. because of this, it is possible that a in its present form, pssyevd assumes a homogeneous system and make the different processes. because of this, it is possible that a compensate for the scaling performed in this routine. at present, scale is always returned as 1.0, it i compensate for the scaling performed in this routine. at present, scale is always returned as 1.0, it i the tailored codes place no restrictions on ia, ja, mb or nb. at present, ia, ja, mb and nb are restricted to those values allowe documented below. (search for "restrictions".) in its present form, pzheev assumes a homogeneous system and make different processes. because of this, it is possible that a compensate for the scaling performed in this routine. at present, scale is always returned as 1.0, it i compensate for the scaling performed in this routine. at present, scale is always returned as 1.0, it i the tailored codes place no restrictions on ia, ja, mb or nb. at present, ia, ja, mb and nb are restricted to those values allowe documented below. (search for "restrictions".) |
| prevented prevented iclustr. if (mod(info/4,2).ne.0), then space limit prevented between vl and vu. the number of eigenvectors iclustr. if (mod(info/4,2).ne.0), then space limit prevented between vl and vu. the number of eigenvectors iclustr. if (mod(info/4,2).ne.0), then space limit prevented between vl and vu. the number of eigenvectors iclustr. if (mod(info/4,2).ne.0), then space limit prevented between vl and vu. the number of eigenvectors iclustr. if (mod(info/4,2).ne.0), then space limit prevented between vl and vu. the number of eigenvectors iclustr. if (mod(info/4,2).ne.0), then space limit prevented between vl and vu. the number of eigenvectors iclustr. if (mod(info/4,2).ne.0), then space limit prevented between vl and vu. the number of eigenvectors iclustr. if (mod(info/4,2).ne.0), then space limit prevented between vl and vu. the number of eigenvectors |
| prevents prevents ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ===================================================================== ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ===================================================================== ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication ja = ib alignment restriction that prevents unnecessary communication |
| previous previous receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous
calculate the update block for previous proc, e_i = gl_i{gu_i
use the "spike" fillin to calculate contribution to previous
receive triangle b_{i-1} from previous processo
h = h( liip1:n, bindex ) and bindex = 0 indeed, the previous loop invariant as stated above for th are null matrices. receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous ihi (global input) integer ilo and ihi must have the same values as in the previous cal distributed submatrix q(ia+ilo:ia+ihi-1,ia+ilo:ja+ihi-1). receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous
calculate the update block for previous proc, e_i = gl_i{gu_i
use the "spike" fillin to calculate contribution to previous
receive triangle b_{i-1} from previous processo
ihi (global input) integer ilo and ihi must have the same values as in the previous cal distributed submatrix q(ia+ilo:ia+ihi-1,ia+ilo:ja+ihi-1). receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous h = h( liip1:n, bindex ) and bindex = 0 indeed, the previous loop invariant as stated above for th are null matrices. receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous
calculate the update block for previous proc, e_i = gl_i{gu_i
use the "spike" fillin to calculate contribution to previous
receive triangle b_{i-1} from previous processo
ihi (global input) integer ilo and ihi must have the same values as in the previous cal distributed submatrix q(ia+ilo:ia+ihi-1,ia+ilo:ja+ihi-1). receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous h = h( liip1:n, bindex ) and bindex = 0 indeed, the previous loop invariant as stated above for th are null matrices. receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous
calculate the update block for previous proc, e_i = gl_i{gu_i
use the "spike" fillin to calculate contribution to previous
receive triangle b_{i-1} from previous processo
h = h( liip1:n, bindex ) and bindex = 0 indeed, the previous loop invariant as stated above for th are null matrices. receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous receive previously transmitted matrix section, which form the "spike" fillin. use the "spike" fillin to calculate contribution to previous ihi (global input) integer ilo and ihi must have the same values as in the previous cal distributed submatrix q(ia+ilo:ia+ihi-1,ia+ilo:ja+ihi-1). |
| previously previously receive previously transmitted matrix section, which form the "spike" fillin. sub( b ) must have been previously factorized as u**h*u or l*l**h b sub( b ) must have been previously factorized as u**h*u or l*l**h b sub( b ) must have been previously factorized as u**h*u or l*l**h b receive previously transmitted matrix section, which form the "spike" fillin. receive previously transmitted matrix section, which form the "spike" fillin. receive previously transmitted matrix section, which form the "spike" fillin. receive previously transmitted matrix section, which form the "spike" fillin. receive previously transmitted matrix section, which form the "spike" fillin. where: lwork, as defined previously, depends upon the numbe nsytrd_lwopt = n + 2*( anb+1 )*( 4*nps+2 ) + sub( b ) must have been previously factorized as u**t*u or l*l**t b sub( b ) must have been previously factorized as u**t*u or l*l**t b where: lwork, as defined previously, depends upon the numbe nsytrd_lwopt = n + 2*( anb+1 )*( 4*nps+2 ) + sub( b ) must have been previously factorized as u**h*u or l*l**h b receive previously transmitted matrix section, which form the "spike" fillin. receive previously transmitted matrix section, which form the "spike" fillin. receive previously transmitted matrix section, which form the "spike" fillin. where: lwork, as defined previously, depends upon the numbe nsytrd_lwopt = n + 2*( anb+1 )*( 4*nps+2 ) + sub( b ) must have been previously factorized as u**t*u or l*l**t b sub( b ) must have been previously factorized as u**t*u or l*l**t b where: lwork, as defined previously, depends upon the numbe nsytrd_lwopt = n + 2*( anb+1 )*( 4*nps+2 ) + sub( b ) must have been previously factorized as u**h*u or l*l**h b receive previously transmitted matrix section, which form the "spike" fillin. sub( b ) must have been previously factorized as u**h*u or l*l**h b sub( b ) must have been previously factorized as u**h*u or l*l**h b sub( b ) must have been previously factorized as u**h*u or l*l**h b receive previously transmitted matrix section, which form the "spike" fillin. receive previously transmitted matrix section, which form the "spike" fillin. |
| principal principal space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of space af. mathematically, this is equivalent to reordering the matrix a as p a p^t and then factoring the principal the matrices factored on each processor. the factors of |
| prior prior send modifications to prior processor's right hand side send modifications to prior processor's right hand side processor column that owns a( :, i+1 ) so that a( :, i+1 ) can be updated prior to spreading v across we keep the block column of a up-to-date to minimize the send modifications to prior processor's right hand side send modifications to prior processor's right hand side send modifications to prior processor's right hand side send modifications to prior processor's right hand side send modifications to prior processor's right hand side send modifications to prior processor's right hand side processor column that owns a( :, i+1 ) so that a( :, i+1 ) can be updated prior to spreading v across we keep the block column of a up-to-date to minimize the send modifications to prior processor's right hand side send modifications to prior processor's right hand side send modifications to prior processor's right hand side send modifications to prior processor's right hand side processor column that owns a( :, i+1 ) so that a( :, i+1 ) can be updated prior to spreading v across we keep the block column of a up-to-date to minimize the send modifications to prior processor's right hand side send modifications to prior processor's right hand side processor column that owns a( :, i+1 ) so that a( :, i+1 ) can be updated prior to spreading v across we keep the block column of a up-to-date to minimize the send modifications to prior processor's right hand side send modifications to prior processor's right hand side |
| priori priori (only the first nsplit elements will actually be used, but since the user cannot know a priori what value nsplit wil (only the first nsplit elements will actually be used, but since the user cannot know a priori what value nsplit wil |
| Probable Probable used was incorrect. no eigenvalues were computed. Probable cause: your machine has sloppy floatin cure: increase the parameter "fudge", recompile, used was incorrect. no eigenvalues were computed. Probable cause: your machine has sloppy floatin cure: increase the parameter "fudge", recompile, |
| problem problem goto put in by g. henry to fix alpha problem gp = ( ( oldgp+p )-( d( l )-p ) ) / 1. if trans = 'n' and m >= n: find the least squares solution of an overdetermined system, i.e., solve the least squares problem reference: f. tisseur and j. dongarra, "a parallel divide and conquer algorithm for the symmetric eigenvalue problem siam j. sci. comput., 6:20 (1999), pp. 2223--2236. pchegs2 reduces a complex hermitian-definite generalized eigenproblem pchegst reduces a complex hermitian-definite generalized eigenproblem the eigenvectors of a complex generalized hermitian-definite eigenproblem, of the for sub( b )*sub( a )*x=(lambda)*x. pchengst reduces a complex hermitian-definite generalized eigenproblem to standard form pchengst performs the same function as pchegst, but is based on 1. if trans = 'n' and m >= n: find the least squares solution of an overdetermined system, i.e., solve the least squares problem the first stage consists of deflating the size of the problem the z vector. for each such occurence the dimension of the pdlaed2 sorts the two sets of eigenvalues together into a single sorted set. then it tries to deflate the size of the problem eigenvalues are close together or if there is a tiny entry in the reference: f. tisseur and j. dongarra, "a parallel divide and conquer algorithm for the symmetric eigenvalue problem siam j. sci. comput., 6:20 (1999), pp. 2223--2236. reference: f. tisseur and j. dongarra, "a parallel divide and conquer algorithm for the symmetric eigenvalue problem siam j. sci. comput., 6:20 (1999), pp. 2223--2236. pdsygs2 reduces a real symmetric-definite generalized eigenproblem pdsygst reduces a real symmetric-definite generalized eigenproblem the eigenvectors of a real generalized sy-definite eigenproblem, of the for sub( b )*sub( a )*x=(lambda)*x. pdsyngst reduces a complex hermitian-definite generalized eigenproblem to standard form pdsyngst performs the same function as pdhegst, but is based on tailored eigen-routines to choose problem-dependent parameters for the local environment. see ispe 1. if trans = 'n' and m >= n: find the least squares solution of an overdetermined system, i.e., solve the least squares problem the first stage consists of deflating the size of the problem the z vector. for each such occurence the dimension of the pslaed2 sorts the two sets of eigenvalues together into a single sorted set. then it tries to deflate the size of the problem eigenvalues are close together or if there is a tiny entry in the reference: f. tisseur and j. dongarra, "a parallel divide and conquer algorithm for the symmetric eigenvalue problem siam j. sci. comput., 6:20 (1999), pp. 2223--2236. reference: f. tisseur and j. dongarra, "a parallel divide and conquer algorithm for the symmetric eigenvalue problem siam j. sci. comput., 6:20 (1999), pp. 2223--2236. pssygs2 reduces a real symmetric-definite generalized eigenproblem pssygst reduces a real symmetric-definite generalized eigenproblem the eigenvectors of a real generalized sy-definite eigenproblem, of the for sub( b )*sub( a )*x=(lambda)*x. pssyngst reduces a complex hermitian-definite generalized eigenproblem to standard form pssyngst performs the same function as pshegst, but is based on 1. if trans = 'n' and m >= n: find the least squares solution of an overdetermined system, i.e., solve the least squares problem reference: f. tisseur and j. dongarra, "a parallel divide and conquer algorithm for the symmetric eigenvalue problem siam j. sci. comput., 6:20 (1999), pp. 2223--2236. pzhegs2 reduces a complex hermitian-definite generalized eigenproblem pzhegst reduces a complex hermitian-definite generalized eigenproblem the eigenvectors of a complex generalized hermitian-definite eigenproblem, of the for sub( b )*sub( a )*x=(lambda)*x. pzhengst reduces a complex hermitian-definite generalized eigenproblem to standard form pzhengst performs the same function as pzhegst, but is based on goto put in by g. henry to fix alpha problem gp = ( ( oldgp+p )-( d( l )-p ) ) / |
| proc proc check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor proc (iqrow, iqcol) receive the parts of z check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor proc (iqrow, iqcol) receive the parts of z check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam check consistency across processor check consistency across processor source processor must be the sam check consistency across processor source processor must be the sam |
| procedure procedure 1-dimensional "row" of processes. calling the lapack procedure cbdsqr require wcbdsqr = max(1, 4*size ) 1-dimensional "row" of processes. calling the lapack procedure dbdsqr require wdbdsqr = max(1, 4*size ) this is a scalapack internal procedure and arguments are not checke this is a scalapack internal procedure and arguments are not checke 1-dimensional "row" of processes. calling the lapack procedure sbdsqr require wsbdsqr = max(1, 4*size ) this is a scalapack internal procedure and arguments are not checke this is a scalapack internal procedure and arguments are not checke 1-dimensional "row" of processes. calling the lapack procedure zbdsqr require wzbdsqr = max(1, 4*size ) |
| proceeds proceeds p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: p pieces with one stored on each processor, and then proceeds in 2 phases for the factorization or 3 for th 1) local phase: |
| procesor procesor the computation of v, which could be performed in any processor column (or other procesor subsets), is performed in th can be updated prior to spreading v across. the computation of v, which could be performed in any processor column (or other procesor subsets), is performed in th can be updated prior to spreading v across. the computation of v, which could be performed in any processor column (or other procesor subsets), is performed in th can be updated prior to spreading v across. the computation of v, which could be performed in any processor column (or other procesor subsets), is performed in th can be updated prior to spreading v across. |
| procesors procesors on all processors and hence pjlaenv will return the same value to all procesors (i.e. global output). however some on each processor and hence pjlaenv can return different |
| process process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo diagonally dominant-like, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo diagonally dominant-like, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo nonsingular, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process only spot checks of the consistency of the eigenvalues across the different processes. because of this, it is possible that messages. np = the number of rows local to a given process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process the mapping between an object element and its corresponding process and memory location let a be a generic term for any 2d block cyclicly distributed vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process ii, jj : local indices into array a icurrow : process row containing diagonal bloc irsc0 : pointer to part of work used to store the rowsums while only one process ro ii, jj : local indices into array a icurrow : process row containing diagonal bloc irsc0 : pointer to part of work used to store the rowsums while gather the intermediate results to process (0,0) sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. pivoting the rows of sub( a ), ipiv should be distributed along a process column and replicated over all process rows. similarly all process columns for column pivoting. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process sub( a ). this routine assumes that the pivoting information has already been broadcast along the process row or column same mb (or nb) block. if you want to pivot a full matrix, use denoting a( ia:ia+n-1, ja:ja+n-1 ). the result is left on every process of the grid notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process no communication is performed by this routine, the matrix to operate on should be strictly local to one process notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo positive definite, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process the scaling factor are stored along process rows in sr and alon greatly the application of the factors. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo positive definite, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process correspond to user specified eigenvalues. pcstein does not orthogonalize vectors that are on different processes. the exten eigenvectors that are to be orthogonalized are computed by the same vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process block matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). this matrix should be contained in one and only one process memory space (local operation) notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo diagonally dominant-like, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo diagonally dominant-like, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo nonsingular, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process drow (global input) integer the process row over which the first row of the matrix d i drow (global input) integer the process row over which the first row of the matrix d i vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process only one process ro ii, jj : local indices into array a icurrow : process row containing diagonal bloc irsc0 : pointer to part of work used to store the rowsums while gather the intermediate results to process (0,0) sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. pivoting the rows of sub( a ), ipiv should be distributed along a process column and replicated over all process rows. similarly all process columns for column pivoting. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process it assumes that the input array, bycol, is distributed across rows and that all process columns contain the same copy o and will contain the entire array. it assumes that the input array, byrow, is distributed across columns and that all process rows contain the same copy o and will contain the entire array. is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process sub( a ). this routine assumes that the pivoting information has already been broadcast along the process row or column same mb (or nb) block. if you want to pivot a full matrix, use denoting a( ia:ia+n-1, ja:ja+n-1 ). the result is left on every process of the grid notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process no communication is performed by this routine, the matrix to operate on should be strictly local to one process notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo positive definite, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process the scaling factor are stored along process rows in sr and alon greatly the application of the factors. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo positive definite, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process correspond to user specified eigenvalues. pdstein does not orthogonalize vectors that are on different processes. the exten eigenvectors that are to be orthogonalized are computed by the same no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process the mapping between an object element and its corresponding process and memory location let a be a generic term for any 2d block cyclicly distributed vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process block matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). this matrix should be contained in one and only one process memory space (local operation) notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo diagonally dominant-like, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo diagonally dominant-like, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo nonsingular, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process drow (global input) integer the process row over which the first row of the matrix d i drow (global input) integer the process row over which the first row of the matrix d i vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process only one process ro ii, jj : local indices into array a icurrow : process row containing diagonal bloc irsc0 : pointer to part of work used to store the rowsums while gather the intermediate results to process (0,0) sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. pivoting the rows of sub( a ), ipiv should be distributed along a process column and replicated over all process rows. similarly all process columns for column pivoting. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process it assumes that the input array, bycol, is distributed across rows and that all process columns contain the same copy o and will contain the entire array. it assumes that the input array, byrow, is distributed across columns and that all process rows contain the same copy o and will contain the entire array. is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process sub( a ). this routine assumes that the pivoting information has already been broadcast along the process row or column same mb (or nb) block. if you want to pivot a full matrix, use denoting a( ia:ia+n-1, ja:ja+n-1 ). the result is left on every process of the grid notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process no communication is performed by this routine, the matrix to operate on should be strictly local to one process notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo positive definite, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process the scaling factor are stored along process rows in sr and alon greatly the application of the factors. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo positive definite, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process correspond to user specified eigenvalues. psstein does not orthogonalize vectors that are on different processes. the exten eigenvectors that are to be orthogonalized are computed by the same no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process the mapping between an object element and its corresponding process and memory location let a be a generic term for any 2d block cyclicly distributed vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process block matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). this matrix should be contained in one and only one process memory space (local operation) notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo diagonally dominant-like, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo diagonally dominant-like, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo nonsingular, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process only spot checks of the consistency of the eigenvalues across the different processes. because of this, it is possible that messages. np = the number of rows local to a given process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process the mapping between an object element and its corresponding process and memory location let a be a generic term for any 2d block cyclicly distributed vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process ii, jj : local indices into array a icurrow : process row containing diagonal bloc irsc0 : pointer to part of work used to store the rowsums while only one process ro ii, jj : local indices into array a icurrow : process row containing diagonal bloc irsc0 : pointer to part of work used to store the rowsums while gather the intermediate results to process (0,0) sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the pivot vector may be distributed across a process ro matrix a. this routine will transpose the pivot vector if necessary. pivoting the rows of sub( a ), ipiv should be distributed along a process column and replicated over all process rows. similarly all process columns for column pivoting. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process is sub( c ) only distributed over a process row vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process sub( a ). this routine assumes that the pivoting information has already been broadcast along the process row or column same mb (or nb) block. if you want to pivot a full matrix, use denoting a( ia:ia+n-1, ja:ja+n-1 ). the result is left on every process of the grid notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process no communication is performed by this routine, the matrix to operate on should be strictly local to one process notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo positive definite, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on vector. this vector stores the information required to establish the mapping between an object element and its corresponding process the scaling factor are stored along process rows in sr and alon greatly the application of the factors. vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process info = -i. > 0: if info = k<=nprocs, the submatrix stored on processo positive definite, and of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on correspond to user specified eigenvalues. pzstein does not orthogonalize vectors that are on different processes. the exten eigenvectors that are to be orthogonalized are computed by the same vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process block matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). this matrix should be contained in one and only one process memory space (local operation) notes vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process vector. this vector stores the information required to establish the mapping between an object element and its corresponding process |
| processes processes it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a the number of elements of k that a process would receive if k were distributed over the r processes of its process column. similarly receive if k were distributed over the c processes of its process locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a only spot checks of the consistency of the eigenvalues across the different processes. because of this, it is possible that messages. locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locp( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locq( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a although all processes call pcgemr2d, only the processes that ow first column of b receive data. the calls to cgebs2d/cgebr2d locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a correspond to user specified eigenvalues. pcstein does not orthogonalize vectors that are on different processes. the exten eigenvectors that are to be orthogonalized are computed by the same locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the r processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a the number of elements of k that a process would receive if k were distributed over the r processes of its process column. similarly receive if k were distributed over the c processes of its process locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a tridiagonal matrix. on output, q is distributed across the p processes in bloc locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a although all processes call pdgemr2d, only the processes that ow first column of b receive data. the calls to dgebs2d/dgebr2d locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a rows and that all process columns contain the same copy of bycol. the output array, byall, will be identical on all processes columns and that all process rows contain the same copy of byrow. the output array, byall, will be identical on all processes locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a static partitioning of work is done at the beginning of pdstebz which results in all processes finding an (almost) equal number o tridiagonal matrix. on output, q is distributed across the p processes in bloc correspond to user specified eigenvalues. pdstein does not orthogonalize vectors that are on different processes. the exten eigenvectors that are to be orthogonalized are computed by the same no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locp( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locq( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a the number of elements of k that a process would receive if k were distributed over the r processes of its process column. similarly receive if k were distributed over the c processes of its process locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a tridiagonal matrix. on output, q is distributed across the p processes in bloc locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a although all processes call psgemr2d, only the processes that ow first column of b receive data. the calls to sgebs2d/sgebr2d locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a rows and that all process columns contain the same copy of bycol. the output array, byall, will be identical on all processes columns and that all process rows contain the same copy of byrow. the output array, byall, will be identical on all processes locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a static partitioning of work is done at the beginning of psstebz which results in all processes finding an (almost) equal number o tridiagonal matrix. on output, q is distributed across the p processes in bloc correspond to user specified eigenvalues. psstein does not orthogonalize vectors that are on different processes. the exten eigenvectors that are to be orthogonalized are computed by the same no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. no checks for consistency of the eigenvalues or eigenvectors across the different processes. because of this, it is possible that messages. locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locp( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locq( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a the number of elements of k that a process would receive if k were distributed over the r processes of its process column. similarly receive if k were distributed over the c processes of its process locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a only spot checks of the consistency of the eigenvalues across the different processes. because of this, it is possible that messages. locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locp( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locq( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a although all processes call pzgemr2d, only the processes that ow first column of b receive data. the calls to zgebs2d/zgebr2d locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, it is best to distribute the input matrix a one-dimensionally, with columns atomic and rows divided amongst the processes p pieces with one stored on each processor, correspond to user specified eigenvalues. pzstein does not orthogonalize vectors that are on different processes. the exten eigenvectors that are to be orthogonalized are computed by the same locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the r processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a locr( k ) denotes the number of elements of k that a process would receive if k were distributed over the p processes of it similarly, locc( k ) denotes the number of elements of k that a |
| processor processor info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor diagonally dominant-like, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor diagonally dominant-like, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor nonsingular, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on the workspace is given by a workspace required on processor (0,0) watobd <= nb*(mp0 + nq0 + 1) + nq0. pcstein will perform no better than cstein on 1 processor all eigenvectors will increase the total execution time performance. in the limit (i.e. clustersize = n-1) pcstein will perform no better than cstein on 1 processor all eigenvectors will increase the total execution time arrays v and h are replicated across all processor columns on output, a is replicated across all processes in this processor column ia (global input) integer info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor positive definite, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor positive definite, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor diagonally dominant-like, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor diagonally dominant-like, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor nonsingular, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on the workspace is given by a workspace required on processor (0,0) watobd <= nb*(mp0 + nq0 + 1) + nq0. on output, a is replicated across all processes in this processor column ia (global input) integer info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor positive definite, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor positive definite, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam pdstein will perform no better than dstein on 1 processor all eigenvectors will increase the total execution time performance. in the limit (i.e. clustersize = n-1) pdstein will perform no better than dstein on 1 processor all eigenvectors will increase the total execution time arrays v and h are replicated across all processor columns most parameters set via a call to pjlaenv must be identical on all processors and hence pjlaenv will return the sam in particular, the panel blocking factor can be different info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor diagonally dominant-like, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor diagonally dominant-like, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor nonsingular, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on the workspace is given by a workspace required on processor (0,0) watobd <= nb*(mp0 + nq0 + 1) + nq0. on output, a is replicated across all processes in this processor column ia (global input) integer info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor positive definite, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor positive definite, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam psstein will perform no better than sstein on 1 processor all eigenvectors will increase the total execution time performance. in the limit (i.e. clustersize = n-1) psstein will perform no better than sstein on 1 processor all eigenvectors will increase the total execution time arrays v and h are replicated across all processor columns info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor diagonally dominant-like, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor diagonally dominant-like, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor nonsingular, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on the workspace is given by a workspace required on processor (0,0) watobd <= nb*(mp0 + nq0 + 1) + nq0. pzstein will perform no better than zstein on 1 processor all eigenvectors will increase the total execution time performance. in the limit (i.e. clustersize = n-1) pzstein will perform no better than zstein on 1 processor all eigenvectors will increase the total execution time arrays v and h are replicated across all processor columns on output, a is replicated across all processes in this processor column ia (global input) integer info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor positive definite, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam info = -i. > 0: if info = k<=nprocs, the submatrix stored on processor positive definite, and check consistency across processor of the divide and conquer algorithm as a task-parallel algorithm. this formula in words is: no processor may have more than on source processor must be the sam |
| processors processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors column (i.e. mycol .eq. curcol) they are the same. however, on some processors, a( lii, lij ) points to an elemen h(m+2,m-1). since these elements may be on separate processors, the first major loop (10) goes over the tridiagona the node owning h(m,m) does not. this will occur on a border when we hit a border, there are row and column transforms that overlap over several processors and the code gets ver *local* matrix is generated on one node (called smalla) and info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors h(m+2,m-1). since these elements may be on separate processors, the first major loop (10) goes over the tridiagona the node owning h(m,m) does not. this will occur on a border when we hit a border, there are row and column transforms that overlap over several processors and the code gets ver *local* matrix is generated on one node (called smalla) and info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors column (i.e. mycol .eq. curcol) they are the same. however, on some processors, a( lii, lij ) points to an elemen most parameters set via a call to pjlaenv must be identical on all processors and hence pjlaenv will return the sam in particular, the panel blocking factor can be different info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors h(m+2,m-1). since these elements may be on separate processors, the first major loop (10) goes over the tridiagona the node owning h(m,m) does not. this will occur on a border when we hit a border, there are row and column transforms that overlap over several processors and the code gets ver *local* matrix is generated on one node (called smalla) and info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors column (i.e. mycol .eq. curcol) they are the same. however, on some processors, a( lii, lij ) points to an elemen info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors column (i.e. mycol .eq. curcol) they are the same. however, on some processors, a( lii, lij ) points to an elemen h(m+2,m-1). since these elements may be on separate processors, the first major loop (10) goes over the tridiagona the node owning h(m,m) does not. this will occur on a border when we hit a border, there are row and column transforms that overlap over several processors and the code gets ver *local* matrix is generated on one node (called smalla) and info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors info-nprocs representing interactions with other processors was no and the factorization was not completed. check consistency across processors check consistency across processors |
| procs procs form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. byall(i) = bycol( numroc(i,desc( nb_ ),myrow,0,nprow ) on the procs byall(i) = byrow( numroc(i,desc( mb_ ),mycol,0,npcol ) on the procs form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. byall(i) = bycol( numroc(i,desc( nb_ ),myrow,0,nprow ) on the procs byall(i) = byrow( numroc(i,desc( mb_ ),mycol,0,npcol ) on the procs form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. form a new blacs grid (the "standard form" grid) with only procs starting at csrc=0, with ja modified to reflect dropped procs. |
| produce produce where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n real where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex where a(1:n, ja:ja+n-1) is the matrix used to produce the factor a(1:n, ja:ja+n-1) is an n-by-n complex |
| product product a = l * u where l is a product of unit lower bidiagona diagonal and first superdiagonal. a = l * u where l is a product of unit lower bidiagona diagonal and first superdiagonal. below the diagonal, with the array tauq, represent the unitary matrix q as a product of elementary reflectors, an taup, represent the orthogonal matrix p as a product of below the diagonal, with the array tauq, represent the unitary matrix q as a product of elementary reflectors, an taup, represent the orthogonal matrix p as a product of ments below the first subdiagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar ments below the first subdiagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar the elements above the diagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar the elements above the diagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar trapezoidal matrix l; the remaining elements, with the array tau, represent the unitary matrix q as a product o trapezoidal matrix l; the remaining elements, with the array tau, represent the unitary matrix q as a product o the elements below the diagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar the elements below the diagonal, with the array tau, represent the unitary matrix q as a product of elementar the elements below the diagonal, with the array tau, represent the unitary matrix q as a product of elementar trapezoidal matrix r; the remaining elements, with the array tau, represent the unitary matrix q as a product o trapezoidal matrix r; the remaining elements, with the array tau, represent the unitary matrix q as a product o the elements below the diagonal, with the array taua, represent the unitary matrix q as a product of min(n,m trapezoidal matrix r; the remaining elements, with the array taua, represent the unitary matrix q as a product o values in this array correspond to the clusters indicated by the array iclustr. as a result, the dot product betwee as ( c * n ) / gap(i) where c is a small constant. values in this array correspond to the clusters indicated by the array iclustr. as a result, the dot product betwee as ( c * n ) / gap(i) where c is a small constant. with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the columns, with the array tauq, represent the unitary matrix q as a product of elementary reflectors; an array taup, represent the unitary matrix p as a product two consecutive small subdiagonal elements will stall convergence of a double shift if their product is smal necessary to scan the "tridiagonal portion of the matrix." in matrix; the elements below the k-th subdiagonal, with the array tau, represent the matrix q as a product of elementar unchanged. see further details. direct (global input) character indicates how q is formed from a product of elementar = 'f': q = h(1) h(2) . . . h(k) (forward) pclarft forms the triangular factor t of a complex block reflector h of order n, which is defined as a product of k elementary reflectors if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular; q is a product of k elementary reflectors as returned by pctzrzf currently, only storev = 'r' and direct = 'b' are supported. pclarzt forms the triangular factor t of a complex block reflector h of order > n, which is defined as a product of k elementar diagonal with the array tau, represent the unitary matrix q as a product of elementary reflectors. if uplo = 'l', th the diagonal elements overwriting the diagonal elements of of sub( a ), with the array tau, represent the unitary matrix z as a product of m elementary reflectors ia (global input) integer if the scaling needed for a in the dot product is 1 pclauu2 computes the product u * u' or l' * l, where the triangula the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). pclauum computes the product u * u' or l' * l, where the triangula the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). values in this array correspond to the info/(m+1) clusters indicated by the array iclustr. as a result, the dot product as high as ( o(n)*macheps ) / gap(i). sub( a ), with the array tau, represent the unitary matrix z as a product of m elementary reflectors ia (global input) integer a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the last n columns of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the first n columns of a product of k elementary reflectors of orde a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the first m rows of a product of k elementary reflectors of order q = h(k)' . . . h(2)' h(1)' a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the first m rows of a product of k elementary reflectors of order q = h(k)' . . . h(2)' h(1)' a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the last n columns of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the first n columns of a product of k elementary reflectors of orde a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the last m rows of a product of k elementary reflectors of order q = h(1)' h(2)' . . . h(k)' a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the last m rows of a product of k elementary reflectors of order q = h(1)' h(2)' . . . h(k)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of ihi-ilo elementary reflectors, as returned by pcgehrd q = h(ilo) h(ilo+1) . . . h(ihi-1). where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(k)' . . . h(2)' h(1)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(k)' . . . h(2)' h(1)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1)' h(2)' . . . h(k)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1)' h(2)' . . . h(k)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1)' h(2)' . . . h(k)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1)' h(2)' . . . h(k)' nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of nq-1 elementary reflectors, as returned by pchetrd if uplo = 'u', q = h(nq-1) . . . h(2) h(1); below the diagonal, with the array tauq, represent the orthogonal matrix q as a product of elementary reflectors array taup, represent the orthogonal matrix p as a product below the diagonal, with the array tauq, represent the orthogonal matrix q as a product of elementary reflectors array taup, represent the orthogonal matrix p as a product ments below the first subdiagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar ments below the first subdiagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar the elements above the diagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar the elements above the diagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar trapezoidal matrix l; the remaining elements, with the array tau, represent the orthogonal matrix q as a product o trapezoidal matrix l; the remaining elements, with the array tau, represent the orthogonal matrix q as a product o the elements below the diagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar the elements below the diagonal, with the array tau, represent the orthogonal matrix q as a product of elementar the elements below the diagonal, with the array tau, represent the orthogonal matrix q as a product of elementar trapezoidal matrix r; the remaining elements, with the array tau, represent the orthogonal matrix q as a product o trapezoidal matrix r; the remaining elements, with the array tau, represent the orthogonal matrix q as a product o the elements below the diagonal, with the array taua, represent the orthogonal matrix q as a product of min(n,m trapezoidal matrix r; the remaining elements, with the array taua, represent the orthogonal matrix q as a product o columns, with the array tauq, represent the orthogonal matrix q as a product of elementary reflectors; an array taup, represent the orthogonal matrix p as a product two consecutive small subdiagonal elements will stall convergence of a double shift if their product is smal necessary to scan the "tridiagonal portion of the matrix." in matrix; the elements below the k-th subdiagonal, with the array tau, represent the matrix q as a product of elementar unchanged. see further details. direct (global input) character indicates how q is formed from a product of elementar = 'f': q = h(1) h(2) . . . h(k) (forward) pdlarft forms the triangular factor t of a real block reflector h of order n, which is defined as a product of k elementary reflectors if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular; q is a product of k elementary reflectors as returned by pdtzrzf currently, only storev = 'r' and direct = 'b' are supported. pdlarzt forms the triangular factor t of a real block reflector h of order > n, which is defined as a product of k elementar diagonal with the array tau, represent the orthogonal matrix q as a product of elementary reflectors. if uplo = 'l', th the diagonal elements overwriting the diagonal elements of of sub( a ), with the array tau, represent the orthogonal matrix z as a product of m elementary reflectors ia (global input) integer pdlauu2 computes the product u * u' or l' * l, where the triangula the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). pdlauum computes the product u * u' or l' * l, where the triangula the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the last n columns of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the first n columns of a product of k elementary reflectors of orde a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the first m rows of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the first m rows of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the last n columns of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the first n columns of a product of k elementary reflectors of orde a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the last m rows of a product of k elementary reflectors of order q = h(1) h(2) . . . h(k) a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the last m rows of a product of k elementary reflectors of order q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of ihi-ilo elementary reflectors, as returned by pdgehrd q = h(ilo) h(ilo+1) . . . h(ihi-1). where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of nq-1 elementary reflectors, as returned by pdsytrd if uplo = 'u', q = h(nq-1) . . . h(2) h(1); values in this array correspond to the info/(m+1) clusters indicated by the array iclustr. as a result, the dot product as high as ( o(n)*macheps ) / gap(i). values in this array correspond to the clusters indicated by the array iclustr. as a result, the dot product betwee as ( c * n ) / gap(i) where c is a small constant. values in this array correspond to the clusters indicated by the array iclustr. as a result, the dot product betwee as ( c * n ) / gap(i) where c is a small constant. with the array tau, represent the orthogonal matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the orthogonal matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the orthogonal matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the sub( a ), with the array tau, represent the orthogonal matrix z as a product of m elementary reflectors ia (global input) integer below the diagonal, with the array tauq, represent the orthogonal matrix q as a product of elementary reflectors array taup, represent the orthogonal matrix p as a product below the diagonal, with the array tauq, represent the orthogonal matrix q as a product of elementary reflectors array taup, represent the orthogonal matrix p as a product ments below the first subdiagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar ments below the first subdiagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar the elements above the diagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar the elements above the diagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar trapezoidal matrix l; the remaining elements, with the array tau, represent the orthogonal matrix q as a product o trapezoidal matrix l; the remaining elements, with the array tau, represent the orthogonal matrix q as a product o the elements below the diagonal, with the array tau, repre- sent the orthogonal matrix q as a product of elementar the elements below the diagonal, with the array tau, represent the orthogonal matrix q as a product of elementar the elements below the diagonal, with the array tau, represent the orthogonal matrix q as a product of elementar trapezoidal matrix r; the remaining elements, with the array tau, represent the orthogonal matrix q as a product o trapezoidal matrix r; the remaining elements, with the array tau, represent the orthogonal matrix q as a product o the elements below the diagonal, with the array taua, represent the orthogonal matrix q as a product of min(n,m trapezoidal matrix r; the remaining elements, with the array taua, represent the orthogonal matrix q as a product o columns, with the array tauq, represent the orthogonal matrix q as a product of elementary reflectors; an array taup, represent the orthogonal matrix p as a product two consecutive small subdiagonal elements will stall convergence of a double shift if their product is smal necessary to scan the "tridiagonal portion of the matrix." in matrix; the elements below the k-th subdiagonal, with the array tau, represent the matrix q as a product of elementar unchanged. see further details. direct (global input) character indicates how q is formed from a product of elementar = 'f': q = h(1) h(2) . . . h(k) (forward) pslarft forms the triangular factor t of a real block reflector h of order n, which is defined as a product of k elementary reflectors if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular; q is a product of k elementary reflectors as returned by pstzrzf currently, only storev = 'r' and direct = 'b' are supported. pslarzt forms the triangular factor t of a real block reflector h of order > n, which is defined as a product of k elementar diagonal with the array tau, represent the orthogonal matrix q as a product of elementary reflectors. if uplo = 'l', th the diagonal elements overwriting the diagonal elements of of sub( a ), with the array tau, represent the orthogonal matrix z as a product of m elementary reflectors ia (global input) integer pslauu2 computes the product u * u' or l' * l, where the triangula the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). pslauum computes the product u * u' or l' * l, where the triangula the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the last n columns of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the first n columns of a product of k elementary reflectors of orde a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the first m rows of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the first m rows of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the last n columns of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the first n columns of a product of k elementary reflectors of orde a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the last m rows of a product of k elementary reflectors of order q = h(1) h(2) . . . h(k) a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the last m rows of a product of k elementary reflectors of order q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of ihi-ilo elementary reflectors, as returned by psgehrd q = h(ilo) h(ilo+1) . . . h(ihi-1). where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a real orthogonal distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of nq-1 elementary reflectors, as returned by pssytrd if uplo = 'u', q = h(nq-1) . . . h(2) h(1); values in this array correspond to the info/(m+1) clusters indicated by the array iclustr. as a result, the dot product as high as ( o(n)*macheps ) / gap(i). values in this array correspond to the clusters indicated by the array iclustr. as a result, the dot product betwee as ( c * n ) / gap(i) where c is a small constant. values in this array correspond to the clusters indicated by the array iclustr. as a result, the dot product betwee as ( c * n ) / gap(i) where c is a small constant. with the array tau, represent the orthogonal matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the orthogonal matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the orthogonal matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the sub( a ), with the array tau, represent the orthogonal matrix z as a product of m elementary reflectors ia (global input) integer below the diagonal, with the array tauq, represent the unitary matrix q as a product of elementary reflectors, an taup, represent the orthogonal matrix p as a product of below the diagonal, with the array tauq, represent the unitary matrix q as a product of elementary reflectors, an taup, represent the orthogonal matrix p as a product of ments below the first subdiagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar ments below the first subdiagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar the elements above the diagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar the elements above the diagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar trapezoidal matrix l; the remaining elements, with the array tau, represent the unitary matrix q as a product o trapezoidal matrix l; the remaining elements, with the array tau, represent the unitary matrix q as a product o the elements below the diagonal, with the array tau, repre- sent the unitary matrix q as a product of elementar the elements below the diagonal, with the array tau, represent the unitary matrix q as a product of elementar the elements below the diagonal, with the array tau, represent the unitary matrix q as a product of elementar trapezoidal matrix r; the remaining elements, with the array tau, represent the unitary matrix q as a product o trapezoidal matrix r; the remaining elements, with the array tau, represent the unitary matrix q as a product o the elements below the diagonal, with the array taua, represent the unitary matrix q as a product of min(n,m trapezoidal matrix r; the remaining elements, with the array taua, represent the unitary matrix q as a product o values in this array correspond to the clusters indicated by the array iclustr. as a result, the dot product betwee as ( c * n ) / gap(i) where c is a small constant. values in this array correspond to the clusters indicated by the array iclustr. as a result, the dot product betwee as ( c * n ) / gap(i) where c is a small constant. with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the with the array tau, represent the unitary matrix q as a product of elementary reflectors; if uplo = 'l', the diagona corresponding elements of the tridiagonal matrix t, and the columns, with the array tauq, represent the unitary matrix q as a product of elementary reflectors; an array taup, represent the unitary matrix p as a product two consecutive small subdiagonal elements will stall convergence of a double shift if their product is smal necessary to scan the "tridiagonal portion of the matrix." in matrix; the elements below the k-th subdiagonal, with the array tau, represent the matrix q as a product of elementar unchanged. see further details. direct (global input) character indicates how q is formed from a product of elementar = 'f': q = h(1) h(2) . . . h(k) (forward) pzlarft forms the triangular factor t of a complex block reflector h of order n, which is defined as a product of k elementary reflectors if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular; q is a product of k elementary reflectors as returned by pztzrzf currently, only storev = 'r' and direct = 'b' are supported. pzlarzt forms the triangular factor t of a complex block reflector h of order > n, which is defined as a product of k elementar diagonal with the array tau, represent the unitary matrix q as a product of elementary reflectors. if uplo = 'l', th the diagonal elements overwriting the diagonal elements of of sub( a ), with the array tau, represent the unitary matrix z as a product of m elementary reflectors ia (global input) integer if the scaling needed for a in the dot product is 1 pzlauu2 computes the product u * u' or l' * l, where the triangula the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). pzlauum computes the product u * u' or l' * l, where the triangula the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). values in this array correspond to the info/(m+1) clusters indicated by the array iclustr. as a result, the dot product as high as ( o(n)*macheps ) / gap(i). sub( a ), with the array tau, represent the unitary matrix z as a product of m elementary reflectors ia (global input) integer a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the last n columns of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the first n columns of a product of k elementary reflectors of orde a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the first m rows of a product of k elementary reflectors of order q = h(k)' . . . h(2)' h(1)' a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the first m rows of a product of k elementary reflectors of order q = h(k)' . . . h(2)' h(1)' a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the last n columns of a product of k elementary reflectors of order q = h(k) . . . h(2) h(1) a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined as the first n columns of a product of k elementary reflectors of orde a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the last m rows of a product of k elementary reflectors of order q = h(1)' h(2)' . . . h(k)' a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as the last m rows of a product of k elementary reflectors of order q = h(1)' h(2)' . . . h(k)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of ihi-ilo elementary reflectors, as returned by pzgehrd q = h(ilo) h(ilo+1) . . . h(ihi-1). where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(k)' . . . h(2)' h(1)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(k)' . . . h(2)' h(1)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(k) . . . h(2) h(1) where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1) h(2) . . . h(k) where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1)' h(2)' . . . h(k)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1)' h(2)' . . . h(k)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1)' h(2)' . . . h(k)' where q is a complex unitary distributed matrix defined as the product of k elementary reflector q = h(1)' h(2)' . . . h(k)' nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of nq-1 elementary reflectors, as returned by pzhetrd if uplo = 'u', q = h(nq-1) . . . h(2) h(1); a = l * u where l is a product of unit lower bidiagona diagonal and first superdiagonal. a = l * u where l is a product of unit lower bidiagona diagonal and first superdiagonal. |
| products products the matrices q and p are represented as products of elementar the matrices q and p are represented as products of elementar the matrices q and p are represented as products of elementar a. reverse communication is used for evaluating matrix-vector products. x and v are aligned with the distributed matrix a, thi matrices x and/or y of right or left eigenvectors of t, or the products q*x and/or q*y, where q is an input unitar original matrix a = q*t*q', then q*x and q*y are the matrices of bidiagonal form: a(ia:*,ja:*) = q * b * p**h. q and p**h are defined as products of elementary reflectors h(i) and g(i) respectively let nq = m if side = 'l' and nq = n if side = 'r'. thus nq is the the matrices q and p are represented as products of elementar the matrices q and p are represented as products of elementar the matrices q and p are represented as products of elementar pdlacon estimates the 1-norm of a square, real distributed matrix a. reverse communication is used for evaluating matrix-vector products is implicitly contained within iv, ix, descv, and descx. bidiagonal form: a(ia:*,ja:*) = q * b * p**t. q and p**t are defined as products of elementary reflectors h(i) and g(i) respectively let nq = m if side = 'l' and nq = n if side = 'r'. thus nq is the the matrices q and p are represented as products of elementar the matrices q and p are represented as products of elementar the matrices q and p are represented as products of elementar pslacon estimates the 1-norm of a square, real distributed matrix a. reverse communication is used for evaluating matrix-vector products is implicitly contained within iv, ix, descv, and descx. bidiagonal form: a(ia:*,ja:*) = q * b * p**t. q and p**t are defined as products of elementary reflectors h(i) and g(i) respectively let nq = m if side = 'l' and nq = n if side = 'r'. thus nq is the the matrices q and p are represented as products of elementar the matrices q and p are represented as products of elementar the matrices q and p are represented as products of elementar a. reverse communication is used for evaluating matrix-vector products. x and v are aligned with the distributed matrix a, thi matrices x and/or y of right or left eigenvectors of t, or the products q*x and/or q*y, where q is an input unitar original matrix a = q*t*q', then q*x and q*y are the matrices of bidiagonal form: a(ia:*,ja:*) = q * b * p**h. q and p**h are defined as products of elementary reflectors h(i) and g(i) respectively let nq = m if side = 'l' and nq = n if side = 'r'. thus nq is the |
| program program on entry, lda specifies the first dimension of a as declared in the calling (sub) program. lda must be at leas unchanged on exit. on entry, lda specifies the first dimension of a as declared in the calling (sub) program. lda must be at leas unchanged on exit. the algorithm used in this program is basically backward (forward the code robust against possible overflow. but scaling has not yet the algorithm used in this program is basically backward (forward the code robust against possible overflow. but scaling has not yet on entry, lda specifies the first dimension of a as declared in the calling (sub) program. lda must be at leas unchanged on exit. on entry, lda specifies the first dimension of a as declared in the calling (sub) program. lda must be at leas unchanged on exit. |
| progress progress $ ( two*( c*oldrp-b )+safmin ) make sure that we are making progress skip the current step: the subdiagonal info is just noise. $ ( two*( c*oldrp-b )+safmin ) make sure that we are making progress skip the current step: the subdiagonal info is just noise. |
| promise promise functional differences: pcheevx does not promise orthogonality for eigenvectors associate pcheevx does not reorthogonalize eigenvectors functional differences: pdsyevx does not promise orthogonality for eigenvectors associate pdsyevx does not reorthogonalize eigenvectors functional differences: pssyevx does not promise orthogonality for eigenvectors associate pssyevx does not reorthogonalize eigenvectors functional differences: pzheevx does not promise orthogonality for eigenvectors associate pzheevx does not reorthogonalize eigenvectors |
| proper proper (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa .and. iroffa.eq.0 ) with the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) with the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa .and. iroffa.eq.0 ) with sub( c ) is a proper distributed matri sub( c ) is a proper distributed matri sub( c ) is a proper distributed matri sub( c ) is a proper distributed matri (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) sub( c ) is a proper distributed matri sub( c ) is a proper distributed matri (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa .and. iroffa.eq.0 ) with the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) with the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa .and. iroffa.eq.0 ) with (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) sub( c ) is a proper distributed matri sub( c ) is a proper distributed matri (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa .and. iroffa.eq.0 ) with the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) with the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa .and. iroffa.eq.0 ) with (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa .and. iroffa.eq.0 ) with the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa ) with the distributed submatrix sub( a ) must verify some alignment proper ( mb_a.eq.nb_a .and. iroffa.eq.icoffa .and. iroffa.eq.0 ) with sub( c ) is a proper distributed matri sub( c ) is a proper distributed matri sub( c ) is a proper distributed matri sub( c ) is a proper distributed matri (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* |
| properly properly (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int (501 or 502) is input. this routine will interpret the grid properly either way the grid passed to a single scalapack routine *must be the same* adjust addressing into matrix space to properly get int |
| properties properties the distributed submatrices sub( a ) and sub( b ) must verify some alignment properties, namely the following expression should be true ( mb_a.eq.mb_b .and. iroffa.eq.iroffb .and. iarow.eq.ibrow ) the distributed submatrices sub( a ) and sub( b ) must verify some alignment properties, namely the following expression should be true ( nb_a.eq.nb_b .and. icoffa.eq.icoffb .and. iacol.eq.ibcol ) the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices sub( a ), sub( z ) must verify some alignment properties, namely the following expressio ( mb_a.eq.nb_a.eq.mb_z.eq.nb_z .and. iroffa.eq.icoffa .and. the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*), c(ic:ic+m-1,jc:jc+n-1), and b( ib:ib+n-1, jb:jb+n-1 ) must verify some alignment properties the distributed submatrices v(iv:*, jv:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices v(iv:*, jv:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices sub( a ) and sub( b ) must verify some alignment properties, namely the following expression should be true ( mb_a.eq.mb_b .and. iroffa.eq.iroffb .and. iarow.eq.ibrow ) the distributed submatrices sub( a ) and sub( b ) must verify some alignment properties, namely the following expression should be true ( nb_a.eq.nb_b .and. icoffa.eq.icoffb .and. iacol.eq.ibcol ) the distributed submatrices v(iv:*, jv:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices v(iv:*, jv:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and z(iz:iz+m-1,jz:jz+n-1) must verify some alignment properties, namely the followin the distributed submatrices sub( a ), sub( z ) must verify some alignment properties, namely the following expressio ( mb_a.eq.nb_a.eq.mb_z.eq.nb_z .and. iroffa.eq.icoffa .and. the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*), c(ic:ic+m-1,jc:jc+n-1), and b( ib:ib+n-1, jb:jb+n-1 ) must verify some alignment properties the distributed submatrices sub( a ) and sub( b ) must verify some alignment properties, namely the following expression should be true ( mb_a.eq.mb_b .and. iroffa.eq.iroffb .and. iarow.eq.ibrow ) the distributed submatrices sub( a ) and sub( b ) must verify some alignment properties, namely the following expression should be true ( nb_a.eq.nb_b .and. icoffa.eq.icoffb .and. iacol.eq.ibcol ) the distributed submatrices v(iv:*, jv:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices v(iv:*, jv:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and z(iz:iz+m-1,jz:jz+n-1) must verify some alignment properties, namely the followin the distributed submatrices sub( a ), sub( z ) must verify some alignment properties, namely the following expressio ( mb_a.eq.nb_a.eq.mb_z.eq.nb_z .and. iroffa.eq.icoffa .and. the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*), c(ic:ic+m-1,jc:jc+n-1), and b( ib:ib+n-1, jb:jb+n-1 ) must verify some alignment properties the distributed submatrices sub( a ) and sub( b ) must verify some alignment properties, namely the following expression should be true ( mb_a.eq.mb_b .and. iroffa.eq.iroffb .and. iarow.eq.ibrow ) the distributed submatrices sub( a ) and sub( b ) must verify some alignment properties, namely the following expression should be true ( nb_a.eq.nb_b .and. icoffa.eq.icoffb .and. iacol.eq.ibcol ) the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices sub( a ), sub( z ) must verify some alignment properties, namely the following expressio ( mb_a.eq.nb_a.eq.mb_z.eq.nb_z .and. iroffa.eq.icoffa .and. the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*), c(ic:ic+m-1,jc:jc+n-1), and b( ib:ib+n-1, jb:jb+n-1 ) must verify some alignment properties the distributed submatrices v(iv:*, jv:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices v(iv:*, jv:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin the distributed submatrices a(ia:*, ja:*) and c(ic:ic+m-1,jc:jc+n-1) must verify some alignment properties, namely the followin |
| property property can send through without breaking the consecutive small subdiagonal property can send through without breaking the consecutive small subdiagonal property |
| prototype prototype pchentrd is a prototype version of pchetrd which uses tailore when the workspace provided by the user is adequate. pclamr1d has not been tested except withint the contect of pcheptrd, the prototype reduction to tridiagonal form code purpose pdlamr1d has not been tested except withint the contect of pdsyptrd, the prototype reduction to tridiagonal form code purpose pdsyntrd is a prototype version of pdsytrd which uses tailore when the workspace provided by the user is adequate. pslamr1d has not been tested except withint the contect of pssyptrd, the prototype reduction to tridiagonal form code purpose pssyntrd is a prototype version of pssytrd which uses tailore when the workspace provided by the user is adequate. pzhentrd is a prototype version of pzhetrd which uses tailore when the workspace provided by the user is adequate. pzlamr1d has not been tested except withint the contect of pzheptrd, the prototype reduction to tridiagonal form code purpose |
| provide provide codes (either the serial, chetrd, or the parallel code, pchettrd) when the workspace provided by the user is adequate greater performance can be achieved if adequate workspace is provided. on the other hand, in some situations increases above the workspace amount shown below: greater performance can be achieved if adequate workspace is provided. on the other hand, in some situations increases above the workspace amount shown below: codes (either the serial, dsytrd, or the parallel code, pdsyttrd) when the workspace provided by the user is adequate greater performance can be achieved if adequate workspace is provided. on the other hand, in some situations increases above the workspace amount shown below: greater performance can be achieved if adequate workspace is provided. on the other hand, in some situations increases above the workspace amount shown below: codes (either the serial, ssytrd, or the parallel code, pssyttrd) when the workspace provided by the user is adequate codes (either the serial, zhetrd, or the parallel code, pzhettrd) when the workspace provided by the user is adequate |
| provided provided the following options are provided 1. if trans = 'n' and m >= n: find the least squares solution of error bounds on the solution and a condition estimate are also provided notes pchengst also calls pchegst when insufficient workspace is provided, hence pchengst provides improve codes (either the serial, chetrd, or the parallel code, pchettrd) when the workspace provided by the user is adequate error bounds on the solution and a condition estimate are also provided. in the following comments y denotes y(iy:iy+m-1,jy:jy+k-1 the following options are provided 1. if trans = 'n' and m >= n: find the least squares solution of error bounds on the solution and a condition estimate are also provided notes error bounds on the solution and a condition estimate are also provided. in the following comments y denotes y(iy:iy+m-1,jy:jy+k-1 greater performance can be achieved if adequate workspace is provided. on the other hand, in some situations increases above the workspace amount shown below: greater performance can be achieved if adequate workspace is provided. on the other hand, in some situations increases above the workspace amount shown below: pdsyngst also calls pdhegst when insufficient workspace is provided, hence pdsyngst provides improve codes (either the serial, dsytrd, or the parallel code, pdsyttrd) when the workspace provided by the user is adequate the following options are provided 1. if trans = 'n' and m >= n: find the least squares solution of error bounds on the solution and a condition estimate are also provided notes error bounds on the solution and a condition estimate are also provided. in the following comments y denotes y(iy:iy+m-1,jy:jy+k-1 greater performance can be achieved if adequate workspace is provided. on the other hand, in some situations increases above the workspace amount shown below: greater performance can be achieved if adequate workspace is provided. on the other hand, in some situations increases above the workspace amount shown below: pssyngst also calls pshegst when insufficient workspace is provided, hence pssyngst provides improve codes (either the serial, ssytrd, or the parallel code, pssyttrd) when the workspace provided by the user is adequate the following options are provided 1. if trans = 'n' and m >= n: find the least squares solution of error bounds on the solution and a condition estimate are also provided notes pzhengst also calls pzhegst when insufficient workspace is provided, hence pzhengst provides improve codes (either the serial, zhetrd, or the parallel code, pzhettrd) when the workspace provided by the user is adequate error bounds on the solution and a condition estimate are also provided. in the following comments y denotes y(iy:iy+m-1,jy:jy+k-1 |
| provides provides pcgerfs improves the computed solution to a system of linear equations and provides error bounds and backward error estimates fo ifail (output) integer array, dimension (n) ifail provides additional information when info .ne. the smallest minor which is not positive definite. pchengst calls pchegst when uplo='u', hence pchengst provides equations when the coefficient matrix is hermitian positive definite and provides error bounds and backward error estimates for th pctrrfs provides error bounds and backward error estimates for th coefficient matrix. pdgerfs improves the computed solution to a system of linear equations and provides error bounds and backward error estimates fo equations when the coefficient matrix is symmetric positive definite and provides error bounds and backward error estimates for th ifail (output) integer array, dimension (n) ifail provides additional information when info .ne. the smallest minor which is not positive definite. pdsyngst calls pdhegst when uplo='u', hence pdhengst provides pdtrrfs provides error bounds and backward error estimates for th coefficient matrix. this version provides a set of parameters which should give good computers. users are encouraged to modify this subroutine to set psgerfs improves the computed solution to a system of linear equations and provides error bounds and backward error estimates fo equations when the coefficient matrix is symmetric positive definite and provides error bounds and backward error estimates for th ifail (output) integer array, dimension (n) ifail provides additional information when info .ne. the smallest minor which is not positive definite. pssyngst calls pshegst when uplo='u', hence pshengst provides pstrrfs provides error bounds and backward error estimates for th coefficient matrix. pzgerfs improves the computed solution to a system of linear equations and provides error bounds and backward error estimates fo ifail (output) integer array, dimension (n) ifail provides additional information when info .ne. the smallest minor which is not positive definite. pzhengst calls pzhegst when uplo='u', hence pzhengst provides equations when the coefficient matrix is hermitian positive definite and provides error bounds and backward error estimates for th pztrrfs provides error bounds and backward error estimates for th coefficient matrix. |
| providing providing relationship between workspace, orthogonality & performance: if clustersize >= n/sqrt(nprow*npcol), then providing orthogonally will cause serious degradation in relationship between workspace, orthogonality & performance: if clustersize >= n/sqrt(nprow*npcol), then providing orthogonally will cause serious degradation in if clustersize >= n/sqrt(nprow*npcol), then providing orthogonally will cause serious degradation in if clustersize >= n/sqrt(nprow*npcol), then providing orthogonally will cause serious degradation in if clustersize >= n/sqrt(nprow*npcol), then providing orthogonally will cause serious degradation in if clustersize >= n/sqrt(nprow*npcol), then providing orthogonally will cause serious degradation in relationship between workspace, orthogonality & performance: if clustersize >= n/sqrt(nprow*npcol), then providing orthogonally will cause serious degradation in relationship between workspace, orthogonality & performance: if clustersize >= n/sqrt(nprow*npcol), then providing orthogonally will cause serious degradation in |
| PRT PRT max(wcbdsqr, max(wantu*wpcormbrqln, wantvt*wpcormbrPRT)) where max(wdbdsqr, max(wantu*wpdormbrqln, wantvt*wpdormbrPRT)) where max(wsbdsqr, max(wantu*wpsormbrqln, wantvt*wpsormbrPRT)) where max(wzbdsqr, max(wantu*wpzormbrqln, wantvt*wpzormbrPRT)) where |
| PSCASUM PSCASUM based on PSCASUM from the level 1 pblas. the change i |
| PSCSUM1 PSCSUM1 PSCSUM1 returns the sum of absolute values of a comple |
| PSDBSV PSDBSV PSDBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSDBTRF PSDBTRF see PSDBTRF and psdbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PSDBTRF banded diagonally dominant-like distributed |
| PSDBTRS PSDBTRS see psdbtrf and PSDBTRS for details ===================================================================== PSDBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSDTSV PSDTSV PSDTSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSDTTRF PSDTTRF see PSDTTRF and psdttrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PSDTTRF tridiagonal diagonally dominant-like distributed |
| PSDTTRS PSDTTRS see psdttrf and PSDTTRS for details ===================================================================== PSDTTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSGBSV PSGBSV PSGBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSGBTRF PSGBTRF see PSGBTRF and psgbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PSGBTRF banded distributed |
| PSGBTRS PSGBTRS see psgbtrf and PSGBTRS for details ===================================================================== PSGBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSGEBD2 PSGEBD2 PSGEBD2 reduces a real general m-by-n distributed matri form b by an orthogonal transformation: q' * sub( a ) * p = b. |
| PSGEBRD PSGEBRD PSGEBRD reduces a real general m-by-n distributed matri form b by an orthogonal transformation: q' * sub( a ) * p = b. watobd = max(max(wpslange,wPSGEBRD) this is an auxiliary routine called by PSGEBRD notes here q and p**t are the orthogonal distributed matrices determined by PSGEBRD when reducing a real distributed matrix a(ia:*,ja:*) t as products of elementary reflectors h(i) and g(i) respectively. |
| PSGECON PSGECON PSGECON estimates the reciprocal of the condition number of a genera or the infinity-norm, using the lu factorization computed by psgetrf. lwork is local input and must be at least lwork = max( PSGECON( lwork ), psgerfs( lwork ) |
| PSGEEQU PSGEEQU PSGEEQU computes row and column scalings intended to equilibrate a reduce its condition number. r returns the row scale factors and c |
| PSGEHD2 PSGEHD2 PSGEHD2 reduces a real general distributed matrix sub( a tion: q' * sub( a ) * q = h, where |
| PSGEHRD PSGEHRD PSGEHRD reduces a real general distributed matrix sub( a tion: q' * sub( a ) * q = h, where this is an auxiliary routine called by PSGEHRD. in the followin nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of ihi-ilo elementary reflectors, as returned by PSGEHRD q = h(ilo) h(ilo+1) . . . h(ihi-1). |
| PSGELQ2 PSGELQ2 PSGELQ2 computes a lq factorization of a real distributed m-by- |
| PSGELQF PSGELQF PSGELQF computes a lq factorization of a real distributed m-by- if m < n, sub( a ) is overwritten by details of its lq factorization as returned by PSGELQF ia (global input) integer as returned by PSGELQF notes as returned by PSGELQF notes as returned by PSGELQF. q is of order m if side = 'l' and of order as returned by PSGELQF. q is of order m if side = 'l' and of order |
| PSGELS PSGELS PSGELS solves overdetermined or underdetermined real linea or its transpose, using a qr or lq factorization of sub( a ). it is |
| PSGEMR2D PSGEMR2D although all processes call PSGEMR2D, only the processes that ow first column of b receive data. the calls to sgebs2d/sgebr2d |
| PSGEQL2 PSGEQL2 PSGEQL2 computes a ql factorization of a real distributed m-by- |
| PSGEQLF PSGEQLF PSGEQLF computes a ql factorization of a real distributed m-by- as returned by PSGEQLF notes as returned by PSGEQLF notes as returned by PSGEQLF. q is of order m if side = 'l' and of order as returned by PSGEQLF. q is of order m if side = 'l' and of order |
| PSGEQPF PSGEQPF PSGEQPF computes a qr factorization with column pivoting of |
| PSGEQR2 PSGEQR2 PSGEQR2 computes a qr factorization of a real distributed m-by- |
| PSGEQRF PSGEQRF if m >= n, sub( a ) is overwritten by details of its qr factorization as returned by PSGEQRF factorization as returned by psgelqf. PSGEQRF computes a qr factorization of a real distributed m-by- as returned by PSGEQRF notes as returned by PSGEQRF notes as returned by PSGEQRF. q is of order m if side = 'l' and of order as returned by PSGEQRF. q is of order m if side = 'l' and of order |
| PSGERFS PSGERFS PSGERFS improves the computed solution to a system of linea the solutions. lwork is local input and must be at least lwork = max( psgecon( lwork ), PSGERFS( lwork ) |
| PSGERQ2 PSGERQ2 PSGERQ2 computes a rq factorization of a real distributed m-by- |
| PSGERQF PSGERQF PSGERQF computes a rq factorization of a real distributed m-by- as returned by PSGERQF notes as returned by PSGERQF notes as returned by PSGERQF. q is of order m if side = 'l' and of order as returned by PSGERQF. q is of order m if side = 'l' and of order |
| PSGESV PSGESV PSGESV computes the solution to a real system of linear equation sub( a ) * x = sub( b ), |
| PSGESVD PSGESVD PSGESVD computes the singular value decomposition (svd) of a singular vectors. the svd is written as |
| PSGESVX PSGESVX PSGESVX uses the lu factorization to compute the solution to a rea |
| PSGETF2 PSGETF2 PSGETF2 computes an lu factorization of a general m-by- partial pivoting with row interchanges. |
| PSGETRF PSGETRF distributed real matrix a(ia:ia+n-1,ja:ja+n-1), in either the 1-norm or the infinity-norm, using the lu factorization computed by PSGETRF an estimate is obtained for norm(inv(a(ia:ia+n-1,ja:ja+n-1))), and factors of the matrix sub( a ) = p * l * u as computed by PSGETRF iaf (global input) integer entry contains the factors l and u from the factorization a(ia:ia+n-1,ja:ja+n-1) = p*l*u as computed by PSGETRF equilibrated matrix a(ia:ia+n-1,ja:ja+n-1). PSGETRF computes an lu factorization of a general m-by-n distribute row interchanges. psgetri computes the inverse of a distributed matrix using the lu factorization computed by PSGETRF. this method inverts u and the inva by solving the system inva*l = inv(u) for inva. with a general n-by-n distributed matrix sub( a ) using the lu factorization computed by PSGETRF sub( b ) denotes b(ib:ib+n-1,jb:jb+nrhs-1). |
| PSGETRI PSGETRI PSGETRI computes the inverse of a distributed matrix using the l computes the inverse of sub( a ) = a(ia:ia+n-1,ja:ja+n-1) denoted |
| PSGETRS PSGETRS PSGETRS solves a system of distributed linear equation op( sub( a ) ) * x = sub( b ) |
| PSGGQRF PSGGQRF PSGGQRF computes a generalized qr factorization o an n-by-p matrix sub( b ) = b(ib:ib+n-1,jb:jb+p-1): |
| PSGGRQF PSGGRQF PSGGRQF computes a generalized rq factorization o and a p-by-n matrix sub( b ) = b(ib:ib+p-1,jb:jb+n-1): |
| PSHEGST PSHEGST pssyngst performs the same function as PSHEGST, but is based o triangular solves (the basis of pssyngst). |
| PSHENGST PSHENGST pssyngst calls pshegst when uplo='u', hence PSHENGST provide |
| PSHETTRD PSHETTRD pssyttrd is not intended to be called directly. all users are encourage to call pssytrd which will then call PSHETTRD i the process grid must be square ( i.e. nprow = npcol ) and |
| PSLABAD PSLABAD PSLABAD takes as input the values computed by pslamch for underflo the log of large is sufficiently large. this subroutine is intended |
| PSLABRD PSLABRD PSLABRD reduces the first nb rows and columns of a real genera or lower bidiagonal form by an orthogonal transformation q' * a * p, |
| PSLACON PSLACON PSLACON estimates the 1-norm of a square, real distributed matrix a x and v are aligned with the distributed matrix a, this information |
| PSLACONSB PSLACONSB PSLACONSB looks for two consecutive small subdiagonal elements b given by h44, h33, & h43h34 and see if this would make a look for two consecutive small subdiagonal elements: PSLACONSB is the routine that does this |
| PSLACP2 PSLACP2 PSLACP2 copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes |
| PSLACP3 PSLACP3 PSLACP3 is an auxiliary routine that copies from a global paralle the entire submatrix that is copied gets placed on one node or |
| PSLACPY PSLACPY PSLACPY copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes |
| PSLAEBZ PSLAEBZ PSLAEBZ contains the iteration loop which computes the eigenvalue j = 1,...,minp. it uses and computes the function n(w), which is will, in general, be reordered on output. see the comments in PSLAEBZ for more on the function n(w) nval (input/output) integer array, dimension (2*(kl-kf)) |
| PSLAECV PSLAECV PSLAECV checks if the input intervals [ intvl(2*i-1), intvl(2*i) ] pslaecv modifies kf to be the index of the last converged interval, |
| PSLAED0 PSLAED0 PSLAED0 computes all eigenvalues and corresponding eigenvectors of |
| PSLAED1 PSLAED1 PSLAED1 computes the updated eigensystem of a diagona in parallel. nn (global output) integer, the order of matrix u, (PSLAED1) nn2 (global output) integer, the order of matrix q2, (pslaed1). |
| PSLAED2 PSLAED2 secular equation problem is reduced by one. this stage is performed by the routine PSLAED2 the second stage consists of calculating the updated PSLAED2 sorts the two sets of eigenvalues together into a singl there are two ways in which deflation can occur: when two or more |
| PSLAED3 PSLAED3 > 0: if info = 1 through n, the i(th) eigenvalue did not converge in PSLAED3 alignment requirements eigenvalues. this is done by finding the roots of the secular equation via the routine slaed4 (as called by PSLAED3) problem. on exit, rho has been modified to the value required by PSLAED3 z (global input) real array, dimension (n) PSLAED3 finds the roots of the secular equation, as defined by th appropriate calls to slaed4 |
| PSLAEVSWP PSLAEVSWP PSLAEVSWP moves the eigenvectors (potentially unsorted) fro array, sorted so that the corresponding eigenvalues are sorted. |
| pslahqr pslahqr following differs in comparison to pslahqr |
| PSLAHRD PSLAHRD PSLAHRD reduces the first nb columns of a real general n-by-(n-k+1 k-th subdiagonal are zero. the reduction is performed by an orthogo- |
| pslaiect pslaiect the appropriate slmake.inc file to include the compiler switch -dno_ieee. this switch only affects the compilation of pslaiect.c arguments the appropriate slmake.inc file to include the compiler switch -dno_ieee. this switch only affects the compilation of pslaiect.c arguments |
| PSLAMCH PSLAMCH abstol (global input) real if jobz='v', setting abstol to PSLAMCH( context, 'u') yield abstol (global input) real if jobz='v', setting abstol to PSLAMCH( context, 'u') yield pslabad takes as input the values computed by PSLAMCH for underflo the log of large is sufficiently large. this subroutine is intended PSLAMCH determines single precision machine parameters arguments note : if eigenvectors are desired later by inverse iteration
( psstein ), abstol should be set to 2*PSLAMCH('s')
d (global input) real array, dimension (n)
abstol (global input) real if jobz='v', setting abstol to PSLAMCH( context, 'u') yield abstol (global input) real if jobz='v', setting abstol to PSLAMCH( context, 'u') yield |
| PSLAMR1D PSLAMR1D PSLAMR1D has not been tested except withint the contect o |
| PSLANGE PSLANGE watobd = max(max(wPSLANGE,wpsgebrd) PSLANGE returns the value of the one norm, or the frobenius norm distributed matrix sub( a ) = a(ia:ia+m-1, ja:ja+n-1). |
| PSLAPDCT PSLAPDCT without overflow. see PSLAPDCT for the "paranoid" implementation of the stur PSLAPDCT counts the number of negative eigenvalues of (t - sigma i) the innermost loop to avoid overflow and determine the sign of a |
| PSLAPIV PSLAPIV PSLAPIV applies either p (permutation matrix indicated by ipiv sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column same mb (or nb) block. if you want to pivot a full matrix, use PSLAPIV notes |
| PSLAPV2 PSLAPV2 PSLAPV2 applies either p (permutation matrix indicated by ipiv a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the |
| PSLAQGE PSLAQGE PSLAQGE equilibrates a general m-by-n distributed matri factors in the vectors r and c. |
| PSLAQSY PSLAQSY PSLAQSY equilibrates a symmetric distributed matri vectors sr and sc. |
| PSLARED1D PSLARED1D workspaces required respectively for the subprograms pclange, PSLARED1D, pslared2d, pcgebrd. using th where wpslange, wPSLARED1D, wpslared2d, wpsgebrd are th pslange, pslared1d, pslared2d, psgebrd. using the PSLARED1D redistributes a 1d arra it assumes that the input array, bycol, is distributed across |
| PSLARED2D PSLARED2D workspaces required respectively for the subprograms pclange, pslared1d, PSLARED2D, pcgebrd. using th watobd = max(max(wpslange,wpsgebrd), max(wPSLARED2D,wp(pre)lared1d)) where wpslange, wpslared1d, wpslared2d, wpsgebrd are the PSLARED2D redistributes a 1d arra it assumes that the input array, byrow, is distributed across |
| PSLARFB PSLARFB PSLARFB applies a real block reflector q or its transpose q**t to from the left or the right. |
| PSLARFG PSLARFG PSLARFG generates a real elementary reflector h of order n, suc |
| PSLARFT PSLARFT PSLARFT forms the triangular factor t of a real block reflector |
| PSLARZB PSLARZB PSLARZB applies a real block reflector q or its transpose q**t t from the left or the right. |
| PSLARZT PSLARZT PSLARZT forms the triangular factor t of a real block reflecto reflectors as returned by pstzrzf. |
| PSLASCL PSLASCL PSLASCL multiplies the m-by-n real distributed matrix sub( a is done without over/underflow as long as the final result |
| PSLASE2 PSLASE2 PSLASE2 initializes an m-by-n distributed matrix sub( a ) denotin offdiagonals. pslase2 requires that only dimension of the matrix |
| PSLASET PSLASET PSLASET initializes an m-by-n distributed matrix sub( a ) denotin offdiagonals. |
| PSLASMSUB PSLASMSUB PSLASMSUB looks for a small subdiagonal element from the botto |
| PSLASRT PSLASRT PSLASRT sort the numbers in d in increasing order and th |
| PSLASSQ PSLASSQ PSLASSQ returns the values scl and smsq such tha ( scl**2 )*smsq = x( 1 )**2 +...+ x( n )**2 + ( scale**2 )*sumsq, |
| PSLASWP PSLASWP PSLASWP performs a series of row or column interchanges o interchange is initiated for each of rows or columns k1 trough k2 of |
| PSLATRA PSLATRA PSLATRA computes the trace of an n-by-n distributed matrix sub( a process of the grid. |
| PSLATRD PSLATRD PSLATRD reduces nb rows and columns of a real symmetric distribute form by an orthogonal similarity transformation q' * sub( a ) * q, |
| PSLATRZ PSLATRZ PSLATRZ reduces the m-by-n ( m<=n ) real upper trapezoidal matri upper triangular form by means of orthogonal transformations. |
| PSLAUU2 PSLAUU2 PSLAUU2 computes the product u * u' or l' * l, where the triangula the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PSLAUUM PSLAUUM PSLAUUM computes the product u * u' or l' * l, where the triangula the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PSLAWIL PSLAWIL PSLAWIL gets the transform given by h44,h33, & h43h34 into |
| PSM PSM PSM (workspace) integer array, dimension( npcol, 4 npcol (global input) integer PSM (workspace) integer array, dimension( npcol, 4 npcol (global input) integer |
| PSORG2L PSORG2L PSORG2L generates an m-by-n real distributed matrix q denotin the last n columns of a product of k elementary reflectors of order m |
| PSORG2R PSORG2R PSORG2R generates an m-by-n real distributed matrix q denotin the first n columns of a product of k elementary reflectors of order |
| PSORGL2 PSORGL2 PSORGL2 generates an m-by-n real distributed matrix q denotin the first m rows of a product of k elementary reflectors of order n |
| PSORGLQ PSORGLQ PSORGLQ generates an m-by-n real distributed matrix q denotin the first m rows of a product of k elementary reflectors of order n |
| PSORGQL PSORGQL PSORGQL generates an m-by-n real distributed matrix q denotin the last n columns of a product of k elementary reflectors of order m |
| PSORGQR PSORGQR a(ia+i:ia+n-1,ja+i-1), and taua in taua(ja+i-1). to form q explicitly, use scalapack subroutine PSORGQR b(ib+i:ib+p-1,jb+i-1), and taub in taub(jb+i-1). to form z explicitly, use scalapack subroutine PSORGQR PSORGQR generates an m-by-n real distributed matrix q denotin the first n columns of a product of k elementary reflectors of order |
| PSORGR2 PSORGR2 PSORGR2 generates an m-by-n real distributed matrix q denotin last m rows of a product of k elementary reflectors of order n |
| PSORGRQ PSORGRQ b(ib+n-k+i-1,jb:jb+p-k+i-2), and taub in taub(ib+n-k+i-1). to form z explicitly, use scalapack subroutine PSORGRQ a(ia+m-k+i-1,ja:ja+n-k+i-2), and taua in taua(ia+m-k+i-1). to form q explicitly, use scalapack subroutine PSORGRQ PSORGRQ generates an m-by-n real distributed matrix q denotin last m rows of a product of k elementary reflectors of order n |
| PSORM2L PSORM2L PSORM2L overwrites the general real m-by-n distributed matri |
| PSORM2R PSORM2R PSORM2R overwrites the general real m-by-n distributed matri |
| PSORMBR PSORMBR max(wsbdsqr, max(wantu*wPSORMBRqln, wantvt*wpsormbrprt)) where if vect = 'q', PSORMBR overwrites the general real distributed m-by- |
| PSORMHR PSORMHR PSORMHR overwrites the general real m-by-n distributed matri |
| PSORML2 PSORML2 PSORML2 overwrites the general real m-by-n distributed matri |
| PSORMLQ PSORMLQ PSORMLQ overwrites the general real m-by-n distributed matri |
| PSORMQL PSORMQL PSORMQL overwrites the general real m-by-n distributed matri |
| PSORMQR PSORMQR to form q explicitly, use scalapack subroutine psorgqr. to use q to update another matrix, use scalapack subroutine PSORMQR the matrix z is represented as a product of elementary reflectors to form z explicitly, use scalapack subroutine psorgqr. to use z to update another matrix, use scalapack subroutine PSORMQR alignment requirements PSORMQR overwrites the general real m-by-n distributed matri |
| PSORMR2 PSORMR2 PSORMR2 overwrites the general real m-by-n distributed matri |
| PSORMR3 PSORMR3 PSORMR3 overwrites the general real m-by-n distributed matri |
| PSORMRQ PSORMRQ to form z explicitly, use scalapack subroutine psorgrq. to use z to update another matrix, use scalapack subroutine PSORMRQ alignment requirements to form q explicitly, use scalapack subroutine psorgrq. to use q to update another matrix, use scalapack subroutine PSORMRQ the matrix z is represented as a product of elementary reflectors PSORMRQ overwrites the general real m-by-n distributed matri |
| PSORMRZ PSORMRZ PSORMRZ overwrites the general real m-by-n distributed matri |
| PSORMTR PSORMTR PSORMTR overwrites the general real m-by-n distributed matri ldc = max( 1, nrc ) sizemqrleft = the workspace requirement for PSORMTR |
| PSPBSV PSPBSV PSPBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSPBTRF PSPBTRF see PSPBTRF and pspbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PSPBTRF banded symmetric positive definite distributed |
| PSPBTRS PSPBTRS see pspbtrf and PSPBTRS for details ===================================================================== PSPBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSPOCON PSPOCON PSPOCON estimates the reciprocal of the condition number (in th using the cholesky factorization a = u**t*u or a = l*l**t computed by lwork is local input and must be at least lwork = max( PSPOCON( lwork ), psporfs( lwork ) lwork = 3*desca( lld_ ) |
| PSPOEQU PSPOEQU PSPOEQU computes row and column scalings intended t sub( a ) = a(ia:ia+n-1,ja:ja+n-1) and reduce its condition number |
| PSPORFS PSPORFS PSPORFS improves the computed solution to a system of linea and provides error bounds and backward error estimates for the lwork is local input and must be at least lwork = max( pspocon( lwork ), PSPORFS( lwork ) lwork = 3*desca( lld_ ) |
| PSPOSV PSPOSV PSPOSV computes the solution to a real system of linear equation sub( a ) * x = sub( b ), |
| PSPOSVX PSPOSVX PSPOSVX uses the cholesky factorization a = u**t*u or a = l*l**t t |
| PSPOTF2 PSPOTF2 PSPOTF2 computes the cholesky factorization of a real symmetri |
| PSPOTRF PSPOTRF using the cholesky factorization a = u**t*u or a = l*l**t computed by PSPOTRF an estimate is obtained for norm(inv(a(ia:ia+n-1,ja:ja+n-1))), and cholesky factorization sub( a ) = l*l**t or u**t*u, as computed by PSPOTRF iaf (global input) integer PSPOTRF computes the cholesky factorization of an n-by-n rea a(ia:ia+n-1, ja:ja+n-1). cholesky factorization sub( a ) = u**t*u or l*l**t computed by PSPOTRF notes symmetric positive definite distributed matrix using the cholesky factorization sub( a ) = u**t*u or l*l**t computed by PSPOTRF sub( b ) must have been previously factorized as u**t*u or l*l**t by PSPOTRF notes sub( b ) must have been previously factorized as u**t*u or l*l**t by PSPOTRF notes sub( b ) must have been previously factorized as u**h*u or l*l**h by PSPOTRF notes |
| PSPOTRI PSPOTRI PSPOTRI computes the inverse of a real symmetric positive definit cholesky factorization sub( a ) = u**t*u or l*l**t computed by |
| PSPOTRS PSPOTRS PSPOTRS solves a system of linear equation sub( a ) * x = sub( b ) |
| PSPTSV PSPTSV PSPTSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSPTTRF PSPTTRF see PSPTTRF and pspttrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PSPTTRF tridiagonal symmetric positive definite distributed |
| PSPTTRS PSPTTRS see pspttrf and PSPTTRS for details ===================================================================== PSPTTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PSPTTRSV PSPTTRSV end of PSPTTRSV |
| PSRSCL PSRSCL PSRSCL multiplies an n-element real distributed vector sub( x ) b long as the final result sub( x )/a does not overflow or underflow. |
| PSSTEBZ PSSTEBZ from smallest to largest within the block (the output array w from PSSTEBZ with order='b' is expected here). thi on output, the first m elements contain the input PSSTEBZ computes the eigenvalues of a symmetric tridiagonal matrix i the interval [vl, vu], or the eigenvalues indexed il through iu. a from smallest to largest within the block (the output array w from PSSTEBZ with order='b' is expected here). thi on output, the first m elements contain the input computed is returned in nz. if (mod(info/8,2).ne.0), then PSSTEBZ failed to comput send e-mail to scalapack@cs.utk.edu computed is returned in nz. if (mod(info/8,2).ne.0), then PSSTEBZ failed t send e-mail to scalapack@cs.utk.edu |
| PSSTEDC PSSTEDC ======= PSSTEDC computes all eigenvalues and eigenvectors of conquer algorithm. |
| PSSTEIN PSSTEIN note : if eigenvectors are desired later by inverse iteration
( PSSTEIN ), abstol should be set to 2*pslamch('s')
d (global input) real array, dimension (n)
PSSTEIN computes the eigenvectors of a symmetric tridiagonal matri correspond to user specified eigenvalues. psstein does not performance. in the limit (i.e. clustersize = n-1) PSSTEIN will perform no better than sstein on for clustersize = n/sqrt(nprow*npcol) reorthogonalizing performance. in the limit (i.e. clustersize = n-1) PSSTEIN will perform no better than sstein on 1 processor all eigenvectors will increase the total execution time |
| PSSYEV PSSYEV PSSYEV computes all eigenvalues and, optionally, eigenvector of scalapack routines. |
| PSSYEVD PSSYEVD PSSYEVD computes all the eigenvalues and eigenvector of scalapack routines. |
| PSSYEVX PSSYEVX PSSYEVX computes selected eigenvalues and, optionally, eigenvector of scalapack routines. eigenvalues/vectors can be selected by |
| PSSYGS2 PSSYGS2 PSSYGS2 reduces a real symmetric-definite generalized eigenproble |
| PSSYGST PSSYGST PSSYGST reduces a real symmetric-definite generalized eigenproble |
| PSSYGVX PSSYGVX PSSYGVX computes all the eigenvalues, and optionally of a real generalized sy-definite eigenproblem, of the form |
| PSSYNGST PSSYNGST PSSYNGST reduces a complex hermitian-definite generalize |
| PSSYNTRD PSSYNTRD PSSYNTRD is a prototype version of pssytrd which uses tailore when the workspace provided by the user is adequate. |
| PSSYPTRD PSSYPTRD pslamr1d has not been tested except withint the contect of PSSYPTRD, the prototype reduction to tridiagonal form code purpose |
| PSSYTD2 PSSYTD2 PSSYTD2 reduces a real symmetric matrix sub( a ) to symmetri q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PSSYTRD PSSYTRD this is an auxiliary routine called by PSSYTRD to redistribute d, this is an auxiliary routine called by PSSYTRD notes nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of nq-1 elementary reflectors, as returned by PSSYTRD if uplo = 'u', q = h(nq-1) . . . h(2) h(1); where sizesytrd = the workspace requirement for PSSYTRD if eigenvectors are requested (jobz = 'v' ) then support for uplo='u' is limited to calling the old, slow, PSSYTRD PSSYTRD reduces a real symmetric matrix sub( a ) to symmetri q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). pssyttrd is not intended to be called directly. all users are encourage to call PSSYTRD which will then call pshettrd i the process grid must be square ( i.e. nprow = npcol ) and |
| PSSYTTRD PSSYTTRD anb = pjlaenv( desca( ctxt_), 3, 'PSSYTTRD', 'l' sqnpc = int( sqrt( dble( nprow * npcol ) ) ) anb = pjlaenv( desca( ctxt_), 3, 'PSSYTTRD', 'l' sqnpc = int( sqrt( dble( nprow * npcol ) ) ) pssyntrd is a prototype version of pssytrd which uses tailored codes (either the serial, ssytrd, or the parallel code, PSSYTTRD PSSYTTRD reduces a complex hermitian matrix sub( a ) to hermitia q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PSTRCON PSTRCON PSTRCON estimates the reciprocal of the condition number of 1-norm or the infinity-norm. |
| PSTRRFS PSTRRFS PSTRRFS provides error bounds and backward error estimates for th coefficient matrix. |
| PSTRTI2 PSTRTI2 PSTRTI2 computes the inverse of a real upper or lower triangula contained in one and only one process memory space (local operation). |
| PSTRTRI PSTRTRI PSTRTRI computes the inverse of a upper or lower triangula |
| PSTRTRS PSTRTRS the solution matrix x must be computed by PSTRTRS or some othe refinement because doing so cannot improve the backward error. PSTRTRS solves a triangular system of the for sub( a ) * x = sub( b ) or sub( a )**t * x = sub( b ), |
| PSTZRZF PSTZRZF q is a product of k elementary reflectors as returned by PSTZRZF currently, only storev = 'r' and direct = 'b' are supported. h of order > n, which is defined as a product of k elementary reflectors as returned by PSTZRZF if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular; as returned by PSTZRZF. q is of order m if side = 'l' and of order as returned by PSTZRZF. q is of order m if side = 'l' and of order PSTZRZF reduces the m-by-n ( m<=n ) real upper trapezoidal matri of orthogonal transformations. |
| publicly publicly arithmetic, this needs to be larger. the default for publicly released versions should be large enough to handl on the accuracy of the solution. arithmetic, this needs to be larger. the default for publicly released versions should be large enough to handl on the accuracy of the solution. |
| Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose ======= Purpose ======= Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose pdstedc computes all eigenvalues and eigenvectors of a Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose ======= Purpose ======= Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose psstedc computes all eigenvalues and eigenvectors of a Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose ======= Purpose Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose ======= Purpose ======= Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose ======= Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose Purpose |
| put put if remaining matrix is 2-by-2, use slae2 or slaev2 to compute its eigensystem test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter test the input parameter if remaining matrix is 2-by-2, use dlae2 or dlaev2 to compute its eigensystem |
| puts puts buted matrix b with elements b(i,j) = s(i)*a(i,j)*s(j) has ones on the diagonal. this choice of sr and sc puts the condition numbe over all possible diagonal scalings. buted matrix b with elements b(i,j) = s(i)*a(i,j)*s(j) has ones on the diagonal. this choice of sr and sc puts the condition numbe over all possible diagonal scalings. buted matrix b with elements b(i,j) = s(i)*a(i,j)*s(j) has ones on the diagonal. this choice of sr and sc puts the condition numbe over all possible diagonal scalings. buted matrix b with elements b(i,j) = s(i)*a(i,j)*s(j) has ones on the diagonal. this choice of sr and sc puts the condition numbe over all possible diagonal scalings. |
| Px1 Px1 dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok no ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no dtype 501 502 1 1 blacs grid 1xp or Px1 1xp or px1 1xp px a ok ok ok no |
| PXERBLA PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer as the first element of work and no error message is issued by PXERBLA rwork (workspace) real array, dimension (1+4*sizeb) values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer as the first element of work and no error message is issued by PXERBLA rwork (local workspace/output) complex array, entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/output) real array, in the first entry of the correspondingwork array, and no error message is issued by PXERBLA rwork (local workspace/output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/global output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) real array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer as the first element of work and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace) integer array, dimension ( max( 4*n, 14 ) ) as the first element of work and no error message is issued by PXERBLA iwork (local workspace/output) integer array, dimension (liwork) values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/global output) integer array, as the first element of work and no error message is issued by PXERBLA info (global output) integer as the first element of work and no error message is issued by PXERBLA iwork (local workspace/output) integer array, dimension (liwork) corresponding work arrays, and no error message is issued by PXERBLA iwork (local workspace) integer array corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer as the first element of work and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace) integer array, dimension ( max( 4*n, 14 ) ) as the first element of work and no error message is issued by PXERBLA iwork (local workspace/output) integer array, dimension (liwork) values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/global output) integer array, as the first element of work and no error message is issued by PXERBLA info (global output) integer as the first element of work and no error message is issued by PXERBLA iwork (local workspace/output) integer array, dimension (liwork) corresponding work arrays, and no error message is issued by PXERBLA iwork (local workspace) integer array corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer as the first element of work and no error message is issued by PXERBLA rwork (workspace) real array, dimension (1+4*sizeb) values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/local output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer as the first element of work and no error message is issued by PXERBLA rwork (local workspace/output) complex*16 array, entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/output) double precision array, in the first entry of the correspondingwork array, and no error message is issued by PXERBLA rwork (local workspace/output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (local output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA iwork (local workspace/global output) integer array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA rwork (local workspace/local output) double precision array, values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA info (global output) integer values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA |
| pXYYtevx pXYYtevx pXYYtevx.f and pxyytgvx.f and pxyyttrd.f are the only codes whic the data to the best data layout for each transformation. pxyyttrd.f |
| pXYYtgvx pXYYtgvx pxyytevx.f and pXYYtgvx.f and pxyyttrd.f are the only codes whic the data to the best data layout for each transformation. pxyyttrd.f |
| pXYYttrd pXYYttrd pxyytevx.f and pxyytgvx.f and pXYYttrd.f are the only codes whic the data to the best data layout for each transformation. pxyyttrd.f |
| PZAMAX PZAMAX based on PZAMAX from level 1 pblas. the change is to use th |
| PZDBSV PZDBSV PZDBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZDBTRF PZDBTRF see PZDBTRF and pzdbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PZDBTRF banded diagonally dominant-like distributed |
| PZDBTRS PZDBTRS see pzdbtrf and PZDBTRS for details ===================================================================== PZDBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZDOTU PZDOTU if the scaling needed for a in the dot product is 1, call PZDOTU to perform the dot product |
| PZDRSCL PZDRSCL PZDRSCL multiplies an n-element complex distributed vecto underflow as long as the final sub( x )/a does not overflow or |
| PZDTSV PZDTSV PZDTSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZDTTRF PZDTTRF see PZDTTRF and pzdttrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PZDTTRF tridiagonal diagonally dominant-like distributed |
| PZDTTRS PZDTTRS see pzdttrf and PZDTTRS for details ===================================================================== PZDTTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZGBSV PZGBSV PZGBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZGBTRF PZGBTRF see PZGBTRF and pzgbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PZGBTRF banded distributed |
| PZGBTRS PZGBTRS see pzgbtrf and PZGBTRS for details ===================================================================== PZGBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZGEBD2 PZGEBD2 PZGEBD2 reduces a complex general m-by-n distributed matri form b by an unitary transformation: q' * sub( a ) * p = b. |
| PZGEBRD PZGEBRD PZGEBRD reduces a complex general m-by-n distributed matri form b by an unitary transformation: q' * sub( a ) * p = b. watobd = max(max(wpzlange,wPZGEBRD) this is an auxiliary routine called by PZGEBRD notes here q and p**h are the unitary distributed matrices determined by PZGEBRD when reducing a complex distributed matrix a(ia:*,ja:*) t as products of elementary reflectors h(i) and g(i) respectively. |
| PZGECON PZGECON PZGECON estimates the reciprocal of the condition number of a genera 1-norm or the infinity-norm, using the lu factorization computed by lwork is local input and must be at least lwork = max( PZGECON( lwork ), pzgerfs( lwork ) |
| PZGEEQU PZGEEQU PZGEEQU computes row and column scalings intended to equilibrate a reduce its condition number. r returns the row scale factors and c |
| PZGEHD2 PZGEHD2 PZGEHD2 reduces a complex general distributed matrix sub( a q' * sub( a ) * q = h, where |
| PZGEHRD PZGEHRD PZGEHRD reduces a complex general distributed matrix sub( a q' * sub( a ) * q = h, where this is an auxiliary routine called by PZGEHRD. in the followin nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of ihi-ilo elementary reflectors, as returned by PZGEHRD q = h(ilo) h(ilo+1) . . . h(ihi-1). |
| PZGELQ2 PZGELQ2 PZGELQ2 computes a lq factorization of a complex distributed m-by- |
| PZGELQF PZGELQF PZGELQF computes a lq factorization of a complex distributed m-by- if m < n, sub( a ) is overwritten by details of its lq factorization as returned by PZGELQF ia (global input) integer as returned by PZGELQF notes as returned by PZGELQF notes as returned by PZGELQF. q is of order m if side = 'l' and of order as returned by PZGELQF. q is of order m if side = 'l' and of order |
| PZGELS PZGELS PZGELS solves overdetermined or underdetermined complex linea or its conjugate-transpose, using a qr or lq factorization of |
| PZGEMR2D PZGEMR2D although all processes call PZGEMR2D, only the processes that ow first column of b receive data. the calls to zgebs2d/zgebr2d |
| PZGEQL2 PZGEQL2 PZGEQL2 computes a ql factorization of a complex distributed m-by- |
| PZGEQLF PZGEQLF PZGEQLF computes a ql factorization of a complex distributed m-by- as returned by PZGEQLF notes as returned by PZGEQLF notes as returned by PZGEQLF. q is of order m if side = 'l' and of order as returned by PZGEQLF. q is of order m if side = 'l' and of order |
| PZGEQPF PZGEQPF PZGEQPF computes a qr factorization with column pivoting of |
| PZGEQR2 PZGEQR2 PZGEQR2 computes a qr factorization of a complex distributed m-by- |
| PZGEQRF PZGEQRF if m >= n, sub( a ) is overwritten by details of its qr factorization as returned by PZGEQRF factorization as returned by pzgelqf. PZGEQRF computes a qr factorization of a complex distributed m-by- as returned by PZGEQRF notes as returned by PZGEQRF notes as returned by PZGEQRF. q is of order m if side = 'l' and of order as returned by PZGEQRF. q is of order m if side = 'l' and of order |
| PZGERFS PZGERFS PZGERFS improves the computed solution to a system of linea the solutions. lwork is local input and must be at least lwork = max( pzgecon( lwork ), PZGERFS( lwork ) |
| PZGERQ2 PZGERQ2 PZGERQ2 computes a rq factorization of a complex distributed m-by- |
| PZGERQF PZGERQF PZGERQF computes a rq factorization of a complex distributed m-by- as returned by PZGERQF notes as returned by PZGERQF notes as returned by PZGERQF. q is of order m if side = 'l' and of order as returned by PZGERQF. q is of order m if side = 'l' and of order |
| PZGESV PZGESV PZGESV computes the solution to a complex system of linear equation sub( a ) * x = sub( b ), |
| PZGESVD PZGESVD PZGESVD computes the singular value decomposition (svd) of a singular vectors. the svd is written as |
| PZGESVX PZGESVX PZGESVX uses the lu factorization to compute the solution to |
| PZGETF2 PZGETF2 PZGETF2 computes an lu factorization of a general m-by- partial pivoting with row interchanges. |
| PZGETRF PZGETRF 1-norm or the infinity-norm, using the lu factorization computed by PZGETRF an estimate is obtained for norm(inv(a(ia:ia+n-1,ja:ja+n-1))), and factors of the matrix sub( a ) = p * l * u as computed by PZGETRF iaf (global input) integer entry contains the factors l and u from the factorization a(ia:ia+n-1,ja:ja+n-1) = p*l*u as computed by PZGETRF equilibrated matrix a(ia:ia+n-1,ja:ja+n-1). PZGETRF computes an lu factorization of a general m-by-n distribute row interchanges. pzgetri computes the inverse of a distributed matrix using the lu factorization computed by PZGETRF. this method inverts u and the inva by solving the system inva*l = inv(u) for inva. with a general n-by-n distributed matrix sub( a ) using the lu factorization computed by PZGETRF and sub( b ) denotes b(ib:ib+n-1,jb:jb+nrhs-1). |
| PZGETRI PZGETRI PZGETRI computes the inverse of a distributed matrix using the l computes the inverse of sub( a ) = a(ia:ia+n-1,ja:ja+n-1) denoted |
| PZGETRS PZGETRS PZGETRS solves a system of distributed linear equation op( sub( a ) ) * x = sub( b ) |
| PZGGQRF PZGGQRF PZGGQRF computes a generalized qr factorization o an n-by-p matrix sub( b ) = b(ib:ib+n-1,jb:jb+p-1): |
| PZGGRQF PZGGRQF PZGGRQF computes a generalized rq factorization o and a p-by-n matrix sub( b ) = b(ib:ib+p-1,jb:jb+n-1): |
| PZHEEV PZHEEV PZHEEV computes selected eigenvalues and, optionally, eigenvector of scalapack routines. PZHEEVd computes all the eigenvalues and eigenvectors of a hermitia |
| PZHEEVD PZHEEVD PZHEEVD computes all the eigenvalues and eigenvectors of a hermitia |
| PZHEEVX PZHEEVX PZHEEVX computes selected eigenvalues and, optionally, eigenvector of scalapack routines. eigenvalues/vectors can be selected by |
| PZHEGS2 PZHEGS2 PZHEGS2 reduces a complex hermitian-definite generalized eigenproble |
| PZHEGST PZHEGST PZHEGST reduces a complex hermitian-definite generalized eigenproble pzhengst performs the same function as PZHEGST, but is based o triangular solves (the basis of pzhengst). |
| PZHEGVX PZHEGVX PZHEGVX computes all the eigenvalues, and optionally of a complex generalized hermitian-definite eigenproblem, of the form |
| PZHENGST PZHENGST PZHENGST reduces a complex hermitian-definite generalize |
| PZHENTRD PZHENTRD PZHENTRD is a prototype version of pzhetrd which uses tailore when the workspace provided by the user is adequate. |
| PZHEPTRD PZHEPTRD pzlamr1d has not been tested except withint the contect of PZHEPTRD, the prototype reduction to tridiagonal form code purpose |
| PZHETD2 PZHETD2 PZHETD2 reduces a complex hermitian matrix sub( a ) to hermitia q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PZHETRD PZHETRD support for uplo='u' is limited to calling the old, slow, PZHETRD PZHETRD reduces a complex hermitian matrix sub( a ) to hermitia q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). pzhettrd is not intended to be called directly. all users are encourage to call PZHETRD which will then call pzhettrd i the process grid must be square ( i.e. nprow = npcol ) and this is an auxiliary routine called by PZHETRD to redistribute d, this is an auxiliary routine called by PZHETRD notes nq = m if side = 'l' and nq = n if side = 'r'. q is defined as the product of nq-1 elementary reflectors, as returned by PZHETRD if uplo = 'u', q = h(nq-1) . . . h(2) h(1); |
| PZHETTRD PZHETTRD ictxt = desca( ctxt_ ) anb = pjlaenv( ictxt, 3, 'PZHETTRD', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) ictxt = desca( ctxt_ ) anb = pjlaenv( ictxt, 3, 'PZHETTRD', 'l', 0, 0, 0, 0 nps = max( numroc( n, 1, 0, 0, sqnpc ), 2*anb ) pzhentrd is a prototype version of pzhetrd which uses tailored codes (either the serial, zhetrd, or the parallel code, PZHETTRD PZHETTRD reduces a complex hermitian matrix sub( a ) to hermitia q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PZLABRD PZLABRD PZLABRD reduces the first nb rows and columns of a complex genera or lower bidiagonal form by an unitary transformation q' * a * p, and |
| PZLACGV PZLACGV PZLACGV conjugates a complex vector of length n, sub( x ), wher x(ix:ix+n-1,jx) if incx = 1, and |
| PZLACON PZLACON PZLACON estimates the 1-norm of a square, complex distributed matri products. x and v are aligned with the distributed matrix a, this |
| PZLACONSB PZLACONSB PZLACONSB looks for two consecutive small subdiagonal elements b given by h44, h33, & h43h34 and see if this would make a look for two consecutive small subdiagonal elements: PZLACONSB is the routine that does this |
| PZLACP2 PZLACP2 PZLACP2 copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes |
| PZLACP3 PZLACP3 PZLACP3 is an auxiliary routine that copies from a global paralle the entire submatrix that is copied gets placed on one node or |
| PZLACPY PZLACPY PZLACPY copies all or part of a distributed matrix a to anothe performs a local copy sub( a ) := sub( b ), where sub( a ) denotes |
| PZLAEVSWP PZLAEVSWP PZLAEVSWP moves the eigenvectors (potentially unsorted) fro array, sorted so that the corresponding eigenvalues are sorted. |
| PZLAHRD PZLAHRD PZLAHRD reduces the first nb columns of a complex genera elements below the k-th subdiagonal are zero. the reduction is |
| PZLAMR1D PZLAMR1D PZLAMR1D has not been tested except withint the contect o |
| PZLANGE PZLANGE watobd = max(max(wPZLANGE,wpzgebrd) PZLANGE returns the value of the one norm, or the frobenius norm distributed matrix sub( a ) = a(ia:ia+m-1, ja:ja+n-1). |
| PZLAPIV PZLAPIV PZLAPIV applies either p (permutation matrix indicated by ipiv sub( a ) = a(ia:ia+m-1,ja:ja+n-1), resulting in row or column same mb (or nb) block. if you want to pivot a full matrix, use PZLAPIV notes |
| PZLAPV2 PZLAPV2 PZLAPV2 applies either p (permutation matrix indicated by ipiv a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting. the |
| PZLAQGE PZLAQGE PZLAQGE equilibrates a general m-by-n distributed matri factors in the vectors r and c. |
| PZLAQSY PZLAQSY PZLAQSY equilibrates a symmetric distributed matri vectors sr and sc. |
| PZLARFB PZLARFB PZLARFB applies a complex block reflector q or its conjugat denoting c(ic:ic+m-1,jc:jc+n-1), from the left or the right. |
| PZLARFG PZLARFG PZLARFG generates a complex elementary reflector h of order n, suc |
| PZLARFT PZLARFT PZLARFT forms the triangular factor t of a complex block reflector |
| PZLARZB PZLARZB PZLARZB applies a complex block reflector q or its conjugat denoting c(ic:ic+m-1,jc:jc+n-1), from the left or the right. |
| PZLARZT PZLARZT PZLARZT forms the triangular factor t of a complex block reflecto reflectors as returned by pztzrzf. |
| PZLASCL PZLASCL PZLASCL multiplies the m-by-n complex distributed matrix sub( a is done without over/underflow as long as the final result |
| PZLASE2 PZLASE2 PZLASE2 initializes an m-by-n distributed matrix sub( a ) denotin offdiagonals. pzlase2 requires that only dimension of the matrix |
| PZLASET PZLASET PZLASET initializes an m-by-n distributed matrix sub( a ) denotin offdiagonals. |
| PZLASMSUB PZLASMSUB PZLASMSUB looks for a small subdiagonal element from the botto |
| PZLASSQ PZLASSQ PZLASSQ returns the values scl and smsq such tha ( scl**2 )*smsq = x( 1 )**2 +...+ x( n )**2 + ( scale**2 )*sumsq, |
| PZLASWP PZLASWP PZLASWP performs a series of row or column interchanges o interchange is initiated for each of rows or columns k1 trough k2 of |
| PZLATRA PZLATRA PZLATRA computes the trace of an n-by-n distributed matrix sub( a process of the grid. |
| pzlatrd pzlatrd the traditional way of computing v (and the one used in pzlatrd.f an v = tau * v the traditional way of computing v (and the one used in pzlatrd.f an v = tau * v the traditional way of computing v (and the one used in pzlatrd.f an v = tau * v the traditional way of computing v (and the one used in pzlatrd.f an v = tau * v pzlatrd reduces nb rows and columns of a complex hermitia tridiagonal form by an unitary similarity transformation |
| PZLATRZ PZLATRZ PZLATRZ reduces the m-by-n ( m<=n ) complex upper trapezoida to upper triangular form by means of unitary transformations. |
| PZLATTRS PZLATTRS dimension ( 2*desct(lld_) ) additional workspace may be required if PZLATTRS is update |
| PZLAUU2 PZLAUU2 PZLAUU2 computes the product u * u' or l' * l, where the triangula the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PZLAUUM PZLAUUM PZLAUUM computes the product u * u' or l' * l, where the triangula the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1). |
| PZLAWIL PZLAWIL PZLAWIL gets the transform given by h44,h33, & h43h34 into |
| PZMAX1 PZMAX1 PZMAX1 computes the global index of the maximum element in absolut in indx and the value is returned in amax, |
| PZPBSV PZPBSV PZPBSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZPBTRF PZPBTRF see PZPBTRF and pzpbtrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PZPBTRF banded symmetric positive definite distributed |
| PZPBTRS PZPBTRS see pzpbtrf and PZPBTRS for details ===================================================================== PZPBTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZPOCON PZPOCON PZPOCON estimates the reciprocal of the condition number (in th using the cholesky factorization a = u**h*u or a = l*l**h computed by lwork is local input and must be at least lwork = max( PZPOCON( lwork ), pzporfs( lwork ) lwork = 3*desca( lld_ ) |
| PZPOEQU PZPOEQU PZPOEQU computes row and column scalings intended t sub( a ) = a(ia:ia+n-1,ja:ja+n-1) and reduce its condition number |
| PZPORFS PZPORFS PZPORFS improves the computed solution to a system of linea and provides error bounds and backward error estimates for the lwork is local input and must be at least lwork = max( pzpocon( lwork ), PZPORFS( lwork ) lwork = 3*desca( lld_ ) |
| PZPOSV PZPOSV PZPOSV computes the solution to a complex system of linear equation sub( a ) * x = sub( b ), |
| PZPOSVX PZPOSVX PZPOSVX uses the cholesky factorization a = u**h*u or a = l*l**h t |
| PZPOTF2 PZPOTF2 PZPOTF2 computes the cholesky factorization of a complex hermitia |
| PZPOTRF PZPOTRF sub( b ) must have been previously factorized as u**h*u or l*l**h by PZPOTRF notes sub( b ) must have been previously factorized as u**h*u or l*l**h by PZPOTRF notes sub( b ) must have been previously factorized as u**h*u or l*l**h by PZPOTRF notes using the cholesky factorization a = u**h*u or a = l*l**h computed by PZPOTRF an estimate is obtained for norm(inv(a(ia:ia+n-1,ja:ja+n-1))), and cholesky factorization sub( a ) = l*l**h or u**h*u, as computed by PZPOTRF iaf (global input) integer PZPOTRF computes the cholesky factorization of an n-by-n comple a(ia:ia+n-1, ja:ja+n-1). cholesky factorization sub( a ) = u**h*u or l*l**h computed by PZPOTRF notes hermitian positive definite distributed matrix using the cholesky factorization sub( a ) = u**h*u or l*l**h computed by PZPOTRF |
| PZPOTRI PZPOTRI PZPOTRI computes the inverse of a complex hermitian positive definit cholesky factorization sub( a ) = u**h*u or l*l**h computed by |
| PZPOTRS PZPOTRS PZPOTRS solves a system of linear equation sub( a ) * x = sub( b ) |
| PZPTSV PZPTSV PZPTSV solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZPTTRF PZPTTRF see PZPTTRF and pzpttrs for details ===================================================================== where a(1:n, ja:ja+n-1) is the matrix used to produce the factors stored in a(1:n,ja:ja+n-1) and af by PZPTTRF tridiagonal symmetric positive definite distributed |
| PZPTTRS PZPTTRS see pzpttrf and PZPTTRS for details ===================================================================== PZPTTRS solves a system of linear equation a(1:n, ja:ja+n-1) * x = b(ib:ib+n-1, 1:nrhs) |
| PZSTEBZ PZSTEBZ computed is returned in nz. if (mod(info/8,2).ne.0), then PZSTEBZ failed to comput send e-mail to scalapack@cs.utk.edu computed is returned in nz. if (mod(info/8,2).ne.0), then PZSTEBZ failed t send e-mail to scalapack@cs.utk.edu |
| PZSTEIN PZSTEIN performance. in the limit (i.e. clustersize = n-1) PZSTEIN will perform no better than zstein on for clustersize = n/sqrt(nprow*npcol) reorthogonalizing performance. in the limit (i.e. clustersize = n-1) PZSTEIN will perform no better than zstein on 1 processor all eigenvectors will increase the total execution time PZSTEIN computes the eigenvectors of a symmetric tridiagonal matri correspond to user specified eigenvalues. pzstein does not |
| PZTRCON PZTRCON PZTRCON estimates the reciprocal of the condition number of 1-norm or the infinity-norm. |
| PZTREVC PZTREVC PZTREVC computes some or all of the right and/or left eigenvectors o |
| PZTRRFS PZTRRFS PZTRRFS provides error bounds and backward error estimates for th coefficient matrix. |
| PZTRSV PZTRSV compute a bound on the computed solution vector to see if the level 2 pblas routine PZTRSV can be used been implemented in pzlattrs which is called by this routine to solve the triangular systems. pzlattrs just calls PZTRSV each eigenvector is normalized so that the element of largest |
| PZTRTI2 PZTRTI2 PZTRTI2 computes the inverse of a complex upper or lower triangula contained in one and only one process memory space (local operation). |
| PZTRTRI PZTRTRI PZTRTRI computes the inverse of a upper or lower triangula |
| PZTRTRS PZTRTRS the solution matrix x must be computed by PZTRTRS or some othe refinement because doing so cannot improve the backward error. PZTRTRS solves a triangular system of the for sub( a ) * x = sub( b ) or sub( a )**t * x = sub( b ) or |
| PZTZRZF PZTZRZF q is a product of k elementary reflectors as returned by PZTZRZF currently, only storev = 'r' and direct = 'b' are supported. h of order > n, which is defined as a product of k elementary reflectors as returned by PZTZRZF if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular; PZTZRZF reduces the m-by-n ( m<=n ) complex upper trapezoidal matri of unitary transformations. as returned by PZTZRZF. q is of order m if side = 'l' and of order as returned by PZTZRZF. q is of order m if side = 'l' and of order |
| PZUNG2L PZUNG2L PZUNG2L generates an m-by-n complex distributed matrix q denotin the last n columns of a product of k elementary reflectors of order m |
| PZUNG2R PZUNG2R PZUNG2R generates an m-by-n complex distributed matrix q denotin the first n columns of a product of k elementary reflectors of order |
| PZUNGL2 PZUNGL2 PZUNGL2 generates an m-by-n complex distributed matrix q denotin the first m rows of a product of k elementary reflectors of order n |
| PZUNGLQ PZUNGLQ PZUNGLQ generates an m-by-n complex distributed matrix q denotin the first m rows of a product of k elementary reflectors of order n |
| PZUNGQL PZUNGQL PZUNGQL generates an m-by-n complex distributed matrix q denotin the last n columns of a product of k elementary reflectors of order m |
| PZUNGQR PZUNGQR a(ia+i:ia+n-1,ja+i-1), and taua in taua(ja+i-1). to form q explicitly, use scalapack subroutine PZUNGQR b(ib+i:ib+p-1,jb+i-1), and taub in taub(jb+i-1). to form z explicitly, use scalapack subroutine PZUNGQR PZUNGQR generates an m-by-n complex distributed matrix q denotin the first n columns of a product of k elementary reflectors of order |
| PZUNGR2 PZUNGR2 PZUNGR2 generates an m-by-n complex distributed matrix q denotin last m rows of a product of k elementary reflectors of order n |
| PZUNGRQ PZUNGRQ exit in b(ib+n-k+i-1,jb:jb+p-k+i-2), and taub in taub(ib+n-k+i-1). to form z explicitly, use scalapack subroutine PZUNGRQ exit in a(ia+m-k+i-1,ja:ja+n-k+i-2), and taua in taua(ia+m-k+i-1). to form q explicitly, use scalapack subroutine PZUNGRQ PZUNGRQ generates an m-by-n complex distributed matrix q denotin last m rows of a product of k elementary reflectors of order n |
| PZUNM2L PZUNM2L PZUNM2L overwrites the general complex m-by-n distributed matri |
| PZUNM2R PZUNM2R PZUNM2R overwrites the general complex m-by-n distributed matri |
| PZUNMBR PZUNMBR to the workspace required for the subprograms zbdsqr, PZUNMBR(qln), and pzunmbr(prt), where qln and prt are th to pzunmbr. nru is equal to the local number of rows of if vect = 'q', PZUNMBR overwrites the general complex distribute |
| PZUNMHR PZUNMHR PZUNMHR overwrites the general complex m-by-n distributed matri |
| PZUNML2 PZUNML2 PZUNML2 overwrites the general complex m-by-n distributed matri |
| PZUNMLQ PZUNMLQ PZUNMLQ overwrites the general complex m-by-n distributed matri |
| PZUNMQL PZUNMQL PZUNMQL overwrites the general complex m-by-n distributed matri |
| PZUNMQR PZUNMQR to form q explicitly, use scalapack subroutine pzungqr. to use q to update another matrix, use scalapack subroutine PZUNMQR the matrix z is represented as a product of elementary reflectors to form z explicitly, use scalapack subroutine pzungqr. to use z to update another matrix, use scalapack subroutine PZUNMQR alignment requirements PZUNMQR overwrites the general complex m-by-n distributed matri |
| PZUNMR2 PZUNMR2 PZUNMR2 overwrites the general complex m-by-n distributed matri |
| PZUNMR3 PZUNMR3 PZUNMR3 overwrites the general complex m-by-n distributed matri |
| PZUNMRQ PZUNMRQ to form z explicitly, use scalapack subroutine pzungrq. to use z to update another matrix, use scalapack subroutine PZUNMRQ alignment requirements to form q explicitly, use scalapack subroutine pzungrq. to use q to update another matrix, use scalapack subroutine PZUNMRQ the matrix z is represented as a product of elementary reflectors PZUNMRQ overwrites the general complex m-by-n distributed matri |
| PZUNMRZ PZUNMRZ PZUNMRZ overwrites the general complex m-by-n distributed matri |
| PZUNMTR PZUNMTR PZUNMTR overwrites the general complex m-by-n distributed matri |