the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PCDTTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PCPBTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PCPTTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PDDBTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PDDTTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PDPBTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PDPTTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PSDBTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PSDTTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PSPBTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PSPTTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PZDBTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PZDTTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PZPBTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

PZPTTRSV()

the last processor does not participate in the solution of the
reduced system and just waits to receive its solution
determine number of steps in tree loop

Want

PCDBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCDBTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCDTTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCDTTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCGBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCHEEVX()

minimal workspace is supplied and orfac is too small.
if you Want to guarantee orthogonality (at the cos
the following to lrwork:

PCHEGVX()

minimal workspace is supplied and orfac is too small.
if you Want to guarantee orthogonality (at the cos
the following to lrwork:

PCLAHQR()

for schur form, use 2x2 blocks
if we don't Want the schur form, use bigger blocks
now the active submatrix is in rows and columns l to i. if

PCLASWP()

also note that this routine will only work for k1-k2 being in the
same mb (or nb) block.  if you Want to pivot a full matrix, us

PCPBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCPBTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCPTTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCPTTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDDBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDDBTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDDTTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDDTTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDGBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDLAHQR()

make sure it's divisible by lcm (we Want even workloads!

PDLASWP()

also note that this routine will only work for k1-k2 being in the
same mb (or nb) block.  if you Want to pivot a full matrix, us

PDPBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDPBTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDPTTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDPTTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDSYEVX()

minimal workspace is supplied and orfac is too small.
if you Want to guarantee orthogonality (at the cos
the following to lwork:

PDSYGVX()

minimal workspace is supplied and orfac is too small.
if you Want to guarantee orthogonality (at the cos
the following to lwork:

PSDBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSDBTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSDTTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSDTTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSGBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSLAHQR()

make sure it's divisible by lcm (we Want even workloads!

PSLASWP()

also note that this routine will only work for k1-k2 being in the
same mb (or nb) block.  if you Want to pivot a full matrix, us

PSPBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSPBTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSPTTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSPTTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSSYEVX()

minimal workspace is supplied and orfac is too small.
if you Want to guarantee orthogonality (at the cos
the following to lwork:

PSSYGVX()

minimal workspace is supplied and orfac is too small.
if you Want to guarantee orthogonality (at the cos
the following to lwork:

PZDBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZDBTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZDTTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZDTTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZGBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZHEEVX()

minimal workspace is supplied and orfac is too small.
if you Want to guarantee orthogonality (at the cos
the following to lrwork:

PZHEGVX()

minimal workspace is supplied and orfac is too small.
if you Want to guarantee orthogonality (at the cos
the following to lrwork:

PZLAHQR()

for schur form, use 2x2 blocks
if we don't Want the schur form, use bigger blocks
now the active submatrix is in rows and columns l to i. if

PZLASWP()

also note that this routine will only work for k1-k2 being in the
same mb (or nb) block.  if you Want to pivot a full matrix, us

PZPBTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZPBTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZPTTRF()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZPTTRSV()

Want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

wanted

PCGESVD()

1, if left(right) singular vectors are wanted
0, otherwise

PDGESVD()

1, if left(right) singular vectors are wanted
0, otherwise

PSGESVD()

1, if left(right) singular vectors are wanted
0, otherwise

PZGESVD()

1, if left(right) singular vectors are wanted
0, otherwise

WANTU

PCGESVD()

wbdtosvd = size*(WANTU*nru + wantvt*ncvt) 
max(wantu*wpcormbrqln, wantvt*wpcormbrprt)),

PDGESVD()

wbdtosvd = size*(WANTU*nru + wantvt*ncvt) 
max(wantu*wpdormbrqln, wantvt*wpdormbrprt)),

PSGESVD()

wbdtosvd = size*(WANTU*nru + wantvt*ncvt) 
max(wantu*wpsormbrqln, wantvt*wpsormbrprt)),

PZGESVD()

wbdtosvd = size*(WANTU*nru + wantvt*ncvt) 
max(wantu*wpzormbrqln, wantvt*wpzormbrprt)),

WANTVT

PCGESVD()

wbdtosvd = size*(wantu*nru + WANTVT*ncvt) 
max(wantu*wpcormbrqln, wantvt*wpcormbrprt)),

PDGESVD()

wbdtosvd = size*(wantu*nru + WANTVT*ncvt) 
max(wantu*wpdormbrqln, wantvt*wpdormbrprt)),

PSGESVD()

wbdtosvd = size*(wantu*nru + WANTVT*ncvt) 
max(wantu*wpsormbrqln, wantvt*wpsormbrprt)),

PZGESVD()

wbdtosvd = size*(wantu*nru + WANTVT*ncvt) 
max(wantu*wpzormbrqln, wantvt*wpzormbrprt)),

WANTZ

CLAREF()

WANTZ   (global input) logica
if .false., then do no additional work on z.

DLAREF()

WANTZ   (global input) logica
if .false., then do no additional work on z.

SLAREF()

WANTZ   (global input) logica
if .false., then do no additional work on z.

ZLAREF()

WANTZ   (global input) logica
if .false., then do no additional work on z.

was

DLASORTE()

eigenvalues and things couldn't be paired or if the input
matrix s was not originally in schur form

DSTEIN2()

if stopping criterion was not satisfied, update info an

PCDBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PCDTSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PCGBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PCGBTRF()

if error was found in phase 1, processors jump here
free blacs space used to hold standard-form grid.

PCGEQPF()

on exit, if ipiv(i) = k, the local i-th column of sub( a )*p
was the global k-th column of sub( a ). ipiv is tied to th

PCGERFS()

by pcgetrf. ipiv(i) -> the global row local row i
was swapped with. this array is tied to the distribute

PCGESV()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PCGESVX()

6. if fact = 'e' and equilibration was used, the matrix x i
trans = 't' or 'c') so that it solves the original system

PCGETF2()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PCGETRF()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PCGETRI()

keeps track of the pivoting information. ipiv(i) is the
global row index the local row i was swapped with.  thi

PCGETRS()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PCHEGVX()

send e-mail to scalapack@cs.utk.edu
if (mod(info/16,2).ne.0), then b was not positiv
the smallest minor which is not positive definite.

PCLACON()

the serial version clacon has been contributed by nick higham,
university of manchester. it was originally named sonest, date

PCLAHQR()

the else part of this if needs updated vcopy, this
was not necessary in pslahqr

PCLAPIV()

this array contains the pivoting information. ipiv(i) is the
global row (column), local row (column) i was swapped with
or 'c' and pivroc='r' or 'r', the last piece of this array of

PCLAPV2()

the pivoting information. ipiv(i) is the global row (column),
local row (column) i was swapped with.  the last piece of th
tied to the distributed matrix a.

PCLAQGE()

equed   (global output) character
specifies the form of equilibration that was done
= 'r':  row equilibration, i.e., sub( a ) has been pre-

PCLAQSY()

equed   (output) character*1
specifies whether or not equilibration was done
= 'y':  equilibration was done, i.e., sub( a ) has been re-

PCLATTRS()

compute x(j) := ( x(j) - csumj ) / a(j,j) if 1/a(j,j)
was not used to scale the dotproduct
x( j ) = x( j ) - csumj

PCMAX1()

the serial version was contributed to lapack by nick higham for us

PCPBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PCPOSVX()

6. if equilibration was used, the matrix x is premultiplied b
equilibration.

PCPTSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PCTREVC()

products q*x and/or q*y, where q is an input unitary
matrix. if t was obtained from the schur factorization of a
right or left eigenvectors of a.

PDDBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PDDTSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PDGBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PDGBTRF()

if error was found in phase 1, processors jump here
free blacs space used to hold standard-form grid.

PDGEQPF()

on exit, if ipiv(i) = k, the local i-th column of sub( a )*p
was the global k-th column of sub( a ). ipiv is tied to th

PDGERFS()

by pdgetrf. ipiv(i) -> the global row local row i
was swapped with. this array is tied to the distribute

PDGESV()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PDGESVX()

6. if fact = 'e' and equilibration was used, the matrix x i
trans = 't' or 'c') so that it solves the original system

PDGETF2()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PDGETRF()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PDGETRI()

keeps track of the pivoting information. ipiv(i) is the
global row index the local row i was swapped with.  thi

PDGETRS()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PDLACON()

the serial version dlacon has been contributed by nick higham,
university of manchester. it was originally named sonest, date

PDLAPIV()

this array contains the pivoting information. ipiv(i) is the
global row (column), local row (column) i was swapped with
or 'c' and pivroc='r' or 'r', the last piece of this array of

PDLAPV2()

the pivoting information. ipiv(i) is the global row (column),
local row (column) i was swapped with.  the last piece of th
tied to the distributed matrix a.

PDLAQGE()

equed   (global output) character
specifies the form of equilibration that was done
= 'r':  row equilibration, i.e., sub( a ) has been pre-

PDLAQSY()

equed   (output) character*1
specifies whether or not equilibration was done
= 'y':  equilibration was done, i.e., sub( a ) has been re-

PDPBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PDPOSVX()

6. if equilibration was used, the matrix x is premultiplied b
equilibration.

PDPTSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PDSTEBZ()

= 3 : range='i', and the gershgorin interval initially
used was incorrect. no eigenvalues were computed
point arithmetic.

PDSYGVX()

send e-mail to scalapack@cs.utk.edu
if (mod(info/16,2).ne.0), then b was not positiv
the smallest minor which is not positive definite.

PDZSUM1()

the serial version of this routine was originally contributed b

PSCSUM1()

the serial version of this routine was originally contributed b

PSDBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PSDTSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PSGBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PSGBTRF()

if error was found in phase 1, processors jump here
free blacs space used to hold standard-form grid.

PSGEQPF()

on exit, if ipiv(i) = k, the local i-th column of sub( a )*p
was the global k-th column of sub( a ). ipiv is tied to th

PSGERFS()

by psgetrf. ipiv(i) -> the global row local row i
was swapped with. this array is tied to the distribute

PSGESV()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PSGESVX()

6. if fact = 'e' and equilibration was used, the matrix x i
trans = 't' or 'c') so that it solves the original system

PSGETF2()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PSGETRF()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PSGETRI()

keeps track of the pivoting information. ipiv(i) is the
global row index the local row i was swapped with.  thi

PSGETRS()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PSLACON()

the serial version slacon has been contributed by nick higham,
university of manchester. it was originally named sonest, date

PSLAPIV()

this array contains the pivoting information. ipiv(i) is the
global row (column), local row (column) i was swapped with
or 'c' and pivroc='r' or 'r', the last piece of this array of

PSLAPV2()

the pivoting information. ipiv(i) is the global row (column),
local row (column) i was swapped with.  the last piece of th
tied to the distributed matrix a.

PSLAQGE()

equed   (global output) character
specifies the form of equilibration that was done
= 'r':  row equilibration, i.e., sub( a ) has been pre-

PSLAQSY()

equed   (output) character*1
specifies whether or not equilibration was done
= 'y':  equilibration was done, i.e., sub( a ) has been re-

PSPBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PSPOSVX()

6. if equilibration was used, the matrix x is premultiplied b
equilibration.

PSPTSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PSSTEBZ()

= 3 : range='i', and the gershgorin interval initially
used was incorrect. no eigenvalues were computed
point arithmetic.

PSSYGVX()

send e-mail to scalapack@cs.utk.edu
if (mod(info/16,2).ne.0), then b was not positiv
the smallest minor which is not positive definite.

PZDBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PZDTSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PZGBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PZGBTRF()

if error was found in phase 1, processors jump here
free blacs space used to hold standard-form grid.

PZGEQPF()

on exit, if ipiv(i) = k, the local i-th column of sub( a )*p
was the global k-th column of sub( a ). ipiv is tied to th

PZGERFS()

by pzgetrf. ipiv(i) -> the global row local row i
was swapped with. this array is tied to the distribute

PZGESV()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PZGESVX()

6. if fact = 'e' and equilibration was used, the matrix x i
trans = 't' or 'c') so that it solves the original system

PZGETF2()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PZGETRF()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PZGETRI()

keeps track of the pivoting information. ipiv(i) is the
global row index the local row i was swapped with.  thi

PZGETRS()

this array contains the pivoting information.
ipiv(i) -> the global row local row i was swapped with

PZHEGVX()

send e-mail to scalapack@cs.utk.edu
if (mod(info/16,2).ne.0), then b was not positiv
the smallest minor which is not positive definite.

PZLACON()

the serial version zlacon has been contributed by nick higham,
university of manchester. it was originally named sonest, date

PZLAHQR()

the else part of this if needs updated vcopy, this
was not necessary in pdlahqr

PZLAPIV()

this array contains the pivoting information. ipiv(i) is the
global row (column), local row (column) i was swapped with
or 'c' and pivroc='r' or 'r', the last piece of this array of

PZLAPV2()

the pivoting information. ipiv(i) is the global row (column),
local row (column) i was swapped with.  the last piece of th
tied to the distributed matrix a.

PZLAQGE()

equed   (global output) character
specifies the form of equilibration that was done
= 'r':  row equilibration, i.e., sub( a ) has been pre-

PZLAQSY()

equed   (output) character*1
specifies whether or not equilibration was done
= 'y':  equilibration was done, i.e., sub( a ) has been re-

PZLATTRS()

compute x(j) := ( x(j) - csumj ) / a(j,j) if 1/a(j,j)
was not used to scale the dotproduct
x( j ) = x( j ) - csumj

PZMAX1()

the serial version was contributed to lapack by nick higham for us

PZPBSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PZPOSVX()

6. if equilibration was used, the matrix x is premultiplied b
equilibration.

PZPTSV()

> 0:  if info = k<=nprocs, the submatrix stored on processor
info and factored locally was no
the factorization was not completed.

PZTREVC()

products q*x and/or q*y, where q is an input unitary
matrix. if t was obtained from the schur factorization of a
right or left eigenvectors of a.

SLASORTE()

eigenvalues and things couldn't be paired or if the input
matrix s was not originally in schur form

SSTEIN2()

if stopping criterion was not satisfied, update info an

WATOBD

PCGESVD()

lwork >= 1 + 2*sizeb + max(WATOBD, wbdtosvd)
where sizeb = max(m,n), and watobd and wbdtosvd refer,

PDGESVD()

lwork >= 1 + 6*sizeb + max(WATOBD, wbdtosvd)
where sizeb = max(m,n), and watobd and wbdtosvd refer,

PSGESVD()

lwork >= 1 + 6*sizeb + max(WATOBD, wbdtosvd)
where sizeb = max(m,n), and watobd and wbdtosvd refer,

PZGESVD()

lwork >= 1 + 2*sizeb + max(WATOBD, wbdtosvd)
where sizeb = max(m,n), and watobd and wbdtosvd refer,

way

DLASORTE()

dlasorte sorts eigenpairs so that real eigenpairs are together and
complex are together.  this way one can employ 2x2 shifts easil
this routine does no parallel work.

PCDBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCDBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCDTSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCDTTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCGBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCGBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCGERFS()

the estimate is as reliable as the estimate for rcond, and
is almost always a slight overestimate of the true error

PCHETTRD()

the traditional
way of computing v (and the one used in pzlatrd.f an
v = tau * v

PCLATTRS()

otherwise, scale column of a by uscal before dot
product.  below is not the best way to do it
do 130 i = 1, j - 1

PCPBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCPBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCPORFS()

the estimate is as reliable as the estimate for rcond, and
is almost always a slight overestimate of the true error

PCPTSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCPTTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PCTRRFS()

largest entry in sub( x ).  the estimate is as reliable as
the estimate for rcond, and is almost always a sligh
this array is tied to the distributed matrix x.

PDDBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDDBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDDTSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDDTTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDGBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDGBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDGERFS()

the estimate is as reliable as the estimate for rcond, and
is almost always a slight overestimate of the true error

PDPBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDPBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDPORFS()

the estimate is as reliable as the estimate for rcond, and
is almost always a slight overestimate of the true error

PDPTSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDPTTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PDSYTTRD()

the traditional
way of computing v (and the one used in pzlatrd.f an
v = tau * v

PDTRRFS()

largest entry in sub( x ).  the estimate is as reliable as
the estimate for rcond, and is almost always a sligh
this array is tied to the distributed matrix x.

PSDBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSDBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSDTSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSDTTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSGBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSGBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSGERFS()

the estimate is as reliable as the estimate for rcond, and
is almost always a slight overestimate of the true error

PSPBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSPBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSPORFS()

the estimate is as reliable as the estimate for rcond, and
is almost always a slight overestimate of the true error

PSPTSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSPTTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PSSYTTRD()

the traditional
way of computing v (and the one used in pzlatrd.f an
v = tau * v

PSTRRFS()

largest entry in sub( x ).  the estimate is as reliable as
the estimate for rcond, and is almost always a sligh
this array is tied to the distributed matrix x.

PZDBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZDBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZDTSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZDTTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZGBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZGBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZGERFS()

the estimate is as reliable as the estimate for rcond, and
is almost always a slight overestimate of the true error

PZHETTRD()

the traditional
way of computing v (and the one used in pzlatrd.f an
v = tau * v

PZLATTRS()

otherwise, scale column of a by uscal before dot
product.  below is not the best way to do it
do 130 i = 1, j - 1

PZPBSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZPBTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZPORFS()

the estimate is as reliable as the estimate for rcond, and
is almost always a slight overestimate of the true error

PZPTSV()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZPTTRS()

parallel. these factors are applied to the matrix creating
fillin, which is stored in a non-inspectable way in auxiliar
the matrix a as p a p^t and then factoring the principal

PZTRRFS()

largest entry in sub( x ).  the estimate is as reliable as
the estimate for rcond, and is almost always a sligh
this array is tied to the distributed matrix x.

SLASORTE()

slasorte sorts eigenpairs so that real eigenpairs are together and
complex are together.  this way one can employ 2x2 shifts easil
this routine does no parallel work.

ways

PDLAED2()

sorted set.  then it tries to deflate the size of the problem.
there are two ways in which deflation can occur:  when two or mor
z vector.  for each such occurrence the order of the related secular

PSLAED2()

sorted set.  then it tries to deflate the size of the problem.
there are two ways in which deflation can occur:  when two or mor
z vector.  for each such occurrence the order of the related secular

WBDTOSVD

PCGESVD()

lwork >= 1 + 2*sizeb + max(watobd, WBDTOSVD)
where sizeb = max(m,n), and watobd and wbdtosvd refer,

PDGESVD()

lwork >= 1 + 6*sizeb + max(watobd, WBDTOSVD)
where sizeb = max(m,n), and watobd and wbdtosvd refer,

PSGESVD()

lwork >= 1 + 6*sizeb + max(watobd, WBDTOSVD)
where sizeb = max(m,n), and watobd and wbdtosvd refer,

PZGESVD()

lwork >= 1 + 2*sizeb + max(watobd, WBDTOSVD)
where sizeb = max(m,n), and watobd and wbdtosvd refer,

WCBDSQR

PCGESVD()

wbdtosvd = size*(wantu*nru + wantvt*ncvt) +
max(WCBDSQR

WDBDSQR

PDGESVD()

wbdtosvd = size*(wantu*nru + wantvt*ncvt) +
max(WDBDSQR

well

CCOMBAMAX1 ()

ccombamax1 finds the element having maximum real part absolute
value as well as its corresponding globl index
arguments

CLAREF()

wantz   (global input) logical
if .true., then apply any column reflections to z as well

DLAREF()

wantz   (global input) logical
if .true., then apply any column reflections to z as well

PCGEEQU()

factors is not guaranteed to reduce the condition number of
sub( a ) but works well in practice
notes

PDGEEQU()

factors is not guaranteed to reduce the condition number of
sub( a ) but works well in practice
notes

PSGEEQU()

factors is not guaranteed to reduce the condition number of
sub( a ) but works well in practice
notes

PZGEEQU()

factors is not guaranteed to reduce the condition number of
sub( a ) but works well in practice
notes

SLAREF()

wantz   (global input) logical
if .true., then apply any column reflections to z as well

ZCOMBAMAX1 ()

zcombamax1 finds the element having maximum real part absolute
value as well as its corresponding globl index
arguments

ZLAREF()

wantz   (global input) logical
if .true., then apply any column reflections to z as well

were

PCDBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCDBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCDTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCDTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGBTRF()

zero out any junk entries that were copie

PCGBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEBD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEBRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGECON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEHD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEHRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGELQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGELQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGELS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQLF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQPF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGERFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGERQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGERQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGESV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGESVD()

assume that its process grid has dimension r x c. locr( k ) denotes
the number of elements of k that a process would receive if k were
locc( k ) denotes the number of elements of k that a process would

PCGESVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGETF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGETRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGETRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGETRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGGQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGGRQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEEV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEEVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEGS2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEGVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHENGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHENTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHETD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHETRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHETTRD()

locp( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locq( k ) denotes the number of elements of k that a

PCLABRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACGV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACONSB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACP2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACP3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACPY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAEVSWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLANGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAPIV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAPV2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAQGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAQSY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARFB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARFG()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARFT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARZB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARZT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASE2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASET()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASMSUB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASSQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLATRA()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLATRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLATRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAUU2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAUUM()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAWIL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCMAX1()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPORFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOSVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOTF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOTRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCSRSCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCSTEIN()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTREVC()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRRFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRTI2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTZRZF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNG2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNG2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNM2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNM2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMBR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMHR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNML2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMR3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMTR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDDBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDDBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDDTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDDTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGBTRF()

zero out any junk entries that were copie

PDGBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEBD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEBRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGECON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEHD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEHRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGELQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGELQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGELS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQLF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQPF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGERFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGERQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGERQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGESV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGESVD()

assume that its process grid has dimension r x c. locr( k ) denotes
the number of elements of k that a process would receive if k were
locc( k ) denotes the number of elements of k that a process would

PDGESVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGETF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGETRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGETRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGETRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGGQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGGRQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLABRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACONSB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACP2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACP3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACPY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAEBZ()

= 1 - mmax : the last info intervals did not converge.
= mmax + 1 : more than mmax intervals were generated
=====================================================================

PDLAED2()

on exit, d contains the trailing (n-k) updated eigenvalues
(those which were deflated) sorted into increasing order
drow   (global input) integer

PDLAED3()

on exit, d contains the trailing (n-k) updated eigenvalues
(those which were deflated) sorted into increasing order
drow   (global input) integer

PDLAEVSWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLANGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAPIV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAPV2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAQGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAQSY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARED1D()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARED2D()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARFB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARFG()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARFT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARZB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARZT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASE2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASET()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASMSUB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASSQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLATRA()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLATRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLATRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAUU2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAUUM()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAWIL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORG2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORG2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORM2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORM2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMBR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMHR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORML2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMR3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMTR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPORFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOSVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOTF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOTRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDRSCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSTEBZ()

> 0 :  some or all of the eigenvalues failed to converge or
were not computed
these eigenvalues are flagged by a negative block

PDSTEIN()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYEV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYEVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYGS2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYGVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYNGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYNTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYTD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYTTRD()

locp( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locq( k ) denotes the number of elements of k that a

PDTRCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTRRFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTRTI2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTRTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTRTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTZRZF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDZSUM1()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSCSUM1()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSDBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSDBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSDTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSDTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGBTRF()

zero out any junk entries that were copie

PSGBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEBD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEBRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGECON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEHD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEHRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGELQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGELQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGELS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQLF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQPF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGERFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGERQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGERQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGESV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGESVD()

assume that its process grid has dimension r x c. locr( k ) denotes
the number of elements of k that a process would receive if k were
locc( k ) denotes the number of elements of k that a process would

PSGESVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGETF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGETRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGETRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGETRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGGQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGGRQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLABRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACONSB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACP2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACP3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACPY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAEBZ()

= 1 - mmax : the last info intervals did not converge.
= mmax + 1 : more than mmax intervals were generated
=====================================================================

PSLAED2()

on exit, d contains the trailing (n-k) updated eigenvalues
(those which were deflated) sorted into increasing order
drow   (global input) integer

PSLAED3()

on exit, d contains the trailing (n-k) updated eigenvalues
(those which were deflated) sorted into increasing order
drow   (global input) integer

PSLAEVSWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLANGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAPIV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAPV2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAQGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAQSY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARED1D()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARED2D()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARFB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARFG()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARFT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARZB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARZT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASE2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASET()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASMSUB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASSQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLATRA()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLATRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLATRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAUU2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAUUM()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAWIL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORG2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORG2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORM2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORM2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMBR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMHR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORML2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMR3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMTR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPORFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOSVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOTF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOTRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSRSCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSTEBZ()

> 0 :  some or all of the eigenvalues failed to converge or
were not computed
these eigenvalues are flagged by a negative block

PSSTEIN()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYEV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYEVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYGS2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYGVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYNGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYNTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYTD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYTTRD()

locp( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locq( k ) denotes the number of elements of k that a

PSTRCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTRRFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTRTI2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTRTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTRTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTZRZF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDRSCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGBTRF()

zero out any junk entries that were copie

PZGBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEBD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEBRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGECON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEHD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEHRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGELQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGELQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGELS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQLF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQPF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGERFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGERQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGERQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGESV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGESVD()

assume that its process grid has dimension r x c. locr( k ) denotes
the number of elements of k that a process would receive if k were
locc( k ) denotes the number of elements of k that a process would

PZGESVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGETF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGETRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGETRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGETRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGGQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGGRQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEEV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEEVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEGS2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEGVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHENGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHENTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHETD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHETRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHETTRD()

locp( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locq( k ) denotes the number of elements of k that a

PZLABRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACGV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACONSB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACP2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACP3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACPY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAEVSWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLANGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAPIV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAPV2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAQGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAQSY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARFB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARFG()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARFT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARZB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARZT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASE2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASET()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASMSUB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASSQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLATRA()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLATRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLATRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAUU2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAUUM()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAWIL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZMAX1()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPORFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOSVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOTF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOTRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZSTEIN()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTREVC()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRRFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRTI2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTZRZF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNG2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNG2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNM2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNM2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMBR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMHR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNML2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMR3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMTR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

what

PDSTEBZ()

(only the first nsplit elements will actually be used, but
since the user cannot know a priori what value nsplit wil

PSSTEBZ()

(only the first nsplit elements will actually be used, but
since the user cannot know a priori what value nsplit wil

whatever

PCLACONSB()

processors, the first major loop (10) goes over the tridiagonal
and has each node store whatever values of the 7 it has tha
and can happen in no more than 3 locations per block assuming

PDLACONSB()

processors, the first major loop (10) goes over the tridiagonal
and has each node store whatever values of the 7 it has tha
and can happen in no more than 3 locations per block assuming

PSLACONSB()

processors, the first major loop (10) goes over the tridiagonal
and has each node store whatever values of the 7 it has tha
and can happen in no more than 3 locations per block assuming

PZLACONSB()

processors, the first major loop (10) goes over the tridiagonal
and has each node store whatever values of the 7 it has tha
and can happen in no more than 3 locations per block assuming

when

CDBTF2()

the band storage scheme is illustrated by the following example, when

CLAMSH()

that can be sent through.
clamsh should only be called when there are multiple shifts/bulge
unreduced hessenberg matrix because of two or more consecutive

CLAREF()

these serve the same purpose as itmp1,itmp2 but for z
when wantz is set
vecs    (global input) complex array of size 3*n (matrix size)

DDBTF2()

the band storage scheme is illustrated by the following example, when

DLAMSH ()

that can be sent through.
dlamsh should only be called when there are multiple shifts/bulge
unreduced hessenberg matrix because of two or more consecutive small

DLAREF()

these serve the same purpose as itmp1,itmp2 but for z
when wantz is set
vecs    (global input) double precision array of size 3*n (matrix

PCGELS()

where sub( b ) denotes b( ib:ib+m-1, jb:jb+nrhs-1 ) when trans = 'n
vectors b and solution vectors x can be handled in a single call;

PCGERFS()

this routine temporarily returns when n <= 1
the distributed submatrices op( a ) and op( af ) (respectively

PCGESVD()

to pcunmbr. nru is equal to the local number of rows of
the matrix u when distributed 1-dimensional "column" o
of columns of the matrix vt when distributed across

PCHEEVX()

an approximate eigenvalue is accepted as converged
when it is determined to lie in an interval [a,b

PCHEGVX()

an approximate eigenvalue is accepted as converged
when it is determined to lie in an interval [a,b

PCHENGST()

pchengst calls pchegst when uplo='u', hence pchengst provide

PCHENTRD()

codes (either the serial, chetrd, or the parallel code, pchettrd)
when the workspace provided by the user is adequate

PCLAHQR()

we first hit a border when mod(k1(ki)-1,hbl)=hbl-2 and we hi

PCLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PCLANGE()

the number of rows to be operated on i.e the number of rows
of the distributed submatrix sub( a ). when m = 0, pclang

PCLAPIV()

the following restrictions apply when ipiv must be transposed
descip(mb_) must equal desca(nb_)

PCLATTRS()

scale x by (1/abs(x(j)))*abs(a(j,j))*bignum
to avoid overflow when dividing by a(j,j)

PCMAX1()

when the result of a vector-oriented pblas call is a scalar, it wil
being operated on.  let x be a generic term for the input vector(s).

PCPORFS()

pcporfs improves the computed solution to a system of linear
equations when the coefficient matrix is hermitian positive definit
solutions.

PCTRRFS()

this routine temporarily returns when n <= 1
the distributed submatrices sub( x ) and sub( b ) should be

PCUNMBR()

here q and p**h are the unitary distributed matrices determined by
pcgebrd when reducing a complex distributed matrix a(ia:*,ja:*) t
as products of elementary reflectors h(i) and g(i) respectively.

PDGELS()

where sub( b ) denotes b( ib:ib+m-1, jb:jb+nrhs-1 ) when trans = 'n
vectors b and solution vectors x can be handled in a single call;

PDGERFS()

this routine temporarily returns when n <= 1
the distributed submatrices op( a ) and op( af ) (respectively

PDGESVD()

to pdormbr. nru is equal to the local number of rows of
the matrix u when distributed 1-dimensional "column" o
of columns of the matrix vt when distributed across

PDLAEBZ()

abstol  (input) double precision
the minimum (absolute) width of an interval. when an interva
magnitude) endpoint, then it is considered to be sufficiently

PDLAECV()

specifies the criterion for "convergence" of an interval.
= 0 : when an interval is narrower than abstol, or tha
it is considered to have "converged".

PDLAED1()

the first stage consists of deflating the size of the problem
when there are multiple eigenvalues or if there is a zero i
secular equation problem is reduced by one.  this stage is

PDLAED2()

sorted set.  then it tries to deflate the size of the problem.
there are two ways in which deflation can occur:  when two or mor
z vector.  for each such occurrence the order of the related secular

PDLAHQR()

we first hit a border when mod(k1(ki)-1,hbl)=hbl-2 and we hi

PDLAMCH()

t     = number of (base) digits in the mantissa
rnd   = 1.0 when rounding occurs in addition, 0.0 otherwis
rmin  = underflow threshold - base**(emin-1)

PDLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PDLANGE()

the number of rows to be operated on i.e the number of rows
of the distributed submatrix sub( a ). when m = 0, pdlang

PDLAPIV()

the following restrictions apply when ipiv must be transposed
descip(mb_) must equal desca(nb_)

PDORMBR()

here q and p**t are the orthogonal distributed matrices determined by
pdgebrd when reducing a real distributed matrix a(ia:*,ja:*) t
as products of elementary reflectors h(i) and g(i) respectively.

PDPORFS()

pdporfs improves the computed solution to a system of linear
equations when the coefficient matrix is symmetric positive definit
solutions.

PDSTEBZ()

will be used, where |t| means the 1-norm of t.
eigenvalues will be computed most accurately when abstol i
note : if eigenvectors are desired later by inverse iteration

PDSYEV()

sizemqrleft = the workspace requirement for pdormtr
when it's side argument is 'l'
with myprowc defined when a new context is created as:

PDSYEVX()

an approximate eigenvalue is accepted as converged
when it is determined to lie in an interval [a,b

PDSYGVX()

an approximate eigenvalue is accepted as converged
when it is determined to lie in an interval [a,b

PDSYNGST()

pdsyngst calls pdhegst when uplo='u', hence pdhengst provide

PDSYNTRD()

codes (either the serial, dsytrd, or the parallel code, pdsyttrd)
when the workspace provided by the user is adequate

PDTRRFS()

this routine temporarily returns when n <= 1
the distributed submatrices sub( x ) and sub( b ) should be

PDZSUM1()

when the result of a vector-oriented pblas call is a scalar, it wil
being operated on.  let x be a generic term for the input vector(s).

PJLAENV()

the following conventions have been used when calling pjlaenv fro
1)  opts is a concatenation of all of the character options to

PSCSUM1()

when the result of a vector-oriented pblas call is a scalar, it wil
being operated on.  let x be a generic term for the input vector(s).

PSGELS()

where sub( b ) denotes b( ib:ib+m-1, jb:jb+nrhs-1 ) when trans = 'n
vectors b and solution vectors x can be handled in a single call;

PSGERFS()

this routine temporarily returns when n <= 1
the distributed submatrices op( a ) and op( af ) (respectively

PSGESVD()

to psormbr. nru is equal to the local number of rows of
the matrix u when distributed 1-dimensional "column" o
of columns of the matrix vt when distributed across

PSLAEBZ()

abstol  (input) real
the minimum (absolute) width of an interval. when an interva
magnitude) endpoint, then it is considered to be sufficiently

PSLAECV()

specifies the criterion for "convergence" of an interval.
= 0 : when an interval is narrower than abstol, or tha
it is considered to have "converged".

PSLAED1()

the first stage consists of deflating the size of the problem
when there are multiple eigenvalues or if there is a zero i
secular equation problem is reduced by one.  this stage is

PSLAED2()

sorted set.  then it tries to deflate the size of the problem.
there are two ways in which deflation can occur:  when two or mor
z vector.  for each such occurrence the order of the related secular

PSLAHQR()

we first hit a border when mod(k1(ki)-1,hbl)=hbl-2 and we hi

PSLAMCH()

t     = number of (base) digits in the mantissa
rnd   = 1.0 when rounding occurs in addition, 0.0 otherwis
rmin  = underflow threshold - base**(emin-1)

PSLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PSLANGE()

the number of rows to be operated on i.e the number of rows
of the distributed submatrix sub( a ). when m = 0, pslang

PSLAPIV()

the following restrictions apply when ipiv must be transposed
descip(mb_) must equal desca(nb_)

PSORMBR()

here q and p**t are the orthogonal distributed matrices determined by
psgebrd when reducing a real distributed matrix a(ia:*,ja:*) t
as products of elementary reflectors h(i) and g(i) respectively.

PSPORFS()

psporfs improves the computed solution to a system of linear
equations when the coefficient matrix is symmetric positive definit
solutions.

PSSTEBZ()

will be used, where |t| means the 1-norm of t.
eigenvalues will be computed most accurately when abstol i
note : if eigenvectors are desired later by inverse iteration

PSSYEV()

sizemqrleft = the workspace requirement for psormtr
when it's side argument is 'l'
with myprowc defined when a new context is created as:

PSSYEVX()

an approximate eigenvalue is accepted as converged
when it is determined to lie in an interval [a,b

PSSYGVX()

an approximate eigenvalue is accepted as converged
when it is determined to lie in an interval [a,b

PSSYNGST()

pssyngst calls pshegst when uplo='u', hence pshengst provide

PSSYNTRD()

codes (either the serial, ssytrd, or the parallel code, pssyttrd)
when the workspace provided by the user is adequate

PSTRRFS()

this routine temporarily returns when n <= 1
the distributed submatrices sub( x ) and sub( b ) should be

PZGELS()

where sub( b ) denotes b( ib:ib+m-1, jb:jb+nrhs-1 ) when trans = 'n
vectors b and solution vectors x can be handled in a single call;

PZGERFS()

this routine temporarily returns when n <= 1
the distributed submatrices op( a ) and op( af ) (respectively

PZGESVD()

to pzunmbr. nru is equal to the local number of rows of
the matrix u when distributed 1-dimensional "column" o
of columns of the matrix vt when distributed across

PZHEEVX()

an approximate eigenvalue is accepted as converged
when it is determined to lie in an interval [a,b

PZHEGVX()

an approximate eigenvalue is accepted as converged
when it is determined to lie in an interval [a,b

PZHENGST()

pzhengst calls pzhegst when uplo='u', hence pzhengst provide

PZHENTRD()

codes (either the serial, zhetrd, or the parallel code, pzhettrd)
when the workspace provided by the user is adequate

PZLAHQR()

we first hit a border when mod(k1(ki)-1,hbl)=hbl-2 and we hi

PZLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PZLANGE()

the number of rows to be operated on i.e the number of rows
of the distributed submatrix sub( a ). when m = 0, pzlang

PZLAPIV()

the following restrictions apply when ipiv must be transposed
descip(mb_) must equal desca(nb_)

PZLATTRS()

scale x by (1/abs(x(j)))*abs(a(j,j))*bignum
to avoid overflow when dividing by a(j,j)

PZMAX1()

when the result of a vector-oriented pblas call is a scalar, it wil
being operated on.  let x be a generic term for the input vector(s).

PZPORFS()

pzporfs improves the computed solution to a system of linear
equations when the coefficient matrix is hermitian positive definit
solutions.

PZTRRFS()

this routine temporarily returns when n <= 1
the distributed submatrices sub( x ) and sub( b ) should be

PZUNMBR()

here q and p**h are the unitary distributed matrices determined by
pzgebrd when reducing a complex distributed matrix a(ia:*,ja:*) t
as products of elementary reflectors h(i) and g(i) respectively.

SDBTF2()

the band storage scheme is illustrated by the following example, when

SLAMSH ()

that can be sent through.
slamsh should only be called when there are multiple shifts/bulge
unreduced hessenberg matrix because of two or more consecutive small

SLAREF()

these serve the same purpose as itmp1,itmp2 but for z
when wantz is set
vecs    (global input) real array of size 3*n (matrix

ZDBTF2()

the band storage scheme is illustrated by the following example, when

ZLAMSH()

that can be sent through.
zlamsh should only be called when there are multiple shifts/bulge
unreduced hessenberg matrix because of two or more consecutive

ZLAREF()

these serve the same purpose as itmp1,itmp2 but for z
when wantz is set
vecs    (global input) complex*16 array of size 3*n (matrix size)

where

CDTTRF()

a = l * u
where l is a product of unit lower bidiagona
diagonal and first superdiagonal.

CPTTRSV()

u * x = b, or  u**h * x = b,
where l or u is the cholesky factor of a hermitian positiv
a = u**h*d*u or a = l*d*l**h (computed by cpttrf).

CTRMVT()

where x is an n element vector and  t is an n by

DDTTRF()

a = l * u
where l is a product of unit lower bidiagona
diagonal and first superdiagonal.

DPTTRSV()

l**t* x = b, or  l * x = b,
where l is the cholesky factor of a hermitian positiv
a = l*d*l**h (computed by dpttrf).

DSTEQR2()

determine where the matrix splits and choose ql or qr iteratio
element is smaller.

DTRMVT()

where x is an n element vector and  t is an n by

PCDBSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix with bandwidth bwl, bwu.

PCDBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCDBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PCDBTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCDTSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix.

PCDTTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCDTTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PCDTTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCGBSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix with bandwidth bwl, bwu.

PCGBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCGBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PCGEBD2()

where nb = mb_a = nb_a, iroffa = mod( ia-1, nb 
iacol = indxg2p( ja, nb, mycol, csrc_a, npcol ),

PCGEBRD()

where nb = mb_a = nb_a
iarow = indxg2p( ia, nb, myrow, rsrc_a, nprow ),

PCGEHD2()

to upper hessenberg form h by an unitary similarity transformation:
q' * sub( a ) * q = h, where

PCGEHRD()

to upper hessenberg form h by an unitary similarity transformation:
q' * sub( a ) * q = h, where

PCGELQ2()

lwork is local input and must be at least
lwork >= nq0 + max( 1, mp0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PCGELQF()

lwork is local input and must be at least
lwork >= mb_a * ( mp0 + nq0 + mb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PCGELS()

where sub( b ) denotes b( ib:ib+m-1, jb:jb+nrhs-1 ) when trans = 'n
vectors b and solution vectors x can be handled in a single call;

PCGEQL2()

lwork is local input and must be at least
lwork >= mp0 + max( 1, nq0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PCGEQLF()

lwork is local input and must be at least
lwork >= nb_a * ( mp0 + nq0 + nb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PCGEQPF()

where tau is a complex scalar, and v is a complex vector wit
a(ia+i-1:ia+m-1,ja+i-1).

PCGEQR2()

lwork is local input and must be at least
lwork >= mp0 + max( 1, nq0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PCGEQRF()

lwork is local input and must be at least
lwork >= nb_a * ( mp0 + nq0 + nb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PCGERQ2()

lwork is local input and must be at least
lwork >= nq0 + max( 1, mp0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PCGERQF()

lwork is local input and must be at least
lwork >= mb_a * ( mp0 + nq0 + mb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PCGESV()

where sub( a ) = a(ia:ia+n-1,ja:ja+n-1) is an n-by-n distribute
distributed matrices.

PCGESVD()

where sigma is an m-by-n matrix which is zero except for it
v is an n-by-n orthogonal matrix. the diagonal elements of sigma

PCGESVX()

where a(ia:ia+n-1,ja:ja+n-1) is an n-by-n matrix and x an

PCGETF2()

the factorization has the form sub( a ) = p * l * u, where p is 
elements (lower trapezoidal if m > n), and u is upper triangular

PCGETRF()

the factorization has the form sub( a ) = p * l * u, where p is 
ments (lower trapezoidal if m > n), and u is upper triangular

PCGETRI()

nb_a )
where lcm is the least common multiple of proces
end if

PCGGQRF()

where q is an n-by-n unitary matrix, z is a p-by-p unitary matrix

PCGGRQF()

where q is an n-by-n unitary matrix, z is a p-by-p unitary matrix

PCHEEV()

iarow.eq.izrow )
where

PCHEEVX()

where eps is the machine precision.  if abstol is less tha
where norm(t) is the 1-norm of the tridiagonal matrix

PCHEGVX()

where eps is the machine precision.  if abstol is less tha
where norm(t) is the 1-norm of the tridiagonal matrix

PCHENGST()

where nb = mb_a = nb_a
nq0 = numroc( n, nb, 0, 0, nprow ),

PCHENTRD()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
features

PCHETD2()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PCHETRD()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PCHETTRD()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PCLABRD()

where tauq and taup are complex scalars, and v and u are comple

PCLACGV()

pclacgv conjugates a complex vector of length n, sub( x ), where
x(ix:ix+n-1,jx) if incx = 1, and

PCLACON()

memory to an array of dimension locr(n+mod(iv-1,mb_v)). on
the final return, v = a*w, where est = norm(v)/norm(w

PCLACONSB()

up and left and a buffer to send right.  each of these buffers
is actually stored in one buffer buf where buf(istr1+1) start
the values are stored, if there are any values that a node

PCLACP2()

distributed matrix b.  no communication is performed, pclacp2
performs a local copy sub( a ) := sub( b ), where sub( a ) denote
pclacp2 requires that only dimension of the matrix operands is

PCLACPY()

distributed matrix b.  no communication is performed, pclacpy
performs a local copy sub( a ) := sub( b ), where sub( a ) denote

PCLAEVSWP()

pclaevswp moves the eigenvectors (potentially unsorted) from
where they are computed, to a scalapack standard block cycli

PCLAHRD()

where tau is a complex scalar, and v is a complex vector wit
a(ia+i+k:ia+n-1,ja+i-1), and tau in tau(ja+i-1).

PCLANGE()

where norm1 denotes the  one norm of a matrix (maximum column sum)
normf denotes the  frobenius norm of a matrix (square root of sum of

PCLAPIV()

ipiv    (local input) integer array, dimension (lipiv) where lipiv i
>= locr( ia+m-1 ) + mb_a      if pivroc='c' or 'c',

PCLARFB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PCLARFG()

where alpha is a real scalar, and sub( x ) is an (n-1)-elemen
x(ix,jx:jx+n-2) if incx = descx(m_).  h is represented in the form

PCLARZB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PCLASSQ()

where x( i ) = sub( x ) = abs( x( ix+(jx-1)*descx(m_)+(i-1)*incx ) )
ssq will then satisfy

PCLATRD()

where tau is a complex scalar, and v is a complex vector wit
a(ia:ia+i-2,ja+i), and tau in tau(ja+i-1).

PCLATRZ()

where z is an n-by-n unitary matrix and r is an m-by-m uppe

PCLATTRS()

compute grow = 1/g(j), where g(0) = max{x(i), i=1,...,n}

PCLAUU2()

pclauu2 computes the product u * u' or l' * l, where the triangula
the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1).

PCLAUUM()

pclauum computes the product u * u' or l' * l, where the triangula
the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1).

PCLAWIL()

m       (global input) integer
on entry, this is where the transform starts (row m.

PCMAX1()

where sub( x ) denotes x(ix:ix+n-1,jx) if incx = 1

PCPBSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix with bandwidth bw.

PCPBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCPBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PCPBTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCPOSV()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is an n-by-
denoting b(ib:ib+n-1,jb:jb+nrhs-1) are n-by-nrhs distributed

PCPOSVX()

where a(ia:ia+n-1,ja:ja+n-1) is an n-by-n matrix and x an

PCPOTF2()

where u is an upper triangular matrix and l is lower triangular
notes

PCPOTRF()

where u is an upper triangular matrix and l is lower triangular
notes

PCPOTRS()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a n-by-
factorization sub( a ) = u**h*u or l*l**h computed by pcpotrf.

PCPTSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix.

PCPTTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCPTTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PCPTTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PCSRSCL()

where sub( x ) denotes x(ix:ix+n-1,jx:jx), if incx = 1

PCTREVC()

where y' denotes the conjugate transpose of the vector y
if all eigenvectors are requested, the routine may either return the

PCTRTRS()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a triangula
n-by-nrhs distributed matrix denoted by sub( b ). a check is made

PCTZRZF()

where z is an n-by-n unitary matrix and r is an m-by-m uppe

PCUNG2L()

lwork is local input and must be at least
lwork >= mpa0 + max( 1, nqa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCUNG2R()

lwork is local input and must be at least
lwork >= mpa0 + max( 1, nqa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCUNGL2()

lwork is local input and must be at least
lwork >= nqa0 + max( 1, mpa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCUNGLQ()

lwork is local input and must be at least
lwork >= mb_a * ( mpa0 + nqa0 + mb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCUNGQL()

lwork is local input and must be at least
lwork >= nb_a * ( nqa0 + mpa0 + nb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCUNGQR()

lwork is local input and must be at least
lwork >= nb_a * ( nqa0 + mpa0 + nb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCUNGR2()

lwork is local input and must be at least
lwork >= nqa0 + max( 1, mpa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCUNGRQ()

lwork is local input and must be at least
lwork >= mb_a * ( mpa0 + nqa0 + mb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCUNM2L()

where q is a complex unitary distributed matrix defined as th

PCUNM2R()

where q is a complex unitary distributed matrix defined as th

PCUNMBR()

where lcmp = lcm / nprow, lcmq = lcm / npcol, wit

PCUNMHR()

where q is a complex unitary distributed matrix of order nq, wit
product of ihi-ilo elementary reflectors, as returned by pcgehrd:

PCUNML2()

where q is a complex unitary distributed matrix defined as th

PCUNMLQ()

where q is a complex unitary distributed matrix defined as th

PCUNMQL()

where q is a complex unitary distributed matrix defined as th

PCUNMQR()

where q is a complex unitary distributed matrix defined as th

PCUNMR2()

where q is a complex unitary distributed matrix defined as th

PCUNMR3()

where q is a complex unitary distributed matrix defined as th

PCUNMRQ()

where q is a complex unitary distributed matrix defined as th

PCUNMRZ()

where q is a complex unitary distributed matrix defined as th

PCUNMTR()

where q is a complex unitary distributed matrix of order nq, wit
product of nq-1 elementary reflectors, as returned by pchetrd:

PDDBSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix with bandwidth bwl, bwu.

PDDBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDDBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PDDBTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDDTSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix.

PDDTTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDDTTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PDDTTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDGBSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix with bandwidth bwl, bwu.

PDGBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDGBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PDGEBD2()

where nb = mb_a = nb_a, iroffa = mod( ia-1, nb 
iacol = indxg2p( ja, nb, mycol, csrc_a, npcol ),

PDGEBRD()

where nb = mb_a = nb_a
iarow = indxg2p( ia, nb, myrow, rsrc_a, nprow ),

PDGEHD2()

to upper hessenberg form h by an orthogonal similarity transforma-
tion:  q' * sub( a ) * q = h, where

PDGEHRD()

to upper hessenberg form h by an orthogonal similarity transforma-
tion:  q' * sub( a ) * q = h, where

PDGELQ2()

lwork is local input and must be at least
lwork >= nq0 + max( 1, mp0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PDGELQF()

lwork is local input and must be at least
lwork >= mb_a * ( mp0 + nq0 + mb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PDGELS()

where sub( b ) denotes b( ib:ib+m-1, jb:jb+nrhs-1 ) when trans = 'n
vectors b and solution vectors x can be handled in a single call;

PDGEQL2()

lwork is local input and must be at least
lwork >= mp0 + max( 1, nq0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PDGEQLF()

lwork is local input and must be at least
lwork >= nb_a * ( mp0 + nq0 + nb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PDGEQPF()

where tau is a real scalar, and v is a real vector with v(1:i-1) =

PDGEQR2()

lwork is local input and must be at least
lwork >= mp0 + max( 1, nq0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PDGEQRF()

lwork is local input and must be at least
lwork >= nb_a * ( mp0 + nq0 + nb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PDGERQ2()

lwork is local input and must be at least
lwork >= nq0 + max( 1, mp0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PDGERQF()

lwork is local input and must be at least
lwork >= mb_a * ( mp0 + nq0 + mb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PDGESV()

where sub( a ) = a(ia:ia+n-1,ja:ja+n-1) is an n-by-n distribute
distributed matrices.

PDGESVD()

where sigma is an m-by-n matrix which is zero except for it
v is an n-by-n orthogonal matrix. the diagonal elements of sigma

PDGESVX()

where a(ia:ia+n-1,ja:ja+n-1) is an n-by-n matrix and x an

PDGETF2()

the factorization has the form sub( a ) = p * l * u, where p is 
elements (lower trapezoidal if m > n), and u is upper triangular

PDGETRF()

the factorization has the form sub( a ) = p * l * u, where p is 
ments (lower trapezoidal if m > n), and u is upper triangular

PDGETRI()

nb_a )
where lcm is the least common multiple of proces
end if

PDGGQRF()

where q is an n-by-n orthogonal matrix, z is a p-by-p orthogona

PDGGRQF()

where q is an n-by-n orthogonal matrix, z is a p-by-p orthogona

PDLABRD()

where tauq and taup are real scalars, and v and u are real vectors
if m >= n, v(1:i-1) = 0, v(i) = 1, and v(i:m) is stored on exit in

PDLACON()

memory to an array of dimension locr(n+mod(iv-1,mb_v)). on
the final return, v = a*w, where est = norm(v)/norm(w

PDLACONSB()

up and left and a buffer to send right.  each of these buffers
is actually stored in one buffer buf where buf(istr1+1) start
the values are stored, if there are any values that a node

PDLACP2()

distributed matrix b.  no communication is performed, pdlacp2
performs a local copy sub( a ) := sub( b ), where sub( a ) denote
pdlacp2 requires that only dimension of the matrix operands is

PDLACPY()

distributed matrix b.  no communication is performed, pdlacpy
performs a local copy sub( a ) := sub( b ), where sub( a ) denote

PDLAEBZ()

pdlaebz contains the iteration loop which computes the eigenvalues
contained in the input intervals [ intvl(2*j-1), intvl(2*j) ] where
the count of eigenvalues of a symmetric tridiagonal matrix less than

PDLAED1()

where z = q'u, u is a vector of length n with ones in th

PDLAEVSWP()

pdlaevswp moves the eigenvectors (potentially unsorted) from
where they are computed, to a scalapack standard block cycli

PDLAHRD()

where tau is a real scalar, and v is a real vector wit
a(ia+i+k:ia+n-1,ja+i-1), and tau in tau(ja+i-1).

PDLAMCH()

where
eps   = relative machine precision

PDLANGE()

where norm1 denotes the  one norm of a matrix (maximum column sum)
normf denotes the  frobenius norm of a matrix (square root of sum of

PDLAPDCT()

implementation of the sturm sequence loop. this must be at
least max_j |e(j)^2| *safe_min, and at least safe_min, where
without overflow.

PDLAPIV()

ipiv    (local input) integer array, dimension (lipiv) where lipiv i
>= locr( ia+m-1 ) + mb_a      if pivroc='c' or 'c',

PDLARFB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PDLARFG()

where alpha is a scalar, and sub( x ) is an (n-1)-element rea
incx = descx(m_).  h is represented in the form

PDLARZB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PDLASRT()

lwork = max( n, np * ( nb + nq ))
where
nq = numroc( n, nb, mycol, descq( csrc_ ), npcol )

PDLASSQ()

where  x( i ) = sub( x ) = x( ix+(jx-1)*descx(m_)+(i-1)*incx )
value

PDLATRD()

where tau is a real scalar, and v is a real vector wit
a(ia:ia+i-2,ja+i), and tau in tau(ja+i-1).

PDLATRZ()

where z is an n-by-n orthogonal matrix and r is an m-by-m uppe

PDLAUU2()

pdlauu2 computes the product u * u' or l' * l, where the triangula
the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1).

PDLAUUM()

pdlauum computes the product u * u' or l' * l, where the triangula
the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1).

PDLAWIL()

m       (global input) integer
on entry, this is where the transform starts (row m.

PDORG2L()

lwork is local input and must be at least
lwork >= mpa0 + max( 1, nqa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDORG2R()

lwork is local input and must be at least
lwork >= mpa0 + max( 1, nqa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDORGL2()

lwork is local input and must be at least
lwork >= nqa0 + max( 1, mpa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDORGLQ()

lwork is local input and must be at least
lwork >= mb_a * ( mpa0 + nqa0 + mb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDORGQL()

lwork is local input and must be at least
lwork >= nb_a * ( nqa0 + mpa0 + nb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDORGQR()

lwork is local input and must be at least
lwork >= nb_a * ( nqa0 + mpa0 + nb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDORGR2()

lwork is local input and must be at least
lwork >= nqa0 + max( 1, mpa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDORGRQ()

lwork is local input and must be at least
lwork >= mb_a * ( mpa0 + nqa0 + mb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDORM2L()

where q is a real orthogonal distributed matrix defined as th

PDORM2R()

where q is a real orthogonal distributed matrix defined as th

PDORMBR()

where lcmp = lcm / nprow, lcmq = lcm / npcol, wit

PDORMHR()

where q is a real orthogonal distributed matrix of order nq, wit
product of ihi-ilo elementary reflectors, as returned by pdgehrd:

PDORML2()

where q is a real orthogonal distributed matrix defined as th

PDORMLQ()

where q is a real orthogonal distributed matrix defined as th

PDORMQL()

where q is a real orthogonal distributed matrix defined as th

PDORMQR()

where q is a real orthogonal distributed matrix defined as th

PDORMR2()

where q is a real orthogonal distributed matrix defined as th

PDORMR3()

where q is a real orthogonal distributed matrix defined as th

PDORMRQ()

where q is a real orthogonal distributed matrix defined as th

PDORMRZ()

where q is a real orthogonal distributed matrix defined as th

PDORMTR()

where q is a real orthogonal distributed matrix of order nq, wit
product of nq-1 elementary reflectors, as returned by pdsytrd:

PDPBSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix with bandwidth bw.

PDPBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDPBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PDPBTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDPOSV()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is an n-by-
denoting b(ib:ib+n-1,jb:jb+nrhs-1) are n-by-nrhs distributed

PDPOSVX()

where a(ia:ia+n-1,ja:ja+n-1) is an n-by-n matrix and x an

PDPOTF2()

where u is an upper triangular matrix and l is lower triangular
notes

PDPOTRF()

where u is an upper triangular matrix and l is lower triangular
notes

PDPOTRS()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a n-by-
factorization sub( a ) = u**t*u or l*l**t computed by pdpotrf.

PDPTSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix.

PDPTTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDPTTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PDPTTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PDRSCL()

where sub( x ) denotes x(ix:ix+n-1,jx:jx), if incx = 1

PDSTEBZ()

less.  if abstol is less than or equal to zero, then ulp*|t|
will be used, where |t| means the 1-norm of t
set to the underflow threshold dlamch('u'), not zero.

PDSYEV()

lwork >= 5*n + sizesytrd + 1
where
and is max( nb * ( np +1 ), 3 * nb )

PDSYEVX()

where eps is the machine precision.  if abstol is less tha
where norm(t) is the 1-norm of the tridiagonal matrix

PDSYGVX()

where eps is the machine precision.  if abstol is less tha
where norm(t) is the 1-norm of the tridiagonal matrix

PDSYNGST()

where nb = mb_a = nb_a
nq0 = numroc( n, nb, 0, 0, nprow ),

PDSYNTRD()

tridiagonal form t by an orthogonal similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
features

PDSYTD2()

tridiagonal form t by an orthogonal similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PDSYTRD()

tridiagonal form t by an orthogonal similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PDSYTTRD()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PDTRTRS()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a triangula
n-by-nrhs distributed matrix denoted by sub( b ). a check is made

PDTZRZF()

where z is an n-by-n orthogonal matrix and r is an m-by-m uppe

PDZSUM1()

where sub( x ) denotes x(ix:ix+n-1,jx:jx), if incx = 1

PSCSUM1()

where sub( x ) denotes x(ix:ix+n-1,jx:jx), if incx = 1

PSDBSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix with bandwidth bwl, bwu.

PSDBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSDBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PSDBTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSDTSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix.

PSDTTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSDTTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PSDTTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSGBSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix with bandwidth bwl, bwu.

PSGBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSGBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PSGEBD2()

where nb = mb_a = nb_a, iroffa = mod( ia-1, nb 
iacol = indxg2p( ja, nb, mycol, csrc_a, npcol ),

PSGEBRD()

where nb = mb_a = nb_a
iarow = indxg2p( ia, nb, myrow, rsrc_a, nprow ),

PSGEHD2()

to upper hessenberg form h by an orthogonal similarity transforma-
tion:  q' * sub( a ) * q = h, where

PSGEHRD()

to upper hessenberg form h by an orthogonal similarity transforma-
tion:  q' * sub( a ) * q = h, where

PSGELQ2()

lwork is local input and must be at least
lwork >= nq0 + max( 1, mp0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PSGELQF()

lwork is local input and must be at least
lwork >= mb_a * ( mp0 + nq0 + mb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PSGELS()

where sub( b ) denotes b( ib:ib+m-1, jb:jb+nrhs-1 ) when trans = 'n
vectors b and solution vectors x can be handled in a single call;

PSGEQL2()

lwork is local input and must be at least
lwork >= mp0 + max( 1, nq0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PSGEQLF()

lwork is local input and must be at least
lwork >= nb_a * ( mp0 + nq0 + nb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PSGEQPF()

where tau is a real scalar, and v is a real vector with v(1:i-1) =

PSGEQR2()

lwork is local input and must be at least
lwork >= mp0 + max( 1, nq0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PSGEQRF()

lwork is local input and must be at least
lwork >= nb_a * ( mp0 + nq0 + nb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PSGERQ2()

lwork is local input and must be at least
lwork >= nq0 + max( 1, mp0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PSGERQF()

lwork is local input and must be at least
lwork >= mb_a * ( mp0 + nq0 + mb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PSGESV()

where sub( a ) = a(ia:ia+n-1,ja:ja+n-1) is an n-by-n distribute
distributed matrices.

PSGESVD()

where sigma is an m-by-n matrix which is zero except for it
v is an n-by-n orthogonal matrix. the diagonal elements of sigma

PSGESVX()

where a(ia:ia+n-1,ja:ja+n-1) is an n-by-n matrix and x an

PSGETF2()

the factorization has the form sub( a ) = p * l * u, where p is 
elements (lower trapezoidal if m > n), and u is upper triangular

PSGETRF()

the factorization has the form sub( a ) = p * l * u, where p is 
ments (lower trapezoidal if m > n), and u is upper triangular

PSGETRI()

nb_a )
where lcm is the least common multiple of proces
end if

PSGGQRF()

where q is an n-by-n orthogonal matrix, z is a p-by-p orthogona

PSGGRQF()

where q is an n-by-n orthogonal matrix, z is a p-by-p orthogona

PSLABRD()

where tauq and taup are real scalars, and v and u are real vectors
if m >= n, v(1:i-1) = 0, v(i) = 1, and v(i:m) is stored on exit in

PSLACON()

memory to an array of dimension locr(n+mod(iv-1,mb_v)). on
the final return, v = a*w, where est = norm(v)/norm(w

PSLACONSB()

up and left and a buffer to send right.  each of these buffers
is actually stored in one buffer buf where buf(istr1+1) start
the values are stored, if there are any values that a node

PSLACP2()

distributed matrix b.  no communication is performed, pslacp2
performs a local copy sub( a ) := sub( b ), where sub( a ) denote
pslacp2 requires that only dimension of the matrix operands is

PSLACPY()

distributed matrix b.  no communication is performed, pslacpy
performs a local copy sub( a ) := sub( b ), where sub( a ) denote

PSLAEBZ()

pslaebz contains the iteration loop which computes the eigenvalues
contained in the input intervals [ intvl(2*j-1), intvl(2*j) ] where
the count of eigenvalues of a symmetric tridiagonal matrix less than

PSLAED1()

where z = q'u, u is a vector of length n with ones in th

PSLAEVSWP()

pslaevswp moves the eigenvectors (potentially unsorted) from
where they are computed, to a scalapack standard block cycli

PSLAHRD()

where tau is a real scalar, and v is a real vector wit
a(ia+i+k:ia+n-1,ja+i-1), and tau in tau(ja+i-1).

PSLAMCH()

where
eps   = relative machine precision

PSLANGE()

where norm1 denotes the  one norm of a matrix (maximum column sum)
normf denotes the  frobenius norm of a matrix (square root of sum of

PSLAPDCT()

implementation of the sturm sequence loop. this must be at
least max_j |e(j)^2| *safe_min, and at least safe_min, where
without overflow.

PSLAPIV()

ipiv    (local input) integer array, dimension (lipiv) where lipiv i
>= locr( ia+m-1 ) + mb_a      if pivroc='c' or 'c',

PSLARFB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PSLARFG()

where alpha is a scalar, and sub( x ) is an (n-1)-element rea
incx = descx(m_).  h is represented in the form

PSLARZB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PSLASRT()

lwork = max( n, np * ( nb + nq ))
where
nq = numroc( n, nb, mycol, descq( csrc_ ), npcol )

PSLASSQ()

where  x( i ) = sub( x ) = x( ix+(jx-1)*descx(m_)+(i-1)*incx )
value

PSLATRD()

where tau is a real scalar, and v is a real vector wit
a(ia:ia+i-2,ja+i), and tau in tau(ja+i-1).

PSLATRZ()

where z is an n-by-n orthogonal matrix and r is an m-by-m uppe

PSLAUU2()

pslauu2 computes the product u * u' or l' * l, where the triangula
the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1).

PSLAUUM()

pslauum computes the product u * u' or l' * l, where the triangula
the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1).

PSLAWIL()

m       (global input) integer
on entry, this is where the transform starts (row m.

PSORG2L()

lwork is local input and must be at least
lwork >= mpa0 + max( 1, nqa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSORG2R()

lwork is local input and must be at least
lwork >= mpa0 + max( 1, nqa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSORGL2()

lwork is local input and must be at least
lwork >= nqa0 + max( 1, mpa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSORGLQ()

lwork is local input and must be at least
lwork >= mb_a * ( mpa0 + nqa0 + mb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSORGQL()

lwork is local input and must be at least
lwork >= nb_a * ( nqa0 + mpa0 + nb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSORGQR()

lwork is local input and must be at least
lwork >= nb_a * ( nqa0 + mpa0 + nb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSORGR2()

lwork is local input and must be at least
lwork >= nqa0 + max( 1, mpa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSORGRQ()

lwork is local input and must be at least
lwork >= mb_a * ( mpa0 + nqa0 + mb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSORM2L()

where q is a real orthogonal distributed matrix defined as th

PSORM2R()

where q is a real orthogonal distributed matrix defined as th

PSORMBR()

where lcmp = lcm / nprow, lcmq = lcm / npcol, wit

PSORMHR()

where q is a real orthogonal distributed matrix of order nq, wit
product of ihi-ilo elementary reflectors, as returned by psgehrd:

PSORML2()

where q is a real orthogonal distributed matrix defined as th

PSORMLQ()

where q is a real orthogonal distributed matrix defined as th

PSORMQL()

where q is a real orthogonal distributed matrix defined as th

PSORMQR()

where q is a real orthogonal distributed matrix defined as th

PSORMR2()

where q is a real orthogonal distributed matrix defined as th

PSORMR3()

where q is a real orthogonal distributed matrix defined as th

PSORMRQ()

where q is a real orthogonal distributed matrix defined as th

PSORMRZ()

where q is a real orthogonal distributed matrix defined as th

PSORMTR()

where q is a real orthogonal distributed matrix of order nq, wit
product of nq-1 elementary reflectors, as returned by pssytrd:

PSPBSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix with bandwidth bw.

PSPBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSPBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PSPBTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSPOSV()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is an n-by-
denoting b(ib:ib+n-1,jb:jb+nrhs-1) are n-by-nrhs distributed

PSPOSVX()

where a(ia:ia+n-1,ja:ja+n-1) is an n-by-n matrix and x an

PSPOTF2()

where u is an upper triangular matrix and l is lower triangular
notes

PSPOTRF()

where u is an upper triangular matrix and l is lower triangular
notes

PSPOTRS()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a n-by-
factorization sub( a ) = u**t*u or l*l**t computed by pspotrf.

PSPTSV()

where a(1:n, ja:ja+n-1) is an n-by-n rea
matrix.

PSPTTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSPTTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n real

PSPTTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PSRSCL()

where sub( x ) denotes x(ix:ix+n-1,jx:jx), if incx = 1

PSSTEBZ()

less.  if abstol is less than or equal to zero, then ulp*|t|
will be used, where |t| means the 1-norm of t
set to the underflow threshold slamch('u'), not zero.

PSSYEV()

lwork >= 5*n + sizesytrd + 1
where
and is max( nb * ( np +1 ), 3 * nb )

PSSYEVX()

where eps is the machine precision.  if abstol is less tha
where norm(t) is the 1-norm of the tridiagonal matrix

PSSYGVX()

where eps is the machine precision.  if abstol is less tha
where norm(t) is the 1-norm of the tridiagonal matrix

PSSYNGST()

where nb = mb_a = nb_a
nq0 = numroc( n, nb, 0, 0, nprow ),

PSSYNTRD()

tridiagonal form t by an orthogonal similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
features

PSSYTD2()

tridiagonal form t by an orthogonal similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PSSYTRD()

tridiagonal form t by an orthogonal similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PSSYTTRD()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PSTRTRS()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a triangula
n-by-nrhs distributed matrix denoted by sub( b ). a check is made

PSTZRZF()

where z is an n-by-n orthogonal matrix and r is an m-by-m uppe

PZDBSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix with bandwidth bwl, bwu.

PZDBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZDBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PZDBTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZDRSCL()

where sub( x ) denotes x(ix:ix+n-1,jx:jx), if incx = 1

PZDTSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix.

PZDTTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZDTTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PZDTTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZGBSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix with bandwidth bwl, bwu.

PZGBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZGBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PZGEBD2()

where nb = mb_a = nb_a, iroffa = mod( ia-1, nb 
iacol = indxg2p( ja, nb, mycol, csrc_a, npcol ),

PZGEBRD()

where nb = mb_a = nb_a
iarow = indxg2p( ia, nb, myrow, rsrc_a, nprow ),

PZGEHD2()

to upper hessenberg form h by an unitary similarity transformation:
q' * sub( a ) * q = h, where

PZGEHRD()

to upper hessenberg form h by an unitary similarity transformation:
q' * sub( a ) * q = h, where

PZGELQ2()

lwork is local input and must be at least
lwork >= nq0 + max( 1, mp0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PZGELQF()

lwork is local input and must be at least
lwork >= mb_a * ( mp0 + nq0 + mb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PZGELS()

where sub( b ) denotes b( ib:ib+m-1, jb:jb+nrhs-1 ) when trans = 'n
vectors b and solution vectors x can be handled in a single call;

PZGEQL2()

lwork is local input and must be at least
lwork >= mp0 + max( 1, nq0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PZGEQLF()

lwork is local input and must be at least
lwork >= nb_a * ( mp0 + nq0 + nb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PZGEQPF()

where tau is a complex scalar, and v is a complex vector wit
a(ia+i-1:ia+m-1,ja+i-1).

PZGEQR2()

lwork is local input and must be at least
lwork >= mp0 + max( 1, nq0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PZGEQRF()

lwork is local input and must be at least
lwork >= nb_a * ( mp0 + nq0 + nb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PZGERQ2()

lwork is local input and must be at least
lwork >= nq0 + max( 1, mp0 ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PZGERQF()

lwork is local input and must be at least
lwork >= mb_a * ( mp0 + nq0 + mb_a ), where
iroff = mod( ia-1, mb_a ), icoff = mod( ja-1, nb_a ),

PZGESV()

where sub( a ) = a(ia:ia+n-1,ja:ja+n-1) is an n-by-n distribute
distributed matrices.

PZGESVD()

where sigma is an m-by-n matrix which is zero except for it
v is an n-by-n orthogonal matrix. the diagonal elements of sigma

PZGESVX()

where a(ia:ia+n-1,ja:ja+n-1) is an n-by-n matrix and x an

PZGETF2()

the factorization has the form sub( a ) = p * l * u, where p is 
elements (lower trapezoidal if m > n), and u is upper triangular

PZGETRF()

the factorization has the form sub( a ) = p * l * u, where p is 
ments (lower trapezoidal if m > n), and u is upper triangular

PZGETRI()

nb_a )
where lcm is the least common multiple of proces
end if

PZGGQRF()

where q is an n-by-n unitary matrix, z is a p-by-p unitary matrix

PZGGRQF()

where q is an n-by-n unitary matrix, z is a p-by-p unitary matrix

PZHEEV()

iarow.eq.izrow )
where

PZHEEVX()

where eps is the machine precision.  if abstol is less tha
where norm(t) is the 1-norm of the tridiagonal matrix

PZHEGVX()

where eps is the machine precision.  if abstol is less tha
where norm(t) is the 1-norm of the tridiagonal matrix

PZHENGST()

where nb = mb_a = nb_a
nq0 = numroc( n, nb, 0, 0, nprow ),

PZHENTRD()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
features

PZHETD2()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PZHETRD()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PZHETTRD()

tridiagonal form t by an unitary similarity transformation:
q' * sub( a ) * q = t, where sub( a ) = a(ia:ia+n-1,ja:ja+n-1)
notes

PZLABRD()

where tauq and taup are complex scalars, and v and u are comple

PZLACGV()

pzlacgv conjugates a complex vector of length n, sub( x ), where
x(ix:ix+n-1,jx) if incx = 1, and

PZLACON()

memory to an array of dimension locr(n+mod(iv-1,mb_v)). on
the final return, v = a*w, where est = norm(v)/norm(w

PZLACONSB()

up and left and a buffer to send right.  each of these buffers
is actually stored in one buffer buf where buf(istr1+1) start
the values are stored, if there are any values that a node

PZLACP2()

distributed matrix b.  no communication is performed, pzlacp2
performs a local copy sub( a ) := sub( b ), where sub( a ) denote
pzlacp2 requires that only dimension of the matrix operands is

PZLACPY()

distributed matrix b.  no communication is performed, pzlacpy
performs a local copy sub( a ) := sub( b ), where sub( a ) denote

PZLAEVSWP()

pzlaevswp moves the eigenvectors (potentially unsorted) from
where they are computed, to a scalapack standard block cycli

PZLAHRD()

where tau is a complex scalar, and v is a complex vector wit
a(ia+i+k:ia+n-1,ja+i-1), and tau in tau(ja+i-1).

PZLANGE()

where norm1 denotes the  one norm of a matrix (maximum column sum)
normf denotes the  frobenius norm of a matrix (square root of sum of

PZLAPIV()

ipiv    (local input) integer array, dimension (lipiv) where lipiv i
>= locr( ia+m-1 ) + mb_a      if pivroc='c' or 'c',

PZLARFB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PZLARFG()

where alpha is a real scalar, and sub( x ) is an (n-1)-elemen
x(ix,jx:jx+n-2) if incx = descx(m_).  h is represented in the form

PZLARZB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PZLASSQ()

where x( i ) = sub( x ) = abs( x( ix+(jx-1)*descx(m_)+(i-1)*incx ) )
ssq will then satisfy

PZLATRD()

where tau is a complex scalar, and v is a complex vector wit
a(ia:ia+i-2,ja+i), and tau in tau(ja+i-1).

PZLATRZ()

where z is an n-by-n unitary matrix and r is an m-by-m uppe

PZLATTRS()

compute grow = 1/g(j), where g(0) = max{x(i), i=1,...,n}

PZLAUU2()

pzlauu2 computes the product u * u' or l' * l, where the triangula
the matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1).

PZLAUUM()

pzlauum computes the product u * u' or l' * l, where the triangula
the distributed matrix sub( a ) = a(ia:ia+n-1,ja:ja+n-1).

PZLAWIL()

m       (global input) integer
on entry, this is where the transform starts (row m.

PZMAX1()

where sub( x ) denotes x(ix:ix+n-1,jx) if incx = 1

PZPBSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix with bandwidth bw.

PZPBTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZPBTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PZPBTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZPOSV()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is an n-by-
denoting b(ib:ib+n-1,jb:jb+nrhs-1) are n-by-nrhs distributed

PZPOSVX()

where a(ia:ia+n-1,ja:ja+n-1) is an n-by-n matrix and x an

PZPOTF2()

where u is an upper triangular matrix and l is lower triangular
notes

PZPOTRF()

where u is an upper triangular matrix and l is lower triangular
notes

PZPOTRS()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a n-by-
factorization sub( a ) = u**h*u or l*l**h computed by pzpotrf.

PZPTSV()

where a(1:n, ja:ja+n-1) is an n-by-n comple
matrix.

PZPTTRF()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZPTTRS()

where a(1:n, ja:ja+n-1) is the matrix used to produce the factor
a(1:n, ja:ja+n-1) is an n-by-n complex

PZPTTRSV()

form a new blacs grid (the "standard form" grid) with only procs
holding part of the matrix, of size 1xnp where np is adjusted

PZTREVC()

where y' denotes the conjugate transpose of the vector y
if all eigenvectors are requested, the routine may either return the

PZTRTRS()

where sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1) and is a triangula
n-by-nrhs distributed matrix denoted by sub( b ). a check is made

PZTZRZF()

where z is an n-by-n unitary matrix and r is an m-by-m uppe

PZUNG2L()

lwork is local input and must be at least
lwork >= mpa0 + max( 1, nqa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZUNG2R()

lwork is local input and must be at least
lwork >= mpa0 + max( 1, nqa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZUNGL2()

lwork is local input and must be at least
lwork >= nqa0 + max( 1, mpa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZUNGLQ()

lwork is local input and must be at least
lwork >= mb_a * ( mpa0 + nqa0 + mb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZUNGQL()

lwork is local input and must be at least
lwork >= nb_a * ( nqa0 + mpa0 + nb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZUNGQR()

lwork is local input and must be at least
lwork >= nb_a * ( nqa0 + mpa0 + nb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZUNGR2()

lwork is local input and must be at least
lwork >= nqa0 + max( 1, mpa0 ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZUNGRQ()

lwork is local input and must be at least
lwork >= mb_a * ( mpa0 + nqa0 + mb_a ), where
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZUNM2L()

where q is a complex unitary distributed matrix defined as th

PZUNM2R()

where q is a complex unitary distributed matrix defined as th

PZUNMBR()

where lcmp = lcm / nprow, lcmq = lcm / npcol, wit

PZUNMHR()

where q is a complex unitary distributed matrix of order nq, wit
product of ihi-ilo elementary reflectors, as returned by pzgehrd:

PZUNML2()

where q is a complex unitary distributed matrix defined as th

PZUNMLQ()

where q is a complex unitary distributed matrix defined as th

PZUNMQL()

where q is a complex unitary distributed matrix defined as th

PZUNMQR()

where q is a complex unitary distributed matrix defined as th

PZUNMR2()

where q is a complex unitary distributed matrix defined as th

PZUNMR3()

where q is a complex unitary distributed matrix defined as th

PZUNMRQ()

where q is a complex unitary distributed matrix defined as th

PZUNMRZ()

where q is a complex unitary distributed matrix defined as th

PZUNMTR()

where q is a complex unitary distributed matrix of order nq, wit
product of nq-1 elementary reflectors, as returned by pzhetrd:

SDTTRF()

a = l * u
where l is a product of unit lower bidiagona
diagonal and first superdiagonal.

SPTTRSV()

l**t* x = b, or  l * x = b,
where l is the cholesky factor of a hermitian positiv
a = l*d*l**h (computed by spttrf).

SSTEQR2()

determine where the matrix splits and choose ql or qr iteratio
element is smaller.

STRMVT()

where x is an n element vector and  t is an n by

ZDTTRF()

a = l * u
where l is a product of unit lower bidiagona
diagonal and first superdiagonal.

ZPTTRSV()

u * x = b, or  u**h * x = b,
where l or u is the cholesky factor of a hermitian positiv
a = u**h*d*u or a = l*d*l**h (computed by zpttrf).

ZTRMVT()

where x is an n element vector and  t is an n by

whereas

PCHETTRD()

work required in updating the current column of a.  updating
the block column of a is reasonably load balanced whereas
processor column is involved).

PDSYTTRD()

work required in updating the current column of a.  updating
the block column of a is reasonably load balanced whereas
processor column is involved).

PSSYTTRD()

work required in updating the current column of a.  updating
the block column of a is reasonably load balanced whereas
processor column is involved).

PZHETTRD()

work required in updating the current column of a.  updating
the block column of a is reasonably load balanced whereas
processor column is involved).

whether

CDTTRSV()

uplo    (input) character*1
specifies whether to solve with l or u
trans   (input) character

CPTTRSV()

uplo    (input) character*1
specifies whether the superdiagonal or the subdiagona
factorization:

CTRMVT()

uplo   - character*1.
on entry, uplo specifies whether the matrix is an upper o

DDTTRSV()

uplo    (input) character*1
specifies whether to solve with l or u
trans   (input) character

DSTEQR2()

determine where the matrix splits and choose ql or qr iteration
for each block, according to whether top or bottom diagona

DTRMVT()

uplo   - character*1.
on entry, uplo specifies whether the matrix is an upper o

PCGECON()

norm    (global input) character
specifies whether the 1-norm condition number or th
= '1' or 'o':  1-norm

PCGESVX()

trans = 'c': (diag(r)*a*diag(c))**h *inv(diag(r))*x = diag(c)*b
whether or not the system will be equilibrated depends on th
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n')

PCHEEV()

jobz    (global input) character*1
specifies whether or not to compute the eigenvectors
= 'v':  compute eigenvalues and eigenvectors.

PCHEEVD()

uplo    (global input) character*1
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PCHEEVX()

jobz    (global input) character*1
specifies whether or not to compute the eigenvectors
= 'v':  compute eigenvalues and eigenvectors.

PCHENTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PCHETD2()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PCHETRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PCHETTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PCLACON()

on an intermediate return, kase will be 1 or 2, indicating
whether x should be overwritten by a * x  or a' * x

PCLAPIV()

pivroc  (global input) character*1
specifies whether ipiv is distributed over a process ro
= 'r' ipiv distributed over a process row

PCLAQSY()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PCLATRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u': upper triangular

PCLAUU2()

uplo    (global input) character*1
specifies whether the triangular factor stored in the matri
= 'u':  upper triangular,

PCLAUUM()

uplo    (global input) character*1
specifies whether the triangular factor stored in th
= 'u':  upper triangular

PCPOCON()

uplo    (global input) character
specifies whether the factor stored i
= 'u':  upper triangular

PCPORFS()

uplo    (global input) character*1
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PCPOSVX()

diag(sr) * a * diag(sc) * inv(diag(sc)) * x = diag(sr) * b
whether or not the system will be equilibrated depends on th
overwritten by diag(sr)*a*diag(sc) and b by diag(sr)*b.

PCTRCON()

norm    (global input) character
specifies whether the 1-norm condition number or th
= '1' or 'o':  1-norm;

PCTRTRI()

uplo    (global input) character
specifies whether the distributed matrix sub( a ) is uppe
= 'u':  upper triangular,

PDGECON()

norm    (global input) character
specifies whether the 1-norm condition number or th
= '1' or 'o':  1-norm

PDGESVX()

trans = 'c': (diag(r)*a*diag(c))**h *inv(diag(r))*x = diag(c)*b
whether or not the system will be equilibrated depends on th
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n')

PDLACON()

on an intermediate return, kase will be 1 or 2, indicating
whether x should be overwritten by a * x  or a' * x

PDLAEBZ()

ieflag  (input) integer
a flag which indicates whether n(w) should be speeded up b

PDLAPIV()

pivroc  (global input) character*1
specifies whether ipiv is distributed over a process ro
= 'r' ipiv distributed over a process row

PDLAQSY()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PDLATRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u': upper triangular

PDLAUU2()

uplo    (global input) character*1
specifies whether the triangular factor stored in the matri
= 'u':  upper triangular,

PDLAUUM()

uplo    (global input) character*1
specifies whether the triangular factor stored in th
= 'u':  upper triangular

PDPOCON()

uplo    (global input) character
specifies whether the factor stored i
= 'u':  upper triangular

PDPORFS()

uplo    (global input) character*1
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PDPOSVX()

diag(sr) * a * diag(sc) * inv(diag(sc)) * x = diag(sr) * b
whether or not the system will be equilibrated depends on th
overwritten by diag(sr)*a*diag(sc) and b by diag(sr)*b.

PDSYEV()

jobz    (global input) character*1
specifies whether or not to compute the eigenvectors
= 'v':  compute eigenvalues and eigenvectors.

PDSYEVD()

uplo    (global input) character*1
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PDSYEVX()

jobz    (global input) character*1
specifies whether or not to compute the eigenvectors
= 'v':  compute eigenvalues and eigenvectors.

PDSYNTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PDSYTD2()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PDSYTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PDSYTTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PDTRCON()

norm    (global input) character
specifies whether the 1-norm condition number or th
= '1' or 'o':  1-norm;

PDTRTRI()

uplo    (global input) character
specifies whether the distributed matrix sub( a ) is uppe
= 'u':  upper triangular,

PSGECON()

norm    (global input) character
specifies whether the 1-norm condition number or th
= '1' or 'o':  1-norm

PSGESVX()

trans = 'c': (diag(r)*a*diag(c))**h *inv(diag(r))*x = diag(c)*b
whether or not the system will be equilibrated depends on th
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n')

PSLACON()

on an intermediate return, kase will be 1 or 2, indicating
whether x should be overwritten by a * x  or a' * x

PSLAEBZ()

ieflag  (input) integer
a flag which indicates whether n(w) should be speeded up b

PSLAPIV()

pivroc  (global input) character*1
specifies whether ipiv is distributed over a process ro
= 'r' ipiv distributed over a process row

PSLAQSY()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PSLATRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u': upper triangular

PSLAUU2()

uplo    (global input) character*1
specifies whether the triangular factor stored in the matri
= 'u':  upper triangular,

PSLAUUM()

uplo    (global input) character*1
specifies whether the triangular factor stored in th
= 'u':  upper triangular

PSPOCON()

uplo    (global input) character
specifies whether the factor stored i
= 'u':  upper triangular

PSPORFS()

uplo    (global input) character*1
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PSPOSVX()

diag(sr) * a * diag(sc) * inv(diag(sc)) * x = diag(sr) * b
whether or not the system will be equilibrated depends on th
overwritten by diag(sr)*a*diag(sc) and b by diag(sr)*b.

PSSYEV()

jobz    (global input) character*1
specifies whether or not to compute the eigenvectors
= 'v':  compute eigenvalues and eigenvectors.

PSSYEVD()

uplo    (global input) character*1
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PSSYEVX()

jobz    (global input) character*1
specifies whether or not to compute the eigenvectors
= 'v':  compute eigenvalues and eigenvectors.

PSSYNTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PSSYTD2()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PSSYTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PSSYTTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PSTRCON()

norm    (global input) character
specifies whether the 1-norm condition number or th
= '1' or 'o':  1-norm;

PSTRTRI()

uplo    (global input) character
specifies whether the distributed matrix sub( a ) is uppe
= 'u':  upper triangular,

PZGECON()

norm    (global input) character
specifies whether the 1-norm condition number or th
= '1' or 'o':  1-norm

PZGESVX()

trans = 'c': (diag(r)*a*diag(c))**h *inv(diag(r))*x = diag(c)*b
whether or not the system will be equilibrated depends on th
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n')

PZHEEV()

jobz    (global input) character*1
specifies whether or not to compute the eigenvectors
= 'v':  compute eigenvalues and eigenvectors.

PZHEEVD()

uplo    (global input) character*1
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PZHEEVX()

jobz    (global input) character*1
specifies whether or not to compute the eigenvectors
= 'v':  compute eigenvalues and eigenvectors.

PZHENTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PZHETD2()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PZHETRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PZHETTRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PZLACON()

on an intermediate return, kase will be 1 or 2, indicating
whether x should be overwritten by a * x  or a' * x

PZLAPIV()

pivroc  (global input) character*1
specifies whether ipiv is distributed over a process ro
= 'r' ipiv distributed over a process row

PZLAQSY()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PZLATRD()

uplo    (global input) character
specifies whether the upper or lower triangular part of th
= 'u': upper triangular

PZLAUU2()

uplo    (global input) character*1
specifies whether the triangular factor stored in the matri
= 'u':  upper triangular,

PZLAUUM()

uplo    (global input) character*1
specifies whether the triangular factor stored in th
= 'u':  upper triangular

PZPOCON()

uplo    (global input) character
specifies whether the factor stored i
= 'u':  upper triangular

PZPORFS()

uplo    (global input) character*1
specifies whether the upper or lower triangular part of th
= 'u':  upper triangular

PZPOSVX()

diag(sr) * a * diag(sc) * inv(diag(sc)) * x = diag(sr) * b
whether or not the system will be equilibrated depends on th
overwritten by diag(sr)*a*diag(sc) and b by diag(sr)*b.

PZTRCON()

norm    (global input) character
specifies whether the 1-norm condition number or th
= '1' or 'o':  1-norm;

PZTRTRI()

uplo    (global input) character
specifies whether the distributed matrix sub( a ) is uppe
= 'u':  upper triangular,

SDTTRSV()

uplo    (input) character*1
specifies whether to solve with l or u
trans   (input) character

SSTEQR2()

determine where the matrix splits and choose ql or qr iteration
for each block, according to whether top or bottom diagona

STRMVT()

uplo   - character*1.
on entry, uplo specifies whether the matrix is an upper o

ZDTTRSV()

uplo    (input) character*1
specifies whether to solve with l or u
trans   (input) character

ZPTTRSV()

uplo    (input) character*1
specifies whether the superdiagonal or the subdiagona
factorization:

ZTRMVT()

uplo   - character*1.
on entry, uplo specifies whether the matrix is an upper o

which

CDBTRF()

here a11, a21 and a31 denote the current block of jb columns
which is about to be factorized. the number of rows in th
of columns are jb, j2, j3. the superdiagonal elements of a13

CLAHQR2()

i1 and i2 are the indices of the first row and last column of h
to which transformations must be applied. if eigenvalues only ar

DDBTRF()

here a11, a21 and a31 denote the current block of jb columns
which is about to be factorized. the number of rows in th
of columns are jb, j2, j3. the superdiagonal elements of a13

PCDBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCDBTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PCDBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCDTSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCDTTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PCDTTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCGBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCGBTRF()

note that for mycol > 0 one has lower triangular blocks!
lm is the number of rows which is usually nb except fo
is nr+bwu where nr is the number of columns on the last processor

PCGBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCGEBD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEBRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGECON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEEQU()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEHD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEHRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGELQ2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGELQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGELS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEQL2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEQLF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEQPF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEQR2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGEQRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGERFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGERQ2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGERQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGESV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGESVD()

where sigma is an m-by-n matrix which is zero except for it
v is an n-by-n orthogonal matrix. the diagonal elements of sigma

PCGESVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGETF2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGETRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGETRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGETRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGGQRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCGGRQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCHEEV()

the columns of a.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCHEEVD()

ia      (global input) integer
a's global row index, which points to the beginning of th

PCHEEVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCHEGS2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCHEGST()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCHEGVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCHENGST()

pchengst performs the same function as pchegst, but is based on
rank 2k updates, which are faster and more scalable tha

PCHENTRD()

pchentrd is a prototype version of pchetrd which uses tailore
when the workspace provided by the user is adequate.

PCHETD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCHETRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCHETTRD()

distribute the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which th
csrc_a (global) desca( csrc_ ) the process column over which the

PCLABRD()

or lower bidiagonal form by an unitary transformation q' * a * p, and
returns the matrices x and y which are needed to apply the transfor

PCLACGV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLACON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLACONSB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLACP2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLACP3()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLACPY()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLAEVSWP()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLAHQR()

i1 and i2 are the indices of the first row and last column of h
to which transformations must be applied. if eigenvalues only ar

PCLAHRD()

performed by an unitary similarity transformation q' * a * q. the
routine returns the matrices v and t which determine q as a bloc

PCLAMR1D()

ia      (global input) integer
a's global row index, which points to the beginning o

PCLANGE()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLANHE()

can be obtained by adding along row i and column i of the the
triangular matrix, stopping/starting at the diagonal, which i
in the following code, the row sums created by --- rows below are

PCLANSY()

can be obtained by adding along row i and column i of the the
triangular matrix, stopping/starting at the diagonal, which i
in the following code, the row sums created by --- rows below are

PCLAPIV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLAPV2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLAQGE()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLAQSY()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLARFB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLARFG()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLARFT()

pclarft forms the triangular factor t of a complex block reflector h
of order n, which is defined as a product of k elementary reflectors
if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular;

PCLARZB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLARZT()

pclarzt forms the triangular factor t of a complex block reflector
h of order > n, which is defined as a product of k elementar

PCLASCL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLASE2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLASET()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLASMSUB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLASSQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLASWP()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLATRA()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLATRD()

tridiagonal form by an unitary similarity transformation
q' * sub( a ) * q, and returns the matrices v and w which ar

PCLATRZ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLAUU2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLAUUM()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCLAWIL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCMAX1()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCPBTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PCPBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCPOCON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPOEQU()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPORFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPOSV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPOSVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPOTF2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPOTRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPOTRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPOTRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCPTSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCPTTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PCPTTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PCSRSCL()

te the columns of the array.
rsrc_a (global) desca[ rsrc_ ] the process row over which the firs
csrc_a (global) desca[ csrc_ ] the process column over which the

PCSTEIN()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCTRCON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCTREVC()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCTRRFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCTRTI2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCTRTRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCTRTRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCTZRZF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNG2L()

pcung2l generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PCUNG2R()

pcung2r generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PCUNGL2()

pcungl2 generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PCUNGLQ()

pcunglq generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PCUNGQL()

pcungql generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PCUNGQR()

pcungqr generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PCUNGR2()

pcungr2 generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PCUNGRQ()

pcungrq generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PCUNM2L()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNM2R()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMBR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMHR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNML2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMLQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMQL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMQR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMR2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMR3()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMRQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMRZ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PCUNMTR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDDBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDDBTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PDDBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDDTSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDDTTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PDDTTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDGBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDGBTRF()

note that for mycol > 0 one has lower triangular blocks!
lm is the number of rows which is usually nb except fo
is nr+bwu where nr is the number of columns on the last processor

PDGBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDGEBD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEBRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGECON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEEQU()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEHD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEHRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGELQ2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGELQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGELS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEQL2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEQLF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEQPF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEQR2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGEQRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGERFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGERQ2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGERQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGESV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGESVD()

where sigma is an m-by-n matrix which is zero except for it
v is an n-by-n orthogonal matrix. the diagonal elements of sigma

PDGESVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGETF2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGETRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGETRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGETRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGGQRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDGGRQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLABAD()

ictxt   (global input) integer
the blacs context handle in which the computation take

PDLABRD()

or lower bidiagonal form by an orthogonal transformation q' * a * p,
and returns the matrices x and y which are needed to apply th

PDLACON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLACONSB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLACP2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLACP3()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLACPY()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLAEBZ()

pdlaebz contains the iteration loop which computes the eigenvalue
j = 1,...,minp. it uses and computes the function n(w), which is

PDLAED0()

iq      (global input) integer
q's global row index, which points to the beginning of th

PDLAED1()

id      (global input) integer
q's global row/col index, which points to the beginnin

PDLAED2()

sorted set.  then it tries to deflate the size of the problem.
there are two ways in which deflation can occur:  when two or mor
z vector.  for each such occurrence the order of the related secular

PDLAED3()

add/subtract, or on those binary machines without guard digits
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2
without guard digits, but we know of none.

PDLAEDZ()

form z1 which consist of the last row of q

PDLAEVSWP()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLAHQR()

i1 and i2 are the indices of the first row and last column of h
to which transformations must be applied. if eigenvalues only ar

PDLAHRD()

nal similarity transformation q' * a * q. the routine returns the
matrices v and t which determine q as a block reflector i - v*t*v'

PDLAMCH()

ictxt   (global input) integer
the blacs context handle in which the computation take

PDLAMR1D()

ia      (global input) integer
a's global row index, which points to the beginning o

PDLANGE()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLANSY()

can be obtained by adding along row i and column i of the the
triangular matrix, stopping/starting at the diagonal, which i
in the following code, the row sums created by --- rows below are

PDLAPIV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLAPV2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLAQGE()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLAQSY()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLARED1D()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLARED2D()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLARFB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLARFG()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLARFT()

pdlarft forms the triangular factor t of a real block reflector h
of order n, which is defined as a product of k elementary reflectors
if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular;

PDLARZB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLARZT()

pdlarzt forms the triangular factor t of a real block reflector
h of order > n, which is defined as a product of k elementar

PDLASCL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLASE2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLASET()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLASMSUB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLASSQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLASWP()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLATRA()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLATRD()

form by an orthogonal similarity transformation q' * sub( a ) * q,
and returns the matrices v and w which are needed to apply th

PDLATRZ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLAUU2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLAUUM()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDLAWIL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORG2L()

pdorg2l generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PDORG2R()

pdorg2r generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PDORGL2()

pdorgl2 generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PDORGLQ()

pdorglq generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PDORGQL()

pdorgql generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PDORGQR()

pdorgqr generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PDORGR2()

pdorgr2 generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PDORGRQ()

pdorgrq generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PDORM2L()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORM2R()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMBR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMHR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORML2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMLQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMQL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMQR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMR2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMR3()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMRQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMRZ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDORMTR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDPBTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PDPBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDPOCON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPOEQU()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPORFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPOSV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPOSVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPOTF2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPOTRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPOTRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPOTRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDPTSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDPTTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PDPTTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PDRSCL()

te the columns of the array.
rsrc_a (global) desca[ rsrc_ ] the process row over which the firs
csrc_a (global) desca[ csrc_ ] the process column over which the

PDSTEBZ()

the interval [vl, vu], or the eigenvalues indexed il through iu. a
static partitioning of work is done at the beginning of pdstebz which
eigenvalues.

PDSTEDC()

add/subtract, or on those binary machines without guard digits
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2
without guard digits, but we know of none.  see dlaed3 for details.

PDSTEIN()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDSYEV()

the columns of a.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDSYEVD()

ia      (global input) integer
a's global row index, which points to the beginning of th

PDSYEVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDSYGS2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDSYGST()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDSYGVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDSYNGST()

pdsyngst performs the same function as pdhegst, but is based on
rank 2k updates, which are faster and more scalable tha

PDSYNTRD()

pdsyntrd is a prototype version of pdsytrd which uses tailore
when the workspace provided by the user is adequate.

PDSYTD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDSYTRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDSYTTRD()

distribute the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which th
csrc_a (global) desca( csrc_ ) the process column over which the

PDTRCON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDTRRFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDTRTI2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDTRTRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDTRTRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDTZRZF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PDZSUM1()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PJLAENV()

this version provides a set of parameters which should give good
computers.  users are encouraged to modify this subroutine to set

PSCSUM1()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSDBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSDBTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PSDBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSDTSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSDTTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PSDTTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSGBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSGBTRF()

note that for mycol > 0 one has lower triangular blocks!
lm is the number of rows which is usually nb except fo
is nr+bwu where nr is the number of columns on the last processor

PSGBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSGEBD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEBRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGECON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEEQU()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEHD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEHRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGELQ2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGELQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGELS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEQL2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEQLF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEQPF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEQR2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGEQRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGERFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGERQ2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGERQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGESV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGESVD()

where sigma is an m-by-n matrix which is zero except for it
v is an n-by-n orthogonal matrix. the diagonal elements of sigma

PSGESVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGETF2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGETRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGETRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGETRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGGQRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSGGRQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLABAD()

ictxt   (global input) integer
the blacs context handle in which the computation take

PSLABRD()

or lower bidiagonal form by an orthogonal transformation q' * a * p,
and returns the matrices x and y which are needed to apply th

PSLACON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLACONSB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLACP2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLACP3()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLACPY()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLAEBZ()

pslaebz contains the iteration loop which computes the eigenvalue
j = 1,...,minp. it uses and computes the function n(w), which is

PSLAED0()

iq      (global input) integer
q's global row index, which points to the beginning of th

PSLAED1()

id      (global input) integer
q's global row/col index, which points to the beginnin

PSLAED2()

sorted set.  then it tries to deflate the size of the problem.
there are two ways in which deflation can occur:  when two or mor
z vector.  for each such occurrence the order of the related secular

PSLAED3()

add/subtract, or on those binary machines without guard digits
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2
without guard digits, but we know of none.

PSLAEDZ()

form z1 which consist of the last row of q

PSLAEVSWP()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLAHQR()

i1 and i2 are the indices of the first row and last column of h
to which transformations must be applied. if eigenvalues only ar

PSLAHRD()

nal similarity transformation q' * a * q. the routine returns the
matrices v and t which determine q as a block reflector i - v*t*v'

PSLAMCH()

ictxt   (global input) integer
the blacs context handle in which the computation take

PSLAMR1D()

ia      (global input) integer
a's global row index, which points to the beginning o

PSLANGE()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLANSY()

can be obtained by adding along row i and column i of the the
triangular matrix, stopping/starting at the diagonal, which i
in the following code, the row sums created by --- rows below are

PSLAPIV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLAPV2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLAQGE()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLAQSY()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLARED1D()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLARED2D()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLARFB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLARFG()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLARFT()

pslarft forms the triangular factor t of a real block reflector h
of order n, which is defined as a product of k elementary reflectors
if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular;

PSLARZB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLARZT()

pslarzt forms the triangular factor t of a real block reflector
h of order > n, which is defined as a product of k elementar

PSLASCL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLASE2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLASET()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLASMSUB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLASSQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLASWP()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLATRA()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLATRD()

form by an orthogonal similarity transformation q' * sub( a ) * q,
and returns the matrices v and w which are needed to apply th

PSLATRZ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLAUU2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLAUUM()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSLAWIL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORG2L()

psorg2l generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PSORG2R()

psorg2r generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PSORGL2()

psorgl2 generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PSORGLQ()

psorglq generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PSORGQL()

psorgql generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PSORGQR()

psorgqr generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PSORGR2()

psorgr2 generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PSORGRQ()

psorgrq generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PSORM2L()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORM2R()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMBR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMHR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORML2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMLQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMQL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMQR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMR2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMR3()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMRQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMRZ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSORMTR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSPBTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PSPBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSPOCON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPOEQU()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPORFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPOSV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPOSVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPOTF2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPOTRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPOTRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPOTRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSPTSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSPTTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PSPTTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PSRSCL()

te the columns of the array.
rsrc_a (global) desca[ rsrc_ ] the process row over which the firs
csrc_a (global) desca[ csrc_ ] the process column over which the

PSSTEBZ()

the interval [vl, vu], or the eigenvalues indexed il through iu. a
static partitioning of work is done at the beginning of psstebz which
eigenvalues.

PSSTEDC()

add/subtract, or on those binary machines without guard digits
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2
without guard digits, but we know of none.  see slaed3 for details.

PSSTEIN()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSSYEV()

the columns of a.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSSYEVD()

ia      (global input) integer
a's global row index, which points to the beginning of th

PSSYEVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSSYGS2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSSYGST()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSSYGVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSSYNGST()

pssyngst performs the same function as pshegst, but is based on
rank 2k updates, which are faster and more scalable tha

PSSYNTRD()

pssyntrd is a prototype version of pssytrd which uses tailore
when the workspace provided by the user is adequate.

PSSYTD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSSYTRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSSYTTRD()

distribute the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which th
csrc_a (global) desca( csrc_ ) the process column over which the

PSTRCON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSTRRFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSTRTI2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSTRTRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSTRTRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PSTZRZF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZDBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZDBTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PZDBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZDRSCL()

te the columns of the array.
rsrc_a (global) desca[ rsrc_ ] the process row over which the firs
csrc_a (global) desca[ csrc_ ] the process column over which the

PZDTSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZDTTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PZDTTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZGBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZGBTRF()

note that for mycol > 0 one has lower triangular blocks!
lm is the number of rows which is usually nb except fo
is nr+bwu where nr is the number of columns on the last processor

PZGBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZGEBD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEBRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGECON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEEQU()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEHD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEHRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGELQ2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGELQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGELS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEQL2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEQLF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEQPF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEQR2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGEQRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGERFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGERQ2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGERQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGESV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGESVD()

where sigma is an m-by-n matrix which is zero except for it
v is an n-by-n orthogonal matrix. the diagonal elements of sigma

PZGESVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGETF2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGETRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGETRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGETRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGGQRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZGGRQF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZHEEV()

the columns of a.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZHEEVD()

ia      (global input) integer
a's global row index, which points to the beginning of th

PZHEEVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZHEGS2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZHEGST()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZHEGVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZHENGST()

pzhengst performs the same function as pzhegst, but is based on
rank 2k updates, which are faster and more scalable tha

PZHENTRD()

pzhentrd is a prototype version of pzhetrd which uses tailore
when the workspace provided by the user is adequate.

PZHETD2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZHETRD()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZHETTRD()

distribute the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which th
csrc_a (global) desca( csrc_ ) the process column over which the

PZLABRD()

or lower bidiagonal form by an unitary transformation q' * a * p, and
returns the matrices x and y which are needed to apply the transfor

PZLACGV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLACON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLACONSB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLACP2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLACP3()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLACPY()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLAEVSWP()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLAHQR()

i1 and i2 are the indices of the first row and last column of h
to which transformations must be applied. if eigenvalues only ar

PZLAHRD()

performed by an unitary similarity transformation q' * a * q. the
routine returns the matrices v and t which determine q as a bloc

PZLAMR1D()

ia      (global input) integer
a's global row index, which points to the beginning o

PZLANGE()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLANHE()

can be obtained by adding along row i and column i of the the
triangular matrix, stopping/starting at the diagonal, which i
in the following code, the row sums created by --- rows below are

PZLANSY()

can be obtained by adding along row i and column i of the the
triangular matrix, stopping/starting at the diagonal, which i
in the following code, the row sums created by --- rows below are

PZLAPIV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLAPV2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLAQGE()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLAQSY()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLARFB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLARFG()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLARFT()

pzlarft forms the triangular factor t of a complex block reflector h
of order n, which is defined as a product of k elementary reflectors
if direct = 'f', h = h(1) h(2) . . . h(k) and t is upper triangular;

PZLARZB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLARZT()

pzlarzt forms the triangular factor t of a complex block reflector
h of order > n, which is defined as a product of k elementar

PZLASCL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLASE2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLASET()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLASMSUB()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLASSQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLASWP()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLATRA()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLATRD()

tridiagonal form by an unitary similarity transformation
q' * sub( a ) * q, and returns the matrices v and w which ar

PZLATRZ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLAUU2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLAUUM()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZLAWIL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZMAX1()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPBSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZPBTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PZPBTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZPOCON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPOEQU()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPORFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPOSV()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPOSVX()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPOTF2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPOTRF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPOTRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPOTRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZPTSV()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZPTTRF()

transfer last triangle d_i of local matrix to next processor
which needs it to calculate fillin due to factorization o
overlap the send with the factorization of a_i.

PZPTTRS()

the index in the global array a that points to the start of
the matrix to be operated on (which may be either all of

PZSTEIN()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZTRCON()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZTREVC()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZTRRFS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZTRTI2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZTRTRI()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZTRTRS()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZTZRZF()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNG2L()

pzung2l generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PZUNG2R()

pzung2r generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PZUNGL2()

pzungl2 generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PZUNGLQ()

pzunglq generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PZUNGQL()

pzungql generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PZUNGQR()

pzungqr generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PZUNGR2()

pzungr2 generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PZUNGRQ()

pzungrq generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PZUNM2L()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNM2R()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMBR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMHR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNML2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMLQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMQL()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMQR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMR2()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMR3()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMRQ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMRZ()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

PZUNMTR()

the columns of the array.
rsrc_a (global) desca( rsrc_ ) the process row over which the firs
csrc_a (global) desca( csrc_ ) the process column over which the

SDBTRF()

here a11, a21 and a31 denote the current block of jb columns
which is about to be factorized. the number of rows in th
of columns are jb, j2, j3. the superdiagonal elements of a13

ZDBTRF()

here a11, a21 and a31 denote the current block of jb columns
which is about to be factorized. the number of rows in th
of columns are jb, j2, j3. the superdiagonal elements of a13

ZLAHQR2()

i1 and i2 are the indices of the first row and last column of h
to which transformations must be applied. if eigenvalues only ar

while

PCDBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCDBTRF()

calculate new ja one while dropping off unused processors

PCDBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCDBTRSV()

calculate new ja one while dropping off unused processors

PCDTSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCDTTRF()

calculate new ja one while dropping off unused processors

PCDTTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCDTTRSV()

calculate new ja one while dropping off unused processors

PCGBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCGBTRF()

calculate new ja one while dropping off unused processors

PCGBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCLAHQR()

however, because there are many bulges, k1(ki) & k2(ki) might
go past that range while later bulges (ki+1,ki+2,etc..) ar
communication sometimes k1(ki)=hbl-2 & k2(ki)=hbl-1 so both

PCLANHE()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums while
irsr0   : pointer to part of work used to store the rowsums after

PCLANSY()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums while
irsr0   : pointer to part of work used to store the rowsums after

PCPBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCPBTRF()

calculate new ja one while dropping off unused processors

PCPBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCPBTRSV()

calculate new ja one while dropping off unused processors

PCPTSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCPTTRF()

calculate new ja one while dropping off unused processors

PCPTTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PCPTTRSV()

calculate new ja one while dropping off unused processors

PDDBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDDBTRF()

calculate new ja one while dropping off unused processors

PDDBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDDBTRSV()

calculate new ja one while dropping off unused processors

PDDTSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDDTTRF()

calculate new ja one while dropping off unused processors

PDDTTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDDTTRSV()

calculate new ja one while dropping off unused processors

PDGBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDGBTRF()

calculate new ja one while dropping off unused processors

PDGBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDLAEBZ()

performance. the diagonal entries of t are in the entries
d(1),d(3),...,d(2*n-1), while the squares of the off-diagona
matrix must be scaled so that its largest entry is no greater

PDLAED0()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while working on the submatrix lying i

PDLAHQR()

however, because there are many bulges, k1(ki) & k2(ki) might
go past that range while later bulges (ki+1,ki+2,etc..) ar

PDLANSY()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums while
irsr0   : pointer to part of work used to store the rowsums after

PDLAPDCT()

performance. the diagonal entries of t are in the entries
d(1),d(3),...,d(2*n-1), while the squares of the off-diagona
matrix must be scaled so that its largest entry is no greater

PDPBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDPBTRF()

calculate new ja one while dropping off unused processors

PDPBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDPBTRSV()

calculate new ja one while dropping off unused processors

PDPTSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDPTTRF()

calculate new ja one while dropping off unused processors

PDPTTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PDPTTRSV()

calculate new ja one while dropping off unused processors

PDSTEDC()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while working on the submatrix lying i

PDSYEVD()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while working on the submatrix lying i

PSDBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSDBTRF()

calculate new ja one while dropping off unused processors

PSDBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSDBTRSV()

calculate new ja one while dropping off unused processors

PSDTSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSDTTRF()

calculate new ja one while dropping off unused processors

PSDTTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSDTTRSV()

calculate new ja one while dropping off unused processors

PSGBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSGBTRF()

calculate new ja one while dropping off unused processors

PSGBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSLAEBZ()

performance. the diagonal entries of t are in the entries
d(1),d(3),...,d(2*n-1), while the squares of the off-diagona
matrix must be scaled so that its largest entry is no greater

PSLAED0()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while working on the submatrix lying i

PSLAHQR()

however, because there are many bulges, k1(ki) & k2(ki) might
go past that range while later bulges (ki+1,ki+2,etc..) ar

PSLANSY()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums while
irsr0   : pointer to part of work used to store the rowsums after

PSLAPDCT()

performance. the diagonal entries of t are in the entries
d(1),d(3),...,d(2*n-1), while the squares of the off-diagona
matrix must be scaled so that its largest entry is no greater

PSPBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSPBTRF()

calculate new ja one while dropping off unused processors

PSPBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSPBTRSV()

calculate new ja one while dropping off unused processors

PSPTSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSPTTRF()

calculate new ja one while dropping off unused processors

PSPTTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PSPTTRSV()

calculate new ja one while dropping off unused processors

PSSTEDC()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while working on the submatrix lying i

PSSYEVD()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while working on the submatrix lying i

PZDBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZDBTRF()

calculate new ja one while dropping off unused processors

PZDBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZDBTRSV()

calculate new ja one while dropping off unused processors

PZDTSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZDTTRF()

calculate new ja one while dropping off unused processors

PZDTTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZDTTRSV()

calculate new ja one while dropping off unused processors

PZGBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZGBTRF()

calculate new ja one while dropping off unused processors

PZGBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZLAHQR()

however, because there are many bulges, k1(ki) & k2(ki) might
go past that range while later bulges (ki+1,ki+2,etc..) ar
communication sometimes k1(ki)=hbl-2 & k2(ki)=hbl-1 so both

PZLANHE()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums while
irsr0   : pointer to part of work used to store the rowsums after

PZLANSY()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums while
irsr0   : pointer to part of work used to store the rowsums after

PZPBSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZPBTRF()

calculate new ja one while dropping off unused processors

PZPBTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZPBTRSV()

calculate new ja one while dropping off unused processors

PZPTSV()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZPTTRF()

calculate new ja one while dropping off unused processors

PZPTTRS()

the following are restrictions on the input parameters. some of these
are temporary and will be removed in future releases, while other

PZPTTRSV()

calculate new ja one while dropping off unused processors

who

PCLAHQR()

get first transform on node who owns m+2,m+

PDLAHQR()

get first transform on node who owns m+2,m+

PSLAHQR()

get first transform on node who owns m+2,m+

PZLAHQR()

get first transform on node who owns m+2,m+

whole

PCGBTRF()

copy diagonal block to align whole syste

PDGBTRF()

copy diagonal block to align whole syste

PSGBTRF()

copy diagonal block to align whole syste

PZGBTRF()

copy diagonal block to align whole syste

whose

PCGEEQU()

to an array of dimension ( lld_a, locc(ja+n-1) ), the
local pieces of the m-by-n distributed matrix whose

PCHEEVX()

dimension (nprow*npcol)
this array contains the gap between eigenvalues whose
values in this array correspond to the clusters indicated

PCHEGVX()

dimension (nprow*npcol)
this array contains the gap between eigenvalues whose
values in this array correspond to the clusters indicated

PCLACONSB()

(desca(lld_),*)
on entry, the hessenberg matrix whose tridiagonal part i
unchanged on exit.

PCLARFB()

the order of the matrix t (= the number of elementary
reflectors whose product defines the block reflector)
v       (local input) complex pointer into the local memory

PCLARZB()

the order of the matrix t (= the number of elementary
reflectors whose product defines the block reflector)
l       (global input) integer

PCLASMSUB()

a       (global input) complex array, dimension (desca(lld_),*)
on entry, the hessenberg matrix whose tridiagonal part i
unchanged on exit.

PCLATRZ()

the  factorization is obtained by householder's method.  the kth
transformation matrix, z( k ), whose conjugate transpose is used t
the form

PCMAX1()

the global index of the element of the distributed vector
sub( x ) whose real part has maximum absolute value
x       (local input) complex array containing the local

PCPOEQU()

n-by-n hermitian positive definite distributed matrix
sub( a ) whose scaling factors are to be computed.  only th

PCSTEIN()

gap     (global output) real array, dimension (p)
this output array contains the gap between eigenvalues whose
values in this array correspond to the info/(m+1) clusters

PCTZRZF()

the  factorization is obtained by householder's method.  the kth
transformation matrix, z( k ), whose conjugate transpose is used t
the form

PCUNG2L()

k       (global input) integer
the number of elementary reflectors whose product defines th

PCUNG2R()

k       (global input) integer
the number of elementary reflectors whose product defines th

PCUNGL2()

k       (global input) integer
the number of elementary reflectors whose product defines th

PCUNGLQ()

k       (global input) integer
the number of elementary reflectors whose product defines th

PCUNGQL()

k       (global input) integer
the number of elementary reflectors whose product defines th

PCUNGQR()

k       (global input) integer
the number of elementary reflectors whose product defines th

PCUNGR2()

k       (global input) integer
the number of elementary reflectors whose product defines th

PCUNGRQ()

k       (global input) integer
the number of elementary reflectors whose product defines th

PCUNM2L()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNM2R()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNMBR()

if side = 'l', and nq = n otherwise. the vectors which
define the elementary reflectors h(i) and g(i), whose
pcgebrd.

PCUNML2()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNMLQ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNMQL()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNMQR()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNMR2()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNMR3()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNMRQ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PCUNMRZ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDGEEQU()

to an array of dimension ( lld_a, locc(ja+n-1) ), the
local pieces of the m-by-n distributed matrix whose

PDLACONSB()

(desca(lld_),*)
on entry, the hessenberg matrix whose tridiagonal part i
unchanged on exit.

PDLARED1D()

byall(i) = bycol( numroc(i,desc( nb_ ),myrow,0,nprow ) on the procs
whose myrow == mod((i-1)/desc( nb_ ),nprow
work    (local workspace) double precision dimension (lwork)

PDLARED2D()

byall(i) = byrow( numroc(i,desc( mb_ ),mycol,0,npcol ) on the procs
whose mycol == mod((i-1)/desc( mb_ ),npcol
work    (local workspace) double precision dimension (lwork)

PDLARFB()

the order of the matrix t (= the number of elementary
reflectors whose product defines the block reflector)
v       (local input) double precision pointer into the local memory

PDLARZB()

the order of the matrix t (= the number of elementary
reflectors whose product defines the block reflector)
l       (global input) integer

PDLASMSUB()

(desca(lld_),*)
on entry, the hessenberg matrix whose tridiagonal part i
unchanged on exit.

PDORG2L()

k       (global input) integer
the number of elementary reflectors whose product defines th

PDORG2R()

k       (global input) integer
the number of elementary reflectors whose product defines th

PDORGL2()

k       (global input) integer
the number of elementary reflectors whose product defines th

PDORGLQ()

k       (global input) integer
the number of elementary reflectors whose product defines th

PDORGQL()

k       (global input) integer
the number of elementary reflectors whose product defines th

PDORGQR()

k       (global input) integer
the number of elementary reflectors whose product defines th

PDORGR2()

k       (global input) integer
the number of elementary reflectors whose product defines th

PDORGRQ()

k       (global input) integer
the number of elementary reflectors whose product defines th

PDORM2L()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORM2R()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORMBR()

if side = 'l', and nq = n otherwise. the vectors which
define the elementary reflectors h(i) and g(i), whose
pdgebrd.

PDORML2()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORMLQ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORMQL()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORMQR()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORMR2()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORMR3()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORMRQ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDORMRZ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PDPOEQU()

n-by-n symmetric positive definite distributed matrix
sub( a ) whose scaling factors are to be computed.  only th

PDSTEBZ()

(or cluster) is considered to be located if it has been
determined to lie in an interval whose width is abstol o
will be used, where |t| means the 1-norm of t.

PDSTEIN()

gap     (global output) double precision array, dimension (p)
this output array contains the gap between eigenvalues whose
values in this array correspond to the info/(m+1) clusters

PDSYEVX()

dimension (nprow*npcol)
this array contains the gap between eigenvalues whose
values in this array correspond to the clusters indicated

PDSYGVX()

dimension (nprow*npcol)
this array contains the gap between eigenvalues whose
values in this array correspond to the clusters indicated

PSGEEQU()

to an array of dimension ( lld_a, locc(ja+n-1) ), the
local pieces of the m-by-n distributed matrix whose

PSLACONSB()

(desca(lld_),*)
on entry, the hessenberg matrix whose tridiagonal part i
unchanged on exit.

PSLARED1D()

byall(i) = bycol( numroc(i,desc( nb_ ),myrow,0,nprow ) on the procs
whose myrow == mod((i-1)/desc( nb_ ),nprow
work    (local workspace) real dimension (lwork)

PSLARED2D()

byall(i) = byrow( numroc(i,desc( mb_ ),mycol,0,npcol ) on the procs
whose mycol == mod((i-1)/desc( mb_ ),npcol
work    (local workspace) real dimension (lwork)

PSLARFB()

the order of the matrix t (= the number of elementary
reflectors whose product defines the block reflector)
v       (local input) real pointer into the local memory

PSLARZB()

the order of the matrix t (= the number of elementary
reflectors whose product defines the block reflector)
l       (global input) integer

PSLASMSUB()

(desca(lld_),*)
on entry, the hessenberg matrix whose tridiagonal part i
unchanged on exit.

PSORG2L()

k       (global input) integer
the number of elementary reflectors whose product defines th

PSORG2R()

k       (global input) integer
the number of elementary reflectors whose product defines th

PSORGL2()

k       (global input) integer
the number of elementary reflectors whose product defines th

PSORGLQ()

k       (global input) integer
the number of elementary reflectors whose product defines th

PSORGQL()

k       (global input) integer
the number of elementary reflectors whose product defines th

PSORGQR()

k       (global input) integer
the number of elementary reflectors whose product defines th

PSORGR2()

k       (global input) integer
the number of elementary reflectors whose product defines th

PSORGRQ()

k       (global input) integer
the number of elementary reflectors whose product defines th

PSORM2L()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORM2R()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORMBR()

if side = 'l', and nq = n otherwise. the vectors which
define the elementary reflectors h(i) and g(i), whose
psgebrd.

PSORML2()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORMLQ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORMQL()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORMQR()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORMR2()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORMR3()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORMRQ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSORMRZ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PSPOEQU()

n-by-n symmetric positive definite distributed matrix
sub( a ) whose scaling factors are to be computed.  only th

PSSTEBZ()

(or cluster) is considered to be located if it has been
determined to lie in an interval whose width is abstol o
will be used, where |t| means the 1-norm of t.

PSSTEIN()

gap     (global output) real array, dimension (p)
this output array contains the gap between eigenvalues whose
values in this array correspond to the info/(m+1) clusters

PSSYEVX()

dimension (nprow*npcol)
this array contains the gap between eigenvalues whose
values in this array correspond to the clusters indicated

PSSYGVX()

dimension (nprow*npcol)
this array contains the gap between eigenvalues whose
values in this array correspond to the clusters indicated

PZGEEQU()

to an array of dimension ( lld_a, locc(ja+n-1) ), the
local pieces of the m-by-n distributed matrix whose

PZHEEVX()

dimension (nprow*npcol)
this array contains the gap between eigenvalues whose
values in this array correspond to the clusters indicated

PZHEGVX()

dimension (nprow*npcol)
this array contains the gap between eigenvalues whose
values in this array correspond to the clusters indicated

PZLACONSB()

(desca(lld_),*)
on entry, the hessenberg matrix whose tridiagonal part i
unchanged on exit.

PZLARFB()

the order of the matrix t (= the number of elementary
reflectors whose product defines the block reflector)
v       (local input) complex*16 pointer into the local memory

PZLARZB()

the order of the matrix t (= the number of elementary
reflectors whose product defines the block reflector)
l       (global input) integer

PZLASMSUB()

a       (global input) complex*16 array, dimension (desca(lld_),*)
on entry, the hessenberg matrix whose tridiagonal part i
unchanged on exit.

PZLATRZ()

the  factorization is obtained by householder's method.  the kth
transformation matrix, z( k ), whose conjugate transpose is used t
the form

PZMAX1()

the global index of the element of the distributed vector
sub( x ) whose real part has maximum absolute value
x       (local input) complex*16 array containing the local

PZPOEQU()

n-by-n hermitian positive definite distributed matrix
sub( a ) whose scaling factors are to be computed.  only th

PZSTEIN()

gap     (global output) double precision array, dimension (p)
this output array contains the gap between eigenvalues whose
values in this array correspond to the info/(m+1) clusters

PZTZRZF()

the  factorization is obtained by householder's method.  the kth
transformation matrix, z( k ), whose conjugate transpose is used t
the form

PZUNG2L()

k       (global input) integer
the number of elementary reflectors whose product defines th

PZUNG2R()

k       (global input) integer
the number of elementary reflectors whose product defines th

PZUNGL2()

k       (global input) integer
the number of elementary reflectors whose product defines th

PZUNGLQ()

k       (global input) integer
the number of elementary reflectors whose product defines th

PZUNGQL()

k       (global input) integer
the number of elementary reflectors whose product defines th

PZUNGQR()

k       (global input) integer
the number of elementary reflectors whose product defines th

PZUNGR2()

k       (global input) integer
the number of elementary reflectors whose product defines th

PZUNGRQ()

k       (global input) integer
the number of elementary reflectors whose product defines th

PZUNM2L()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNM2R()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNMBR()

if side = 'l', and nq = n otherwise. the vectors which
define the elementary reflectors h(i) and g(i), whose
pzgebrd.

PZUNML2()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNMLQ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNMQL()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNMQR()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNMR2()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNMR3()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNMRQ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

PZUNMRZ()

k       (global input) integer
the number of elementary reflectors whose product defines th
n >= k >= 0.

widen

PDSTEBZ()

fudge   double precision, default = 2.0
a "fudge factor" to widen the gershgorin intervals.  ideally
arithmetic, this needs to be larger.  the default for

PSSTEBZ()

fudge   real, default = 2.0
a "fudge factor" to widen the gershgorin intervals.  ideally
arithmetic, this needs to be larger.  the default for

width

PCHEEVX()

when it is determined to lie in an interval [a,b]
of width less than or equal t
abstol + eps *   max( |a|,|b| ) ,

PCHEGVX()

when it is determined to lie in an interval [a,b]
of width less than or equal t
abstol + eps *   max( |a|,|b| ) ,

PDLAEBZ()

abstol  (input) double precision
the minimum (absolute) width of an interval. when an interva
magnitude) endpoint, then it is considered to be sufficiently

PDLAECV()

abstol  (input) double precision
the minimum (absolute) width of an interval. when an interva
magnitude) endpoint, then it is considered to be sufficiently

PDSTEBZ()

(or cluster) is considered to be located if it has been
determined to lie in an interval whose width is abstol o
will be used, where |t| means the 1-norm of t.

PDSYEVX()

when it is determined to lie in an interval [a,b]
of width less than or equal t
abstol + eps *   max( |a|,|b| ) ,

PDSYGVX()

when it is determined to lie in an interval [a,b]
of width less than or equal t
abstol + eps *   max( |a|,|b| ) ,

PSLAEBZ()

abstol  (input) real
the minimum (absolute) width of an interval. when an interva
magnitude) endpoint, then it is considered to be sufficiently

PSLAECV()

abstol  (input) real
the minimum (absolute) width of an interval. when an interva
magnitude) endpoint, then it is considered to be sufficiently

PSSTEBZ()

(or cluster) is considered to be located if it has been
determined to lie in an interval whose width is abstol o
will be used, where |t| means the 1-norm of t.

PSSYEVX()

when it is determined to lie in an interval [a,b]
of width less than or equal t
abstol + eps *   max( |a|,|b| ) ,

PSSYGVX()

when it is determined to lie in an interval [a,b]
of width less than or equal t
abstol + eps *   max( |a|,|b| ) ,

PZHEEVX()

when it is determined to lie in an interval [a,b]
of width less than or equal t
abstol + eps *   max( |a|,|b| ) ,

PZHEGVX()

when it is determined to lie in an interval [a,b]
of width less than or equal t
abstol + eps *   max( |a|,|b| ) ,

Wilkinson

CLAHQR2()

prepare to use Wilkinson's shift

PCLAHQR()

copy submatrix of size 2*jblk and prepare to do generalized
Wilkinson shift or an exceptional shif

PDLAHQR()

copy submatrix of size 2*jblk and prepare to do generalized
Wilkinson shift or an exceptional shif

PSLAHQR()

copy submatrix of size 2*jblk and prepare to do generalized
Wilkinson shift or an exceptional shif

PZLAHQR()

copy submatrix of size 2*jblk and prepare to do generalized
Wilkinson shift or an exceptional shif

ZLAHQR2()

prepare to use Wilkinson's shift

will

CDBTF2()

has been completed, but the factor u is exactly
singular, and division by zero will occur if it is use

CDTTRF()

has been completed, but the factor u is exactly
singular, and division by zero will occur if it is use

DDBTF2()

has been completed, but the factor u is exactly
singular, and division by zero will occur if it is use

DDTTRF()

has been completed, but the factor u is exactly
singular, and division by zero will occur if it is use

PCDBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
nb*(bwl+bwu)+6*max(bwl,bwu)*max(bwl,bwu)

PCDBTRF()

move block into place that it will be expected to be fo

PCDBTRS()

nb*(bwl+bwu)+6*max(bwl,bwu)*max(bwl,bwu)
if laf is not large enough, an error code will be returne

PCDTSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(12*npcol+3*nb)

PCDTTRF()

move block into place that it will be expected to be fo

PCDTTRS()

2*(nb+2)
if laf is not large enough, an error code will be returne

PCGBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(nb+bwu)*(bwl+bwu)+6*(bwl+bwu)*(bwl+2*bwu)

PCGBTRF()

in this case the loop over the levels will not b

PCGBTRS()

(nb+bwu)*(bwl+bwu)+6*(bwl+bwu)*(bwl+2*bwu)
if laf is not large enough, an error code will be returne

PCGESVX()

trans = 'c': (diag(r)*a*diag(c))**h *inv(diag(r))*x = diag(c)*b
whether or not the system will be equilibrated depends on th
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n')

PCGETF2()

the factorization has been completed, but the factor u
is exactly singular, and division by zero will occur i

PCGETRF()

the factorization has been completed, but the factor u
is exactly singular, and division by zero will occur i

PCHEEVX()

range   (global input) character*1
= 'a': all eigenvalues will be found
= 'i': the il-th through iu-th eigenvalues will be found.

PCHEGVX()

range   (global input) character*1
= 'a': all eigenvalues will be found
= 'i': the il-th through iu-th eigenvalues will be found.

PCHETTRD()

pchettrd is not intended to be called directly.  all users are
encourage to call pchetrd which will then call pchettrd i
the process grid must be square ( i.e. nprow = npcol ) and

PCLACON()

on the initial call to pclacon, kase should be 0.
on an intermediate return, kase will be 1 or 2, indicatin
on the final return from pclacon, kase will again be 0.

PCLACONSB()

on exit, this yields the starting location of the qr double
shift.  this will satisfy: l <= m  <= i-2
h44

PCLAHQR()

nbulge is the number of bulges that will be attempte

PCLAPIV()

or a column. the pivot vector should be aligned with the distributed
matrix a. this routine will transpose the pivot vector if necessary
sub( a ), pass rowcol='c' and pivroc='c'.

PCLAPV2()

specifies if the rows or columns are to be permuted:
= 'r' rows will be permuted

PCLASMSUB()

on exit, this yields the bottom portion of the unreduced
submatrix.  this will satisfy: l <= m  <= i-1
smlnum  (global input) real

PCLASSQ()

the value of sumsq is assumed to be at least unity and the value of
ssq will then satisf
1.0 .le. ssq .le. ( sumsq + 2*n ).

PCLASWP()

already been broadcast along the process row or column.
also note that this routine will only work for k1-k2 being in th
pclapiv.

PCMAX1()

when the result of a vector-oriented pblas call is a scalar, it will
being operated on.  let x be a generic term for the input vector(s).

PCPBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(nb+2*bw)*bw

PCPBTRF()

move block into place that it will be expected to be fo

PCPBTRS()

(nb+2*bw)*bw
if laf is not large enough, an error code will be returne

PCPOSVX()

diag(sr) * a * diag(sc) * inv(diag(sc)) * x = diag(sr) * b
whether or not the system will be equilibrated depends on th
overwritten by diag(sr)*a*diag(sc) and b by diag(sr)*b.

PCPTSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(12*npcol + 3*nb)

PCPTTRF()

move block into place that it will be expected to be fo

PCPTTRS()

(nb+2)
if laf is not large enough, an error code will be returne

PCSRSCL()

the scalar a which is used to divide each component of
sub( x ).  sa must be >= 0, or the subroutine will divide b

PCSTEIN()

orthogonalized can be stored in one process.
no orthogonalization will be done if orfac equals zero
orfac should be identical on all processes.

PDDBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
nb*(bwl+bwu)+6*max(bwl,bwu)*max(bwl,bwu)

PDDBTRS()

nb*(bwl+bwu)+6*max(bwl,bwu)*max(bwl,bwu)
if laf is not large enough, an error code will be returne

PDDTSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(12*npcol+3*nb)

PDDTTRF()

move block into place that it will be expected to be fo

PDDTTRS()

2*(nb+2)
if laf is not large enough, an error code will be returne

PDGBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(nb+bwu)*(bwl+bwu)+6*(bwl+bwu)*(bwl+2*bwu)

PDGBTRF()

in this case the loop over the levels will not b

PDGBTRS()

(nb+bwu)*(bwl+bwu)+6*(bwl+bwu)*(bwl+2*bwu)
if laf is not large enough, an error code will be returne

PDGESVX()

trans = 'c': (diag(r)*a*diag(c))**h *inv(diag(r))*x = diag(c)*b
whether or not the system will be equilibrated depends on th
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n')

PDGETF2()

the factorization has been completed, but the factor u
is exactly singular, and division by zero will occur i

PDGETRF()

the factorization has been completed, but the factor u
is exactly singular, and division by zero will occur i

PDLACON()

on the initial call to pdlacon, kase should be 0.
on an intermediate return, kase will be 1 or 2, indicatin
on the final return from pdlacon, kase will again be 0.

PDLACONSB()

on exit, this yields the starting location of the qr double
shift.  this will satisfy: l <= m  <= i-2
h44

PDLAEBZ()

the maximum number of intervals that may be generated. if
more than mmax intervals are generated, then pdlaebz will

PDLAECV()

oendpoint f the j-th interval, and intvl(2*j) is the right
endpoint of the j-th interval. the input intervals will
on input, intvl contains the kl-kf input intervals.

PDLAED2()

dlamda (global output) double precision array, dimension (n)
a copy of the first k eigenvalues which will be used b

PDLAED3()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PDLAHQR()

nbulge is the number of bulges that will be attempte

PDLAPDCT()

the innermost loop to avoid overflow and determine the sign of a
floating point number. pdlapdct will be referred to as the "paranoid

PDLAPIV()

or a column. the pivot vector should be aligned with the distributed
matrix a. this routine will transpose the pivot vector if necessary
sub( a ), pass rowcol='c' and pivroc='c'.

PDLAPV2()

specifies if the rows or columns are to be permuted:
= 'r' rows will be permuted

PDLARED1D()

rows and that all process columns contain the same copy of
bycol.  the output array, byall, will be identical on all processe

PDLARED2D()

columns and that all process rows contain the same copy of
byrow.  the output array, byall, will be identical on all processe

PDLASMSUB()

on exit, this yields the bottom portion of the unreduced
submatrix.  this will satisfy: l <= m  <= i-1
smlnum  (global input) double precision

PDLASWP()

already been broadcast along the process row or column.
also note that this routine will only work for k1-k2 being in th
pdlapiv.

PDPBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(nb+2*bw)*bw

PDPBTRF()

move block into place that it will be expected to be fo

PDPBTRS()

(nb+2*bw)*bw
if laf is not large enough, an error code will be returne

PDPOSVX()

diag(sr) * a * diag(sc) * inv(diag(sc)) * x = diag(sr) * b
whether or not the system will be equilibrated depends on th
overwritten by diag(sr)*a*diag(sc) and b by diag(sr)*b.

PDPTSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(12*npcol + 3*nb)

PDPTTRF()

move block into place that it will be expected to be fo

PDPTTRS()

(nb+2)
if laf is not large enough, an error code will be returne

PDRSCL()

the scalar a which is used to divide each component of
sub( x ).  sa must be >= 0, or the subroutine will divide b

PDSTEBZ()

specifies which eigenvalues are to be found.
= 'a': ("all")   all eigenvalues will be found
[vl, vu] will be found.

PDSTEDC()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PDSTEIN()

orthogonalized can be stored in one process.
no orthogonalization will be done if orfac equals zero
orfac should be identical on all processes.

PDSYEVX()

range   (global input) character*1
= 'a': all eigenvalues will be found
= 'i': the il-th through iu-th eigenvalues will be found.

PDSYGVX()

range   (global input) character*1
= 'a': all eigenvalues will be found
= 'i': the il-th through iu-th eigenvalues will be found.

PDSYTTRD()

pdsyttrd is not intended to be called directly.  all users are
encourage to call pdsytrd which will then call pdhettrd i
the process grid must be square ( i.e. nprow = npcol ) and

PDZSUM1()

when the result of a vector-oriented pblas call is a scalar, it will
being operated on.  let x be a generic term for the input vector(s).

PJLAENV()

this routine will not function correctly if it is converted to al

PSCSUM1()

when the result of a vector-oriented pblas call is a scalar, it will
being operated on.  let x be a generic term for the input vector(s).

PSDBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
nb*(bwl+bwu)+6*max(bwl,bwu)*max(bwl,bwu)

PSDBTRS()

nb*(bwl+bwu)+6*max(bwl,bwu)*max(bwl,bwu)
if laf is not large enough, an error code will be returne

PSDTSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(12*npcol+3*nb)

PSDTTRF()

move block into place that it will be expected to be fo

PSDTTRS()

2*(nb+2)
if laf is not large enough, an error code will be returne

PSGBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(nb+bwu)*(bwl+bwu)+6*(bwl+bwu)*(bwl+2*bwu)

PSGBTRF()

in this case the loop over the levels will not b

PSGBTRS()

(nb+bwu)*(bwl+bwu)+6*(bwl+bwu)*(bwl+2*bwu)
if laf is not large enough, an error code will be returne

PSGESVX()

trans = 'c': (diag(r)*a*diag(c))**h *inv(diag(r))*x = diag(c)*b
whether or not the system will be equilibrated depends on th
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n')

PSGETF2()

the factorization has been completed, but the factor u
is exactly singular, and division by zero will occur i

PSGETRF()

the factorization has been completed, but the factor u
is exactly singular, and division by zero will occur i

PSLACON()

on the initial call to pslacon, kase should be 0.
on an intermediate return, kase will be 1 or 2, indicatin
on the final return from pslacon, kase will again be 0.

PSLACONSB()

on exit, this yields the starting location of the qr double
shift.  this will satisfy: l <= m  <= i-2
h44

PSLAEBZ()

the maximum number of intervals that may be generated. if
more than mmax intervals are generated, then pslaebz will

PSLAECV()

oendpoint f the j-th interval, and intvl(2*j) is the right
endpoint of the j-th interval. the input intervals will
on input, intvl contains the kl-kf input intervals.

PSLAED2()

dlamda (global output) real array, dimension (n)
a copy of the first k eigenvalues which will be used b

PSLAED3()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PSLAHQR()

nbulge is the number of bulges that will be attempte

PSLAPDCT()

the innermost loop to avoid overflow and determine the sign of a
floating point number. pslapdct will be referred to as the "paranoid

PSLAPIV()

or a column. the pivot vector should be aligned with the distributed
matrix a. this routine will transpose the pivot vector if necessary
sub( a ), pass rowcol='c' and pivroc='c'.

PSLAPV2()

specifies if the rows or columns are to be permuted:
= 'r' rows will be permuted

PSLARED1D()

rows and that all process columns contain the same copy of
bycol.  the output array, byall, will be identical on all processe

PSLARED2D()

columns and that all process rows contain the same copy of
byrow.  the output array, byall, will be identical on all processe

PSLASMSUB()

on exit, this yields the bottom portion of the unreduced
submatrix.  this will satisfy: l <= m  <= i-1
smlnum  (global input) real

PSLASWP()

already been broadcast along the process row or column.
also note that this routine will only work for k1-k2 being in th
pslapiv.

PSPBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(nb+2*bw)*bw

PSPBTRF()

move block into place that it will be expected to be fo

PSPBTRS()

(nb+2*bw)*bw
if laf is not large enough, an error code will be returne

PSPOSVX()

diag(sr) * a * diag(sc) * inv(diag(sc)) * x = diag(sr) * b
whether or not the system will be equilibrated depends on th
overwritten by diag(sr)*a*diag(sc) and b by diag(sr)*b.

PSPTSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(12*npcol + 3*nb)

PSPTTRF()

move block into place that it will be expected to be fo

PSPTTRS()

(nb+2)
if laf is not large enough, an error code will be returne

PSRSCL()

the scalar a which is used to divide each component of
sub( x ).  sa must be >= 0, or the subroutine will divide b

PSSTEBZ()

specifies which eigenvalues are to be found.
= 'a': ("all")   all eigenvalues will be found
[vl, vu] will be found.

PSSTEDC()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PSSTEIN()

orthogonalized can be stored in one process.
no orthogonalization will be done if orfac equals zero
orfac should be identical on all processes.

PSSYEVX()

range   (global input) character*1
= 'a': all eigenvalues will be found
= 'i': the il-th through iu-th eigenvalues will be found.

PSSYGVX()

range   (global input) character*1
= 'a': all eigenvalues will be found
= 'i': the il-th through iu-th eigenvalues will be found.

PSSYTTRD()

pssyttrd is not intended to be called directly.  all users are
encourage to call pssytrd which will then call pshettrd i
the process grid must be square ( i.e. nprow = npcol ) and

PZDBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
nb*(bwl+bwu)+6*max(bwl,bwu)*max(bwl,bwu)

PZDBTRF()

move block into place that it will be expected to be fo

PZDBTRS()

nb*(bwl+bwu)+6*max(bwl,bwu)*max(bwl,bwu)
if laf is not large enough, an error code will be returne

PZDRSCL()

the scalar a which is used to divide each component of
sub( x ).  sa must be >= 0, or the subroutine will divide b

PZDTSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(12*npcol+3*nb)

PZDTTRF()

move block into place that it will be expected to be fo

PZDTTRS()

2*(nb+2)
if laf is not large enough, an error code will be returne

PZGBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(nb+bwu)*(bwl+bwu)+6*(bwl+bwu)*(bwl+2*bwu)

PZGBTRF()

in this case the loop over the levels will not b

PZGBTRS()

(nb+bwu)*(bwl+bwu)+6*(bwl+bwu)*(bwl+2*bwu)
if laf is not large enough, an error code will be returne

PZGESVX()

trans = 'c': (diag(r)*a*diag(c))**h *inv(diag(r))*x = diag(c)*b
whether or not the system will be equilibrated depends on th
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n')

PZGETF2()

the factorization has been completed, but the factor u
is exactly singular, and division by zero will occur i

PZGETRF()

the factorization has been completed, but the factor u
is exactly singular, and division by zero will occur i

PZHEEVX()

range   (global input) character*1
= 'a': all eigenvalues will be found
= 'i': the il-th through iu-th eigenvalues will be found.

PZHEGVX()

range   (global input) character*1
= 'a': all eigenvalues will be found
= 'i': the il-th through iu-th eigenvalues will be found.

PZHETTRD()

pzhettrd is not intended to be called directly.  all users are
encourage to call pzhetrd which will then call pzhettrd i
the process grid must be square ( i.e. nprow = npcol ) and

PZLACON()

on the initial call to pzlacon, kase should be 0.
on an intermediate return, kase will be 1 or 2, indicatin
on the final return from pzlacon, kase will again be 0.

PZLACONSB()

on exit, this yields the starting location of the qr double
shift.  this will satisfy: l <= m  <= i-2
h44

PZLAHQR()

nbulge is the number of bulges that will be attempte

PZLAPIV()

or a column. the pivot vector should be aligned with the distributed
matrix a. this routine will transpose the pivot vector if necessary
sub( a ), pass rowcol='c' and pivroc='c'.

PZLAPV2()

specifies if the rows or columns are to be permuted:
= 'r' rows will be permuted

PZLASMSUB()

on exit, this yields the bottom portion of the unreduced
submatrix.  this will satisfy: l <= m  <= i-1
smlnum  (global input) double precision

PZLASSQ()

the value of sumsq is assumed to be at least unity and the value of
ssq will then satisf
1.0 .le. ssq .le. ( sumsq + 2*n ).

PZLASWP()

already been broadcast along the process row or column.
also note that this routine will only work for k1-k2 being in th
pzlapiv.

PZMAX1()

when the result of a vector-oriented pblas call is a scalar, it will
being operated on.  let x be a generic term for the input vector(s).

PZPBSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(nb+2*bw)*bw

PZPBTRF()

move block into place that it will be expected to be fo

PZPBTRS()

(nb+2*bw)*bw
if laf is not large enough, an error code will be returne

PZPOSVX()

diag(sr) * a * diag(sc) * inv(diag(sc)) * x = diag(sr) * b
whether or not the system will be equilibrated depends on th
overwritten by diag(sr)*a*diag(sc) and b by diag(sr)*b.

PZPTSV()

size of user-input workspace work.
if lwork is too small, the minimal acceptable size will b
(12*npcol + 3*nb)

PZPTTRF()

move block into place that it will be expected to be fo

PZPTTRS()

(nb+2)
if laf is not large enough, an error code will be returne

PZSTEIN()

orthogonalized can be stored in one process.
no orthogonalization will be done if orfac equals zero
orfac should be identical on all processes.

SDBTF2()

has been completed, but the factor u is exactly
singular, and division by zero will occur if it is use

SDTTRF()

has been completed, but the factor u is exactly
singular, and division by zero will occur if it is use

ZDBTF2()

has been completed, but the factor u is exactly
singular, and division by zero will occur if it is use

ZDTTRF()

has been completed, but the factor u is exactly
singular, and division by zero will occur if it is use

wing

PCGEHD2()

such a global array has an associated description vector desca.
in the following comments, the character _ should be read a

PDGEHD2()

such a global array has an associated description vector desca.
in the following comments, the character _ should be read a

PSGEHD2()

such a global array has an associated description vector desca.
in the following comments, the character _ should be read a

PZGEHD2()

such a global array has an associated description vector desca.
in the following comments, the character _ should be read a

wise

PDSYEVX()

biggest boost in performance comes for small n, so it
is wise to provide the extra workspace (typically les

PDSYGVX()

biggest boost in performance comes for small n, so it
is wise to provide the extra workspace (typically les

PSSYEVX()

biggest boost in performance comes for small n, so it
is wise to provide the extra workspace (typically les

PSSYGVX()

biggest boost in performance comes for small n, so it
is wise to provide the extra workspace (typically les

with

CDBTF2()

cdbtrf computes an lu factorization of a real m-by-n band matrix a
without using partial pivoting with row interchanges
this is the unblocked version of the algorithm, calling level 2 blas.

CDTTRF()

cdttrf computes an lu factorization of a complex tridiagonal matrix a
using elimination without partial pivoting
the factorization has the form

CDTTRSV()

u * x = b,  u**t * x = b,  or  u**h * x = b,
with factors of the tridiagonal matrix a from the lu factorizatio

CLAHQR2()

ihi to ilo in steps of 1 or 2. each iteration of the loop works
with the active submatrix in rows and columns l to i
h(l,l-1) is negligible so that the matrix splits.

CTRMVT()

t      - complex array of dimension ( ldt, n ).
before entry with  uplo = 'u' or 'u', the leading n by 
triangular matrix and the strictly lower triangular part of

DDBTF2()

ddbtrf computes an lu factorization of a real m-by-n band matrix a
without using partial pivoting with row interchanges
this is the unblocked version of the algorithm, calling level 2 blas.

DDTTRF()

ddttrf computes an lu factorization of a complex tridiagonal matrix a
using elimination without partial pivoting
the factorization has the form

DDTTRSV()

u * x = b,  u**t * x = b,  or  u**h * x = b,
with factors of the tridiagonal matrix a from the lu factorizatio

DSTEIN2()

compute lu factors with partial pivoting  ( pt = lu

DTRMVT()

t      - double precision array of dimension ( ldt, n ).
before entry with  uplo = 'u' or 'u', the leading n by 
triangular matrix and the strictly lower triangular part of

PCDBSV()

banded diagonally dominant-like distributed
matrix with bandwidth bwl, bwu
gaussian elimination without pivoting

PCDBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCDBTRS()

banded diagonally dominant-like distributed
matrix with bandwidth bwl, bwu
routine pcdbtrf must be called first.

PCDBTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCDTSV()

gaussian elimination without pivotin
of the matrix into l u.

PCDTTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCDTTRS()

trans   (global input) character
= 'n':  solve with a(1:n, ja:ja+n-1)

PCDTTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCGBSV()

banded distributed
matrix with bandwidth bwl, bwu
gaussian elimination with pivoting

PCGBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCGBTRS()

banded distributed
matrix with bandwidth bwl, bwu
routine pcgbtrf must be called first.

PCGEBD2()

the diagonal and the first superdiagonal of sub( a ) are
overwritten with the upper bidiagonal matrix b; the element
unitary matrix q as a product of elementary reflectors, and

PCGEBRD()

the diagonal and the first superdiagonal of sub( a ) are
overwritten with the upper bidiagonal matrix b; the element
unitary matrix q as a product of elementary reflectors, and

PCGEEQU()

the column scale factors, chosen to try to make the largest entry in
each row and column of the distributed matrix b with element

PCGEHD2()

the upper triangle and the first subdiagonal of sub( a ) are
overwritten with the upper hessenberg matrix h, and the ele
sent the unitary matrix q as a product of elementary

PCGEHRD()

the upper triangle and the first subdiagonal of sub( a ) are
overwritten with the upper hessenberg matrix h, and the ele
sent the unitary matrix q as a product of elementary

PCGELQ2()

lower trapezoidal matrix l (l is lower triangular if m <= n);
the elements above the diagonal, with the array tau, repre
reflectors (see further details).

PCGELQF()

lower trapezoidal matrix l (l is lower triangular if m <= n);
the elements above the diagonal, with the array tau, repre
reflectors (see further details).

PCGELS()

where lcmp = lcm / nprow with lcm = ilcm( nprow, npcol )
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PCGEQL2()

the (n-m)-th superdiagonal contain the m by n lower
trapezoidal matrix l; the remaining elements, with th
elementary reflectors (see further details).

PCGEQLF()

the (n-m)-th superdiagonal contain the m by n lower
trapezoidal matrix l; the remaining elements, with th
elementary reflectors (see further details).

PCGEQPF()

pcgeqpf computes a qr factorization with column pivoting of

PCGEQR2()

upper trapezoidal matrix r (r is upper triangular if m >= n);
the elements below the diagonal, with the array tau
reflectors (see further details).

PCGEQRF()

upper trapezoidal matrix r (r is upper triangular if m >= n);
the elements below the diagonal, with the array tau
reflectors (see further details).

PCGERFS()

by pcgetrf. ipiv(i) -> the global row local row i
was swapped with. this array is tied to the distribute

PCGERQ2()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PCGERQF()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PCGESV()

the lu decomposition with partial pivoting and row interchanges i
tation matrix, l is unit lower triangular, and u is upper triangular.

PCGESVX()

if equed is not 'n', the matrix
a(ia:ia+n-1,ja:ja+n-1) has been equilibrated with
a(ia:ia+n-1,ja:ja+n-1), af(iaf:iaf+n-1,jaf:jaf+n-1),

PCGETF2()

distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using
partial pivoting with row interchanges
the factorization has the form sub( a ) = p * l * u, where p is a

PCGETRF()

pcgetrf computes an lu factorization of a general m-by-n distributed
matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting with

PCGETRI()

keeps track of the pivoting information. ipiv(i) is the
global row index the local row i was swapped with.  thi

PCGETRS()

with a general n-by-n distributed matrix sub( a ) using the l
sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1), op( a ) = a, a**t or a**h

PCGGQRF()

upper trapezoidal matrix r (r is upper triangular if n >= m);
the elements below the diagonal, with the array taua
elementary reflectors (see further details).

PCGGRQF()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PCHEEV()

different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PCHEEVD()

lwork = n + ( np0 + mq0 + nb ) * nb,
with  np0 = numroc( max( n, nb, 2 ), nb, 0, 0, nprow

PCHEEVX()

set to twice the underflow threshold 2*pslamch('s') not zero.
if this routine returns with ((mod(info,2).ne.0) .or
eigenvectors did not converge, try setting abstol to

PCHEGVX()

set to twice the underflow threshold 2*pslamch('s') not zero.
if this routine returns with ((mod(info,2).ne.0) .or
eigenvectors did not converge, try setting abstol to

PCHENTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PCHETD2()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PCHETRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PCHETTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PCLABRD()

if m >= n, elements on and below the diagonal in the first nb
columns, with the array tauq, represent the unitar
elements above the diagonal in the first nb rows, with the

PCLACON()

a. reverse communication is used for evaluating matrix-vector
products. x and v are aligned with the distributed matrix a, thi

PCLAEVSWP()

the eigenvectors on output.  the eigenvectors are distributed
in a block cyclic manner in both dimensions, with

PCLAHQR()

ihi to ilo in steps of our schur block size (<=2*iblk). each
iteration of the loop works  with the active submatrix in row
converged. either l = ilo or the global a(l,l-1) is negligible

PCLAHRD()

the k-th subdiagonal in the first nb columns are overwritten
with the corresponding elements of the reduced distribute
array tau, represent the matrix q as a product of elementary

PCLAMR1D()

to 1.  indeed, i suspect that ib should always be set to 1 or ignored
with 1 used in its place
pclamr1d has not been tested except withint the contect of

PCLANGE()

( max(abs(a(i,j))),  norm = 'm' or 'm' with ia <= i <= ia+m-1
(

PCLAPIV()

pivoting. the pivot vector may be distributed across a process row
or a column. the pivot vector should be aligned with the distribute
for example if the row pivots should be applied to the columns of

PCLAPV2()

a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting.  the
pivot vector should be aligned with the distributed matrix a.  fo
process column and replicated over all process rows.  similarly,

PCLAQGE()

r       (local input) real array, dimension locr(m_a)
the row scale factors for sub( a ). r is aligned with th
column. r is tied to the distributed matrix a.

PCLAQSY()

the scale factors for a(ia:ia+m-1,ja:ja+n-1). sr is aligned
with the distributed matrix a, and replicated across ever

PCLARFB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PCLARFG()

before entry, the incremented array sub( x ) must contain
the vector x. on exit, it is overwritten with the vector v
ix      (global input) integer

PCLARFT()

the k-by-k triangular factor of the block reflector asso-
ciated with v. if direct = 'f', t is upper triangular

PCLARZB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PCLARZT()

it contains the k-by-k triangular factor of the block
reflector associated with v. t is lower triangular
work    (local workspace) complex array,

PCLASSQ()

on entry, the value  scale  in the equation above.
on exit, scale is overwritten with  scl , the scaling facto

PCLATRD()

on exit, if uplo = 'u', the last nb columns have been reduced
to tridiagonal form, with the diagonal elements overwritin
diagonal with the array tau, represent the unitary matrix q

PCLATRZ()

gular matrix r, and elements n-l+1 to n of the first m rows
of sub( a ), with the array tau, represent the unitary matri

PCLAUU2()

on exit, if uplo = 'u', the upper triangle of the distributed
matrix sub( a ) is overwritten with the upper triangle of th
is overwritten with the lower triangle of the product l' * l.

PCLAUUM()

on exit, if uplo = 'u', the upper triangle of the distributed
matrix sub( a ) is overwritten with the upper triangle of th
is overwritten with the lower triangle of the product l' * l.

PCMAX1()

when the result of a vector-oriented pblas call is a scalar, it will
be made available only within the scope which owns the vector(s
then, the processes which receive the answer will be (note that if

PCPBSV()

banded symmetric positive definite distributed
matrix with bandwidth bw
cholesky factorization is used to factor a reordering of

PCPBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCPBTRS()

banded symmetric positive definite distributed
matrix with bandwidth bw
a(1:n, ja:ja+n-1) = u'*u or l*l' as computed by pcpbtrf.

PCPBTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCPOEQU()

sub( a ) = a(ia:ia+n-1,ja:ja+n-1) and reduce its condition number
(with respect to the two-norm).  sr and sc contain the scal
buted matrix b with elements b(i,j) = s(i)*a(i,j)*s(j) has ones on

PCPOSV()

ted matrix sub( b ). on exit, if info = 0, sub( b ) is over-
written with the solution distributed matrix x
ib      (global input) integer

PCPOSVX()

if equed = 'y', the matrix a has been equilibrated
with scaling factors given by s.  a and af will no
= 'n':  the matrix a will be copied to af and factored.

PCPTSV()

matrix. globally, du(n) is not referenced, and du must be
aligned with d
factors of the matrix.

PCPTTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCPTTRS()

matrix. globally, du(n) is not referenced, and du must be
aligned with d
factors of the matrix.

PCPTTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PCSTEIN()

eigenvalues should be grouped by split-off block and ordered
from smallest to largest within the block (the output arra
array should be replicated on all processes.

PCTRRFS()

pctrrfs provides error bounds and backward error estimates for the
solution to a system of linear equations with a triangula

PCTZRZF()

gular matrix r, and elements m+1 to n of the first m rows of
sub( a ), with the array tau, represent the unitary matrix

PCUNG2L()

pcung2l generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PCUNG2R()

pcung2r generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PCUNGL2()

pcungl2 generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PCUNGLQ()

pcunglq generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PCUNGQL()

pcungql generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PCUNGQR()

pcungqr generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PCUNGR2()

pcungr2 generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PCUNGRQ()

pcungrq generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PCUNM2L()

pcunm2l overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PCUNM2R()

pcunm2r overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PCUNMBR()

if vect = 'q', pcunmbr overwrites the general complex distributed
m-by-n matrix sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PCUNMHR()

pcunmhr overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PCUNML2()

pcunml2 overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PCUNMLQ()

pcunmlq overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PCUNMQL()

pcunmql overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PCUNMQR()

pcunmqr overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PCUNMR2()

pcunmr2 overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PCUNMR3()

pcunmr3 overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PCUNMRQ()

pcunmrq overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PCUNMRZ()

pcunmrz overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PCUNMTR()

pcunmtr overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PDDBSV()

banded diagonally dominant-like distributed
matrix with bandwidth bwl, bwu
gaussian elimination without pivoting

PDDBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDDBTRS()

banded diagonally dominant-like distributed
matrix with bandwidth bwl, bwu
routine pddbtrf must be called first.

PDDBTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDDTSV()

gaussian elimination without pivotin
of the matrix into l u.

PDDTTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDDTTRS()

trans   (global input) character
= 'n':  solve with a(1:n, ja:ja+n-1)

PDDTTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDGBSV()

banded distributed
matrix with bandwidth bwl, bwu
gaussian elimination with pivoting

PDGBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDGBTRS()

banded distributed
matrix with bandwidth bwl, bwu
routine pdgbtrf must be called first.

PDGEBD2()

the diagonal and the first superdiagonal of sub( a ) are
overwritten with the upper bidiagonal matrix b; the element
orthogonal matrix q as a product of elementary reflectors,

PDGEBRD()

the diagonal and the first superdiagonal of sub( a ) are
overwritten with the upper bidiagonal matrix b; the element
orthogonal matrix q as a product of elementary reflectors,

PDGEEQU()

the column scale factors, chosen to try to make the largest entry in
each row and column of the distributed matrix b with element

PDGEHD2()

the upper triangle and the first subdiagonal of sub( a ) are
overwritten with the upper hessenberg matrix h, and the ele
sent the orthogonal matrix q as a product of elementary

PDGEHRD()

the upper triangle and the first subdiagonal of sub( a ) are
overwritten with the upper hessenberg matrix h, and the ele
sent the orthogonal matrix q as a product of elementary

PDGELQ2()

lower trapezoidal matrix l (l is lower triangular if m <= n);
the elements above the diagonal, with the array tau, repre
reflectors (see further details).

PDGELQF()

lower trapezoidal matrix l (l is lower triangular if m <= n);
the elements above the diagonal, with the array tau, repre
reflectors (see further details).

PDGELS()

where lcmp = lcm / nprow with lcm = ilcm( nprow, npcol )
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PDGEQL2()

the (n-m)-th superdiagonal contain the m by n lower
trapezoidal matrix l; the remaining elements, with th
elementary reflectors (see further details).

PDGEQLF()

the (n-m)-th superdiagonal contain the m by n lower
trapezoidal matrix l; the remaining elements, with th
elementary reflectors (see further details).

PDGEQPF()

pdgeqpf computes a qr factorization with column pivoting of

PDGEQR2()

upper trapezoidal matrix r (r is upper triangular if m >= n);
the elements below the diagonal, with the array tau
reflectors (see further details).

PDGEQRF()

upper trapezoidal matrix r (r is upper triangular if m >= n);
the elements below the diagonal, with the array tau
reflectors (see further details).

PDGERFS()

by pdgetrf. ipiv(i) -> the global row local row i
was swapped with. this array is tied to the distribute

PDGERQ2()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PDGERQF()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PDGESV()

the lu decomposition with partial pivoting and row interchanges i
tation matrix, l is unit lower triangular, and u is upper triangular.

PDGESVX()

if equed is not 'n', the matrix
a(ia:ia+n-1,ja:ja+n-1) has been equilibrated with
a(ia:ia+n-1,ja:ja+n-1), af(iaf:iaf+n-1,jaf:jaf+n-1),

PDGETF2()

distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using
partial pivoting with row interchanges
the factorization has the form sub( a ) = p * l * u, where p is a

PDGETRF()

pdgetrf computes an lu factorization of a general m-by-n distributed
matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting with

PDGETRI()

keeps track of the pivoting information. ipiv(i) is the
global row index the local row i was swapped with.  thi

PDGETRS()

with a general n-by-n distributed matrix sub( a ) using the l
sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1), op( a ) = a or a**t and

PDGGQRF()

upper trapezoidal matrix r (r is upper triangular if n >= m);
the elements below the diagonal, with the array taua
elementary reflectors (see further details).

PDGGRQF()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PDLABAD()

the log of large is sufficiently large.  this subroutine is intended
to identify machines with a large exponent range, such as the crays
of the values computed by pdlamch.  this subroutine is needed because

PDLABRD()

if m >= n, elements on and below the diagonal in the first nb
columns, with the array tauq, represent the orthogona
elements above the diagonal in the first nb rows, with the

PDLACON()

reverse communication is used for evaluating matrix-vector products.
x and v are aligned with the distributed matrix a, this informatio

PDLAEBZ()

specifies the computation done by pdlaebz
= 0 : find an interval with desired values of n(w) at th
= 1 : find a floating point number contained in the initial

PDLAED0()

work    (local workspace ) double precision array, dimension (lwork)
lwork = 6*n + 2*np*nq, with
nq = numroc( n, nb_q, mycol, iqcol, npcol )

PDLAED1()

where z = q'u, u is a vector of length n with ones in th

PDLAED2()

on entry, q contains the eigenvectors of two submatrices in
the two square blocks with corners at (1,1), (n1,n1
on exit, q contains the trailing (n-k) updated eigenvectors

PDLAED3()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PDLAEVSWP()

the eigenvectors on output.  the eigenvectors are distributed
in a block cyclic manner in both dimensions, with

PDLAHQR()

ihi to ilo in steps of our schur block size (<=2*iblk). each
iteration of the loop works  with the active submatrix in row
converged. either l = ilo or the global a(l,l-1) is negligible

PDLAHRD()

the k-th subdiagonal in the first nb columns are overwritten
with the corresponding elements of the reduced distribute
array tau, represent the matrix q as a product of elementary

PDLAMR1D()

to 1.  indeed, i suspect that ib should always be set to 1 or ignored
with 1 used in its place
pdlamr1d has not been tested except withint the contect of

PDLANGE()

( max(abs(a(i,j))),  norm = 'm' or 'm' with ia <= i <= ia+m-1
(

PDLAPIV()

pivoting. the pivot vector may be distributed across a process row
or a column. the pivot vector should be aligned with the distribute
for example if the row pivots should be applied to the columns of

PDLAPV2()

a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting.  the
pivot vector should be aligned with the distributed matrix a.  fo
process column and replicated over all process rows.  similarly,

PDLAQGE()

r       (local input) double precision array, dimension locr(m_a)
the row scale factors for sub( a ). r is aligned with th
column. r is tied to the distributed matrix a.

PDLAQSY()

the scale factors for a(ia:ia+m-1,ja:ja+n-1). sr is aligned
with the distributed matrix a, and replicated across ever

PDLARFB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PDLARFG()

before entry, the incremented array sub( x ) must contain
the vector x. on exit, it is overwritten with the vector v
ix      (global input) integer

PDLARFT()

the k-by-k triangular factor of the block reflector asso-
ciated with v. if direct = 'f', t is upper triangular

PDLARZB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PDLARZT()

it contains the k-by-k triangular factor of the block
reflector associated with v. t is lower triangular
work    (local workspace) double precision array,

PDLASSQ()

on entry, the value  scale  in the equation above.
on exit, scale is overwritten with  scl , the scaling facto

PDLATRD()

on exit, if uplo = 'u', the last nb columns have been reduced
to tridiagonal form, with the diagonal elements overwritin
diagonal with the array tau, represent the orthogonal matrix

PDLATRZ()

gular matrix r, and elements n-l+1 to n of the first m rows
of sub( a ), with the array tau, represent the orthogona

PDLAUU2()

on exit, if uplo = 'u', the upper triangle of the distributed
matrix sub( a ) is overwritten with the upper triangle of th
is overwritten with the lower triangle of the product l' * l.

PDLAUUM()

on exit, if uplo = 'u', the upper triangle of the distributed
matrix sub( a ) is overwritten with the upper triangle of th
is overwritten with the lower triangle of the product l' * l.

PDORG2L()

pdorg2l generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PDORG2R()

pdorg2r generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PDORGL2()

pdorgl2 generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PDORGLQ()

pdorglq generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PDORGQL()

pdorgql generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PDORGQR()

pdorgqr generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PDORGR2()

pdorgr2 generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PDORGRQ()

pdorgrq generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PDORM2L()

pdorm2l overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PDORM2R()

pdorm2r overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PDORMBR()

if vect = 'q', pdormbr overwrites the general real distributed m-by-n
matrix sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PDORMHR()

pdormhr overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PDORML2()

pdorml2 overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PDORMLQ()

pdormlq overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PDORMQL()

pdormql overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PDORMQR()

pdormqr overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PDORMR2()

pdormr2 overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PDORMR3()

pdormr3 overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PDORMRQ()

pdormrq overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PDORMRZ()

pdormrz overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PDORMTR()

pdormtr overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PDPBSV()

banded symmetric positive definite distributed
matrix with bandwidth bw
cholesky factorization is used to factor a reordering of

PDPBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDPBTRS()

banded symmetric positive definite distributed
matrix with bandwidth bw
a(1:n, ja:ja+n-1) = u'*u or l*l' as computed by pdpbtrf.

PDPBTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDPOEQU()

sub( a ) = a(ia:ia+n-1,ja:ja+n-1) and reduce its condition number
(with respect to the two-norm).  sr and sc contain the scal
buted matrix b with elements b(i,j) = s(i)*a(i,j)*s(j) has ones on

PDPOSV()

ted matrix sub( b ). on exit, if info = 0, sub( b ) is over-
written with the solution distributed matrix x
ib      (global input) integer

PDPOSVX()

if equed = 'y', the matrix a has been equilibrated
with scaling factors given by s.  a and af will no
= 'n':  the matrix a will be copied to af and factored.

PDPTSV()

matrix. globally, du(n) is not referenced, and du must be
aligned with d
factors of the matrix.

PDPTTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDPTTRS()

matrix. globally, du(n) is not referenced, and du must be
aligned with d
factors of the matrix.

PDPTTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PDSTEBZ()

split-off block (see iblock, isplit) and
ordered from smallest to largest withi
= 'e': ("entire matrix")

PDSTEDC()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PDSTEIN()

eigenvalues should be grouped by split-off block and ordered
from smallest to largest within the block (the output arra
array should be replicated on all processes.

PDSYEV()

the different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PDSYEVD()

the different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PDSYEVX()

set to twice the underflow threshold 2*pdlamch('s') not zero.
if this routine returns with ((mod(info,2).ne.0) .or
eigenvectors did not converge, try setting abstol to

PDSYGVX()

set to twice the underflow threshold 2*pdlamch('s') not zero.
if this routine returns with ((mod(info,2).ne.0) .or
eigenvectors did not converge, try setting abstol to

PDSYNTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the orthogonal matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PDSYTD2()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the orthogonal matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PDSYTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the orthogonal matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PDSYTTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PDTRRFS()

pdtrrfs provides error bounds and backward error estimates for the
solution to a system of linear equations with a triangula

PDTZRZF()

gular matrix r, and elements m+1 to n of the first m rows of
sub( a ), with the array tau, represent the orthogonal matri

PDZSUM1()

the serial version of this routine was originally contributed by
nick higham for use with zlacon
notes

PSCSUM1()

the serial version of this routine was originally contributed by
nick higham for use with clacon
notes

PSDBSV()

banded diagonally dominant-like distributed
matrix with bandwidth bwl, bwu
gaussian elimination without pivoting

PSDBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSDBTRS()

banded diagonally dominant-like distributed
matrix with bandwidth bwl, bwu
routine psdbtrf must be called first.

PSDBTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSDTSV()

gaussian elimination without pivotin
of the matrix into l u.

PSDTTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSDTTRS()

trans   (global input) character
= 'n':  solve with a(1:n, ja:ja+n-1)

PSDTTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSGBSV()

banded distributed
matrix with bandwidth bwl, bwu
gaussian elimination with pivoting

PSGBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSGBTRS()

banded distributed
matrix with bandwidth bwl, bwu
routine psgbtrf must be called first.

PSGEBD2()

the diagonal and the first superdiagonal of sub( a ) are
overwritten with the upper bidiagonal matrix b; the element
orthogonal matrix q as a product of elementary reflectors,

PSGEBRD()

the diagonal and the first superdiagonal of sub( a ) are
overwritten with the upper bidiagonal matrix b; the element
orthogonal matrix q as a product of elementary reflectors,

PSGEEQU()

the column scale factors, chosen to try to make the largest entry in
each row and column of the distributed matrix b with element

PSGEHD2()

the upper triangle and the first subdiagonal of sub( a ) are
overwritten with the upper hessenberg matrix h, and the ele
sent the orthogonal matrix q as a product of elementary

PSGEHRD()

the upper triangle and the first subdiagonal of sub( a ) are
overwritten with the upper hessenberg matrix h, and the ele
sent the orthogonal matrix q as a product of elementary

PSGELQ2()

lower trapezoidal matrix l (l is lower triangular if m <= n);
the elements above the diagonal, with the array tau, repre
reflectors (see further details).

PSGELQF()

lower trapezoidal matrix l (l is lower triangular if m <= n);
the elements above the diagonal, with the array tau, repre
reflectors (see further details).

PSGELS()

where lcmp = lcm / nprow with lcm = ilcm( nprow, npcol )
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PSGEQL2()

the (n-m)-th superdiagonal contain the m by n lower
trapezoidal matrix l; the remaining elements, with th
elementary reflectors (see further details).

PSGEQLF()

the (n-m)-th superdiagonal contain the m by n lower
trapezoidal matrix l; the remaining elements, with th
elementary reflectors (see further details).

PSGEQPF()

psgeqpf computes a qr factorization with column pivoting of

PSGEQR2()

upper trapezoidal matrix r (r is upper triangular if m >= n);
the elements below the diagonal, with the array tau
reflectors (see further details).

PSGEQRF()

upper trapezoidal matrix r (r is upper triangular if m >= n);
the elements below the diagonal, with the array tau
reflectors (see further details).

PSGERFS()

by psgetrf. ipiv(i) -> the global row local row i
was swapped with. this array is tied to the distribute

PSGERQ2()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PSGERQF()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PSGESV()

the lu decomposition with partial pivoting and row interchanges i
tation matrix, l is unit lower triangular, and u is upper triangular.

PSGESVX()

if equed is not 'n', the matrix
a(ia:ia+n-1,ja:ja+n-1) has been equilibrated with
a(ia:ia+n-1,ja:ja+n-1), af(iaf:iaf+n-1,jaf:jaf+n-1),

PSGETF2()

distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using
partial pivoting with row interchanges
the factorization has the form sub( a ) = p * l * u, where p is a

PSGETRF()

psgetrf computes an lu factorization of a general m-by-n distributed
matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting with

PSGETRI()

keeps track of the pivoting information. ipiv(i) is the
global row index the local row i was swapped with.  thi

PSGETRS()

with a general n-by-n distributed matrix sub( a ) using the l
sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1), op( a ) = a or a**t and

PSGGQRF()

upper trapezoidal matrix r (r is upper triangular if n >= m);
the elements below the diagonal, with the array taua
elementary reflectors (see further details).

PSGGRQF()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PSLABAD()

the log of large is sufficiently large.  this subroutine is intended
to identify machines with a large exponent range, such as the crays
of the values computed by pslamch.  this subroutine is needed because

PSLABRD()

if m >= n, elements on and below the diagonal in the first nb
columns, with the array tauq, represent the orthogona
elements above the diagonal in the first nb rows, with the

PSLACON()

reverse communication is used for evaluating matrix-vector products.
x and v are aligned with the distributed matrix a, this informatio

PSLAEBZ()

specifies the computation done by pslaebz
= 0 : find an interval with desired values of n(w) at th
= 1 : find a floating point number contained in the initial

PSLAED0()

work    (local workspace ) real array, dimension (lwork)
lwork = 6*n + 2*np*nq, with
nq = numroc( n, nb_q, mycol, iqcol, npcol )

PSLAED1()

where z = q'u, u is a vector of length n with ones in th

PSLAED2()

on entry, q contains the eigenvectors of two submatrices in
the two square blocks with corners at (1,1), (n1,n1
on exit, q contains the trailing (n-k) updated eigenvectors

PSLAED3()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PSLAEVSWP()

the eigenvectors on output.  the eigenvectors are distributed
in a block cyclic manner in both dimensions, with

PSLAHQR()

ihi to ilo in steps of our schur block size (<=2*iblk). each
iteration of the loop works  with the active submatrix in row
converged. either l = ilo or the global a(l,l-1) is negligible

PSLAHRD()

the k-th subdiagonal in the first nb columns are overwritten
with the corresponding elements of the reduced distribute
array tau, represent the matrix q as a product of elementary

PSLAMR1D()

to 1.  indeed, i suspect that ib should always be set to 1 or ignored
with 1 used in its place
pslamr1d has not been tested except withint the contect of

PSLANGE()

( max(abs(a(i,j))),  norm = 'm' or 'm' with ia <= i <= ia+m-1
(

PSLAPIV()

pivoting. the pivot vector may be distributed across a process row
or a column. the pivot vector should be aligned with the distribute
for example if the row pivots should be applied to the columns of

PSLAPV2()

a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting.  the
pivot vector should be aligned with the distributed matrix a.  fo
process column and replicated over all process rows.  similarly,

PSLAQGE()

r       (local input) real array, dimension locr(m_a)
the row scale factors for sub( a ). r is aligned with th
column. r is tied to the distributed matrix a.

PSLAQSY()

the scale factors for a(ia:ia+m-1,ja:ja+n-1). sr is aligned
with the distributed matrix a, and replicated across ever

PSLARFB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PSLARFG()

before entry, the incremented array sub( x ) must contain
the vector x. on exit, it is overwritten with the vector v
ix      (global input) integer

PSLARFT()

the k-by-k triangular factor of the block reflector asso-
ciated with v. if direct = 'f', t is upper triangular

PSLARZB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PSLARZT()

it contains the k-by-k triangular factor of the block
reflector associated with v. t is lower triangular
work    (local workspace) real array,

PSLASSQ()

on entry, the value  scale  in the equation above.
on exit, scale is overwritten with  scl , the scaling facto

PSLATRD()

on exit, if uplo = 'u', the last nb columns have been reduced
to tridiagonal form, with the diagonal elements overwritin
diagonal with the array tau, represent the orthogonal matrix

PSLATRZ()

gular matrix r, and elements n-l+1 to n of the first m rows
of sub( a ), with the array tau, represent the orthogona

PSLAUU2()

on exit, if uplo = 'u', the upper triangle of the distributed
matrix sub( a ) is overwritten with the upper triangle of th
is overwritten with the lower triangle of the product l' * l.

PSLAUUM()

on exit, if uplo = 'u', the upper triangle of the distributed
matrix sub( a ) is overwritten with the upper triangle of th
is overwritten with the lower triangle of the product l' * l.

PSORG2L()

psorg2l generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PSORG2R()

psorg2r generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PSORGL2()

psorgl2 generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PSORGLQ()

psorglq generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PSORGQL()

psorgql generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PSORGQR()

psorgqr generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PSORGR2()

psorgr2 generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PSORGRQ()

psorgrq generates an m-by-n real distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PSORM2L()

psorm2l overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PSORM2R()

psorm2r overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PSORMBR()

if vect = 'q', psormbr overwrites the general real distributed m-by-n
matrix sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PSORMHR()

psormhr overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PSORML2()

psorml2 overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PSORMLQ()

psormlq overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PSORMQL()

psormql overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PSORMQR()

psormqr overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PSORMR2()

psormr2 overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PSORMR3()

psormr3 overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PSORMRQ()

psormrq overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PSORMRZ()

psormrz overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PSORMTR()

psormtr overwrites the general real m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PSPBSV()

banded symmetric positive definite distributed
matrix with bandwidth bw
cholesky factorization is used to factor a reordering of

PSPBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSPBTRS()

banded symmetric positive definite distributed
matrix with bandwidth bw
a(1:n, ja:ja+n-1) = u'*u or l*l' as computed by pspbtrf.

PSPBTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSPOEQU()

sub( a ) = a(ia:ia+n-1,ja:ja+n-1) and reduce its condition number
(with respect to the two-norm).  sr and sc contain the scal
buted matrix b with elements b(i,j) = s(i)*a(i,j)*s(j) has ones on

PSPOSV()

ted matrix sub( b ). on exit, if info = 0, sub( b ) is over-
written with the solution distributed matrix x
ib      (global input) integer

PSPOSVX()

if equed = 'y', the matrix a has been equilibrated
with scaling factors given by s.  a and af will no
= 'n':  the matrix a will be copied to af and factored.

PSPTSV()

matrix. globally, du(n) is not referenced, and du must be
aligned with d
factors of the matrix.

PSPTTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSPTTRS()

matrix. globally, du(n) is not referenced, and du must be
aligned with d
factors of the matrix.

PSPTTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PSSTEBZ()

split-off block (see iblock, isplit) and
ordered from smallest to largest withi
= 'e': ("entire matrix")

PSSTEDC()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PSSTEIN()

eigenvalues should be grouped by split-off block and ordered
from smallest to largest within the block (the output arra
array should be replicated on all processes.

PSSYEV()

the different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PSSYEVD()

the different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PSSYEVX()

set to twice the underflow threshold 2*pslamch('s') not zero.
if this routine returns with ((mod(info,2).ne.0) .or
eigenvectors did not converge, try setting abstol to

PSSYGVX()

set to twice the underflow threshold 2*pslamch('s') not zero.
if this routine returns with ((mod(info,2).ne.0) .or
eigenvectors did not converge, try setting abstol to

PSSYNTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the orthogonal matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PSSYTD2()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the orthogonal matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PSSYTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the orthogonal matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PSSYTTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PSTRRFS()

pstrrfs provides error bounds and backward error estimates for the
solution to a system of linear equations with a triangula

PSTZRZF()

gular matrix r, and elements m+1 to n of the first m rows of
sub( a ), with the array tau, represent the orthogonal matri

PZDBSV()

banded diagonally dominant-like distributed
matrix with bandwidth bwl, bwu
gaussian elimination without pivoting

PZDBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZDBTRS()

banded diagonally dominant-like distributed
matrix with bandwidth bwl, bwu
routine pzdbtrf must be called first.

PZDBTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZDTSV()

gaussian elimination without pivotin
of the matrix into l u.

PZDTTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZDTTRS()

trans   (global input) character
= 'n':  solve with a(1:n, ja:ja+n-1)

PZDTTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZGBSV()

banded distributed
matrix with bandwidth bwl, bwu
gaussian elimination with pivoting

PZGBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZGBTRS()

banded distributed
matrix with bandwidth bwl, bwu
routine pzgbtrf must be called first.

PZGEBD2()

the diagonal and the first superdiagonal of sub( a ) are
overwritten with the upper bidiagonal matrix b; the element
unitary matrix q as a product of elementary reflectors, and

PZGEBRD()

the diagonal and the first superdiagonal of sub( a ) are
overwritten with the upper bidiagonal matrix b; the element
unitary matrix q as a product of elementary reflectors, and

PZGEEQU()

the column scale factors, chosen to try to make the largest entry in
each row and column of the distributed matrix b with element

PZGEHD2()

the upper triangle and the first subdiagonal of sub( a ) are
overwritten with the upper hessenberg matrix h, and the ele
sent the unitary matrix q as a product of elementary

PZGEHRD()

the upper triangle and the first subdiagonal of sub( a ) are
overwritten with the upper hessenberg matrix h, and the ele
sent the unitary matrix q as a product of elementary

PZGELQ2()

lower trapezoidal matrix l (l is lower triangular if m <= n);
the elements above the diagonal, with the array tau, repre
reflectors (see further details).

PZGELQF()

lower trapezoidal matrix l (l is lower triangular if m <= n);
the elements above the diagonal, with the array tau, repre
reflectors (see further details).

PZGELS()

where lcmp = lcm / nprow with lcm = ilcm( nprow, npcol )
iroffa = mod( ia-1, mb_a ), icoffa = mod( ja-1, nb_a ),

PZGEQL2()

the (n-m)-th superdiagonal contain the m by n lower
trapezoidal matrix l; the remaining elements, with th
elementary reflectors (see further details).

PZGEQLF()

the (n-m)-th superdiagonal contain the m by n lower
trapezoidal matrix l; the remaining elements, with th
elementary reflectors (see further details).

PZGEQPF()

pzgeqpf computes a qr factorization with column pivoting of

PZGEQR2()

upper trapezoidal matrix r (r is upper triangular if m >= n);
the elements below the diagonal, with the array tau
reflectors (see further details).

PZGEQRF()

upper trapezoidal matrix r (r is upper triangular if m >= n);
the elements below the diagonal, with the array tau
reflectors (see further details).

PZGERFS()

by pzgetrf. ipiv(i) -> the global row local row i
was swapped with. this array is tied to the distribute

PZGERQ2()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PZGERQF()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PZGESV()

the lu decomposition with partial pivoting and row interchanges i
tation matrix, l is unit lower triangular, and u is upper triangular.

PZGESVX()

if equed is not 'n', the matrix
a(ia:ia+n-1,ja:ja+n-1) has been equilibrated with
a(ia:ia+n-1,ja:ja+n-1), af(iaf:iaf+n-1,jaf:jaf+n-1),

PZGETF2()

distributed matrix sub( a ) = a(ia:ia+m-1,ja:ja+n-1) using
partial pivoting with row interchanges
the factorization has the form sub( a ) = p * l * u, where p is a

PZGETRF()

pzgetrf computes an lu factorization of a general m-by-n distributed
matrix sub( a ) = (ia:ia+m-1,ja:ja+n-1) using partial pivoting with

PZGETRI()

keeps track of the pivoting information. ipiv(i) is the
global row index the local row i was swapped with.  thi

PZGETRS()

with a general n-by-n distributed matrix sub( a ) using the l
sub( a ) denotes a(ia:ia+n-1,ja:ja+n-1), op( a ) = a, a**t or a**h

PZGGQRF()

upper trapezoidal matrix r (r is upper triangular if n >= m);
the elements below the diagonal, with the array taua
elementary reflectors (see further details).

PZGGRQF()

and above the (m-n)-th subdiagonal contain the m by n upper
trapezoidal matrix r; the remaining elements, with the arra
elementary reflectors (see further details).

PZHEEV()

different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PZHEEVD()

lwork = n + ( np0 + mq0 + nb ) * nb,
with  np0 = numroc( max( n, nb, 2 ), nb, 0, 0, nprow

PZHEEVX()

set to twice the underflow threshold 2*pdlamch('s') not zero.
if this routine returns with ((mod(info,2).ne.0) .or
eigenvectors did not converge, try setting abstol to

PZHEGVX()

set to twice the underflow threshold 2*pdlamch('s') not zero.
if this routine returns with ((mod(info,2).ne.0) .or
eigenvectors did not converge, try setting abstol to

PZHENTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PZHETD2()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PZHETRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PZHETTRD()

matrix t, and the elements above the first superdiagonal,
with the array tau, represent the unitary matrix q as 
and first subdiagonal of sub( a ) are overwritten by the

PZLABRD()

if m >= n, elements on and below the diagonal in the first nb
columns, with the array tauq, represent the unitar
elements above the diagonal in the first nb rows, with the

PZLACON()

a. reverse communication is used for evaluating matrix-vector
products. x and v are aligned with the distributed matrix a, thi

PZLAEVSWP()

the eigenvectors on output.  the eigenvectors are distributed
in a block cyclic manner in both dimensions, with

PZLAHQR()

ihi to ilo in steps of our schur block size (<=2*iblk). each
iteration of the loop works  with the active submatrix in row
converged. either l = ilo or the global a(l,l-1) is negligible

PZLAHRD()

the k-th subdiagonal in the first nb columns are overwritten
with the corresponding elements of the reduced distribute
array tau, represent the matrix q as a product of elementary

PZLAMR1D()

to 1.  indeed, i suspect that ib should always be set to 1 or ignored
with 1 used in its place
pzlamr1d has not been tested except withint the contect of

PZLANGE()

( max(abs(a(i,j))),  norm = 'm' or 'm' with ia <= i <= ia+m-1
(

PZLAPIV()

pivoting. the pivot vector may be distributed across a process row
or a column. the pivot vector should be aligned with the distribute
for example if the row pivots should be applied to the columns of

PZLAPV2()

a(ia:ia+m-1,ja:ja+n-1), resulting in row or column pivoting.  the
pivot vector should be aligned with the distributed matrix a.  fo
process column and replicated over all process rows.  similarly,

PZLAQGE()

r       (local input) double precision array, dimension locr(m_a)
the row scale factors for sub( a ). r is aligned with th
column. r is tied to the distributed matrix a.

PZLAQSY()

the scale factors for a(ia:ia+m-1,ja:ja+n-1). sr is aligned
with the distributed matrix a, and replicated across ever

PZLARFB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PZLARFG()

before entry, the incremented array sub( x ) must contain
the vector x. on exit, it is overwritten with the vector v
ix      (global input) integer

PZLARFT()

the k-by-k triangular factor of the block reflector asso-
ciated with v. if direct = 'f', t is upper triangular

PZLARZB()

where lcmq = lcm / npcol with lcm = iclm( nprow, npcol )
iroffv = mod( iv-1, mb_v ), icoffv = mod( jv-1, nb_v ),

PZLARZT()

it contains the k-by-k triangular factor of the block
reflector associated with v. t is lower triangular
work    (local workspace) complex*16 array,

PZLASSQ()

on entry, the value  scale  in the equation above.
on exit, scale is overwritten with  scl , the scaling facto

PZLATRD()

on exit, if uplo = 'u', the last nb columns have been reduced
to tridiagonal form, with the diagonal elements overwritin
diagonal with the array tau, represent the unitary matrix q

PZLATRZ()

gular matrix r, and elements n-l+1 to n of the first m rows
of sub( a ), with the array tau, represent the unitary matri

PZLAUU2()

on exit, if uplo = 'u', the upper triangle of the distributed
matrix sub( a ) is overwritten with the upper triangle of th
is overwritten with the lower triangle of the product l' * l.

PZLAUUM()

on exit, if uplo = 'u', the upper triangle of the distributed
matrix sub( a ) is overwritten with the upper triangle of th
is overwritten with the lower triangle of the product l' * l.

PZMAX1()

when the result of a vector-oriented pblas call is a scalar, it will
be made available only within the scope which owns the vector(s
then, the processes which receive the answer will be (note that if

PZPBSV()

banded symmetric positive definite distributed
matrix with bandwidth bw
cholesky factorization is used to factor a reordering of

PZPBTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZPBTRS()

banded symmetric positive definite distributed
matrix with bandwidth bw
a(1:n, ja:ja+n-1) = u'*u or l*l' as computed by pzpbtrf.

PZPBTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZPOEQU()

sub( a ) = a(ia:ia+n-1,ja:ja+n-1) and reduce its condition number
(with respect to the two-norm).  sr and sc contain the scal
buted matrix b with elements b(i,j) = s(i)*a(i,j)*s(j) has ones on

PZPOSV()

ted matrix sub( b ). on exit, if info = 0, sub( b ) is over-
written with the solution distributed matrix x
ib      (global input) integer

PZPOSVX()

if equed = 'y', the matrix a has been equilibrated
with scaling factors given by s.  a and af will no
= 'n':  the matrix a will be copied to af and factored.

PZPTSV()

matrix. globally, du(n) is not referenced, and du must be
aligned with d
factors of the matrix.

PZPTTRF()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZPTTRS()

matrix. globally, du(n) is not referenced, and du must be
aligned with d
factors of the matrix.

PZPTTRSV()

want to find errors with min( ), so if no error, set it to a bi
descriptor multiplier.

PZSTEIN()

eigenvalues should be grouped by split-off block and ordered
from smallest to largest within the block (the output arra
array should be replicated on all processes.

PZTRRFS()

pztrrfs provides error bounds and backward error estimates for the
solution to a system of linear equations with a triangula

PZTZRZF()

gular matrix r, and elements m+1 to n of the first m rows of
sub( a ), with the array tau, represent the unitary matrix

PZUNG2L()

pzung2l generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PZUNG2R()

pzung2r generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PZUNGL2()

pzungl2 generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PZUNGLQ()

pzunglq generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined a

PZUNGQL()

pzungql generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a

PZUNGQR()

pzungqr generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal columns, which is defined a
m

PZUNGR2()

pzungr2 generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PZUNGRQ()

pzungrq generates an m-by-n complex distributed matrix q denoting
a(ia:ia+m-1,ja:ja+n-1) with orthonormal rows, which is defined as th

PZUNM2L()

pzunm2l overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PZUNM2R()

pzunm2r overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PZUNMBR()

if vect = 'q', pzunmbr overwrites the general complex distributed
m-by-n matrix sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PZUNMHR()

pzunmhr overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PZUNML2()

pzunml2 overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PZUNMLQ()

pzunmlq overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PZUNMQL()

pzunmql overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PZUNMQR()

pzunmqr overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'            side = 'r'

PZUNMR2()

pzunmr2 overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PZUNMR3()

pzunmr3 overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'          side = 'r'

PZUNMRQ()

pzunmrq overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PZUNMRZ()

pzunmrz overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

PZUNMTR()

pzunmtr overwrites the general complex m-by-n distributed matrix
sub( c ) = c(ic:ic+m-1,jc:jc+n-1) with
side = 'l'           side = 'r'

SDBTF2()

sdbtrf computes an lu factorization of a real m-by-n band matrix a
without using partial pivoting with row interchanges
this is the unblocked version of the algorithm, calling level 2 blas.

SDTTRF()

sdttrf computes an lu factorization of a complex tridiagonal matrix a
using elimination without partial pivoting
the factorization has the form

SDTTRSV()

u * x = b,  u**t * x = b,  or  u**h * x = b,
with factors of the tridiagonal matrix a from the lu factorizatio

SSTEIN2()

compute lu factors with partial pivoting  ( pt = lu

STRMVT()

t      - real array of dimension ( ldt, n ).
before entry with  uplo = 'u' or 'u', the leading n by 
triangular matrix and the strictly lower triangular part of

ZDBTF2()

zdbtrf computes an lu factorization of a real m-by-n band matrix a
without using partial pivoting with row interchanges
this is the unblocked version of the algorithm, calling level 2 blas.

ZDTTRF()

zdttrf computes an lu factorization of a complex tridiagonal matrix a
using elimination without partial pivoting
the factorization has the form

ZDTTRSV()

u * x = b,  u**t * x = b,  or  u**h * x = b,
with factors of the tridiagonal matrix a from the lu factorizatio

ZLAHQR2()

ihi to ilo in steps of 1 or 2. each iteration of the loop works
with the active submatrix in rows and columns l to i
h(l,l-1) is negligible so that the matrix splits.

ZTRMVT()

t      - complex*16 array of dimension ( ldt, n ).
before entry with  uplo = 'u' or 'u', the leading n by 
triangular matrix and the strictly lower triangular part of

within

CDBTF2()

kl      (input) integer
the number of subdiagonals within the band of a.  kl >= 0
ku      (input) integer

DDBTF2()

kl      (input) integer
the number of subdiagonals within the band of a.  kl >= 0
ku      (input) integer

PCHEEVX()

specifies which eigenvectors should be reorthogonalized.
eigenvectors that correspond to eigenvalues which are within
however, if the workspace is insufficient (see lwork),

PCHEGVX()

specifies which eigenvectors should be reorthogonalized.
eigenvectors that correspond to eigenvalues which are within
however, if the workspace is insufficient (see lwork),

PCHETTRD()

details:  the distinction between lii and ltli (and between
liip1 and ltlip1) is subtle.  within the current processo
on some processors, a( lii, lij ) points to an element

PCLACON()

products. x and v are aligned with the distributed matrix a, this
information is implicitly contained within iv, ix, descv, and descx
notes

PCLARF()

perform the local computation within a process colum

PCLARFC()

perform the local computation within a process colum

PCLARZ()

perform the local computation within a process colum

PCLARZC()

perform the local computation within a process colum

PCMAX1()

when the result of a vector-oriented pblas call is a scalar, it will
be made available only within the scope which owns the vector(s
then, the processes which receive the answer will be (note that if

PCPOEQU()

the  diagonal.  this choice of sr and sc puts the condition number
of b within a factor n of the smallest possible condition numbe

PCSTEIN()

eigenvalues should be grouped by split-off block and ordered
from smallest to largest within the block (the output arra
array should be replicated on all processes.

PDLACON()

x and v are aligned with the distributed matrix a, this information
is implicitly contained within iv, ix, descv, and descx
notes

PDLAHQR()

all rotn row transforms are all complete
through some column tmp.  (loops within 190
are then applied in a block fashion.

PDLARF()

perform the local computation within a process colum

PDLARZ()

perform the local computation within a process colum

PDPOEQU()

the  diagonal.  this choice of sr and sc puts the condition number
of b within a factor n of the smallest possible condition numbe

PDSTEBZ()

split-off block (see iblock, isplit) and
ordered from smallest to largest within
= 'e': ("entire matrix")

PDSTEIN()

eigenvalues should be grouped by split-off block and ordered
from smallest to largest within the block (the output arra
array should be replicated on all processes.

PDSYEVX()

specifies which eigenvectors should be reorthogonalized.
eigenvectors that correspond to eigenvalues which are within
however, if the workspace is insufficient (see lwork),

PDSYGVX()

specifies which eigenvectors should be reorthogonalized.
eigenvectors that correspond to eigenvalues which are within
however, if the workspace is insufficient (see lwork),

PDSYTTRD()

details:  the distinction between lii and ltli (and between
liip1 and ltlip1) is subtle.  within the current processo
on some processors, a( lii, lij ) points to an element

PDZSUM1()

when the result of a vector-oriented pblas call is a scalar, it will
be made available only within the scope which owns the vector(s
then, the processes which receive the answer will be (note that if

PSCSUM1()

when the result of a vector-oriented pblas call is a scalar, it will
be made available only within the scope which owns the vector(s
then, the processes which receive the answer will be (note that if

PSLACON()

x and v are aligned with the distributed matrix a, this information
is implicitly contained within iv, ix, descv, and descx
notes

PSLAHQR()

all rotn row transforms are all complete
through some column tmp.  (loops within 190
are then applied in a block fashion.

PSLARF()

perform the local computation within a process colum

PSLARZ()

perform the local computation within a process colum

PSPOEQU()

the  diagonal.  this choice of sr and sc puts the condition number
of b within a factor n of the smallest possible condition numbe

PSSTEBZ()

split-off block (see iblock, isplit) and
ordered from smallest to largest within
= 'e': ("entire matrix")

PSSTEIN()

eigenvalues should be grouped by split-off block and ordered
from smallest to largest within the block (the output arra
array should be replicated on all processes.

PSSYEVX()

specifies which eigenvectors should be reorthogonalized.
eigenvectors that correspond to eigenvalues which are within
however, if the workspace is insufficient (see lwork),

PSSYGVX()

specifies which eigenvectors should be reorthogonalized.
eigenvectors that correspond to eigenvalues which are within
however, if the workspace is insufficient (see lwork),

PSSYTTRD()

details:  the distinction between lii and ltli (and between
liip1 and ltlip1) is subtle.  within the current processo
on some processors, a( lii, lij ) points to an element

PZHEEVX()

specifies which eigenvectors should be reorthogonalized.
eigenvectors that correspond to eigenvalues which are within
however, if the workspace is insufficient (see lwork),

PZHEGVX()

specifies which eigenvectors should be reorthogonalized.
eigenvectors that correspond to eigenvalues which are within
however, if the workspace is insufficient (see lwork),

PZHETTRD()

details:  the distinction between lii and ltli (and between
liip1 and ltlip1) is subtle.  within the current processo
on some processors, a( lii, lij ) points to an element

PZLACON()

products. x and v are aligned with the distributed matrix a, this
information is implicitly contained within iv, ix, descv, and descx
notes

PZLARF()

perform the local computation within a process colum

PZLARFC()

perform the local computation within a process colum

PZLARZ()

perform the local computation within a process colum

PZLARZC()

perform the local computation within a process colum

PZMAX1()

when the result of a vector-oriented pblas call is a scalar, it will
be made available only within the scope which owns the vector(s
then, the processes which receive the answer will be (note that if

PZPOEQU()

the  diagonal.  this choice of sr and sc puts the condition number
of b within a factor n of the smallest possible condition numbe

PZSTEIN()

eigenvalues should be grouped by split-off block and ordered
from smallest to largest within the block (the output arra
array should be replicated on all processes.

SDBTF2()

kl      (input) integer
the number of subdiagonals within the band of a.  kl >= 0
ku      (input) integer

ZDBTF2()

kl      (input) integer
the number of subdiagonals within the band of a.  kl >= 0
ku      (input) integer

withint

PCLAMR1D()

pclamr1d has not been tested except withint the contect o

PDLAMR1D()

pdlamr1d has not been tested except withint the contect o

PSLAMR1D()

pslamr1d has not been tested except withint the contect o

PZLAMR1D()

pzlamr1d has not been tested except withint the contect o

without

CDBTF2()

cdbtrf computes an lu factorization of a real m-by-n band matrix a
without using partial pivoting with row interchanges
this is the unblocked version of the algorithm, calling level 2 blas.

CDTTRF()

cdttrf computes an lu factorization of a complex tridiagonal matrix a
using elimination without partial pivoting
the factorization has the form

DDBTF2()

ddbtrf computes an lu factorization of a real m-by-n band matrix a
without using partial pivoting with row interchanges
this is the unblocked version of the algorithm, calling level 2 blas.

DDTTRF()

ddttrf computes an lu factorization of a complex tridiagonal matrix a
using elimination without partial pivoting
the factorization has the form

PCDBSV()

gaussian elimination without pivotin
of the matrix into l u.

PCDTSV()

gaussian elimination without pivotin
of the matrix into l u.

PCDTTRS()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PCHEEV()

different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PCHEEVX()

and sufficient workspace to compute them.  (see lwork below.)
pcheevx is always able to detect insufficient space without

PCHEGVX()

and sufficient workspace to compute them.  (see lwork below.)
pchegvx is always able to detect insufficient space without

PCLAHQR()

subdiagonal elements, we need to see how many bulges we
can send through without breaking the consecutive smal

PCLASCL()

denoting a(ia:ia+m-1,ja:ja+n-1) by the real scalar cto/cfrom.  this
is done without over/underflow as long as the final resul
sub( a ) may be full, upper triangular, lower triangular or upper

PCPTSV()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PCPTTRF()

since there is no element-by-element vector multiplication in
the blas, this loop must be hardwired in without a blas cal

PCPTTRS()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PCSRSCL()

pcsrscl multiplies an n-element complex distributed vector
sub( x ) by the real scalar 1/a. this is done without overflow o
underflow.

PDDBSV()

gaussian elimination without pivotin
of the matrix into l u.

PDDTSV()

gaussian elimination without pivotin
of the matrix into l u.

PDDTTRS()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PDLAEBZ()

safe_min is at least the smallest number that can divide 1.0
without overflow
sequence loop.

PDLAED3()

arithmetic. it will work on machines with a guard digit in
add/subtract, or on those binary machines without guard digit
it could conceivably fail on hexadecimal or decimal machines

PDLAPDCT()

safe_min is at least the smallest number that can divide 1.0
without overflow
count   (output) integer

PDLASCL()

denoting a(ia:ia+m-1,ja:ja+n-1) by the real scalar cto/cfrom.  this
is done without over/underflow as long as the final resul
sub( a ) may be full, upper triangular, lower triangular or upper

PDPTSV()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PDPTTRF()

since there is no element-by-element vector multiplication in
the blas, this loop must be hardwired in without a blas cal

PDPTTRS()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PDRSCL()

pdrscl multiplies an n-element real distributed vector sub( x ) by
the real scalar 1/a. this is done without overflow or underflow a

PDSTEDC()

arithmetic. it will work on machines with a guard digit in
add/subtract, or on those binary machines without guard digit
it could conceivably fail on hexadecimal or decimal machines

PDSYEV()

the different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PDSYEVD()

the different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PDSYEVX()

and sufficient workspace to compute them.  (see lwork below.)
pdsyevx is always able to detect insufficient space without

PDSYGVX()

and sufficient workspace to compute them.  (see lwork below.)
pdsygvx is always able to detect insufficient space without

PSDBSV()

gaussian elimination without pivotin
of the matrix into l u.

PSDTSV()

gaussian elimination without pivotin
of the matrix into l u.

PSDTTRS()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PSLAEBZ()

safe_min is at least the smallest number that can divide 1.0
without overflow
sequence loop.

PSLAED3()

arithmetic. it will work on machines with a guard digit in
add/subtract, or on those binary machines without guard digit
it could conceivably fail on hexadecimal or decimal machines

PSLAPDCT()

safe_min is at least the smallest number that can divide 1.0
without overflow
count   (output) integer

PSLASCL()

denoting a(ia:ia+m-1,ja:ja+n-1) by the real scalar cto/cfrom.  this
is done without over/underflow as long as the final resul
sub( a ) may be full, upper triangular, lower triangular or upper

PSPTSV()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PSPTTRF()

since there is no element-by-element vector multiplication in
the blas, this loop must be hardwired in without a blas cal

PSPTTRS()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PSRSCL()

psrscl multiplies an n-element real distributed vector sub( x ) by
the real scalar 1/a. this is done without overflow or underflow a

PSSTEDC()

arithmetic. it will work on machines with a guard digit in
add/subtract, or on those binary machines without guard digit
it could conceivably fail on hexadecimal or decimal machines

PSSYEV()

the different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PSSYEVD()

the different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PSSYEVX()

and sufficient workspace to compute them.  (see lwork below.)
pssyevx is always able to detect insufficient space without

PSSYGVX()

and sufficient workspace to compute them.  (see lwork below.)
pssygvx is always able to detect insufficient space without

PZDBSV()

gaussian elimination without pivotin
of the matrix into l u.

PZDRSCL()

pzdrscl multiplies an n-element complex distributed vector
sub( x ) by the real scalar 1/a. this is done without overflow o
underflow.

PZDTSV()

gaussian elimination without pivotin
of the matrix into l u.

PZDTTRS()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PZHEEV()

different processes.  because of this, it is possible that a
heterogeneous system may return incorrect results without any erro

PZHEEVX()

and sufficient workspace to compute them.  (see lwork below.)
pzheevx is always able to detect insufficient space without

PZHEGVX()

and sufficient workspace to compute them.  (see lwork below.)
pzhegvx is always able to detect insufficient space without

PZLAHQR()

subdiagonal elements, we need to see how many bulges we
can send through without breaking the consecutive smal

PZLASCL()

denoting a(ia:ia+m-1,ja:ja+n-1) by the real scalar cto/cfrom.  this
is done without over/underflow as long as the final resul
sub( a ) may be full, upper triangular, lower triangular or upper

PZPTSV()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

PZPTTRF()

since there is no element-by-element vector multiplication in
the blas, this loop must be hardwired in without a blas cal

PZPTTRS()

dtype_a = 501 or 502 can be used interchangeably
without any other change
tridiagonal matrix be aligned with each other. because of this, a

SDBTF2()

sdbtrf computes an lu factorization of a real m-by-n band matrix a
without using partial pivoting with row interchanges
this is the unblocked version of the algorithm, calling level 2 blas.

SDTTRF()

sdttrf computes an lu factorization of a complex tridiagonal matrix a
using elimination without partial pivoting
the factorization has the form

ZDBTF2()

zdbtrf computes an lu factorization of a real m-by-n band matrix a
without using partial pivoting with row interchanges
this is the unblocked version of the algorithm, calling level 2 blas.

ZDTTRF()

zdttrf computes an lu factorization of a complex tridiagonal matrix a
using elimination without partial pivoting
the factorization has the form

won

DSTEIN2()

copy the matrix t so it won't be destroyed in factorization

SSTEIN2()

copy the matrix t so it won't be destroyed in factorization

words

PCDBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCDBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCDTSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCDTTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCGBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCGBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCPBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCPBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCPTSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PCPTTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDDBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDDBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDDTSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDDTTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDGBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDGBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDPBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDPBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDPTSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDPTTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PDSTEBZ()

since the user cannot know a priori what value nsplit will
have, n words must be reserved for isplit.
work    (local workspace) double precision array,

PSDBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSDBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSDTSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSDTTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSGBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSGBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSPBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSPBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSPTSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSPTTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PSSTEBZ()

since the user cannot know a priori what value nsplit will
have, n words must be reserved for isplit.
work    (local workspace) real array, dimension ( max( 5*n, 7 ) )

PZDBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZDBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZDTSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZDTTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZGBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZGBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZPBSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZPBTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZPTSV()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

PZPTTRS()

of the divide and conquer algorithm as a task-parallel algorithm.
this formula in words is: no processor may have more than on

work

CDBTRF()

the block size must not exceed the limit set by the size of the
local arrays work13 and work31

CLAREF()

if .true., then apply any column reflections to z as well.
if .false., then do no additional work on z
z       (global input/output) complex array, (ldz,*)

DDBTRF()

the block size must not exceed the limit set by the size of the
local arrays work13 and work31

DLAREF()

if .true., then apply any column reflections to z as well.
if .false., then do no additional work on z
z       (global input/output) double precision array, (ldz,*)

DLASORTE()

since every 2nd subdiagonal is guaranteed to be zero.
this routine does no parallel work
arguments

DSTEIN2()

skip all the work if the block size is one

PCDBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCDBTRF()

check worksiz

PCDBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCDTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCDTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCGBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCGBTRF()

check worksiz

PCGBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCGEBD2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEBRD()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCGECON()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEHD2()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCGEHRD()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCGELQ2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGELQF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGELS()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQL2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQLF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQPF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQR2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQRF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGERFS()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGERQ2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGERQF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGESVD()

a       (local input/workspace) block cyclic comple
global dimension (m, n), local dimension (mp, nq)

PCGESVX()

rcond is less than the machine precision (in particular, if
rcond = 0), the matrix is singular to working precision

PCGETRI()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGGQRF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGGRQF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCHEEV()

a       (local input/workspace) block cyclic complex array
locc(ja+n-1) )

PCHEEVD()

a       (local input/workspace) block cyclic complex array
locc(ja+n-1) )

PCHEEVX()

a       (local input/workspace) block cyclic complex array
local dimension ( lld_a, locc(ja+n-1) )

PCHEGVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack working note #3
see "on the correctness of parallel bisection in floating

PCHENGST()

pchengst also calls pchegst when insufficient workspace i
performance only when lwork >= 2 * np0 * nb + nq0 * nb + nb * nb

PCHENTRD()

codes (either the serial, chetrd, or the parallel code, pchettrd)
when the workspace provided by the user is adequate

PCHETD2()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCHETRD()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCHETTRD()

work    (local workspace) complex array, dimension (lwork

PCLABRD()

work    (local workspace) complex array, dimension (lwork

PCLACONSB()

buf     (local output) complex array of size lwork
lwork   (global input) integer

PCLAHQR()

determine the number of columns we have so we can check workspac

PCLAHRD()

work    (local workspace) complex array, dimension (nb
further details

PCLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PCLANGE()

work    (local workspace) real array dimension (lwork
nq0 if norm = '1', 'o' or 'o',

PCLANHE()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums whil
irsr0   : pointer to part of work used to store the rowsums after

PCLANSY()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums whil
irsr0   : pointer to part of work used to store the rowsums after

PCLARFB()

work    (local workspace) complex array, dimension (lwork
if side = 'l',

PCLARFT()

work    (local workspace) complex array

PCLARZB()

work    (local workspace) complex array, dimension (lwork
if side = 'l',

PCLARZT()

work    (local workspace) complex array

PCLASMSUB()

buf     (local output) complex array of size lwork
lwork   (global input) integer

PCLASWP()

already been broadcast along the process row or column.
also note that this routine will only work for k1-k2 being in th
pclapiv.

PCLATRD()

work    (local workspace) complex array, dimension (nb_a
further details

PCLATRZ()

work    (local workspace) complex array, dimension (lwork

PCPBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCPBTRF()

check worksiz

PCPBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCPOCON()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCPORFS()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCPOSVX()

machine precision (in particular, if rcond = 0), the matrix
is singular to working precision.  this condition i
error bounds are not computed.

PCPTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCPTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCSTEIN()

orthogonalize vectors that are on different processes. the extent
of orthogonalization is controlled by the input parameter lwork
process. pcstein decides on the allocation of work among the

PCTRCON()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCTREVC()

work    (local workspace) complex array
additional workspace may be required if pclattrs is updated

PCTRRFS()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCTZRZF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNG2L()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNG2R()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGL2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGLQ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGQL()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGQR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGR2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGRQ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNM2L()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNM2R()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMBR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMHR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNML2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMLQ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMQL()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMQR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMR2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMR3()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMRQ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMRZ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMTR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PDDBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDDBTRF()

check worksiz

PDDBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDDTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDDTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDGBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDGBTRF()

check worksiz

PDGBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDGEBD2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEBRD()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDGECON()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEHD2()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDGEHRD()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDGELQ2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGELQF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGELS()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQL2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQLF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQPF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQR2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQRF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGERFS()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGERQ2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGERQF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGESVD()

a       (local input/workspace) block cyclic double precisio
global dimension (m, n), local dimension (mp, nq)

PDGESVX()

rcond is less than the machine precision (in particular, if
rcond = 0), the matrix is singular to working precision

PDGETRI()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGGQRF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGGRQF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDLABRD()

work    (local workspace) double precision array, dimension (lwork

PDLACONSB()

buf     (local output) double precision array of size lwork
lwork   (global input) integer

PDLAED0()

work    (local workspace ) double precision array, dimension (lwork
np = numroc( n, mb_q, myrow, iqrow, nprow )

PDLAED1()

work    (local workspace/output) double precision array

PDLAED3()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PDLAEVSWP()

work    (local workspace) double precision array, dimension (lwork
lwork   (local input) integer dimension of work

PDLAHQR()

determine the number of columns we have so we can check workspac

PDLAHRD()

work    (local workspace) double precision array, dimension (nb
further details

PDLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PDLANGE()

work    (local workspace) double precision array dimension (lwork
nq0 if norm = '1', 'o' or 'o',

PDLANSY()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums whil
irsr0   : pointer to part of work used to store the rowsums after

PDLARED1D()

work    (local workspace) double precision dimension (lwork

PDLARED2D()

work    (local workspace) double precision dimension (lwork

PDLARFB()

work    (local workspace) double precision array, dimension (lwork
if side = 'l',

PDLARFT()

work    (local workspace) double precision array

PDLARZB()

work    (local workspace) double precision array, dimension (lwork
if side = 'l',

PDLARZT()

work    (local workspace) double precision array

PDLASMSUB()

buf     (local output) double precision array of size lwork
lwork   (global input) integer

PDLASRT()

work    (local workspace/local output) double precision array
lwork   (local or global input) integer

PDLASWP()

already been broadcast along the process row or column.
also note that this routine will only work for k1-k2 being in th
pdlapiv.

PDLATRD()

work    (local workspace) double precision array, dimension (nb_a
further details

PDLATRZ()

work    (local workspace) double precision array, dimension (lwork

PDORG2L()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORG2R()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGL2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGLQ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGQL()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGQR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGR2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGRQ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORM2L()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORM2R()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMBR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMHR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORML2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMLQ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMQL()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMQR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMR2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMR3()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMRQ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMRZ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMTR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDPBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDPBTRF()

check worksiz

PDPBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDPOCON()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDPORFS()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDPOSVX()

machine precision (in particular, if rcond = 0), the matrix
is singular to working precision.  this condition i
error bounds are not computed.

PDPTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDPTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDSTEBZ()

the interval [vl, vu], or the eigenvalues indexed il through iu. a
static partitioning of work is done at the beginning of pdstebz whic
eigenvalues.

PDSTEDC()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PDSTEIN()

orthogonalize vectors that are on different processes. the extent
of orthogonalization is controlled by the input parameter lwork
process. pdstein decides on the allocation of work among the

PDSYEV()

a       (local input/workspace) block cyclic double precision array
locc(ja+n-1) )

PDSYEVD()

a       (local input/workspace) block cyclic double precision array
locc(ja+n-1) )

PDSYEVX()

a       (local input/workspace) block cyclic double precision array
local dimension ( lld_a, locc(ja+n-1) )

PDSYGVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack working note #3
see "on the correctness of parallel bisection in floating

PDSYNGST()

pdsyngst also calls pdhegst when insufficient workspace i
performance only when lwork >= 2 * np0 * nb + nq0 * nb + nb * nb

PDSYNTRD()

codes (either the serial, dsytrd, or the parallel code, pdsyttrd)
when the workspace provided by the user is adequate

PDSYTD2()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDSYTRD()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDSYTTRD()

work  (local workspace) double precision array, dimension (lwork

PDTRCON()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDTRRFS()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDTZRZF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PSDBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSDBTRF()

check worksiz

PSDBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSDTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSDTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSGBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSGBTRF()

check worksiz

PSGBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSGEBD2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEBRD()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSGECON()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEHD2()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSGEHRD()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSGELQ2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGELQF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGELS()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQL2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQLF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQPF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQR2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQRF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGERFS()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGERQ2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGERQF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGESVD()

a       (local input/workspace) block cyclic rea
global dimension (m, n), local dimension (mp, nq)

PSGESVX()

rcond is less than the machine precision (in particular, if
rcond = 0), the matrix is singular to working precision

PSGETRI()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGGQRF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGGRQF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSLABRD()

work    (local workspace) real array, dimension (lwork

PSLACONSB()

buf     (local output) real array of size lwork
lwork   (global input) integer

PSLAED0()

work    (local workspace ) real array, dimension (lwork
np = numroc( n, mb_q, myrow, iqrow, nprow )

PSLAED1()

work    (local workspace/output) real array

PSLAED3()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PSLAEVSWP()

work    (local workspace) real array, dimension (lwork
lwork   (local input) integer dimension of work

PSLAHQR()

determine the number of columns we have so we can check workspac

PSLAHRD()

work    (local workspace) real array, dimension (nb
further details

PSLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PSLANGE()

work    (local workspace) real array dimension (lwork
nq0 if norm = '1', 'o' or 'o',

PSLANSY()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums whil
irsr0   : pointer to part of work used to store the rowsums after

PSLARED1D()

work    (local workspace) real dimension (lwork

PSLARED2D()

work    (local workspace) real dimension (lwork

PSLARFB()

work    (local workspace) real array, dimension (lwork
if side = 'l',

PSLARFT()

work    (local workspace) real array

PSLARZB()

work    (local workspace) real array, dimension (lwork
if side = 'l',

PSLARZT()

work    (local workspace) real array

PSLASMSUB()

buf     (local output) real array of size lwork
lwork   (global input) integer

PSLASRT()

work    (local workspace/local output) real array
lwork   (local or global input) integer

PSLASWP()

already been broadcast along the process row or column.
also note that this routine will only work for k1-k2 being in th
pslapiv.

PSLATRD()

work    (local workspace) real array, dimension (nb_a
further details

PSLATRZ()

work    (local workspace) real array, dimension (lwork

PSORG2L()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORG2R()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGL2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGLQ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGQL()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGQR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGR2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGRQ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORM2L()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORM2R()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMBR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMHR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORML2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMLQ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMQL()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMQR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMR2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMR3()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMRQ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMRZ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMTR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSPBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSPBTRF()

check worksiz

PSPBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSPOCON()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSPORFS()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSPOSVX()

machine precision (in particular, if rcond = 0), the matrix
is singular to working precision.  this condition i
error bounds are not computed.

PSPTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSPTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSSTEBZ()

the interval [vl, vu], or the eigenvalues indexed il through iu. a
static partitioning of work is done at the beginning of psstebz whic
eigenvalues.

PSSTEDC()

this code makes very mild assumptions about floating point
arithmetic. it will work on machines with a guard digit i
which subtract like the cray x-mp, cray y-mp, cray c-90, or cray-2.

PSSTEIN()

orthogonalize vectors that are on different processes. the extent
of orthogonalization is controlled by the input parameter lwork
process. psstein decides on the allocation of work among the

PSSYEV()

a       (local input/workspace) block cyclic double precision array
locc(ja+n-1) )

PSSYEVD()

a       (local input/workspace) block cyclic real array
locc(ja+n-1) )

PSSYEVX()

a       (local input/workspace) block cyclic real array
local dimension ( lld_a, locc(ja+n-1) )

PSSYGVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack working note #3
see "on the correctness of parallel bisection in floating

PSSYNGST()

pssyngst also calls pshegst when insufficient workspace i
performance only when lwork >= 2 * np0 * nb + nq0 * nb + nb * nb

PSSYNTRD()

codes (either the serial, ssytrd, or the parallel code, pssyttrd)
when the workspace provided by the user is adequate

PSSYTD2()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSSYTRD()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSSYTTRD()

work    (local workspace) real array, dimension (lwork

PSTRCON()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSTRRFS()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSTZRZF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PZDBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZDBTRF()

check worksiz

PZDBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZDTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZDTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZGBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZGBTRF()

check worksiz

PZGBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZGEBD2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEBRD()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZGECON()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEHD2()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZGEHRD()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZGELQ2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGELQF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGELS()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQL2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQLF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQPF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQR2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQRF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGERFS()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGERQ2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGERQF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGESVD()

a       (local input/workspace) block cyclic complex*1
global dimension (m, n), local dimension (mp, nq)

PZGESVX()

rcond is less than the machine precision (in particular, if
rcond = 0), the matrix is singular to working precision

PZGETRI()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGGQRF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGGRQF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZHEEV()

a       (local input/workspace) block cyclic complex*16 array
locc(ja+n-1) )

PZHEEVD()

a       (local input/workspace) block cyclic complex*16 array
locc(ja+n-1) )

PZHEEVX()

a       (local input/workspace) block cyclic complex*16 array
local dimension ( lld_a, locc(ja+n-1) )

PZHEGVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack working note #3
see "on the correctness of parallel bisection in floating

PZHENGST()

pzhengst also calls pzhegst when insufficient workspace i
performance only when lwork >= 2 * np0 * nb + nq0 * nb + nb * nb

PZHENTRD()

codes (either the serial, zhetrd, or the parallel code, pzhettrd)
when the workspace provided by the user is adequate

PZHETD2()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZHETRD()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZHETTRD()

work    (local workspace) complex*16 array, dimension (lwork

PZLABRD()

work    (local workspace) complex*16 array, dimension (lwork

PZLACONSB()

buf     (local output) complex*16 array of size lwork
lwork   (global input) integer

PZLAHQR()

determine the number of columns we have so we can check workspac

PZLAHRD()

work    (local workspace) complex*16 array, dimension (nb
further details

PZLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PZLANGE()

work    (local workspace) double precision array dimension (lwork
nq0 if norm = '1', 'o' or 'o',

PZLANHE()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums whil
irsr0   : pointer to part of work used to store the rowsums after

PZLANSY()

icurcol : process column containing diagonal block
irsc0   : pointer to part of work used to store the rowsums whil
irsr0   : pointer to part of work used to store the rowsums after

PZLARFB()

work    (local workspace) complex*16 array, dimension (lwork
if side = 'l',

PZLARFT()

work    (local workspace) complex*16 array

PZLARZB()

work    (local workspace) complex*16 array, dimension (lwork
if side = 'l',

PZLARZT()

work    (local workspace) complex*16 array

PZLASMSUB()

buf     (local output) complex*16 array of size lwork
lwork   (global input) integer

PZLASWP()

already been broadcast along the process row or column.
also note that this routine will only work for k1-k2 being in th
pzlapiv.

PZLATRD()

work    (local workspace) complex*16 array, dimension (nb_a
further details

PZLATRZ()

work    (local workspace) complex*16 array, dimension (lwork

PZPBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZPBTRF()

check worksiz

PZPBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZPOCON()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZPORFS()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZPOSVX()

machine precision (in particular, if rcond = 0), the matrix
is singular to working precision.  this condition i
error bounds are not computed.

PZPTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZPTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZSTEIN()

orthogonalize vectors that are on different processes. the extent
of orthogonalization is controlled by the input parameter lwork
process. pzstein decides on the allocation of work among the

PZTRCON()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZTREVC()

work    (local workspace) complex*16 array
additional workspace may be required if pzlattrs is updated

PZTRRFS()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZTZRZF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNG2L()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNG2R()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGL2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGLQ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGQL()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGQR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGR2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGRQ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNM2L()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNM2R()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMBR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMHR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNML2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMLQ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMQL()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMQR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMR2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMR3()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMRQ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMRZ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMTR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

SDBTRF()

the block size must not exceed the limit set by the size of the
local arrays work13 and work31

SLAREF()

if .true., then apply any column reflections to z as well.
if .false., then do no additional work on z
z       (global input/output) real array, (ldz,*)

SLASORTE()

since every 2nd subdiagonal is guaranteed to be zero.
this routine does no parallel work
arguments

SSTEIN2()

skip all the work if the block size is one

ZDBTRF()

the block size must not exceed the limit set by the size of the
local arrays work13 and work31

ZLAREF()

if .true., then apply any column reflections to z as well.
if .false., then do no additional work on z
z       (global input/output) complex*16 array, (ldz,*)

WORK13

CDBTRF()

the block size must not exceed the limit set by the size of the
local arrays WORK13 and work31

DDBTRF()

the block size must not exceed the limit set by the size of the
local arrays WORK13 and work31

SDBTRF()

the block size must not exceed the limit set by the size of the
local arrays WORK13 and work31

ZDBTRF()

the block size must not exceed the limit set by the size of the
local arrays WORK13 and work31

WORK31

CDBTRF()

the block size must not exceed the limit set by the size of the
local arrays work13 and WORK31

DDBTRF()

the block size must not exceed the limit set by the size of the
local arrays work13 and WORK31

SDBTRF()

the block size must not exceed the limit set by the size of the
local arrays work13 and WORK31

ZDBTRF()

the block size must not exceed the limit set by the size of the
local arrays work13 and WORK31

Working

PCGESVX()

rcond is less than the machine precision (in particular, if
rcond = 0), the matrix is singular to Working precision

PCHEEVD()

siam j. sci. comput., 6:20 (1999), pp. 2223--2236.
(see also lapack Working note 132

PCHEEVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack Working note #3
see "on the correctness of parallel bisection in floating

PCHEGVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack Working note #3
see "on the correctness of parallel bisection in floating

PCPOSVX()

machine precision (in particular, if rcond = 0), the matrix
is singular to Working precision.  this condition i
error bounds are not computed.

PDGESVX()

rcond is less than the machine precision (in particular, if
rcond = 0), the matrix is singular to Working precision

PDLAED0()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while Working on the submatrix lying i

PDPOSVX()

machine precision (in particular, if rcond = 0), the matrix
is singular to Working precision.  this condition i
error bounds are not computed.

PDSTEDC()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while Working on the submatrix lying i

PDSYEVD()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while Working on the submatrix lying i

PDSYEVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack Working note #3
see "on the correctness of parallel bisection in floating

PDSYGVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack Working note #3
see "on the correctness of parallel bisection in floating

PSGESVX()

rcond is less than the machine precision (in particular, if
rcond = 0), the matrix is singular to Working precision

PSLAED0()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while Working on the submatrix lying i

PSPOSVX()

machine precision (in particular, if rcond = 0), the matrix
is singular to Working precision.  this condition i
error bounds are not computed.

PSSTEDC()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while Working on the submatrix lying i

PSSYEVD()

> 0:  the algorithm failed to compute the info/(n+1) th
eigenvalue while Working on the submatrix lying i

PSSYEVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack Working note #3
see "on the correctness of parallel bisection in floating

PSSYGVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack Working note #3
see "on the correctness of parallel bisection in floating

PZGESVX()

rcond is less than the machine precision (in particular, if
rcond = 0), the matrix is singular to Working precision

PZHEEVD()

siam j. sci. comput., 6:20 (1999), pp. 2223--2236.
(see also lapack Working note 132

PZHEEVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack Working note #3
see "on the correctness of parallel bisection in floating

PZHEGVX()

with guaranteed high relative accuracy," by demmel and
kahan, lapack Working note #3
see "on the correctness of parallel bisection in floating

PZPOSVX()

machine precision (in particular, if rcond = 0), the matrix
is singular to Working precision.  this condition i
error bounds are not computed.

workloads

PCLAHQR()

make sure it's divisible by lcm (we want even workloads!

PDLAHQR()

make sure it's divisible by lcm (we want even workloads!

PSLAHQR()

make sure it's divisible by lcm (we want even workloads!

PZLAHQR()

make sure it's divisible by lcm (we want even workloads!

works

CLAHQR2()

the main loop begins here. i is the loop index and decreases from
ihi to ilo in steps of 1 or 2. each iteration of the loop works
eigenvalues i+1 to ihi have already converged. either l = ilo, or

PCGEEQU()

factors is not guaranteed to reduce the condition number of
sub( a ) but works well in practice
notes

PCLAHQR()

determine the number of columns we have so we can check workspac

PCLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PDGEEQU()

factors is not guaranteed to reduce the condition number of
sub( a ) but works well in practice
notes

PDLAHQR()

determine the number of columns we have so we can check workspac

PDLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PSGEEQU()

factors is not guaranteed to reduce the condition number of
sub( a ) but works well in practice
notes

PSLAHQR()

determine the number of columns we have so we can check workspac

PSLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

PZGEEQU()

factors is not guaranteed to reduce the condition number of
sub( a ) but works well in practice
notes

PZLAHQR()

determine the number of columns we have so we can check workspac

PZLAMR1D()

i am not sure that this works correctly when ib and jb are not equa
with 1 used in its place.

ZLAHQR2()

the main loop begins here. i is the loop index and decreases from
ihi to ilo in steps of 1 or 2. each iteration of the loop works
eigenvalues i+1 to ihi have already converged. either l = ilo, or

worksize

PCDBTRF()

check worksize

PCDTTRF()

check worksize

PCGBTRF()

check worksize

PCPBTRF()

check worksize

PCPTTRF()

check worksize

PDDBTRF()

check worksize

PDDTTRF()

check worksize

PDGBTRF()

check worksize

PDPBTRF()

check worksize

PDPTTRF()

check worksize

PDPTTRSV()

output minimum worksize

PSDBTRF()

check worksize

PSDTTRF()

check worksize

PSGBTRF()

check worksize

PSPBTRF()

check worksize

PSPTTRF()

check worksize

PSPTTRSV()

output minimum worksize

PZDBTRF()

check worksize

PZDTTRF()

check worksize

PZGBTRF()

check worksize

PZPBTRF()

check worksize

PZPTTRF()

check worksize

workspace

PCDBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCDBTRF()

offset to workspace for upper triangular facto

PCDBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCDBTRSV()

offset to workspace for upper triangular facto

PCDTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCDTTRF()

offset to workspace for upper triangular facto

PCDTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCDTTRSV()

offset to workspace for upper triangular facto

PCGBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCGBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCGEBD2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEBRD()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCGECON()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEHD2()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCGEHRD()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCGELQ2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGELQF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGELS()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQL2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQLF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQPF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQR2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGEQRF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGERFS()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGERQ2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGERQF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGESVD()

a       (local input/workspace) block cyclic comple
global dimension (m, n), local dimension (mp, nq)

PCGESVX()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGETRI()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGGQRF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCGGRQF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCHEEV()

a       (local input/workspace) block cyclic complex array
locc(ja+n-1) )

PCHEEVD()

a       (local input/workspace) block cyclic complex array
locc(ja+n-1) )

PCHEEVX()

a       (local input/workspace) block cyclic complex array
local dimension ( lld_a, locc(ja+n-1) )

PCHEGVX()

space to hold the eigenvectors in z (m .le. descz(n_))
and sufficient workspace to compute them.  (see lwork below.
computation unless range .eq. 'v'.

PCHENGST()

pchengst also calls pchegst when insufficient workspace i
performance only when lwork >= 2 * np0 * nb + nq0 * nb + nb * nb

PCHENTRD()

codes (either the serial, chetrd, or the parallel code, pchettrd)
when the workspace provided by the user is adequate

PCHETD2()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCHETRD()

work    (local workspace/local output) complex array
on exit, work( 1 ) returns the minimal and optimal lwork.

PCHETTRD()

work    (local workspace) complex array, dimension (lwork

PCLABRD()

work    (local workspace) complex array, dimension (lwork

PCLACON()

v       (local workspace) complex pointer into the loca
the final return, v = a*w, where est = norm(v)/norm(w)

PCLAEVSWP()

rwork    (local workspace) real array, dimension (lrwork
lrwork   (local input) integer dimension of rwork

PCLAHQR()

determine the number of columns we have so we can check workspace

PCLAHRD()

work    (local workspace) complex array, dimension (nb
further details

PCLAMR1D()

work    (local workspace) complex*16 array, dimension ( lwork 
lwork   (local input) integer

PCLANGE()

work    (local workspace) real array dimension (lwork
nq0 if norm = '1', 'o' or 'o',

PCLAPIV()

or 'c' and pivroc='r' or 'r', the last piece of this array of
size mb_a (resp. nb_a) is used as workspace. in those cases

PCLAPV2()

local row (column) i was swapped with.  the last piece of the
array of size mb_a (resp. nb_a) is used as workspace. ipiv i

PCLARFB()

work    (local workspace) complex array, dimension (lwork
if side = 'l',

PCLARFT()

work    (local workspace) complex array

PCLARZB()

work    (local workspace) complex array, dimension (lwork
if side = 'l',

PCLARZT()

work    (local workspace) complex array

PCLATRD()

work    (local workspace) complex array, dimension (nb_a
further details

PCLATRZ()

work    (local workspace) complex array, dimension (lwork

PCPBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCPBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCPOCON()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCPORFS()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCPOSVX()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCPTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCPTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PCSTEIN()

processes and then calls sstein2 (modified lapack routine) on each
individual process. if insufficient workspace is allocated, th

PCTRCON()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCTREVC()

work    (local workspace) complex array
additional workspace may be required if pclattrs is updated

PCTRRFS()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCTZRZF()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNG2L()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNG2R()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGL2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGLQ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGQL()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGQR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGR2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNGRQ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNM2L()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNM2R()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMBR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMHR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNML2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMLQ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMQL()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMQR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMR2()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMR3()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMRQ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMRZ()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PCUNMTR()

work    (local workspace/local output) complex array
on exit, work(1) returns the minimal and optimal lwork.

PDDBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDDBTRF()

offset to workspace for upper triangular facto

PDDBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDDBTRSV()

offset to workspace for upper triangular facto

PDDTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDDTTRF()

offset to workspace for upper triangular facto

PDDTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDDTTRSV()

offset to workspace for upper triangular facto

PDGBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDGBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDGEBD2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEBRD()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDGECON()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEHD2()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDGEHRD()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDGELQ2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGELQF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGELS()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQL2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQLF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQPF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQR2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGEQRF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGERFS()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGERQ2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGERQF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGESVD()

a       (local input/workspace) block cyclic double precisio
global dimension (m, n), local dimension (mp, nq)

PDGESVX()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGETRI()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGGQRF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDGGRQF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDLABRD()

work    (local workspace) double precision array, dimension (lwork

PDLACON()

v       (local workspace) double precision pointer into the loca
the final return, v = a*w, where est = norm(v)/norm(w)

PDLAED0()

work    (local workspace ) double precision array, dimension (lwork
np = numroc( n, mb_q, myrow, iqrow, nprow )

PDLAED1()

work    (local workspace/output) double precision array

PDLAED2()

qbuf   (workspace) double precision array, dimension 3*
ctot   (workspace) integer array, dimension( npcol, 4)

PDLAED3()

qbuf   (workspace) double precision array, dimension 3*

PDLAEVSWP()

work    (local workspace) double precision array, dimension (lwork
lwork   (local input) integer dimension of work

PDLAHQR()

determine the number of columns we have so we can check workspace

PDLAHRD()

work    (local workspace) double precision array, dimension (nb
further details

PDLAMR1D()

work    (local workspace) complex*16 array, dimension ( lwork 
lwork   (local input) integer

PDLANGE()

work    (local workspace) double precision array dimension (lwork
nq0 if norm = '1', 'o' or 'o',

PDLAPIV()

or 'c' and pivroc='r' or 'r', the last piece of this array of
size mb_a (resp. nb_a) is used as workspace. in those cases

PDLAPV2()

local row (column) i was swapped with.  the last piece of the
array of size mb_a (resp. nb_a) is used as workspace. ipiv i

PDLARED1D()

work    (local workspace) double precision dimension (lwork

PDLARED2D()

work    (local workspace) double precision dimension (lwork

PDLARFB()

work    (local workspace) double precision array, dimension (lwork
if side = 'l',

PDLARFT()

work    (local workspace) double precision array

PDLARZB()

work    (local workspace) double precision array, dimension (lwork
if side = 'l',

PDLARZT()

work    (local workspace) double precision array

PDLASRT()

work    (local workspace/local output) double precision array
lwork   (local or global input) integer

PDLATRD()

work    (local workspace) double precision array, dimension (nb_a
further details

PDLATRZ()

work    (local workspace) double precision array, dimension (lwork

PDORG2L()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORG2R()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGL2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGLQ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGQL()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGQR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGR2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORGRQ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORM2L()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORM2R()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMBR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMHR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORML2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMLQ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMQL()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMQR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMR2()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMR3()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMRQ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMRZ()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDORMTR()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDPBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDPBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDPOCON()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDPORFS()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDPOSVX()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDPTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDPTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PDSTEBZ()

work    (local workspace) double precision array

PDSTEDC()

work    (local workspace/output) double precision array
on output, work(1) returns the workspace needed.

PDSTEIN()

processes and then calls dstein2 (modified lapack routine) on each
individual process. if insufficient workspace is allocated, th

PDSYEV()

a       (local input/workspace) block cyclic double precision array
locc(ja+n-1) )

PDSYEVD()

a       (local input/workspace) block cyclic double precision array
locc(ja+n-1) )

PDSYEVX()

a       (local input/workspace) block cyclic double precision array
local dimension ( lld_a, locc(ja+n-1) )

PDSYGVX()

space to hold the eigenvectors in z (m .le. descz(n_))
and sufficient workspace to compute them.  (see lwork below.
computation unless range .eq. 'v'.

PDSYNGST()

pdsyngst also calls pdhegst when insufficient workspace i
performance only when lwork >= 2 * np0 * nb + nq0 * nb + nb * nb

PDSYNTRD()

codes (either the serial, dsytrd, or the parallel code, pdsyttrd)
when the workspace provided by the user is adequate

PDSYTD2()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDSYTRD()

work    (local workspace/local output) double precision array
on exit, work( 1 ) returns the minimal and optimal lwork.

PDSYTTRD()

work  (local workspace) double precision array, dimension (lwork

PDTRCON()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDTRRFS()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PDTZRZF()

work    (local workspace/local output) double precision array
on exit, work(1) returns the minimal and optimal lwork.

PSDBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSDBTRF()

offset to workspace for upper triangular facto

PSDBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSDBTRSV()

offset to workspace for upper triangular facto

PSDTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSDTTRF()

offset to workspace for upper triangular facto

PSDTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSDTTRSV()

offset to workspace for upper triangular facto

PSGBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSGBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSGEBD2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEBRD()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSGECON()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEHD2()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSGEHRD()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSGELQ2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGELQF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGELS()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQL2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQLF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQPF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQR2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGEQRF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGERFS()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGERQ2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGERQF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGESVD()

a       (local input/workspace) block cyclic rea
global dimension (m, n), local dimension (mp, nq)

PSGESVX()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGETRI()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGGQRF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSGGRQF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSLABRD()

work    (local workspace) real array, dimension (lwork

PSLACON()

v       (local workspace) real pointer into the loca
the final return, v = a*w, where est = norm(v)/norm(w)

PSLAED0()

work    (local workspace ) real array, dimension (lwork
np = numroc( n, mb_q, myrow, iqrow, nprow )

PSLAED1()

work    (local workspace/output) real array

PSLAED2()

qbuf   (workspace) real array, dimension 3*
ctot   (workspace) integer array, dimension( npcol, 4)

PSLAED3()

qbuf   (workspace) real array, dimension 3*

PSLAEVSWP()

work    (local workspace) real array, dimension (lwork
lwork   (local input) integer dimension of work

PSLAHQR()

determine the number of columns we have so we can check workspace

PSLAHRD()

work    (local workspace) real array, dimension (nb
further details

PSLAMR1D()

work    (local workspace) complex*16 array, dimension ( lwork 
lwork   (local input) integer

PSLANGE()

work    (local workspace) real array dimension (lwork
nq0 if norm = '1', 'o' or 'o',

PSLAPIV()

or 'c' and pivroc='r' or 'r', the last piece of this array of
size mb_a (resp. nb_a) is used as workspace. in those cases

PSLAPV2()

local row (column) i was swapped with.  the last piece of the
array of size mb_a (resp. nb_a) is used as workspace. ipiv i

PSLARED1D()

work    (local workspace) real dimension (lwork

PSLARED2D()

work    (local workspace) real dimension (lwork

PSLARFB()

work    (local workspace) real array, dimension (lwork
if side = 'l',

PSLARFT()

work    (local workspace) real array

PSLARZB()

work    (local workspace) real array, dimension (lwork
if side = 'l',

PSLARZT()

work    (local workspace) real array

PSLASRT()

work    (local workspace/local output) real array
lwork   (local or global input) integer

PSLATRD()

work    (local workspace) real array, dimension (nb_a
further details

PSLATRZ()

work    (local workspace) real array, dimension (lwork

PSORG2L()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORG2R()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGL2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGLQ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGQL()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGQR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGR2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORGRQ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORM2L()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORM2R()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMBR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMHR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORML2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMLQ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMQL()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMQR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMR2()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMR3()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMRQ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMRZ()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSORMTR()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSPBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSPBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSPOCON()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSPORFS()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSPOSVX()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSPTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSPTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PSSTEBZ()

work    (local workspace) real array, dimension ( max( 5*n, 7 ) 
lwork   (local input) integer

PSSTEDC()

work    (local workspace/output) real array
on output, work(1) returns the workspace needed.

PSSTEIN()

processes and then calls sstein2 (modified lapack routine) on each
individual process. if insufficient workspace is allocated, th

PSSYEV()

a       (local input/workspace) block cyclic double precision array
locc(ja+n-1) )

PSSYEVD()

a       (local input/workspace) block cyclic real array
locc(ja+n-1) )

PSSYEVX()

a       (local input/workspace) block cyclic real array
local dimension ( lld_a, locc(ja+n-1) )

PSSYGVX()

space to hold the eigenvectors in z (m .le. descz(n_))
and sufficient workspace to compute them.  (see lwork below.
computation unless range .eq. 'v'.

PSSYNGST()

pssyngst also calls pshegst when insufficient workspace i
performance only when lwork >= 2 * np0 * nb + nq0 * nb + nb * nb

PSSYNTRD()

codes (either the serial, ssytrd, or the parallel code, pssyttrd)
when the workspace provided by the user is adequate

PSSYTD2()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSSYTRD()

work    (local workspace/local output) real array
on exit, work( 1 ) returns the minimal and optimal lwork.

PSSYTTRD()

work    (local workspace) real array, dimension (lwork

PSTRCON()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSTRRFS()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PSTZRZF()

work    (local workspace/local output) real array
on exit, work(1) returns the minimal and optimal lwork.

PZDBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZDBTRF()

offset to workspace for upper triangular facto

PZDBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZDBTRSV()

offset to workspace for upper triangular facto

PZDTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZDTTRF()

offset to workspace for upper triangular facto

PZDTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZDTTRSV()

offset to workspace for upper triangular facto

PZGBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZGBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZGEBD2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEBRD()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZGECON()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEHD2()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZGEHRD()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZGELQ2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGELQF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGELS()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQL2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQLF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQPF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQR2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGEQRF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGERFS()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGERQ2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGERQF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGESVD()

a       (local input/workspace) block cyclic complex*1
global dimension (m, n), local dimension (mp, nq)

PZGESVX()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGETRI()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGGQRF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZGGRQF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZHEEV()

a       (local input/workspace) block cyclic complex*16 array
locc(ja+n-1) )

PZHEEVD()

a       (local input/workspace) block cyclic complex*16 array
locc(ja+n-1) )

PZHEEVX()

a       (local input/workspace) block cyclic complex*16 array
local dimension ( lld_a, locc(ja+n-1) )

PZHEGVX()

space to hold the eigenvectors in z (m .le. descz(n_))
and sufficient workspace to compute them.  (see lwork below.
computation unless range .eq. 'v'.

PZHENGST()

pzhengst also calls pzhegst when insufficient workspace i
performance only when lwork >= 2 * np0 * nb + nq0 * nb + nb * nb

PZHENTRD()

codes (either the serial, zhetrd, or the parallel code, pzhettrd)
when the workspace provided by the user is adequate

PZHETD2()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZHETRD()

work    (local workspace/local output) complex*16 array
on exit, work( 1 ) returns the minimal and optimal lwork.

PZHETTRD()

work    (local workspace) complex*16 array, dimension (lwork

PZLABRD()

work    (local workspace) complex*16 array, dimension (lwork

PZLACON()

v       (local workspace) complex*16 pointer into the loca
the final return, v = a*w, where est = norm(v)/norm(w)

PZLAEVSWP()

rwork    (local workspace) double precision array, dimension (lrwork
lrwork   (local input) integer dimension of rwork

PZLAHQR()

determine the number of columns we have so we can check workspace

PZLAHRD()

work    (local workspace) complex*16 array, dimension (nb
further details

PZLAMR1D()

work    (local workspace) complex*16 array, dimension ( lwork 
lwork   (local input) integer

PZLANGE()

work    (local workspace) double precision array dimension (lwork
nq0 if norm = '1', 'o' or 'o',

PZLAPIV()

or 'c' and pivroc='r' or 'r', the last piece of this array of
size mb_a (resp. nb_a) is used as workspace. in those cases

PZLAPV2()

local row (column) i was swapped with.  the last piece of the
array of size mb_a (resp. nb_a) is used as workspace. ipiv i

PZLARFB()

work    (local workspace) complex*16 array, dimension (lwork
if side = 'l',

PZLARFT()

work    (local workspace) complex*16 array

PZLARZB()

work    (local workspace) complex*16 array, dimension (lwork
if side = 'l',

PZLARZT()

work    (local workspace) complex*16 array

PZLATRD()

work    (local workspace) complex*16 array, dimension (nb_a
further details

PZLATRZ()

work    (local workspace) complex*16 array, dimension (lwork

PZPBSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZPBTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZPOCON()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZPORFS()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZPOSVX()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZPTSV()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZPTTRS()

work    (local workspace/local output
be overwritten in between calls to routines. work must be

PZSTEIN()

processes and then calls dstein2 (modified lapack routine) on each
individual process. if insufficient workspace is allocated, th

PZTRCON()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZTREVC()

work    (local workspace) complex*16 array
additional workspace may be required if pzlattrs is updated

PZTRRFS()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZTZRZF()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNG2L()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNG2R()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGL2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGLQ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGQL()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGQR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGR2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNGRQ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNM2L()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNM2R()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMBR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMHR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNML2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMLQ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMQL()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMQR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMR2()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMR3()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMRQ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMRZ()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

PZUNMTR()

work    (local workspace/local output) complex*16 array
on exit, work(1) returns the minimal and optimal lwork.

workspaces

PCGESVD()

where wpclange, wpclared1d, wpclared2d, wpcgebrd are the
workspaces required respectively for the subprogram
standard notation

PDGESVD()

where wpdlange, wpdlared1d, wpdlared2d, wpdgebrd are the
workspaces required respectively for the subprogram
standard notation

PSGESVD()

where wpslange, wpslared1d, wpslared2d, wpsgebrd are the
workspaces required respectively for the subprogram
standard notation

PZGESVD()

where wpzlange, wpzlared1d, wpzlared2d, wpzgebrd are the
workspaces required respectively for the subprogram
standard notation

worst

PDSTEBZ()

publicly released versions should be large enough to handle
the worst machine around.  note that this has no effec

PSSTEBZ()

publicly released versions should be large enough to handle
the worst machine around.  note that this has no effec

worth

PCGEEQU()

if rowcnd >= 0.1 and amax is neither too large nor too small,
it is not worth scaling by r(ia:ia+m-1)
colcnd  (global output) real

PCPOEQU()

ia <= i <= ia+n-1 and ja <= j <= ja+n-1. if scond >= 0.1
and amax is neither too large nor too small, it is not worth

PDGEEQU()

if rowcnd >= 0.1 and amax is neither too large nor too small,
it is not worth scaling by r(ia:ia+m-1)
colcnd  (global output) double precision

PDPOEQU()

ia <= i <= ia+n-1 and ja <= j <= ja+n-1. if scond >= 0.1
and amax is neither too large nor too small, it is not worth

PSGEEQU()

if rowcnd >= 0.1 and amax is neither too large nor too small,
it is not worth scaling by r(ia:ia+m-1)
colcnd  (global output) real

PSPOEQU()

ia <= i <= ia+n-1 and ja <= j <= ja+n-1. if scond >= 0.1
and amax is neither too large nor too small, it is not worth

PZGEEQU()

if rowcnd >= 0.1 and amax is neither too large nor too small,
it is not worth scaling by r(ia:ia+m-1)
colcnd  (global output) double precision

PZPOEQU()

ia <= i <= ia+n-1 and ja <= j <= ja+n-1. if scond >= 0.1
and amax is neither too large nor too small, it is not worth

would

CLAHQR2()

determine the effect of starting the double-shift qr
iteration at row m, and see if this would make h(m,m-1

PCDBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCDBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCDTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCDTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEBD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEBRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGECON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEHD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEHRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGELQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGELQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGELS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQLF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQPF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGEQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGERFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGERQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGERQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGESV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGESVD()

assume that its process grid has dimension r x c. locr( k ) denotes
the number of elements of k that a process would receive if k wer
locc( k ) denotes the number of elements of k that a process would

PCGESVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGETF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGETRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGETRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGETRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGGQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCGGRQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEEV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEEVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEGS2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHEGVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHENGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHENTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHETD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHETRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCHETTRD()

locp( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locq( k ) denotes the number of elements of k that a

PCLABRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACGV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACONSB()

seeing the effect of starting a double shift qr iteration
given by h44, h33, & h43h34 and see if this would make

PCLACP2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACP3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLACPY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAEVSWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLANGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAPIV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAPV2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAQGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAQSY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARFB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARFG()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARFT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARZB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLARZT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASE2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASET()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASMSUB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASSQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLASWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLATRA()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLATRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLATRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAUU2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAUUM()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCLAWIL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCMAX1()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPORFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOSVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOTF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOTRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPOTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCPTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCSRSCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCSTEIN()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTREVC()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRRFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRTI2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTRTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCTZRZF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNG2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNG2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNGRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNM2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNM2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMBR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMHR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNML2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMR3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PCUNMTR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDDBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDDBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDDTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDDTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEBD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEBRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGECON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEHD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEHRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGELQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGELQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGELS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQLF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQPF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGEQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGERFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGERQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGERQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGESV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGESVD()

assume that its process grid has dimension r x c. locr( k ) denotes
the number of elements of k that a process would receive if k wer
locc( k ) denotes the number of elements of k that a process would

PDGESVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGETF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGETRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGETRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGETRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGGQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDGGRQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLABRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACONSB()

seeing the effect of starting a double shift qr iteration
given by h44, h33, & h43h34 and see if this would make

PDLACP2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACP3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLACPY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAEVSWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLANGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAPIV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAPV2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAQGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAQSY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARED1D()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARED2D()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARFB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARFG()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARFT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARZB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLARZT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASE2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASET()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASMSUB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASSQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLASWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLATRA()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLATRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLATRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAUU2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAUUM()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDLAWIL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORG2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORG2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORGRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORM2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORM2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMBR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMHR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORML2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMR3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDORMTR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPORFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOSVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOTF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOTRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPOTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDPTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDRSCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSTEIN()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYEV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYEVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYGS2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYGVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYNGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYNTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYTD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDSYTTRD()

locp( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locq( k ) denotes the number of elements of k that a

PDTRCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTRRFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTRTI2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTRTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTRTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDTZRZF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PDZSUM1()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PJLAENV()

into a single character string.  for example, uplo = 'u',
trans = 't', and diag = 'n' for a triangular routine would

PSCSUM1()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSDBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSDBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSDTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSDTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEBD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEBRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGECON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEHD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEHRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGELQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGELQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGELS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQLF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQPF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGEQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGERFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGERQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGERQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGESV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGESVD()

assume that its process grid has dimension r x c. locr( k ) denotes
the number of elements of k that a process would receive if k wer
locc( k ) denotes the number of elements of k that a process would

PSGESVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGETF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGETRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGETRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGETRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGGQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSGGRQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLABRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACONSB()

seeing the effect of starting a double shift qr iteration
given by h44, h33, & h43h34 and see if this would make

PSLACP2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACP3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLACPY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAEVSWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLANGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAPIV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAPV2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAQGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAQSY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARED1D()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARED2D()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARFB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARFG()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARFT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARZB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLARZT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASE2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASET()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASMSUB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASSQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLASWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLATRA()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLATRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLATRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAUU2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAUUM()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSLAWIL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORG2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORG2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORGRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORM2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORM2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMBR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMHR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORML2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMR3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSORMTR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPORFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOSVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOTF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOTRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPOTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSPTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSRSCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSTEIN()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYEV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYEVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYGS2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYGVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYNGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYNTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYTD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSSYTTRD()

locp( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locq( k ) denotes the number of elements of k that a

PSTRCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTRRFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTRTI2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTRTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTRTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PSTZRZF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDRSCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZDTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEBD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEBRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGECON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEHD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEHRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGELQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGELQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGELS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQLF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQPF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGEQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGERFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGERQ2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGERQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGESV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGESVD()

assume that its process grid has dimension r x c. locr( k ) denotes
the number of elements of k that a process would receive if k wer
locc( k ) denotes the number of elements of k that a process would

PZGESVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGETF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGETRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGETRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGETRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGGQRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZGGRQF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEEV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEEVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEGS2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHEGVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHENGST()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHENTRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHETD2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHETRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZHETTRD()

locp( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locq( k ) denotes the number of elements of k that a

PZLABRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACGV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACONSB()

seeing the effect of starting a double shift qr iteration
given by h44, h33, & h43h34 and see if this would make

PZLACP2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACP3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLACPY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAEVSWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLANGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAPIV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAPV2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAQGE()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAQSY()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARFB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARFG()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARFT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARZB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLARZT()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASCL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASE2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASET()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASMSUB()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASSQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLASWP()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLATRA()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLATRD()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLATRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAUU2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAUUM()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZLAWIL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZMAX1()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPBSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPBTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOEQU()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPORFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOSVX()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOTF2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOTRF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPOTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPTSV()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZPTTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZSTEIN()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRCON()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTREVC()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the r processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRRFS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRTI2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRTRI()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTRTRS()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZTZRZF()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNG2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNG2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGL2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNGRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNM2L()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNM2R()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMBR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMHR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNML2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMLQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMQL()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMQR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMR2()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMR3()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMRQ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMRZ()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

PZUNMTR()

locr( k ) denotes the number of elements of k that a process
would receive if k were distributed over the p processes of it
similarly, locc( k ) denotes the number of elements of k that a

ZLAHQR2()

determine the effect of starting the double-shift qr
iteration at row m, and see if this would make h(m,m-1

WPCGEBRD

PCGESVD()

watobd = max(max(wpclange,WPCGEBRD)

WPCLANGE

PCGESVD()

watobd = max(max(WPCLANGE,wpcgebrd)

WPCLARED1D

PCGESVD()

where wpclange, WPCLARED1D, wpclared2d, wpcgebrd are th
pclange, pslared1d, pslared2d, pcgebrd. using the

WPCLARED2D

PCGESVD()

watobd = max(max(wpclange,wpcgebrd),
max(WPCLARED2D,wp(pre)lared1d))
where wpclange, wpclared1d, wpclared2d, wpcgebrd are the

WPCORMBRPRT

PCGESVD()

max(wcbdsqr,
max(wantu*wpcormbrqln, wantvt*WPCORMBRPRT))
where

WPCORMBRQLN

PCGESVD()

max(wcbdsqr,
max(wantu*WPCORMBRQLN, wantvt*wpcormbrprt))
where

WPDGEBRD

PDGESVD()

watobd = max(max(wpdlange,WPDGEBRD)

WPDLANGE

PDGESVD()

watobd = max(max(WPDLANGE,wpdgebrd)

WPDLARED1D

PDGESVD()

where wpdlange, WPDLARED1D, wpdlared2d, wpdgebrd are th
pdlange, pdlared1d, pdlared2d, pdgebrd. using the

PZGESVD()

wpzlange = mp,
WPDLARED1D = nq0
wpzgebrd = nb*(mp + nq + 1) + nq,

WPDLARED2D

PDGESVD()

watobd = max(max(wpdlange,wpdgebrd),
max(WPDLARED2D,wp(pre)lared1d))
where wpdlange, wpdlared1d, wpdlared2d, wpdgebrd are the

PZGESVD()

wpdlared1d = nq0,
WPDLARED2D = mp0

WPDORMBRPRT

PDGESVD()

max(wdbdsqr,
max(wantu*wpdormbrqln, wantvt*WPDORMBRPRT))
where

WPDORMBRQLN

PDGESVD()

max(wdbdsqr,
max(wantu*WPDORMBRQLN, wantvt*wpdormbrprt))
where

WPSGEBRD

PSGESVD()

watobd = max(max(wpslange,WPSGEBRD)

WPSLANGE

PSGESVD()

watobd = max(max(WPSLANGE,wpsgebrd)

WPSLARED1D

PCGESVD()

wpclange = mp,
WPSLARED1D = nq0
wpcgebrd = nb*(mp + nq + 1) + nq,

PSGESVD()

where wpslange, WPSLARED1D, wpslared2d, wpsgebrd are th
pslange, pslared1d, pslared2d, psgebrd. using the

WPSLARED2D

PCGESVD()

wpslared1d = nq0,
WPSLARED2D = mp0

PSGESVD()

watobd = max(max(wpslange,wpsgebrd),
max(WPSLARED2D,wp(pre)lared1d))
where wpslange, wpslared1d, wpslared2d, wpsgebrd are the

WPSORMBRPRT

PSGESVD()

max(wsbdsqr,
max(wantu*wpsormbrqln, wantvt*WPSORMBRPRT))
where

WPSORMBRQLN

PSGESVD()

max(wsbdsqr,
max(wantu*WPSORMBRQLN, wantvt*wpsormbrprt))
where

WPZGEBRD

PZGESVD()

watobd = max(max(wpzlange,WPZGEBRD)

WPZLANGE

PZGESVD()

watobd = max(max(WPZLANGE,wpzgebrd)

WPZLARED1D

PZGESVD()

where wpzlange, WPZLARED1D, wpzlared2d, wpzgebrd are th
pzlange, pdlared1d, pdlared2d, pzgebrd. using the

WPZLARED2D

PZGESVD()

watobd = max(max(wpzlange,wpzgebrd),
max(WPZLARED2D,wp(pre)lared1d))
where wpzlange, wpzlared1d, wpzlared2d, wpzgebrd are the

WPZORMBRPRT

PZGESVD()

max(wzbdsqr,
max(wantu*wpzormbrqln, wantvt*WPZORMBRPRT))
where

WPZORMBRQLN

PZGESVD()

max(wzbdsqr,
max(wantu*WPZORMBRQLN, wantvt*wpzormbrprt))
where

writing

PCDBTRF()

the following method uses more flops than necessary but
does not necessitate the writing of a new blas routine

PCPBTRF()

the following method uses more flops than necessary but
does not necessitate the writing of a new blas routine

PDDBTRF()

the following method uses more flops than necessary but
does not necessitate the writing of a new blas routine

PDPBTRF()

the following method uses more flops than necessary but
does not necessitate the writing of a new blas routine

PSDBTRF()

the following method uses more flops than necessary but
does not necessitate the writing of a new blas routine

PSPBTRF()

the following method uses more flops than necessary but
does not necessitate the writing of a new blas routine

PZDBTRF()

the following method uses more flops than necessary but
does not necessitate the writing of a new blas routine

PZPBTRF()

the following method uses more flops than necessary but
does not necessitate the writing of a new blas routine

written

PCGBTRS()

complex temporary workspace. this space may
be overwritten in between calls to routines. work must b
on exit, work( 1 ) contains the minimal lwork.

PCGESVD()

m-by-n matrix a, optionally computing the left and/or right
singular vectors. the svd is written a
a = u * sigma * transpose(v)

PCGESVX()

scaling of the matrix a, but if equilibration is used, a is
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n'

PCHENTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PCHETD2()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PCHETRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PCHETTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PCPOSV()

ted matrix sub( b ). on exit, if info = 0, sub( b ) is over-
written with the solution distributed matrix x
ib      (global input) integer

PDGBTRS()

double precision temporary workspace. this space may
be overwritten in between calls to routines. work must b
on exit, work( 1 ) contains the minimal lwork.

PDGESVD()

m-by-n matrix a, optionally computing the left and/or right
singular vectors. the svd is written a
a = u * sigma * transpose(v)

PDGESVX()

scaling of the matrix a, but if equilibration is used, a is
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n'

PDPOSV()

ted matrix sub( b ). on exit, if info = 0, sub( b ) is over-
written with the solution distributed matrix x
ib      (global input) integer

PDSYNTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the orthogonal matrix q as a

PDSYTD2()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the orthogonal matrix q as a

PDSYTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the orthogonal matrix q as a

PDSYTTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PSGBTRS()

real temporary workspace. this space may
be overwritten in between calls to routines. work must b
on exit, work( 1 ) contains the minimal lwork.

PSGESVD()

m-by-n matrix a, optionally computing the left and/or right
singular vectors. the svd is written a
a = u * sigma * transpose(v)

PSGESVX()

scaling of the matrix a, but if equilibration is used, a is
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n'

PSPOSV()

ted matrix sub( b ). on exit, if info = 0, sub( b ) is over-
written with the solution distributed matrix x
ib      (global input) integer

PSSYNTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the orthogonal matrix q as a

PSSYTD2()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the orthogonal matrix q as a

PSSYTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the orthogonal matrix q as a

PSSYTTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PZGBTRS()

complex*16 temporary workspace. this space may
be overwritten in between calls to routines. work must b
on exit, work( 1 ) contains the minimal lwork.

PZGESVD()

m-by-n matrix a, optionally computing the left and/or right
singular vectors. the svd is written a
a = u * sigma * transpose(v)

PZGESVX()

scaling of the matrix a, but if equilibration is used, a is
overwritten by diag(r)*a*diag(c) and b by diag(r)*b (if trans='n'

PZHENTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PZHETD2()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PZHETRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PZHETTRD()

the diagonal and first superdiagonal of sub( a ) are over-
written by the corresponding elements of the tridiagona
with the array tau, represent the unitary matrix q as a

PZPOSV()

ted matrix sub( b ). on exit, if info = 0, sub( b ) is over-
written with the solution distributed matrix x
ib      (global input) integer

WSBDSQR

PSGESVD()

wbdtosvd = size*(wantu*nru + wantvt*ncvt) +
max(WSBDSQR

www

PCHEEVD()

(see also lapack working note 132)
http://www.netlib.org/lapack/lawns/lawn132.p
=====================================================================

PDSTEDC()

(see also lapack working note 132)
http://www.netlib.org/lapack/lawns/lawn132.p
=====================================================================

PDSYEVD()

(see also lapack working note 132)
http://www.netlib.org/lapack/lawns/lawn132.p
=====================================================================

PSSTEDC()

(see also lapack working note 132)
http://www.netlib.org/lapack/lawns/lawn132.p
=====================================================================

PSSYEVD()

(see also lapack working note 132)
http://www.netlib.org/lapack/lawns/lawn132.p
=====================================================================

PZHEEVD()

(see also lapack working note 132)
http://www.netlib.org/lapack/lawns/lawn132.p
=====================================================================

WZBDSQR

PZGESVD()

wbdtosvd = size*(wantu*nru + wantvt*ncvt) +
max(WZBDSQR