Routine: PCHEGVX()  File: SRC\pchegvx.f

 
 
# lines: 836
  # code: 836
  # comment: 0
  # blank:0
# Variables:88
# Callers:0
# Callings:5
# Words:415
# Keywords:260
 

 

..
     .. Array Arguments ..
     ..
  Purpose
  =======
  PCHEGVX computes all the eigenvalues, and optionally,
  the eigenvectors
  of a complex generalized Hermitian-definite eigenproblem, of the form
  sub( A )*x=(lambda)*sub( B )*x,  sub( A )*sub( B )x=(lambda)*x,  or
  sub( B )*sub( A )*x=(lambda)*x.
  Here sub( A ) denoting A( IA:IA+N-1, JA:JA+N-1 ) is assumed to be
  Hermitian, and sub( B ) denoting B( IB:IB+N-1, JB:JB+N-1 ) is assumed
  to be Hermitian positive definite.
  Notes
  =====
  Each global data object is described by an associated description
  vector.  This vector stores the information required to establish
  the mapping between an object element and its corresponding process
  and memory location.
  Let A be a generic term for any 2D block cyclicly distributed array.
  Such a global array has an associated description vector DESCA.
  In the following comments, the character _ should be read as
  "of the global array".
  NOTATION        STORED IN      EXPLANATION
  --------------- -------------- --------------------------------------
  DTYPE_A(global) DESCA( DTYPE_ )The descriptor type.  In this case,
                                 DTYPE_A = 1.
  CTXT_A (global) DESCA( CTXT_ ) The BLACS context handle, indicating
                                 the BLACS process grid A is distribu-
                                 ted over. The context itself is glo-
                                 bal, but the handle (the integer
                                 value) may vary.
  M_A    (global) DESCA( M_ )    The number of rows in the global
                                 array A.
  N_A    (global) DESCA( N_ )    The number of columns in the global
                                 array A.
  MB_A   (global) DESCA( MB_ )   The blocking factor used to distribute
                                 the rows of the array.
  NB_A   (global) DESCA( NB_ )   The blocking factor used to distribute
                                 the columns of the array.
  RSRC_A (global) DESCA( RSRC_ ) The process row over which the first
                                 row of the array A is distributed.
  CSRC_A (global) DESCA( CSRC_ ) The process column over which the
                                 first column of the array A is
                                 distributed.
  LLD_A  (local)  DESCA( LLD_ )  The leading dimension of the local
                                 array.  LLD_A >= MAX(1,LOCr(M_A)).
  Let K be the number of rows or columns of a distributed matrix,
  and assume that its process grid has dimension p x q.
  LOCr( K ) denotes the number of elements of K that a process
  would receive if K were distributed over the p processes of its
  process column.
  Similarly, LOCc( K ) denotes the number of elements of K that a
  process would receive if K were distributed over the q processes of
  its process row.
  The values of LOCr() and LOCc() may be determined via a call to the
  ScaLAPACK tool function, NUMROC:
          LOCr( M ) = NUMROC( M, MB_A, MYROW, RSRC_A, NPROW ),
          LOCc( N ) = NUMROC( N, NB_A, MYCOL, CSRC_A, NPCOL ).
  An upper bound for these quantities may be computed by:
          LOCr( M ) <= ceil( ceil(M/MB_A)/NPROW )*MB_A
          LOCc( N ) <= ceil( ceil(N/NB_A)/NPCOL )*NB_A
  Arguments
  =========
  IBTYPE   (global input) INTEGER
          Specifies the problem type to be solved:
          = 1:  sub( A )*x = (lambda)*sub( B )*x
          = 2:  sub( A )*sub( B )*x = (lambda)*x
          = 3:  sub( B )*sub( A )*x = (lambda)*x
  JOBZ    (global input) CHARACTER*1
          = 'N':  Compute eigenvalues only;
          = 'V':  Compute eigenvalues and eigenvectors.
  RANGE   (global input) CHARACTER*1
          = 'A': all eigenvalues will be found.
          = 'V': all eigenvalues in the interval [VL,VU] will be found.
          = 'I': the IL-th through IU-th eigenvalues will be found.
  UPLO    (global input) CHARACTER*1
          = 'U':  Upper triangles of sub( A ) and sub( B ) are stored;
          = 'L':  Lower triangles of sub( A ) and sub( B ) are stored.
  N       (global input) INTEGER
          The order of the matrices sub( A ) and sub( B ).  N >= 0.
  A       (local input/local output) COMPLEX pointer into the
          local memory to an array of dimension (LLD_A, LOCc(JA+N-1)).
          On entry, this array contains the local pieces of the
          N-by-N Hermitian distributed matrix sub( A ). If UPLO = 'U',
          the leading N-by-N upper triangular part of sub( A ) contains
          the upper triangular part of the matrix.  If UPLO = 'L', the
          leading N-by-N lower triangular part of sub( A ) contains
          the lower triangular part of the matrix.
          On exit, if JOBZ = 'V', then if INFO = 0, sub( A ) contains
          the distributed matrix Z of eigenvectors.  The eigenvectors
          are normalized as follows:
          if IBTYPE = 1 or 2, Z**H*sub( B )*Z = I;
          if IBTYPE = 3, Z**H*inv( sub( B ) )*Z = I.
          If JOBZ = 'N', then on exit the upper triangle (if UPLO='U')
          or the lower triangle (if UPLO='L') of sub( A ), including
          the diagonal, is destroyed.
  IA      (global input) INTEGER
          The row index in the global array A indicating the first
          row of sub( A ).
  JA      (global input) INTEGER
          The column index in the global array A indicating the
          first column of sub( A ).
  DESCA   (global and local input) INTEGER array of dimension DLEN_.
          The array descriptor for the distributed matrix A.
          If DESCA( CTXT_ ) is incorrect, PCHEGVX cannot guarantee
          correct error reporting.
  B       (local input/local output) COMPLEX pointer into the
          local memory to an array of dimension (LLD_B, LOCc(JB+N-1)).
          On entry, this array contains the local pieces of the
          N-by-N Hermitian distributed matrix sub( B ). If UPLO = 'U',
          the leading N-by-N upper triangular part of sub( B ) contains
          the upper triangular part of the matrix.  If UPLO = 'L', the
          leading N-by-N lower triangular part of sub( B ) contains
          the lower triangular part of the matrix.
          On exit, if INFO <= N, the part of sub( B ) containing the
          matrix is overwritten by the triangular factor U or L from
          the Cholesky factorization sub( B ) = U**H*U or
          sub( B ) = L*L**H.
  IB      (global input) INTEGER
          The row index in the global array B indicating the first
          row of sub( B ).
  JB      (global input) INTEGER
          The column index in the global array B indicating the
          first column of sub( B ).
  DESCB   (global and local input) INTEGER array of dimension DLEN_.
          The array descriptor for the distributed matrix B.
          DESCB( CTXT_ ) must equal DESCA( CTXT_ )
  VL      (global input) REAL
          If RANGE='V', the lower bound of the interval to be searched
          for eigenvalues.  Not referenced if RANGE = 'A' or 'I'.
  VU      (global input) REAL
          If RANGE='V', the upper bound of the interval to be searched
          for eigenvalues.  Not referenced if RANGE = 'A' or 'I'.
  IL      (global input) INTEGER
          If RANGE='I', the index (from smallest to largest) of the
          smallest eigenvalue to be returned.  IL >= 1.
          Not referenced if RANGE = 'A' or 'V'.
  IU      (global input) INTEGER
          If RANGE='I', the index (from smallest to largest) of the
          largest eigenvalue to be returned.  min(IL,N) <= IU <= N.
          Not referenced if RANGE = 'A' or 'V'.
  ABSTOL  (global input) REAL
          If JOBZ='V', setting ABSTOL to PSLAMCH( CONTEXT, 'U') yields
          the most orthogonal eigenvectors.
          The absolute error tolerance for the eigenvalues.
          An approximate eigenvalue is accepted as converged
          when it is determined to lie in an interval [a,b]
          of width less than or equal to
                  ABSTOL + EPS *   max( |a|,|b| ) ,
          where EPS is the machine precision.  If ABSTOL is less than
          or equal to zero, then EPS*norm(T) will be used in its place,
          where norm(T) is the 1-norm of the tridiagonal matrix
          obtained by reducing A to tridiagonal form.
          Eigenvalues will be computed most accurately when ABSTOL is
          set to twice the underflow threshold 2*PSLAMCH('S') not zero.
          If this routine returns with ((MOD(INFO,2).NE.0) .OR.
          (MOD(INFO/8,2).NE.0)), indicating that some eigenvalues or
          eigenvectors did not converge, try setting ABSTOL to
          2*PSLAMCH('S').
          See "Computing Small Singular Values of Bidiagonal Matrices
          with Guaranteed High Relative Accuracy," by Demmel and
          Kahan, LAPACK Working Note #3.
          See "On the correctness of Parallel Bisection in Floating
          Point" by Demmel, Dhillon and Ren, LAPACK Working Note #70
  M       (global output) INTEGER
          Total number of eigenvalues found.  0 <= M <= N.
  NZ      (global output) INTEGER
          Total number of eigenvectors computed.  0 <= NZ <= M.
          The number of columns of Z that are filled.
          If JOBZ .NE. 'V', NZ is not referenced.
          If JOBZ .EQ. 'V', NZ = M unless the user supplies
          insufficient space and PCHEGVX is not able to detect this
          before beginning computation.  To get all the eigenvectors
          requested, the user must supply both sufficient
          space to hold the eigenvectors in Z (M .LE. DESCZ(N_))
          and sufficient workspace to compute them.  (See LWORK below.)
          PCHEGVX is always able to detect insufficient space without
          computation unless RANGE .EQ. 'V'.
  W       (global output) REAL array, dimension (N)
          On normal exit, the first M entries contain the selected
          eigenvalues in ascending order.
  ORFAC   (global input) REAL
          Specifies which eigenvectors should be reorthogonalized.
          Eigenvectors that correspond to eigenvalues which are within
          tol=ORFAC*norm(A) of each other are to be reorthogonalized.
          However, if the workspace is insufficient (see LWORK),
          tol may be decreased until all eigenvectors to be
          reorthogonalized can be stored in one process.
          No reorthogonalization will be done if ORFAC equals zero.
          A default value of 10^-3 is used if ORFAC is negative.
          ORFAC should be identical on all processes.
  Z       (local output) COMPLEX array,
          global dimension (N, N),
          local dimension ( LLD_Z, LOCc(JZ+N-1) )
          If JOBZ = 'V', then on normal exit the first M columns of Z
          contain the orthonormal eigenvectors of the matrix
          corresponding to the selected eigenvalues.  If an eigenvector
          fails to converge, then that column of Z contains the latest
          approximation to the eigenvector, and the index of the
          eigenvector is returned in IFAIL.
          If JOBZ = 'N', then Z is not referenced.
  IZ      (global input) INTEGER
          The row index in the global array Z indicating the first
          row of sub( Z ).
  JZ      (global input) INTEGER
          The column index in the global array Z indicating the
          first column of sub( Z ).
  DESCZ   (global and local input) INTEGER array of dimension DLEN_.
          The array descriptor for the distributed matrix Z.
          DESCZ( CTXT_ ) must equal DESCA( CTXT_ )
  WORK    (local workspace/output) COMPLEX array,
             dimension (LWORK)
          WORK(1) returns the optimal workspace.
  LWORK   (local input) INTEGER
          Size of WORK array.  If only eigenvalues are requested:
            LWORK >= N + MAX( NB * ( NP0 + 1 ), 3 )
          If eigenvectors are requested:
            LWORK >= N + ( NP0 + MQ0 + NB ) * NB
          with NQ0 = NUMROC( NN, NB, 0, 0, NPCOL ).
          For optimal performance, greater workspace is needed, i.e.
            LWORK >= MAX( LWORK, N + NHETRD_LWOPT,
              NHEGST_LWOPT )
          Where LWORK is as defined above, and
            NHETRD_LWORK = 2*( ANB+1 )*( 4*NPS+2 ) +
              ( NPS + 1 ) * NPS
            NHEGST_LWOPT =  2*NP0*NB + NQ0*NB + NB*NB
            NB = DESCA( MB_ )
            NP0 = NUMROC( N, NB, 0, 0, NPROW )
            NQ0 = NUMROC( N, NB, 0, 0, NPCOL )
            ICTXT = DESCA( CTXT_ )
            ANB = PJLAENV( ICTXT, 3, 'PCHETTRD', 'L', 0, 0, 0, 0 )
            SQNPC = SQRT( DBLE( NPROW * NPCOL ) )
            NPS = MAX( NUMROC( N, 1, 0, 0, SQNPC ), 2*ANB )
            NUMROC is a ScaLAPACK tool functions;
            PJLAENV is a ScaLAPACK envionmental inquiry function
            MYROW, MYCOL, NPROW and NPCOL can be determined by calling
            the subroutine BLACS_GRIDINFO.
          If LWORK = -1, then LWORK is global input and a workspace
          query is assumed; the routine only calculates the optimal
          size for all work arrays. Each of these values is returned
          in the first entry of the correspondingwork array, and no
          error message is issued by PXERBLA.
  RWORK   (local workspace/output) REAL array,
             dimension max(3,LRWORK)
          On return, RWORK(1) contains the amount of workspace
             required for optimal efficiency
          if JOBZ='N' RWORK(1) = optimal amount of workspace
             required to compute eigenvalues efficiently
          if JOBZ='V' RWORK(1) = optimal amount of workspace
             required to compute eigenvalues and eigenvectors
             efficiently with no guarantee on orthogonality.
             If RANGE='V', it is assumed that all eigenvectors
             may be required when computing optimal workspace.
  LRWORK  (local input) INTEGER
          Size of RWORK
          See below for definitions of variables used to define LRWORK.
          If no eigenvectors are requested (JOBZ = 'N') then
             LRWORK >= 5 * NN + 4 * N
          If eigenvectors are requested (JOBZ = 'V' ) then
             the amount of workspace required to guarantee that all
             eigenvectors are computed is:
             LRWORK >= 4*N + MAX( 5*NN, NP0 * MQ0 ) +
               ICEIL( NEIG, NPROW*NPCOL)*NN
             The computed eigenvectors may not be orthogonal if the
             minimal workspace is supplied and ORFAC is too small.
             If you want to guarantee orthogonality (at the cost
             of potentially poor performance) you should add
             the following to LRWORK:
                (CLUSTERSIZE-1)*N
             where CLUSTERSIZE is the number of eigenvalues in the
             largest cluster, where a cluster is defined as a set of
             close eigenvalues: { W(K),...,W(K+CLUSTERSIZE-1) |
                                  W(J+1) <= W(J) + ORFAC*2*norm(A) }
          Variable definitions:
             NEIG = number of eigenvectors requested
             NB = DESCA( MB_ ) = DESCA( NB_ ) = DESCZ( MB_ ) =
                  DESCZ( NB_ )
             NN = MAX( N, NB, 2 )
             DESCA( RSRC_ ) = DESCA( NB_ ) = DESCZ( RSRC_ ) =
                              DESCZ( CSRC_ ) = 0
             NP0 = NUMROC( NN, NB, 0, 0, NPROW )
             MQ0 = NUMROC( MAX( NEIG, NB, 2 ), NB, 0, 0, NPCOL )
             ICEIL( X, Y ) is a ScaLAPACK function returning
             ceiling(X/Y)
          When LRWORK is too small:
             If LRWORK is too small to guarantee orthogonality,
             PCHEGVX attempts to maintain orthogonality in
             the clusters with the smallest
             spacing between the eigenvalues.
             If LRWORK is too small to compute all the eigenvectors
             requested, no computation is performed and INFO=-25
             is returned.  Note that when RANGE='V', PCHEGVX does
             not know how many eigenvectors are requested until
             the eigenvalues are computed.  Therefore, when RANGE='V'
             and as long as LRWORK is large enough to allow PCHEGVX to
             compute the eigenvalues, PCHEGVX will compute the
             eigenvalues and as many eigenvectors as it can.
          Relationship between workspace, orthogonality & performance:
             If CLUSTERSIZE >= N/SQRT(NPROW*NPCOL), then providing
             enough space to compute all the eigenvectors
             orthogonally will cause serious degradation in
             performance. In the limit (i.e. CLUSTERSIZE = N-1)
             PCSTEIN will perform no better than CSTEIN on 1 processor.
             For CLUSTERSIZE = N/SQRT(NPROW*NPCOL) reorthogonalizing
             all eigenvectors will increase the total execution time
             by a factor of 2 or more.
             For CLUSTERSIZE > N/SQRT(NPROW*NPCOL) execution time will
             grow as the square of the cluster size, all other factors
             remaining equal and assuming enough workspace.  Less
             workspace means less reorthogonalization but faster
             execution.
          If LRWORK = -1, then LRWORK is global input and a workspace
          query is assumed; the routine only calculates the minimum
          and optimal size for all work arrays. Each of these
          values is returned in the first entry of the corresponding
          work array, and no error message is issued by PXERBLA.
  IWORK   (local workspace) INTEGER array
          On return, IWORK(1) contains the amount of integer workspace
          required.
  LIWORK  (local input) INTEGER
          size of IWORK
          LIWORK >= 6 * NNP
          Where:
            NNP = MAX( N, NPROW*NPCOL + 1, 4 )
          If LIWORK = -1, then LIWORK is global input and a workspace
          query is assumed; the routine only calculates the minimum
          and optimal size for all work arrays. Each of these
          values is returned in the first entry of the corresponding
          work array, and no error message is issued by PXERBLA.
  IFAIL   (output) INTEGER array, dimension (N)
          IFAIL provides additional information when INFO .NE. 0
          If (MOD(INFO/16,2).NE.0) then IFAIL(1) indicates the order of
          the smallest minor which is not positive definite.
          If (MOD(INFO,2).NE.0) on exit, then IFAIL contains the
          indices of the eigenvectors that failed to converge.
          If neither of the above error conditions hold and JOBZ = 'V',
          then the first M elements of IFAIL are set to zero.
  ICLUSTR (global output) integer array, dimension (2*NPROW*NPCOL)
          This array contains indices of eigenvectors corresponding to
          a cluster of eigenvalues that could not be reorthogonalized
          due to insufficient workspace (see LWORK, ORFAC and INFO).
          Eigenvectors corresponding to clusters of eigenvalues indexed
          ICLUSTR(2*I-1) to ICLUSTR(2*I), could not be
          reorthogonalized due to lack of workspace. Hence the
          eigenvectors corresponding to these clusters may not be
          orthogonal.  ICLUSTR() is a zero terminated array.
          (ICLUSTR(2*K).NE.0 .AND. ICLUSTR(2*K+1).EQ.0) if and only if
          K is the number of clusters
          ICLUSTR is not referenced if JOBZ = 'N'
  GAP     (global output) REAL array,
             dimension (NPROW*NPCOL)
          This array contains the gap between eigenvalues whose
          eigenvectors could not be reorthogonalized. The output
          values in this array correspond to the clusters indicated
          by the array ICLUSTR. As a result, the dot product between
          eigenvectors correspoding to the I^th cluster may be as high
          as ( C * n ) / GAP(I) where C is a small constant.
  INFO    (global output) INTEGER
          = 0:  successful exit
          < 0:  If the i-th argument is an array and the j-entry had
                an illegal value, then INFO = -(i*100+j), if the i-th
                argument is a scalar and had an illegal value, then
                INFO = -i.
          > 0:  if (MOD(INFO,2).NE.0), then one or more eigenvectors
                  failed to converge.  Their indices are stored
                  in IFAIL.  Send e-mail to scalapack@cs.utk.edu
                if (MOD(INFO/2,2).NE.0),then eigenvectors corresponding
                  to one or more clusters of eigenvalues could not be
                  reorthogonalized because of insufficient workspace.
                  The indices of the clusters are stored in the array
                  ICLUSTR.
                if (MOD(INFO/4,2).NE.0), then space limit prevented
                  PCHEGVX from computing all of the eigenvectors
                  between VL and VU.  The number of eigenvectors
                  computed is returned in NZ.
                if (MOD(INFO/8,2).NE.0), then PCSTEBZ failed to
                  compute eigenvalues.
                  Send e-mail to scalapack@cs.utk.edu
                if (MOD(INFO/16,2).NE.0), then B was not positive
                  definite.  IFAIL(1) indicates the order of
                  the smallest minor which is not positive definite.
  Alignment requirements
  ======================
  The distributed submatrices A(IA:*, JA:*), C(IC:IC+M-1,JC:JC+N-1),
  and B( IB:IB+N-1, JB:JB+N-1 ) must verify some alignment properties,
  namely the following expressions should be true:
     DESCA(MB_) = DESCA(NB_)
     IA = IB = IZ
     JA = IB = JZ
     DESCA(M_) = DESCB(M_) =DESCZ(M_)
     DESCA(N_) = DESCB(N_)= DESCZ(N_)
     DESCA(MB_) = DESCB(MB_) = DESCZ(MB_)
     DESCA(NB_) = DESCB(NB_) = DESCZ(NB_)
     DESCA(RSRC_) = DESCB(RSRC_) = DESCZ(RSRC_)
     DESCA(CSRC_) = DESCB(CSRC_) = DESCZ(CSRC_)
     MOD( IA-1, DESCA( MB_ ) ) = 0
     MOD( JA-1, DESCA( NB_ ) ) = 0
     MOD( IB-1, DESCB( MB_ ) ) = 0
     MOD( JB-1, DESCB( NB_ ) ) = 0
  =====================================================================
     .. Parameters ..

 
Display dynamic version Find AutoScroll Reload FontSize: - + Hide Comments Hide Blanks Frame FullScreen MailPrint

 
001        SUBROUTINE PCHEGVX( IBTYPE , JOBZ , RANGE , UPLO , N , A , IA , JA ,
002       $DESCA , B , IB , JB , DESCB , VL , VU , IL , IU ,
003       $ABSTOL , M , NZ , W , ORFAC , Z , IZ , JZ , DESCZ ,
004       $WORK , LWORK , RWORK , LRWORK , IWORK , LIWORK ,
005       $IFAIL , ICLUSTR , GAP , INFO )
006  
007  *     -- ScaLAPACK routine(version 1.7) --
008  *     University of Tennessee , Knoxville , Oak Ridge National Laboratory ,
009  *     and University of California , Berkeley.
010  *     October 15 , 1999
011  
012  *     .. Scalar Arguments ..
013        CHARACTER JOBZ , RANGE , UPLO
014        INTEGER IA , IB , IBTYPE , IL , INFO , IU , IZ , JA , JB , JZ ,
015       $LIWORK , LRWORK , LWORK , M , N , NZ
016        REAL ABSTOL , ORFAC , VL , VU
017        INTEGER BLOCK_CYCLIC_2D , DLEN_ , DTYPE_ , CTXT_ , M_ , N_ ,
018       $MB_ , NB_ , RSRC_ , CSRC_ , LLD_
019        PARAMETER( BLOCK_CYCLIC_2D = 1 , DLEN_ = 9 , DTYPE_ = 1 ,
020       $CTXT_ = 2 , M_ = 3 , N_ = 4 , MB_ = 5 , NB_ = 6 ,
021       $RSRC_ = 7 , CSRC_ = 8 , LLD_ = 9 )
022        COMPLEX ONE
023        PARAMETER( ONE = 1.0E + 0 )
024        REAL FIVE , ZERO
025        PARAMETER( FIVE = 5.0E + 0 , ZERO = 0.0E + 0 )
026        INTEGER IERRNPD
027        PARAMETER( IERRNPD = 16 )
028  *     ..
029  *     .. Local Scalars ..
030        LOGICAL ALLEIG , INDEIG , LQUERY , UPPER , VALEIG , WANTZ
031        CHARACTER TRANS
032        INTEGER ANB , IACOL , IAROW , IBCOL , IBROW , ICOFFA ,
033       $ICOFFB , ICTXT , IROFFA , IROFFB , LIWMIN , LRWMIN ,
034       $LRWOPT , LWMIN , LWOPT , MQ0 , MYCOL , MYROW , NB ,
035       $NEIG , NHEGST_LWOPT , NHETRD_LWOPT , NN , NP0 ,
036       $NPCOL , NPROW , NPS , NQ0 , SQNPC
037        REAL EPS , SCALE
038  *     ..
039  *     .. Local Arrays ..
040        INTEGER IDUM1( 5 ) , IDUM2( 5 )
041  *     ..
042  *     .. External Functions ..
043        LOGICAL LSAME
044        INTEGER ICEIL , INDXG2P , NUMROC , PJLAENV
045        REAL PSLAMCH
046        EXTERNAL LSAME , ICEIL , INDXG2P , NUMROC , PJLAENV , PSLAMCH
047  *     ..
048  *     .. External Subroutines ..
049        EXTERNAL BLACS_GRIDINFO , CHK1MAT , PCHEEVX , PCHENGST ,
050       $PCHK1MAT , PCHK2MAT , PCPOTRF , PCTRMM , PCTRSM ,
051       $PXERBLA , SGEBR2D , SGEBS2D , SSCAL
052  *     ..
053  *     .. Intrinsic Functions ..
054        INTRINSIC ABS , CMPLX , DBLE , ICHAR , INT , MAX , MIN , MOD ,
055       $REAL , SQRT
056  *     ..
057  *     .. Executable Statements ..
058  *     This is just to keep ftnchek and toolpack / 1 happy
059        IF( BLOCK_CYCLIC_2D*CSRC_*CTXT_*DLEN_*DTYPE_*LLD_*MB_*M_*NB_*N_*
060       $    RSRC_.LT.0 )RETURN
061  
062  *         Get grid parameters
063  
064            ICTXT = DESCA( CTXT_ )
065            CALL BLACS_GRIDINFO( ICTXT , NPROW , NPCOL , MYROW , MYCOL )
066  
067  *         Test the input parameters
068  
069            INFO = 0
070            IF( NPROW.EQ. - 1 ) THEN
071                INFO = - ( 900 + CTXT_ )
072            ELSE IF( DESCA( CTXT_ ).NE.DESCB( CTXT_ ) ) THEN
073                INFO = - ( 1300 + CTXT_ )
074            ELSE IF( DESCA( CTXT_ ).NE.DESCZ( CTXT_ ) ) THEN
075                INFO = - ( 2600 + CTXT_ )
076            ELSE
077  
078  *             Get machine constants.
079  
080                EPS = PSLAMCH( DESCA( CTXT_ ) , 'Precision' )
081  
082                WANTZ = LSAME( JOBZ , 'V' )
083                UPPER = LSAME( UPLO , 'U' )
084                ALLEIG = LSAME( RANGE , 'A' )
085                VALEIG = LSAME( RANGE , 'V' )
086                INDEIG = LSAME( RANGE , 'I' )
087                CALL CHK1MAT( N , 4 , N , 4 , IA , JA , DESCA , 9 , INFO )
088                CALL CHK1MAT( N , 4 , N , 4 , IB , JB , DESCB , 13 , INFO )
089                CALL CHK1MAT( N , 4 , N , 4 , IZ , JZ , DESCZ , 26 , INFO )
090                IF( INFO.EQ.0 ) THEN
091                    IF( MYROW.EQ.0 .AND. MYCOL.EQ.0 ) THEN
092                        RWORK( 1 ) = ABSTOL
093                        IF( VALEIG ) THEN
094                            RWORK( 2 ) = VL
095                            RWORK( 3 ) = VU
096                        ELSE
097                            RWORK( 2 ) = ZERO
098                            RWORK( 3 ) = ZERO
099                        END IF
100                        CALL SGEBS2D( DESCA( CTXT_ ) , 'ALL' , ' ' , 3 , 1 , RWORK ,
101       $                3 )
102                    ELSE
103                        CALL SGEBR2D( DESCA( CTXT_ ) , 'ALL' , ' ' , 3 , 1 , RWORK , 3 ,
104       $                0 , 0 )
105                    END IF
106                    IAROW = INDXG2P( IA , DESCA( MB_ ) , MYROW , DESCA( RSRC_ ) ,
107       $            NPROW )
108                    IBROW = INDXG2P( IB , DESCB( MB_ ) , MYROW , DESCB( RSRC_ ) ,
109       $            NPROW )
110                    IACOL = INDXG2P( JA , DESCA( NB_ ) , MYCOL , DESCA( CSRC_ ) ,
111       $            NPCOL )
112                    IBCOL = INDXG2P( JB , DESCB( NB_ ) , MYCOL , DESCB( CSRC_ ) ,
113       $            NPCOL )
114                    IROFFA = MOD( IA - 1 , DESCA( MB_ ) )
115                    ICOFFA = MOD( JA - 1 , DESCA( NB_ ) )
116                    IROFFB = MOD( IB - 1 , DESCB( MB_ ) )
117                    ICOFFB = MOD( JB - 1 , DESCB( NB_ ) )
118  
119  *                 Compute the total amount of space needed
120  
121                    LQUERY = .FALSE.
122                    IF( LWORK.EQ. - 1 .OR. LIWORK.EQ. - 1 .OR. LRWORK.EQ. - 1 )
123       $                LQUERY = .TRUE.
124  
125                        LIWMIN = 6*MAX( N ,( NPROW*NPCOL ) + 1 , 4 )
126  
127                        NB = DESCA( MB_ )
128                        NN = MAX( N , NB , 2 )
129                        NP0 = NUMROC( NN , NB , 0 , 0 , NPROW )
130  
131                        IF(( .NOT.WANTZ ) .OR.( VALEIG .AND.( .NOT.LQUERY ) ) )
132       $                    THEN
133                            LWMIN = N + MAX( NB*( NP0 + 1 ) , 3 )
134                            LWOPT = LWMIN
135                            LRWMIN = 5*NN + 4*N
136                            IF( WANTZ ) THEN
137                                MQ0 = NUMROC( MAX( N , NB , 2 ) , NB , 0 , 0 , NPCOL )
138                                LRWOPT = 4*N + MAX( 5*NN , NP0*MQ0 )
139                            ELSE
140                                LRWOPT = LRWMIN
141                            END IF
142                            NEIG = 0
143                        ELSE
144                            IF( ALLEIG .OR. VALEIG ) THEN
145                                NEIG = N
146                            ELSE IF( INDEIG ) THEN
147                                NEIG = IU - IL + 1
148                            END IF
149                            MQ0 = NUMROC( MAX( NEIG , NB , 2 ) , NB , 0 , 0 , NPCOL )
150                            LWMIN = N + ( NP0 + MQ0 + NB )*NB
151                            LWOPT = LWMIN
152                            LRWMIN = 4*N + MAX( 5*NN , NP0*MQ0 ) +
153       $                    ICEIL( NEIG , NPROW*NPCOL )*NN
154                            LRWOPT = LRWMIN
155  
156                        END IF
157  
158  *                     Compute how much workspace is needed to use the
159  *                     new TRD and GST algorithms
160  
161                        ANB = PJLAENV( ICTXT , 3 , 'PCHETTRD' , 'L' , 0 , 0 , 0 , 0 )
162                        SQNPC = INT( SQRT( DBLE( NPROW*NPCOL ) ) )
163                        NPS = MAX( NUMROC( N , 1 , 0 , 0 , SQNPC ) , 2*ANB )
164                        NHETRD_LWOPT = 2*( ANB + 1 )*( 4*NPS + 2 ) + ( NPS + 4 )*NPS
165                        NB = DESCA( MB_ )
166                        NP0 = NUMROC( N , NB , 0 , 0 , NPROW )
167                        NQ0 = NUMROC( N , NB , 0 , 0 , NPCOL )
168                        NHEGST_LWOPT = 2*NP0*NB + NQ0*NB + NB*NB
169                        LWOPT = MAX( LWOPT , N + NHETRD_LWOPT , NHEGST_LWOPT )
170  
171  *                     Version 1.0 Limitations
172  
173                        IF( IBTYPE.LT.1 .OR. IBTYPE.GT.3 ) THEN
174                            INFO = - 1
175                        ELSE IF( .NOT.( WANTZ .OR. LSAME( JOBZ , 'N' ) ) ) THEN
176                            INFO = - 2
177                        ELSE IF( .NOT.( ALLEIG .OR. VALEIG .OR. INDEIG ) ) THEN
178                            INFO = - 3
179                        ELSE IF( .NOT.UPPER .AND. .NOT.LSAME( UPLO , 'L' ) ) THEN
180                            INFO = - 4
181                        ELSE IF( N.LT.0 ) THEN
182                            INFO = - 5
183                        ELSE IF( IROFFA.NE.0 ) THEN
184                            INFO = - 7
185                        ELSE IF( ICOFFA.NE.0 ) THEN
186                            INFO = - 8
187                        ELSE IF( DESCA( MB_ ).NE.DESCA( NB_ ) ) THEN
188                            INFO = - ( 900 + NB_ )
189                        ELSE IF( DESCA( M_ ).NE.DESCB( M_ ) ) THEN
190                            INFO = - ( 1300 + M_ )
191                        ELSE IF( DESCA( N_ ).NE.DESCB( N_ ) ) THEN
192                            INFO = - ( 1300 + N_ )
193                        ELSE IF( DESCA( MB_ ).NE.DESCB( MB_ ) ) THEN
194                            INFO = - ( 1300 + MB_ )
195                        ELSE IF( DESCA( NB_ ).NE.DESCB( NB_ ) ) THEN
196                            INFO = - ( 1300 + NB_ )
197                        ELSE IF( DESCA( RSRC_ ).NE.DESCB( RSRC_ ) ) THEN
198                            INFO = - ( 1300 + RSRC_ )
199                        ELSE IF( DESCA( CSRC_ ).NE.DESCB( CSRC_ ) ) THEN
200                            INFO = - ( 1300 + CSRC_ )
201                        ELSE IF( DESCA( CTXT_ ).NE.DESCB( CTXT_ ) ) THEN
202                            INFO = - ( 1300 + CTXT_ )
203                        ELSE IF( DESCA( M_ ).NE.DESCZ( M_ ) ) THEN
204                            INFO = - ( 2200 + M_ )
205                        ELSE IF( DESCA( N_ ).NE.DESCZ( N_ ) ) THEN
206                            INFO = - ( 2200 + N_ )
207                        ELSE IF( DESCA( MB_ ).NE.DESCZ( MB_ ) ) THEN
208                            INFO = - ( 2200 + MB_ )
209                        ELSE IF( DESCA( NB_ ).NE.DESCZ( NB_ ) ) THEN
210                            INFO = - ( 2200 + NB_ )
211                        ELSE IF( DESCA( RSRC_ ).NE.DESCZ( RSRC_ ) ) THEN
212                            INFO = - ( 2200 + RSRC_ )
213                        ELSE IF( DESCA( CSRC_ ).NE.DESCZ( CSRC_ ) ) THEN
214                            INFO = - ( 2200 + CSRC_ )
215                        ELSE IF( DESCA( CTXT_ ).NE.DESCZ( CTXT_ ) ) THEN
216                            INFO = - ( 2200 + CTXT_ )
217                        ELSE IF( IROFFB.NE.0 .OR. IBROW.NE.IAROW ) THEN
218                            INFO = - 11
219                        ELSE IF( ICOFFB.NE.0 .OR. IBCOL.NE.IACOL ) THEN
220                            INFO = - 12
221                        ELSE IF( VALEIG .AND. N.GT.0 .AND. VU.LE.VL ) THEN
222                            INFO = - 15
223                        ELSE IF( INDEIG .AND.( IL.LT.1 .OR. IL.GT.MAX( 1 , N ) ) )
224       $                    THEN
225                            INFO = - 16
226                        ELSE IF( INDEIG .AND.( IU.LT.MIN( N , IL ) .OR. IU.GT.N ) )
227       $                    THEN
228                            INFO = - 17
229                        ELSE IF( VALEIG .AND.( ABS( RWORK( 2 ) - VL ).GT.FIVE*EPS*
230       $                    ABS( VL ) ) ) THEN
231                            INFO = - 14
232                        ELSE IF( VALEIG .AND.( ABS( RWORK( 3 ) - VU ).GT.FIVE*EPS*
233       $                    ABS( VU ) ) ) THEN
234                            INFO = - 15
235                        ELSE IF( ABS( RWORK( 1 ) - ABSTOL ).GT.FIVE*EPS*
236       $                    ABS( ABSTOL ) ) THEN
237                            INFO = - 18
238                        ELSE IF( LWORK.LT.LWMIN .AND. .NOT.LQUERY ) THEN
239                            INFO = - 28
240                        ELSE IF( LRWORK.LT.LRWMIN .AND. .NOT.LQUERY ) THEN
241                            INFO = - 30
242                        ELSE IF( LIWORK.LT.LIWMIN .AND. .NOT.LQUERY ) THEN
243                            INFO = - 32
244                        END IF
245                    END IF
246                    IDUM1( 1 ) = IBTYPE
247                    IDUM2( 1 ) = 1
248                    IF( WANTZ ) THEN
249                        IDUM1( 2 ) = ICHAR( 'V' )
250                    ELSE
251                        IDUM1( 2 ) = ICHAR( 'N' )
252                    END IF
253                    IDUM2( 2 ) = 2
254                    IF( UPPER ) THEN
255                        IDUM1( 3 ) = ICHAR( 'U' )
256                    ELSE
257                        IDUM1( 3 ) = ICHAR( 'L' )
258                    END IF
259                    IDUM2( 3 ) = 3
260                    IF( ALLEIG ) THEN
261                        IDUM1( 4 ) = ICHAR( 'A' )
262                    ELSE IF( INDEIG ) THEN
263                        IDUM1( 4 ) = ICHAR( 'I' )
264                    ELSE
265                        IDUM1( 4 ) = ICHAR( 'V' )
266                    END IF
267                    IDUM2( 4 ) = 4
268                    IF( LQUERY ) THEN
269                        IDUM1( 5 ) = - 1
270                    ELSE
271                        IDUM1( 5 ) = 1
272                    END IF
273                    IDUM2( 5 ) = 5
274                    CALL PCHK2MAT( N , 4 , N , 4 , IA , JA , DESCA , 9 , N , 4 , N , 4 , IB ,
275       $            JB , DESCB , 13 , 5 , IDUM1 , IDUM2 , INFO )
276                    CALL PCHK1MAT( N , 4 , N , 4 , IZ , JZ , DESCZ , 26 , 0 , IDUM1 , IDUM2 ,
277       $            INFO )
278                END IF
279  
280                IWORK( 1 ) = LIWMIN
281                WORK( 1 ) = CMPLX( REAL( LWOPT ) )
282                RWORK( 1 ) = REAL( LRWOPT )
283  
284                IF( INFO.NE.0 ) THEN
285                    CALL PXERBLA( ICTXT , 'PCHEGVX ' , - INFO )
286                    RETURN
287                ELSE IF( LQUERY ) THEN
288                    RETURN
289                END IF
290  
291  *             Form a Cholesky factorization of sub( B ).
292  
293                CALL PCPOTRF ( UPLO , N , B , IB , JB , DESCB , INFO )
294                IF( INFO.NE.0 ) THEN
295                    IWORK( 1 ) = LIWMIN
296                    WORK( 1 ) = CMPLX( REAL( LWOPT ) )
297                    RWORK( 1 ) = REAL( LRWOPT )
298                    IFAIL( 1 ) = INFO
299                    INFO = IERRNPD
300                    RETURN
301                END IF
302  
303  *             Transform problem to standard eigenvalue problem and solve.
304  
305                CALL PCHENGST ( IBTYPE , UPLO , N , A , IA , JA , DESCA , B , IB , JB ,
306       $        DESCB , SCALE , WORK , LWORK , INFO )
307                CALL PCHEEVX ( JOBZ , RANGE , UPLO , N , A , IA , JA , DESCA , VL , VU , IL ,
308       $        IU , ABSTOL , M , NZ , W , ORFAC , Z , IZ , JZ , DESCZ , WORK ,
309       $        LWORK , RWORK , LRWORK , IWORK , LIWORK , IFAIL , ICLUSTR ,
310       $        GAP , INFO )
311  
312                IF( WANTZ ) THEN
313  
314  *                 Backtransform eigenvectors to the original problem.
315  
316                    NEIG = M
317                    IF( IBTYPE.EQ.1 .OR. IBTYPE.EQ.2 ) THEN
318  
319  *                     For sub( A )*x =(lambda)*sub( B )*x and
320  *                     sub( A )*sub( B )*x =(lambda)*x ; backtransform eigenvectors :
321  *                     x = inv(L)'*y or inv(U)*y
322  
323                        IF( UPPER ) THEN
324                            TRANS = 'N'
325                        ELSE
326                            TRANS = 'C'
327                        END IF
328  
329                        CALL PCTRSM( 'Left' , UPLO , TRANS , 'Non - unit' , N , NEIG , ONE ,
330       $                B , IB , JB , DESCB , Z , IZ , JZ , DESCZ )
331  
332                    ELSE IF( IBTYPE.EQ.3 ) THEN
333  
334  *                     For sub( B )*sub( A )*x =(lambda)*x ;
335  *                     backtransform eigenvectors : x = L*y or U'*y
336  
337                        IF( UPPER ) THEN
338                            TRANS = 'C'
339                        ELSE
340                            TRANS = 'N'
341                        END IF
342  
343                        CALL PCTRMM( 'Left' , UPLO , TRANS , 'Non - unit' , N , NEIG , ONE ,
344       $                B , IB , JB , DESCB , Z , IZ , JZ , DESCZ )
345                    END IF
346                END IF
347  
348                IF( SCALE.NE.ONE ) THEN
349                    CALL SSCAL( N , SCALE , W , 1 )
350                END IF
351  
352                IWORK( 1 ) = LIWMIN
353                WORK( 1 ) = CMPLX( REAL( LWOPT ) )
354                RWORK( 1 ) = REAL( LRWOPT )
355                RETURN
356  
357  *             End of PCHEGVX
358  
359            END