Compute the eigenvalues of a symmetric tridiagonal matrix in parallel
SUBROUTINE PSSTEBZ(
ICTXT, RANGE, ORDER, N, VL, VU, IL, IU, ABSTOL, D, E, M, NSPLIT, W, IBLOCK, ISPLIT, WORK, LWORK, IWORK, LIWORK, INFO )
CHARACTER
ORDER, RANGE
INTEGER
ICTXT, IL, INFO, IU, LIWORK, LWORK, M, N, NSPLIT
REAL
ABSTOL, VL, VU
INTEGER
IBLOCK( * ), ISPLIT( * ), IWORK( * )
REAL
D( * ), E( * ), W( * ), WORK( * )
PSSTEBZ computes the eigenvalues of a symmetric tridiagonal matrix in parallel. The user may ask for all eigenvalues, all eigenvalues in the interval [VL, VU], or the eigenvalues indexed IL through IU. A static partitioning of work is done at the beginning of PSSTEBZ which results in all processes finding an (almost) equal number of eigenvalues.
NOTE : It is assumed that the user is on an IEEE machine. If the user
is not on an IEEE mchine, set the compile time flag NO_IEEE to 1 (in SLmake.inc). The features of IEEE arithmetic that are needed for the "fast" Sturm Count are : (a) infinity arithmetic (b) the sign bit of a double precision floating point number is assumed be in the 32nd or 64th bit position (c) the sign of negative zero.
See W. Kahan "Accurate Eigenvalues of a Symmetric Tridiagonal Matrix", Report CS41, Computer Science Dept., Stanford
University, July 21, 1966.
ICTXT (global input) INTEGER
The BLACS context handle.
RANGE (global input) CHARACTER
Specifies which eigenvalues are to be found. = 'A': ("All") all eigenvalues will be found.
= 'V': ("Value") all eigenvalues in the interval [VL, VU] will be found. = 'I': ("Index") the IL-th through IU-th eigenvalues (of the entire matrix) will be found.
ORDER (global input) CHARACTER
Specifies the order in which the eigenvalues and their block numbers are stored in W and IBLOCK. = 'B': ("By Block") the eigenvalues will be grouped by split-off block (see IBLOCK, ISPLIT) and ordered from smallest to largest within the block. = 'E': ("Entire matrix") the eigenvalues for the entire matrix will be ordered from smallest to largest.
N (global input) INTEGER
The order of the tridiagonal matrix T. N >= 0.
VL (global input) REAL
If RANGE='V', the lower bound of the interval to be searched for eigenvalues. Eigenvalues less than VL will not be returned. Not referenced if RANGE='A' or 'I'.
VU (global input) REAL
If RANGE='V', the upper bound of the interval to be searched for eigenvalues. Eigenvalues greater than VU will not be returned. VU must be greater than VL. Not referenced if RANGE='A' or 'I'.
IL (global input) INTEGER
If RANGE='I', the index (from smallest to largest) of the smallest eigenvalue to be returned. IL must be at least 1. Not referenced if RANGE='A' or 'V'.
IU (global input) INTEGER
If RANGE='I', the index (from smallest to largest) of the largest eigenvalue to be returned. IU must be at least IL and no greater than N. Not referenced if RANGE='A' or 'V'.
ABSTOL (global input) REAL
The absolute tolerance for the eigenvalues. An eigenvalue (or cluster) is considered to be located if it has been determined to lie in an interval whose width is ABSTOL or less. If ABSTOL is less than or equal to zero, then ULP*|T| will be used, where |T| means the 1-norm of T. Eigenvalues will be computed most accurately when ABSTOL is set to the underflow threshold SLAMCH('U'), not zero. Note : If eigenvectors are desired later by inverse iteration ( PSSTEIN ), ABSTOL should be set to 2*PSLAMCH('S').
D (global input) REAL array, dimension (N)
The n diagonal elements of the tridiagonal matrix T. To avoid overflow, the matrix must be scaled so that its largest entry is no greater than overflow**(1/2) * underflow**(1/4) in absolute value, and for greatest accuracy, it should not be much smaller than that.
E (global input) REAL array, dimension (N-1)
The (n-1) off-diagonal elements of the tridiagonal matrix T. To avoid overflow, the matrix must be scaled so that its largest entry is no greater than overflow**(1/2) * underflow**(1/4) in absolute value, and for greatest accuracy, it should not be much smaller than that.
M (global output) INTEGER
The actual number of eigenvalues found. 0 <= M <= N. (See also the description of INFO=2)
NSPLIT (global output) INTEGER
The number of diagonal blocks in the matrix T. 1 <= NSPLIT <= N.
W (global output) REAL array, dimension (N)
On exit, the first M elements of W contain the eigenvalues on all processes.
IBLOCK (global output) INTEGER array, dimension (N)
At each row/column j where E(j) is zero or small, the matrix T is considered to split into a block diagonal matrix. On exit IBLOCK(i) specifies which block (from 1 to the number of blocks) the eigenvalue W(i) belongs to. NOTE: in the (theoretically impossible) event that bisection does not converge for some or all eigenvalues, INFO is set to 1 and the ones for which it did not are identified by a negative block number.
ISPLIT (global output) INTEGER array, dimension (N)
The splitting points, at which T breaks up into submatrices. The first submatrix consists of rows/columns 1 to ISPLIT(1), the second of rows/columns ISPLIT(1)+1 through ISPLIT(2), etc., and the NSPLIT-th consists of rows/columns ISPLIT(NSPLIT-1)+1 through ISPLIT(NSPLIT)=N. (Only the first NSPLIT elements will actually be used, but since the user cannot know a priori what value NSPLIT will have, N words must be reserved for ISPLIT.)
WORK (local workspace) REAL array, dimension ( MAX( 5*N, 7 ) )
LWORK (local input) INTEGER
size of array WORK must be >= MAX( 5*N, 7 ) If LWORK = -1, then LWORK is global input and a workspace query is assumed; the routine only calculates the minimum and optimal size for all work arrays. Each of these values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA.
IWORK (local workspace) INTEGER array, dimension ( MAX( 4*N, 14 ) )
LIWORK (local input) INTEGER
size of array IWORK must be >= MAX( 4*N, 14, NPROCS ) If LIWORK = -1, then LIWORK is global input and a workspace query is assumed; the routine only calculates the minimum and optimal size for all work arrays. Each of these values is returned in the first entry of the corresponding work array, and no error message is issued by PXERBLA.
INFO (global output) INTEGER
= 0 : successful exit
< 0 : if INFO = -i, the i-th argument had an illegal value
> 0 : some or all of the eigenvalues failed to converge or
were not computed:
= 1 : Bisection failed to converge for some eigenvalues; these eigenvalues are flagged by a negative block number. The effect is that the eigenvalues may not be as accurate as the absolute and relative tolerances. This is generally caused by arithmetic which is less accurate than PSLAMCH says. = 2 : There is a mismatch between the number of eigenvalues output and the number desired. = 3 : RANGE='i', and the Gershgorin interval initially used was incorrect. No eigenvalues were computed. Probable cause: your machine has sloppy floating point arithmetic. Cure: Increase the PARAMETER "FUDGE", recompile, and try again.
RELFAC REAL, default = 2.0
The relative tolerance. An interval [a,b] lies within "relative tolerance" if b-a < RELFAC*ulp*max(|a|,|b|), where "ulp" is the machine precision (distance from 1 to the next larger floating point number.)
FUDGE REAL, default = 2.0
A "fudge factor" to widen the Gershgorin intervals. Ideally, a value of 1 should work, but on machines with sloppy arithmetic, this needs to be larger. The default for publicly released versions should be large enough to handle the worst machine around. Note that this has no effect on the accuracy of the solution.