dgssvx

DGSSVX solves the system of linear equations $A\cdot X = B$ or $A^T\cdot X = B$

It uses the LU factorization from dgstrf. Error bounds on the solution and a condition estimate are also provided. More details are give below.

A is NCformat

If A is stored column-wise (A->Stype = SLU_NC):

If options->Equil = YES, scaling factors are computed to equilibrate the system:

options->Trans = NOTRANS

\text{diag}(R) \cdot A \cdot \text{diag}(C) \cdot \text{diag}(C)^{-1}\cdot X = \text{diag}(R) \cdot B

options->Trans = TRANS:

(\text{diag}(R) \cdot A \cdot \text{diag}(C))^{T} \cdot \text{diag}(R)^{-1} \cdot X = \text{diag}(C) \cdot B

options->Trans = CONJ

(\text{diag}(R) \cdot A \cdot \text{diag}(C))^{H} \cdot \text{diag}(R) \cdot X = \text{diag}(C) \cdot B

note

Whether the system will be equilibrated depends on the scaling of the matrix A, but if equilibration is used, A is overwritten by $\text{diag}(R)\cdot A \cdot \text{diag}(C)$ and B by $\text{diag}(R) \cdot B$ (if options->Trans=NOTRANS) or $\text{diag}(C) \cdot B$ (if options->Trans = TRANS or CONJ).

Permute columns of A, forming A*Pc, where Pc is a permutation matrix that usually preserves sparsity. For more details of this step, see sp_preorder.c.
If options->Fact != FACTORED, the LU decomposition is used to factor the matrix A (after equilibration if options->Equil = YES) as Pr*A*Pc = L*U, with Pr determined by partial pivoting.
Compute the reciprocal pivot growth factor.
If some U(i,i) = 0, so that U is exactly singular, then the routine returns with info = i. Otherwise, the factored form of A is used to estimate the condition number of the matrix A. If the reciprocal of the condition number is less than machine precision, info = A->ncol+1 is returned as a warning, but the routine still goes on to solve for X and computes error bounds as described below.
The system of equations is solved for X using the factored form of A.
If options->IterRefine != NOREFINE, iterative refinement is applied to improve the computed solution matrix and calculate error bounds, and backward error estimates for it.
If equilibration was used, the matrix X is premultiplied by diag(C) (if options->Trans = NOTRANS) or diag(R) (if options->Trans = TRANS or CONJ) so that it solves the original system before equilibration.

A is NRformat

If A is stored row-wise (A->Stype = SLU_NR), apply the above algorithm to the transpose of A:

If options->Equil = YES, scaling factors are computed to equilibrate the system:

options->Trans = NOTRANS: $\text{diag}(R) \cdot A \cdot \text{diag}(C) \cdot \text{diag}(C)^{-1} \cdot X = \text{diag}(R) \cdot B$
options->Trans = TRANS: $(\text{diag}(R) \cdot A \cdot \text{diag}(C))^{T} \cdot \text{diag}(R)^{-1} \cdot X = \text{diag}(C) \cdot B$
options->Trans = CONJ: $(\text{diag}(R) \cdot A \cdot \text{diag}(C))^{H} \cdot \text{diag}(R)^{-1} \cdot X = \text{diag}(C) \cdot B$

note

Whether the system will be equilibrated depends on the scaling of the matrix A, but if equilibration is used, $A^{T}$ is overwritten by $\text{diag}(R) \cdot A^{T} \cdot \text{diag}(C)$ and B by $\text{diag}(R)\cdot B$ (if trans='N') or $\text{diag}(C) \cdot B$ (if trans = 'T' or 'C').

Permute columns of transpose(A) (rows of A), forming transpose(A)*Pc, where Pc is a permutation matrix that usually preserves sparsity. For more details of this step, see sp_preorder.c.
If options->Fact != FACTORED, the LU decomposition is used to factor the transpose(A) (after equilibration if options->Fact = YES) as Pr*transpose(A)*Pc = L*U with the permutation Pr determined by partial pivoting.
Compute the reciprocal pivot growth factor.
If some U(i,i) = 0, so that U is exactly singular, then the routine returns with info = i. Otherwise, the factored form of transpose(A) is used to estimate the condition number of the matrix A. If the reciprocal of the condition number is less than machine precision, info = A->nrow+1 is returned as a warning, but the routine still goes on to solve for X and computes error bounds as described below.
The system of equations is solved for X using the factored form of transpose(A).
If options->IterRefine != NOREFINE, iterative refinement is applied to improve the computed solution matrix and calculate error bounds, and backward error estimates for it.
If equilibration was used, the matrix X is premultiplied by diag(C) (if options->Trans = NOTRANS) or diag(R) (if options->Trans = TRANS or CONJ) so that it solves the original system before equilibration.

! void
! dgssvx(superlu_options_t *options, SuperMatrix*A, int *perm_c, int*perm_r,
!        int *etree, char*equed, double *R, double*C,
!        SuperMatrix *L, SuperMatrix*U, void *work, int lwork,
!        SuperMatrix*B, SuperMatrix *X, double*recip_pivot_growth,
!        double *rcond, double*ferr, double *berr,
!        GlobalLU_t*Glu, mem_usage_t *mem_usage, SuperLUStat_t*stat, int *info )
INTERFACE
  SUBROUTINE dgssvx(options, A, perm_c, perm_r, &
    & etree, equed, R, C, L, U, work, lwork, &
    & B, X, recip_pivot_growth, rcond, ferr, berr, &
    & Glu, mem_usage, stat, info) &
    & BIND(C, name="dgssvx")
    IMPORT :: superlu_options_t, SuperLUStat_t, C_INT, C_PTR, &
      & SuperMatrix, GlobalLU_t, mem_usage_t, C_CHAR, C_DOUBLE
    !
    TYPE(superlu_options_t), INTENT(IN) :: options
    TYPE(SuperMatrix), INTENT(INOUT) :: A
    INTEGER(C_INT), INTENT(INOUT) :: perm_c(*)
    INTEGER(C_INT), INTENT(INOUT) :: perm_r(*)
    INTEGER(C_INT), INTENT(INOUT) :: etree(*)
    CHARACTER(1, kind=C_CHAR), INTENT(inout) :: equed(*)
    REAL(C_DOUBLE), INTENT(inout) :: R(*)
    REAL(C_DOUBLE), INTENT(inout) :: C(*)
    TYPE(SuperMatrix), INTENT(INOUT) :: L
    TYPE(SuperMatrix), INTENT(INOUT) :: U
    TYPE(C_PTR), INTENT(inout) :: work
    INTEGER(C_INT), VALUE, INTENT(IN) :: lwork
    ! lwork   (input) int
    !         Specifies the size of work array in bytes.
    !         = 0:  allocate space internally by system malloc;
    !         > 0:  use user-supplied work array of length lwork in bytes,
    !               returns error if space runs out.
    !         = -1: the routine guesses the amount of space needed without
    !               performing the factorization, and returns it in
    !               mem_usage->total_needed; no other side effects.
    !
    !         See argument 'mem_usage' for memory usage statistics.
    TYPE(SuperMatrix), INTENT(INOUT) :: B
    ! B       (input/output) SuperMatrix*
    !         B has types: Stype = SLU_DN, Dtype = SLU_D, Mtype = SLU_GE.
    !         On entry, the right hand side matrix.
    !         If B->ncol = 0, only LU decomposition is performed, the triangular
    !                         solve is skipped.
    !         On exit,
    !            if equed = 'N', B is not modified; otherwise
    !            if A->Stype = SLU_NC:
    !               if options->Trans = NOTRANS and equed = 'R' or 'B',
    !                  B is overwritten by diag(R)*B;
    !               if options->Trans = TRANS or CONJ and equed = 'C' of 'B',
    !                  B is overwritten by diag(C)*B;
    !            if A->Stype = SLU_NR:
    !               if options->Trans = NOTRANS and equed = 'C' or 'B',
    !                  B is overwritten by diag(C)*B;
    !               if options->Trans = TRANS or CONJ and equed = 'R' of 'B',
    !                  B is overwritten by diag(R)*B.
    TYPE(SuperMatrix), INTENT(INOUT) :: X
    ! X       (output) SuperMatrix*
    !         X has types: Stype = SLU_DN, Dtype = SLU_D, Mtype = SLU_GE.
    !         If info = 0 or info = A->ncol+1, X contains the solution matrix
    !         to the original system of equations. Note that A and B are modified
    !         on exit if equed is not 'N', and the solution to the equilibrated
    !         system is inv(diag(C))*X if options->Trans = NOTRANS and
    !         equed = 'C' or 'B', or inv(diag(R))*X if options->Trans = 'T' or 'C'
    !         and equed = 'R' or 'B'.
    !
    REAL(C_DOUBLE), INTENT(INOUT) :: recip_pivot_growth
    ! recip_pivot_growth (output) double*
    !         The reciprocal pivot growth factor max_j( norm(A_j)/norm(U_j) ).
    !         The infinity norm is used. If recip_pivot_growth is much less
    !         than 1, the stability of the LU factorization could be poor.
    REAL(C_DOUBLE), INTENT(inout) :: rcond
    ! rcond   (output) double*
    !         The estimate of the reciprocal condition number of the matrix A
    !         after equilibration (if done). If rcond is less than the machine
    !         precision (in particular, if rcond = 0), the matrix is singular
    !         to working precision. This condition is indicated by a return
    !         code of info > 0.
    REAL(C_DOUBLE), INTENT(inout) :: ferr(*)
    ! FERR    (output) double*, dimension (B->ncol)
    !         The estimated forward error bound for each solution vector
    !         X(j) (the j-th column of the solution matrix X).
    !         If XTRUE is the true solution corresponding to X(j), FERR(j)
    !         is an estimated upper bound for the magnitude of the largest
    !         element in (X(j) - XTRUE) divided by the magnitude of the
    !         largest element in X(j).  The estimate is as reliable as
    !         the estimate for RCOND, and is almost always a slight
    !         overestimate of the true error.
    !         If options->IterRefine = NOREFINE, ferr = 1.0.
    REAL(C_DOUBLE), INTENT(inout) :: berr(*)
    ! BERR    (output) double*, dimension (B->ncol)
    !         The componentwise relative backward error of each solution
    !         vector X(j) (i.e., the smallest relative change in
    !         any element of A or B that makes X(j) an exact solution).
    !         If options->IterRefine = NOREFINE, berr = 1.0.
    TYPE(GlobalLU_t), INTENT(inout) :: Glu
    ! (input/output) GlobalLU_t *
    ! If options->Fact == SamePattern_SameRowPerm, it is an input;
    !     The matrix A will be factorized assuming that a
    !     factorization of a matrix with the same sparsity pattern
    !     and similar numerical values was performed prior to this one.
    !     Therefore, this factorization will reuse both row and column
    !       scaling factors R and C, both row and column permutation
    !       vectors perm_r and perm_c, and the L & U data structures
    !       set up from the previous factorization.
    ! Otherwise, it is an output.
    TYPE(mem_usage_t), INTENT(inout) :: mem_usage
    ! mem_usage (output) mem_usage_t*
    !         Record the memory usage statistics, consisting of following fields:
    !         - for_lu (float)
    !           The amount of space used in bytes for L\U data structures.
    !         - total_needed (float)
    !           The amount of space needed in bytes to perform factorization.
    !         - expansions (int)
    !           The number of memory expansions during the LU factorization.
    TYPE(SuperLUStat_t), INTENT(INOUT) :: stat
    ! stat   (output) SuperLUStat_t*
    !        Record the statistics on runtime and floating-point operation count.
    !        See slu_util.h for the definition of 'SuperLUStat_t'.
    INTEGER(C_INT), INTENT(INOUT) :: info
    ! info    (output) int*
    !         = 0: successful exit
    !         < 0: if info = -i, the i-th argument had an illegal value
    !         > 0: if info = i, and i is
    !              <= A->ncol: U(i,i) is exactly zero. The factorization has
    !                    been completed, but the factor U is exactly
    !                    singular, so the solution and error bounds
    !                    could not be computed.
    !              = A->ncol+1: U is nonsingular, but RCOND is less than machine
    !                    precision, meaning that the matrix is singular to
    !                    working precision. Nevertheless, the solution and
    !                    error bounds are computed because there are a number
    !                    of situations where the computed solution can be more
    !                    accurate than the value of RCOND would suggest.
    !              > A->ncol+1: number of bytes allocated when memory allocation
    !                    failure occurred, plus A->ncol.
  END SUBROUTINE dgssvx
END INTERFACE
PUBLIC :: dgssvx

options

options (input) superlu_options_t*

The structure defines the input parameters to control how the LU decomposition will be performed and how the system will be solved.

A

A       (input/output) SuperMatrix*

Matrix A in A*X=B, of dimension (A->nrow, A->ncol). The number of the linear equations is A->nrow.

Currently, the type of can be:

Stype = SLU_NC or SLU_NR
Dtype = SLU_D
Mtype = SLU_GE.

In the future, more general A may be handled.

On entry

If options->Fact = FACTORED and equed is not 'N', then A must-have been equilibrated by the scaling factors in R and/or C.

On exit

A is not modified if options->Equil = NO, or if options->Equil = YES but equed = 'N' on exit.

Otherwise, if options->Equil = YES and equed is not 'N', the A is scaled as follows:

If A->Stype = SLU_NC:

equed = 'R', A := diag(R)* A
equed = 'C', A := A *diag(C)
equed = 'B', A := diag(R)* A *diag(C)

If A->Stype = SLU_NR:

equed = 'R', transpose(A) := diag(R)* transpose(A)
equed = 'C', transpose(A) := transpose(A) *diag(C)
equed = 'B', transpose(A) := diag(R)* transpose(A) *diag(C).

`perm_c`

(input/output) int*

If A->Stype = SLU_NC, Column permutation vector of size A->ncol, which defines the permutation matrix Pc; perm_c[i] = j means column i of A is in position j in A*Pc.

On exit, perm_c may be overwritten by the product of the input perm_c and a permutation that postorders the elimination tree of Pc'*A'*A*Pc; perm_c is not changed if the elimination tree is already in postorder.

If A->Stype = SLU_NR, column permutation vector of size A->nrow, which describes permutation of columns of transpose(A) (rows of A) as described above.

`perm_r`

(input/output) int*

If A->Stype = SLU_NC, row permutation vector of size A->nrow, which defines the permutation matrix Pr, and is determined by partial pivoting. perm_r[i] = j means row i of A is in position j in Pr*A.

If A->Stype = SLU_NR, permutation vector of size A->ncol, which determines permutation of rows of transpose(A) (columns of A) as described above.

If options->Fact = SamePattern_SameRowPerm, the pivoting routine will try to use the input perm_r, unless a certain threshold criterion is violated. In that case, perm_r is overwritten by a new permutation determined by partial pivoting or diagonal threshold pivoting. Otherwise, perm_r is output argument.

etree

(input/output) int*,  dimension (A->ncol)

Elimination tree of Pc'*A'*A*Pc. If options->Fact != FACTORED and options->Fact != DOFACT, etree is an input argument, otherwise it is an output argument.

note

etree is a vector of parent pointers for a forest whose vertices are the integers 0 to A->ncol-1; etree[root]==A->ncol.

equed

(input/output) char*

Specifies the form of equilibration that was done.

'N': No equilibration.
'R': Row equilibration, i.e., A was premultiplied by diag(R).
'C': Column equilibration, i.e., A was postmultiplied by diag(C).
'B': Both row and column equilibration, i.e., A was replaced by diag(R)*A*diag(C).

If options->Fact = FACTORED, equed is an input argument, otherwise it is an output argument.

R

(input/output) double*, dimension (A->nrow)

The row scale factors for A or transpose(A).

If equed = 'R' or 'B', A (if A->Stype = SLU_NC) or transpose(A) (if A->Stype = SLU_NR) is multiplied on the left by diag(R).
If equed = 'N' or 'C', R is not accessed.
For options->Fact = FACTORED, R is an input argument, otherwise, R is output.
For options->Fact = FACTORED and equed = 'R' or 'B', each element of R must be positive.

C

(input/output) double*, dimension (A->ncol)

The column scale factors for A or transpose(A).

If equed = 'C' or 'B', A (if A->Stype = SLU_NC) or transpose(A) (if A->Stype = SLU_NR) is multiplied on the right by diag(C).
If equed = 'N' or 'R', C is not accessed.
For options->Fact = FACTORED, C is an input argument, otherwise, C is output.
For options->Fact = FACTORED and equed = 'C' or 'B', each element of C must be positive.

L

(output) SuperMatrix*

The factor L from the factorization

Pr*A*Pc=L*U              (if A->Stype SLU_= NC) or
Pr*transpose(A)*Pc=L*U   (if A->Stype = SLU_NR).

Uses compressed row subscripts storage for supernodes, i.e., L has types:

Stype = SLU_SC, Dtype = SLU_D, Mtype = SLU_TRLU.

U

(output) SuperMatrix*

The factor U from the factorization

Pr*A*Pc=L*U              (if A->Stype = SLU_NC) or
Pr*transpose(A)*Pc=L*U   (if A->Stype = SLU_NR).

Uses column-wise storage scheme, i.e., U has types:

Stype = SLU_NC, Dtype = SLU_D, Mtype = SLU_TRU.

work

(workspace/output) void*, size (lwork) (in bytes)

User supplied workspace, should be large enough to hold data structures for factors L and U. On exit, if fact is not 'F', L and U point to this array.

Glu

(input/output) GlobalLU_t *

If options->Fact == SamePattern_SameRowPerm, it is an input. The matrix A will be factorized assuming that a factorization of a matrix with the same sparsity pattern and similar numerical values was performed prior to this one. Therefore, this factorization will reuse both row and column scaling factors R and C, both row and column sermutation vectors perm_r and perm_c, and the L & U data structures set up from the previous factorization.

Otherwise, it is an output.

A is NCformat​

A is NRformat​

options​

A​

perm_c​

perm_r​

etree​

equed​

R​

C​

L​

U​

work​

Glu​

A is NCformat

A is NRformat

options

A

`perm_c`

`perm_r`

etree

equed

R

C

L

U

work

Glu