Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Summary
1 Introduction
In the last years the insight about structured matrices has received a strong
impulse by several research groups in numerical linear algebra. Nearly ultimate
results have been recently achieved in two main directions.
1. The “horizontal” one, that is the development of effective mathematical
tools to describe and investigate new types of structures (e. g., the concept
of displacement operator, see [8]).
2. The “vertical” one, that is the construction of more and more efficient
algorithms for solving linear systems and least squares problems: in this
field the most attractive approach is the Preconditioned Conjugate Gradi-
ent Method (see [5] for a wide presentation of circulant preconditioning),
for which unifying convergence theories have been proposed very recently
[12, 6].
On the other hand, a wide number of applications take a great advantage from
the description in terms of matrix structures. Image processing is one of them
(perhaps the most popular): many papers of the group of Plemmons (see, e. g.,
[11]) are concerned with the reconstruction of atmospheric images, where the
blurring process can be well described with the help of Toeplitz matrices.
Besides, there are still several image applications for which structured matri-
ces have not been fully exploited. Here we present two different examples, by
discussing how structures can be incorporated in the mathematical models and
by pointing out some new open problems derived from these contexts.
2 A first application:
reconstruction of tomographical images
A numerical approach to the continuous model (2.1) must take into account the
discretization of the data set; leaving the pixel P treated as a continuous variable
leads to the so-called natural pixels operator [4]
R: f 7→ {g(sj ,θk ); j = 1, . . ., q, k = 0, . . . , p − 1}
acting from L2 (Ω) to IRpq . We assume, as is the standard in practice, that both
the θk ’s and the sj ’s are equally spaced, θk = k∆θ and sj = j∆s.
It is evident that R has a nontrivial null space, so that the equation Rf = g
potentially has infinitely many solutions. Among them, the most significant are
the following.
2
• The regularized solution fµ = R∗ (RR∗ + µI)−1 g, where µ is a suitable
parameter that avoids ill-conditioning effects on f † .
• The positive solution
f+ = arg min {kfkL2 : Rf = g, f ≥ 0} , (2.2)
which takes into account the physical meaning of the function f. This case
will be the subject of future work.
The first two choices lead to a linear system of pq equations in pq unknowns.
In fact, the adjoint R∗ maps a vector h = (hjk ) ∈ IRpq to the L2 function
q p−1
X X
(R∗ h)(P ) = hjkK(P ; sj ,θk ); (2.3)
j=1 k=0
3
The following result [9] states that, if rotational invariance occurs, the natural
pixels discretization allows us to find a strong structure in the matrix A.
Theorem 2.1 Let Uθ : IR2 → IR2 be the operator which rotates every point P
by the angle θ. Assume that the kernel K(P ; s,θ) of the integral operator (2.1)
satisfies the invariance property
4
Xk
Theorem 2.2 [3] Given the point P and the indices l and k, define =
Yk
Uθk P and the parameters
L+B L+B
ak = 1 − , bl,k = sl − Xk ,
R0 + L + B + Yk R0 + L + B + Yk
B B
ck = 1 − , dl,k = sl − Xk ,
R0 + L + B + Yk R0 + L + B + Yk
where the constants R0, L, B are the distances explained in Figure 1. Then
q
1 X ∆s bl,k + W/2 dl,k + W/2
K(P ; sj ,θk ) = min sj + , ,
∆s 2 ak ck
l=1
∆s bl,k − W/2 dl,k − W/2
− max sj − , , .
2 ak ck +
An immediate check shows that condition (2.6) is satisfied. Hence, the re-
construction of the body activity from the SPECT data can be accelerated by
exploiting the circulant structure of the normal equations. The efficient compu-
tation of the scalar products appearing in the entries of A is under investigation.
First of all there is a lack in literature about the fast computation of the positive
solution f+ given by (2.2), if we want to exploit the structure of RR∗ .
However, in our opinion the most interesting challenge consists of reducing the
complexity with respect to the number of bins, q: is there any structure “hidden”
in the variable s which makes it possible?
We believe that such a structure exists, even though it is not apparent. Our
“clue” is the success obtained in some tomographic problems by multigrid tech-
niques involving the variable s [7].
This class of methods is based on a sequence of increasing levels of discretiza-
tion, at which the continuous problem can be projected. Our normal equations
represent the finest level; the solution is computed by an iterative method where
each step is the composition of a smoothing technique (e. g., Gauss-Seidel it-
eration) and a recursive call to the same problem defined at the coarser level
(whose dimension is halved).
For the SPECT model, it seems natural to associate each level to the param-
eter ∆s that is doubled when we make the transition to the coarser level [10];
thus, the simplest equation to be solved in the recursive calls corresponds to as-
suming a single bin of width 128∆s (the total length of the detector) collecting
all the emissions passing through the different holes. Remark that the circulant
structure with respect to the angles is retained at all the levels.
We don’t know at yet if a multigrid approach can succeed also for this problem.
5
R0
O P
detector
sj
∆s
collimator
j - th bin
θk
6
g(x,y) = −f(x,y − ∆) + 2f(x,y) − f(x,y + ∆) (3.7)
where ∆ is the chopping amplitude and (x,y) is a generic pixel: this is done
by pointing the telescope to two sky areas close to the region of interest, corre-
sponding to the shifts ±∆.
Here the shift invariance is apparent, resulting in a band Toeplitz matrix A
discretizing (3.7), with many inner diagonals equal to zero: it is given by
(A)m,n = −δm,n + 2δm+K,n − δm+2K,n , m = 1, . . ., N, n = 1, . . . , N + 2K,
where δm,n is the Kronecker symbol; K is related to the ratio between ∆ and
the sampling distance.
A theoretical inspection of the associated system of equations then gives a lot
of information about the expected quality of reconstructions [1].
In particular, the condition number of A only depends on the quotient, q,
between N and K and is asymptotical to q2 (thus it is decreasing with respect
to K); the pseudo-inverse A† has piecewise constant diagonals, which produce
discontinuities in the reconstructions; the generalized solution f † is orthogonal
to the vector of all 1’s and therefore its components sum to zero.
In the light of the last property, for this problem it is not convenient to apply
standard fast methods for the solution of Toeplitz systems, since it is required
the computation of a constrained solution. More precisely, the problem
min kAf − gk
(3.8)
f ≥0
has a unique nonnegative solution of minimal norm, having much more physical
significance than the generalized solution.
Despite the complete theoretical understanding, at present it is not clear how
the structure of A can be exploited to obtain efficient methods for solving (3.8).
The best results are actually provided by the application of the so-called Pro-
jected Landweber method [1], that produces a sequence of nonnegative approx-
imations slowly converging to a solution of (3.8).
4 Conclusions
We presented two examples of image applications where taking into account the
structure of the matrices involved could help in order to reduce the computa-
tional cost of reconstructions.
The first example is concerned with tomography (more precisely SPECT imag-
ing): the problem is computationally very difficult, a matrix structure can be
found with respect to one of the two variables, at present no insight is available
concerning the other variable.
In the second example (infrared astronomy) the theoretical knowledge is com-
plete and the structures are apparent, but very few is known about reconstruction
algorithms exploiting them.
7
We point out that all the numerical methods used in these applications should
take into account that the structured matrices discretize an inverse problem and
therefore some regularization approach is needed.
References