Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Vitor Alex Oliveira Alves, Rodrigo Juliani Correa de Godoy and Claudio Garcia
AbstractThe goal of System Identication is to obtain a
reduced order model capable of reproducing the key features
of the true system behavior. Some system identication methods
require, as a starting point, a high order linear model of the
system. Such model is then adopted as the system most reliable
description and, based on it, the desired reduced order model
is derived. In this scenario, one could ask what is the best
order of the high order model?. The reply to this question is
not addressed in the specialized literature. This paper presents
an optimization strategy that maximizes the t index to deal
with this subject.
I. INTRODUCTION
The identication methods that employ high order models
use them in different ways. For instance, the ASYM method
[1] considers an ARX (Auto-Regressive with eXogenous
inputs) high order model as the true description of the
system. Afterwards, using a frequency domain criterion, this
method reduces the model order to obtain a low order model
suitable for control applications. The two-step joint input-
output identication methods, such as the u-ltering [2], y-
ltering [3] and the Double Filtering [4] methods, employ
FIR (Finite Impulse Response) high order models to describe
the input and output sensitivity functions [5] - or the closed-
loop transfer function (exclusively for the Double Filtering
method [4]). There is a MRI (MPC Relevant Identication)
method, proposed in [6] and [7], that uses a high order
FIR model as the true process description. Another MRI
method proposed in [8] employs, also as the true process
most faithful description, a high order ARX model. The
preference for a high order ARX structure is due to the
fact that such models are practically unbiased [9] and their
parameter estimation has analytical solution.
None of these studies were worried about the determina-
tion of the best high order to be used. Instead, they rely on
xed values, such as 15 or 20 [2], 24 [9], 20 or 25 [10], 30 or
40 [11]. Also, it is not mentioned the number of zeros asso-
ciated with the models. As a rule of thumb, it is considered
equal to the number of poles minus one (see Example 1). It
is common that the obtained high order models are unstable
[specially when the identication method applied is non lin-
ear in the parameters to be estimated, such as the Prediction
Research project sponsored and nancially supported by Petrobras SA
under process 0050.0058968.10.9/2010.
V. A. O. Alves is with Department of Mechanical Engineering, Mau a
Institute of Technology, Praca Mau a 01, S ao Caetano do Sul, SP, Brazil -
CEP 09580-900. vitoralex.alves@maua.br
R. Juliani C. G. and C. Garcia are with Department of Telecommu-
nications and Control Engineering, Polytechnic School of the University
of S ao Paulo, Av. Prof. Luciano Gualberto, 158, trav. 3, Butant a, S ao
Paulo, SP, Brazil - CEP 05508-970. rodrigo.juliani@usp.br,
clgarcia@lac.usp.br
Error Method (PEM) with Output-Error (OE), Box-Jenkins
(BJ) or Auto-Regressive Moving Average with eXogenous
inputs (ARMAX) model structures] or present near pole and
zero cancellations. Besides, in practical situations - given a
(relatively small) experimental dataset - several identication
algorithms exhibit convergence problems when the adopted
model order is too high (or, equivalently, when there is an
elevated number of parameters to be estimated). This can
result in poor models. A frequency domain interpretation of
such fact can be found in [12].
In this paper, an optimization approach is proposed to nd
the best high order for an ARX model (adjusted for a given
input-output dataset), based on the maximization of the t
index [13], dened as:
f it =100
1
y(t) y(t)
y(t) mean(y(t))
= [n
a
, n
b
], but it is completely irrelevant if the order
itself is close to the global optimal n
. It is also relevant
to note that the cross-validation strategy was applied in (5)
because it is not so surprising that a model will be able
to reproduce the estimation data, [...] but the real test is
whether it will be capable of also describing fresh data-sets
from the process [12].
B. Optimization Problem Solution
To optimize (5), subject to (6)-(8), a two-step optimization
strategy is proposed. In the rst step, the problem is solved
with a global optimization algorithm, then, in the second
step, the solution is rened with a direct search algorithm.
Both steps are described next.
1) Global Optimization - Simulated Annealing: The op-
timization problem (5) requires a large computational effort
when solved by exhaustive search. Furthermore, it is a well
known fact that the search domain grows exponentially with
the increase in the dimension (number of inputs and outputs)
of the system to be identied [16]. Thus, it is desirable
to nd a solution that presents a t close to the optimum
in a feasible computational time. The Simulated Annealing
algorithm [17], [18] is employed in this task.
2) Direct Search Optimization - Pattern Search: The
Simulated Annealing algorithm nds a solution that presents
a performance near the optimum but, in most cases, is not
exactly the optimum. Hence, to rene such solution, it is
employed a direct search algorithm: the Pattern Search [19],
[20]. In conclusion, this second step moves the previous
solution closer to the optimum.
These algorithms were selected due to its widely appli-
cation to optimization problems and for being available in
computational tools employed in simulation, identication
and optimization, such as MATLAB