AIM.1999.803173 - TWC NN Modeling
AIM.1999.803173 - TWC NN Modeling
Michele Milano
Institute of Fluid Dynamics, Swiss Federal Institute of Technology (ETH)
Sonneggstrasse 3, 8092 Ziirich, Switzerland
E-mail: milanoQifd.mavt .ethz.ch
Gabriele Serra
Magneti Marelli Divisione Controllo Motore
via Timavo 3, Bologna, Italy
Abstruct- In this paper we present a novel approach sions, where reaction rates depend on concentrations
to the problem of three-way catalytic converter dy- and temperature, should yield a model sufficiently fiex-
namic modeling; one of the main issues related t o mod-
eling these devices is the reaction kinetics submodel, ible to match the real behavior of a large variety of
that has to be at the same time simple and flexible catalyst formulations and washcoats.
enough t o capture all the significant features of the real Unfortunately, the lack of precise kinetic measurr-
system. We propose the use of machine learning tech-
niques to solve this problem: a neural network struc- ments, especially during transient warm-up phase,
ture for the kinetic submodel and a genetic algorithm makes the choice of suitable reaction kinetic expres-
to tune its parameters. In this way the difficulties aris- sions and the tuning of the parameters probably the
ing from the identification of the resulting overall model
are avoided.
most crucial aspects for model reliability.
Thus, there is a need for a method allowing the es-
timation of the kinetic data from more easily available
I. INTRODUCTION experimental data, like pollutant emission measure-
Stricter legislation on engine exhaust gases makes ments at the inlet and the outlet of the convert,er.
significant the optimization of the three-way catalytic Since the reaction kinetics are not straightforward
converter (TWC) dynamical behavior. From this point to model, it is convenient to use a black-box approach
of view, an important task is to predict the transient which allows also an easier compensation for possible
TWC emissions (converter output) given the engine noise that could be present in the input-output data.
output for any driving cycle of interest. Indeed the Neural network (NN) models are particularly suitable
most important part of the cycles, as regards air pol- for the purpose of dealing with these problems [5].
lution and thermal stress, is the warm-up phase. Taking into account the reduced order dynamical
Numerical simulations of the three-way catalytic model of TWC presented in [4], here we implement
converter could advantageously substitute expensive a neural network submodel for the reaction kinetics
experiments with an engine on a test bench. More- deeply embedded in the TWC model structure.
over, the design of warm-up controllers and real time Since both the lack of data available to directly train
application demand for simpler, reduced order models the NN and the model architecture make difficult to
(see, for example, [3], [4]). Simplified reaction scheme define a parametric gradient, necessary for the stan-
and rate expressions [ll],[14] are used in most models dard backpropagation technique, a genetic algorithm
111, [2], [3], [4], [la]. These empirical kinetic expres- is used here to train the NN submodel without the
240
111. THENEURALNETWORKEXTENSION
OF THE as = ( X C O r X C ~ H s , ~ C H ~ , X N 0 , X H 2 , TzsE, ~ m ) T ;
U
TWC MODEL IR7 is the output vector of the first layer; W , E IR7x7
The neural network extension of the model (3) has and W2 E IR5x7 are the network weight matrices;
been developed using a NN representation of the spe- bl E R7 and b2 E R5 are the bias column vectors;
cific reaction rates vector R’ = ( R i ,. . . ,Rb)T (equa- and the tanh(.) operator is intended component-wise.
tion (3b)); in this way the neural network is embebbed The mean gas velocity in monolith U,,, has been in-
into the model equations (3). cluded in the input vector U because previous identifi-
The kinetic model (see Figure 1) has the following cation experiments of the system have shown that this
further information improves the performance of the
structure:
whole model.
n
where D = diag(Xc.0, X C ~ H XC~ ‘, HXNO,
~ , X H ~ )E
R N NE IR’ is the output vector of the NN;T, RNN,1
is the temperature of the reactive surface (substrate)
and Tt is a prefixed threshold of the temperature. RNN, 2
On the basis of the reaction kinetics, according to
RNN, 3
the chemical scheme (2), it is straightforward to obtain
the vector of the rates of the chemical species R =
RNN, 4
( R I , .. . ,R G )(equations
~ (3c)) since it is related t,o
RI .
RNN, 5
dNN I
24 1
the whole model and the measured experimental data:
+
tion result; h refers to the output channel of the T W C
model (temperature and concentration of the chemical
species all evaluated at the outlet of the TWC); VI,, are
i scaling factors; ti, and tfi,, are respectively the initial
and final time instant of the simulation.
The trainidg process employs a Genetic Algorithm
(GA) [lo], obtained by a modification of the Con-
trolled Random Search (CRS) algorithm by Price [13].
, ., ~- The CRS algorithm has been demonstrated to be ef-
fective in solving global optimization problems [9], but,
.Fig. 3. Structure of the system. often with too long convergence time due t o the large
population size necessary to avoid local minima. A
considerable improvement of the CRS algorithm for
solving a system of ordinary differential equations both the convergence time and the reliability of' the
(ODES) [6]. solutions is presented in [lo]. The CRS can be seen as
a Genetic Algorithm with zero mutation probability.
V . THEIDENTIFICATION
PROCEDURE Within the framework of the CRS a variable muta-
tion operator was designed. In virtue of this context-
In the TWC model the neural network is merely a dependent mutation operator, this GA needs a much
simple nonlinear mapping between the input and out- smaller population size than that one necessary to
put quantities of the kinetics submodel. The NN con- CRS to properly converge, so that the convergence
tains a certain number of parameters (weights W and time can be much smaller, too. It was also demon-
biases b ) , whose final values are determinated during strated that the cross-over operator best exploits the
the training period. fitness function regularities, much like a gradient based
To train a static multilayer feed-forward neural net- algorithm, even if the gradient itself is never needed.
work the standard backpropagation algorithm could The identification procedure uses as experimental
be used [ 5 ] . However for the backpropagation algo- data the first 800 seconds of an ECE cycle. The follow-
rithm to work properly, it is necessary to have the ing table gives a better insight into the identification
desired output for the NN available. In our case the procedure:
desired outputs are the reaction rates; since these data
are not directly available, the NN model has to be I Two-layer NN
embedded in the TWC equations (3) and the desired
NN structure 7.7.5
outputs become the outputs of the whole model. This
Number of generations
means that one should take also the model structure
Best fitness
into account when writing down the updating rule for
Number of parameter
the network parameters. This is not an easy task with
Number of population elements
our model, and furthermore, from some previous iden-
tification experience, it turned out that the model it-
self is quite difficult t o be identified using a standard RESULTS
VI. SIMULATION
least-squares technique, even without the presence of The simulation results are shown in Figures 4-8. In
the N N extention. all figures a comparison between real d a t a and the
All these reasons suggested to use a stochastic algo- TWC model output is reported; the outputs result-
rithm, since it performs a full exploration of the whole ing from the validation phase, starting after 800 sec,
space of solutions, thus avoiding local minimum points, are plotted together with the results obtained during
without the need for any gradient information. the training.
In this context, a learning rule is used to adjust the Results show a good fit between the model output,
weights and the biases of the NN in order to mini- and the real behavior of the TWC for the temperature
mize the sum-squared error between the outputs of dynamics, also in the validation phase. As regards the
T i m (3)
2500
,, , ,-
- - . - ,,,,,....,,
*...,..,
,.,1,,1,1,,.*..,,
,",,,i 1/1
I I , I I ~ , I
I I 1 I I , , I
,( I I, I , ~ I1
-05' J -500'
0 200 400 600 800 1000 1200 1400 0 200 400 600 800 1000 1200
Tlme 151 Time [r]
243
ECE cycle (experimental data); [8] Lin, C. C., and L. A. Segel, Mathematics Applied to Deter-
the simulation of 1200 real seconds of the warm- ministic Problems in the Natural Sciences, Society for Indus-
trial & Applied Mathematics, 1988.
up along an ECE cycle takes 5 sec. on a on a PC, 191 Milano L., Barone F., Milano M., ‘Time domain amplitude
Intel Pentium I1 350MHz Processor, 32 Mb,RAM. and frequency detection of gr@vitational waves from coalesc-
ing binaries,’ Physical Review D, vol. 55, n. 8, pp. 4537-4554.
[lo] Barone, F., M. Milano, ‘An Identification Procedure per-
VII. CONCLUSIONS forming Automatic Sensitivity Analysis based on Evolution-
ary Programming,’ submitted t o IEEE ’IYansactions S’ystems
We presented some preliminary results relative to Man and Cybernetics.
the use of machine learning techniques to include a [ll) Montreuil C. N . , S. C. Williams, and A. A. Atlarnczyk,
Modeling Current Generation Catalytic Converters: Labo-
NN reaction kinetics submodel into a TWC model. ratory Experiments and Kinetic Parameter Optimization-
To overcome the problems arising from the resulting Steady State Kinetics,’ SAE paper 920096, 19%.
overall model structure, we used a GA to train the [12] Oh, S. H. and J. C. Cavendish, ‘Mathematical Modeling
of Catalytic Converter Lightoff. Part 111: Prediction of Ve-
NN submodel without the necessity of gradient infor- hicle Exhaust Emissions and Parametric Analysis,’ AICliE
mation. Journal, vol. 31, No. 6 , June 1985, 943-949.
[13] Price, W. L. , ‘A controlled random search procedure for
Further improvements of this model will be explored global optimization,’ The Computer Journal, vol. 20, 11. 4,
by looking at more sophisticated NN architectures. 1976, pp. 367-370.
[14] Voltz, S. E., C. R. Morgan, D. Liederman and S. M. Jacob,
‘Kinetic Study of Carbon Moiloxide and Propylene Oxidation
V I I I . APPENDIX on Platinum Catalysts,’ lrid. EJIE.CXem. Prod. lies. Dev..
J/kg I< specific heat capacity of .gas Vol. 12, No. 4, 1973, pp. 294-301;
J/kg I< specific heat capacity of substrate
mol/inZs specific reaction rate for species i
mol/rn2 s specific i.eaction rate for the chemical reactioii 1
m2/m3 active area/volume ratio of the monolith
W/m’ K convective heat transfer coefficient
(from g i s ’ t o substrate) .
W/m2 K heat transfer coefficient
m/s mass transfer coefficient for species i
m2/m3 external area/volume ratio
K ambient, temperature
m/s mean gas velocity in monolith
J/mol heat of ith-reaction
void fractioii. , .
R.EFERENCES
[l] Abate, G., L. Glielmo, P. Rinaldi, S. Santini, G. Serra, ‘Nu-
merical Simulation and Identification of the Dynaqic Be-
haviour of Three Way Catalytic Converters’, ICE97-3rd In-
ternational Conference on Internal Combustion Engines:,-Ex-
periments and Modelling, Capri, Italy, 1997.
[2] Baba, N., K.. Ohsawa, and S. Sugiura, ‘Numerical Approach
for Improving the ConversiowCharacteristics of Exhaust Cat-
alysts under - Warming-Up Condition,’ SAE paper 9620 7G,
1996.
[3] Glielmo, L., P. Rinaldi, S. Santini, %. Serra, ‘Modelling and
Numerical Simulation of the Dynamic Behaviour of Three
Way Catalytic Converters’, Proc. IEEE Conference on Con-
trol Applications, Trieste, Italy, September 1998.
[4] Glielmo, L., S. Santini, G. Serra, ‘A Two-Time-Scale
Infinite-Adsorption Model of Three Way Catalytic Convert-
ers’, t o be presented at American Control Conference, San
Diego, U.S.A., 1999.
[5] Haykin S., Neural Networks, McMillan College, 1997.
[6] .Jeffrey, A., Quasilinear Hyperbolic Systems and Waves, Pit-
man Publishing, London, 1976.
[7] KokotoviC, P. V., H. K. Khalil and J . O’Reilly, Singular
Perturbation Methods in Control: 4naJ,ysis and Design, Aca-
demic Press, London, 1986.
244