JCSSP 2019 57 66

Journal of Computer Science
Original Research Paper
Performance Optimization of Physics Simulations Through

Genetic Algorithms
1,2
Oksana Shadura, 2Federico Carminati and 1Anatoliy Petrenko
1
National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute", Kiev, Ukraine
2
CERN, Geneve, Switzerland
Article history Abstract: The GeantV R&D approach is revisiting the standard particle
Received: 21-09-2018 transport simulation approach to be able to benefit from “Single Instruction,
Revised: 08-12-2018 Multiple Data” (SIMD) computational architectures or extremely parallel
Accepted: 7-01-2019 systems like coprocessors and GPUs. The goal of this work is to develop a
Corresponding Author:
mechanism for optimizing the programs used for High-Energy Physics
Oksana Shadura (HEP) particle transport simulations using a “black-box” optimization
CERN, Geneve, Switzerland approach. Taking in account that genetic algorithms are among the most
Email: oksana.shadura@cern.ch widely used “black-box” optimization methods, we analyzed a simplified
ksu.shadura@gmail.com model that allows precise mathematical definition and description of the
genetic algorithm. The work done in this article is focused on the studies of
evolutionary algorithms and particularly on stochastic optimization
algorithms and unsupervised machine learning methods for the
optimization of the parameters of the GeantV applications.
Keywords: Genetic Algorithms, Multi-Objective Optimization, Black-Box

Optimization, Simulation of Transport of Particles
Introduction indicates a process of function minimization or

maximization employing algorithms of random
In the past years, simulation toolkit for transport optimization. Even assuming that we use a set of
particles through matter Geant4 (Agostinelli et al., 2003) efficient genetic algorithms for the optimization of a
has been the main application for full detector simulation “black-box” Multi-Objective Problem (MOP) with a
in High Energy Physics (HEP). The complex design of computationally expensive fitness function, we still face
Geant4, with deep class structure hierarchies and calling the problem of finding operators providing efficient
stacks, makes it less than optimally efficient when convergence to the optimal Pareto Front. The genetic
running on the latest computer architectures. In order to algorithms selected for this work are the Non-Dominant
resolve this situation, in 2013 the simulation project Sorting Genetic Algorithms - NSGA-II (Deb et al.,
GeantV (Amadio et al., 2016) was started. The 2002) and NSGA-III (Deb and Jain, 2014).
connected R&D work included the implementation of During research was identified a set of indicators that
instruction level parallelism for increased performance are relevant for the evaluation of GeantV performance.
by leveraging Single Instruction Multiple Data (SIMD) Our selection is: Run time, peak memory consumption
code and improving data locality. For these reasons the during application run and other model-specific dependent
new software applications was designed to be able to run indicators, such as the number of instructions or other
on parallel as well as SIMD architectures and efficiently performance measurements. Introducing extra operators
use system caches (Shadura and Carminati, 2016). for the genetic algorithm is an attempt to remove noise
GeantV expects to achieve significant improvement for from the dataset and introduce a procedure to improve the
event throughput to be ready for the larger simulated genetic algorithm convergence to the “true Pareto front.”
data samples needed by the High-Luminosity Large Optimal individuals selected among the set of the
Hardron Collider (LHC). interesting parameters populate this front. This set of
General idea of work is about tuning of performance parameters is used to apply an orthogonal transformation
of complex High Energy Physics simulations such as and maximize variance to discover strong patterns in data.
GeantV. It depends on a large number of parameters that The objective of this work is to inquire whether
are complex to tune by hand. Stochastic optimization unsupervised machine learning methods are useful as
© 2019 Oksana Shadura, Federico Carminati and Anatoliy Petrenko. This open access article is distributed under a Creative
Commons Attribution (CC-BY) 3.0 license.
Oksana Shadura et al. / Journal of Computer Science 2019, 15 (1): 57.66
DOI: 10.3844/jcssp.2019.57.66
injected “operators” in genetic algorithms and if, thanks A Markov chain is defined by a transition matrix Tq , p
to them, it is possible to speed up the process of finding from the population p to q , where the vector describing

a Pareto front.
the population is defined as:
Materials and Methods
During the work on optimization of computing
{ p = ( p , p ,…, p ) ,
1 2 m
t
0 ≤ pα ≤ 1, ∑
m
α =1 }
pα = 1 , (2)
performance of HEP simulation applications, we

consider particle transport simulation as a heuristic and the vector component pα is the probability of the
parametric model with expensive evaluations in terms of appearance of the α-th individual in the genetic
consumption of computing resources and a complex population. In the sample space Ω = {1,2,3..N} we have
fitness landscape that will be optimized by use of a population of m different types of individuals. We will
stochastic search algorithms. provide a full description of the genetic operators in
As a result, this article is part of a research work on terms of dynamic system in the next section together
the optimization of high-energy physics simulations with the implementation of a new unsupervised machine
interpreted as multi-objective computing applications learning genetic operator.
performance optimization problems. To test our ideas on Here, we introduce the basic definitions used in the
a simpler application, we will try to optimize the theory of genetic algorithms and dynamic systems. A
performance of the Deb-Thiele-Laumanns-Zitzler dynamic system is a model in which a function describes
(DTLZ) benchmarks (Deb et al., 2005) and of a the time dependence of the evolution of a point in
simplified simulation benchmark. The goal is to improve a geometrical space. Dynamic systems can describe the
convergence to the “true Pareto front” while using a evolution of individuals in the space of finite dimension,
combination of genetic algorithms and methods from members of a populations of fixed size m, where m is
unsupervised machine learning. number of measurements during the experiment. While
Considering that genetic algorithms are among the defining genetic algorithms as a discrete dynamical system,
most widely used evolutionary algorithms, we tried to we can discover mathematical entities such as fixed points,
analyze a simplified model that allows precise which are elements of the function's domain that are
mathematical definition and description of the genetic
mapped to themselves by the function. These objects are
algorithm, i.e., the “Simple model of Genetic Algorithm”
(SGA) (Vose, 1999) used for the investigation of the not only relevant in the investigation of simple genetic
typical evolutionary system, describing the genetic algorithms, but in general for all optimization problems
algorithm as an example of dynamical system with a (Shadura and Carminati, 2016).
precise mathematical definitions. The investigation of the convergence properties of
In this approach, genetic algorithm is defined by the SGA as evolution schema was explored in (Rudolph,
Markov chains, which are represented by stochastic 1997). Schmitt and Rothlauf (2001) it has been shown
models with a sets of sequential events, where the that the convergence rate of the genetic algorithm is
probability of each new event depends on the state of the determined by the second largest eigenvalue of the
previous event. The states of the evolutionary system are transition matrix Tq , p .
populations of genetic individuals, while transitions
For Markov chains, it is very complex to determine
between the various states are handled by a set of genetic
the process of evolution along an appropriate direction
operators: Selection, crossover and mutation (Rowe,
2007). These operators are operating in the space of: leading to a faster convergence to equilibrium.
Principal Component Analysis (PCA) could be
t defined as a procedure that is able to inspect the
Λ = ( p1 , p2 ,..., pm ) (1) genetic algorithm population’s sensitivity and the
correlations between parameters of the input data
of the population’s vectors. matrix and the generated population. For this reason,
The mutation operation is an interesting evolutionary- we want to introduce a PCA operator providing such
inspired mechanism that expresses the connection of the functionality, using inverse PCA noise reduction.
Markov chain (through fully connected matrix)and
ensures that it has an exclusive equilibrium distribution Theory of PCA and UPCA-based Operator
over populations, which means that the Markov chain
converges to the stationary distribution.
in GA
In case of Markov chains, the probability of Before embarking in the optimization of the GeantV
generation of a particular population will depend only on simulation, we need to identify a list of optimization
the previous genetic generation and some extraneous parameters significant for the computing performance.
affecting factors. These are user defined parameters such as the number of
58
DOI: 10.3844/jcssp.2019.57.66
threads used in simulation or the size of the vectors of and for the matrix Wi , j = {w j } = ( wi ) j

we have the
transported particles etc. These can be described via a
orthogonality condition:
data matrix of size m × n:
Wˆ t ⋅ Wˆ = Iˆ. (8)
Xˆ = { X α ,i } = {( xα )i } = { xα } , (3)

From (7) we have:

where, xα = {( xα )i } (1 ≤ α ≤ m,1 ≤ i ≤ n ) is the

α-th
individual of the population. In this matrix, the index i Wˆ t ⋅ Tˆ ⋅ Wˆ = ∆ˆ , ∆ i , j = tiδ i , j . (9)
indicates the GeantV tuning parameters (i = 1,…,n) and
the index α indicates the number of measurements of the
{
θ j = (θ j ) } = {x ⋅ w j } is
t
j-th uncentered principal

α
fitness function for the given set of performance α
indicators (α = 1,…,m for m measurements).Comparing component, here α = 1,…,m. If we define

{ } {
Θα , j = θ j = (θ j ) }

the definition in terms of GA, the matrix is described then:
α
through m -samples of data in a n-dimensional space,
where n is the size of the individual and m is the number
of individuals in the generation. Θα , j = X α , iWi , j ,1 ≤ α ≤ m, (10)
Principle Component Analysis (PCA) is used for data
analysis via covariance matrix to diminish complex data which from (8) and (9) satisfies the condition:
set to a lower dimensionality, applying PCA to a
centered data matrix. Θti ,α Θα , j = m∆ i , j = mtiδ i , j . (11)
In this section, we apply PCA to an uncentered data
matrix. We will reference to it as “Uncentred PCA” We do not have a clear description of the relationship
(UPCA) and we will prove that it is particularly useful in between the matrix eigenvalues tj and the variance of the
the case of transformations on constrained genetic j-th uncentered principal component (σθ,j)2 as for the
algorithms populations, as it is the case for a set of PCA case. In our case this property is not fundamental
GeantV parameters. A discussion of the relations for using the PCA algorithm in the GA. We apply
between centered and uncentered data matrix can be instead the “eigenvalue control parameter”
found in the work of Cadima and Jolliffe (2009). approximation (Cadima and Jolliffe, 2009). In simple
As ¨mentioned in the previous section, m-samples of terms, it is the ability to use the PCA algorithm for the
data from an n-dimensional space are elements of the Singular Value Decomposition (SVD) representation of
data matrix X̂ of size m × n (m is the number of the data matrix X̂ (Amadio et al., 2017).
individuals in the generation and n is the size of the
If we define the matrix Θɶ α ,i as:
individual, which is equivalent to the dimension of a
vector of genes x = {xi}(1≤i≤n).

ɶ ∆1/ 2 ,∆1/ 2 = t1/ 2δ .
Θα , j = m Θ (12)
Using the uncentered data matrix X̂ size m × n, α ,i i , j i, j i i, j
defined in (3) we can write the matrix of non-central

second moments for UPCA: from (11) and (12) we obtain:
1 ɶt Θ ɶ
Tˆ = Xˆ t ⋅ Xˆ t . (4) Θ i ,α α , j = δ i , j .
m

if w j

are eigenvectors of the matrix T̂ with the Using (12), (11) and (7) we see that Θɶ α , j = θɶj is the {}
corresponding eigenvalues tj:
matrix of the eigenvectors (θɶj )α of the matrix Kɶ = Xˆ ⋅ Xˆ t
Tˆ ⋅ w j = t j w j ,1 ≤ j ≤ n. (5) of size m × m:

They satisfy the orthonormality condition: ( )

Kɶ α , β θɶj
β
( )
= X α , k X kt , β θɶj
β
( )
= t j θɶj
α
.
wit ⋅ w j = δ i , j ,1 ≤ i, j ≤ n. (6)

From (10) we can derive the representation for the
Then: data matrix X̂ :
wtj ⋅ Tˆ ⋅ w j = t j , (7) X α ,i = Θα , jW jt,i , (13)

59
DOI: 10.3844/jcssp.2019.57.66
and the SVD representation of the data matrix is then selection operator selects genetic individuals from the
obtained: current population via the performance indicators,

f = { fα } ∈ R m , fα = f(α), α∈Ω:
ɶ ∆1/ 2W t .
X α ,i = mΘ (14)
α ,k k , j j ,i
diag ( f ) ⋅ p

F ( p) =

In the case when the matrix of non-central second t ,
f ⋅p
moments T̂ has the (n-q) smallest eigenvalues such that
tj << 1, q +1≤j≤n, we can use the “eigenvalue control
where, diag ( f ) is a diagonal matrix.

parameter” approximation of the data matrix and
approximate it by the output data matrix Xɶ α , j of rank q: The mutation operator
U : Λ → Λ,
ɶ ∆ɶ 1/ 2W t
Xɶ α ,i = mΘ α ,k k , j j ,i
(15)
( ɶ W t + ... + t1/ 2Θ
= m t11/ 2Θα ,1 1, i q
ɶ Wt ,
α , q q ,i ) is an m × m matrix with the (α,β)-th entry uα,β >0 for all
α, β and uα,β indicates the probability of individual β∈Ω
where, the eigenvalue matrix ∆ɶ k , j has rank q(tq+1 = tq+2 = mutating into α∈Ω. (U ⋅ p )α is the probability for an

⋅⋅⋅ = tn = 0). This approximation is the analog of the individual (type α) to appear after the mutation process
Hotelling transformation (Hotelling, 1936). is applied to the population p (Amadio et al., 2017).

We can estimate the mean square error ηq for the The crossover operator is defined as:
approximation (15):
C : Λ → Λ,
1 m n 2
ηq = ∑∑
mn α =1 i =1
( X α ,i − Xɶ α ,i )

(
C ( p ) = pt ⋅ Cˆ1 ⋅ p,..., p t ⋅ Cˆ m ⋅ p

)
2
1 m n  n
ɶ W t  = 1
n
= ∑∑  m ∑ tk Θ α , k k ,i  ∑ .tk where, Cˆ1 ,..., Cˆ m is a sequence of symmetric non-negative
mn α =1 i =1  k = q +1  n k = q +1
N × N real-valued matrices. Cα ( p ) expresses

The minimum error is achieved if the matrix T̂ has probability that an individual α appears after the
crossover process is applied to the population p .

the (n-q) smallest eigenvalues such that tj << 1, q
+1≤j≤n.
At this point having already defined the UPCA G : Λ → Λ,
(16)
operator, we need to describe the genetic algorithm G ( p ) = C U F ( p ).
model for the injection of the operator we introduced
earlier. Recalling ideas presented in (Vose, 1999) and
Having defined G, we can express the stochastic
described in the previous section, we will develop the transition matrix, based on the probability of
optimization process (Amadio et al., 2017). transformation of the population p into q :

The GA operators are operating in the space Λ = (p1,
p2,⋅⋅⋅, pm)t of the population’s vectors.
( G ( p ))
mqα
The genetic operator Gα ( p ) is defined by the

= m!∏α ∈Ω
α
Tq, p
(17)
probability of generating an individual α (starting from ( mqα )!
the previous population p ). We define a map G: Λ→Λ,

where G ( p ) = ∏ α∈Ω Gα ( p ) and G ( p ) ∈ Λ is an heuristic where, Gα ( p ) define probability of the appearance of

function and Ω is the sample space. The map G is individual α in the next generation and mqα is the number
equivalent to the composition of selection, mutation and of individuals α in the population q with size m .

crossover maps. The genetic selection operator:
We can improve the convergence rate of genetic
algorithms (how fast genetic algorithms are converging
F : Λ → Λ,
to the optimum per generation) by adding a new genetic
F ( p ) = ∏α∈Ω Fα ( p )

operator P based on UPCA procedure. We will call this
procedure “UPCA noise cleanup operator and we will
defines the probability of a genetic individual of type α introduce it in the genetic algorithm’s map

(when the process of selection is applied to p ∈ Λ ). A
GP ( p ) = P C U F ( p ) .
60
DOI: 10.3844/jcssp.2019.57.66
Since we have a similar setup as it is done in the SGA on the front to which they belong to. Individuals in the
case (Schmitt and Rothlauf, 2001), we can observe that a first front are given a rank value of one, while
genetic algorithm convergence rate is defined by the individuals in the second front are assigned a rank value
eigenvalues that follow the highest one. That’s why we of two and so on. In addition to the rank value, an
had to apply the proposed operator P on the earlier additional parameter called crowding distance is
eigenvalues. calculated for each individual (Shadura, 2017). The
An interesting observation is that the eigenvector with crowding distance is a measure of how close an individual
the largest eigenvalues defines a subspace of solutions for is to its neighbors. Larger average value of crowding
multi-objective problem populating Pareto front. distances indicates significant population diversity.
In the next section we will assess whether using an NSGA-III (Deb et al., 2005) has a different algorithm
iterative procedure for an uncentered data matrix allows schema, built on the idea of improved reference points
us to reach the subspace of optimal solutions faster than selection and using an already defined set of reference
with a centered matrix. We will perform this test for the points to assure diversity for the population.
standard benchmarking problems used to validate The main problem of genetic algorithms is an
performance of genetic algorithms. absence of operators that enhance the convergence to a
global model optimum. We present an improved
Testing of the UPCA-Improved Genetic algorithm (Fig. 1) based on the combination of NSGA-II
Algorithm on NSGA-II and DTLZ and the UPCA operator defined in the previous section.
The DTLZ benchmarks (Vose, 1999) are a set of
Benchmarks numerical multi-objective problems, used for
We use NSGA-II (Allison et al., 2006) as a part of comparison/validation of GA algorithms. We present the
our testing setup since it is one of the most popular comparison of NSGA-II without and with the UPCA
genetic algorithms. It is based on fast non-dominance noise cleanup operator applied to DTLZ.
sorting for generated populations and provides an In Fig. 2 and 3 we present the tuning parameter
impressive convergence rate to the optimal Pareto set. distribution together with the mean and standard
The genetic algorithm population is initialized and it deviation values with and without the UPCA operator. In
is afterwards sorted into a group of fronts using the non- Fig. 3 and 5 the PCA-based noise-cleanup procedure is
domination principle. The first front is treated as a used, while in Fig. 2 and 4 it is not. We can observe a
completely non-dominant set and the individuals of the significant improvement of the convergence speed to the
first front are dominating the individuals of the second ideal values of the parameters in the first case when we
front only and so on until the last front. For each use the UPCA operator. Figure 5 shows the first images
individual, in each front, a rank value is assigned based of an early Pareto front.
2M×N
Selection Rank-12 Rank-3

M×N
Rank-3
UPCA NDS
Crossover,
Mutation
Transition matrix
Transition matrix
Fig. 1: Updated NSGA-II algorithm used in GeantV
61
DOI: 10.3844/jcssp.2019.57.66
Population distribution PopDist20

z Entries 24000
Mean 0.4814
250 Std Dev 0.1823
200
150
100
50
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
TGenes/bins
Fig. 2: Distribution of genetic population in the 20th generation of NSGA-II for DTLZ2 problem
Population distribution
PopDist20
z Entries 12000
Mean 0.482
120 Std Dev 0.1369
100
80
60
40
20
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
TGenes/bins
Fig. 3: Distribution of the genetic population at the 20th generation for NSGA-II with UPCA operator for DTLZ2 problem
62
DOI: 10.3844/jcssp.2019.57.66
Y1/Y2/Y3
h3y20
Entries 2000
Mean x 0.5129
Mean y 0.6449
Mean z 0.6434
Std Dev x 0.2695
Std Dev y 0.255
Std Dev z 0.2393
1
1 0.9
\ 0.8
0.7
0.6
0.5
0.4
0.3
0.2 0
0.1 0.1
0.2
0 0.3
1 0.4
0.9 0.8 0.5
0.7 0.6 0.6
0.7
0.5 0.4 0.8
0.3 0.2 0.9
0.1 0 1
Fig. 4: Visualization of the Pareto Front for the 20th generation of NSGA-II for DTLZ2 problem
Y1/Y2/Y3
h3y20
Entries 1000
1 Mean x 0.5359
Mean y 0.5357
0.9 Mean z 0.5286
Std Dev x 0.2869
0.8
Std Dev y 0.27
0.7 Std Dev z 0.2766
0.6
0.5
0.4
0.3
0.2
0.1
0
0.10.2
0.3
0.4 0.5
0.6
0.70.8 0.9 1
0.9 1 0.6 0.7 0.8
0.3 0.4 0.5
0 0.1 0.2
Fig. 5: Visualization of the Pareto Front at the 40th generation of NSGA-II with UPCA operator for DTLZ2 problem
efficient test suite, that can cover different use-cases,

Results with GeantV Simulation Code
providing proper coverage of functionality and including
An important step to evaluate if this kind of interactions between the GeantV core and other sub
optimization is applicable to GeantV, is to have an modules: Physics, geometry and etc. We choose the
63
DOI: 10.3844/jcssp.2019.57.66
GeantV NoviceExampleN03 Implementing PCA operator in the genetic

(https://gitlab.cern.ch/geant/GeantV.git) (Fig. 8.). algorithm’s library, used for tuning GeantV, helped to
The first results for ExampleN03, which represent a increase the speed of convergence of genetic algorithm
sampling calorimeter with Pb absorber layers and liquid by a factor two, allowing to quickly reach the “early”
Ar detection gaps are shown in Fig. 6 and 7. All Pareto front. Early results prove that thanks to the
Electromagnetic (EM) processes and decay are simulated stochastic tuning, the performance of the GeantV
with specific production cuts for γ, e+, e- (used for example on a Core i7-67000 machine, improves by 18%.
shower studies). On an Intel(R) Xeon(R) CPU E5-2695 the CPU time is
The output of the simulation is the detector response reduced by 34%, providing a more stable memory
that includes: Energy deposit and track length both in the consumption and reducing the overall run time of the
detecting gaps and in the inert plates (absorber). batch by up to 27% (Shadura and Carminati, 2016).
1600
1400
1200
1000
×10000
Non optimized Algorithm

800
Optimized Algorithm
600
400
200
0
Number of idle Number Number of Number of
CPU cycles of cycles instructions branches
Fig. 6: Comparison of number of performance indicators for optimized and non optimized GeantV simulation
160
155
150
Thousands
Non optimized Algorithm
145 Optimized Algorithm
140
135
Number of primaries events/second
Fig. 7: Comparison of number primaries events for optimized and non optimized GeantV simulations
64
DOI: 10.3844/jcssp.2019.57.66
Fig. 8: Shower simulation example in the selected test case
We have of course carefully verified that the physics analysis helps us to get faster convergence rate using
output does not change by applying stochastic tuning simple noise cleanup techniques based on an
algorithms. Users will be able to collect data on the orthonormal transformation strategy. The performance
performance of their system and tune it while running enhancement reached with this tuning for particle
simulation jobs. transport simulations can significantly help to economize
In this study we demonstrated a proof of concept computing resources, improve scheduling strategies and
tuning of the performance of a complex multithreaded, provide energy economy for computing resources.
highly vectorized application (GeantV) using
evolutionary computation/genetic algorithms. This opens Acknowledgment
the possibility to use the same method to tune complex
The authors would like to thank the CERN for their
applications on supercomputers and High Performance
support in writing this article.
Computing (HPC) clusters. It could also help to analyze
the performance of compute-intensive jobs, running in
heterogeneous environments, with the objective of Author’s Contributions
reaching better scalability. All authors equally contributed in this work.
Conclusion Ethics
The work we presented is focused on the stochastic This article is original and contains unpublished
optimization of highly parallelized applications for material. The authors confirm that are no conflict of
simulation of radiation transport in complex detectors. interest involved.
During the work we introduced a new genetic operator
able to speedup convergence of the algorithm to the References
true Pareto Front.
We have shown that the idea of merging the classical Agostinelli, S., J. Allison, K.A. Amako, J. Apostolakis
implementation of evolutionary algorithms with and H. Araujo et al., 2003. GEANT4 - a simulation
unsupervised machine learning methods can create a toolkit. Nuclear Instruments Meth. Phys. Res., 506:
powerful symbiosis that can efficiently tune the 250-303. DOI: 10.1016/S0168-9002(03)01368-8
performance of complex algorithms depending on a large Allison, J., K. Amako, J. Apostolakis, H. Araujo and P.
number of correlated parameters and it could be easily Arce Dubois et al., 2006. Geant4 developments and
implemented in any framework. The “mutation” of applications. IEEE Trans. Nuclear Sci., 53: 270-278.
genetic algorithm coupled with principal component DOI: 10.1109/TNS.2006.869826
65
DOI: 10.3844/jcssp.2019.57.66
Amadio, G., A. Ananya, J. Apostolakis, A. Arora and M. Rowe, J.E., 2007. Genetic algorithm theory. Proceedings
Bandieramonte et al., 2016. GeantV: from CPU to of the 9th Annual Conference Companion on
accelerators. J. Phys., 762: 012019-012019. Genetic and Evolutionary Computation, Jul. 07-11,
Amadio, G., J. Apostolakis, M. Bandieramonte, S.P. ACM, London, UK, pp: 3585-3608.
Behera and R. Brun et al., 2017. Stochastic DOI: 10.1145/1274000.1274125
optimization of GeantV code by use of genetic Rudolph, G., 1997. Convergence Properties of
algorithms. J. Phys.: Conf. Series, 898: 042026- Evolutionary Algorithms. 1st Edn., Kovac, ISBN-0:
042026. DOI: 10.1088/1742-6596/898/4/042026 3860645544, pp: 286.
Cadima, J. and I. Jolliffe, 2009. On relationships Schmitt, F. and F. Rothlauf, 2001. On the importance of
between uncentred and column-centred principal
the second largest eigenvalue on the convergence
component analysis. Pak. J. Stat., 25: 473-503.
rate of genetic algorithms. Proceedings of the 3rd
Deb, K. and H. Jain, 2014. An evolutionary many-
Annual Conference on Genetic and Evolutionary
objective optimization algorithm using reference-
point-based nondominated sorting approach, part I: Computation, Jul. 07-11, Morgan Kaufmann
Solving problems with box constraints. IEEE Trans. Publishers Inc., pp: 559-564.
Evolut. Comput., 18: 577-601. Shadura, O. and F. Carminati, 2016. Stochastic
DOI: 10.1109/TEVC.2013.2281535 performance tuning of complex simulation applications
Deb, K., A. Pratap, S. Agarwal and T. Meyarivan, 2002. using unsupervised machine learning. Proceedings of
A fast and elitist multiobjective genetic algorithm: the IEEE Symposium Series on Computational
NSGA-II. IEEE Trans. Evolut. Comput., 6: 182-197. Intelligence, Dec. 6-9, IEEE Xplore Press, Athens,
DOI: 10.1109/4235.996017 Greece, pp: 1-8. DOI: 10.1109/SSCI.2016.7850200
Deb, K., L. Thiele, M. Laumanns and E. Zitzler, 2005. Shadura, O., 2017. Performance tuning and monitoring
Scalable Test Problems for Evolutionary using genetic optimization techniques. GeantV
Multiobjective Optimization. In: Evolutionary Spring Sprint.
Multiobjective Optimization, Abraham, A., L. Vose, M.D., 1999. The Simple Genetic Algorithm:
Jain and R. Goldberg (Eds.), Springer, London, Foundations and Theory. 1st Edn., MIT Press,
pp: 105-145. Cambridge, ISBN-10: 026222058X, pp: 251.
Hotelling, H., 1936. Relations between two sets of
variates. Biometrika, 28: 321-377.
DOI: 10.2307/2333955
66

JCSSP 2019 57 66

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

JCSSP 2019 57 66

Caricato da

Copyright:

Formati disponibili

Journal of Computer Science

Original Research Paper

Performance Optimization of Physics Simulations Through

Keywords: Genetic Algorithms, Multi-Objective Optimization, Black-Box

Introduction indicates a process of function minimization or

performance of HEP simulation applications, we

From (7) we have:

indicators (α = 1,…,m for m measurements).Comparing component, here α = 1,…,m. If we define

defined in (3) we can write the matrix of non-central

They satisfy the orthonormality condition: ( )

wtj ⋅ Tˆ ⋅ w j = t j , (7) X α ,i = Θα , jW jt,i , (13)

where G ( p ) = ∏ α∈Ω Gα ( p ) and G ( p ) ∈ Λ is an heuristic where, Gα ( p ) define probability of the appearance of

Selection Rank-12 Rank-3

Fig. 1: Updated NSGA-II algorithm used in GeantV

Population distribution PopDist20

efficient test suite, that can cover different use-cases,

GeantV NoviceExampleN03 Implementing PCA operator in the genetic

Non optimized Algorithm

Non optimized Algorithm

145 Optimized Algorithm

Fig. 8: Shower simulation example in the selected test case

Potrebbero piacerti anche