Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Abstract— Breast cancer has become one of the most deadly cancer nearest neighbor algorithm. In [5], the authors proposed a fuzzy
among women all over the word. Fortunately, an early diagnosis rule-based reasoning system. This system is developed in two
of this type of cancer can considerably enhances the success of stages. In the first one, the data is clustered into similar groups
treatment. In this work, we propose a classification system of the using Expectation Maximization (EM). Then, the fuzzy rules
breast cancer based on neural networks. The proposed system is a
neural network with single hidden layer and trained using extreme
are generated using Classification and Regression Trees. The
learning machine algorithm. The main contribution of this work authors also incorporated the Principal Component Analysis in
relies on the use of different activation functions for the hidden order to avoid the problem of multi-collinearity.
neurons and their optimization using genetic algorithm. To Some other works have been based on Support Vector
evaluate the performance of the proposed system, tests are carried Machines (SVM). For example, in [6] the authors proposed a
out on Wisconsin Diagnostic Breast Cancer database. The system based on a combination of the K-means clustering
obtained results show an important enhancement compared to the method and SVM. The K-means algorithm is separately applied
conventional extreme learning machine. On the other hand, the on benign and malignant tumors in order to recognize possible
obtained results are promising compared to other state-of-the-art hidden patterns. These patterns are then considered as new
methods.
features for the SVM classifier.
Keywords—Extreme learning machine; Breast cancer Other works include Naïve Bayesian [7], Adaptive Neuro-
diagnosis; Classification; Neural networks; Genetic algorithm. Fuzzy Inference System (ANFIS) [8], data mining techniques
[9]…etc.
I. INTRODUCTION (HEADING 1) In this work, we propose a neural classifier for breast cancer
Recently, many breast cancer deaths have been prevented. based on single hidden neural network trained using Extreme
This is due to both improvements in treatment and early Learning Machine algorithm (ELM). The advantages of this
detection by mammography [1]. Therefore, given the training algorithm over the standard Back-propagation are as
importance of the early detection of the breast cancer, a variety follows [10-12]. First, it is faster. Second, it can be used with
of classification techniques have been established to deal with non-differential activation functions. Third, it does not need to
this problem. The published works in this field includes many set the stopping criteria and the learning rate. Indeed, training
machine learning and pattern recognition methods. the output weights of a neural network is much simpler than
Some of these works have been based on neural networks. training all weights [13]. Instead of using similar fix activation
For example, in [2] the authors proposed optimizing the used functions, we use different activation functions for the hidden
neural classifier using genetic algorithm. In their model, both neurons and we consider the choice of their parameters as an
the structure and the weights of the network are optimized. optimization problem. Then, we use the genetic algorithm for
They introduced different crossover and mutation methods in this task. Indeed, the importance of using tunable functions
order to overcome the drawbacks of the conventional operators. have been widely investigated in the literature and it has been
In [3], the authors proposed a neural classifier with two learning noted that the success of the network is linked with determining
stages. In the first one, the input features are learned in an the optimal functions [14][15].
unsupervised way using deep belief network. Then, in the Apart for that, in Section 2, we briefly present the
second stage, the weights of this network are updated using formulations of the original ELM. In Section 3, we describe the
back propagation algorithm in a supervised mode. proposed classification system. In section 4, we discuss the
Some other works have been based on fuzzy logic. For classification results obtained on Wisconsin Diagnostic Breast
example, in [4] the authors proposed a classification model Cancer database. Finally, Section 5 concludes the paper.
performed in three phases: samples selection, features selection
II. EXTREME LEARNING MACHINE
and classification. In the first stage, i.e. samples selection, a
fuzzy-rough method is used to remove unusable or inaccurate The ELM algorithm, introduced by Huang et al [10], is a
samples. In the second stage, i.e. feature selection, a learning approach for Single Hidden Neural Networks (SHNN).
combination of a feature selection method founded on This algorithm is based on the random initialization of input
consistency with a reranking algorithm is used. In the third weights and biases and the analytic calculation of the output
stage, the classification is performed using the fuzzy-rough weights. Therefore, The network can be trained in only some
1 ;1 Step 3: Analytic calculation of the output weights using:
> - = ‡ !.
<
;2 III. THE PROPOSED METHOD
> The aim of this work it to introduce a system for classifying
<2 breast cancer based on an enhanced single hidden neural
network trained using ELM algorithm. The proposed
enhancement is based on the activation functions. Indeed, the
activation functions have an important role on the performance
<K of the neural networks. This matter has been discussed in several
works [10-15]. It was noted that: “networks with any bounded
piecewise of a continuous and non-constant function can
>
;M
approximate any continuous objective functions and can
separate arbitrary disconnected regions of any shapes” [10-12].
Figure 1. An example of ELM-NN with architecure ? ? @ On the other hand, it was noted that a network with different type
of activation function has better generalization capacities
[14][15]. In this work, we use sigmoid activation function,
steps. Fig. 1 illustrates an example of an ELM based- neural
which are the most commonly used. The sigmoid function is
network.
given by:
For a -dimensional classification problem with training () = (4)
/023(4(56.2/76))
samples: () , () ,
= 1: , where ()
and ()
. An ELM-based FFNN with activation function (. ) and In order to enhance the performance of the neural classifier,
hidden neurons is given by [10]: we propose using different sigmoid functions for the hidden
()
neurons. More precisely, we propose using different values of
+ = () ,
= 1: (1) the parameters 89 and 9. Fig. 2 part (a) and (b) illustrate the
where = [ … ] is the input weight vector effect of the value of 89 and 9, respectively, on the shape of the
that connects the input neurons to the neuron of the hidden sigmoid function.
layer, is the bias of the hidden neuron, and = 1
[ … ] is the weight vector that connects the 0.9
neuron of the hidden layer to the output layer. This can be done
0.8
in matrix form as:
0.7
= ! (2) 0.6
0.5
f(x)
$× as=1
as=1.5
0.1
as=2
- = ‡ ! (3) 0.4
0.3
matrix 0.1
bs=1
bs=1.5