Sei sulla pagina 1di 16

Evolving Intelligent System for Classification Twitter Data

Dr. Yossra Hussain Ali Dr. Nuha Jameel Ibrahim Mohammed Abdul Jaleel
Computer Science Department, University of Technology, Baghdad, Iraq

Abstract
Our time characterized by the tremendous progress in communication technology, and the
social networking pages of various types and forms. These tools have emerged as a cultural
achievement created by creative minds through advanced technology. Social media is a modern
web-based application for communication and interaction between humans through audio
messages, written messages, and video messages. These devices build and activate living
communities around the world. People share their interests and activities with these Applications.
Twitter is a social media site, where people communicate through tweets. A service that enables
friends, family, and co-workers to communicate and stay in touch through the exchange of quick
and frequent tweets. People publish their tweets on their profile and send their followers to express
their thoughts and opinions about events in this world. It is important to study and categorize these
tweets. In this research, used evolving intelligent system to fix the problem of text classification.
the Inputs for this classification system are a set of features extracted from tweet and The output
of this system is decision of classification for tweet, which is degree of correlation for each tweet
to a appointed event where the degree of relevance to desired event if it irrelevant or relevant. The
results compared with method of keyword search and fuzzy logic based method to recognize based
incremental rate and correction rate. The result exhibit that this evolving intelligent system is more
suitable for tweets classification from fuzzy logic method and method of keywords search.
Keywords- Social media; Text classification; Evolving Intelligent System

‫الخالصة‬
.‫ وصفحات التواصل االجتماعي من مختلف األنواع والنماذج‬،‫يتميز عصرنا الحالي بالتقدم الهائل في تكنولوجيا االتصاالت‬
‫ وسائل اإلعالم االجتماعية هي تطبيقات‬.‫وقد برزت هذه األدوات كإنجاز خلقته العقول المبدعة من خالل التكنولوجيا المتقدمة‬
‫ تقوم هذه‬.‫حديثة تعتمد على الويب للتواصل والتفاعل بين البشر من خالل الرسائل الصوتية والرسائل المكتوبة ورسائل الفيديو‬
،‫ يشارك الناس اهتماماتهم وأنشطتهم في مواقع التواصل االجتماعي‬.‫الوسائل بتفعيل التواصل بين المجتمعات في جميع أنحاء العالم‬
‫ موقع تويتر خدمة تمكن األصدقاء والعائلة وزمالء العمل من التواصل والبقاء على‬.‫حيث يتواصل الناس من خالل التغريدات‬
‫ يقوم األشخاص بنشر تغريداتهم على ملفهم الشخصي وإرسالها الى متابعيه‬.‫اتصال من خالل تبادل التغريدات السريعة والمتكررة‬
‫ في هذا البحث تم استخدم‬.‫ من المهم دراسة هذه التغريدات وتصنيفها‬.‫للتعبير عن أفكارهم وآرائهم حول األحداث في هذا العالم‬
‫ ناتج هذا النظام هو قرار‬.‫ مدخالت النظام هذا عبارة عن مجموعة من الميزات التي تستخرج من كل تغريدة‬.‫نظام ذكي متطور‬
‫ وهي درجة ارتباط كل رسالة بحدث معين حيث يتم تحديد درجة االنتماء إلى الحدث المرغوب فيه إذا‬،‫التصنيف لكل تغريدة‬
‫ طريقة تستند إلى منطق غامض للتمييز‬،‫ النتائج مقارنة مع طريقة البحث عن الكلمات الرئيسية‬.‫كانت ذات صلة أو غير ذات صلة‬
‫ تظهر النتيجة أن هذا النظام أكثر مالءمة لتصنيف التغريدات من طريقة الكلمات‬.‫على أساس شروط معدل تصحيح ومعدل متزايد‬
‫الرئيسية وطريقة المنطق الضبابي‬
1. INTRODUCTION
Social media is a convenient space for people to Clarification their opinions in certain events
and communicate with each other. Tweets on Twitter contain features related to users' thoughts
and opinions concerning certain events, and it is important to categorize and select them. In
Twitter, there are data that reduces the data extracted and decrease its utility and then impact to
the classification process [1].
Text classification is an essence problem for numerous applications, like spam detection, smart
replies or sentiment analysis. It is a problem studied widely in past few years and various methods
used to solve this problem. Text classification aim to allocate documents to many or one categories.
If document allocated to more than one class, if document allocated to more than one class, it
called “single label” and if a document allocated to only one class, it called “multi-label” [2]. Most
methods depend on representing text as a text vector to classify it. This vector contains frequency
of each word in the text. It can be more sophisticated and represent several features that extracted
from the text [3].
Evolving intelligent systems (EISs) used to development of online algorithms that it close to
the theoretically optimal, work in real-time, appropriate for unpredictable environments and
appropriate for unstable problems. EISs are characterized by evolve its structure, adapt its
parameters, adapt and work gradually, in real time and online. Evolving intelligent systems Based
on Adaptive algorithms that it participate to raise of ‘quotient of intelligence’ of a system [4].
In this work, evolving intelligent system designed to classify tweets of Twitter data. A set of
features extracted from each tweet. These features are inputs to the classification procedure that
based on the fuzzy logic and genetic algorithm in the classification of the tweets according to their
relevance.

2. RELATED WORKS
In 2014 Caragea et al. [5] by using Naïve Bayes classifier and Support Vector Machine (SVM)
combined with SentiStrength algorithm. They suggested a sentiment classification method for
tweets of users during Hurricane Sandy and visualized these sentiments to map of geographical
Concentrated on hurricane Sandy .IN 2014 Salari et al. [6] they suggested a classification
procedure by using artificial neural network, genetic algorithm and k-Nearest Neighbor algorithm.
The purpose of this classification procedure is to earning best features vector. First, used feature-
sorting methods to prefer features such as the class of criteria reparability and ratio of feature
discriminant. Second, results contain arrays of best-ranked features used to produce optimum
features arrays as initial population to genetic algorithm. Third, advanced the process of
classification based on genetic algorithms that it selected optimum arrays of features using a
modified k-Nearest Neighbor method and improved back propagation neural network method. In
2016 Spielhofer et al. [7] suggested that the problems of irrelevant data removal and noise
reduction are similar to the email spam filtering. They trained a Naïve Bayes classifier for relevant
data detection. In 2016 Jiang et al. [8] by utilizing Maximum Likelihood Estimate, presented an
enhanced strategy called deep feature weighting Naïve Bayes to ascertain the earlier likelihood
and contingent likelihood. In 2016 Prusa et al. [9] in text classification utilized Convolutional
Neural Networks (CNN) and another encoding approach. CNN primarily utilized as a part of
processing of image. the new encoding strategy can change data of text into image and CNN can
utilized as a classifier of text. Because of text data’s high feature measurement, a procedure of
feature selection connected in classification of text. All feature selection applications plan to locate
the littlest subset of original data with the end goal that it can lessen the calculation time and
enhance classification of text. In 2016 Bidi et al. [10] utilizing Genetic Algorithm (GA) to execute
feature selection, can achieve two objectives by using This feature selection method, first is the
hunt of a feature subset to extent that execution of classifier is ideal; second is discover feature
subset with littlest dimensionality which accomplishes classification with higher precision. To
assess execution, three classifiers chose Nearest Neighbors, Support Vector Machine and Naive
Bayes. In 2017 Sathe et al. [11] suggested sentiment classification algorithm using fuzzy logic
combined with Neural Network.

3. PROPOSED SYSTEM
Research provides subtle elements of evolving intelligent system based on fuzzy logic and
genetic algorithm. At begin, classified collected data as training data, and the initial phase is pre-
processing. In this phase, each tweet handled to eliminate with the augmentations that influence
the classification procedure and then seven features used as input extracted from each tweet in the
phase of extract feature. These features utilized as input in procedure of classification. Procedure
of classification goes through three stages. Fuzzification is change of real inputs to fuzzy inputs
containing degree of membership using functions of membership. The trapezoidal membership
function utilized because it is precise, utilized much of the time and straightforward. In this step,
the advancement procedure happens. Genetic algorithms used to generate new membership degree
based on the previous membership degree. The inference step portrays second step, draws
assignment from input to output, and uses the IF-THEN rules to change the fuzzy input to the
fuzzy output. The last step is the defuzzification step to get real output. There are numerous
functions of defuzzification, for example, the middle point, median, maximum average (MOM)
and greater than the maximum (LOM). Figure (1) describe block diagram of this evolving
intelligent system.
Data Collection

Test Data Training Data

Preprocessing
Preprocessing
Hashtag Process

Tokenization
Extract more 50 words Feature Extraction
used frequently
Stop Word Removal

Stemming
Extract other Fuzzy
Rules
Lemmatization

POS

Feature Extraction

Fuzzification Inference Defuzzification Classification


Result

Generate new Fuzzy Rule manually


membership

Classification procedure

Figure (2) block diagram of evolving intelligent system


3.1 DATA COLLECTION
Underlying data containing in excess of one million tweets. through the Twitter API
("Application Programming Interface") collected tweets from twitter .In social media, twitter is
one of the essential tools, let clients to cast their more thinner on specific issues and events by
tweets that are 140 characters or less than . Data gathered amid the period from 10.27.2012 to
11.7.2012. Each record have of the area, timestamp, date and text data. This data sifted and get
just the data of text and are then prepared and then processed and extract features and classify
them.
After data collection, we take 1000 tweets from preliminary data haphazardly chased as
preparing data. Tweets are individuals' thoughts and conclusions of user so it not contain
contextual data. Along these lines, physically arrange an arrangement of tweets that utilized as
training data for classification procedure. Each tweet utilizing one or zero to demonstrate irrelevant
or relevant. Each tweet has accumulation of score that interim from zero to 15. Four score interims
characterized to depict an important level of relevant to a tweet based on irrelevance L1 [zero, 5),
low relevance L2 [5, 9), moderate relevance L3 [9, 12) and high relevance L4 [12, 16],
respectively.
3.2 DATA PREPROCESSING
Non-helpful data Found in tweets during the time spent arranging content, for example, URL,
label, numbers, and stop words. For instance, 'Tropical storm Sandy! #Hurricane (Bonnier) http:
/twittter.com'. It is vital to evacuate these augmentations or control them in tweets so as not to
influence the classification procedure. In this system, utilized example coordinating to kill these
added substances. For instance, a URL with a static example beginning with "http:/" will erased
when it is found and Erase numbers, mark, and Special characters. The most vital advance in
preprocessing is the erasure of the stop words, these words are the most much of the time utilized
as a part of English and once in a while these words helpful in the grouping procedure. The Hashtag
is an arrangement of words or word that not familiar and there are no spaces between them, Like
#HurricaneSandy. Client sees straightforwardly, yet the machine and the program cannot
recognize them. For this situation. Utilize correlation expressions of Hashtag with actually English
words. E.g., can recognize "Sandy" after fifth activity amid #SandyHurricane, in light of the fact
that parts of term can be described as "S", "Sa”, "San " , " Sand ", " Sandy ", separately .
Furthermore, utilized the Stemming, Lemmatization and Part of Speech (POS) for each word in
the tweet to greater adaptability in arrangement and more precision. The last advance in the
preprocessing is to change over each word to a lower value word to keep the distinction between
words if a similar word is upper value or lower value in another part in the tweet [12][13]..
3.3 FEATURE EXTRACTION
In classification process, a critical advance are feature extraction. A few words seem more of
the time than others. Subsequent to characterizing four frames L1, L2, L3, and L4, tweets that have
a place with L2, L3 and L4 are choose from training data gathering then we process every tweet.
Recurrence of each word in the tweets computed and we pick more 50 words rehashed. For each
word, the word's importance ai characterize as:
ai = Ai/Bi (1)
Where Ai chooses words number in the tweets that have a place with L2, L3, and L4.
Represent Bi words number in every tweet i.e. that have a place with L1, L2, L3 and L4. represent
ai the rate of word's essential i.the following stage, sort the most words utilized every now and
again in light of ai of littlest size to design list D, Then compute process comparability between
word in the menu D and word in tweet . Procedure of closeness present as numerical administrator.
Any tweet has n words, Ti show word ith in tweet and I ϵ {1… n}. CK show kth word in D list
and k ϵ {one… 50}. Most noteworthy value from comparability scores chosen to represent Fi’s.
So Fi score characterize as:
𝑆𝑖 = max(𝑐𝑘 × 𝑇𝑖 ⊗ 𝐶𝑘) , 𝑖 ∈ [1, 𝑛]
Si is our fundamental esteem. From every tweet, this features vector extracted. Points of
interest of features as show:
1. Shows (Gj) to word 's most noteworthy score of word in tweet (jth)
Gj = max Si
Where Gj chooses large score of word in tweet (jth).
2. Indicates (Kj) to tweet score

Kj = sum Si
Where Kj chooses a tweet, amass score of words.
3. Indicates (Nj) to length of tweet

Nj = n
Where n indicates to words number in tweet.
4. Indicates (Mj) to considerably utilized words number in tweet

Mj indicates to number of words in tweet and it same to words in list L. list L


contain words used frequently and use to compare with all tweets.
5. Indicates (Wj) to weight of tweet

Wj = Kj / Nj
Where Wj is the mean of words.
6. Indicates (Xj) to much of the time utilized words weight in tweet

Xj = Mj / Nj
Where Xj chooses rate of words utilized much of the time for all words in tweet.
7. Number of patterns in jth tweet (Vj )
After get a list. Helpful words found in training data in excess of 50 significant words that go
ahead their own but are not on the list. e.g. 'not protected ' and 'not expected' terms advantageous
more than one term, for example, 'not' or ' safe' . Along these lines, shows YJ to number of this
pattern in a tweet [12].
We extracted four extra features to enable us to give a more precise outcome and to order
more tweets. Partition the list D to three equally circulated subgroups that indicate to Z1, Z2 and
Z3 with various weights Θ1, Θ2 and Θ3, individually. These features characterize as:
8- More words utilized as a part of the list D (Z1)
9 - Words utilized regularly in the D list (Z2)
10 - Less usually utilized as a part of the rundown D (Z3)
ϴ1 for Z1 k ϵ [1, 17)
ϴk = ϴ2 for Z2 k ϵ [17, 33)
ϴ3 foe Z3 k ϵ [33, 50]

11 - The Number of words not found in the D, but rather these words utilized as often as
possible in training data that identified with Hurricane Sandy (SW)
3.4 CLASSIFICATION PROCEDURE
Figure (2) indicate to framework of using a classification procedure. After process of
fextraction feature, feature vector contain eleven value for every tweet. Eleven features utilized as
the input to the procedure. The classification procedure Pass through three steps of Fuzzification
process, inference process, Defuzzification process, as show in algorithm (1)

Inputs Fuzzification Inference Defuzzification Output

Generate new Set of rules


membership degree

Crossover

Mutation

Reproduction

Figure (2) framework of using a classification procedure


Algorithm (1): Steps of classification procedure
Input: Predefined classified training data, Feature vector for each tweet contain eleven value.
Output: Decision of classification.
Start
Step 1: Generate fuzzy rules from predefined classified training data.
Step 2: Fuzzification process
2.1 select membership function
2.2 For each value in feature vector
Compute degree of membership using membership functions
2.3 Map the crisp or real input to fuzzy set
2.4 generate new membership degree using genetic algorithm
Step 3: Inference process
3.1 Write set of IF-THEN fuzzy rules
3.2 Decision Making based on these fuzzy rules in addition to fuzzy rules extracted in
step1
Step 4: Defuzzification process
1- Select Defuzzification function
2- Transform the fuzzy results into real value
Step 5: print decision of classification and real value of result
End
3.4.1 FUZZIFICATION
Fuzzification utilize to delineate crisp or real value inputs to fuzzy sets. We calculate the
degrees of membership for every component utilizing membership functions. For every input and
output variable chose, we characterize at least three membership functions (MF), for instance: low,
moderate or high. The state of these membership functions can be triangles, Gaussian, Singleton
and trapezoids .The output of these functions work constantly restricted to approximately zero and
one [14]. In this research, the triangular membership function utilized in light of the fact that it
exact and broadly utilized. Initial eleven linguistic variables output and input sources find in Table
1, offer a few spaces for parameters. For instance, five degrees of parameter characterize for G
variable, very low value [0 – 2.26], low value [0.2 – 0.4], moderate value [0.3 – 0.6], high [0.55 –
0.8] and very high [0.7 - 1]. From that point onward, membership function is computed and after
that Genetic calculations are utilized to produce new membership degree by utilizing Genetic
Operators, for example, crossover , mutation and reproduction as show in algorithm(2)
TABLE 1. INPUT AND OUTPUT PARAMETERS
Variable No of feature Linguistic Range Linguistic Value Parameter
Variables
Very Low 0 - 0 .36
Low 0.16 - 0.46
1 S 0-1 Moderate 0.26 - 0.56
High 0.5- 0.75
Very High 0.65 - 1
Very Low 0 – 2.5
Low 2–7
2 F 0 – 20 Moderate 4 – 10
High 7 – 15
Very High 10 - 20
Low 0–7
3 M 0 – 20 Moderate 5 – 14
High 12 – 20
Low 0–3
4 I 0 - 10 Moderate 2–7
High 4 - 10
Very Low 0 – 2.26
Low 0.2 – 0.4
Input 5 G Moderate 0.3 – 0.6
0-1 High 0.55 – 0.8
Very High 0.7 - 1
Low 0 – 0.12
6 E 0-1 Moderate 0.06 – 0.23
High 0.16 – 1
Low 0–4
7 V 0 - 10 Moderate 3–7
High 6 - 10
Low 0–2
8 Z1 0 - 20 Moderate 1–5
High 4 - 20
Low 0–2
9 Z2 0 - 20 Moderate 1–5
High 4 - 20
Low 0–2
10 Z3 0 - 20 Moderate 1–5
High 4 - 20
Low 0-2
11 SW 0 - 20 Moderate 1- 5
High 4 - 20
Irrelevance/DK 0-40
Output R 0 - 100 Low Relevance 30-65
Moderate 50-85
Relevance
High Relevance 75-100
Algorithm (2): Genetic Algorithm
INPUT: Initial population of membership degree:
OUTPUT: New Membership degree
Evaluate the fitness of membership degrees:
While not (Termination Conditions) Do
Begin
Step1: Select pair of membership degrees:
Step2: Crossover between the membership degrees (recombine)
Step3: Mutate membership degrees
Step4: Evaluate the fitness of the modified membership degrees
Step5: Generate a new population of Membership degree
End;
Genetic algorithm (GA)
Genetic algorithm is a search procedure led on the mechanics of natural selection. A generational
GA creates an initial generation G (0), and for each generation G (t), generates a new one G
(t+1). GA maintains a population of candidate solution and makes it evolve by iteratively
applying a set of operators. The most commonly applied genetic operators’ are-
1- Crossover
Data from two parents (two-membership degree) combined to form children (new
membership degree). It takes two chromosomes and swaps all genes after a randomly selected
crossover point to produce new chromosomes. This operator does not add new genetic data but
manipulates the genetic data already present in population chromosomes to obtain new more fit
children. It works as follows: -
1- Select two parents from the MP (The best two chromosomes)
2- Find a position K between two genes randomly in the range (1, M-1)
M = length of chromosome.
3- Swap the genes after K between the two parents. The output will be the both children or
the more fit one.
2- Mutation
Is the process of add new genetic data to chromosomes. Mutation is a way to introduce new
data by changing some genes. It can applied to:
1- Chromosomes selected from the MD.
2- Chromosomes that have already subject to crossover.
3- Reproduction
After manipulating genetic data already present in the MP. By fitness function reproduction
operator, add new genetic data to population of the chromosomes by combining strong parents
with strong children; the hope is to obtain new more fit children (best new membership degree).
Reproduction imitates to natural selection.
3.4.2 INFERENCE
After fuzzification process, a process of inference is way to draw inputs to to the output and
give a classification decision. Fuzzy rules are an accumulation of linguistic expressions. The
process of inference utilized standards IF-THEN to change the fuzzy input to fuzzy output. In In
this system we used General rules written manually and utilized predefined training data to To
generate another set of fuzzy rules. The results more precise than with the methods in which they
compared and number of tweets that classified more than these techniques. A portion of these rules
written as follow:
1) If S: very high ˅ high ^ I: high ^ Z1: high →R: high relevant.
2) If S: high ^ Z3: Moderate ^ L3: low →R: moderate relevant.
3) If L: moderate ˅ E: low ^ G: low, → R: low relevant.
4) If M: high ^ S: very low ^ Z1, Z2, Z3=zero ^ SW is low → R is irrelevance/DK.
5) If Z1: high ^ Z2: high ^ SW: Moderate, → R: High relevant.
As per the above rules, we give a point-by-point clarification of these rules. Oftentimes
utilized words and words have high degree in tweet and words number in List D are high This
Indicates tweet High relevant degree to Hurricane Sandy. The tweet has a place with a moderate
relevant when the level of its words is high and tweet's length is low and the quantity of words is
moderate inside the 50 words most utilized as a part of list D, demonstrates that user posted tweet
with basic, vital words and short tweet length. In the event that the weight of tweet is low and
frequently used, words' weight is low and the quantity of vital words in list D show to there are
minimal essential words or real words. so degree of tweet is low Relevant to sandy, and tweets are
classified irrelevant on the grounds that critical words Linked to Hurricane Sandy not found. At
the point when the rate of words in List D is high and the quantity of words having a place with
Hurricane Sandy in the training data isn't in the list of the most important words is moderate. For
this situation the Tweet is high relevant to Hurricane sandy.
3.4.3 DEFUZZIFICATION
Defuzzification is the process of produce quantifiable outcomes in real logic. It must
actualized to change over fuzzy outcomes to real value in view of fuzzy sets and corresponding
membership degrees. There are set of function for defuzzification proposed in this research, similar
to centroid, Center of Sums Method (COS), bisector, mean of the greatest (MOM), littlest of the
most extreme (SOM) and First of Maxima Method (FOM). Output (R) is exceptional value
defuzzified from overall fuzzy set contain values of output based of defuzzification functions [14].
4. EXPERIMENTAL RESULTS
Comparison of defuzzification functions
The essential and troublesome thing are to check the correction rate. Note that clients tend to
express unique feelings and opinions so the results are extraordinary. In the first place, we utilize
the training data classified manually composed of 600 tweets, isolated to 300irrelevant and 300
relevant (low, moderate and high relevance). So to look at this strategy through the defuzzification
functions and analyze these functions.

TABLE 2. Polar relevance problem's results

function Relationship First dataset Second Third dataset


dataset
Centroid Irrelevant 99.8 % 99.7 % 100 %
relevant 99.6 % 95 % 98.6 %
Bisector Irrelevant 99.6 % 99.7 % 98.6 %
relevant 99.8 % 95 % 97.2 %
Mean of Irrelevant 99.8 % 99.8 % 100 %
Maximum relevant 99.5 % 98 % 98 %
Smallest of Irrelevant 99.7 % 99.8 100 %
Maximum relevant 98.8 % 95 % 96.4 %
Largest of Irrelevant 98.9 % 98.9 % 96 %
Maximum relevant 99.7 % 95 % 97.4 %

TABLE 3. Four-degree relevance problem’s results


Function Relationship First dataset Second dataset Third dataset
Centroid Irrelevant 99.8 % 99.7 % 100 %
Lowly 78.2 % none 72%
Moderately 72 % 80% 70 %
Highly 100 % 100% 98.7 %
Bisector Irrelevant 99.6 % 99.7 % 98.6 %
Lowly 70.4 % None 68.5%
Moderately 95.9% 79% 49.6%
Highly 69% 59.3% 69.5%
Mean of Irrelevant 99.8% 99.8% 100 %
Maximum Lowly 59.1% None 68 %
Moderately 59.7% 79% 89%
Highly 98.7% 59.6% 69%
Smallest of Irrelevant 99.7 % 99.8 100 %
Maximum Lowly 29.8 None 35.5
Moderately 0. 0 9 79 % 0.09%
Highly 99.3% 100% 98%
Largest of Irrelevant 98.8 98.9 96 %
Maximum Lowly 59% None 54%
Moderately 60% 79% 62.5%
Highly 76% 68.6% 66.9%
Three set of datasets utilized from test data. Every dataset contains 200 tweet. The distinction
between them is proportion of irrelevance to relevance, which is 1 : 9, 1 : 1 and 9: 1, separately.
This design contain unbalanced and balanced data. The defuzzification function used to look at
amongst them and pick the best function. Note that in second dataset index, there is no related low
example. So, the signs in the tables are "none" And here differences in the results between
defuzzification functions and this is naturalistic on the grounds that the function dependably give
different results. In Table 2 and Table 3, the outcomes demonstrate this evolving intelligent system
accomplishes the extraordinary precision to the issue of the polar relevance. This evolving
intelligent system manages the four-degree appropriateness issue, however not with the quality
and nature of managing polar relevance problems. From the outcomes, the centroid execution is
exceptionally proficient however, LOM is worse. Through comparison, we choose centroid
defuzzification function since its capacity to give well result and superior to different functions.
Compare with keyword search method
Researches [14-15] utilized keyword search method for the extraction related tweets from
primary dataset. Advantage of Keyword search method is exceptionally precise, proficient and
straightforward for tweet has very relevant. Mistake of this strategy that is unable to extract
adequate tweets relevant. Comparison in light of five dataset of tweets from the test data. Table
(4) demonstrates the comparison results about between this evolving intelligent system and
keyword search method based on correction rate and incremental rate. A correctness rate is
α = A / B * 100 %
Where B set related tweets' number extracted from each technique. The correctly categorized
tweets number in B is X. A is ascertained by manually check and It is affirmed by training data,
i.e., we compute the quantity of tweets categorized correctly in B. likewise, an incremental rate (λ)
is portray that evolving intelligent system is able and ready to exploit data more than well-known
keyword search method, it is defined as:
λ = (Af – Ak) / Ak * 100 %
Where Af compute by this evolving intelligent system and Ak compute by keyword search
method
Now, compare between keyword search method and this evolving intelligent system based on
correction rate and incremental rate show in Table 4.
TABLE 4. Comparison results between evolving intelligent system and Keyword search
Data Set Keyword Search method Evolving intelligent system λ
X Y α Y X α
1 98 96 97.9 % 160 158 98.75 % 64.583 %
2 103 103 100 % 184 183 99.456 % 77.669 %
3 86 86 100 % 147 146 99.319 % 69.767%
4 93 92 98.9 % 154 152 98.701% 65.217%
5 99 99 100 % 138 136 98.550% 37.373 %
Through manual audit and trial of results, all the extracted tweets by keyword search
method appears in evolving intelligent system that is, the result of keyword search method is
subsets from this evolving intelligent system. In Table 4, the rates of λ demonstrates this evolving
intelligent system successfully revise additional tweets more than keyword search method.
Comparison with fuzzy logic based method
Table 5 shows the difference between this evolving intelligent system and fuzzy logic method
for text classification [‫]رقمها‬.
TABLE 5. Results between Fuzzy logic based method and improved fuzzy logic method.
Data Set Fuzzy Logic Based Method evolving intelligent system λ
X Y α Y X α
1 141 135 95.7 % 160 158 98.75 % 17.037 %
2 161 157 97.7 % 184 183 99.456 % 26.560 %
3 128 126 98.4 % 147 146 99.319 % 15.873 %
4 137 132 96.3 % 154 152 98.701% 15.151 %
5 122 118 96.7 138 136 98.550% 15.254%

Summarize, evolving intelligent system ready to extract tweets more than fuzzy logic
method and keyword search method. With considering incremental rate, evolving intelligent
system is powerful more than fuzzy logic based method and keyword search method. With
considering correctness rate values, a keyword search method completing somewhat best than
fuzzy logic method but evolving intelligent system is superior and better than fuzzy logic method,
and approximate to keyword search method. With thinking and considering about both standard,
evolving intelligent system choose in research, where relevant tweets are exceptionally and highly
required for analysis step [14-15]. correctness rate value and high quantity ready to ensure more
informative, educational and helpful .We find that evolving intelligent system is superior to
anything and better than fuzzy logic based method where it can ensure the high rate value and
high quantity value of correction and tweets more that are relevant and classified accurately.

5. CONCLUSION
In this research, proposed evolving intelligent system for text classification from Twitter
data. By utilizing, an arrangement of training data and test data and got eleven feature from every
tweet as inputs to the classification procedure. We compare this evolving intelligent system
with two methods. The first is a method of keyword search and second method is fuzzy Logic
method for text classification. Results demonstrate that this system is suitable and appropriate to
classify irrelevant or relevant tweets more than fuzzy logic method and keyword search method,
Additionally, by contrasting defuzzification functions usually utilized, we conclude , centroid
function is more productive and powerful than other function. In future works, we aim to detect a
best way to classify text, for example, using neural networks with fuzzy logic and using
evolutionary algorithms to generate new membership degrees at fuzzification process or generate
additional rules in the inference process.
REFERENCES
[1] C. Chen, D. Neal, and M. Zhou, “Understanding the Evolution of a Disaster A Framework
for Assessing Crisis in a System Environment (FACSE)”, Natural Hazards, vol. 65, no. 1, pp.
407-422, January, 2013.
[2] Rajni Jindal, Ruchika Malhotra, Abha Jain, "Techniques for text classification:
Literature review and current trends", Webology, Volume 12, Number 2, December 2015.

[3] M. Nogueira, O. Rezende, A. Camargo,” On the Use of Fuzzy Rules to Text Document
Classification”, International Conference on Hybrid Intelligent Systems, USA, August 2010.

[4] Plamen Angelov, Dimitar P. Filev, Nikola Kasabov, “Evolving Intelligent Systems:
Methodology and Applications”, IEEE, Ambleside, UK, September 2006.
[5] C. Caragea, A. Squicciarini, S. Stehle, K. Neppalli, and A. Tapia, “Mapping moods: geo-
mapped sentiment analysis during hurricane Sandy,” International Conference on Information
Systems for Crisis Response and Management (ISCRAM), pp. 642-651, May 2014.

[6] : Salari N, Shohaimi S, Najafi F, Nallappan M, Karishnarajah “A Novel Hybrid Classification


Model of Genetic Algorithms, Modified k-Nearest Neighbor and Developed
Backpropagation Neural Network”, PLOS ONE, November, 2014

[7] T. Spielhofer, R. Greenlaw, D. Markham, and A. Hahne, “Data mining Twitter during the
UK floods: Investigating the potential use of social media in emergency management”, 3rd
International Conference on Information and Communication Technologies for Disaster
Management (ICT-DM), Vienna, Austria, pp. 1-6, December, 2016.

[8] Q. Jiang, W. Wang, X. Han, S. Zhang, X. Wang and C. Wang, “Deep feature weighting in
Naive Bayes for Chinese text classification”, International Conference on Cloud Computing and
Intelligence Systems (CCIS), Beijing, China, pp. 160-164, August, 2016.

[9] J. D. Prusa and T. M. Khoshgoftaar, “Designing a better data representation for deep neural
networks and text classification”, International Conference on Information Reuse and
Integration (IRI), IEEE, USA, pp. 411-416, July ,2016.

[10] N. Bidi and Z. Elberrichi, “Feature selection for text classification using genetic
algorithms”, International Conference on Modelling, Identification and Control (ICMIC),IEEE,
Algiers, Algeria, pp. 806-810, November, 2016.
[11] J. B. Sathe and M. P. Mali, “A hybrid Sentiment Classification method using Neural Network
and Fuzzy Logic,” IEEE, India, pp. 93-96, January 2017.

[12] KeYuan Wu, MengChu Zhou, Xiaoyu Sean Lu and Li Huang, “A Fuzzy Logic-Based Text
Classification Method for Social Media Data“, International Conference on Systems, IEEE,
October 2017.
[13] A. Kasun, M. Manic, and R. Hruska, “Optimal stop word selection for text mining in
critical infrastructure domain”, Resilience Week (RWS), Philadelphia, pp. 1-6, August 2015.
[14] H. Hellendoorn and C. Thomax, “Defuzzification in fuzzy controllers”, Journal of
Intelligent & Fuzzy Systems, vol. 1, no. 2, pp.109-123, 1993.

Potrebbero piacerti anche