A Data Mining-Based Approach To Predict Strain Situations in Hospital Emergency Department Systems

A DATA MINING-BASED APPROACH TO PREDICT
STRAIN SITUATIONS IN HOSPITAL EMERGENCY

DEPARTMENT SYSTEMS
S Benbelkacem, F Kadri, S Chaabane, B Atmani
To cite this version:

S Benbelkacem, F Kadri, S Chaabane, B Atmani. A DATA MINING-BASED APPROACH
TO PREDICT STRAIN SITUATIONS IN HOSPITAL EMERGENCY DEPARTMENT
SYSTEMS. 10ème Conférence Francophone de Modélisation, Optimisation et Simulation-
MOSIM’14, Nov 2014, Nancy, France. <hal-01081556>
HAL Id: hal-01081556

https://hal.archives-ouvertes.fr/hal-01081556
Submitted on 9 Nov 2014
HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est

archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents
entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non,
lished or not. The documents may come from émanant des établissements d’enseignement et de
teaching and research institutions in France or recherche français ou étrangers, des laboratoires
abroad, or from public or private research centers. publics ou privés.
A DATA MINING-BASED APPROACH TO PREDICT STRAIN SITUATIONS
IN HOSPITAL EMERGENCY DEPARTMENT SYSTEMS
S. BENBELKACEM1, F. KADRI2,3 S.CHAABANE2,3, B. ATMANI1

1
LIO Lab., University of Oran, Oran, Algeria sondes.chaabane@univ-valenciennes.fr
2
Univ. Lille Nord de France, F-59000 Lille, France atmani.baghdad@gmail.com
3
TEMPO Lab., PSI Team, F-59313 Valenciennes, France
sofia.benbelkacem@gmail.com
farid.kadri@univ-valenciennes.fr
ABSTRACT: Nowadays, emergency departments are confronted to exceptional events such as epidemics problems or
health crises. These situations increase the patient flow. The consequence of this influx of patients has resulted in
problems of ED overcrowding which often increases the length of stay of patients (LOS) in EDs and leads to strain
situations. To cope with such situations, ED managers must predict the LOS. In this paper, we propose a model for
predicting the patient length of stay (LOS) in ED using data mining techniques. The used data was collected from the
pediatric emergency department (PED) in Lille regional hospital centre, France. Our target is to illustrate with a real
world case study, how data mining can be benefit to predict LOS and which its limitations.
KEYWORDS: data mining, emergency department, classification, prediction, length of stay.
1 INTRODUCTION The remainder of this paper is organized as follows:

Section 2 describes the problem. Section 3 summarizes
Nowadays, with the growing demand for emergency the current data mining techniques and their applications
medical care, the management of hospital emergency in healthcare systems. Section 4 presents the approach
departments (EDs) has become increasingly important. used in this work applied to our study case. Section 5
The management of patient influx is one of the most discusses obtained results. Finally, section 6 reviews the
crucial problems in EDs throughout the world (Kadri et main conclusions in this work and exposes the perspec-
al., 2014). To deal with this influx of patients, emer- tives of the study.
gency departments require significant human and mate-
rial resources, as well as a high degree of coordination 2 PROBLEM DESCRIPTION
among human and material elements (Kadri et al.,
2014a; Kadri et al., 2013). Unfortunately, these resources Emergency departments are confronted to exceptional
are limited. The consequences of this influx of patients events due to seasonal epidemics (in winter: influenza,
have resulted in problems of ED overcrowding (Boyle et colds, gastroenteritis, bronchiolitis, etc., in summer:
al., 2012; Kadri et al., 2014b). This latter increases the trauma), health crises… These events can cause strain
length of stay of patients in EDs which lead to violence situations when disequilibrium between the care load
of angry patients against staff, patients leave the ED flow (demand) and the care production capacity (supply)
without being treated, reduced access to emergency is observed. The management of patient flow in such
medical services and increase in patient mortality situations is one of the most important problems man-
(Sprivulis et al., 2006). aged by ED managers. To handle this influx of patients,
emergency departments require significant human and
Thanks to information and computerized hospital, the material resources, but these are limited. To confront
data produced by emergency departments (EDs) is still these situations and face theses constraints, emergency
increasing. With the use of data mining techniques it is departments have no choice but to adapt.
possible to extract interesting and useful knowledge and
regularities which may be used as a tool for decision To help manager to resolve this problem (Farid Kadri et
making in such establishments, in order to respond to the al., 2014b) propose a decision support system. One of
needs of ED managers in their daily decision making the principle part is to Prevent and predict strain situa-
activities. The aim of this study is to see how can we use tions with a predictive model. They identify and develop
data mining techniques and particularly classification ten relevant indicators validated by professionals. One of
methods to develop models for prediction of patients’ the important indicators is the length of stay defined as
length of stay (LOS) at the emergency department. the time between when the patient completes their ad-
ministrative registration at the ED and their discharge Walczak and Scharf (2000) used artificial neural network
(departure) from the ED. In strain situations, LOS in- to predict the quantity of transfusion units that are re-
creases considerably and causes violence of angry pa- quired by surgical patients for a specific operation. The
tients. Thus, the estimation and prediction of the patient results show that artificial neural networks are capable of
length of stay (LOS) is important to detect the beginning reducing the overall quantity of blood units ordered for
of strain situations to help systems to react to such situa- operations. Thus, the artificial neural networks offer a
tions. It helps or managing patients and internal re- means to reduce patient costs while maintaining a high
sources in EDs, and predict sufficient hospitalization level of patient care.
capacity downstream.
Chae et al. (2003) used a decision tree to analyze the
Our purpose is to see how data mining models and tech- healthcare quality indicators in order to develop quality
niques can be used to predict LOS indicator and which improvement strategies. The authors identified the im-
limitations of such approaches. portant factors influencing the inpatient mortality: length
of stay, disease classes, discharge departments, and age
3 RELATED WORK groups. The results provided cumulative statistics that
show how the authors found the inpatient mortality.
Koh and Tan (2005) present a data mining as a relatively
recently developed methodology and technology, com- Kraft et al. (2003) are the first to use nursing diagnosis
ing into prominence only in 1994 (Trybula, 1997). It and neural networks to predict the length of stay of pa-
aims to identify valid, novel, potentially useful, and tients with spinal cord injuries. Four types of neural
understandable correlations and patterns in data by networks were developed for building predictive models
combing through copious data sets to sniff out patterns for length of stay: dynamic network, prune network,
that are too subtle or complex for humans to detect. In multilayer perceptron, and radial basis function network.
this paper, authors explored data mining applications The results show that the radial basis function network
within healthcare in diverse areas involving: evaluation model neural network was selected as the best model.
of treatment effectiveness, management of healthcare,
customer relationship management, and detection of Walsh et al. (2004) used artificial neural network ensem-
fraud. This review shows that data mining has been bles to predict the disposition and length of stay in chil-
largely applied in the hospital sector to provide useful dren presenting to the Emergency Department with
information to improve hospital establishment efficien- bronchiolitis. The results show that the neural network
cy. Milovic (2012) mentioned data mining applications ensembles correctly predicted disposition in 81% of test
for a variety of healthcare areas, such as doctors who use cases. The authors concluded that artificial neural net-
patterns by measuring clinical indicators, quality indica- work ensembles can predict disposition for infants and
tors, customer satisfaction and economic indicators, toddlers with bronchiolitis; however, the prediction of
performance of physicians from multiple perspectives to length of hospital stay is not as good.
optimize use of resources, cost efficiency and decision
making based on evidence, identifying high-risk patients To illustrate a data mining application in healthcare, Koh
and intervene proactively, optimize health care, etc. and Tan (2005) used the decision tree to find out how
certain variables are associated with the onset of diabe-
More recently, Esfandiari et al. (2014) presented a re- tes. The purpose of this healthcare data mining applica-
view study, including papers published between 1999 tion is to identify high-risk individuals. The results show
and 2013, in the context of medical data mining applica- that age is the most important factor associated with the
tions. Five data mining approaches were considered: onset of diabetics.
classification, regression, clustering, association and
hybrid. Liu et al. (2006) applied decision trees and naive Bayesi-
an classifiers to a geriatric hospital dataset in order to
In this section, the literature review is focused on the predict inpatient length of stay. Results show that naive
supervised classification approaches. It refers to super- Bayesian models performed better in comparison with
vised methods that determine target class value of un- the C4.5 algorithm of decision tree. So, applying naive
seen data. It is used to identify profiles of classes in Bayesian imputation models to handle a considerable
terms of their attributes. The most popular supervised amount of missing data can greatly increase the classifi-
classification algorithms in medical data mining based cation accuracy of predicting length of stay.
on the literature (Esfandiari et al. 2014) are : decision
tree (DT), multi-layer perception type of artificial neural De Toledo et al. (2009) used decision trees techniques to
network model (MLP), support vector machine (SVM) build a model of outcome prediction for subarachnoid
and naive bayes (NB). hemorrhage that makes the knowledge discovered from
the data explicit and communicable to domain experts.
The algorithms used in this paper were C4.5, fast deci-
sion tree learner, partial decision trees, nearest neighbor
with generalization, and ripple down rule learner. Com- 4 DATA MINING BASED PROCESS : A CASE
pared to other algorithms, the C4.5 obtained the best OF STUDY
results
Lille Regional Hospital Centre (CHR-Lille) serves four
Delen et al. (2009) built a classification model to analyze million inhabitants in Nord-Pas-de-Calais, a region char-
the healthcare coverage using artificial neural networks acterized by one of the largest population densities in
and decision trees. The developed model was used to France (7% of the French population). The paediatric
predict if an individual has healthcare coverage or not emergency department (PED) is open 24 hours a day and
based on the specific information’s about socio- receives 23 900 patients a year on average. Besides its
demographic and lifestyle, and the importance of the internal capacity, the PED shares many resources, such
various factors in the classification model. The results as administrative patient registration and clinical labora-
indicated that the most accurate classifier for this phe- tories, with other hospital departments. In this study the
nomenon was the multi-layer perception type of artificial data mining based approach is presented in the figure 1
neural network model. and applied to predict the patient’s length of stay at the
PED.
Aljumah et al. (2013) used data mining techniques to
discover patterns that identify the best mode of treatment
for diabetes across different age groups. In this study,
predictions on the effectiveness of different treatment
methods were elucidated using a support vector machine
algorithm. These results indicated that dietary treatment
is more effective for patients in the old age group than
patients in the young group.
Different applied methods and application fields are

summarized in table 1.
Method Reference Application field

Chae et al. Identify factors influencing
(2003) inpatient mortality
Find variables associated
Koh and
with onset of diabetes to
DT Tan (2005)
identify high-risk individuals
Build a model of outcome
De Toledo
prediction of subarachnoid
et al. (2009)
hemorrhage
Walczak Predict quantity of
and Scharf transfusion units required by
(2000) surgical patients
Predict length of stay of
Kraft et al.
ANN patients with spinal cord
(2003)
injuries
Predict length of stay in Figure 1: General architecture of the proposed
Walsh et al.
children presenting approach
(2004)
bronchiolitis to ED
Predict if an individual has 4.1 Data Collection
DT Delen et al.
healthcare coverage
ANN (2009)
The first step of the proposed approach is the collection
Aljumah et Predict the effectiveness of and the preparation of the data. This step is composed of
SVM al. (2013) treatments of diabetes several sub-steps: (1) data sources are located, accessed,
and selected. (2) Selected data is put into a tabular for-
Liu et al. Predict inpatient length of mat in which instances and variables take place in rows
NB
(2006) stay of geriatric patients and columns, respectively (Giudici, 2005).
Table 1: Classification algorithms applied in Our study case was conducted utilizing a dataset
healthcare systems extracted from the database of PED of CHR-Lille. As the
target is to predict the patient’s LOS in strain situations,
a winter period from January to March 2012 including • Class c1 (LOS < 289 minutes): represents the
6135 records is selected. short length of stay (LOS);
• Class c2 (289 < LOS < 432 minutes): represents
the average LOS;
4.2 Data set Description • Class c3 (LOS > 432 minutes): represents the
long LOS.
The aim of this step is the description of the dataset
identified in previous step. Let Ω = {ω1, ω2, ..., ωn} The last class is the most interesting for prediction par-
denote the set of objects taken into account for the train- ticularly for to detect strain situations in PED.
ing. Each object ωi is described by a set of variables X1,
X2,…, Xp called descriptive variables or exogenous vari- After the description, the data set is separated in two
ables. The value taken by a descriptive variable Xj is subsets:
called modality or value of the attribute Xj of an individ- • Training data (90% of records): is used to build
ual ωi. An attribute C, called class variable or endoge- a prediction model by applying a data mining
nous variable, is associated with every individual ωi. C methods;
takes its values in the set of classes C = {c1, c2,..., • Testing data (10% of records): is used to evalu-
cm}(Atmani and Beldjilali, 2007). ate the resulting model and to test the accuracy
of predictions.
Consider our case of study to predicting the LOS at the
PED. This problem can be described by 12 exogenous 4.3 Data Preprocessing
variables X1, X2, X3, X4, X5, X6, X7, X8, X9, X10, X11, X12
and detailed in table 2. Real-world data may be incomplete, noisy, and inconsis-
tent, which can disguise useful patterns. Thus, the pre-
Exogenous processing step is required to generate quality data and
Signification Description
variables improve the efficiency of data mining models (Zhang et
Patient arrival time at the al., 2003). First, anomalies are removed and duplicate
X1 Arrival time records are eliminated. Then, numeric attributes are
PED (hour of day)
X2 Arrival mean Arrival mean of patients discretized by cutting its range of values in a finite num-
Orientation of patient at ber of intervals. After that, a classifier subset evaluator
X3 Orientation the end of emergency (Hall, 1999) is used to select attributes. It evaluates the
management predictive ability of attributes preferring sets of attributes
that are highly correlated with the class. The fact that
X4 Age Age of patients many attributes depend on one another often unduly
X5 Sex Sex of patients influences the accuracy of prediction models (Kotsiantis,
Clinical classification of 2007). For our case of study, the open source tool Weka
X6 CCMU emergency patients (Witten and Frank, 2005) to do this step. Weka is a col-
called CCMU (1-6) lection of data mining algorithms and data pre-
If the patient had an processing methods for a wide range of tasks. The result
X7 Echography of this step is very important because we define the set
echography
If the patient had a of data that will be used in the remaining steps.
X8 Scanner
scanner
If the patient had a
X9 Radiology 4.4 Build classification Models
radiology
If the patient had a
X10 Biology In this step, in order to find and select the best models,
biology test
we applied diverse supervised classification
X11 Diagnostic Diagnostic of patients
techniques to the training data. As presented previously,
Multicentric Emergency the most popular classification algorithms of data mining
X12 GEMSA Department Study Group applied in healthcare based on the literature are used:
called GEMSA (1-6) • Naïve Bayes (NB) and BayesNet ;
• Support vector machine learner (SVM) ;
Table 2: Exogenous variables, signification and
description • Decision tree algorithms including: Id3, C4.5,
REPTree, BFTree and PART. Id3 and C4.5. are
the most well-know algorithms in the literature
Each individual is associated with a class C correspond- for building decision trees; REPTree is a fast
ing to the LOS using a descretized method in WEKA decision tree learner whereas PART is a rule
tool. A set of classes C = {c1, c2, c3} is obtained accord- induction algorithm and BFTree is best-first de-
ing bounds: cision tree learning.
5 RESULTS AND DISCUSSION
All these algorithms are available in Weka tool.
The valuation of models is carried out using 10-fold
4.5 Models Evaluation cross validation on the PED data set. It is based on the
partition of the original sample into ten subsamples,
In this step, the testing data is used in order to validate using nine as training data and one for testing data. The
the chosen model(s). The information discovered in the cross validation process is then repeated ten times with
modelling is evaluated to assess its utility and reliability. each of the ten samples, averaging the results from the
To obtain reliable results, the extracted knowledge ten folds to produce a single estimation (De Toledo et
should be evaluated by a comparison of results obtained al., 2009).
with various supervised classification methods and using
several measures (Dunham, 2006). Performance evalua- Table 3 shows the performance of the prediction models
tion of classifiers can be measured by hold-out, random obtained with each classifier in terms of accuracy, preci-
sub-sampling, cross-validation and bootstrap (Esfandiari sion, recall, kappa statistic, and ROC Area.
et al., 2014). The most common method is the cross
validation (Browne, 2000). Furthermore, performance
measures can be used to analyze predictive models. They
Accuracy
Precision
statistic
based on four values of the confusion table (Figure 2):
Kappa
Recall
ROC
Area
true positives (TP), false positives (FP), true negatives
(TN), false negatives (FN).
NB 78.463 0.76 0.785 0.351 0.829

BayesNet 78.286 0.763 0.783 0.360 0.83
SVM 79.942 0.725 0.799 0.237 0.609
Id3 72.291 0.758 0.784 0.3 0.691

Figure 2: Format of confusion table PART 78.576 0.742 0.786 0.298 0.778
C4.5 79.701 0.748 0.797 0.282 0.706
Five performance measures are used: REPTree 78.945 0.724 0.789 0.210 0.728
BFTree 79.170 0.739 0.792 0.245 0.735
• Accuracy: is a rate of correct classification de-
்௉ା்ே
fined by Table 3: Performance evaluation of LOS prediction
்௉ା்ேାி௉ାிே
models
• Precision: is a rate of true positive classification The maximum accuracy score equal to 79.942 is ob-
்௉ tained by SVM and the lowest one equal to 72.291 is
defined by
்௉ାி௉
obtained by Id3. The maximum kappa score belongs to
0.36 with BayesNet while the lowest kappa score be-
• Recall: is the evaluation and ranking of each
்௉ longs to 0.210 with REPTree. The highest precision
sample based on positive class defined by score again belongs to BayesNet (0.763). Furthermore,
்௉ାிே
we can see that the recall was slightly better with SVM.
• Kappa statistic: measures the agreement of pre- The maximum score of ROC Area belongs to 0.83 with
diction with the true class labels. A score value BayesNet and the lowest score is for SVM (0.609). Ac-
of 1.0 signifies complete agreement, and a value cording this result, BayesNet and SVM give the best
greater than 0 means that the classifier is doing results for the performance measures.
better than pure random behavior (Azari et al.,
2012). Let us define the Vindicator (i,j): is the value of the indi-
cator i ∈{Accuracy, Precision, recall, Kappa statistic,
• ROC: gives the trade-off between true positive ROC area} for the model j ∈{NB, BayesNet, SVM, Id3,
்௉
rate defined by (recall) and false positive PART, C4.5, REPtree, BFTree}. Figure 3 gives the com-
்௉ାிே
ி௉ parison of all models according the RateOfMax (i) and
rate defined by for a given model (Han et defined by:
ி௉ା்ே
al., 2011). ܸ݅݊݀݅ܿܽ‫ݎ݋ݐ‬ሺ݅, ݆ሻ
ܴܽ‫ݔܽܯ݂ܱ݁ݐ‬ሺ݅ሻ =
‫ݔܽܯ‬௝∈ௌ௘௧௢௙ெ௢ௗ௘௟௦ ሺܸ݅݊݀݅ܿܽ‫ݎ݋ݐ‬ሺ݅, ݆ሻሻ
If the evaluation results are not enough performing then
the process is resumed from the data pre processing step.
100%
plete this study and response to such question, it is im-
portant to involve professionals in the whole data mining
80% process.
60%
6 CONCLUSION
40%
In this paper, we presented an approach based on super-
20%
vised classification methods in order to predict the pa-
0% tient length of stay at the pediatric emergency depart-
Accuracy Kappa Precision Recall ROC ment in Lille regional centre, France. We described the
statistic Area necessary steps that should be taken to preprocess and
prepare the data set. After that, we tested and compared
NB BayesNet SVM Id3 the performance of experiment results with different
classification methods. The results indicated that
PART C4.5 REPTree BFTree BayesNet and NB give the best results. However deci-
sion tree methods obtain comparable results and can
Figure 3: Rate of maximum successfully be suitable for the prediction of patient’s
length of stay. This latter, may help EDs managers in
their daily decision making activities.
The results show that BayesNet and NB give the best
result according all performance measures. However DT This study illustrates the benefit of classification meth-
models are for the most, more than 80% closer to the ods to make PED management system more proactive
maximum. The conclusion is that decision trees give face to strain situations. However some limitations must
comparable results compared to NB and BayesNet. In be taken into account. Some perspectives can be consid-
this case, we preferred decision trees because they are ered. First, additional data set must be considered and
reputed to be several orders of magnitude faster than new experiments must be conducted. Second, other data
neural networks and support vector machine. mining methods must be tested with a larger dataset in
order to extract knowledge’s and to build models that
As shown by results, this study can be benefit to PED may help PED managers in decision making in the man-
manager to predict LOS and detect the beginning of agement of PED activities. Finally, professionals must
strain situation. This information can be used to make be involved in the process to give the knowledge and
the decision more proactive. However as presented in information about the domain.
Koh and Tan (2005), this study presents some limita-
tions. REFERENCES
First, this study can be limited by the accessibility of Aljumah, A.A., Ahamad, M.G., Siddiqui, M.K., 2013.
data because of heterogeneity of source of data (admin- Application of data mining: Diabetes health
istration, laboratory, PED …). Hence, data must be col- care in young and old patients. J. King Saud
lected and integrated before used. One of solution pro- Univ. - Comput. Inf. Sci. 25, 127–136.
posed in the literature is the use of data warehouse. But it Atmani, B., Beldjilali, B., 2007. Neuro-IG: A Hybrid
is a costly and time consuming project. System for Selection and Elimination of Predic-
tor Variables and Non Relevant Individuals. In-
Secondly, data can be missing, or non-standardized such formatica 18, 163–186.
as pieces of information recorded in different format. For Azari, A., Janeja, V.P., Mohseni, A., 2012. Predicting
example, the variable X3 (Orientation) we can find two Hospital Length of Stay (PHLOS): A Multi-
different records “ RETURN HOME with external con- tiered Data Mining Approach, in: IEEE 12th In-
sultation next 7 days” and “ RETURN HOME with ex- ternational Conference on Data Mining
ternal consultation next 8 days”. This item can be stand- Workshops (ICDMW). pp. 17–24.
ardized as “RETURN HOME” and have another column Boyle, A., Beniuk, K., Higginson, I., Atkinson, P., 2012.
specifying the delay for external consultation. Emergency Department Crowding: Time for In-
terventions and Policy Evaluations. Emerg.
Med. Int. 2012.
Thirdly, “the successful application of data mining re- Browne, M.W., 2000. Cross-validation Methods. J Math
quires the knowledge of the domain area as well as in Psychol 44, 108–132.
data mining methodology and tools” (Koh and Tan Chae, Y.M., Kim, H.S., Tark, K.C., Park, H.J., Ho, S.H.,
2005). This is true in our case, for example the definition 2003. Analysis of healthcare quality indicator
of endogenous variables (classes) C is obtained by using data mining and decision support system.
WEKA tool. The question is how can we trust this re- Expert Syst. Appl. 24, 167–172.
sult? Did professionals have the same bounds? To com-
De Toledo, P., Rios, P.M., Ledezma, A., Sanchis, A., injury population, in: Proceedings of the 36th
Alen, J.F., Lagares, A., 2009. Predicting the Annual Hawaii International Conference on
Outcome of Patients With Subarachnoid He- System Sciences. Presented at the Proceedings
morrhage Using Machine Learning Techniques. of the 36th Annual Hawaii International Confe-
IEEE Trans. Inf. Technol. Biomed. 13, 794– rence on System Sciences.
801. Liu, P., Lei, L., Yin, J., Zhang, W., Naijun, W., El-Darzi,
Delen, D., Fuller, C., McCann, C., Ray, D., 2009. Analy- E., 2006. Healthcare Data Mining: Prediction
sis of Healthcare Coverage: A Data Mining Ap- Inpatient Length of Stay, in: 2006 3rd Interna-
proach. Expert Syst Appl 36, 995–1003. tional IEEE Conference on Intelligent Systems.
Dunham, M.H., 2006. Data Mining: Introductory And Presented at the 2006 3rd International IEEE
Advanced Topics. Pearson Education. Conference on Intelligent Systems, pp. 832–
Esfandiari, N., Babavalian, M.R., Moghadam, A.-M.E., 837.
Tabar, V.K., 2014. Knowledge discovery in Milovic, B., 2012. Prediction and decision making in
medicine: Current issue and future trend. Expert Health Care using Data Mining. Int. J. Public
Syst. Appl. 41, 4434–4463. Health Sci. IJPHS 1.
Giudici, P., 2005. Applied Data Mining: Statistical Me- Sprivulis, P.C., Da Silva, J.-A., Jacobs, I.G., Frazer,
thods for Business and Industry. John Wiley & A.R.L., Jelinek, G.A., 2006. The association
Sons. between hospital overcrowding and mortality
Hall, M.A., 1999. Correlation-based Feature Selection among patients admitted via Western Australian
for Machine Learning. emergency departments. Med. J. Aust. 184,
Han, J., Kamber, M., Pei, J., 2011. Data Mining: Con- 208–212.
cepts and Techniques: Concepts and Tech- Trybula, W.., 1997. Data Mining and Knowledge Disco-
niques. Elsevier. very. Annu. Rev. Inf. Sci. Technol. 32, 197–
Kadri, F., Chaabane, S., Harrou, F., Tahon, C., 2014. 229.
Modélisation et prévision des flux quotidiens Walczak, S., Scharf, J.E., 2000. Reducing surgical pa-
des patients aux urgences hospitalières en utili- tient costs through use of an artificial neural
sant l’analyse de séries chronologiques, in: network to predict transfusion requirements.
7ème Conférence de Gestion et Ingénierie Des Decis. Support Syst. 30, 125–138.
Systèmes Hospitaliers (GISEH), pp. 8. Walsh, P., Cunningham, P., Rothenberg, S.J.,
Kadri, F., Chaabane, S., Harrou, F., Tahon, C., 2014a. O’Doherty, S., Hoey, H., Healy, R., 2004. An
Time series modelling and forecasting of emer- artificial neural network ensemble to predict
gency department overcrowding. J. Med. Syst., disposition and length of stay in children pre-
38(9):107. doi: 10.1007/s10916-014-0107-0 senting with bronchiolitis. Eur. J. Emerg. Med.
Kadri, F., Chaabane, S., Tahon, C., 2014b. A simulation- Off. J. Eur. Soc. Emerg. Med. 11, 259–264.
based decision support system to prevent and Witten, I.H., Frank, E., 2005. Data Mining: Practical
predict strain situations in emergency depart- Machine Learning Tools and Techniques, Se-
ment systems. Simul. Model. Pract. Theory 42, cond Edition. Morgan Kaufmann.
32–52. Zhang, S., Zhang, C., Yang, Q., 2003. Data preparation
Kadri, F., Pach, C., Chaabane, S., Berger, T., Trente- for data mining. Appl. Artif. Intell. 17, 375–
saux, D., Tahon, C., Sallez, Y., 2013. Modelling 381.
and management of the strain situations in hos-
pital systems using un ORCA approach, IEEE
IESM, 28-30 October. RABAT - MOROCCO,
p. 10.
Koh, H.C., Tan, G., 2005. Data mining applications in
healthcare. J. Healthc. Inf. Manag. JHIM 19,
64–72.
Kotsiantis, S.B., 2007. Supervised Machine Learning: A
Review of Classification Techniques, in: Pro-
ceedings of the Conference on Emerging Artifi-
cial Intelligence Applications in Computer En-
gineering: Real Word AI Systems with Applica-
tions in eHealth, HCI, Information Retrieval
and Pervasive Technologies. IOS Press, Ams-
terdam, The Netherlands, The Netherlands, pp.
3–24.
Kraft, M.R., Desouza, K., Androwich, I., 2003. Data
mining in healthcare information systems: case
study of a veterans’ administration spinal cord

A Data Mining-Based Approach To Predict Strain Situations in Hospital Emergency Department Systems

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

A Data Mining-Based Approach To Predict Strain Situations in Hospital Emergency Department Systems

Caricato da

Copyright:

Formati disponibili

A DATA MINING-BASED APPROACH TO PREDICT

STRAIN SITUATIONS IN HOSPITAL EMERGENCY

To cite this version:

HAL Id: hal-01081556

HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est

S. BENBELKACEM1, F. KADRI2,3 S.CHAABANE2,3, B. ATMANI1

KEYWORDS: data mining, emergency department, classification, prediction, length of stay.

1 INTRODUCTION The remainder of this paper is organized as follows:

Different applied methods and application fields are

Method Reference Application field

NB 78.463 0.76 0.785 0.351 0.829

SVM 79.942 0.725 0.799 0.237 0.609

Id3 72.291 0.758 0.784 0.3 0.691

Potrebbero piacerti anche