Sei sulla pagina 1di 27

Data Mining in Disease Management

(Diabetic patients)
By:
Catherine M. Catamora
097699
What is diabetes?
• a metabolism disorder - the way the body
uses digested food for growth and
energy.

• a chronic disease that occurs either when
the pancreas does not produce enough
insulin or when the body cannot
effectively use the insulin it produces.
Key facts
• According to World Health Organization:
• More than 220 million people
worldwide have diabetes.
• In 2004, an estimated 3.4 million
people died from consequences of
high blood sugar.
• More than 80% of diabetes deaths
occur in low- and middle-income
countries.
• WHO projects that diabetes death
will double between 2005 and
2030.

In the Philippines:
• According to Pediatric endocrinologist
Sioksoan Chan-Cua
• “The numbers of diabetic patients
are still low compared with other
countries”.
• This study aims to use data mining
techniques in learning more about the
instances of diabetic patients under
Bernardo’s clinic in Bulacan.

• The study although focuses on a small


sample of patients, tries to come up with
factors and combination of factors that
results to being diabetic.


Related Studies
• Hyperglycemia and type 2 diabetes among
Filipino women in the Philippines,
Hawaii, and San Diego by the following
authors:
• Maria Rosario G. Araneta,a* Deborah
J. Morton,a Lina Lantion-Ang,b
Andrew Grandinetti,c Mary Anne Lim-
Abrahan,b Healani Chang,c Elizabeth
Barrett-Connor,a Beatrice L.
Rodriguez,d and Deborah L.
Wingarda

Methodology
• Data Gathering
• Based from the patient records of
Bernardo’s Clinic
• Data was inputted in a Excel file, later
converted into a csv file for WEKA
processing.
• WEKA
• Data was processed using Naïve Bayes
classifier
• Classification Technique
• PRISM algorithm was applied
Raw Data Set
Processing applied
• Attributes used
• Age
• Gender
• BMI
• Blood pressure
• Cholesterol level

• Data set was divided into 2 groups


• Male
• Female

Blood
Pressure
Chart
Cholesterol Classification

Diabetic Age Classification


Male Weight Classification

Female Weight Classification


Results and Analysis
• WEKA (Naïve Bayes applied) findings
• Accuracy - 78.333%

Using WEKA ~ NaïveBayes
classifier
Applying Prism Algorithm
• Dataset were divided into two (gender
groups)
Rules generated under the
Female Group:
• First rule: regardless of age, cholesterol
level and body mass index: if a patient
has a blood pressure equivalent to
Moderate Stage, then the patient has
diabetes.

• New Rule: If Blood Pressure = Hypertension High
BP and ?, Then Diagnosis = diabetic

New RULE: if
Blood Pressure = Hypertension High BP
Age Class = Elderly
Cholesterol = High or Cholesterol = Very High
BMI = Overweight or BMI = Obese
Then
Diagnosis = Diabetic
• New Rule: if Cholesterol = Very High and ?, Then
Diagnosis = diabetic

New RULE: if
Cholesterol = Very High
Age Class = Elderly
Blood Pressure = Normal, High Normal, Hypertension High and Moderate Stage
BMI = Normal, Overweight or Obese
then
Diagnosis = Diabetic
• New Rule: if BMI = Overweight and ?, Then
Diagnosis = diabetic

New RULE: if
BMI = Overweight
Age Class = Elderly
Blood Pressure = High Normal, Hypertension High and Moderate Stage
Cholesterol = Borderline High, Very High and Very High
then
Diagnosis = Diabetic
Rules generated under the
Male Group:
• First Rule: Regardless of age, cholesterol
level and body mass index: if a patient
has a blood pressure equivalent to
Moderate Stage and BMI is equal to
Obese then the patient has diabetes.

• New Rule: if Cholesterol = High and ?, Then
Diagnosis = diabetic

New RULE: if
Cholesterol = High
Age Class = Elderly
BMI = Overweight
Blood Pressure = Normal, High Normal or Hypertension BP
then
Diagnosis = Diabetic
• New Rule: if Blood Pressure = Hypertension High BP
and ?, Then Diagnosis = diabetic

New RULE: if
Blood Pressure = Hypertension BP
Age Class = Elderly
Cholesterol = Borderline High, High and Very High
BMI = Overweight or Obese
then
Diagnosis = Diabetic
• New RULE: if Weight = Overweight and ?, Then
Diagnosis = diabetic

New RULE: if
BMI = Overweight
Age Class = Elderly
Blood Pressure = High Normal or Hypertension BP
Cholesterol = Borderline High, High and Very High
then
Diagnosis = Diabetic
Conclusion
• Data mining technologies can have
important utility in diabetes mellitus and
other disease management.
• Fasting blood glucose test is the preferred
test for diagnosing diabetes.
• Diagnosis of diabetes can be made based
on any of the following algorithm results.
• Data mining can be used to manipulate
data especially if it’s in electronic form..

• Data mining approaches can be applied
successfully in data analysis, and
monitoring of diabetic and non-diabetic
patients.
• Diabetes, once diagnosed, a lifetime
treatment is necessary, but with the help
of data mining and scientific researches,
physicians can be equipped with a tool
to prevent the worst case scenarios.
Further Studies
• It is suggested by the researcher that, data
mining techniques for clustering and
association could also be done to
further analyze and study such dataset.
• Taking sample data based from a daily
monitoring or any regular monitoring
scheme of patients’ blood pressure,
cholesterol level and BMI to accurately
produce a good dataset.

• Diabetes management in rural areas such
as Bulacan has a promising research
and application area. Further studies
can also be made to similar locations in
other provinces where small clinics
attend to a huge number of patients on
a regular basis.

Potrebbero piacerti anche