Sei sulla pagina 1di 6

YASEEN TOWETT MELLY

SCT221-C004-0323/2015

ASSIGNMENT

DATA MINING

1)establish the reasons for interest in data mining, how the tools are being applied and
whether the promise of data mining has been realized

A number of factors are driving the use of data mining and predictive analytics. More general
trends include that organizations have a great deal of data to analyze so much data, in fact, that
the term “big data” is now in vogue. Naturally, companies want to capitalize on this valuable
resource by utilizing predictive analytics to gain additional insights they can apply to optimize
their BI, marketing, and various performance management practices. A lot of this data is
unstructured; in particular, Web, contact center, surveys, maintenance logs, sensors, and
consumer social media sites are all contributing to the exploding amounts of unstructured data
that almost every organization in every industry is generating.

Price-performance constraints for big data processing have also declined. Thus, it’s no longer so
prohibitive to implement predictive analytics applications. Trends that are more specific to
predictive analytics include the continual improvement of tools and the increased options
organizations now have for adopting the technology — in particular cloud-based platforms and
software as a service (SaaS) offerings (more on this in a moment).

Data-mining functionality has also advanced to the point where it is increasingly found
embedded in other applications. This has helped to make the technology friendlier, alleviating
some of the need for users to posses an in-depth knowledge of the technology. For example, we
are increasingly seeing predictive analytics functionality “exposed” to the general user via an
intuitive GUI based on the familiar spreadsheet interface (as opposed to the complex interfaces
associated with yesterday’s data-mining workbenches). Such friendly interfaces, when coupled
with built-in workflows, provide the necessary “hand holding” for the less-technical business
user who might otherwise be leery of attempting to utilize the powerful capabilities offered by
predictive analytics.

A more favorable attitude toward text mining and analysis has also emerged, and organizations
now seem to find the technology more approachable. Our research shows that organizations
are no longer content with using text analysis/mining tools in a standalone manner; they now
have advanced their data integration capabilities to the extent that they are blending
unstructured data into their data warehouses to support their corporate data analysis efforts.

The ability to manage and analyze unstructured data is important if organizations want to
integrate data from social media into their BI and data warehousing systems. Today, this remains
mostly in the realm of the major Internet players, such as Google, Yahoo!, and Facebook, etc. as
opposed to more traditional enterprises.

How the tools are being applied

Data Mining is primarily used today by companies with a strong consumer focus — retail,
financial, communication, and marketing organizations, to “drill down” into their transactional
data and determine pricing, customer preferences and product positioning, impact on sales,
customer satisfaction and corporate profits. With data mining, a retailer can use point-of-sale
records of customer purchases to develop products and promotions to appeal to specific
customer segments.

Future Healthcare

Data mining holds great potential to improve health systems. It uses data and analytics to
identify best practices that improve care and reduce costs.

Market Basket Analysis

Market basket analysis is a modelling technique based upon a theory that if you buy a certain
group of items you are more likely to buy another group of items.

Education

There is a new emerging field, called Educational Data Mining, concerns with developing
methods that discover knowledge from data originating from educational Environments.

Manufacturing Engineering

Knowledge is the best asset a manufacturing enterprise would possess. Data mining tools can
be very useful to discover patterns in complex manufacturing process. Data mining can be used
in system-level designing to extract the relationships between product architecture, product
portfolio, and customer needs data.

CRM
Customer Relationship Management is all about acquiring and retaining customers, also
improving customers’ loyalty and implementing customer focused strategies. To maintain a
proper relationship with a customer a business need to collect data and analyse the
information.

Fraud Detection

Billions of dollars have been lost to the action of frauds. Traditional methods of fraud detection
are time consuming and complex. Data mining aids in providing meaningful patterns and
turning data into information.

Has the promise of data mining has been realized

Data mining is an emerging field gaining acceptance in research and industry. This is evidenced
by an increasing number of research publications, conferences, journals and industry initiatives
focused in this field in the recent past. Data mining aims to solve an intricate problem faced by a
number of application domains today with the deluge of data that exists and is continually
collected, typically, in large

electronic databases. That is, to extract useful, meaningful knowledge from these vast data sets.
Human analytical capabilities are limited, especially in its ability to analyse large and complex
data sets. Data mining provides a number of tools and techniques that enables analysis of such
data sets.3

Data mining by definition is exploratory in nature – that is, we are in search for previously

unknown, hidden and interesting patterns in data. The fact that we are in search for unknown,
hidden knowledge makes the outcome of data mining aplication difficult to predict at the onset of
a DM project making it a risky and an uncertain endeavour. “Does interesting, relevant
knowledge exist?“, “What types of knowledge are we looking for?“, “What method should we
consider in order to find what we are looking for?“, “How do we know whether we haven’t
missed any interesting ’knowledge’ in the data set?“ are some of the fundamental questions that
pertain to data mining application. At present, these

questions are answered based on the judgement of the DM team. To assist in these judgements,
the iterative nature of KDDM process allows the DM team to try out certain data mining tasks, if
failed, to re-tract and repeat until satisfactory results are achieved (or in 3the worst case,
resources are exhausted). This approach makes contemporary data mining application, a risky
endeavour and typically follows a trial-and-error process.
2)look for a case study of an organisation such as jkuat, bank dealing with data
mining
INTRODUCTION

The development of information of knowledge of knowledge technology has generated great


deal of
databases and large data in numerous areas. The analysis inknowledge bases and data technology
has given rise to
associate approach to store and manipulate this precious data for more deciding. methoding
could be a process of
extraction of helpful data and patterns from vast knowledge. It’s additionally referred to as data
discovery
method, data mining from knowledge, data extraction or knowledge /pattern analysis.Data
Mining is one amongst the foremost important and motivating space of analysis with the target of
finding significant data from vast knowledge sets. In gift era, data processing is changing into
standard in aid field as a result of there's a necessity of economical analytical methodology for
sleuthing unknown and valuable data in health knowledge. In health business, data processing
provides many advantages like detection of the fraud in insurance, convenience of medical
answer to the patients at lower cost, detection of causes of diseases and identification of medical
treatment ways. It additionally helps the aid researchers for creating economical aid policies,
constructing drug recommendation systems, developing health profiles of people etc. [1]. The
info generated by the health organizations is incredibly huge and complicated thanks to that it's
tough to investigate the info so as to form necessary call concerning patient health.

This knowledge contains details concerning hospitals, patients, medical claims, treatment price
etc. So, there's a
necessity to get a strong tool for analyzing and extracting necessary data from this complicated
knowledge. The
analysis of health knowledge improves the aid by enhancing the performance of patient
management tas
Kansas. the end result of knowledge Mining technologies area unit to produce advantages to aid
organization for
grouping the patients having similar kind of diseases or health problems so aid organization
provides them effective
treatments. It may also helpful for predicting the length of keep of patients in hospital, for
diagnosis and creating set
up for effective data system management. Recent technologies area unit employed in medical
field to
reinforce the medical services in price effective manner.
data processing techniques {are also area unitare accustomed analyze the varied factors that are
accountable
for diseases for instance kind of food, totally different operating surroundings, education level,
living conditions,
convenience of pure water, health care services, cultural, environmen tal and agricultural factors
as shown below
Data mining provides many advantages to aid business.Data processing helps the aid researchers
to form valuable
call. Following area unit the many applications of knowledge Mining in healthcare:

Effective management of Hospital resource: data processing provides support for constructing a
model for
managing the hospital resources that is a vital task in aid.Victimization data processing, it's
potential to notice the
chronic illness and supported the complication of the patient illness rank the patients so they'll
get effective
treatment in timely and correct manner. Fitness report and demographic details of patients is
additionally helpful forutilizing the on the market hospital resources effectively.
Hospital Ranking: totally different data processing approaches area unit accustomed analyze the
variedhospital details so as to see their ranks. Ranking of the hospitals area unit done on the idea
of their capability to
handle the high risk patients. The hospital with seniority handles the high risk patient on its high
priority whereas
the hospital with low status doesn't contemplate the danger issue.
Better client Relation: data processing helps the aid institute to grasp the wants, preferences,
behavior, patterns
and quality of their client so as to form higher relation with them. victimization data processing,
client Potential Management business firm. develops associate index represent the employment
of client aid. This index helps to
notice the influence of client towards explicit aid service.
Hospital Infection Control: A system for examination is made victimization data processing
techniques to get unknown or irregular patterns within the infection management knowledge
[93]. Association rules area unit
accustomed turn out sudden and attention-grabbing data from the general public police work and
hospital
management knowledge. to manage the infection within the hospitals, this data is reviewed more
by associate skilled.
Smarter Treatment Techniques: victimization data processing, physicians and patients will
simply compare
among totally different treatments technique. they'll analyze the effectiveness of obtainable
treatments and
determine that technique is best and price effective.
Improved Patient care: great deal of knowledge is collected with the advancement in electronic
health record. Patient knowledge that is offered in digitized type improve
the aid system quality.
Decrease Insurance Fraud: aid nondepository financial institution develops a model to notice
the fraud and abuse
within the medical claims victimization data processing techniques. This model is useful for
characteristic the
improper prescriptions, irregular or pretend patterns in medical claims created by physicians,
patients, hospitalsetc.
DATA MINING CHALLENGES IN HEALTHCARE
One of the foremost vital challenges of {the knowledge the info the information} mining in aid is
to get the
standard and relevant medical data. it's tough to accumulate the precise and complete aid
knowledge. Health knowledge is complicated and heterogeneous in nature as a result of it's
collected from numerous sources like from the medicalreports of laboratory, from the discussion
with patient or from the review of medico. For aid supplier, it's essential to keep up the standard
{of knowledge information} as a result of this data is beneficial to produce price effective aid
treatments to the patients. Health Care finance
Administration maintains the minimum knowledge set (MDS) that is recorded by all hospitals. In
MDS there area
unit three hundred queries that area unit answered by the patients at arrival time. However this
method is complicated and patients face downside to retort the whole queries. Thanks to this
MDS face some difficulties like
missing data and incorrect entries. While not quality knowledge there are no helpful results. For
fortunate data
processing, complication in medical knowledge is one the numerous hurdle for analyzing
medical knowledge. So, it's essential to keep up the standard and accuracy knowledge for data
processing to creating effective call. Another
problem with aid knowledge is knowledge sharing. Aid organizations area unit unwilling to share
their knowledge
thanks to privacy concern. Most of the patients don't need to disclose their health knowledge. So,
the Health Maintenance Organization and insurance Organization aren't distributing their
knowledge for protective the
privacy of patient.

Potrebbero piacerti anche