Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/
WWW.JOURNALOFCOMPUTING.ORG 34
Abstract— The fast expansion, exploitation and propagation of the innovative and promising Information and Communication Technologies (ICTs) indicate
new opportunities for growth and development. Data Mining is a well established approach of discovering knowledge from databases for the purpose of
Knowledge Management. There is large number of data and information generated and collected by the different levels of governments. In case of gov-
ernment, proper decision making is important to better utilization of all resources. Data Mining could help administrators to extract valuable knowledge and
practices out of this voluminous data, which can be used to obtained knowledge and practices for strategically reducing costs and increasing organization
expansion opportunities and also detect fraud, waste and abuse. The present investigation taken Education Data related with primary education in order to
analyze status of primary education in Allahabad and in Uttar Pradesh, India. Clustering and Classification methods are used to find out similarity or dissi-
milarity among various districts of Uttar Pradesh. This will create groups of districts as clusters so that these districts may further treated together under
one policy. Classification method is based on reported Gross Enrollment Ratio (GER). In this method some unusual classification of district highlighted
that the Data Mining could also establish the impact of migration from one district to another when all the students are given unique identification through
social security number.
Index Terms—Information and Communication Technologies, Knowledge Management, Data Mining, Clustering, Classification
—————————— ——————————
(i) Efficient methods for capturing, storing and han-
1 INTRODUCTION dling government data collected from various re-
D ata Mining is a process of Knowledge Discovery in- sources over a period of time.
cludes methods used to recognize, generate, represent (ii) Efficient Knowledge Management for improved in-
and distribute knowledge for better utilisation of any ternal processes, government policies and programs
system. There is large number of data and information gen- on the basis of historical data stored in its databases.
erated and collected by the different levels of governments.
In case of government, proper decision making is important The present work proposes an E Governance model
to better utilization of all resources. Data Mining could help framework based on Data Mining and Data Warehousing
administrators to extract valuable knowledge and practices techniques which may be efficiently used by the government
out of this voluminous data, which can be used to obtained at all its administrative levels Nation-
knowledge and practices for strategically reducing costs and al/State/District/Block).The proposed Model serves all
increasing organization expansion opportunities and also possible aspects of E Governance with the help of four basic
detect fraud, waste and exploitation. building blocks:
Administrative Block
The research work is aimed to represent the potential of Technical Know How Block
data mining in the context of smart techniques of E Gover- Service Block
nance. Data Mining provides efficient techniques for gov- Stakeholder Block
ernment agencies to analyze data quickly and with lesser
economic efforts [1]. The data extraction process generates
interesting hidden patterns. The discovered hidden patterns
enable the government systems in making better decisions
2 RELETED WORK
and having a more advanced plan in serving the citizens [8]. There is an extensive range of Data Warehousing and Data
Here we are representing an E Governance Model based on Mining applications in government’s regulatory, develop-
Data Mining and Data Warehousing to facilitate mental and social welfare organization. The followings are
some examples reported in different literatures.
————————————————
Ms. Sonali Agarwal is with the Indian Institute of Information Technolo-
gy, Allahabad, U.P., India The project Total Information Awareness (TIA) was
Prof. G.N. Pandey is with the Indian Institute of Information Technology, launched by the US government after the terrorist attack of
Allahabad, U.P., India 9/11. The objective of Total Information Awareness (TIA)
was to search large data and determine associations and pat-
JOURNAL OF COMPUTING, VOLUME 2, ISSUE 10, OCTOBER 2010, ISSN 2151-9617
HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/
WWW.JOURNALOFCOMPUTING.ORG 35
terns related with terrorist activities. The project conducted ation rules may establish the similarity, difference between
discovery of associations among transactions such as work customer’s behaviors [6].
permits, credit card, airline tickets, passports, visas, rental
cars, gun purchases, driver’s license and events such as ar-
rest or doubtful activities [17][15].
3 BASIC BUILDING BLOCK OF PROPOSED E
CAPPS is known as Computer Assisted Passenger Pre- GOVERNANCE MODEL
Screening System. It is a prescreening system initiated by the The proposed E Governance model covers all important as-
Department of Homeland Security US. It is implemented to pect of E Governance in a single model. There are four Basic
check all airline passengers against a database of commer- Building Blocks of proposed E Governance Model. The low-
cially available information. After checking it provides a risk est block is the Administration Block, which regulates the
color or status to each passenger. CAPPS collect information overall function of any country through efficient govern-
provided by the passenger for example Paasenger’s name, ment.
permanent address, contact number etc. These records are
then given to commercial data providers for assessment of
the validity of the passenger and passenger’s correlation
with other events. The commercial data provider would
assign a numerical score back to the owning system indicat-
ing a particular risk level. The passengers having “green”
score is considered as normal and safe passenger. The pas-
sengers having “yellow” score then they would have to face
second level screening test. The passengers having “red”
score is considered as high risk passenger and high risk pas-
sengers may not be allowed for traveling and they must be
further enquired about their identity and purpose of travel-
ling [9].
The proposed model is based on ICT, which may reform Fig.2 : Horizontal and Vertical interconnection for E Governance
organizational structures in both centralized as well as de-
centralized manner. These approaches of E Government Certain important decisions are jointly made and
have their own set of advantages and disadvantages. then standardized across the various levels.
Responsibilities as well as capabilities are decentra-
Centralized Model lized at different government departments/levels,
Centralize government initiatives are favorable as portals with infrastructure and output sharing across the
and services to reduce cost and integration issues. Centralize State as a system.
government initiatives may share technical, financial and Generally, high E Governance set up costs but more
human resources. A Single portal access is very useful for responsive to stakeholder needs. Higher level com-
any end user because all the information may be centrally mittees are formed to manage various Government
available here. There are following features of Centralized E activities. These committees have authority to con-
Governance model. trol the functioning of large area.
All government process based on ICTs are centra-
lized in one organizational unit. Intra-department or horizontal and vertical collaborations
Generally limited Infrastructural and set up costs are very essential for success of any E Governance project. It
but less effective. is very necessary to perform governance functions, share
information and deliver services to all stakeholders. These
collaborations depend on issues like what are the different
JOURNAL OF COMPUTING, VOLUME 2, ISSUE 10, OCTOBER 2010, ISSN 2151-9617
HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/
WWW.JOURNALOFCOMPUTING.ORG 37
types of intra- department collaborations exist in E Gover- 3.4 Module 4 Stakeholder Block
nance and why intra- department collaborations are impor- Stakeholder is an individual person, group of persons or a
tant [4]. community having common area of interest and commonly
affected by any system. Here E Governances has a wide rage
3.2 Module 2: Technical Know How of stakeholders. The main groups are identified in 3 parts.
For E Governance, there are many applications need to be
automated. Various departments seek computerization and
other technological transformation of their working strate-
gies. Now it is necessary to conceptualize the whole ap-
proach and develop a standard framework and protocols for
the regulation of all E Governance activities. The proposed
Model uses Data Mining and Data Warehousing for improv-
ing the service performance of the E Governance system.
Fig.7: Categorization of district according to Gross Enrollment Ratio by In fact Data Mining with Data Warehousing should be an
using Decision Tree ongoing process. It should be integrated with strategic futu-
ristic planning of the entire government. The analysis
5 Conclusion through Data Mining would clearly establish the strong and
Indian scenario is converting now in the form of an efficient, weak areas of planning and implementation of the whole
accountable and transparent society. It is essential that all government process. However, it would take some time to
government functions use ICTs to provide better interfaces develop appropriate Data Warehouse of the past data to car-
or interactions for the public at state and central level. It in- ry out qualitative analysis on the basis of Data Mining tech-
dicates that appropriate software has to be developed which niques.
includes common practices related with government func-
tions. Data Warehousing and Data Mining has been estab- The entire process of Data Warehouse development for
lished to be an excellent option for speeding up reporting any application may be based on the basis of unique identifi-
and integrating data from various department of any gov- cation of critical species, i.e., the citizen of the nation with no
ernment. duplication of the process. Similarly, since district is the cen-
ter of implementation, all the development action, regulatory
The use of Data Mining in government department function of various departments, as well as social welfare
presents several potential advantages for better administra- activities should be quantitatively associated with the unique
tion, including timely access to evaluate data. Different de- identification with each development activities so that all the
partments may quickly identify troublesome trends in its developmental activities are completed as per targeted date
functions and evaluate why they are occurring.The various for the utilization by their stakeholders.
departments may associate this information with trends in
[1]. Junfeng Pan, et al., “Cost-Sensitive Data Preprocessing for Mining
their future policies. Customer Relationship Management Databases”, This paper ap-
pears in: Intelligent Systems, Publication Date: Jan.-Feb. 2007, Vo-
The use of Knowledge Discovery in Databases allows an lume: 22, Issue: 1
individual department to use this information in making On page(s): 46-51
appropriate decisions and enhance the working methodolo- [2]. “WEKA 3: Data Mining Software in Java”, Retrieved March 2007
gies. This, unquestionably, translates into increased efficien- from http://www.cs.waikato.ac.nz/ml/weka/`
[3]. Usman Muhammad Anwar, et al. “Multi-Agent Based Semantic E-
cy, higher progress rates, and economical society.
Government Web Service Architecture” IEEE/WIC/ACM Interna-
tional Conferences on Web Intelligence and Intelligent Agent Technolo-
Along with the development of the relatively new E Go- gy - Workshops (2006) pp. 599-604.
vernance Model based on Data Mining and Data Warehous- [4]. Gregory B. White et al. “Introduction to the 2006 Minitrack on E-
ing, it is also important to determine multiple rules and poli- Government Security” Proceedings of the 39th Hawaii Internation-
cies for future implementation and better administration al Conference on System Sciences - 0-7695-2507-5/06/$20.00 (C) 2006
IEEE ieeex-
JOURNAL OF COMPUTING, VOLUME 2, ISSUE 10, OCTOBER 2010, ISSN 2151-9617
HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/
WWW.JOURNALOFCOMPUTING.ORG 40
plore.ieee.org/iel5/10548/33364/01579445.pdf?arnumber=1579445 versity, Varanasi, and Post Doctoral degree at University of Michigan,
. USA. He worked as a Reader/Lecturer in Chemical Engineering, Bana-
[5]. Graham Williams, Data Mining Desktop Survival Guide ras Hindu University, Varanasi, India, Director, Institute of Engineering &
http://www.togaware.com/datamining/survivor/Usage2. html. Technology, Lucknow, India and Founder Vice-Chancellor, JRH Universi-
ty, Chitrakoot, India. His research interest includes ERP, E Governance,
[6]. Ruey-Chyi Wu, Ruey-Shun Chen, Chen, C , “Data mining applica-
Data Mining and Envionmental Science and Engineering.G.N. Pandey is
tion in customer relationship management of credit card business”. the author of 12 books and more than 200 research papers.
Computer Software and Applications Conference 2005. COMPSAC
2005. 29th Annual International Volume 2, Issue , 26-28 July 2005
Page(s): 39 - 40 Vol.
[7]. “About Kiosk”, E Governance of Government of West Bengal,
Retrieved December 2006
[8]. U.S. General Account Office (GAO) “Data Mining Federal Efforts
Cover a Wide Range of Uses” GAO-04-548,
http://www.gao.gov/new.items/d04548.pdf
[9]. United States General Accounting Office Report to Congressional
Committees “Aviation Security, Computer-Assisted Passenger
Prescreening, Faces, Significant Implementation, Challenges”
www.gao.gov/new.items/d04385.pdf
[10]. Krouse William J CRS Report for Congress Received through the
CRS Web Order Code RL32536 “The Multi-State Anti-Terrorism In-
formation Exchange (MATRIX) Pilot Project”
www.fas.org/irp/crs/RL32536.pdf.
[11]. Salazar, A, Gosalbez, J, Bosch, I Miralles, R Vergara, “A case study
of knowledge discovery on academic achievement, student deser-
tion and student retention”, Information Technology: Research and
Education, 2004. ITRE 2004. 2nd International Conference on Vo-
lume, Issue, 28 June-1 July 2004 Page(s): 150 – 154
[12]. Thomas Zwahr and Matthias Finger, “Enhancing the e-Governance
model: Enterprise Architecture as a potential methodology to build
a holistic framework” Proceedings of the International Conference on
Politics and Information System: Technologies and Applications. Orlan-
do, Florida, USA
[13]. Riley Thomas B. International Tracking Survey Report ‘03 Number
Two “Knowledge Management and Technology”
http://www.ileyis.com/publications/research_papers/tracking03
/intlrackingRpt June03no2.pdf
[14]. Dunham, M.H. , “Data mining introductory and advanced topics” Up-
per Saddle River, NJ: Pearson Education, Inc.
[15]. Report to Congress “Terrorism Information Awareness Program”
In response to Consolidated Appropriations Resolution, 2003, Pub.
L. No. 108-7, Division M, § 111( b)
http://w2.eff.org/Privacy/TIA/TIA-report.html
[16]. Goharian and Grossman, “Data Mining Classification”, Illinois
Institute of Technology, http://ir.iit.edu/~nazli/cs422/CS422-
Slides/DM-Classification.pdf
[17]. Mack Gregory, “Total Information Awareness program (TIA)”
System Description Document Version 1.1,
http://www.epic.org/privacy/profiling/tia/tiasystemdescription.
pdf
[18]. Bob Mann, et al. “Scientific Data Mining, Integration, and Visuali-
zation” UK e-Science Technical Report Series ISSN 1751-5971
[19]. Jain A.K, Murty M.N., Flynn P.J., “Data Clustering: A Review”
ACM Computing Surveys, 31, 3:264-323.
[20]. Apte C. & Weiss S.M. “Data Mining with Decision Trees and Deci-
sion Rules” T.J. Watson Research Center
http://www.research.ibm.com/dar/papers/pdf/fgcsapteweiss_w
ith_cover.pdf