Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Santhosh Baboo
malathi.arunachalam@yahoo.com
Research Scholar
Bharathiar University
Coimbatore, 641 046, India.
santhos2001@sify.com
Abstract
The concern about national security has increased significantly since the 26/11
attacks at Mumbai. However, information and technology overload hinders the
effective analysis of criminal and terrorist activities. Data mining applied in the
context of law enforcement and intelligence analysis holds the promise of
alleviating such problem. In this paper we use a clustering/classify based model
to anticipate crime trends. The data mining techniques are used to analyze the
city crime data from Tamil Nadu Police Department. The results of this data
mining could potentially be used to lessen and even prevent crime for the forth
coming years.
Keywords: Crime, Data Mining, Classification, Cluster and Crime Analysis
1. INTRODUCTION
The concern about national security has increased significantly since the terrorist attacks on
November 26, 2008 at Mumbai. Any intelligent system [15][16][17] as crime analysis tool for
police, it is required to understand Indian Police structure, responsibilities of the police, key
changes and challenges the police is forcing [14].
Intelligence agencies such as the CBI and NCRB (National Crime Record Bureau) are actively
collecting and analyzing information to investigate terrorists activities [12]. Local law enforcement
agencies like SCRB(State Crime Record Bureau) and DCRB(District Crime Record
Bureau)/CCRB (City Crime Record Bureau) have also become more alert to criminal activities in
their own jurisdictions. One challenge to law enforcement and intelligence agencies is the
difficulty of analyzing large volumes of data involved in criminal and terrorist activities. Data
mining holds the promise of making it easy, convenient, and practical to explore very large
databases for organizations and users. Different kinds of crime patterns are clustered together
instead of using geographical clustering. Based on these crime patterns, a classifier model is
applied to predict the crime trend. However, much literature on crime trends focuses only on
violence [13] In this paper, we review data mining techniques applied in the context of law
enforcement and intelligence analysis.
77
Data Attributes
The following yearly attributes were presented and used in the data set for the city crime statistics
[7][8]
78
Dowry Death
Molestation
Sexual Harassment
Cruelty by Husband and her relatives.
Others (sum of next 6 attributes)
Kidnapping & Abduction of others
Criminal Breach of Trust
Arson
Cheating
Counterfeiting
Others IPC crimes
Cluster
0
1
2
3
4
5
6
7
8
9
Percent
1
13
9
8
7
20
13
4
14
12
Mean
46.012
3.488
7.830
10.233
11.444
13.885
24.197
35.477
5.732
1,975
StdDev
3.938
863
894
1.185
0.0012
3.570
3.648
3.521
810
816
TABLE 1: Weka.clusters.EM
79
80
Crime against women includes Rape, Dowry Death, Molestation, Sexual Harassment, Cruelty by
husband or relatives, Kidnapping & Abduction of women & Girls. Crimes against the women have
been going down since 2000, but once again the same thing has been increased since 2004 till
2008, finally it started coming down.
81
Acknowledgements
We are highly indebted to Dr. Sylendra Baboo, Commissioner of Police, Coimbatore. We are
also thankful to him and his team members for sharing their valuable knowledge in this field
6. REFERENCES
[1] Fayyad, U.M., & Uthurusamy, R. (2002). Evolving data mining into solutions for insights.
Communications of the ACM, 45(8), 28-31.
[2] Chau, M., Xu, J., & Chen, H. (2002). Extracting meaningful entities from police narrative
reports. In: Proceedings of the National Conference for Digital Government Research (dg.o
2002), Los Angeles, California, USA.
[3] Classification via Decision Trees in weak, Depaul University,
http://maya.cs.depaul.edu/~classes/ect584/WEKA/classify.html
[4] Frank DellaertThe expectation Maximization Algorithm,
Technical Report, Georgia Institute of Technology, 2002
82
83