Sei sulla pagina 1di 4

Big Data: Concept, Applications, & Challenges

Novan Zulkarnain Muhammad Anshari


Bina Nusantara University Universiti Brunei Darussalam
Indonesia Brunei Darussalam
novan@binus.ac.id anshari.ali@ubd.edu.bn

Abstract—Any organization either public or private rely on study is to examine fundamental concept, applications, and
accurate data analytic to take decisions. The utilization of big challenges that are closely related to big data in organization.
data is pivotal for driving organization extract value from a large This paper is organized as follows: the next section discusses in
amount of data. Big data is a method and technique to retrieve, more detail the literature analysis on big data, Section 3
collect, manage and analyze a very huge volume of both explains research methodology, the discussion is in Section 4,
structured and unstructured data that is difficult to process using and Section 5 is the conclusion.
traditional database which entail new technologies and technique
to analyze them. This paper covers the big data with the II. LITERATURE REVIEW
intentions to find out the concept, its applications and challenges
from literature analysis as well as discussing and reviewing A. Definition
interpretations from these findings along with possible Historically, data is accumulated in organization by
recommendations. entering data into computer system, however, as the Internet of
Things (IoT) evolve users can generate their own data. The
Keywords—big data; big data analytics; big data application; excessive use of Internet will increase by IoT due to the
Social Networks
integration of all objects via embedded systems. This leads to a
I. INTRODUCTION high distribution of network of devices communicating with
people as well as other devices [26]. From these network of
Big data is a term for large and complex sets of data in devices, huge portions of raw data can be generated and stored
which traditional methods of processing data are insufficient. as big data. This data comes from anything and anywhere such
This essay will cover about the big data as a whole, with the as customers, smart mobile devises, sensors, purchase
intentions to find out the origins of the concept, its applications transaction records, social media, digital images, CCTV, GPS,
and challenges from findings resulted from cited sources and audio that can be used to gather specific information. The
publications as well as discussing and reviewing interpretations definition of big data is unclear [8], yet there are attempts to
from these findings along with possible recommendations. Big define it. One definition about big data is that it refers to sets of
data need to be analyzed to gain its value either the trends, data where their size is too advanced, too complex for typical
patterns or behavior anything related to the people or data software tools to use their abilities for analysis or
customers. Though data comes from the rapid growth of management etc. [13]. However the definition of big data is
volume, yet it does rapidly and efficiently processing those intentionally subjective and as it incorporates a non-fixed
data refers to velocity of data processing [20]. Big data analytic definition depending factors such as advancing technology in
leads to more precise analysis thus helps to bring more accurate which would result in the size of datasets that would qualify as
decision-making and better performance. Big data are collected big data to increase [13]considering how as of 2012, big data
either through structured or unstructured data sources (online ranges from a few dozen terabytes to many petabytes of data
or offline). Unstructured data can come from social media [9]. Furthermore, Big Data can be differentiated with 3 key
(Facebook, Instagram, Twitter posts, etc) [25]. While, differences i.e. the 3 V’s. They are Volume, in which
structured data sources can come from internal database of organizations collect data from different sources with different
organization. In businesses, both sources are used to sizes of data considering how in 2012, 2.5 Exabyte are created
understand the patterns of the customers. Indeed, an each day [15]. Velocity, where many applications priorities the
organization nowadays relies the fact that any data could be speed of data creation over its volume which allows real time
analyzed and used to reveal patterns of their customers. In information to be processed and Variety, where big data can
other words, big data will help the organization to understand take in different forms such as messages, updates and even
the behavior of their customers and use it to win a competition. pictures [15].
Even though business organizations are still in early stage Big data is defined as a massive amount of data, very
of perceiving big data as an asset, public agencies are still quickly in processing from many different forms to support
struggling with the issue of open data, whereas science and decision making [9]. Therefore, it is widely known with the
technology are exploring the potentials of big data and its volume, velocity and variety. Other definition is a massive
innovation, yet general public are keep producing a huge volume of structured and unstructured data that is gathered and
amount of data in daily basis poses challenges for all analyzed through new methods that can produce value for the
organizations. An organization faces the fact that the reality of organization [6]. Social media is a part of big data, in which the
big data that can affect their competitiveness. The aim of this

978-1-5090-3352-2/16/$31.00 ©2016 IEEE 16-18 November 2016, Aston Tropicana Hotel, Bandung, Indonesia
2016 International Conference on Information Management and Technology (ICIMTech)
Page 307
number of people using social media are high in number. It had customer service since an organization can understands
shown that 3.419 billion are internet users whereby 2.307 customer’s preferences. For instance, Netflix can detect traffic
billion are active social media users [4]. Analyzing those details for various devices, spot problems in the area and add
amount of data requires a new tools and techniques which is systems that can help the future demand [14]. They are also
beyond a traditional processing tools [1, 17] and exceeds the able to get more vision of their customer’s desire [Ibid.].
processing capacity of conventional database systems. It is Lastly, value is an outcome of big data analytic so that is able
based on the ability which able to harness data in different to make the right decision for his business.
ways to generate effective understanding and utilize values TABLE I. DATA ANALYSIS IN PERIOD
[20]. Big data grows exponentially, accumulates quickly, and
combine multiple data types. Therefore, organization should Source [7]
use advance data analytic to process them [8, 25]. Naming Period Key points
In short, there are many authors defines big data but Decision 1970-1985 Analyzing some structured data to
majority of them has a term for big data and that term is Support support decision making.
explosion of data. Meanwhile, big data as a non-sampled data Executive 1980-1990 Data analysis for senior executives to
which characterized by the invention of databases from various Support take action.
electronic primary sources other than statistical interpretation Online 1990-2000 Application for analyzing
[10]. Whereas, another author has believed that big data is analytical multidimensional data tables.
different because it can generate on a vast scale through social processing
media or not such as online interaction between people, (OLAP)
transactions between people and systems and sensor enabled Business 1989-2005 Applications to support data-driven
machinery [19]. intelligence decisions, with emphasis on reporting.

B. Characteristics & Model Analytic 2005-2010 Statistical and mathematical modelling


analysis for decision.
Big data have brought about by the increased data with a
3Vs model, i.e., the increase of Volume, Velocity, and Variety, Big Data 2010-Present Analysis of structured and unstructured
of very large data, fast-moving data, in
[11]. Gartner and many other enterprises, including IBM [27] short period of time.
and Microsoft still used the “3Vs” model to describe big data
[16]. In the “3Vs” model, Volume means collection of massive
data scale becomes increasingly huge (Figure 1). Velocity The table above shows the evolution of data analysis for
refers to the timeliness of big data, specifically, data collection decision making over the last 45 years. It has transformed from
and processing analysis, must be rapidly and timely conducted decision support, executive support, online analytical
to gain maximum value. Variety means many types of data processing, business intelligence, analytics and now to big data
including structured and unstructured data such as audio, video, (see Table 1).
text, and traditional structured data (CRM, ERP, SCM). Both
unstructured and structured data can be analyzed using C. Big Data Analytics
Hadoop. Hadoop is able to analyze a large volumes of data as Big data analytic is a process of discovering patterns and
well as mining it to reveal the pattern. trends from a large amounts of data to extract its value and
correlations [21, 23]. Based on the report done [22], the stages
that are needed to be done in data retrieval for big data are as
followed: 1) Data acquisition: Data acquired from the medium
where data generation is growing at an exponentially rate.
Although, the data that is being produced continuously mostly
made up of unprocessed data that are useless and due to its
unstructured form, selecting and discarding unneeded data can
be quite challenging. 2) Data extraction: Majority of the
acquired raw data are not useful. Hence, deciding which data
are needed to be kept and which one should be discarded is a
difficult task to perform as well as there is an abundance of it.
3) Data collation: Most of the time, utilizing data from one
Fig. 1. Big Data Model sample is inadequate to be used in an analysis or prediction. So,
data is retrieved from different sources and would be combined
There are many benefits of big data. Firstly, an organization
or superimposed so that a bigger and more detailed picture is
come up to more accurate information as big data can discover
formed. From this point, the data can be analyzed more
value, connections, trends, and pattern. Secondly, it improves
properly. 4) Data structuring: It is important for the analyzed
decision making process since it is richer in term of data
data to be organized in a structured form. This enables the
gathering. Thirdly, big data can reduce the maintenance costs.
retrieval of information to be easier. 5) Data visualization:
A certain type of equipment is likely to wear out after years, so
Usually, case studies would concentrate on certain part of an
it costs a lot to replace every technology, even the important
area or region. The data that is being pulled from these areas
files or documents are left in them. Furthermore, it keeps the
are then analyzed and converted into a more specific and visual
data save by detecting the internal threats. Forth, it improves
format. 6) Data Interpretation: This is where valuable

978-1-5090-3352-2/16/$31.00 ©2016 IEEE 16-18 November 2016, Aston Tropicana Hotel, Bandung, Indonesia
2016 International Conference on Information Management and Technology (ICIMTech)
Page 308
information will be extracted. There are two types of computing becomes trends for any organization either large
information that can be acquired: Retrospective Analysis and corporation or small companies to manage big data storage.
Prospective Analysis. Retrospective involves gaining insights
from the past events and actions. Prospective Analysis is Value Creation
distinguishing patterns and discovering trends for future based The era of big data is coming to bring new opportunities for
off the data that was recorded. discovering new values. It can deliver the valuable aspect in
improving innovation, competitiveness and creation of
III. METHODS business value [24]. Big data creates a high demand for
Referred journal publications were used to find all sources organizations that can analyze and use them [25]. Big data
for referencing, with publications ranging from 2010 to 2015 have an enormous potential to create a better customer services
mostly in English languages. The methods employed of content in any industries like healthcare, education, agriculture, mining,
analysis of the research on big data in any organizations either financial matters, security, communication, and even traffic
public or private. Thirty literature reviews were analyzed from control. In healthcare, according to the Bernard Marr, at the
research of big data published in peer-reviewed journals. Then, hospital, by collecting, and analyzing every heart rate of an
the results were clustered into a thematic of technical, value infant can figure out the symptoms or condition of the new
creation, and possible adoption to an organization. born. Not only that, it can save lives.
Improve Performance
IV. APPLICATIONS Organizations deploy big data in order to enhance their
performance especially in quantify their sales. With the help of
Big data gives tremendous impact on business, ranging big data analytic, organization has better view of their
from consumers to supply chains operation and companies customers based on their online behaviors such clicks, e-
[18]. While, big data applications are management and commerce’s transactions, web visit histories, etc. Big data
processing of distributed data, and it can be a new tool for data analytic helps e-commerce Company to understand what their
analysis and visualization [19]. Big data is useful for business customer’s profile, trends, interests, and personality. Then, the
organizations to help those gaining deeper insights of company uses the result to predict what most likely the
customers’ habits and behaviors. Because big data is a growing customers’ preference their next purchase over product or
concept, big data is present in variety of areas. There are many service so then the company might propagate to their web /
applications of big data despite it being a relatively new thing Apps visit an advertising or offering discounts of products that
in the public. For example, by 2011, 235 terabytes was would likely they will bound in transaction. For instance,
collected by the US Library of Congress and around 30 billion ticketing online will push customers with advertisements based
content is shared in Facebook every month. By 2016, it is on their last visit. Customer who visit the ticketing web for
predicted that around $7.4m will be spend on data-related searching hotel in Bangkok. He can expect some
initiatives and $13.8m in enterprises [13]. advertisements in his free email account or website he visit at
The usage of mobile service provides data that can be used the next day with full advert of hotel in Bangkok. Ticketing
to improve public sector understanding of educational needs company tailors the advertisement based on website history for
and knowledge gaps, permits more targeted and timely each customer. Furthermore, business will then expand by
capability to circulate critical information. Data from e- attracting more customers from reading the positive reviews by
commerce can give a deep understanding into spending and these loyal customers. Businesses may increase their
saving habits across sectors. Online transaction histories performance by understanding better for their customers’ need
provide a credit histories and allow the individuals to make and preferences. Then, big data become ‘a must’ strategy for
loans and other credit-based financial services. Healthcare business to consider in order to forecast what the needs of their
organizations are able to gather information regarding disease customers are. Furthermore, big data in business can allow
trends and treatments for their patients. Big data can be used to updating the details such as transactions’ history and gives an
make a large datasets with treatments and comparison of the accurate predictions and decisions how to utilizing it.
outcomes so that it can be made efficiently and cost effective
manner. In addition, big data can be used to develop the auto
alert products and services. For instance, the usage of data V. CHALLENGES
which is automatically obtained from the sensors embedded in
products can provide after-sales service offerings such as Even though applications and the potential of big data is
proactive maintenance or alert to avoid failures in products, substantial, there are of course challenges that would occur
then the alert is synchronize with the smart mobile device of regarding big data. The main challenge is issue of privacy and
the user. confidentiality. For examples, any online activities like posting
or tweeting on social media are read by the public or person
There are at least three components in order to fully operate who handles the big data. Though, previous studies indicated
big data. First, tools and technologies are important to process that people have very little understanding and concern about
big data. Tools and applications will gather, store, analyze, and how organizations are using big data [5].
display the analytics or results. Hardoop is one of the platform
for big data analytic. Many organizations use Hardoop for their Furthermore, that getting data into the big data platform can
database for storage and data processing. Though, cloud be difficult as different scale and variety of data can overcome
a data specialist who is unprepared in this sort of area [12].

978-1-5090-3352-2/16/$31.00 ©2016 IEEE 16-18 November 2016, Aston Tropicana Hotel, Bandung, Indonesia
2016 International Conference on Information Management and Technology (ICIMTech)
Page 309
Furthermore, according to analyst firm. The United states alone [6] Davis, B. (2013). 10 actual uses of big data. Retrieved from
will face a scarcity of around 140,000-190,000 people that https://econsultancy.com/blog/63594-10-actual-uses-of-big-data/
requires data analytical skills for 1.5 million managers and [7] Davenport, Thomas H. "Analytics 3.0." Harvard Business Review 91,
no. 12 (2013): 64-+
analysts that mastery in using big data analytics’ tool to support
[8] Dumbill, E. (2013). Making sense of big data. Big Data, 1(1), 1-2.
decisions in the organizations [2]. There will be about 44
million IT jobs created globally to support big data by 2015 yet [9] Hashem, Ibrahim Abaker Targio, Ibrar Yaqoob, Nor Badrul Anuar,
Salimah Mokhtar, Abdullah Gani, and Samee Ullah Khan (2015). "The
there is no guarantee of employees filling up these positions rise of “big data” on cloud computing: Review and open research
[3]. issues." Information Systems 47: 98-115
Finally, big data analytic is given to data management using [10] Horrigan, M.W. (2013), Big Data: A Perspective from the BLS, Amstat
News, January 2013, 25-27.
supporting tools to analyze, store, and present results.
[11] Laney, Douglas (2001). 3-d data management: Controlling data volume,
Presenting a comprehensive results yet user friendly from velocity and variety. META Group Research Note, February, 6, 2001.
structured and unstructured data sources are the biggest [12] Loshin, D.(2014) Adressing five emerging challenges of big data.
challenge in big data analytic. It requires a new advancement of Retrieved from https://www.progress.com/docs/default-source/default-
tools and methods in order to gain an expected value. For document-library/Progress/Documents/Papers/Addressing-Five-
instance, it is a big challenge for combining CCTV record, Emerging-Challenges-of-Big-Data.pdf
audio conversation, social networks activities, and CRM record [13] Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C.,
to reveal pattern of a potential customer. It involves technique & Byers, A. H. (2011). Big data: The next frontier for innovation,
competition, and productivity.
to analyze and obtain intelligence from those big data sources
[14] Matteson, Scott (2013). "Big Data Basic Concepts and Benefits
and analytics are closely connected to each other. We can Explained." Web blog, http://www. techrepublic. com/blog/big-data-
expect some advancement in tools and technologies for big analytics/
data analytic. [15] McAfee, A., Brynjolfsson, E., Davenport, T. H., Patil, D. J., & Barton,
D. (2012). Big data. The management revolution. Harvard Bus Rev,
90(10), 61-67.
VI. CONCLUSION [16] Meijer, Erik. (2011). The world according to linq. Communications of
the ACM, 54(10):45–51, 2011.
Business organizations are in need to consider big data to make [17] Ohlhorst, F, J.(2013). Big data analytics. Turning big data into money.
more accurate analysis, leading to a better decision making. Retrieved from http://www.scribd.com/doc/244559565/Big-Data-
Big data can be characterized by three Vs; Volume, Variety, Analytics-Frank-J
and Velocity. Big data is defined as a large volume of data that [18] Pahm, P. (2015). The Impacts of Big Data That You May Not Have
needs to advanced techniques and tools to its data analytics’ Heard Of. Retrieved from
http://www.forbes.com/sites/peterpham/2015/08/28/the-impacts-of-big-
complexity, data size and type. Big data analytic does process data-that-you-may-not-have-heard-of/#1d2c12fbc957
multiple sources of data to present any patterns, trends, or [19] Rodriguez, R.N. (2012), Big Data and Better Data, Amstat News, June
customers’ behavior. By then. Big data can be used to detect 2012, 3-4.
future transactions based on customers’ behavior or discover [20] Schönberger, V. M., & Cukier, K. (2013). Big Data: A revolution that
future business trends due to its capabilities to gather a huge will transform how we live, work, and think. New York, NY: Houghton
data at a speed and come up with the extracted value. Mifflin Harcourt.
[21] Sagiroglu, S., & Sinanc, D. (2013, May). Big data: A review. In
Collaboration Technologies and Systems (CTS), 2013 International
Conference on (pp. 42-47). IEEE.
[22] Toshniwal, R., Dastidar, K. G., & Nath, A. (2015). Big data security
REFERENCES issues and challenges. International Journal of Innovative Research in
[1] Beyer, Mark. "Gartner Says Solving'Big Data'Challenge Involves More Advanced Engineering (IJIRAE), 2(2), 15-20.
Than Just Managing Volumes of Data." Gartner. Archived from the [23] Villars, R.L. et al. (2011) Big Data: What It is and Why You Should
original on 10 (2011). Care. IDC
[2] Brown, Brad, Michael Chui, and James Manyika. "Are you ready for the [24] Williamson, J. (2014). Getting a big data job for dummies. Hoboken,
era of ‘big data’." McKinsey Quarterly 4, no. 2011 (2011): 24-35. New Jersey: John Wiley & Sons, Inc.
[3] Caballero, Ismael, Manuel Serrano, and Mario Piattini. "A Data Quality [25] Watson, H.J. (2014) Tutorial: Big Data Analytics: Concept, technology
in Use Model for Big Data." In Advances in Conceptual Modeling, pp. and application. Communication for the association for information
65-74. Springer International Publishing, 2014. systems, 34(65), 1247-1268. Retrieved from http://aisel.aisnet.org/cais/
[4] Chaffey, D. (2016). Global social media research summary 2016. Smart [26] Xia, F., Yang, L. T., Wang, L., & Vinel, A. (2012). Internet of things.
Insights. Retrieved from http://www.smartinsights.com/social-media- International Journal of Communication Systems, 25(9), 1101-1102.
marketing/social-media-strategy/new-global-social-media-research/
[27] Zikopoulos, P., Chris Eaton, et al. Understanding big data: Analytics for
[5] Clemons, Eric K., James Wilson, and Fujie Jin. "Investigations into enterprise class hadoop and streaming data. McGraw-Hill Osborne
Consumers Preferences Concerning Privacy: An Initial Step towards the Media. 2011.
Development of Modern and Consistent Privacy Protections around the
Globe." In System Sciences (HICSS), 2014 47th Hawaii International
Conference on, pp. 4083-4092. IEEE, 2014.

978-1-5090-3352-2/16/$31.00 ©2016 IEEE 16-18 November 2016, Aston Tropicana Hotel, Bandung, Indonesia
2016 International Conference on Information Management and Technology (ICIMTech)
Page 310

Potrebbero piacerti anche