Sei sulla pagina 1di 60

WWW.DBTA.

COM

The Year Ahead: Data Will 4


Drive the Enterprise in 2019
How to Help Your DBAs 42
Evolve With Automation
Managing Database Performance 55
Multiplatform Data Quality Management
Unison – Speedy, Secure, Scalable

Melissa’s Unison is a data steward’s best friend. It’s the ideal multiplatform solution to establish
and maintain contact data quality at higher speeds – processing 30+ million addresses per hour
– while meeting the most stringent security requirements. With Unison, you can design,
administer and automate data quality routines that cleanse, validate and enrich even your most
sensitive customer information, as data never leaves your organization. Streamline data prep
workflows, reduce analytics busy work, gain more insights and increase efficiency!

• Verify and standardize names, addresses, emails and phone numbers, plus append lat/long
coordinates and Census data.
• On-Premise platform requiring no coding or programming with automatic updates from Melissa.
• Works offline, scalable across multiple servers and allows users to script batch jobs with various
levels of data access.
• Access, manage and visually analyze the quality of your data over time, enjoy project
collaboration and more!

Request a Demo

Melissa.com/dbta-unison
1-800-MELISSA
The Journal of
Information Integration and Management

PUBLISHED BY Unisphere Media—a Division of Information Today, Inc.


Volume 32 | Number 6
DEC 2018/JAN 2019
CONTENTS
EDITORIAL & SALES OFFICE 121 Chanlon Road, New Providence, NJ 07974
CORPORATE HEADQUARTERS 143 Old Marlton Pike, Medford, NJ 08055
FEATURES
RESEARCH@DBTA
Thomas Hogan Jr., Group Publisher Celeste Peterson-Sloss, Lauree Padgett,
609-654-6266; thoganjr@infotoday Editorial Services 2 CLOUD OPENS THE PATH TO DATABASE EXPANSION
Joyce Wells, Editor-in-Chief Tiffany Chamenko, By Joe McKendrick
908-795-3704; Joyce@dbta.com Production Manager
Joseph McKendrick, Lori Rice Flint, FEATURE STORY
Contributing Editor; Joseph@dbta.com Senior Graphic Designer 4 THE YEAR AHEAD: DATA WILL DRIVE
Adam Shepherd, Jackie Crawford, THE ENTERPRISE IN 2019
Advertising and Sales Coordinator Ad Trafficking Coordinator
908-795-3705; ashepherd@dbta.com By Joe McKendrick
Sheila Willison, Marketing Manager,
Stephanie Simone, Managing Editor Events and Circulation
908-795-3520; ssimone@dbta.com 859-278-2223; sheila@infotoday.com TREND-SETTING PRODUCTS > SPECIAL SECTION
Don Zayacz, Advertising Sales Assistant DawnEl Harris, Director of Web Events; 16 INTRODUCTION
908-795-3703; dzayacz@dbta.com dawnel@infotoday.com
18 DBTA TREND-SETTING PRODUCTS
COLUMNISTS FOR 2019
Rob Mandeville, www.solarwinds.com Craig S. Mullins, www.CraigSMullins.com
Guy Harrison, guy@tobacapital.com Todd Schraml, TWSchraml@gmail.com
Kevin Kline, Kevin_Kline@dbta.com

ADVERTISING DEPARTMENTS
Stephen Faig, Business Development Manager, 908-795-3702; Stephen@dbta.com
APPLICATIONS
INFORMATION TODAY, INC. EXECUTIVE MANAGEMENT 42 HOW TO HELP YOUR DBAS EVOLVE WITH AUTOMATION
Thomas H. Hogan, President and CEO Thomas Hogan Jr., Vice President,
Marketing and Business Development By Robert Reeves
Roger R. Bilboul,
Chairman of the Board Bill Spence, Vice President, 44 BLOCKCHAIN FUNDAMENTALS
John C. Yersak, Information Technology
Vice President and CAO Q&A with Paul Tatro, Founder of Blockchain U Online

DATABASE TRENDS AND APPLICATIONS (ISSN: 1547-9897; USPS: 16230) is published


bimonthly (Feb./Mar., Apr./May, Jun./Jul., Aug./Sep., Oct./Nov., and Dec./Jan.) by Unisphere MULTIVALUE SOLUTIONS
Media, a division of Information Today, Inc., 143 Old Marlton Pike, Medford, NJ 08055 USA; 46 MULTIVALUE AND THE CLOUD:
Phone (609) 654-6266; Fax (609) 654-4309; Internet: infotoday.com. Registered in U.S. Patent
& Trademark Office. Periodicals postage paid at Vincentown, NJ, and additional mailing offices. FLEXIBILITY FOR THE FUTURE
© Copyright, 2018 Information Today, Inc. All rights reserved. By Julianna Cammarano
No part of this publication may be reproduced in whole or in part PRINTED IN USA
in any medium without the express permission of the publisher. 48 JBASE HELPS ENCOMPASS SUPPLY CHAIN
POSTMASTER Send address changes to Database Trends and Applications, P.O. Box 3006, STREAMLINE DEVELOPMENT OPERATIONS
Northbrook, IL 60065-3006.
RIGHTS AND PERMISSIONS
Permission to photocopy items is granted by Information Today, Inc. provided that a base fee
of $3.50 plus $0.50 per page is paid directly to Copyright Clearance Center (CCC), or provided
that your organization maintains an appropriate license with CCC. COLUMNS
Visit www.copyright.com to obtain permission to use these materials in academic coursepacks
or for library reserves, interlibrary loans, document delivery services, or as classroom handouts; 49 NEXT-GEN DATA MANAGEMENT > BY ROB MANDEVILLE
for permission to send copies via email or post copies on a corporate intranet or extranet; or
for permission to republish materials in books, textbooks, and newsletters.
ANOMALIES—PREDICTING THE PAST
Contact CCC at 222 Rosewood Drive, Danvers, MA 01923; (978) 750-8400; Fax: (978) 646-8600; 50 EMERGING TECHNOLOGIES > BY GUY HARRISON
www.copyright.com. If you live outside the USA, request permission from your local
Reproduction Rights Organization. (For a list of international agencies, consult www.ifrro.org.) WEB SERVICES MOVE FORWARD WITH GRAPHQL
For all other requests, including making copies for use as commercial reprints or for other 51 SQL SERVER DRILL DOWN > BY KEVIN KLINE
sales, marketing, promotional and publicity uses, contact the publisher in advance of using
the material. For a copy of our Rights and Permissions Request form, contact Lauree Padgett, LOADS OF DATA AND AI ANNOUNCEMENTS
lpadgett@infotoday.com. AT IGNITE 2018
ONLINE ACCESS Visit our website at www.dbta.com
53 IOUG OBSERVATIONS > BY SIMON PANE
Contents also available online under direct licensing arrangements with EBSCO, NewsBank,
ProQuest, and Gale and through redistribution arrangements with information service THE TOOLS THE MODERN DBA NEEDS TO KNOW
providers including, Dow Jones Factiva, LexisNexis, OCLC, STN International, and Westlaw.
SUBSCRIPTION INFORMATION
55 DBA CORNER > BY CRAIG S. MULLINS
Subscriptions are available free to qualified recipients in the U.S. only. Nonqualified MANAGING DATABASE PERFORMANCE
subscribers in the U.S. may purchase a subscription for $74.95 per year. Delivery outside
North America is $140 via surface mail per year. All rates to be prepaid in U.S. funds. 56 DATABASE ELABORATIONS > BY TODD SCHRAML
Subscribe online (circulation@dbta.com) or write Information Today, Inc., 143 Old Marlton Pike,
Medford, NJ 08055-8755. BEWARE THE FRANKENMART!
Back issues: $17 per copy, U.S.; $22 per copy, Canada and Mexico; $27 per copy outside North America;
prepaid only. Missed issues within the U.S. must be claimed within 45 days of publication date.
Change of Address: Mail requests, including a copy of the current address label from a
recent issue and indicating the new address, to DATABASE TRENDS AND APPLICATIONS,
P.O. Box 3006, Northbrook, IL 60065-3006.
Reprints: For quality reprints of 500 copies or more, call (908) 795-3703 or email reprints@dbta.com.
DISCLAIMERS Acceptance of an advertisement does not imply an endorsement by the
publisher. Views expressed by authors and other contributors are entirely their own and MEDIA PARTNER OF THE FOLLOWING USER GROUPS
do not necessarily reflect the views of the publisher. While best efforts to ensure editorial
accuracy of the content are exercised, publisher assumes no liability for any information
contained in this publication. The publisher can accept no responsibility for the return of
unsolicited manuscripts or the loss of photos. The views in this publication are those of the
authors and donot necessarily reflect the views of Information Today, Inc. (ITI) or the editors.
EDITORIAL OFFICE 121 Chanlon Road, New Providence, NJ 07974
List Rental: American List Council. Contact Michael Auriemma, Account Manager,
(914) 524-5238 or email Michael.auriemma@alc.com
DECEMBER 2018/JANUARY 2019

Cloud Opens
By Joe McKendrick
the Path to
How fast and far can databases The survey found that database Other issues also stand in the way of
grow, and how can such growth be environments are sizable and complex growth. Individuals with database skills
sustained? That’s the question faced these days. Respondents manage mul-mul are getting harder to find. Additional
by many data managers these days, tiple databases, and many have data-
data issues are also coming to the fore since
who deal with growing demands from bases scaling into the multi-terabyte the 2016 survey. For example, respon-
respon-
their businesses for real-time, analyt-
analyt stage. A large portion of respondents, dents who cited challenges with finding
ical capabilities, incorporating data- 46%, also reported that their largest the right skills has more than doubled
driven initiatives such as the Internet database exceeds 1TB and range up to since 2016—from 32% to 66% report-report
of Things and artificial intelligence. 25TB in size. ing issues. There also has been a surge
They are responding and keeping up As organizations keep growing and in administration costs and complexity
with these requirements through a expanding their data environments, getting in the way of data environment
combination of cloud resources and they run into obstacles. The survey expansion—cited by 55% of respon-respon
automation. found licensing and support is the dents for an increase of 24 percentage
These are the findings of a recent number-one challenge for organiza-
organiza points. Of the hardware categories, the
survey of 260 data managers, fielded tions seeking to expand the number only category that showed growth was
by Unisphere Research, a division of of Oracle databases and applications. storage cost, but that growth has been
Information Today, Inc., in partner-
partner Data may have become the fuel driving negligible.
ship with VMware and the Indepen-
Indepen today’s and tomorrow’s enterprises, There has been a marked rise in
dent Oracle Users Group. but regardless of where it comes from cloud computing adoption among
The growth of data-driven enter-
enter or where it is stored—managing it database teams, the survey found. For-
For
prises is pressuring data administra-
administra in commercial databases is still tied ty-one percent reported having cloud
tors to deliver high-performing and to the traditional licensing and sup-
sup in production at scale or in limited use,
responsive systems that can scale port model that has been in place for up from 33% in the 2016 survey. Nota-
Nota
with the business. However, many decades. Activating new databases, bly, 28% of respondents have cloud in
enterprises are encumbered by the processor cores, or end user licenses production at scale, well over double
licensing and support issues that means additional costs. More than that of 2 years ago (11%) (see Figure
typically accompany database sys- sys four in five respondents reported that 1). One-third of respondents said
tems, resulting in potentially high it is difficult to grow their data envi-
envi their use of cloud is growing, with 20%
and unexpected costs, as well as skills ronments due to obstacles with licens-
licens reporting their cloud growth as “signifi-
“signifi
shortages. While enterprises are turn-
turn ing and support from their database cant”—again, a rise over just 2 years ago
ing to the cloud and automation solu-
solu vendors—a number that has increased (see Figure 2).
tions to enhance their capabilities in since the initial survey in 2014. The As public cloud adoption grows,
backup and recovery, the challenge is percentage of data managers citing much of this growth is driven by
many data managers subscribing to challenges with licensing and support backup and recovery to support trans-
trans
cloud services are not making licens-
licens costs, 81%, was up 22 percentage action environments. A total of 23%
ing costs enough of a priority. points over the previous survey. respondents delegate a significant
DECEMBER 2018/JANUARY 2019 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S 3

DATABASE
EXPANSION
share (a quarter or more of their capa-
capa
Figure 1: Does your organization use any public cloud services
bilities) of their backup and recovery
for Oracle database and applications?
processes for transactional environ-
environ
ments to the public cloud. For addi-addi 2018 2016
tional processes affecting data environ-
environ Yes, in production at scale 28%
ments, close to one in five rely on cloud 11%
for significant shares or their business Yes, in limited use/ 13%
continuity, monitoring, and provision-
provision- non-production 12%
ing processes. For analytical data envi-
envi
ronments, there is less commitment 20%
Under consideration
to public cloud at this time—at most, 26%
17% of respondents are dedicating a
Considered and rejected
1%
notable portion of their backup and 9%
recovery process workloads to public
No 38%
cloud environments.
43%
Cost reduction is the main benefit
anticipated with cloud, but agility and 0 20 40 60 80 100
capacity are more likely to be realized
in existing deployments. Data man- man
agers and professionals seek the cost Figure 2: How has your use of public cloud services for Oracle
advantages of public cloud—which database and applications changed over the past year?
may form the basis of business cases,
at least initially. Six in 10 foresee the 2018 2016
cost reductions as the improvement
sought with cloud computing. Increased significantly 20%
As deployments mature, however, 6%
the additional agility and on-demand Increased somewhat 13%
resources that the cloud brings to bear 16%
emerge as the leading benefits. In addi-
addi Has not changed 14%
33%
tion, the advantages of public cloud
Decreased somewhat 1%
computing are far more apparent than
1%
2 years ago. When asked about posi- posi
Decreased significantly 1%
tives already seen, three in four said
they have experienced greater agility. A 51%
Do not use cloud
majority cited the on-demand capacity 45%
clouds bring to the table. n
0 20 40 60 80 100
4 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018 /JANUARY 2019

The Year Ahead

DATA
WILL DRIVE THE
ENTERPRISE
in
DECEMBER 2018 /JANUARY 2019 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S 5

By Joe M c Kendrick

T
he year just ending has been an interest-
ing one for data managers. Artificial intel-
ligence (AI) and machine learning took
center stage, which also meant an increas-
ingly glaring spotlight on data sourcing,
management, and viability. The continued rise of the
Internet of Things (IoT) also meant no letting up on
demands for data environments to deliver require-
ments fast and furiously. The year ahead will bring
more of the same—as well as a continuation of the
transformation of information management.
Here are some of the changes and challenges on
the horizon for 2019, as seen by leading industry par-
ticipants and observers.
6 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018 /JANUARY 2019

MASSIVELY DISTRIBUTED DATA with the new sources. Applications and uct management at Infogix. “Suddenly,
GETS EVEN MORE DISTRIBUTED data platform technologies are, and will every organization conducting business in
Things keep moving away from the cen- continuously get infused with, intelligent the European Union needed a well-func-
ter. As Ken Tsai, global VP and head of cloud technologies so business scenarios can be tioning data governance program.”
platform and data management for SAP, put automated by AI and heuristic usage However, Washington continued, “this
it, customers’ data now sits in an average of data, creating a new class of solutions.” new focus on data governance didn’t auto-
six to eight clouds, as well as their own data matically translate to a better understand-
centers. “This is contributing to the rapid DATA GETS EVEN MORE STRATEGIC ing of enterprise data and new analytical
evolution of data processing technologies to In the year ahead, data managers will insights. Although many businesses are
become massively distributed,” he said. He continue seeing their mandates extend now collaborating across departments,
added that “data integration technologies well beyond their original course of there is still a disconnect between the ana-
are shifting from extract-transform-load to action for managing and securing day- lytics team and those that are focused on
a process- and pipeline-driven approach, to-day transactions. It’s now about governance efforts, resulting in various
with data management and governance “leveraging information to make metadata definitions across teams.”
Introducing governance to analytical
models will help businesses “aggregate the
metadata around their models to ensure
In the year ahead, apps and data platforms will all teams have a complete understanding
of their data and can leverage it in analyt-
increasingly be infused with intelligent technologies ical models,” she stated. “More businesses
so business scenarios can be automated by AI and are now successfully cultivating open
communication between various depart-
heuristic usage data. ments because of their data governance
programs. This suggests that many com-
panies are ready to expand their data gov-
capabilities to support both centralized and strategic, operational, and tactical deci- ernance program to a more strategic focus
federated models.” sions that result in increased revenues, beyond just governing data.”
At the same time, Tsai cautioned, this improve operational efficiencies, and
high level of distribution will “increase enhance customer experience,” said Satya MORE CLOUDS ON THE HORIZON
cost and complexity in managing mixed Sachdeva, VP of insights and data for For some time, the idea of relying on
data landscapes and locations and it is dif- Sogeti USA. Systems are evolving with this public cloud services to handle critical or
ficult to know where the data resides, what growing mandate. “Database technologies sensitive data assets was a non-starter.
data is available, as well as how to govern used for information management have Attitudes have changed, and in the year
and trust the data source and accuracy, been rapidly evolving from traditional ahead, public cloud may be the go-to solu-
and monitor its usage and lineage across relational database management systems tion at many enterprise data sites. “Public
this distributed environment.” and OLAP technologies to MPP-based cloud solutions have made resource con-
This will change the way information appliances,” he said. “Data lakes and sumption cheaper, simpler, and dynamic,”
is managed, Tsai added. “Information Hadoop-based data environments for said Gaurav Yadav, founding engineer and
won’t be tethered to just one system and ingestion, wrangling, and analytics are product manager for Hedvig. There’s
will flow freely and be connected—no g a i n i n g fo o t h o l d s a c ro s s m a ny another advantage beyond this, as public
matter how the business needs evolve or organizations.” cloud services offer a degree of standard-
where the data consumer is located, or ization that is needed across global enter-
the device they want to use for access.” DATA GOVERNANCE GAINS prises or partner networks. “Businesses
Enterprises will approach information Data governance will continue to become are struggling to keep up with this extreme
orchestration from raw feeds with intel- even more critical in the year ahead. The pace of data generation and traditional
ligence and real-time analysis on vast EU’s General Data Protection Regulation data analytics tools are not capable of han-
quantities and varieties of data types, he (GDPR) was a major driver for data gover- dling such globally distributed data.”
said. This will “make it much easier to nance programs in 2018, observed Emily While public cloud solutions have
enrich the mission-critical information Washington, senior vice president of prod- long been the preferred option for
DECEMBER 2018 / JANUARY 2019 | DBTA Sponsored Content 7

Multiplatform Data Quality Management—


Unison is Speedy, Secure & Scalable
WHEN IT COMES to data quality, speed aggressive address corrections at automatically to better understand your
and security are paramount. We kept lightning speed. It leverages a Canada data and collaborate on projects.
that in mind when building Unison, an Post® SERP Certified™ and USPS® CASS More than quick processing and setup,
ideal, multiplatform solution to establish Certified™ address engine to match and Unison saves you time by automatically
and maintain contact data quality at correct spellings and naming mistakes distributing updates directly to your IT
higher speeds—processing 30+ million for cities and streets, and to add the team. If your Unison installation has
addresses per hour—while still meeting correct street name suffix, prefix and internet connectivity, updates will prepare
the most stringent security requirements. ZIP+4 information. themselves on your servers, and you’ll get
Designed to be innovative, scalable and Unison can also append latitude and notified once it’s ready.
reliable, Unison is a holistic platform that sets longitude coordinates with Census data,
the standard for data quality management. validate and standardize email addresses Cleanse Sensitive Customer
It brings data standardization, validation and phone numbers, plus validate and Info Securely & Safely
and enrichment together for speedy end- parse full names in record time. We’ve also added an additional layer
to-end data quality. With Unison, you can Data can be managed confidently of security so only authorized users can
even cleanse your most sensitive customer on-site to meet compliance and security access the application. Administrators
information throughout the enterprise, as requirements due to Unison’s ability to can choose to integrate Unison within
data never leaves your organization. work completely offline. Even within the the company’s LDAP system for
platform itself, Docker containerization preexisting logins, or create account
End-to-End Data Quality security features leave assets isolated logins with role-based capabilities and
at Lightning Speed and self-contained, making Unison ideal configurable user rights through the
Unison offers superior data quality for industries with sensitive data that Unison Authentication System.
management in one handy, on-premise must remain onsite.
platform. It was designed to save you Wide Compatibility for
time with simple setup, fast processing Scalability & Flexibility Future Enhancements
and automated updates. Unison supports horizontal scaling, so Unison currently supports Oracle,
Getting started requires no technical you can maximize existing hardware and SQL Server, MySQL database, and a
knowledge, coding or programming – any hardware you choose to add in the variety of delimited flat files to deliver
and development time is completely future. Scaling horizontally allows you to address, email and phone verification,
eliminated. Simply install the web-based spread work across multiple servers and and geocoding for U.S. and Canadian
client-server application for full access begets huge leaps in processing speed. records. In the future, Melissa will
to address, phone and email verification, Docker Swarm tears the roof off any expand Unison’s service offerings
plus geocoding capabilities. performance ceiling by easily spreading past contact data to include the full
The login portal is accessible through Unison containers across as many spectrum of data enrichment services
local intranets in any modern browser processors as you choose to configure. for all data types. Streamline data
and is set up for effective project The only limiting factor will be your prep workflows, reduce analytics busy
collaboration across multiple users. network speed. work, gain more insights and increase
Unison’s elegant interface is organized You also have the ability to connect to efficiency with Unison.
by the steps of the data management multiple RDBMS platforms and schedule Try a Unison product demo! Connect
workflow and is fully customizable. jobs to process during off hours to with members of Melissa’s customer
Once installed, Unison validates U.S. maximize performance, then visualize support team at www.melissa.com or
and Canadian addresses and performs analytics with reports generated call 1-800-MELISSA. n

Melissa n 22382 Avenida Empresa n Rancho Santa Margarita, CA 92688-2112


Phone: 949-858-3000 n Fax: 949-589-5211 n Toll Free: 1-800-635-4772 n www.melissa.com
8 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018 /JANUARY 2019

startups, existing organizations are also However, there will be a some chal- a boom in the use of algorithms throughout
recognizing the advantages of such ser- lenges in achieving these cloud storage the enterprise for a variety of purposes. “This
vices. Most startups find it easy to ramp up benefits, Ankorion cautioned. “Transfer- has the highest opportunity for business pay-
by consuming public cloud solutions at first, ring high data volumes across wide area off, especially for augmenting human intelli-
and slowly migrating data back on-premise networks can result in high latency and gence,” said Suketu Gandhi, partner in the
for cost optimization, Yadav said. However, consume costly bandwidth,” he explained. digital transformation practice at A.T. Kear-
more mature businesses always had data “Enterprises also need to avoid lock-in and ney. He predicted that as a result, “data and
on-premise, and they migrated some of that maintain full data mobility, moving work- organizational intelligence will be discussed at
data to the public cloud in order to reduce loads between clouds and across hybrid the board level, and CEOs will be talking
the number of resources needed to manage architectures based on lessons learned and about it regularly as well.” Gandhi cautioned,
their infrastructure. changing requirements.” however, that data managers need to get ready
“Public cloud and ‘as-a-service’ solu- for the push to an algorithm-driven enter-
tions have transformed the expectations prise. “It shines a light on the state of data—
such that high-availability, simplicity, and both internal and external—and the ability of
security are expected to be built-in for con- Data governance will an enterprise to use it. There is a lot of work
sumer as well as enterprise products. It’s continue to become to get ready for the fundamental reality of
not enough to just fulfill the customer structured versus unstructured, internal ver-
requirements anymore, delighting the cus- even more critical in sus external, real time versus batch,” he added.
tomer is the key,” he noted Business and data executives are conduct-
For the year ahead, expect to see “a huge the year ahead. ing reality checks on the best approaches for
push in data management vendors provid- the employment of algorithms, autonomous
ing deeper public cloud integrations and operations, and AI. “Many businesses are
removing the complexity out of hybrid IT’S HYBRID TIME being smart when it comes to adopting new
cloud data management and analytics fea- Many enterprises have on-premise data technologies,” said Lyndsay Wise, director of
tures,” Yadav said. assets they want to keep within their four market intelligence at Information Builders.
Itamar Ankorion, CMO at Attunity, walls, at least for now. With the rising pop- “Instead of buying into the hype, they are
predicted greater use of major cloud infra- ularity of public cloud, “2019 will be the asking critical questions for garnering the
structure and platform providers—such as year of multi-cloud and hybrid cloud,” strongest ROI, resulting in a delay in broad
Amazon S3, Azure Data Lake Store (ADLS) said Arun Murthy, CPO and co-founder adoption. For instance, organizations are
and ADLS Gen2, and Google Cloud Stor- of Hor tonworks, which recently realizing that strong data management is a
age—for cloud-based data storage, espe- announced a merger with Cloudera. core foundation for predictive and AI tech-
cially as organizations move to analyt- “Cloud providers will, more aggressively, nology and are first focusing efforts on get-
ics-driven strategies. “These converged differentiate among each other in specific ting their data house in order. Others have
platforms host preferred analytics systems areas—such as operational readiness at realized that they don’t have the pool of data
such as the Snowflake and Amazon Red- Amazon, enterprise integrations for Mic- necessary to make the most of predictive
shift data warehouses, and Amazon Ath- rosoft, and AI and machine learning at technologies and are investing in building the
ena,” Ankorion pointed out. “These systems Google,” he predicted. This array of capa- right data streams.”
incorporate familiar SQL structures.” bilities from across vendors will give rise Most organizations by now “have worked
The traditional cloud benefits apply to multi-cloud strategies being put into with data long enough to know when they’re
here, Ankorion continued. “Enterprises motion, bound by “common security, gov- ready for a new trend or significant invest-
gain economic storage, processing and ernance, and data or workload manage- ment,” Wise added. “Organizations that are
management, elastic resource consump- ment strategies. We will also start to see moving forward with predictive analytics,
tion, and the shift of CAPEX to OPEX. enterprises move some always-on work- machine learning, and AI are doing so
They also can reuse storage for new analyt- loads from the cloud back on-prem for a because they’ve dedicated enough time and
ics workloads. All this is changing how hybrid model to optimize economics.” resources to their data management and are
organizations manage data because they confident they have the right amount of data
can more cost-effectively address more STRENGTHENING THE DATA needed. Data comfort is really critical here.
advanced analytics use cases and thereby FOUNDATION You can’t implement data-fueled technolo-
realize a greater return on their data As data becomes the lifeblood of most gies if, as an organization, you’re not comfort-
investments.” business models and strategies, expect to see able with data internally.”
DECEMBER 2018 / JANUARY 2019 | DBTA Sponsored Content 9

Technical Community College Achieves Smarter


Data Integration with Kore Technologies
The Journal of Information Integration and Management
Kourier Integrator bridges the gap between the Colleague
ERP system and custom applications.

SERVING THE CITIZENS of Forsyth and Web Services and REST Gateway, they Building on the data integration with the
Stokes counties in North Carolina since realized that this would be ideal for accessing employee evaluation application, Forsyth
1960, Forsyth Technical Community College data in real time from the Colleague ERP Tech was able to create a new interactive
offers vocational instruction and training in system to their applications. Integration organizational chart in just a few days,
skilled trades as well as college transfer quickly became a key focus. as opposed to months. In addition, there
and two-year degree programs, corporate “All the tools were there from a data are now plans for new projects, including
training, continuing education, and personal warehousing perspective,” said Christopher bi-directional data integration for its custom
enrichment classes. Pearce, associate vice president and CIO of applications as well as integration between
Forsyth Tech. “When we looked at the cost its Colleague ERP system and third-party
THE CHALLENGE and ROI, plus the add-ons the College was applications.”
Forsyth Tech has relied on Colleague considering, we were impressed. It was
by Ellucian on Rocket Software’s UniData everything that the College was looking for at NEXT STEPS
(MultiValue) platform as its central ERP a price point that the College could commit Next on the roadmap is a project to
system for years. The Colleague system is to easily. Once the College found out about tackle better understanding of and reporting
the backbone of the College, handling every- the REST tools, we knew that we had to on student attendance. A new attendance
thing from an administrative perspective, start working on the relationship with Kore.” application will allow the College to provide
including student information, registration, improved attendance information to the
finance, and human resources (HR). When we looked at the cost state, which is tied to its funding. In addi-
Over time, however, Forsyth Tech found and ROI, we were impressed. tion, the application will enable easy access
the need for new capabilities, and pur- to data for use with predictive analytics
chased additional applications. The College tools to quickly identify students who may
THE BENEFIT
also developed its own custom applications, be struggling academically so they can be
which is less expensive and also means Forsyth Tech’s custom apps handle supported with tutors or counseling.
applications meet its own business rules. course substitution, employee evaluation, Reflecting on its use of Kore’s solu-
Integrating data within the core ERP system and tutor requests, and now, because of tions, Pearce said, “the advantages go
with those applications, however, proved to Kore, they can all be updated in real time. beyond data integration. With the use
be challenging. “Up until this point,” said Pearce, “data of new approaches like REST, Forsyth
While there are plenty of solutions transfer from Colleague to its custom appli- Tech is achieving its goals of persistence,
available for relational systems, tools for cations was accomplished using rudimentary engagement, and retention by enabling
reporting and integration for MultiValue tools to get data into a flat-file format, and greater access to information for both
systems are not as plentiful and, in many then the data was passed over to the applica- staff and students to support academic
cases, cost-prohibitive. Moreover, as a tion, which had to import the file before the success. It is all about giving students the
state-funded institution, Forsyth Tech puts community could start using the data. The information they need when they need it
a high priority on cost-effectiveness in process was a ‘management nightmare’ that and removing barriers to furthering their
everything it does in order to concentrate its required writing scripts and scheduling data education.” ■
resources on its primary goal of educating, transfers, and had a level of latency that is
engaging, and retaining students. unacceptable by today’s standards.”
Kore’s Kourier REST solution accelerated CONTACT INFORMATION
THE SOLUTION the process of building secure, real-time
Forsyth Tech initially became familiar with REST APIs for the integration. Beyond
Kore Technologies through a data ware- real-time integration with its current apps,
housing and reporting project it embarked Forsyth Tech has also been able to bet-
on in summer 2017 using Kore’s Kourier ter serve the needs of additional college
5186 Carroll Canyon Road, Suite B
Integrator, SQL Accelerator and Operational departments and expand on its initial set of
San Diego, CA 92121
Data Store for Colleague. However, as the in-house applications. “For example,” said 866-763-5673
Forsyth Tech team members evaluated more Pearce, “each year, the HR department www.koretech.com
of Kore’s product suite, including Kore’s updates the organizational chart to identify info@koretech.com
RESTful (Representational State Transfer) who works where and to whom they report.
10 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018 /JANUARY 2019

tive staff, because they won’t have to dupli-


cate the efforts of past employees—the
A stronger data foundation will help deliver the value information would be captured before the
expected from data-intensive methodologies such as previous employee left and be easily avail-
able,” said Shuman. “An employee who is
predictive analytics, machine learning, and AI. fully engaged with a company and its infor-
mation feels more a part of the team.”
AI technologies are also playing a key
A stronger data foundation will help through interoperability, through visibil- role in enhancing both data and content
deliver the value expected from data-inten- ity—of all the ways value can be created in search. “Coupling technologies such as
sive methodologies such as predictive ana- today’s era of digital transformation,” Ken- machine learning and natural language
lytics, machine learning, and AI, said Wise. ney said. processing with search is helping compa-
“When implemented correctly, the oppor- Organizations aren’t quite ready to fully nies to glean greater value and insight from
tunities are limitless. We can’t really predict embrace an ecosystem-driven approach, all their data, both structured and unstruc-
the full scope of how AI and predictive tech- however. “More work needs to be done in tured,” said Kamran Khan, a managing
nologies will improve organizations, but we how we view and manage all the data inter- director with Accenture Applied Intelli-
know there will be significant improvements actions happening in our ecosystem, but gence. “It increases the ability to find, ana-
in efficiencies, cost savings, customer service it’s achievable,” Kenney said. “It’s important lyze, understand, and present data more
and experience, bias improvement, and to set realistic goals and expectations, and accurately and efficiently and provides a
employee growth—to name a few.” simply mapping out how your intelligence new turbo-charged way of searching that
network has expanded is a great first step helps companies not only gain more insight
ECOSYSTEMS EVOLVE to learning to use the data.” from their data but it also improves the
Success in the data-driven organization user or customer experiences that directly
means more than relying on corporate data- MORE TARGETED SEARCH support many business goals and objec-
bases—innovation and growth now are tied Enterprise search is an area that will tives. Examples of these might be an
to the development of connected ecosystems increasingly become part of organizational intranet or KM application, ecommerce
of partners, customers, and other constitu- information management strategies in the site, or customer service portal.”
encies. Expect to see a greater focus on these year ahead, especially as both data and con- However, AI-driven information tech-
networks in the coming year. “Organizations tent are embraced as strategic assets nologies are still immature, Khan cau-
already have started to pull the camera back enabling business growth. Jill Shuman, tioned. “While technical leadership might
and take a broader look at how their ecosys- director of project engagement of the understand machine learning and NLP,
tems drive their businesses, but it’s about Copyright Clearance Center, foresees the there is still a knowledge gap with regard to
something more than connecting or inte- rise of “a search solution that gathers infor- fully realizing how these technologies work
grating,” said Frank Kenney, director of sales mation from multiple, disparate content and how they can play a role in improving
enablement at Cleo and former Gartner sources all at once and presents results on search and content analytics. These tech-
analyst. “It’s about driving value. In the next a single page.” nologies are also not easy to implement. It
year and beyond, savvy organizations will Ideally, organizations “should consider takes time, money, and a deep understand-
deliberately focus on enabling their ecosys- creating a consolidated place to store both ing of content analytics to bring true cog-
tems to obtain different types of value out of internal and external content, coupled with nitive search into enterprises.”
relationships with customers, partners, part- a single enterprise-wide search function,” AI technologies are changing the way
ners’ customers, and so on. That means they said Shuman. “This allows employees information is managed, said Khan.
are integrating not only their dynamic net- access to everything they need in one basic “Applying NLP and machine learning to
works of people, partners, customers, sys- search effort instead of having to check search applications, for a smarter way of
tems, applications, and things but also the each disparate source individually. This searching and understanding data will
processes and interactions that drive those approach saves time, money, and most absolutely change the way information is
relationships.” importantly, protects organizations from managed. Right now, nearly 80% of all
Managing and capitalizing on ecosys- loss of institutional knowledge, which enterprise data is unstructured content
tems “requires a modern approach to inte- could cost millions of dollars.” which is rarely used in analytics. This num-
gration, one that can enable your business As consolidated enterprise search ber will only continue to grow as each
to take advantage—through functionality, emerges, companies will see “more produc- enterprise’s data continues to increase.” ■
Delphix
PAGE 14
WHY EVERY ENTERPRISE
ORGANIZATION NEEDS
DATAOPS

Aerospike
PAGE 15
MAXIMIZE THE VALUE OF
YOUR OPERATIONAL DATA

Moving to a
MODERN DATA
ARCHITECTURE
Best Practices Series
12 DECEMBER 2018 / JANUARY 2019 | DBTA

9 STEPS
FOR MOVING TO
A MODERN DATA
ARCHITECTURE
Best Practices Series

GETTING TO A MODERN data architecture is managers, fielded by Unisphere and frameworks that are helping to
a long-term journey that involves many Research, a division of Information shape the foundation of modern data
moving parts. Many organizations have Today, Inc., in partnership with architectures, according to a Unisphere
vintage relational database management VMware and the Independent Oracle Research/Oracle survey of more than
systems that perform as required, with Users Group, found a noticeable rise 200 companies. Adoption of data
regular tweaking and upgrades. However, in cloud computing adoption among lakes—a place to store diverse datasets
to meet the needs of a fast-changing database teams. Forty-one percent without having to build a model first—
business environment, data executives, had cloud in production at scale or in continues to rise as data managers seek
DBAs, and analysts need to either build limited use, up from 33% in a similar to develop ways to rapidly capture and
upon that, or re-evaluate whether their survey conducted in 2016. In addition, store data from a multitude of sources
data architecture is structured to support one-third said their use of cloud is in various formats. Overall, 38% of
and grow with their executive leaderships’ growing, with 20% describing their organizations are employing data
ambitions for the digital economy. cloud growth as “significant”—again, a lakes as part of their data architecture,
In terms of technology, a rise over just 2 years ago. Many of these up from 20% in the 2016 survey. In
combination of cloud resources and deployments are in pursuit of greater addition, Apache Spark, an open source
automation is helping data executives automation to deliver backup and analytics engine targeted at large-scale
keep up with emerging requirements— recovery requirements, a vital piece of data processing at real-time speeds, is
artificial intelligence (AI), the Internet all data architectures. being used at one in four organizations
of Things, and real-time customer In addition, there has been notable as part of their data architectures, with
service. A recent survey of 260 data growth in adoption of new systems another 15% considering adoption. One
DECEMBER 2018 / JANUARY 2019 | DBTA 13

in four executives are also employing quants and upper-echelon executives. Automate as much as possible.
machine learning, a part of AI. Key to a self-service culture is effective A modern data architecture has
There are many promising data governance to not only provide to be highly scalable, capable of not
technologies converging to support the “guardrails” of data integration and only streaming and ingesting massive
the development of a modern data consumption, but to also set the course amounts of data but also tagging the
architecture, but organizations need to for the business. data for analytics algorithms and AI
be prepared and ready to capitalize on datasets.
such new capabilities. Here are ways to Get real about real time.
prepare enterprises for this journey: Real-time responsiveness to Encourage a collaborative
customer demands, as well as end- approach.
Define what a modern data user requirements for data, is what Until recent times, data
architecture means to the business. now separates the disrupters from the environments were the domain of data
A “modern data architecture” disrupted in many markets. Real-time executives and DBAs. With data now
can mean many things, and capabilities are reaching commodity an essential component of business
implementations will vary from status with built-in in-memory strategy, managers across the enterprise
organization to organization. For some, processing in major vendor products need to have input on how data is
building and supporting big data with as well as open source frameworks discovered, streamed, and analyzed.
Apache Hadoop clusters is an important such as Apache Spark. A modern data While IT and data managers will
element of a modern data architecture. architecture needs to be able to support still oversee the backend functions
Others may have even moved beyond real-time interactions and calls for data of systems—such as security and
Hadoop and are creating systems where and when it is needed. availability—business end users need to
that respond to real-time events. To be able to provide their input into how
many enterprises these days, a modern Make data analytics part of data is made available.
data architecture means shifting data your digital transformation.
management functions to the cloud. Successful digital transformation— Take a data platform approach.
now being sought by organizations Platform thinking, in which
Make data as widely available as across the board—depends on the participation from across the ecosystem
possible to those who need it. ability to capture and analyze data is encouraged and digital resources are
Analysts, pundits, and vendors have streaming to and from customers, made available, can open up data and
been talking about “data democracy” partners, and employees. Whether it is insights to an ecosystem of partners,
within organizations for close to 2 business intelligence or AI, any digital thereby building on a network effect. This
decades; the technology and know-how transformation strategy requires a maps to the rise of digital enterprises, in
are finally available to make this possible. modern data architecture to succeed. which interactions and transactions take
A data analytics-driven enterprise needs place across a platform, versus traditional
to support informed decisioning all the Bring data closer to the customer. sales channels. This so-called plug-
way down the corporate ranks—for Designers of a modern data and-play business model enables data
example, call center representatives need architecture need to ask one simple producers and data consumers to develop
relevant insights about the customers question: What does the customer need? and engage with services connected with
with whom they are interacting, and Examining what customers—as well as data analytics.
production workers need access to data internal end users—require in terms A modern data architecture
on machine performance. of data and insights helps pave the way is essential to move forward in
to a responsive model and approach today’s digital economy. While the
Build a self-service culture. to data delivery. Exploring customer technology is available and affordable,
Self-service capabilities are the key to and user requirements isn’t limited to organizations need to be ready to take
data democracy. By putting analyses and the first rollout of the architecture— full advantage of a more responsive,
reporting in the hands of end users who data architecture proponents need to real-time, analytical environment. ■
need it when they need it, data-driven keep asking if their environments are
thinking will become a standard part of continuing to meet customer and end-
day-to-day business, not the domain of user needs, and what needs to be changed. —Joe McKendrick
14 DECEMBER 2018 / JANUARY 2019 | DBTA Sponsored Content

Why Every Enterprise Organization


Needs DataOps
DATA IS THE NEW strategic asset in today’s with the heterogeneous enterprise and commercial content to millions of
economy. It is a source of unique insights IT landscape and works with all viewers every day. It relies heavily on
and the engine of modern applications relevant data wherever it exists. No a suite of data warehouses to provide
and machine learning. Winning in this point solutions to stitch together to accurate and up-to-date insights into
new software-driven landscape requires implement a data strategy. business performance and future
aligning people, process, and technology • Frictionless Data Delivery—Data opportunities. Their ability to rapidly
around the secure flow of data to accelerate as a self-service, when and where it refresh this information and manage
innovation and drive differentiation. is needed. An end-to-end platform volume sizes was key to their ongoing
The alignment hinges on the easy access captures the data from where it success and helped target their spend
to any data in any location, without exists, moves it to where it needs to on the screen instead of on technology.
compromising data privacy or compliance. be, prepares it for relevant use cases, They already used Agile and DevOps
As enterprises become more data- such as data pods for SDLC, and models to accelerate delivery of business
driven, the explosion of data demand can lastly, presents masked data to data change, but a lack of automation tools to
sometimes create intense friction with the consumers on a day-to-day basis with optimize data management caused delays
ever-increasing cost, complexity, and risk minimal overhead or delay. to environment deployment into several
of managing, distributing, and securing • Integrated Risk Management—The weeks and drove significant costs in storage
that data. The accelerated adoption platform proactively identifies risk and tooling. These delays caused the team
of different types of databases, like with de-identification, obfuscation, to miss capitalizing on several market
MySQL, MongoDB, PostgreSQL, Amazon and other techniques to mitigate risk trends and burdened them with additional
RDS, and AWS Aurora, in addition to as data flows across the enterprise. capital expenditure. The protracted time
traditional databases, such as Oracle, and DevOps success has come from taken to refresh the circa 18TB of data in
Microsoft SQL, increases this friction breaking down silos between teams, test environments inhibited system and
manifold. bringing speed and agility through business agility, ultimately preventing
Data friction can be solved with close collaboration and automation teams from realizing their goal of regular
DataOps. Like DevOps, DataOps is a via technology platforms. Delphix change and deployment.
people problem, with technology as a key addresses similar technology aspects for Channel 4 now uses Delphix as part
enabler of data flow and reliever of this DataOps. Data remains secure, moves of a DataOps approach to optimize and
friction. DataOps, as seen with a few early fast, and provisions instantly, which automate their application development
adopters, plays an important role within helps organizations to do development process. Changes and modifications to
the enterprise to have an immediate and shift-left testing faster. This results key systems are more efficiently managed
impact on the following: in higher quality software and the ability and the availability of reporting data is
• Software Development—Automated, to release more features. With data pods, improved. The Delphix platform has also
repeatable, and faster provisioning of developers, testers, and CI/CD pipelines enabled the acceleration of data delivery
higher quality data for AppDev, QA, can easily provision and refresh realistic for development, testing, and reporting
and test environments that results in masked production data in a matter to drive faster innovation and improve
higher velocity for new releases and of minutes. Robust branching and operational efficiency. Finally, data pods
feature updates. bookmarking capabilities can be used to have improved the connection between
• Data Privacy and Governance— create a readily available test data catalog DBAs to developers to ensure fresh, secure
Identifying and mitigating risk of production, synthetic, and curated delivery of data. What used to take days
to meet compliance and privacy datasets. now takes just minutes. Today, Channel 4
constraints associated with data flow Let’s take Channel 4 Television as an is able to release applications quickly and
between data managers, like DBAs, example to illustrate how it leverages a with more confidence using the Delphix
and data consumers, like developers. technology platform called Delphix to Dynamic Data Platform. ■
The key to DataOps success lies in a adopt a DataOps approach and drive
technology platform that provides the business success.
following three capabilities: Channel 4 Television, a UK-based,
• Enterprise-Readiness—A platform multibillion-dollar media company, DELPHIX
that seamlessly integrates and scales delivers a high volume of programming www.delphix.com
Sponsored Content DECEMBER 2018 / JANUARY 2019 | DBTA 15

Maximize the Value of Your Operational Data


Modernize and Transform Your Enterprise Via Real-time Transaction and Analysis Processing

OVERVIEW tolerance and near 100% uptime even • Adaptable infrastructure for
It might sound too good to be true: a during upgrades and maintenance. managing varying types of data
database system that processes large volumes How? By capitalizing on proven with minimal effort
of operational data in real time while architectural approaches—such as • Low total cost of ownership (TCO)
delivering exceptional runtime performance, distributed computing and parallelism—
high availability, and cost efficiency while and developing new technologies to KEY FEATURES AND
still keeping your data safe. What if early meet business demands that hadn’t TECHNOLOGIES
adopters in banking, telecommunications, even surfaced when older systems were Aerospike is a shared-nothing
and other industries are already harnessing originally built. Indeed, Aerospike’s database system that operates on a
such a database for achieving results that patented Hybrid Memory Architecture™ cluster of commodity server nodes:
are transforming their businesses in myriad (HMA) drastically reduces traditional I/O • It’s a schema-free, key-value data store.
ways? What if published benchmarks and network communication compared • Aerospike exploits volatile and non-
demonstrate sub-millisecond response times with other approaches; it also uses CPU volatile memory in a distinctive way,
for high throughput read/write workloads resources considerably more efficiently. providing rapid access to index and
over high data volumes with substantial The cumulative impact of these features user data.
cost savings compared with traditional • An intelligent client layer minimizes
(and others) enables Aerospike to deliver
alternatives? costly network “hops” needed to
remarkable speed at scale.
This paper introduces key technologies access data.
that Aerospike clients are using to APPLICATIONS AND USE CASES • Immediate record-level consistency and
modernize their data management Applications that benefit from high availability are guiding principles.
infrastructures and realize such impressive Aerospike typically share some or all of • Access management controls and
(and seemingly impossible) results as: these characteristics: transport encryption protect sensitive
• Service-level agreements (SLAs) that data.
• Rapid read/write speeds without
require sub-millisecond database • Asynchronous replication across data
extensive tuning or a separate data
response times centers provides disaster recovery.
cache
• High throughput for mixed workloads • Ready-made connectors, a publish/
• Substantially smaller footprints than
(e.g., 3–5 million operations per subscribe messaging system, and
popular alternatives, often leading to
second) partner offerings help firms integrate
3-year total cost of ownership (TCO)
• Support for managing billions of Aerospike into their existing IT
savings of $3-5 million per application
business records in databases of infrastructures. ■
• 24x7 availability, including cross-
10s–100s TB
datacenter replication
• High availability and fault tolerance for
• Operational ease during scale-out and FULL REPORT
mission-critical applications
maintenance To get a copy of the full report,
• High scalability for handling
• Interoperation with popular software please go to: www.aerospike.com
unpredictable increases in data
offerings, including Apache Hadoop, /maximize-operational-data .
volumes and transactions
Spark, and Kafka
Sounds unbelievable, right?

THE TECHNOLOGY IN BRIEF


Aerospike provides a distributed, highly
scalable database management system for
demanding read/write workloads involving
operational data. It was designed to deliver
extremely fast—and predictable—response
times for accessing data sets that span
billions of records in databases of 10s – 100s
TB. Other design features address fault
16 DBTA | DECEMBER 2018/JANUARY 2019
DECEMBER 2018/JANUARY 2019 | DBTA 17

YOU CAN CALL IT the new oil, or even the new This year, our list includes newer
electricity, but however it is described, it’s approaches leveraging artificial intelligence
clear that data is now recognized as an essen- (AI), machine learning, and automation as
tial fuel flowing through organizations and well as products in more established catego-
enabling never-before-seen opportunities. ries such as relational and NoSQL database
However, data cannot simply be collected; it management, MultiValue, performance man-
must be handled with care in order to fulfill agement, analytics, and data governance.
the promise of faster, smarter decision making. On the following pages, we present DBTA’s
More than ever, it is critical to have the list of Trend-Setting Products for 2019. We
right tools for the job. Leading IT vendors are encourage you to continue your exploration
coming forward to help customers address by visiting the companies’ websites for addi-
the data-driven possibilities by improving tional information. In addition, in this issue,
self-service access, real-time insights, gov- we include Product Spotlight articles penned
ernance and security, collaboration, high by company executives with explanations of
availability, and more. what makes these products unique.
To help showcase these innovative products And, to stay on top of the latest news, IT
and services each year, Database Trends and trends, and research, go to www.dbta.com,
Applications magazine looks for offerings that and review DBTA’s extensive collection
promise to help organizations derive greater of white papers and research reports at
benefit from their data, make decisions faster, www.dbta.com/DBTA-Downloads/White
and work smarter and more securely. Papers. n
18 DBTA | DECEMBER 2018/JANUARY 2019

ACTIAN ATTIVIO
www.actian.com www.attivio.com
Actian Zen Core Database for Android—a new Attivio Cognitive Search and Insight Platform—
member of the Zen Enterprise Edge Database family that understanding that the key to transforming the productivity
focuses squarely on the needs of edge and IoT application of information workers is improving the relevancy of
developers, providing persistent local and distributed data answers to search queries, Attivio combines natural
across intelligent applications embedded in smart devices language processing, machine learning, and knowledge
graphing for a contextualized search and discovery
AEROSPIKE experience with a secure enterprise platform
www.aerospike.com
Aerospike—an enterprise-class, NoSQL database solution ATTUNITY
for real-time operational applications, Aerospike is offered in www.attunity.com
two versions: the open source Aerospike Community Edition Attunity Replicate—helps to accelerate data integration
and the Aerospike Enterprise Edition, which includes all the for analytics with an intuitive GUI that eliminates the need
features of the Community Edition, plus additional premium for manual coding, and empowers organizations to improve
features
data replication, ingest, and streaming across a range of
heterogeneous databases, data warehouses, and big data
ALTR
platforms
www.altr.com
ALTR Data Security Platform—having emerged after
BACKOFFICE ASSOCIATES
nearly 4 years of stealth, the ALTR platform is designed to
www.boaweb.com
eliminate threats to data through core approaches in private
Information Governance Cloud—a cloud-based SaaS
blockchain, in-line data techniques, real-time alerting, and
solution that defines operational policies and business rules
reporting for business enablement, and offers support for all
for data, aligns them with elements of the business strategy,
major database platforms
and orchestrates their enforcement through any system,
process, and stewardship platform—including the Data
AMAZON WEB SERVICES Stewardship Platform by BackOffice Associates
https://aws.amazon.com
Amazon Aurora Serverless—bringing the power of
BLUEDATA
the MySQL-compatible database built for the cloud to www.bluedata.com
applications with intermittent or cyclical usage patterns, BlueData EPIC (Elastic Private Instant Clusters)
Aurora Serverless is a recently added deployment option software platform—leverages the power of containers to
that automatically starts, scales, and shuts down database help make it easier, faster, and more cost-effective to deploy
capacity with per-second billing for applications big data analytics and machine learning—whether on
premise, in the cloud, or in a hybrid architecture

ARCADIA DATA BLUE MEDORA


www.arcadiadata.com https://bluemedora.com
Arcadia Enterprise—with a data-native and AI-driven BindPlane—offering IT monitoring integration as a service
architecture designed for big data, Arcadia Enterprise and a standardized approach to enterprise monitoring
improves access to analytics and BI for business analysts, integration, BindPlane delivers Dimensional Data, which
enabling them to explore data with recommendations that enhances health and performance metrics with critical
visualize and identify insights, and to derive value with metadata about the inter- and intrarelationships between the
minimal IT overhead metrics in the IT stack
DBTA | DECEMBER 2018/JANUARY 2019 19

BRADMARK TECHNOLOGIES CLOUDERA


www.bradmark.com www.cloudera.com
Bradmark’s Surveillance DB—gives IT professionals Cloudera Data Science Workbench—with Python,
the necessary visibility of overall system health, and offers R, and Scala directly in the web browser, the platform
cross-functional capabilities, including real-time monitor- for collaborative data science delivers a self-service
ing, proactive event management, flashback/time-slicing, experience with connectivity not only to CDH but also to the
alarming and alerting, and SQL performance analysis systems data science teams rely on for analysis

with support across SAP Sybase, MS SQL Server, Oracle,


COGNITIVESCALE
Informix, and IBM DB2 databases
www.cognitivescale.com
Cortex5 software—simplifies the design, development,
CAMBRIDGE SEMANTICS
delivery, and management of enterprise-grade artificial
www.cambridgesemantics.com
intelligence systems that weave knowledge and learning
AnzoGraph—a native, massively parallel processing,
across the enterprise, and helps businesses apply artificial
distributed graph database that provides advanced analytics
intelligence and blockchain technology to solve complex
at big data scale and is available on a standalone basis for business problems in the financial services, healthcare, and
use behind the firewall as well as in the cloud digital commerce markets

CAZENA COLLIBRA
www.cazena.com www.collibra.com
Cazena’s Data Lake as a Service—with the goal of Collibra—designed to address the gamut of data
accelerating enterprises’ ability to leverage analytics, stewardship, governance, and management needs of
machine learning, and artificial intelligence without hiring complex, data-intensive industries, the Collibra configurable
or re-training their workforces, Cazena’s Data Lake as a cloud-based or on-premise solution puts people and
Service, its flagship solution, offers a SaaS-like experience for processes first—automating data governance and
analytics and business teams, with enterprise features for IT management to quickly and securely deliver trusted data

CISCO SYSTEMS COUCHBASE


www.cisco.com www.couchbase.com
Couchbase Server—built on NoSQL technology, the
Cisco Unified Computing System (UCS)—an integrated,
database software platform delivers performance at
scalable, multi-chassis platform for data center
scale, in any cloud, and features such as a memory-first
environments that combines compute, network, storage
architecture, geo-distributed deployments, and workload
access, and virtualization into a system designed to reduce
isolation, plus a SQL-compatible query language (N1QL) to
total cost of ownership and increase business agility and
make migration from RDBMS to Couchbase Server easier
IT staff productivity
DATA DYNAMICS
CLEARDB www.datadynamicsinc.com
https://w2.cleardb.net StorageX Dynamic File Management—helping to free
ClearDB’s MySQL Database-as-a-Service (DBaaS) data from technology lock-in, complexity, and risk, the
solution—delivers a geographically redundant database platform provides enterprises with an intelligent, policy-
infrastructure to help ensure that continuous access to a based, cloud storage management platform to enable data
customer’s MySQL database assets in the cloud is not only portability, usability, and insight for business agility and
resilient within local clusters but across geographies operational efficiency
20 DBTA | DECEMBER 2018/JANUARY 2019

DATABRICKS DELL EMC


https://databricks.com www.dellemc.com
Databricks’ Unified Analytics Platform—a cloud-based Dell EMC PowerEdge MX— high-performance, modular
platform powered by Apache Spark that enables data infrastructure that is designed for the software-defined
science teams to collaborate with data engineering and data center and able to support a variety of traditional
lines of business to build data products; recently added new and emerging data center workloads, including dense
capabilities to lower the barrier for enterprises to innovate virtualization, software-defined storage, software-defined
with AI networking, artificial intelligence, and big data projects

DATASTAX DELPHIX
www.datastax.com www.delphix.com
DataStax Enterprise—powering the “Right-Now Delphix Dyamic Data Platform—helps organizations
Enterprise” with the always-on, distributed cloud database provision lightweight, compressed copies of production data
built on Apache Cassandra and designed for hybrid in minutes, while keeping them in sync; secure sensitive
cloud, DataStax Enterprise helps companies to exceed data in adherence with security policies; and move and
manage data from any environment—on premise, cloud, or
expectations through consumer and enterprise applications
hybrid
that provide responsive and meaningful engagement

DENODO
DATAVAIL
www.denodo.com
www.datavail.com
Denodo Platform—offers the benefits of data virtualization,
Datavail Managed Services—helping clients to collect,
such as the ability to provide real-time access to integrated
manage, and derive value from organizational data, as
data across an organization’s diverse data sources without
well as streamline processes through software integration
replicating any data, and offers broad access to structured
and custom development, Datavail Managed Services are
and unstructured data residing in enterprise, big data, and
provided by a team of more than 1,000 data professionals
cloud sources
including DBAs, developers, project managers, consultants,
and business experts DIAMANTI
https://diamanti.com
DATAWATCH Diamanti’s D10 bare-metal container platform—
www.datawatch.com integrates open source Docker and Kubernetes with
Datawatch Angoss—a suite of big data analytics software purpose-built hardware and container-granular control
solutions and consulting services aimed at providing to provide a full container stack to give infrastructure
customers with a competitive advantage by helping them to architects, IT operations, and application owners the speed,
turn information into actionable business decisions across simplicity, efficiency, and control they need to run stateful
logistics, risk, marketing, and sales containerized applications at scale

DBI SOFTWARE DREMIO


www.dbisoftware.com www.dremio.com
pureFeat V7 Performance Management Suite for IBM Dremio Data-as-a-Service Platform—connects any
Db2 LUW—solutions for database performance analytics, analytical process to any data source and scales from one
tuning, and trending; response time analytics and SLA to 1,000 plus servers, running in the cloud, in Kubernetes, or
attainment, alerting and real time monitoring, along with new in a Hadoop cluster to help analysts and data scientists work
features to help DBAs and management teams to cope with together to discover, curate, and collaborate for diverse
the volatility of agile environments analytical use cases
DBTA | DECEMBER 2018/JANUARY 2019 21

EMPOLIS INFORMATION MANAGEMENT GRIDGAIN SYSTEMS


www.empolis.com www.gridgain.com
Empolis Service Express—a central knowledge platform GridGain In-Memory Computing Platform—sits between
offered as SaaS that speaks the language of its users and the application and data layers to provide in-memory speed
can be used by customers in self-service, first-level service and massive scalability to applications built on disk-based
staff, and experts, giving them direct access to all relevant databases, and works seamlessly with existing application
and data layers including all popular RDBMS, NoSQL, and
information by entering a search query
Hadoop databases
ERWIN
HEWLETT PACKARD ENTEPRISE (HPE)
https://erwin.com
www.hpe.com
erwin EDGE platform—with metadata connectors and
HPE Digital Prescriptive Maintenance Services—a
automated code generation, sensitive data discovery, data
series of AI-enabled industry offerings from HPE Pointnext,
mapping, and cataloging tools, erwin EDGE creates an
that enable problem prevention and increase productivity
“enterprise data governance experience” that facilitates
of industrial equipment—predicting, suggesting, and
collaboration between both IT and the business to unlock automating the right action to fix a problem before it causes
the value of data both at rest and in motion harm

FRANZ HVR
https://allegrograph.com www.hvr-software.com
AllegroGraph—employing a combination of semantic and HVR Platform—a scalable enterprise data integration and
graph technologies that process data with contextual and validation platform to simplify high-volume real-time data
conceptual intelligence, the database technology enables movement that includes built-in efficiency features such as
businesses to extract sophisticated decision insights and log-based change data capture, data compression, as well
predictive analytics from their highly complex, distributed as security features such as encryption and KMS support
data that can’t be answered with conventional databases for AWS

GOOGLE IBM
www.ibm.com
https://cloud.google.com
IBM Cloud Private—a key element of IBM’s hybrid cloud
Cloud SQL—a fully managed database service hosted on
strategy that—using container technology—can be installed
the Google Cloud Platform that offers high performance,
on a wide range of enterprise systems to create a private
scalability, and convenience, helping to make it easy for
cloud with architecture and capabilities consistent with the
users to set up, maintain, manage, and administer relational
public IBM Cloud
PostgreSQL and MySQL databases in the cloud
IDERA
GRIDDABLE www.idera.com
https://griddable.io SQL Diagnostic Manager—provides monitoring and
Griddable.io—provides a SaaS platform for hybrid cloud diagnostics that allow DBAs to detect database performance
data integration that synchronizes data across any topology problems and their root causes quickly, then tune database
or database platform on any cloud, and includes a policy health and performance—and recently introduced a SQL
engine that controls a flexible definition of what data is Query Tuner for optimizing SQL queries and indexes for SQL
filtered, masked, or transformed in transit Server, and new MySQL support
22 DBTA | DECEMBER 2018/JANUARY 2019

IMANISDATA INTERSYSTEMS
www.imanisdata.com www.intersystems.com
Imanis Data Management Platform—big data enterprise IRIS—a unified solution that provides a comprehensive and
data management software for backup, recovery, disaster consistent set of capabilities spanning data management,
recovery, test/dev, and archiving—that is architected for interoperability, transaction processing, and analytics—
petabyte scale, is data-aware, and powered by machine eliminating the need to integrate multiple development
learning to reduce cost, minimize risk, and enable insight to technologies so applications require less code, fewer
all of data system resources, and less maintenance

INFLUXDATA KORE TECHNOLOGIES


www.influxdata.com www.koretech.com
InfluxData Platform—a complete time series platform Kourier Integrator—Kore’s flagship enterprise integration
built specifically for metrics, events, and other time-based and data management suite, providing extract, transform,
data—whether the data comes from humans, sensors, or and load and enterprise application integration capabilities,
machines—that helps developers build next-generation helps organizations extend the value and functionality of
monitoring, analytics, and IoT applications faster and more
enterprise applications by integrating and connecting with
easily
disparate databases and best-in-class applications

INFORMATICA
LICENSEFORTRESS
www.informatica.com
www.licensefortress.com
Informatica Intelligent Data Lake Management—
LicenseFortress—database software license management
products certified to run on popular, on-premise Hadoop
and audit protection services comprised of four levels
distributions or cloud deployment enable data scientists and
(Discovery, Standard, Premier, and Legal) to help
business analysts to quickly find the data they’re looking for
organizations enhance their understanding of software
on a self-service basis using semantic and faceted search,
licensing and pricing practices, reduce risk, and “eliminate
while automatically understanding data lineage and data
the surprise factor”
relationships

INFORMATION BUILDERS LOOKER DATA SCIENCES


https://looker.com
www.informationbuilders.com
WebFOCUS—combines traditional governed BI with Looker Platform—designed to flex and scale as needs
business-led agile analytics, enabling enterprises to for information change and expand, the modern platform
deliver a broad range of reports, dashboards, documents, for data continues to build upon its BI application and adds
and applications while empowering business users to stronger enterprise-class features, enhanced tools, and
create their own data visualizations, charts, graphs, and powerful department-specific plug-and-play applications to
infographics help users access, analyze, and take action

INNOVATIVE ROUTINES INTERNATIONAL (IRI) MAPR TECHNOLOGIES


www.iri.com www.mapr.com
IRI DarkShield—consolidates and multi-threads the MapR Converged Data Platform—a platform for artificial
search, remediation, extraction, and reporting of sensitive intelligence and analytics that enables enterprises to inject
data in unstructured files sitting in multiple formats and analytics into their business processes to increase revenue,
folders across a network, and provides a single, user- reduce costs, and mitigate risks, while addressing the data
friendly interface to do everything at once or in scheduled complexities of high-scale and mission-critical distributed
steps processing
DBTA | DECEMBER 2018/JANUARY 2019 23

MARIADB MICROSTRATEGY
https://mariadb.com www.microstrategy.com
MariaDB TX—an open source database fueled by MicroStrategy—an enterprise analytics platform that offers
out-of-the-box visualizations, intelligent recommendations
community innovation that offers Oracle Database
for content, prompts for dossiers, a native MicroStrategy
compatibility, sharding, temporal tables, and point-in-time
Library app for smartphones, and a new enhanced mapping
rollback, and uses purpose-built storage engines to maximize for conducting geospatial analytics with Mapbox, a leading
the storage efficiency and query performance of applications location data platform for mobile and web applications
with traditional and/or non-traditional workloads
MONGODB
MARKLOGIC www.mongodb.com
www.marklogic.com MongoDB Atlas—engineered by the same team that builds
MarkLogic—an operational and transactional enterprise the MongoDB NoSQL database platform, the MongoDB
Atlas database-as-a-service incorporates best practices
NoSQL database platform designed for speed and scale,
developed from real-world use cases at thousands of
recently introduced the MarkLogic Query Service, a new
customer deployments from startups to the Fortune 100
way to give customers elasticity in the cloud
NUMERIFY
MELISSA www.numerify.com
www.melissa.com/data-integration/talend Numerify—contains a suite of specialized IT applications
Data Quality Components for Talend—offers built-in powered by an underlying analytical platform with each
solutions for Talend Opera Studio for Data Integration, analytical application focusing on a specific business area
of IT—including service, asset, and project management
including Personator, Melissa’s all-in-one ID verification,
data completion, and data-enrichment tool, along with
NUXEO
Global Address Verification to clean, standardize, and www.nuxeo.com
transliterate addresses in more than 240 countries Nuxeo Platform—a modern content services platform that
manages both traditional content and rich media assets to
MEMSQL unlock the full value of all of a company’s digital content—
www.memsql.com anywhere within the company
MemSQL—offers a single database that can easily support
real-time decision making and insight-driven customer
OPENTEXT
www.opentext.com
experiences combining massively scalable ingest, the
OpenText Legal Center—offers a process-centric
ability to scan billions of rows per second across thousands approach to the legal field for client onboarding, external
of users, and a familiar relational structure for user- and sharing and collaboration, and document management,
developer-friendly access and is designed to leverage and extend existing DM
repositories such as OpenText eDOCS, whether
MICROSOFT on-premise or in the cloud
https://azure.microsoft.com
Azure—a set of cloud services that developers and IT ORACLE
www.oracle.com
professionals use to build, deploy, and manage applications
Oracle Autonomous Database Cloud—powered by
through a global network of data centers, with additional
Oracle Database 18c, the next generation of the industry-
support provided by integrated tools, DevOps, and a leading database, Oracle Autonomous Database Cloud
marketplace for building anything from simple mobile apps to offers total automation based on machine learning and
internet-scale solutions eliminates human labor, human error, and manual tuning
24 DBTA | DECEMBER 2018/JANUARY 2019

PAXATA PYTHIAN
www.paxata.com https://pythian.com
Adaptive Information—a big data prep platform aimed Pythian Kick Analytics-as-a-Service—a cloud-native
analytics platform that solves the data silo problem, taking
at democratizing the data preparation process, offering
the hard work out of capturing, curating, and preparing data
new features such as a Adaptive Workload Management for consumption by a range of users and systems
capability, which delivers an elastic resource allocation
service on a number of orchestration frameworks, including QLIKTECH
Microsoft Azure HDInsight, Kubernetes, and Apache Hadoop www.qlik.com
YARN QlikSense—lets users discover, search, and explore
across all their data, pivoting their analysis when new ideas
surface, and provides flexibility with a cloud-ready data
PERCONA
analytics platform that supports the full spectrum of BI use
www.percona.com
cases
Percona Monitoring and Management—combines
best-of-breed tools in a single, virtual appliance, along with QUBOLE
Percona-developed query analytics, administration, API, www.qubole.com
agent, and exporter components, to monitor and provide Qubole Data Service—provides a single platform for ETL,
reporting, ad hoc analysis, stream processing, and machine
performance data for MySQL and MongoDB variants
learning to help data teams be more productive and reduce
the costs of their data initiatives while taking full advantage
PREMIUMSOFT CYBERTECH of the elasticity and scale of the cloud
www.navicat.com
Navicat Premium—a multi-connection database QUEST SOFTWARE
development tool that allows users to connect up to six www.quest.com
databases within a single application, including MySQL, SharePlex—a multi-platform replication and real-time data
integration solution that supports organizations’ increasingly
MariaDB, SQL Server, SQLite, Oracle, and PostgreSQL, to
heterogeneous database environment used by Fortune 500
create access to all databases at once
for migrations and upgrades, load balancing, supporting BI
and analytics, and much more
PROGRESS
www.progress.com RED HAT (acquired by IBM)
Progress OpenEdge—an application development platform www.redhat.com
that helps simplify the delivery of mission-critical business Ansible Automation—simple, agentless IT automation
technology that can improve business processes, migrate
applications with a focus on increasing the ability to always
applications for better optimization, and provide a single
be on, fortify applications through enhanced security, and
language for DevOps practices across the organization
keep accurate data flowing through the organization using a simple, human-readable language that anyone in IT
can understand
PURE STORAGE
www.purestorage.com REDIS LABS
Pure Storage FlashBlade—a petabyte-scale storage https://redislabs.com
system for unstructured and operational data that is Redis Enterprise—combines the advantages of world-
class database technology and the innovation of a vibrant
designed with four key innovations to overcome the
open source community to provide high availability in the
limitations of conventional clustered NAS systems: high- form of Active-Active and Active-Passive geographically
performance storage devices, unified network, distributed distributed architectures, with high performance and built-in
storage operating system, and administrative simplicity search capabilities
DBTA | DECEMBER 2018/JANUARY 2019 25

REDPOINT GLOBAL SAS INSTITUTE


www.redpointglobal.com www.sas.com
RedPoint Customer Engagement Hub—delivers a unified Viya—complements SAS, augmenting the SAS Platform
view of each customer, in-line analytics to determine next- to enable everyone—data scientists, business analysts,
best actions, and intelligent orchestration to personalize developers, and executives alike—to collaborate and
engagement across the enterprise, and recently added new achieve innovative results faster to derive new insights like
master data management capabilities never before

REVELATION SOFTWARE SOFTWARE DIVERSIFIED SERVICES (SDS)


www.sdsusa.com
www.revelation.com
IronSphere—an ISCM solution for the IBM mainframe
OpenInsight 10—Revelation’s flagship product is a NoSQL
that provides complete visibility of system risks, IronSphere
database development suite that provides Windows, Web
continuously monitors the mainframe, automatically
2.0, and .NET tools to develop and deploy mission-critical
performing security scans and looking for system
applications along with a redesigned integrated development
vulnerabilities, insufficient settings, and modified operands
environment, 64-bit architecture integration, new security
updates, and more SEARCH & CONTENT ANALYTICS (part of Accenture Applied Intelligence)
www.searchtechnologies.com
ROBIN SYSTEMS Data Lake Solutions—leverages the company’s expertise
https://robin.io in Hadoop, Cloudera CDH, Cassandra, Elastic Stack, Solr,
ROBIN Hyper Converged Kubernetes Platform—hyper- Hortonworks, Microsoft Azure, Amazon Web Services, and
converged Kubernetes is a software-defined application other big data platforms to offer a wide range of flexibility and
orchestration framework that combines containerized security options for a data lake
storage, networking, compute (Kubernetes), and the
application management layer into a single system, allowing SENTRYONE
users to do self-service deployment of big data, databases www.sentryone.com
and AI/ML, share entire experiments among team members, SQL Sentry for SQL Server—allows organizations to collect
and more and present the most actionable performance metrics and
alerts, see important events, and cross-reference them using
ROCKET SOFTWARE an Outlook-style calendar to resolve issues by doing detailed
www.rocketsoftware.com analysis in the same tool used for monitoring and alerting
UniVerse—a component of the MultiValue application
platform, is a fast, flexible data server for developing
SINEQUA
www.sinequa.com
enterprise app, and its variable length, table-within-
Cognitive Search & Analytics Platform—offers broad
table architecture means speedy data access and low
connectivity to all enterprise data sources and combines
maintenance with user interfaces for Windows, Linux,
disruptive analytical tools, including natural language
and UNIX
processing and machine learning, to increase the enterprise
IQ and continuously improve its ROI
SAP
www.sap.com SISENSE
HANA—combines an ACID-compliant database with www.sisense.com
application services, high-speed analytics, and data Sisense—simplifies business analytics for complex data by
acquisition tools in an in-memory platform with built-in providing a complete solution for preparing, analyzing, and
application services that support the development and visualizing big or disparate datasets with agility that allows
deployment of new business apps to deliver insight from business users with no technical background to get the
big data and IoT accurate intelligence at the very moment it is needed
26 DBTA | DECEMBER 2018/JANUARY 2019

SNAPLOGIC SQREAM
www.snaplogic.com https://sqream.com
SnapLogic Enterprise Integration Cloud—accelerates SQream DB—a fully-featured GPU-accelerated data
digital transformation to connect the application and data warehouse, capable of handling the most complex queries
landscape, and empowers business users and IT—with featuring comprehensive ANSI SQL support to load, store,
intuitive, scalable, and connected integration along with and analyze data by using advanced columnar techniques,
new features such as integration with GitHub and support along with high-throughput processors
for Mesosphere to automate critical elements of continuous
integration and delivery STRIIM
www.striim.com
SNOWFLAKE Striim—a patented, enterprise-grade platform that offers
www.snowflake.com continuous real-time data ingestion, high-speed in-flight
Snowflake Data Warehouse—a new SQL data warehouse stream processing, and sub-second delivery of data to cloud
from the ground up for the cloud designed with a patented
and on-premise endpoints, continuously delivering data
new architecture to handle all aspects of data and analytics
that can be immediately available to high-value operational
to deliver the performance, simplicity, concurrency, and
workloads
affordability not possible with other data warehouses
SYNCSORT
SOLARWINDS
www.syncsort.com
www.solarwinds.com
DMX-h—designed to help users achieve their modern data
Database Performance Analyzer—monitors the entire
strategy objectives, DMX-h features a single interface for
environment on premises, in the cloud, or as a service along
accessing and integrating enterprise data sources—batch
with helping to identify performance issues and eliminate
and streaming—across Hadoop, Spark, Linux, UNIX, or
database bottlenecks using multi-dimensional views to
Windows, whether on premises or in the cloud
answer the who, what, when, where, and why

TABLEAU SOFTWARE
SOUTHBANK SOFTWARE
www.tableau.com
www.dbkoda.com
Tableau—available as desktop, server, and hosted
dbKoda—an open source integrated development
environment for MongoDB includes a performance lab software, the signature platform allows users to connect,

that offers a graphical real-time view of MongoDB server explore, and visualize their data to achieve insights through
performance, allowing users to drill into poorly performing automated table and join recommendations powered by
commands, and an index advisor to help users optimize ML algorithms that simplify the search for the right data for
those commands analysis

SPLUNK TALEND
www.splunk.com www.talend.com
Splunk Enterprise—a platform that enables operational Talend—simplifies and automates big data integration
intelligence from machine-generated data and provides a with graphical tools and wizards that generate native code,
range of search, visualization, and prepackaged content enabling teams to start working with Apache Hadoop,
cases to support organizations in discovering and sharing Apache Spark, Spark Streaming, and NoSQL databases, for
insights easier and faster cloud or on-premises today
DBTA | DECEMBER 2018/JANUARY 2019 27

TERADATA VOLTDB
www.teradata.com www.voltdb.com
Teradata Vantage—enables enterprises to uncover VoltDB—an in-memory, scale-out operational database,
actionable answers to the toughest business questions by built to help organizations create high-velocity applications,
tightly integrating the best analytic functions and engines that offers increased support for the Hadoop ecosystem,
to provide a scalable, agile platform that helps them to drive expanded SQL support, and a new management center
business value
without compromising on ACID requirements

TIGERGRAPH
WHERESCAPE
www.tigergraph.com
www.wherescape.com
TigerGraph—a fast graph analytics platform for the
Wherescape Automation—on-premise, in the cloud, or a
enterprise offers new features that include seamless
combination of both, WhereScape data automation solutions
integration with popular databases and storage systems,
provide out-of-the-box best practices, optimized native code,
support for Docker and Kubernetes containers, availability
on the Amazon Web Services Marketplace and Microsoft and features for popular target data platforms, including

Azure, and a new graph algorithm library Amazon Redshift, Microsoft SQL Server, Microsoft Azure,
Oracle, Snowflake, and Teradata
TRIFACTA
www.trifacta.com YELLOWFIN
Trifacta Wrangler—accelerating data preparation for www.yellowfinbi.com
analytics and machine learning with no coding required, Yellowfin—a BI and analytics platform aimed at solving
Wrangler makes data cleaning, blending, and structuring real enterprise analytics challenges and helping business
more intuitive so that even the most challenging Excel, CSV, people understand not only what happened, but why through
or JSON files can be wrangled in minutes a series of solutions offering data discovery, data prep, and
more
VERTICA
www.vertica.com ZALONI
Vertica Analytics Platform—designed and built on a www.zaloni.com
tested, reliable distributed architecture and columnar
Zaloni Data Lake Management Platform—a
compression to deliver blazingly fast speed for use in data
comprehensive, integrated solution that operationalizes data
warehouses and other big data workloads where speed,
along the entire pipeline, providing automation of repeatable
scalability, simplicity, and openness are crucial to the
management tasks and processes and the ability to centrally
success of analytics
manage all enterprise data sources regardless of location
VMWARE
www.vmware.com
ZOOMDATA
VMware Cloud Foundation—provides integrated www.zoomdata.com
cloud infrastructure (compute, storage, networking, and Hadoop Big Data Visualization—a Hadoop data
security) and cloud management services to run enterprise visualization tool that can connect directly to HDFS as well
applications in both private and public environments as to SQL-on-Hadoop technologies, such as Impala, Hive,
by leveraging a common infrastructure and consistent SparkSQL, and Presto, and connect to big data sources such
operational model as search, streaming, and NoSQL via smart connectors
28 SPONSORED CONTENT DBTA | DECEMBER 2018/JANUARY 2019

PRODUCT SPOTLIGHT

Aerospike
AEROSPIKE IS TRUSTED by leading data, supporting trillions of transactions per month, with
enterprises around the world to sub-millisecond latency.
help them confidently deploy Aerospike is used in Financial Services, Banking,
mission-critical, strategic oper- Telecommunications, Technology, Retail, E-commerce,
ational applications that make Adtech, Martech and Online Gaming. Powered by a pat-
digital transformation possible. ented Hybrid Memory Architecture™ and autonomic
Aerospike’s vision is to make it easy cluster management, the Aerospike database is ideal for
and affordable for companies of all Fraud Prevention, Digital Wallet, Online Brokerage, Telco
Brian Bulkowski,
Founder and CTO sizes to build next-generation data Charging & Billing, Messaging & Chat, Recommendation
systems like those built internally Engines, Real-Time Bidding, and other applications that
by the largest internet-scale com- require the highest possible uptime, performance and
panies like Google and Facebook. scale.
Our enterprise-grade database is deployable any- Aerospike customers include Adobe, Airtel, FlipKart,
where, delivers unmatched uptime, predictable per- Kayak, Nielsen, Nokia, and Snap.
formance, and exceptionally low TCO. Aerospike has
customer deployments that have run for years with no Aerospike
service disruption, handling hundreds of terabytes of www.aerospike.com

BackOffice Associates
ON BEHALF OF the BackOffice Associ- crowdsources contributions and knowledge from business
ates team worldwide, it is an honor and data experts with guidance from artificial intelligence
to accept DBTA’s recognition of our and machine learning to yield trusted data.
Information Governance Cloud Empowering business and IT stakeholders alike, Infor-
solution for its Trend-Setting Prod- mation Governance Cloud offers the ability to compre-
ucts for 2019. As many of today’s hensively set and enforce data policies across an enterprise
leading global enterprises undergo through smart automation. By using Information Gover-
Rex Ahlstrom, digital transformations, they are also nance Cloud, organizations can define and streamline oper-
Chief Strategy and embarking on a data transformation ational policies and business rules for data, align them with
Technology Officer as they shift onto cloud-based busi- the organization’s overall business strategies, and oversee
ness suites and must manage more policy enforcement across all systems including BackOffice
structured and unstructured data than ever before. As the pace Associates’ Data Stewardship Platform™.
and complexity of doing business is accelerating exponentially, As the industry continues to expand, BackOffice Associ-
having reliable, business-ready data available across the full ates remains committed to delivering innovative solutions like
enterprise system landscape is critical to remain competitive. Information Governance Cloud, which harness the power of
With these evolving market dynamics in mind, BackOf- information governance, allowing business users to use their
fice Associates developed Information Governance Cloud data to execute on mission-critical business imperatives.
to help organizations to get more value from their vast
amounts of data by validating its accuracy and relevancy, as
BackOffice Associates
well as using it to drive key business outcomes. The solution www.boaweb.com
DECEMBER 2018/JANUARY 2019 | DBTA 29

PRODUCT SPOTLIGHT

Bradmark Technologies, Inc.


SURVEILLANCE DB™ PERFORMANCE MONITORING & EVENT MANAGEMENT SOLUTION
ENTERPRISES ARE MIXING cloud and resources in data centers, whether it’s in-house or remote
on-premises infrastructure in ways locations.
that make sense for them. Perfor- Access to local, distributed recent history across many
mance monitoring solutions need to sites. Utilizing three-tier, distributed metric data, Sur-
handle this mix of infrastructure, as veillance DB provides quick local access to recent history,
well as the distributed nature (tech- without using WAN bandwidth or having contention on
nical and logical) of many businesses centrally-located resources. So, for industries with multiple
Edward Stangler across geographic zones. store locations or business organizational units—many key
R&D, Product ESSENTIAL VISIBILITY ACROSS YOUR metrics are available from the recent past using data that is
Manager, GROWING IT INFRASTRUCTURE MIX stored locally at each site, rather than having to query a cen-
Surveillance
Bradmark’s Surveillance DB™ tral database repository over the network.
for enterprise environments provides the most comprehen- And to ensure that global infrastructures are managed effi-
sive, real-time monitoring and proactive event management ciently and effectively, our future roadmap highlights alerts
solution available. With support for SAP-Sybase, MS-SQL augmented with machine learning and reactive feedback—as
Server, Oracle, Informix and DB2 databases running on good alerts turn into important details, and important details
UNIX, Linux and Windows servers, Surveillance DB supports turn into faster solutions for problems.
your growing IT infrastructure by providing:
Seamless integration between cloud-accessible moni-
toring and on-premises (or cloud) databases. Quickly pro- Bradmark Technologies, Inc.
vide secure global web access to performance monitoring www.bradmark.com/surveillance

Cambridge Semantics ANZOGRAPH


TODAY’S MOST CHALLENGING analytics to graph OLTP databases like Neo4j, AWS Neptune and
tasks center around the connections other OLTP systems.
in the data. If you’re confronted AnzoGraph offers graph analytic functionality to enable
with a difficult process when trying new levels of data-driven insight to help drive new business
to create corporate knowledge opportunities, minimize costs and increase competitiveness.
graphs, understand buyer intent AnzoGraph is capable of loading, updating, querying and
and influence, detect fraud or build analyzing huge amounts of data at speeds up to 100x faster
Steve Sarsfield, a recommendation engine, a data than other graph databases, and our benchmarks prove it.
VP of Product warehouse style analytics solution AnzoGraph supports both labeled property graphs
that focuses on this type of connected (using W3C RDF* proposed standard) and semantic graphs
analytics is imperative to your success. You need a graph with advanced analytics functionality, including graph
database like AnzoGraph™ from Cambridge Semantics. views, named queries, analytic/data mining/reporting
AnzoGraph is a native, Massively Parallel Processing functions, inferencing and conditional expressions. You
(MPP) distributed Graph OLAP (GOLAP) database, can perform special graph functions like pagerank, shortest
providing fast advanced analytics at big data scale. Data path, all paths and many more. It’s available now for a free
analysts, enterprise architects and application developers download and trial.
can use AnzoGraph to build and execute data warehouse
analytics, graph analytics and inferencing, all in one award-
winning database. Graph OLAP databases offer better AnzoGraph by Cambridge Semantics
performance on deep, long-running queries when compared www.anzograph.com
30 SPONSORED CONTENT DBTA | DECEMBER 2018/JANUARY 2019

PRODUCT SPOTLIGHT

Datavail
DATAVAIL DELTA MONITORING • Availability
EXPANDS TO MYSQL AND ORACLE • Configuration
ON WINDOWS SERVER. • Backups
As the core component of data- • Scheduled Jobs
driven software, your databases • Security and Compliance
need to be continuously managed to • Performance
make your business more agile and • Capacity Planning
less susceptible to risk. This is why With intelligent alerts that dispatch the right level of
Eric Russo, SVP, Datavail developed our custom-built response, Datavail Delta ensures your most pressing issues
Database Services
software, Delta, a world-class database receive immediate attention from database administration
monitoring solution. professionals.
We’re excited to share that our platform has expanded to Find out why 225+ Datavail customers are using Delta to
provide proactive monitoring for MySQL and Oracle. Now, monitor more than 5,000 servers and 125,000 databases. To
DBAs managing databases across cloud and on-premise learn more about Delta, visit our website, or contact Datavail
platforms can set aside tedious, manual database monitoring, today at 866-239-9538.
and rely on Delta to provide alerts for the most pressing issues.
Delta can report on your most critical areas: Datavail
www.datavail.com

Datawatch Angoss
DATAWATCH ANGOSS IS A global STUDIO for Apache Spark to efficiently build analytic
leader in delivering advanced workflows using datasets of all sizes. It effectively lever-
analytics to businesses looking ages Apache Spark’s ability to operate on datasets with
to monetize data with a focus on extremely large numbers of records, while improving
logistics, risk, marketing and sales. SparkSQL and SparkML queries on datasets that have
Datawatch Angoss removes the thousands of columns. Very small datasets can also
complexity inherent to predic- be efficiently included in Spark analytical workflows.
tive analytics and machine learn- Support for Hive, Parquet, and distributed CSV is also
Michael Rowley,
Director Global ing by offering a platform that is featured.
Product Marketing both intuitive to use and rich in Leading global organizations in financial services,
features. These features include insurance, retail and technology rely on Datawatch
intuitive workflows with drag-and-drop functionality Angoss to grow revenue, increase sales productivity and
that train, test and deploy predictive models for quicker improve marketing effectiveness while reducing risk and
time-to-insight in model outputs. cost. Datawatch Angoss solutions include scenario com-
KnowledgeSTUDIO for Apache Spark provides a pro- parison, cloud-based model deployment and real-time
ductivity tool for Spark users, allowing them to inter- scoring, and sentiment data analytics to capture trending
act with Spark via a graphical user interface to generate conversations about consumer brands and product.
error-free code that may be used in production scripts.
Data science teams that are modeling in a Big Data envi- Datawatch Angoss
ronment and outside of it can use Angoss Knowledge- https://www.datawatch.com
DECEMBER 2018/JANUARY 2019 | DBTA 31

PRODUCT SPOTLIGHT

Delphix
TODAY’S MODERN BUSINESSES need age to corporate reputations and customer perceptions.
efficient, secure access to data to Businesses are under tremendous pressure to deliver data
innovate faster to keep pace with for innovation while protecting it from crippling security
consumers demands and avoid breaches.
disruption. At Delphix, our mission is to solve these pain points
An overwhelming majority of Fortune by eliminating data friction. As the critical foundation
1000 executives fear disruption from for any strong DataOps strategy, we provide automated,
nimble, data-driven competitors, secure, self-service access to data for the people who need
Eric Schrock,
CTO and say the inability to be agile and it to drive innovation and business agility. The Delphix
compete on data is the most press- Dynamic Data Platform virtualizes and secures data to
ing challenge facing their business today. Their fear is not make it lightweight and accessible for the people and
unwarranted. Over the last decade, nearly three-quarters teams who need it, regardless of their size.
of Fortune 1000 companies have been replaced, despite Hundreds of the largest and most valuable brands in
aggressive investments in software to fend-off more agile, the world depend on Delphix to accelerate the time to
tech-savvy competitors. market for their key initiatives, and greatly enhance the
Access to data for business is no longer a for- quality of their offerings, including one-third of Fortune
ward-thinking trend, it’s a prerequisite to survival in 100 companies.
today’s fast-moving market. However, data “friction”
can keep companies from moving fast while prioritizing Delphix
security. Just one data breach can cause profound dam- www.delphix.com

Denodo Technologies
ORGANIZATIONS ARE MOVING data to advanced data infrastructures. Beyond hybrid on-premises/
the cloud, driven by efficiencies of cloud environments, multilocation architectures also support
scale; reductions in the total cost of environments that extend across multiple cloud providers.
hardware, software, and the entire IT Such environments may leverage AWS for some applications,
infrastructure; flexible, usage-based Azure for others, and Google Cloud for still others, including
payment structures; and the ability emerging AI applications.
to provision new IT capabilities in The Denodo Platform supports multilocation architecture
minutes rather than months. through data virtualization, which enables real-time data
Ravi Shankar,
CMO Data Migrating data from on-premises integration between on-premises systems and cloud systems,
Virtualization systems requires a certain amount or between different cloud systems, all without moving the
of downtime, however, and many data. Data virtualization establishes a universal data access
organizations are discovering that their employees’, customers’ layer between the sources. To access the data, no matter where it
and partners’ tolerance for downtime is rapidly approaching resides, a user or consuming application simply needs to access
zero. Such organizations need a multilocation architecture, the data virtualization layer, which gets a view of the requested
which is a data architecture that enables access to both data in real time. The Denodo Platform provides the most
on-premises and cloud systems simultaneously. With a advanced data virtualization solution, enabling all the benefits
multilocation architecture, migrations can take place without of a modern multilocation architecture!
downtime, and without users’ knowledge that a migration has
taken place.
Multilocation architectures enable location transparency Denodo Technologies
and data abstraction, and as such they support the world’s most www.denodo.com
32 SPONSORED CONTENT DBTA | DECEMBER 2018/JANUARY 2019

PRODUCT SPOTLIGHT

Empolis
WITH THE CLOUD-BASED solution and easily with their colleagues and always have the full
Empolis Service Express, techni- range of information available online and offline.
cal customer service is successfully Integrated analytics tools instantly make it clear where
digitized immediately, with no problems exist, how customers can be assisted and taps into
large investments. Empolis Ser- the potential for improvement. Thus, information quality is
vice Express provides global, scal- continuously enhanced.
able access to distributed service Empolis Service Express is also available as a mobile service
information for service personnel, app for smartphone or tablet assistance for support field
Stefan Wess,
CEO partners, and customers. Empolis technicians as they conduct their service appointments. Since
Service Express uniquely combines the app has full offline capability, all relevant information and
intelligent search and guided troubleshooting based on arti- the company’s entire service knowledge can be accessed on
ficial intelligence. site, at any time, without an internet connection. All of this,
Magical service moments are created with Empolis in an instant, free of tedious data input, simply by selecting
Service Express, therefore allowing service staff to deliver options or via voice commands, supported by comprehensive
excellent customer service and become real service heroes. ergonomic design and user guidance. The Empolis Service
They have access to all relevant service information with Express mobile app is available in the app store.
a single action, distributed across a wide range of tools.
They find the right information immediately, even in large Empolis
documents. All employees share their experiences quickly www.empolis.com

erwin EDGE
WE DEVELOPED THE erwin EDGE plat- With the erwin EDGE, organizations can:
form to deliver an “enterprise data
governance experience,” so the mod- • Discover data: Identify and integrate metadata from
ern business can accelerate the trans- various data management silos.
formation of mission-critical data • Harvest data: Automate the collection of metadata
from various data management silos and consolidate it
into accurate and actionable insights.
into a single source.
The erwin EDGE ensures collab-
• Structure data: Connect physical metadata to specific
Mariann McDonagh, oration between IT and the business
Chief Marketing
business terms and definitions and reusable design
to discover, understand and unlock
Officer standards.
the value of data both at rest and in • Analyze data: Understand how data relates to the busi-
motion. With data governance at the hub, it brings together ness and what attributes it has.
business process, enterprise architecture, data mapping and • Map data flows: Identify where to integrate data and
data modeling to simplify the complete data management track how it moves and transforms.
and governance lifecycle. And it features the broadest set of • Govern data: Develop a governance model to manage
metadata connectors and automated code generation, data standards and policies and set best practices.
mapping and cataloging tools available today. • Socialize data: Enable stakeholders to see data in one
This single, integrated solution makes it possible to place and in the context of their roles.
gather business intelligence, conduct IT audits, ensure regu-
latory compliance and accomplish any other organizational
objective by fueling an automated, high-quality and real- erwin, Inc.
time data pipeline. www.erwin.com
DECEMBER 2018/JANUARY 2019 | DBTA 33

PRODUCT SPOTLIGHT

Franz Inc. ALLEGROGRAPH—GRAPH DATABASE FOR AI KNOWLEDGE GRAPHS


ARTIFICIAL INTELLIGENCE (AI) is one Semantic Graph Database technology as the foundation. If
of the top investment areas for you really want to develop your corporate Knowledge Graph
companies looking to improve and address complex Artificial Intelligence problems, you
ROI on operations and products, need a data system that goes beyond just data. You have to
and to create customer 360 views. create a system that can link to anything outside your own
Using AI to create “Enterprise predefined parameters—and that can learn from previous
Knowledge” and link it across the experiences. That is where a Semantic Graph Database, like
Jans Aasman, Enterprise to create a “Knowledge AllegroGraph, comes into the picture.
CEO Graph” is a key differentiator for Franz Inc. provides a variety of services as part of its
companies in an ever-increasing Knowledge Graph platform solution: from architectural
competitive landscape. The foun- consulting and technical seminars to training. Franz’s flag-
dation for Knowledge Graphs and ship product, AllegroGraph, provides the necessary power
Artificial Intelligence lies in the facets of semantic technol- and flexibility to address high-security data environments
ogy provided by Franz’s AllegroGraph database. Semantic such as HIPAA access controls, privacy rules for banks, and
Graph databases, such as AllegroGraph, provide the core security models for policing, intelligence, and government.
technology environment to enrich and contextualize the Contact Franz Inc. to unleash the potential of your Com-
understanding of data. The ability to rapidly integrate pany’s Knowledge Graph.
new knowledge is the crux of the Knowledge Graph and
depends entirely on semantic technologies.
An early innovator in Artificial Intelligence, Franz Inc. Franz Inc.
is a leading supplier of Knowledge Graph solutions with https://franz.com/

Griddable
AS ENTERPRISES EMBRACE hybrid Oracle database or re-architecting for microservices, the
cloud as the new long-term reality, Griddable data pipeline efficiently synchronizes to multiple
database architects need a simple destinations with low latency while supporting data cus-
approach to migrate enterprise data tomizations at each destination.
to the cloud, re-engineer for cloud- The Griddable policy engine provides user-friendly
first architectures, and ensure por- controls for modernizing and transforming data in transit.
tability of data across multiple pub- Griddable masks or encrypts any number of individual data
Robin Purohit, lic clouds to avoid lock-in. elements using separate masking algorithms or encryption
CEO Griddable migrates and mod- keys. It also filters and replaces data values, or selectively
ernizes legacy data seamlessly across heterogeneous clouds removes entire rows or columns, with an easily-defined user
and database types. The Griddable platform is a cloud-first policy. Using Griddable.io’s selective filtering and synchro-
product, providing data synchronization with scale up, scale nization, cloud operators can accelerate movement of key
out, high availability features out of the box. Most impor- data to best-fit platforms. Decisions regarding data place-
tantly, Griddable provides an automated user experience ment can be made dynamically during production and
that requires nothing more than connecting the source adjusted to operational parameters.
and target databases to the grid. Once connected, Gridda- To learn more about Griddable.io, download our white
ble automates schema copying, initial load, and subsequent paper https://griddable.io/hybrid-cloud/.
change data capture in a continuous process that requires
no user intervention.
Many use cases today require modernizing data to mul- Griddable
tiple targets simultaneously. Whether replacing a legacy griddable.io
34 SPONSORED CONTENT DBTA | DECEMBER 2018/JANUARY 2019

PRODUCT SPOTLIGHT

InfluxData
TIME SERIES HAS been the fastest metrics and events that deliver insights and competitive
growing database category for the advantage to data-driven organizations.
last two years, according to DB-En- InfluxData provides the leading time series plat-
gines. This growth is being fueled form, built from the ground up, for analyzing metrics
by two major industry trends—the and events for DevOps and IoT applications. Whether
rapid instrumentation of the phys- the data comes from humans, sensors, or machines,
ical world, driven by increasing InfluxData empowers developers to build next-gen-
investment in IoT systems, and an eration monitoring, analytics, and IoT applications
Mark Herring,
CMO explosion in the software world faster, easier, and to scale to deliver real business value,
of cloud-native applications and quickly. Based in San Francisco, InfluxData’s more than
services, all of which are being 450 customers include Cisco, eBay, IBM and Siemens.
instrumented for real-time visibility and control. This
“Age of Instrumentation” has generated major demand for
purpose-built time series platforms that can support the InfluxData
critical requirement for real-time processing of the myriad www.influxdata.com

InterSystems IRIS DATA PLATFORM: INTUITIVE, RELIABLE, INTEROPERABLE, AND SCALABLE


BUSINESSES NEED NEW applications API management, and real-time monitoring and alerting
to meet the growing demands of capabilities to support the full spectrum of integration sce-
society’s advancing technology— narios and requirements.
applications that are smarter, faster, Data Analytics: A powerful, open analytics platform
and that can scale more quickly supports a wide range of analytics and is able to analyze real-
and cost-effectively. The solution? time and batch data simultaneously at scale. Developers can
InterSystems IRIS Data Platform™, embed analytic processing into business processes and trans-
Carlos Kuhl Nogueira, a single, unified platform that pro- actional applications, enabling sophisticated programmatic
General Manager, Data vides all of the following capabilities decisions based on real-time analyses. Natural language
Platform Initiatives to efficiently accommodate increas- processing capabilities allow developers to extract meaning
ing workloads and data sizes: and sentiment from unstructured text.
Data Management: InterSystems IRIS is an ultra-high perfor- Cloud Deployment: Automated “cloud-first” deployment
mance, horizontally scalable, multi-model database. It stores options simplify public cloud, private cloud, on premise, and
and accesses data modeled as objects, schema-free data, rela- virtual machine deployments and updates.
tional data, and multi-dimensional arrays in a single, highly
efficient representation. It simultaneously processes both trans- InterSystems IRIS redefines high performance for application
actional and analytic workloads in a single database at very high developers, systems integrators and end-user organizations who
scale, eliminating latencies between event, insight, and action. develop and deploy data-rich and mission-critical solutions.

Interoperability: A comprehensive, integration platform pro-


vides application integration, data coordination, business InterSystems
process orchestration, composite application development, InterSystems.com/IRIS
DECEMBER 2018/JANUARY 2019 | DBTA 35

PRODUCT SPOTLIGHT

Kore Technologies
KOURER INTEGRATOR IS Kore’s flag- as server). Plus, now you can integrate to other third-party
ship product for advanced Enter- applications via their REST endpoints (REST as a client).
prise Application Integration (EAI) Using Kourier’s new application “Connectors” you can use
and Extract, Transform and Load REST to talk to other applications from your MultiValue
(ETL) solutions. system (e.g., CRM, Service Management and eCommerce)
Kore clients, partners and VARs just as easily as they can talk to your system.
have been using Kourier for years to Integration can be hard, so we endeavor to make it easier
extend the value and functionality using our “clicks not code” approach. Developers are more
Mark Dobransky,
Co-Founder and of their enterprise applications by productive creating REST-based integrations using Kouri-
Managing Partner creating near real-time data ware- er’s template-based APIs and application connectors.
houses from disparate data sources Kore understands the challenges faced by developers and
and through asynchronous integration with best-in-class technology is constantly changing. We strive to provide effi-
applications. cient and easy-to-use software and to react quickly to the
However, the speed of business today often requires needs of our partners and clients so that we continue be a
more synchronous, real-time integration between appli- trend-setting solution provider.
cations. RESTful Web Services are now the technology of
choice and Kourier is evolving to meet these demands.
Our latest release of Kourier continues to improve on Kore Technologies
its ability to create REST APIs that give third-party appli- www.koretech.com
cations secure, rated access to your MultiValue data (REST Or call: 866-763-5673

Melissa TIRED OF THE ETL GRIND? WORK FASTER AND SMARTER WITH MELISSA AND TALEND
IN THE ERA of big data, DBAs spend demographics such as consumer or business name, address,
much of their time transferring, phone or email, gender, household income, FIPS Code,
migrating, and cleaning disparate deceased info, and much more.
data. Melissa makes this more effi- We also offer our Global Address Verification com-
cient by adding data quality into the ponent to clean, standardize, transliterate, and verify
workflow to help DBAs work faster deliverability of addresses in over 240 countries.
and smarter with clean data. We’ve The combined solution brings agility in the data
Bud Walker, teamed up with Talend, a leading pipeline as data processing and data quality controls
VP of Enterprise data integrations firm, to offer a can be applied inflight, on the cloud, or on-premise in
Sales & Strategy faster, easier way to connect and real-time or batch.
clean data with over 900 built-in Boost your productivity and easily clean and integrate all
connectors, advanced string manipulations, and a unified your data with the combined power of Melissa’s data quality
metadata repository. solutions and Talend’s robust open source ETL tool. Free
Through the Talend Open Studio platform, Melissa offers trials, unlimited tech support, and a 120-day ROI guarantee
built-in data quality components including Personator, our are available.
ID verification tool. Personator matches name to address,
email, and phone number, and verifies national ID and age, Melissa
in real-time. The tool also enriches your contact data with www.melissa.com/dbta-etl
36 SPONSORED CONTENT DBTA | DECEMBER 2018/JANUARY 2019

PRODUCT SPOTLIGHT

Navicat
NAVICAT IS AN industry-leading and With more than 16 years of providing database manage-
award-winning database management ment solutions and more than 40% of Fortune 500 compa-
and development solution. Our first nies counting on Navicat every day, we will continue to offer
product—Navicat for MySQL—was world-class customer support and exciting new features. We
launched in 2002. To help our users stay are always working on new features and innovations to help
competitive in today’s business world, your business gain a competitive edge.
we continue to improve our products Download a free 14-day free trial of :
and add new database support. Today, Navicat Premium:
our top-rated prod-uct, Navicat Pre- https://www.navicat.com/download/navicat-premium
mium, supports 7 databases within a single application, includ- Navicat for MongoDB:
ing MySQL, MariaDB, MongoDB, SQL Server, SQLite,Oracle https://www.navicat.com/en/download
and PostgreSQL, and supports 12 languages. /navicat-for-mongodb
Navicat Monitor is our new product line. “We know how Navicat Monitor:
hard and important it is to monitor your database perfor- https://www.navicat.com/en/download
mance,” said Ken Lin, CEO at PremiumSoft. “With Navicat /navicat-monitor
Monitor, we want our users to be able to keep track of how
their database is used and alerted if there are any thresh-
Navicat
old breaches so as to ensure their database performs to the
www.navicat.com
highest standards. Now, Navicat Monitor supports MySQL Navicat is the choice of over 3 million database users all
and MariaDB databases. We will continue to add new data- around the world. Over 160,000 registered customers across
base support and new features in the future.” 7 continents and 138 countries have chosen our products.

Percona
PERCONA MONITORING AND MANAGE- • MySQL query plan information via EXPLAIN in table
MENT (PMM) is a free, open source and JSON format so you can evaluate the level of
database monitoring and perfor- optimization.
mance optimi- zation tool for MySQL, Metrics Monitor displays database activity over time,
MongoDB, and PostgreSQL. PMM including:
builds on other open source proj- • MySQL, MongoDB, and PostgreSQL: Queries Per
ects including Grafana, Prometheus, Second (QPS), replication, storage engine-specific
Michael Coburn, and Consul and is deployed in your (InnoDB, WiredTiger, MyRocks, RocksDB, MMAPv1,
Product Manager environment on your equipment for MyISAM, In-Memory, TokuDB, and Aria engines)
for Percona maximum security and reliability. • System-level resources: CPU, Load, Memory, Network
Monitoring and
Management PMM gives you two views on your Other important features of PMM include:
database performance: Query Analyt- • High Availability support via a ProxySQL Overview
ics provides information about queries running in your data- dashboard
base, and Metrics Monitor plots database metrics over time. • Amazon RDS and Amazon Aurora for MySQL and
Query Analytics helps you optimize database perfor- PostgreSQL—PMM consumes and displays Cloud-
mance by identifying those queries in MySQL and Mon- Watch metrics in order to provide a full picture of
goDB that consume the most amount of resources, provid- database activity in RDS
ing clarity on which queries deserve attention first. Query • External Exporters—Graph any service using PMM!
Analytics highlights the following query attributes: Download your free copy of PMM at percona.com/pmm.
• Query response time, locks, rows sent, rows examined
• Percona Server for MySQL only: InnoDB operations: Percona
reads, waits, Temporary Table usage percona.com
DECEMBER 2018/JANUARY 2019 | DBTA 37

PRODUCT SPOTLIGHT

Pythian
KICK AAAS OVERVIEW: HOW PYTH- things such as automated ETL and data science sandboxes.
IAN’S CLOUD-NATIVE ANALYTICS PLATFORM Kick AaaS features work together like building blocks so
SOLVES THE DATA SILO PROBLEM you can easily add the features you need or remove the
Data-driven companies under- ones you don’t. And because it’s built in the cloud—where
stand that the traditional data much of your big data lives anyway—Kick AaaS grows with
warehouse isn’t up to today’s com- your needs, allowing you to continually add data sources,
plex big data challenges. and enable self-service analytics and better insights using
Ron Kennedy, Pythian’s Kick AaaS was devel- more data sources.
Director of Product oped to answer a specific problem The keys to the success of Kick AaaS lie in its ability to
Management experienced by our clients: that of work on any of the top cloud platforms (Google Cloud
finally opening up the possibili- Platform, Microsoft Azure or AWS), its independence
ties locked within their disparate array of data sets siloed from any particular BI tool, its agility to add new data at
in different databases, and governed by disconnected business speed, and its unique ability to clean, deduplicate
departments. and unify data at scale for better insights across the orga-
Kick AaaS is the cloud-native analytics platform that nization. This means any user can access more data using
solves the data silo problem. It takes the hard work out any BI or visualization tool.
of capturing, curating and preparing data for consump- Find out how Kick AaaS can help you break down data
tion by a range of users and systems. It’s enterprise-wide, silos to drive better business value.
cloud-native data integration and sharing made simple.
The platform scales as your needs evolve. It starts with
software that provides lightning-fast provisioning of data Pythian
infrastructure, then builds on that foundation to enable pythian.com

Redis Labs
WITH EVERY ENTERPRISE managing data structures and versatile modules that adapt to each
and operating in the cloud, deliv- enterprise’s data needs.
ering an instant experience has Redis Enterprise is available as a fully managed data-
become crucial, impacting enter- base-as-a-service, in the cloud or virtual private cloud,
prises operating in every indus- and is downloadable for any cloud platform or private
try including financial, entertain- data center deployment. Powering global companies of
ment, transportation, ecommerce, every size and industry, Redis Enterprise is used for a
Ofer Bengal. and more. Redis Labs is the home wide range of use cases including high speed transac-
Co-Founder of Redis, the world’s most popular tions, job and queue management, user session manage-
and CEO in-memory database, and com- ment, real time data ingest, notifications, content cach-
mercial provider of Redis Enterprise. Combining the ing, and time-series data.
vibrancy of the open source community and world-class Redis Enterprise is a true multi-model database which
engineering, Redis Enterprise delivers an extremely fast offers the quickest time to market, highest levels of per-
multi-model database that scales easily and delivers ideal formance and cost efficiency to be the primary database
functionality with great simplicity. platform for the cloud-native world where access to
Because Redis is the fastest database in the industry, information has to be everywhere, all the time, and in
companies are able to deliver on consumers’ expecta- an instant.
tion for instantaneous responses from applications that
touch every part of people’s lives. Redis combines the Redis Labs
best of in-memory, schema-less design with optimized https://redislabs.com/
38 SPONSORED CONTENT DBTA | DECEMBER 2018/JANUARY 2019

PRODUCT SPOTLIGHT

RedPoint Global
CONSUMERS ARE MORE empowered an open garden approach and utilization of content from
than ever before and are willing NoSQL and document databases, RedPoint provides an
to leave brands that fail to meet agile, open, connected architecture that can leverage
expectations. Providing a consis- existing marketing technology and easily incorporate
tent, highly personalized experi- new technology advancements.
ence across every customer touch- RedPoint CEH integrates with the vast ecosystem of
point is what consumers expect, marketing technologies and offers marketers continu-
Dale Renner, and has become a strategic imper- ously updated insights into how each customer transacts
CEO ative for brands, financial institu- with your organization, including preferences, behaviors,
tions, consumer goods, hospitality offers, purchases, and latest interactions across every
organizations, and retailers alike. touchpoint. This enables organizations to drive higher
The RedPoint Global Customer Engagement Hub™ revenue and lifetime customer value while lowering
(CEH) enables business to overcome data silos, while interaction costs, truly engaging the customer where,
leveraging business rules and processes to personalize when and how each individual prefers.
information for the customer in real time. It does this
by recognizing the customer in context, utilizing AI and
machine learning to build a detailed customer profile,
determine the next-best action, and manage delivery and RedPoint Global
intelligent orchestration of offer management. Through www.redpointglobal.com

Revelation Software
OPENINSIGHT 10 MIGHT be the most Do you want to program in a green screen editor? You
visually appealing, intuitive, Mul- can. Or would you prefer a newer one, with keyword color-
tiValue development tool on the ing, code tips, and interactive, context sensitive help? You
market today. Whether you and can do that as well.
your staff have been developing These editors, as well as Form Designers, Reporting
since Dick Pick was still cutting Tools, Database Tools, are all contained in OpenInsight’s
code, or you have new hires just IDE: Integrated Design Environment. You’ll recognize the
Mike Ruane, coming out of school, OpenInsight look and feel of this tool and will pick it up easily and dis-
President and CEO will let you leverage your skills and cover it’s the exact thing needed to bring your MultiValue
expertise to create brand new appli- application up to customers’ current expectations. A
cations, or update and modernize your current offerings. graphical, intuitive, easy to use application.
Other vendors and products offer tools promising sim- Why learn a new language and environment, such as
ilar outcomes. With OpenInsight, you don’t need to learn Python, when there’s an application development tool that
a new language like Java or Python. OpenInsight uses will give you the same end result, but leverages your exist-
BASIC—the same programming language you’re famil- ing experience and skillset? Give OpenInsight a try. You’re
iar with, with new functions and calls. So, the commands going to love it.
you know, the functions, the case-insensitivity of the pro-
grams, IF/THEN/ELSE structures instead of spaces: they’re Revelation Software
all available in OpenInsight. www.revelation.com
DECEMBER 2018/JANUARY 2019 | DBTA 39

PRODUCT SPOTLIGHT

Robin Systems
ROBIN IS THE HYPER-CONVERGED cation to a simple managed service-like experience, and
Kubernetes platform for big data, automates deployment and lifecycle management of appli-
databases, and AI/ML. Robin helps cations and data, leading to higher agility and lower cost.
enterprises achieve faster reali- For DevOps and IT architects, Robin brings agility to
zation of critical IT and business react to LoB and CXO asks, higher DevOps productivity, as
initiatives like containerization, well as infrastructure utilization, and extends Kubernetes
cloud migration, multi-cloud benefits to big data and databases without new staffing or
Premal Buch, strategy, data analytics, and cost complex projects. Robin provides 1-click deploy, snapshot,
CEO consolidation. clone, scale, and upgrade for big data and databases. Robin
Enterprise customers today enables 1-click cloud migration for applications including
have to develop custom workflows to deploy and manage data, and guarantees QoS and SLA while consolidating big
each application in their big data/AI/ML pipelines and data and database workloads.
operational databases, and repeat that for each on-prem- Robin enables CIOs to execute a faster roll-out of
ise and cloud installation, leading to high cost, complex- critical IT initiatives (containerization, cloud migration,
ity, and time-to-value. With Hyper-converged Kubernetes, multi-cloud strategy, cost-consolidation) and business ini-
Robin is the only solution that embeds application lifecy- tiatives (AI/ML, analytics projects), while empowering the
cle management into an integrated storage, network, and staff with self-service infrastructure and accountability.
cloud infrastructure stack.
As a result, only Robin makes these applications agnos- Robin Systems
tic of infrastructure choices, elevates every data appli- www.robin.io

Rocket Software
ROCKET UNIVERSE, A CORE component • Never lose a transaction,
of the Rocket MultiValue Application • Reduce the risk of file corruption,
Platform, is the backbone of critical • Expedite restoration when required.
business applications around the R&D test results show that high-volume ATM transaction
world. As such, UniVerse must speeds were 10% faster, and data entry processing was 60%
not only meet the demands of faster. Key performance benefits include:
today’s applications, but also exceed • Intelligent queue management combines all record
expectations for performance and updates into one,
Julianna Cammarano,
Director, reliability. • Field-level updates eliminates the need to write the entire
MultiValue and The latest release of UniVerse record in replication, audit logging, and within your own
Business Intelligence rearchitects the core database application,
Product Marketing
processing engine to support a high- • Intelligent query optimization evaluates SQL statements
efficiency, high-availability, recoverable file system (RFS). and refines the order for optimal processing,
UniVerse now maintains data integrity while providing the • Performance monitoring for deep diagnostics pinpoints
accelerated transaction throughput speeds that high-volume trouble areas so you can fine-tune your system.
business applications require. Rocket UniVerse not only meets the demands of modern
RFS, a complement to HA/DR, maintains a persistent applications, but exceeds expectations for performance and
change log (journal) so that when unexpected outages occur reliability for critical business applications.
due to fire, flood, other natural disasters or plain-old network
failure, the file system can quickly restore to the last complete Rocket Software
transaction. RFS users will: www.rocketsoftware.com
40 SPONSORED CONTENT DBTA | DECEMBER 2018/JANUARY 2019

PRODUCT SPOTLIGHT

TigerGraph
TIGERGRAPH IS THE WORLD’S fastest them to uncover connections that were previously too
graph analytics platform. Founded impractical to reach or too cumbersome to express.
in 2012, TigerGraph is designed to TigerGraph supports applications such as Artificial
unleash the power of intercon- Intelligence (AI), machine learning, Anti-Fraud, Cus-
nected data for deeper insights tomer 360 and Internet of Things (IoT) to make sense of
and better outcomes. TigerGraph ever-changing big data.
can load 100-200 GB of data per TigerGraph’s proven technology is used by customers
Yu Xu, CEO machine per hour, while the other including Uber, VISA, Intuit, Zillow, State Grid Corpo-
leading graph analytics solutions ration of China and Alipay. It scales up to 100,000+ que-
require 24 hours or more. ries per second across datasets with 100+ billion vertices
TigerGraph is delivering the next stage in the evolution (orders, payments, customers etc.) and over a trillion
of the graph database: the first system capable of real- edges or relationships. TigerGraph provides pre-built
time analytics on web-scale data. TigerGraph’s Native data schemas, queries and algorithms you can tailor to
Parallel Graph™ (NPG) design supports Hybrid Transac- your business needs.
tional/Analytical Processing (HTAP), offering real-time Download TigerGraph Developer Edition at TigerGraph
ACID transactions (OLTP) and superfast multi-hop ana- .com/developer to experience the world’s fastest graph ana-
lytics (OLAP), no matter how large or complex the data- lytics platform.
set. TigerGraph’s SQL-like graph query language GSQL
provides for ad-hoc exploration and interactive analysis
of Big Data. With GSQL’s expressive capabilities and NPG TigerGraph
speed, users can perform Deep Link Analytics, allowing www.tigergraph.com
FEATURING THESE SPECIAL EVENTS

MAY 21–22, 2019 REGISTRATION


IS OPEN!
Use code
PRECONFERENCE WORKSHOPS DBT19
MONDAY, MAY 20 to SAVE
$100 NOW.

HYATT REGENCY
BOSTON
BOSTON, MA
dbta.com/datasummit
STEP OUT of everyday execution mode this May and join us for 3 days BROUGHT TO YOU BY
of practical advice, inspiring thought leadership, and in-depth training.
Join your peers to learn, share, and celebrate the trends and technologies
shaping the future of data. See where the world of Big Data and data science
is going and how to get there first.

ORGANIZED AND PRODUCED BY CONNECT: #DataSummit


42 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018/JANUARY 2019

APPLICATIONS

How to Help
Your DBAs
Evolve With
Automation
By Robert Reeves

EVERYTHING CHANGES—ESPECIALLY when be a candidate for automation—even tem administrators automated much of
we seek to automate tasks. Automation is those once considered sacred and per- their jobs, just not in a way that enabled
leading to amazing consumer benefits— formed by the DBAs, e.g., SQL script self-service. That automation was limited
from vehicles to clothing to voice-acti- reviews and deployments. Automation to making the system administrator’s life
vated devices—and making our lives will inevitably impact everything from easier, not the end user’s. Now with sys-
better. In the workplace, as automa- code development and application test- tem provisioning automation, end users
tion becomes applied to repetitive ing to database deployment. Evolution is can request systems that meet corporate
tasks that are being handled manually, needed to respond to change—and here standards without having to go through
people become concerned about their is why that is a good thing. a ticket system or procurement. This has
livelihoods. Will they lose their jobs? Let me be clear from the start: Auto- provided benefits to the end user and
How will they provide for themselves mation replaces tasks, not people. Auto- the business but also the system admin-
and their family? How will automation mation will change how DBAs perform istrators. The system administrator role
impact them personally? These are nat- their tasks, but it will not change the has shifted to roles such as site reliability
ural and healthy questions. But they value they can provide their company. engineers or other more interesting and
do not change reality, especially in the We have seen this before with Infrastruc- strategically beneficial roles. It’s time for
world of software development, where ture as Code (IaC) and tools such as Pup- DBAs, and their companies, to share the
sooner or later every repetitive task will pet and Chef. Prior to adopting IaC, sys- same benefit.
DECEMBER 2018/JANUARY 2019 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S 43

APPLICATIONS

Today, the DBA role is insular and course, the need for control and safety need to sit in a classroom for 2 weeks and
focused on protecting data. (Data is, after has not disappeared, but the way it is take a test to become certified. There are
all, the most valuable asset a company applied must change. Just as IaC usage endless and, often, free resources online
owns.) However, tomorrow requires data provides a menu of system options for to take advantage of, mostly from the ven-
to be available and managed in a more end users, DBAs must provide options dors themselves.
decentralized fashion. DBAs must bal- for their end users that allow them to But this evolution will not happen in
ance data protection with data availability reach their goals in the manner they a vacuum. Today’s DBAs are the most
to better serve application development
and testing teams. Tomorrow is a “You
build it, you run it” world that requires
DBAs must start thinking in terms of
tasks such as database schema changes to ‘as a service’ instead of bespoke, one-off,
be made just as easily as application code
changes. Moreover, DBAs are often siloed discrete tasks. If your only entry point to the
DBA is a ticket system, then you and your
by the database platform they support.
With the advent of more specialized data
platforms, being an Oracle DBA alone is
not enough for some companies; they are
company are not evolving.
demanding more from their DBAs.
And there’s no time to waste, as the deem best. This is very much similar to overworked group in IT. To demand that
adoption rate for new technology has offering choices to a child. As my son DBAs add yet another list of tasks to
increased dramatically during the past matured and sought more screen time their already-overflowing email inbox is
century. Electricity was made commer- for video games, I didn’t hand him a a fool’s errand. Management must make
cially available in 1873, and it took 46 game controller. We started gradually this a priority and help. IT management
years for one-quarter of Americans to offering choices such as at bath time: “Do must understand recent computer sci-
use it; the internet first became available you want to stay in the tub or get out ence grads are not becoming DBAs. Pro-
in 1991, and it took just 7 years to reach now and play video games for a few extra grams must be created and funded to
the same level of adoption. The rush to minutes?” Sometimes he chose now and retain DBAs and help them evolve. If not,
containers by cloud providers and devel- other times he chose to stay. Not to liken those DBAs will find more progressive
opment teams is happening even faster, software developers to children, but they companies and take the knowledge and
which makes it all the more pressing for do not have the experience and resulting experience between their ears with them.
DBAs to evolve at a pace that can keep up pattern recognition skills that a seasoned I have seen firsthand what compa-
with these changes. DBA has. But, the solution cannot be, nies are able to accomplish when they
DBAs must start to think about their “Let me see your SQL script,” followed make this change a priority. These com-
jobs differently. Instead of simply focus- by a lengthy manual review. DBAs must panies can deliver compelling services
ing on data control and safety (which is balance safety with speed. to their customers far faster than their
important), DBAs must also think about To that end, DBAs must enhance their competition … and their stock perfor-
other business imperatives, such as appli- skill set and evolve to meet these demands. mance reflects the results. Furthermore,
cation delivery speed, platform flexibility, Learning a programming language other employees are happier, more engaged,
and data accessibility. To balance what than SQL should be the first step. I rec- and able to deliver better results for the
may seem to be competing interests, DBAs ommend Python, as IEEE has once again management team. Remember: Compa-
should consider evolving their practices put it at the top of its list as it continues nies that are high IT performers deliver
and incorporating automation. This will to expand its lead on C++, C, and Java. better stock returns than their lower-per-
help meet the demands of the business. All modern tools have a Python API, and forming peers. And, that is exactly what
Automation is key to this necessary database automation tools are no excep- management’s compensation is tied to.
evolution. DBAs must start thinking in tion. (If not, you’re using the wrong tool.) Bottom line: Help your DBAs evolve and
terms of “as a service” instead of bespoke, Furthermore, experience with a database you are going to make your bonus. ■
one-off, discrete tasks. If your only entry platform other than the one you are certi-
point to the DBA is a ticket system, then fied on is a must. Find out which platform Robert Reeves is co-founder and CTO
you and your company are not evolv- your CTO has identified as the company’s of Datical, a provider of agile database
ing. Self-service must be the priority. Of future and learn that. Today, we no longer automation solutions.
44 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018/JANUARY 2019

APPLICATIONS

Blockchain Fundamentals

Q&A With Paul Tatro, Founder of Blockchain U Online


BLOCKCHAIN, THE DISTRIBUTED ledger tech- transaction—that is where the possibility is valid, you have effectively taken the pro-
nology, has the potential to impact a comes from. cess for validation and codified it so the
diverse range of industries from agricul- process integrity is ensured. The computer
ture to accounting to healthcare. Sup- And the “trust layer for the internet”? software will run the same every time. The
porting integrity and trust among entities, The internet is becoming almost like other thing is the immutability of it. It is
blockchain can help track the movement electricity. If you don’t have it, you can’t virtually impossible to make changes to a
of goods and services—and data itself. function; but we also know that the inter- blockchain once a block has been added.
Recently Paul Tatro, founder of Blockchain net is full of traps, so to have something That ensures that if data is in a blockchain,
U Online, talked about where blockchain that can put trust into an application that it has not been changed. When you com-
is used now and what’s ahead in 2019. is available in a distributed fashion across bine the immutability of blockchain with
the internet is something that is very the resistance to hacker attacks due to its
What is the big advantage of unique and powerful, and it is what gives distributed nature, I think it yields the
blockchain? blockchain its appeal. highest degree of integrity that you will
By delivering records that convey offer find with any application.
and acceptance, a blockchain is a value What are the key benefits of
exchange protocol that provides a trust blockchain for the enterprise? What are some of the areas that
layer for the internet and then digitally First off, it provides the highest degree need to be strengthened in order for
records the data in a shared distributed of accountability in any application envi- blockchain to take a more prominent
ledger in packages called blocks. If you ronment. This is where aspects such as place in the enterprise?
look at the idea of “offer and acceptance,” consensus and the value exchange proto- There are a couple of things that have
that is a fundamental element of any col come in. These are part of the account- to get sorted out before people jump in
transaction so that gives you the idea of ability of the application and so third par- with both feet. One is the idea of regu-
the ubiquitous potential of blockchain ties in the middle are not needed. lations. Until governments decide how a
because it can be involved anywhere that Blockchain also guarantees the valid- truly distributed application is enforced
you have an offer-and-acceptance type of ity of transactions by recording them in or governed, it is going to be tricky.
scenario. several places. It is a distributed type of Let’s say there is a blockchain spanning
application so there isn’t a single point jurisdictions and geographic boundar-
What is the significance of a “value of failure. If there are 100,000 nodes par- ies—such as Bitcoin. Which rules apply?
exchange protocol”? ticipating in a blockchain, then there are Should it be U.S. rules? Maybe there was
Through a blockchain, you are able to 100,000 copies of the transaction, so to an issue that took place in Dubai. Who
not only say, “We are going to do a deal speak. It makes it more difficult for hack- really is involved? There is an old say-
together” but also settle on the terms, ers, and the integrity of what happened ing, “When more than one person is in
conditions, and the value of that deal as is guaranteed because it is recorded in so charge, one is to blame.” That needs to be
part of the transaction being completed many different places. clarified because companies don’t want
by the blockchain. The concept of a to find themselves vulnerable to laws they
value exchange protocol is very powerful How are the other advantages? didn’t anticipate. This could happen with
and this is where this idea of eliminating Because you are not relying on some- a public or private blockchain due to the
trusted third parties in the middle of a one looking over forms and deciding what distributed nature.
DECEMBER 2018/JANUARY 2019 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S 45

APPLICATIONS

What else? rency like Bitcoin. Having an alternative of smart contracts was not a part of it, and
A second concern is standards. There currency has a greater potential impact it did one thing—it transferred Bitcoin
need to be standards about how block- than many people realize, especially when between parties. Version 2 was Ethereum,
chains work. Currently, there are more you understand that more than 70% of the which introduced the first smart contract
than 25 different consensus methods world is unbanked. Bitcoin, particularly, language that enabled business logic or
in blockchain technologies. Each one has the potential to bring those people in rules to be put into a program and stored
works differently because of the consen- as participants to the worldwide economy, in a blockchain.
sus method. Bitcoin uses Proof of Work. and that is the big buzz around those kinds
Ethereum used to use Proof of Work but of currencies. What is version 3 of blockchain?
is shifting to a specialized Proof of Stake This is where additional services over
called Casper. EOS has a specialized Proof Are there other areas? and above smart contracts are becoming
of Stake called Delegated Proof-of-Stake. There was an announcement recently part of the blockchain. This will allow cor-
There are other protocols as well, such by IBM and Maersk about a blockchain porations to see what security and database
as Proof of Burn, and it goes on and on. shipping solution, so that is an example of components are available. These are things
To me, this adds to the confusion around another area. I think identity management that people have taken for granted in the
blockchain. The concepts are straightfor- is going to be yet another area. There are evolution of information technology over
ward, but when you get into some of the companies, such as one called uPort, that the years, and are now coming to block-
details, it gets to be a confusing ball of are advancing the concept of self-sover- chain. Governance over what happens
yarn. This is where some standardization eign identity so instead of some big com- in a blockchain environment is another
needs to come into play. pany controlling who looks at an individ- huge issue that needs to be considered. An
ual’s information and controlling what additional aspect is the network factor. If
How widespread is the use of information is dished out, the individual a company wants to introduce a shipping
blockchain in the enterprise now? can control who looks at their information application, in a blockchain world, it has
In corporate America, everyone is kind and what information is provided. The to get all the players involved. The dock
of looking at it, but I read a McKinsey arti- notion gets more interesting when you workers, the loaders, truck drivers—all
cle that estimated that 90% of the block- consider attestations of who you are and the companies involved—have to become
chain projects being undertaken today will what you have done. participants in the blockchain network for
never see the light of day. In other words, it to work effectively. So, to me, this idea of
companies are learning, experimenting, What do you see on the horizon for the network effect will be one of the big-
and trying to figure out what the impact 2019? gest challenges. ■
will be and what the process will be. We are effectively in what I call version
Obviously, one of the key things com- 3 of blockchain. Version 1 was basically This interview was conducted, edited, and
ing out of blockchain is an alternative cur- Bitcoin, a simple blockchain. The concept condensed by Joyce Wells
46 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018/JANUARY 2019

MV SOLUTIONS

MULTIVALUE AND THE


CLOUD: FLEXIBILITY
FOR THE FUTURE
By Julianna Cammarano
WITH THE EVER-CHANGING business land- alerting rules around critical business func- use cloud-based orchestration tools to
scape, many companies that rely on tions. High availability and disaster recov- bring up test machines, deploy a new ver-
MultiValue platforms are looking for ery, required for business continuity, is also sion of an application to a small group,
ways to integrate existing systems with much easier to set up in the cloud. and then switch over to production once
new technologies—and specifically, fig- Many organizations are achieving it’s ready for general availability.
ure out how to leverage the capabilities dramatic improvements in overall per- MultiValue in the cloud also provides
of the cloud. formance by deploying MultiValue in the ability to employ automation for areas
Organizations often find themselves the cloud. This approach makes available such as load balancing. For example, a retail
torn between the platforms they’ve built a wide range of services such as Azure’s customer has been using cloud-friendly
their critical business applications on products to stay connected to his hybrid
and user demands for modern, intuitive architecture, which is a mix of on-premise
interfaces that provide anytime, anywhere The cloud provides a and cloud-deployed instances.
access to web and mobile applications. Plus, your MV application is always
In today’s evolving technology space, we low barrier to entry for available, even during a maintenance
have arrived at a tipping point. It is easier event. Recently, a large financial services
than ever to embrace the range of capabil- organizations looking firm tested accessibility of its cloud-based
ities by hosting application development MV application during just such a main-
platforms such as MultiValue in the cloud. to access powerful tenance session. Despite running repeated
This approach can deliver value to organi-
zations in many different ways, including
capabilities associated replication processes, pausing, and then
re-directing traffic to the replicated server,
improved performance, more overall busi- with MultiValue. the MV instance worked flawlessly without
ness elasticity, and new levels of automa- any issues.
tion as well as support for DevOps. Simply put, MultiValue in the cloud
To be fair, replacing a MultiValue system Application Insights for Logging. Applica- provides peace of mind. With an MV appli-
can be expensive, risky, and time-consum- tion Insights lets users open a dashboard cation development platform in the cloud,
ing, which is why companies that depend and quickly identify what’s running in the companies don’t have to worry about
on MultiValue platforms sometimes feel cloud, making it much easier to monitor running out of disk space or data being
limited by their options. But now is a per- activity as well as search for and extract deleted or whether they are running the
fect time for companies who rely on the useful information and insights. most up-to-date version of an application.
power of MultiValue to modernize and Another key advantage of hosting Mul- It is all managed behind the scenes.
rejuvenate existing applications by moving tiValue in the cloud is the level of overall The ultimate goal, of course, is to make
them to the cloud. business elasticity that it provides. Orga- it easier for businesses to modernize and
For starters, the cloud today provides a nizations now have the freedom to “pay as grow by leveraging current technologies
low barrier to entry for organizations look- you go,” reducing the need to purchase and while also building in flexibility for the
ing to access powerful capabilities associated support a complex system that requires a future. ■
with MultiValue. If a company moves its major capital expense and could represent
application, it can now quickly and easily use increased risk. MultiValue in the cloud Julianna Cammarano is director of product
tools that deliver a wide range of function- frees up funding that can then be spent on marketing for business intelligence,
ality and are resident in the cloud. Examples more strategic business imperatives. analytics, and the MultiValue Application
include AWS CloudWatch or Azure Applica- We are seeing an increase in running Platform at Rocket Software (www
tion Insights for monitoring and setting up DevOps in the cloud. Businesses can now .rocketsoftware.com).
United States Postal Service | Statement of Ownership, Management and Circulation | (Requester Publications Only)

1. Publication Title: Database Trends and Applications. 2. Publication Number. 16-230. 3. Filing Each Issue During Preceding 12 Months, 4,654; No.Copies of Single Issue Published Nearest
Date: 10/1/18. 4. Issue Frequency: Bi-Monthly; Dec/Jan., Feb/Mar., Apr/May, June/Jul., Aug/ to Filing Date, 4,242. d. Non-requested Distribution (By Mail and Outside the Mail) (1) Outside
Sept., Oct/Nov. 5. Number of issues Published Annually: 6. 6. Annual Subscription Price: 0. 7. County Non-requested Copies Stated on PS Form 3541. (Include Sample copies, Requests Over
Complete Mailing Address of Known Office of Publication: Information Today, Inc., 143 Old 3 years old, Requests induced by a Premium, Bulk Sales and Requests including Association
Marlton Pike, Medford, Burlington County, NJ 08055. 8. Complete Mailing Address of Requests, Names obtained from Business Directories, Lists, and other sources): Average No.
Headquarters or General Business Office of Publisher: Unisphere Media a Division of Copies Each Issue During Preceding 12 Months, 0; No.Copies of Single Issue Published
Information Today, Inc., 143 Old Marlton Pike, Medford, NJ 08055. 9. Full Names and Complete Nearest to Filing Date, 0. (2) In-County Non-requested Copies Stated on PS Form 3541 (include
Mailing Addresses of Publisher, Editor, and Managing Editor: Publisher: Thomas Hogan, Jr. Sample copies, Requests Over 3 years old, Requests induced by a Premium, Bulk Sales and
Group Publisher, 143 Old Marlton Pike, Medford, NJ 08055-8750. Editor: None. Managing Requests including Association Requests, Names obtained from Business Directories, Lists,
Editor: Joyce Wells, 121 Chanlon Road, New Providence, NJ 07974. 10. Owner: Information and other sources): Average No. Copies Each Issue During Preceding 12 Months, 0; No. Copies
Today, Inc., 143 Old Marlton Pike, Medford, NJ 08055, Thomas H. Hogan, 143 Old Marlton of Single Issue Published Nearest to Filing Date, 0; (3) Non-requested Copies Distributed
Pike, Medford, NJ 08055, Roger R. Bilboul, 22 Earls Terrace, London W8 6LP, England. 11. Through the USPS by Other Classes of Mail (e.g., First-Class Mail, Non-requestor Copies
Known Bondholders, Mortgagees, and Other Security Holders Owning or Holding 1 percent or mailed in excess of 10% Limit mailed at Standard Mail or Package Services Rates): Average
More of Total Amount of Bonds, Mortgages or Other Securities If none, check box: None. 12. No. CopiesEach issue During Preceding 12 Months, 0; No. Copies of Single issue Published
Has Not Changed. 13. Publication Title: Database Trends and Applications. 14. Issue Date for Nearest Filing Date, 0. (4) Non-requested Copies Distributed Outside the Mail (Include Pickup
Circulation Data Below: Oct/Nov. 2018. 15. Extent and Nature of Circulation: a. Total Number Stands, Trade Shows, Showrooms and Other Sources): Average No. Copies Each Issue During
of Copies (Net press run): Average No. Copies Each Issue During Preceding 12 Months, 4,846; Preceding 12 Months, 96; No.Copies of Single Issue Published Nearest to Filing Date, 25. e.
No.Copies of Single Issue Published Nearest to Filing Date; 4,370. b. Legitimate Paid and/or Total Non-requested Distribution (Sum of 15d (1),(2), and (3)): Average No. Copies Each Issue
Requested Distribution. (1) Outside-County Paid/Requested Mail Subscriptions Stated on During Preceding 12 Months, 96; No.Copies of Single Issue Published Nearest to Filing Date;
Form 3541 (Include direct written request from recipient, telemarketing and Internet requests 25. f. Total Distribution (Sum of 15c and e): Average No. Copies Each Issue During Preceding
from recipient, paid subscriptions including nominal rate subscriptions, employer requests, 12 Months, 4,750; No.Copies of Single Issue Published Nearest to File Date; 4,267. g. Copies
advertiser’s proof copies, and exchange copies): Average No. Copies Each Issue During not Distributed: Average No. Copies Each issue During Preceding 12 Months, 96; No. Copies
Preceding 12 Months, 4,654; No.Copies of Single Issue Published Nearest to Filing Date, of Single Issue Published Nearest to Filing Date, 103. h. Total (Sum of 15f and g): Average No.
4,242. (2) In-County Paid/Requested Mail SubscriptionsStated on Form 3541 (Include direct Copies Each Issue During Preceding 12 Months, 4,846; No.Copies of Single Issue Published
written request from recipient, telemarketing and Internet requests from recipient, paid Nearest to Filing Date, 4,370. i. Percent Paid and/or Requested Circulation (15c divided by f
subscriptions including nominal rate subscriptions, employer requests, advertiser’s proof times 100): Average No. Copies Each Issue During Preceding 12 Months, 97.97%; No.Copies
copies, and exchange copies): Average No. Copies Each Issue During Preceding 12 Months, 0; of Single Issue Published Nearest to Filing Date, 99.41%. 16. Publication of Statement of
No.Copies of Single Issue Published Nearest to Filing Date, 0. (3) Sales Through Dealers and Ownership for a Requestor Publication is required and will be printed in the December 2018/
Carriers, Street Vendors, Counter Sales, and Other Paid or Requested Distribution Outside January 2019 issue of this publication. 17. Signature and Title of Editor, Publisher, Business
USPS: Average No. Copies Each Issue During Preceding 12 Months, 0; No.Copies of Single Manager or Owner: John C. Yersak, Vice President & CAO. Date: 10/1/18. I certify that all
Issue Published Nearest to Filing Date, 0. (4) Requested Copies Distributed by Other Mail information furnished on this form is true and complete. I understand that anyone who
Classes Through the USPS (e.g., First Class Mail): Average No. Copies Each Issue During furnishes false or misleading information on this form or who omits material or information
Preceding 12 Months, 0 ; No. Copies of Single Issue Published Nearest to Filing Date, 0. c. requested on the form may be subject to criminal sanctions (including fines and imprisonment)
Total Paid and/or Requested Circulation (Sum of 15b (1), (2), (3), and (4): Average No. Copies and/or civil sanctions (including civil penalties).
48 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018/JANUARY 2019

MV SOLUTIONS

jBASE Helps Encompass Supply Chain


Streamline Development Operations
ZUMASYS, A PROVIDER OF NoSQL database advantage of improved reliability and supported or updated. Over time, the
software for business-critical PICK appli- performance for batch processing and database became slow and unreliable,
cations, has announced that Encompass web apps. resulting in downtime for the company’s
Supply Chain Solutions, Inc. has migrated As Encompass continues to grow ERP system and ecommerce applications.
its custom ERP system to jBASE. its business and service offerings, it is Encompass ruled out moving to
jBASE, Zumasys’ flagship product, is a critical to have the right systems in place SAP or Oracle, as switching database
NoSQL database developed nearly 30 years to run its operations and scale with platforms would have been too costly,
ago and used by the largest international confidence, said Encompass president risky, and time-consuming. The company
banks in the world. and CEO Robert Coolidge. “We are very also considered additional MultiValue
Encompass also transferred its pleased with Zumasys’ ability to develop database solutions before deciding on
infrastructure from a co-location facility a strong, dependable platform to provide jBASE from Zumasys.
to the Zumasys cloud, now owned and exceptional uptime and support for our With jBASE, Encompass could retain
operated by NexusTek. customers.” its custom features with a clear path
The move has helped Encompass According to Encompass, its business for the future. Zumasys also offered a
eliminate costly downtime and streamline success depends on its custom-developed truly cloud-ready database solution. “By
development for the company’s primary ERP application, which feeds off its PICK moving to the cloud, we can now focus
business applications. MultiValue database platform. As the our resources on development instead of
With its ERP application running company’s database approached its end running hardware,” added Brent Blair,
on jBASE, Encompass can now take of life, it was no longer being actively Encompass VP of IT. ■
DECEMBER 2018/JANUARY 2019 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S 49

ROB MANDEVILLE
Rob Mandeville is a senior DBA at SolarWinds,

Anomalies—
experienced with major RDBMS vendor engines.

Predicting the Past


DEFINITION OF ANOMALY: something that Example
deviates from what is standard, normal, or Consider two workers—both of their jobs
expected.1 have them starting at 8:00 a.m. One always
If you are a modern database professional, you arrives a few minutes before 8:00 a.m. and
have likely heard about or looked into anomalies one always arrives right at 8:30 a.m. As long as
and how to detect them. At a very simple level, their arrival patterns do not vary, there will be
anomaly detection looks at historical values and no anomaly. If the 8:00 a.m. worker arrived at
predicts future values based on them. An anomaly 8:05 a.m., would we call that an anomaly? Or
occurs when we miss the prediction with enough how about 8:15 a.m. or 8:30 a.m.? What if the
significance. second worker arrived at 8:00 a.m. one day? 7:30 a.m.? Would
In a very literal sense, we are predicting the past. we call those anomalies even though the behavior is better
than the norm? In the strictest sense of anomaly detection,
Innate Anomaly Detection we would call everything out that was considered an outlier.
We come with an innate ability to pick out anomalies. In fact, Then, depending on whether the outlier was below or above
part of our success as a species can be attributed to anomaly our expected value, we would be ready to place a judgment.
detection. The idea in this case is to identify outliers for more
rigorous inspection or analysis. The Judgment
• That spider doesn’t look similar to other ones I’ve encountered. At this point, we are ready to assign a good or bad deter-
• The branch wasn’t broken before. mination to our anomaly. In our example, the second worker
• This creek is much lower than usual for this time of year. comes in late each and every day. The fact that one day she
Noticing these things can help with survival. It could be argued came in at 8:00 a.m. or earlier was a good thing. In fact, it was
that’s a good thing. So, the fact that one of the hot focuses of data the normal that was bad. Now measuring something this sim-
science is anomaly detection should come as no surprise. However, plistic can of course be corrected by also comparing data
anomalies by themselves are not good or bad; they are just differ- values w ith an absolute or threshold, but just
ent from the norm. No judgment. Additionally, the absence of an using anomaly detection, this might create a flawed model. In
anomaly doesn’t imply good or bad—it just implies normal. There fact, check out this post (www.dbta.com/Columns/Next-Gen
still has to be the human element to help define the good and bad. -Data-Management/NEXT-GEN-DATA-MANAGEMENT
---Dangers-of-Statistical-Modeling-127922.aspx), where I discuss
The Science statistical bias and how it can impact results in quite an uninten-
If we are going to try to automate anomaly detection program- tional, yet negative, way.
matically, we need to decide on an algorithm that fits best. This is
likely an algorithm that best predicts future datapoints based on The Wrap Up
historically observed datapoints and gets better over time with more Using anomaly detection to surface deviations from the norm
data. The solution would also likely account for seasonality (same can be a powerful tool when automating monitoring for just
day of the week, same month of the year, etc.). On the surface, anom- about any statistical value. However, it will not uncover the bad
aly detection sounds great; and it can be, as long as you don’t base behavior that may be happening all the time and is only part of
too much judgment on it without the investigation and analysis as the story. Combining data from anomaly detection plus data that
to why the anomaly occurred. can surface good behavior versus bad behavior when it occurs as
Before we get to judgment, there’s still work to be done—we’re not the norm starts to get into more intelligent analysis. The hype is
quite done with how we detect anomalies with our algorithm yet. We legit. Anomaly detection can be a very powerful tool—just not in
have to define how far off from expected the new data point is to call it complete isolation.
significant (standard deviation being a popular choice). Once we have I did not cover the concept of confidence interval intention-
put some math behind defining how our algorithm appears and how ally—saving that for a later date. n
far off any next value needs to be before we call it abnormal, we can
start looking for anomalies in an automated fashion. 1
Cited from Google Dictionary using search term anomaly definition
50 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018/JANUARY 2019

GUY HARRISON
Guy Harrison, a software professional with more than
20 years of experience, is a partner at Toba Capital and the
author of Next Generation Databases (Apress). Contact him at
guy@tobacapital.com.

Web Services Move Forward With GraphQL


IT’S GRATIFYING TO BE in the business long enough to see have been ongoing attempts to design a “better than REST”
new paradigms transition from lofty visions to every- platform for web services.
day standards. Such is the case with web services. When I first GraphQL has emerged as the favorite alternative to REST
started writing a column for DBTA, the evolution of web ser- for modern web API design. GraphQL was first used inter-
vices was well underway, although a battle for nally at Facebook before being open sourced
dominance between competing standards was in 2015. GraphQL is described as a data query
incomplete. In particular, there was competi- and manipulation language which at first glance
tion between a set of committee-defined stan- might suggest it has more in common with
dards (WS-this and WS-that) and a looser set SQL than with REST. However, in reality, both
of techniques based around simple protocols. GraphQL and REST are applicable across a very
As is so often the case in the software indus- similar range of web API scenarios.
try, developers continued to program while the EMERGING GraphQL implements a friendlier syntax than
standards bodies deliberated. By and large, TECHNOLOGIES REST and provides significant efficiency gains.
programmers rejected standards-based mech- REST calls typically return all data related to a
anisms in favor of a far more flexible approach: resource, while a GraphQL query can request
implementing web services by transmitting subsets of data, reducing network overhead.
XML documents across HTTP. Programmers discovered that GraphQL also allows a request to return related data with-
they could use HTTP directives and URL constructs to elimi- out having to create specialized API endpoints. For instance, a
nate a lot of the complexity required by SOAP and its cousins. GraphQL request could request details of a customer and all of
Eventually a widespread standard emerged for this simplis- that customers’ orders, implicitly navigating the relationships
tic approach: REST (Representational State Transfer). between those entities in the server. In REST, such a request
Strictly speaking, REST is an architectural style, not a plat- would have to be anticipated and explicitly created or would
form. To be “RESTful,” an API must adhere to several core have to be resolved using multiple REST calls.
principles first articulated by Roy Fielding. These principles GraphQL also provides a strongly typed schema which
define how a web application navigates through network of allows clients to navigate the API endpoints more effectively.
webpages or web resources. REST defines how requests iden- This allows for ad hoc requests and intelligent API browsers.
tify resources and navigate to related resources. For instance, a GraphQL is unlikely to completely overturn REST but
REST request might first identify a specific customer account, appears to take web services one further step forward toward
then use the results from that request to create a request that maturity. GraphQL is likely to further expedite the transition
navigates to that customers profile. from monolithic applications to loosely coupled microservices.
REST gained popularity throughout the 2000s, and today, The transition to microservices is generally regarded as a
REST using HTTP and JSON has created an almost universal good thing. However, there is a risk that tomorrow’s unmain-
language for web APIs. In so doing, REST has done more than tainable applications will be a confusing mess of poorly defined
any other technology to bring the vision of web services to reality. microservices. We’ll need strong, self-describing protocols to
However, similar to all practical general-purpose technol- avoid this scenario, and GraphQL seems a step in the right
ogies, REST is not perfectly suited to all scenarios, and there direction. n

Best Practices Series


Managing the Hybrid Future:
From Databases to Clouds FEB/MAR
2019
For sponsorship details contact Stephen Faig, stephen@dbta.com, or 908-795-3702.
DECEMBER 2018/JANUARY 2019 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S 51

KEVIN KLINE
Kevin Kline, a longtime Microsoft SQL Server MVP, is a founder and

Loads of Data and AI


former president of PASS and the author of SQL in a Nutshell. Kline
tweets at @kekline and blogs at https://blogs.sentryone.com/kevinkline.

Announcements at Ignite 2018


THE 2018 MICROSOFT IGNITE conference was overflowing with • Want your apps to include a conversational personal intel-
attendees this year, as user enthusiasm continues to grow ligent assistant (PIA) that knows how to complete specific
with the advent of CEO Satya Nadella. Since space is short and corporate tasks? Check out the new Cortana Skills Kit for
the announcements are many, let’s get straight to the details of the Enterprise. Underpinned by Azure Bot Services and Azure
product innovations available for data professions. Language Understanding, imagine scenarios such as build-
ing a Cortana skill that lets employees ask the PIA to sched-
New in AI ule their vacation time, saving 30 to 60 minutes of intranet
If you’re interested in voice and natural language process- surfing to figure out how to make the arrangements them-
ing, then here are a couple of announcements selves. This program is by invitation only and not
just for you: all features have been announced. See https://bit.
• Not long ago, Microsoft had several distinct ly/2ArAIEz for more information.
artificial intelligence (AI) speech capabilities, My personal interests in AI mostly focus
including speech recognition, speech transla- on machine learning. Here, Microsoft further
tion, and customized models to create a unique advances its machine learning capabilities and
voice for your app. Those have been combined also puts more AI into its own products, such as
into Speech Service, now generally available. In in Microsoft 365, Excel, and Dynamix 365.
public preview is Human Parity Text to Speech, • New in Azure Machine Learning is the auto-
which utilizes the latest in deep neural network technology mated machine learning feature set which enables users to auto-
to make computer voices nearly indistinguishable from real mate: data transformations at speed, model selection for more
humans. Details are at https://bit.ly/2PtWT5r. efficient algorithms, and hyperparameter tuning to quickly learn
• Furthering the goal of making your apps behave even more the accuracy of a given pipeline’s predictions. The new SDK for
similar to that of a human being, the Microsoft Bot Frame- Python includes features for distributed deep learning, allow-
work v4 SDK is now generally available. You can program in ing massive clusters of GPUs, access to field programmable gate
C#, Java, Python, and JavaScript. It includes many tools for arrays (FPGA) for amazing speed at image processing, and easier
simplifying and building bots with a modular and extensible integration with IDEs such as Visual Studio Code, Jupyter note-
architecture, allowing easy selection of the components and books, Azure Databricks notebooks, and PyCharm. Read more
services you need. Details are at https://bit.ly/2OSpJwR. at https://bit.ly/2PSp2Qp.
52 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018/JANUARY 2019

New in the Data Platform ibility (and potentially lower cost) when deploying apps to the
Microsoft used Ignite 2018 to announce the public preview of cloud, such as the new Azure SQL Database Hyperscale, a way to
SQL Server 2019, with a special focus on big data features. First, deploy a single but highly scalable database that can grow from a
Spark and Hadoop distributed file system (HDFS) are now built few to hundreds of terabytes in size. Azure SQL Data Warehouse
into SQL Server to accelerate ingestion, storage, and analysis of also provides a new, lower-entry point to help customers get
data at petabyte scale. There are also new direct query connectors started more quickly. Details are at https://bit.ly/2NKNOVw.
to Oracle, Teradata, and MongoDB. As with every release, there Microsoft’s NoSQL offerings have also grown:
are many security, usability, and optimization improvements. But • The Spark-like Azure Databricks product now provides
I will cover those in a separate, more detailed article. Azure Databricks Delta as public preview, improving data
Details are at https://bit.ly/2CFG8x1. reliability, simplified data pipelines, and improved job and
query performance. More information is at https://bit.
ly/2NRBtz5.
Not long ago, Microsoft had several • Azure Cosmos DB provides multi-master support for high
availability with single millisecond latency and better con-
distinct artificial intelligence speech flict resolution. The Reserve Capacity feature reduces costs
capabilities, including speech recognition, for using Cosmos DB. And the Cassandra API is further
enhanced for users familiar with that NoSQL platform.
speech translation, and customized models Details are at https://bit.ly/2RgbMoR.
to create a unique voice for your app. • Azure Data Explorer is in public preview. This is a new index-
ing and querying service for interacting, lightning-fast ad
Those have been combined into Speech hoc data exploration from data that originates in apps, serv-
ers, and edge devices. Details are at https://bit.ly/2yuQPzN.
Service, now generally available.
Other News for Data Professionals
There’s more than one way to learn what’s new, starting with
Azure SQL Database offers improved query performance fea- the new Microsoft Learn and the role-based Microsoft Certi-
tures under the Intelligent Query Processing moniker. This feature fications Microsoft Learn. This is free, interactive web-based
set includes row-mode memory grant feedback, approximate training with a tutorial approach to teach Azure and Business
query processing, and table variable deferred compilation. All Applications. There is cool progress tracking and gamification
three features work together to make SQL processing faster, more features such as achievements, while using free Azure resources
responsive to memory usage issues, and improve long-standing for hands-on learning. There’s also a new role-based set of Mic-
SQL coding issues. And, as happens with every release, Microsoft rosoft Certifications aligned to job roles.
has announced more pricing and performance tiers for great flex- Go to https://bit.ly/2N8hGWL to learn more. n
DECEMBER 2018/JANUARY 2019 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S 53

SIMON PANE
Simon Pane is a principal consultant working for Pythian and an
IOUG board member. As an Oracle ACE, he is a regular writer on
the Oracle blog (https://blog.pythian.com/author/pane), an editor
for IOUG SELECT Journal, and frequently speaks at major Oracle
conferences.

The Tools the Modern DBA Needs to Know


I OFTEN WORK WITH student or junior DBAs who ask about via Secure Shell (SSH) protocol. There’s no software to install
the skills they will need to be effective in the modern on the database servers—all that’s required is connectivity via
technical landscape. And, as the technology changes, of course SSH. In addition, targets can be easily grouped into an unlim-
so do the required skill sets. ited number of permutations. For example, “All”; “All prod”;
Sometimes I hear questions such as: “Should I learn Python and “All application X”—whatever is required.
or PHP to become an effective DBA?” My answer is usually, There are many online resources and videos to help DBAs
“Neither.” And to expand, I usually recommend that they don’t get started with Ansible. Once they see it in action and under-
focus on languages but rather on tools. I don’t mean learn- stand its simplicity and power, DBAs usually start to quickly
ing the intricacies of tool commands—those can be looked see opportunities for how to use it in their day-to-day jobs.
up—but more importantly tool concepts and
fundamentals. Git
The next logical question is about which The next logical tool that DBAs should
tools the modern DBA needs to know. In my understand is the concept of Git. Typically, this
opinion, the tool landscape that the DBA needs is in the form of GitHub (the largest online Git
to understand is moving toward the DevOps service, now owned by Microsoft). But GitHub
realm. That doesn’t mean that DBAs need to be isn’t the only Git implementation. GitLab, Atlas-
DevOps experts—they’re still different roles. sian Bitbucket, and local Git implementations
But rather, there is some overlap and, at the very are other similar options.
least, the DBAs need to be familiar with and, in The idea behind learning Git for the DBA
some cases, use these tools. is script management. Recognizing that administrators are
In the remainder of this article, I’ll explore the tools I per- not developers, they will always have some number of scripts
sonally think are most relevant to today’s DBA. which they’ve developed or must maintain. And maybe/hope-
fully, some of those are Ansible scripts!
Ansible Traditionally, DBAs would store these in a shared file sys-
If there were only one DevOps tool that the modern DBA tem or maybe an NFS mount or even a Windows shared drive.
should know, it would be Ansible. Often, I see DBA tasks that However, in the modern world, DBAs should be storing all
need to be performed against multiple databases in the envi- of their tools and scripts in a Git repository such as GitHub.
ronment. Maybe against all instances in the estate, maybe This provides a central repository, change history, and version
against just production or non-production, or maybe against comparisons, as well as the ability to easily share externally if
a named group of related databases. When environments are desired (repositories can be private or shared—functionality
large, I’ve seen a “divide and conquer” approach where a DBA and costs depends on the service used). DBAs typically use Git
team might split up the work, saying that person A will handle in a simplified workflow, although proper development teams
this group of database; person B, this other group; and so on. will use it in a more sophisticated manner that includes forks,
However, Ansible is a much better solution for these situations. branches, and other such advanced features.
Whenever the modern DBA needs to execute the same In fact, DBAs are probably already recognizing that GitHub
tasks against multiple databases, the answer should always be has become the standard location for external sharing of DBA
Ansible. Ansible is ideal for “automating manual tasks.” It is a tools and utilities. It is likely that their favorite public domain
free scripting language/tool but part of the beauty is that it’s tuning script is already being hosted in and shared via GitHub.
completely agentless—meaning that the Ansible software itself The beauty of DBAs learning Git is not just that it allows them
is only installed on one central computer such as a shared DBA to do their own internal script management, but that it also
machine or even the DBA’s desktop/workstation. From there, it allows them to provide feedback, suggestions, and updates to
performs all of the remote commands against target machines these public domain tools.
54 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018/JANUARY 2019

Documentation Systems cases such as these, I tend to recommend Vagrant. Vagrant could
Another important recommendation from my perspective almost be considered “a scripting tool for automated deployments
is to use a proper wiki-style documentation system. If you’re of virtual machines.” This means that Vagrant allows for the easy
still documenting run books and other how-to types of docu- creation and destruction of VM environments, including those
mentation in Microsoft Word (and/or Excel), then I’m afraid for Oracle Virtualbox. Vagrant can start with publicly available
you’re already a dinosaur. Such documents are very difficult to VM images (called “boxes”) or you can create your own custom
keep up-to-date, version properly, and even find. The solution ones as a starting point. From there, Vagrant scripts can be used
is a proper wiki-style documentation system. to customize the machines upon creation.
My personal preference is Atlassian Confluence. It’s a very
easy-to-learn WYSIWYG wiki-ish system which allows for Other Honorable Mentions
easy editing, collaboration, and sharing. With a little CSS cus- Terraform
tomization, it even does a reasonably decent job of exporting If DBAs are responsible for provisioning cloud resources,
content to PDF. then DBAs should also familiarize themselves with Terraform.
Even if Confluence isn’t an option for you, there are other Terraform is from the same maker as and has some similar
wiki tools, including ones that are installed locally and others concepts as Vagrant. However, Vagrant is for the provisioning
that are online services. Regardless, the point is that any docu- of virtual machines whereas Terraform is for the provisioning
mentation including run books and how-to manuals should be of other infrastructure using code.
in a system that is easy to update, provides revision history, is In my experience though, provisioning of cloud infrastruc-
easily shared, is easily searchable, and ideally also provides col- ture is usually handled by system administrators, site reliabil-
laboration (i.e., commenting functionality). Document editors ity engineers, or DevOps teams and less commonly by DBAs.
and spreadsheets do not meet these requirements. Consequently, it’s good for DBAs to familiarize themselves
with Terraform, but they should leave the actual implementa-
Vagrant and Docker tions to those other experts.
Often DBAs want “test systems.” It’s easy to provision virtual Puppet
machines either with or without software pre-installed online I sometimes hear comments such as, “DBAs can automate
from cloud providers, but sometimes administrators are afraid their software installations and database creations with Pup-
of cloud costs or for other reasons want to do this locally. pet.” While technically true and I’ve seen this done in real life,
So how can we bring some of the ease and speed of pro- it’s not really the best tool for the job.
visioning that we see with online cloud systems to our local Puppet is best for enforcing consistent state. For example,
environment? The answer is tools such as Vagrant and Docker. for ensuring that a Linux iptables file is configured as per inter-
Both allow us to quickly spin up virtualized environments with nal standards. If anyone changes the file, Puppet will revert
consistency and automation. But which to use? it back. Unlike Ansible, Puppet also requires some additional
Deciding which is best, in my opinion, depends on your components: a main server and agents on target machines.
needs. Docker is based on the concept of containers and may So, for many reasons, I think Puppet is best left in the
be best when you need multiple copies of similar systems. For domain of system administrators. It is definitely useful but
example, maybe you need eight environments that are almost less so for DBA purposes. Most similar DBA-type activities
exactly the same so you can test scalability, replication, or that people may consider using Puppet for can often be imple-
failover options between them. In a case such as this, Docker mented instead with Ansible.
is quick and efficient—and usually using containers is signifi-
cantly more resource-efficient on host systems than full virtual Many Options
machines are. Modern DBAs should be excited to see how easily they can
But in other cases, DBAs need a mix of environments. They learn and adopt modern DevOps tools and integrate them into
may need to easily create (and soon destroy and re-create) a their day-to-day activities. The landscape of options is broad,
variety of systems such as different flavors of Linux alongside a but the technology stack discussed here can provide a valuable
Solaris system and a Windows server so they can test how their toolkit to help DBAs increase their efficiency and productivity
platform-agnostic tools or scripts will work against all of these. In both locally and in the cloud. ■
DECEMBER 2018/JANUARY 2019 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S 55

CRAIG S. MULLINS
Craig S. Mullins is president of Mullins Consulting, Inc. He’s an IBM
Gold Consultant and the author of two best-selling books, DB2 Devel-
oper’s Guide and Database Administration: The Complete Guide to DBA
Practices & Procedures. Website: www.mullinsconsulting.com

Managing Database Performance


MANAGING THE PERFORMANCE of database systems and appli- Database performance tools can identify bottlenecks and
cations is a significant job responsibility for DBAs. From a points of contention, monitor workload and throughput,
database perspective, there are three basic performance compo- review SQL performance and optimization, monitor storage
nents that must be performed: space and fragmentation, and view and man-
1. Monitoring the database management age your system and DBMS resource usage. Of
system and the applications accessing it to course, a single tool is unlikely to perform all
find problems as they arise. This is typically of these tasks, so you likely will need multi-
referred to as performance monitoring. ple tools (perhaps integrated into a functional
2. Analyzing performance data (logs, trace suite) to perform all of your required database
records, reports, etc.) from the system to performance management tasks.
determine the root cause of problems. Without proactive tools that can identify
3. Assembling a corrective action to imple- problems as they occur, database performance
ment a fix to the problems. problems are most commonly brought to the
There are database performance software products that can aid attention of the DBA by end users. The phone rings and the
with all three of these components. But you must be careful to fully DBA hears a complaint that is usually vague and a bit diffi-
understand the capabilities of any database performance manage- cult to interpret, such as, “My system is slow today” or, “My
ment solution, as some simply monitor, others just analyze data or
screen isn’t as fast as it used to be.” To resolve such issues,
provide fixes for problems, and others deliver functionality combin-
DBAs need tools that can help uncover the exact problem and
ing all of these tasks.
identify a solution. Database performance management tools
You can also break down database performance management
can be used to find the root cause of such problems as well as
software by the category of performance issues it addresses. Database
to deploy a solution to fix the problem.
performance problems can occur in any of the following three areas:
Furthermore, many organizations use multiple DBMS
• The DBMS itself, which must interact with other system soft-
products in production, and the same DBA team (and some-
ware and hardware, requiring proper configuration to ensure
times even the same exact DBA) will have to ensure the per-
it functions accurately and performs satisfactorily. Addi-
tionally, there are many database system parameters used to formance of more than one DBMS (such as Oracle and SQL
configure the behavior of the DBMS and the resources it has Server or Db2 and PostgreSQL). But each DBMS has different
available to it. This includes criteria such as memory capacity, interfaces, parameters, and settings that affect how it per-
I/O throughput, and locking of data pages. forms. Database performance tools can mitigate this com-
• The database design and schema, including database param- plexity with intelligent interfaces that make disparate compo-
eters, table designs, and indexing, can all impact database nents and settings look and feel similar from DBMS to DBMS.
performance. How the data is organized must also be man- There are many providers of database performance man-
aged; as data is modified in the database, its efficiency will agement tools, including the DBMS vendors (such as IBM,
degrade. Reorganization and defragmentation are required Microsoft, and Oracle), large ISVs (such as BMC and CA),
to periodically remedy disorganized data. and a wide array of niche vendors that focus on DBA and
• Finally, the SQL and application code itself can cause per- database performance software (for example, Quest, IDERA,
formance issues. Coding efficient SQL statements can be and Navicat).
complicated because there are many different ways to write The exact database performance management solutions
SQL that return the same results. But the efficiency and per- you should use depend upon the database systems you utilize,
formance of each formulation can vary significantly. DBAs the size of your organization, the amount of data managed,
need tools that can monitor the SQL code that’s being run, your service level agreements, and your budget. But managing
show the access paths it uses, and provide guidance on how production databases without performance tools is a recipe
to improve the code. for failure. n
56 D ATA B A S E T R E N D S A N D A P P L I C AT I O N S DECEMBER 2018/JANUARY 2019

TODD SCHRAML
Todd Schraml has more than 20 years of IT management, project
development, business analysis, and database design experience across
many industries from telecommunications to healthcare. He can be
reached at TWSchraml@gmail.com.

Beware the Frankenmart!


NOVICE DEVELOPERS ARE often main concepts. One, writing SQL to
confused about database access data correctly means having
design. And when they mature a detailed understanding of all the
into experienced personnel, they pieces. Consequently, there may be
unknowingly pass on bad habits a lengthy learning curve for a new DEC 2018/JAN 2019
to the next generation of develop- querier to become functional. And Ad Index
ers. The problem at hand is a very next, because of the numerous dif- Kore Technologies . . . . . . . . . . . . . . . . . 9, 48
human one. When putting together fering approaches contained within Melissa. . . . . . . . . . . . . . . . . . . . . Cover 2, 7
data marts, particularly data marts the single solution, scaling may not Revelation Software . . . . . . . . . . . . Cover 4
intended to be dimensional, some developers tend be straightforward. The simplest of changes to the SHARE . . . . . . . . . . . . . . . . . . . . . . . Cover 3
to think the way a typical end user might think. solution’s requirements might force massive, or even Wisconsin-Madison . . . . . . . . . . . . . . . . . . 47
End users are very creative. If end users have complete, refactoring that could prove quite costly.
a need, a function, a task at hand that the given Unlike our end user, data mart builders must Best Practices Sponsors
application does not directly support, they will understand what they are working to accom- Aerospike . . . . . . . . . . . . . . . . . . . . . . . . . . 14
find a way to instantiate that missing functionality. plish. The DBMS is not going to magically guide Delphix . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
It may require very unexpected uses of the exist- them to a solution. The builder is responsible for
ing solution, but the need will win out somehow. knowing how dimensional techniques work, why Trend-Setting Products
Developers may or may not understand the tenets they work, and what options may exist within the Aerospike . . . . . . . . . . . . . . . . . . . . . . . . . . 28
and approaches that comprise a “dimensional dimensional framework. The DBMS has a lot of BackOffice Associates . . . . . . . . . . . . . . . . 28
data mart”; and often those data marts are imple- functions for serving all sorts of purposes. Archi- Bradmark . . . . . . . . . . . . . . . . . . . . . . . . . . 29
mented inside relational DBMSs. Therefore, the tecture means in part that there is a place for the Cambridge Semantics . . . . . . . . . . . . . . . . 29
developer often feels that anything the DBMS will elements used, and the elements used are all in Datavail . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
do is “fair game” for their data mart. their proper place. The person serving as the data Datawatch . . . . . . . . . . . . . . . . . . . . . . . . . 30
A relational DBMS is a very generic tool, whereas architect must be the enforcer of the rules, or else Delphix . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
a multidimensional data mart is a specific kind of there will be none. A Frankenmart has pieces put Denodo . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
approach. Therefore, including “anything” that is together, but likely not in their proper place, and Empolis. . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
relational actually spans a great many features that that is the ultimate weakness. Simple is almost erwin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
are not multidimensional in nature. The result of always best. If the target is a multidimensional Franz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
such a composite approach likely results in a data data mart, then work toward exactly that. Excep- Griddable . . . . . . . . . . . . . . . . . . . . . . . . . . 33
mart that has dimensional nomenclature applied to tions may arise, but those exceptions should be InfluxData . . . . . . . . . . . . . . . . . . . . . . . . . . 34
it, but that in practice is really an amalgamation of very rare and should be very well-documented. InterSystems . . . . . . . . . . . . . . . . . . . . . . . 34
third normal form, denormalized, and dimensional Therefore, the best results are most often simple Kore Technologies . . . . . . . . . . . . . . . . . . . 35
structures. Most optimistically, one could think of star schemas, with no snowflaking and no strange Melissa. . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
these data marts as a pastiche, a bricolage, or to take normalized components dancing around facts. Navicat . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
a little poetic license, a data-based portmanteau There should be no expectation of direct fact Percona. . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
morph, containing the best of all possible elements table-to-fact table joins. And ideally, even though Pythian . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
needed to deliver a solution. In practice, results are they are “legal moves,” there should be an absolute Redis Labs . . . . . . . . . . . . . . . . . . . . . . . . . 37
often less lofty. The data mart may be a complete minimum of bridge and outrigger tables. Enforce- RedPoint Global . . . . . . . . . . . . . . . . . . . . . 38
pile of junk, or a Frankenstein’s monster comprised ment is completely in the hands of the data archi- Revelation . . . . . . . . . . . . . . . . . . . . . . . . . 38
of pieces from here and there that function, sort of. tect in charge, and these individuals need to have Robin Systems . . . . . . . . . . . . . . . . . . . . . . 39
These Frankenmarts exist in many places. The lim- the fortitude and belief to imprint their will on the Rocket. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
itations of these beasts often are associated with two solution’s database design. ■ TigerGraph . . . . . . . . . . . . . . . . . . . . . . . . . 40
Software architects need database
development tools that evolve with their
rapidly changing business landscape.
We are Revelation Software, creators of
the OpenInsight Development Suite,
bringing you one of the best browser-
based, mobile computing and robust
reporting toolkits on the market. Go to
revelation.com and start inventing your
next great software solution today.

Potrebbero piacerti anche