SlideShare a Scribd company logo
1 of 34
CONSULTING SOLUTIONS OUTSOURCING

Unlock Big Data's Potential
in Financial Services
Kurt Lueck – Pactera – US ITS Director of BI & Analytics
Chris Hackett – Hortonworks – Enterprise Account Manager
Ajay Singh – Hortonworks – Director of Technical Channels

PARTNER FOR A NEW
ERA
Topics

1

Pactera & Hortonworks Intro

2

The Hortonworks Approach

3

Smart Banking Requires a Polyglot Approach

4

Catching the Christmas Grinch (Fraud Detection in 2013)

5

360 Degree View of a Customer

6

Next Steps

© Pactera. Confidential. All Rights Reserved.

2
Global Footprint and Flexible Delivery Capabilities
Pactera is a global company strategically headquartered in China, enabling 360 partnerships
with global brands seeking to expand in one of the world’s largest and fastest-growing markets.

Global FTE: 24,000

© Pactera. Confidential. All Rights Reserved.

3
Hortonworks Approach to Enterprise Hadoop

Community Driven Enterprise Apache Hadoop
Identify and introduce enterprise requirements into
the public domain

Work with the community to advance and incubate
open source projects
Apply Enterprise Rigor to provide the most stable
and reliable distribution

© Pactera. Confidential. All Rights Reserved.

4
Hortonworks: The Value of “Open” for You
Connect With the Hadoop Community
We employ a large number of Apache project committers & innovators so that
you are represented in the open source community

Avoid Vendor Lock
Hortonworks Data Platform remains as close to the open source trunk as possible
and is developed 100% in the open so you are never locked in

The partners you rely on, rely on Hortonworks
We work with partners to deeply integrate Hadoop with data center technologies
so you can leverage existing skills and investments

Certified for the Enterprise
We engineer, test and certify the Hortonworks Data Platform at scale to ensure
reliability and stability you require for enterprise use

Support from the experts
We provide the highest quality of support for deploying at scale. You are
supported by hundreds of years of Hadoop experience

5
© Pactera. Confidential. All Rights Reserved.
Our Mission:

Enable your Modern Data Architecture by
delivering One Enterprise Hadoop
Our Commitment

Headquarters: Palo Alto, CA
Employees: 240+ and growing
Customers: 120+ and growing
Investors: Benchmark, Index,
Yahoo, Dragoneer, Tenaya

Innovate in the Open
We employ the core architects and operators of Hadoop and drive
innovation through open source Apache Foundation projects to
avoid vendor lock-in

Certify for the Enterprise
Trusted Partners with:

We engineer, test and certify the Hortonworks Data Platform for
enterprise usage and deliver the highest quality of support

Interoperate with the Ecosystem
We work with partners to deeply integrate Hadoop with key
technologies so you can leverage existing skills and investments

© Hortonworks Inc. 2013 - Confidential

6
APPLICATIONS

A Modern Data Architecture

Custom
Applications

Business
Analytics

Packaged
Applications
DEV & DATA
TOOLS

SOURCES

DATA SYSTEM

BUILD &
TEST

OPERATIONAL
TOOLS
RDBMS

EDW

MANAGE &
MONITOR

MPP

REPOSITORIES

Existing Sources

Emerging Sources

(CRM, ERP, Clickstream, Logs)

(Sensor, Sentiment, Geo, Unstructured)

© Pactera. Confidential. All Rights Reserved.

7
DATA SYSTEM

APPLICATIONS

Goal: Interoperable and Familiar
BusinessObjects BI

DEV & DATA TOOLS

OPERATIONAL TOOLS

RDBMS

HANA

EDW

MPP

SOURCES

INFRASTRUCTURE

Existing Sources

Emerging Sources

(CRM, ERP, Clickstream, Logs)

(Sensor, Sentiment, Geo, Unstructured)

© Pactera. Confidential. All Rights Reserved.

8
Betting on Hortonworks…

HDInsight &
HDP for Windows
• Only Hadoop Distribution
for Windows Azure &
Windows Server
• Native integration with
SQL Server, Excel, and
System Center

Teradata Portfolio
for Hadoop
•

Seamless data access
between Teradata and
Hadoop (SQL-H)

•

Simple management &
monitoring with Viewpoint
integration

•

Flexible deployment options

• Extends Hadoop to .NET
community

Instant Access +
Infinite Scale
• SAP can assure their
customers they are
deploying an SAP HANA
+ Hadoop architecture
fully supported by SAP
• Enables analytics apps
(BOBJ) to interact with
Hadoop

Complete Portfolio for Hadoop

UDA
Diagram
Appliances

© Hortonworks Inc. 2013 - Confidential

9
HDP: Enterprise Hadoop Platform
OPERATIONAL
SERVICES
AMBARI

FLUME
HBASE

FALCON*
OOZIE

Hortonworks
Data Platform (HDP)

DATA
SERVICES
PIG

SQOOP

HIVE &
HCATALOG

• The ONLY 100% open source
and complete platform

LOAD &
EXTRACT

HADOOP
CORE
PLATFORM
SERVICES

NFS
WebHDFS

KNOX*

MAP
REDUCE

TEZ

YARN
HDFS
Enterprise Readiness
High Availability, Disaster
Recovery, Rolling Upgrades,
Security and Snapshots

HORTONWORKS
DATA PLATFORM (HDP)
OS/VM

Cloud

© Hortonworks Inc. 2013 - Confidential

• Integrates full range of
enterprise-ready services

• Certified and tested at scale
• Engineered for deep ecosystem
interoperability

Appliance

10
Transferring Hadoop Expertise
The expert source for Apache Hadoop
training & certification
• World class training programs
Designed to help you learn fast
– Role-based hands on classes with 50% lab time
–

• Hadoop Certification demonstrates expertise in
Development & Administration
• Expert consulting services
• Programs designed to transfer knowledge
• Industry leading Hadoop Sandbox
Free download
– Fastest way to learn Apache Hadoop
– Personal, portable Hadoop environment
–

11
© Hortonworks Inc. 2013 - Confidential
BI in Financial Markets
A Polyglot Approach

© Pactera. Confidential. All Rights Reserved.
Why Big Data
What Can You Not Do Today?

Store More for Less
“Data Lake”

© Pactera. Confidential. All Rights Reserved.

•
•
•
•

Fraud Detection
360 Degree View of Customer
Account Risk Analysis
Social Media Analysis
13
Many Aspects of Smart Banking

© Pactera. Confidential. All Rights Reserved.

14
Polyglot approach
Analytics

Massive Process

Transactional
Applications

Real Time BI

Process
Persistence

• Indexing, Clustering,
• Interrupt processing
• Time sharing processing

• A new way of data
processing, one
technology of MPP
(Massive Parallel
Processing)

NoSQL
• Key Value DB / Key Value
Stores
• Large Column DB
• Document-oriented DB
• Graphic DB

Hadoop
• Parallel data storage model
• BASE

Transform
Source

HDFS/GPFS

ftp/ftps

CEP

Data Mining

SQL

Map Reduce

No Transform

Real Time BI

RDBMS
• Traditional database for
OLTP and OLAP
• ACID
• Scale up and scale out
• New MPP support
Memory
RAC
Cache

after loading

Streams Tools for
stream data

MQ/ESB

Connectors

ELT – Transform

ETL – Transform
while loading
ETL Tools (datastage,
informatica, flume,
sqoop, etc.)

In-Memory Computing

• SAP HANA
• Software AG Terracotta
• Designed For real time
analytics and transaction
• Column based
compressing
• Computing near
persistence

In-Database
Computing
• SAS

Large Memory
Disk Persistence

SQL for direct
loading

WS Clients

JDBC/MDX

API/WS

Multi-channels Data Sources
© Pactera. Confidential. All Rights Reserved.

15
Big Data is part of the Ecosystem
Big Data
BATCH
SOURCE
DATA

Map
Reduce
HIVE

ETL

PIG

(data processing)

clickstream

social

USE

(data processing)

DB
PIG

HCATALOG (table metadata)
INTERACTIVE

server logs

compute
&
storage

.

.

.

.

Flume

.

.

.

.

compute
&
storage

EDW

HIVE/SQL
MPP

ONLINE

geo-location
Sqoop

sensor

.

.

HBASE

YARN
STREAMING

text

© Pactera. Confidential. All Rights Reserved.

STORM

16
Fraud Detection in 2013
Catching the Christmas Grinch

© Pactera. Confidential. All Rights Reserved.
Fraud Story Line

© Pactera. Confidential. All Rights Reserved.

Old School

18
Fighting Fraud – Using Rules & Known Patterns

Charlotte, NC

-$500

Atlanta, GA

-$500

Dallas, TX

Hong Kong

-$500

-$500

Balance = $2000
© Pactera. Confidential. All Rights Reserved.

20
Fighting Fraud - Anomaly Detection
We have a very simple data model. Each credit card
transaction contains the following 4 attributes:
1.
2.
3.
4.

Transaction ID
Time of the day
Money spent
Vendor type

Here are some examples. The last one is an outlier, injected
into the data set.
YX66AJ9U
1025 20.47
Drug store
98ZCM6B1
1910 55.50
Restaurant
XXXX7362
0100 1875.40
Jeweler store
© Pactera. Confidential. All Rights Reserved.

21
Fighting Fraud -Predictive Analytics

Predictive
Descriptive
Decision
*Predictive analytics is an area of statistical
analysis that deals with extracting
information from data and using it to predict
future trends and behavior patterns.
* Wikipedia
© Pactera. Confidential. All Rights Reserved.

22
Fighting Fraud - Social Network Analysis

© Pactera. Confidential. All Rights Reserved.
© Pactera. Confidential. All Rights Reserved.

23
23
Additional Use Cases of Big
Data in Financial Services

© Pactera. Confidential. All Rights Reserved.
6 Key Hadoop DATA TYPES
1. Sentiment
Understand how your customers feel about your
brand and products – right now

2. Clickstream
Capture and analyze website visitors’ data trails
and optimize your website

3. Sensor/Machine
Discover patterns in data streaming automatically
from remote sensors and machines

4. Geographic

Value

Analyze location-based data to manage
operations where they occur

5. Server Logs
Research logs to diagnose process failures and
prevent security breaches

6. Text
Understand patterns in text across millions of web
pages, emails, and documents
26
© Hortonworks Inc. 2013
Big Data in Financial Services

Financial Services

• Insurance Underwriting
• 360 Degree View of the Customer
• Website optimization

• Brand sentiment
• New Account Risk Screening

• Accelerate Loan Processing
27
© Hortonworks Inc. 2013
Insurance Underwriting

Financial Services
Data: Geo, Text

Business Problem
• Insurance companies hold massive amounts of unstructured, textbased claim data
• Without analyzing both structured and unstructured data, insurance
companies have an incomplete view of risk
• Data scarcity leads to moral hazard – companies sell to risky
customers, safer individuals stay out of the market

Solution
• HDP gives underwriters more statistical confidence
• Store and use more data, from more sources, for longer
• Sensor and geographic data at large scale give real underwriting info
for car, home, crop and cargo insurance

28
© Hortonworks Inc. 2012
Website Optimization

Financial Services
Data: Clickstream,

Business Problem
• Online bankers leave a long trail of clickstream data
• Clickstream data can tell product pages customers visit and their
interest
• The huge volume of unstructured weblogs is difficult to store, refine
and analyze for insight
• Storing log data in relational databases is too expensive

Solution
• HDP stores all web logs, for years, at a low cost
• Banks use that to understand user paths, do basket analysis, run A/B
tests and prioritize site updates
• Improve customer service & reduce expense
29
© Hortonworks Inc. 2012
360° View of the Customer

Financial Services
Data: Clickstream, Text

Business Problem
• Banks interact with customers across multiple channels
• Customer interaction and product subscription is often siloed
• Few banks can correlate customer interactions with marketing
campaigns and online browsing behavior
• Merging data in relational databases is expensive
Solution
• HDP gives banks a 360° view of customer behavior
• Store data longer & track phases of the customer lifecycle
• Gain competitive advantage: increase sales, reduce service expense
and retain the best customers

30
© Hortonworks Inc. 2012
Next Steps

© Pactera. Confidential. All Rights Reserved.
Pactera Big Data Capability
Big Data Solution Architecture
 In-Memory Solutions
 Scalable Distributed Platforms

Next Generation Analytics

 Models, Algorithms, and Simulations
 Visualization

Improving Operational Ability

 Help companies drive more operational efficiencies from existing
investments.
 Moving from the realm of data scientists into everyday business transactions
and encounters.

New Business Processes

 Impact on both customer intelligence and operational efficiency by making
everything immediately actionable.
 Armed with immediate decision-making capability and intelligence,
companies will be able to implement new business processes that will
change how business is done.
 We ask the Right Questions
© Pactera. Confidential. All Rights Reserved.

32
How Pactera can help with Big Data
Executive Workshop
Strategies, Planning, and Expectations
• Big Data strategy on what tomorrow will look like
POC (2-4 Weeks)
• Using Big Data to establish market dominance
• Big Data project takeaways
• Roadblocks to implementing Big Data analytics
• Defining an ROI for Big Data
• Getting the right ROI on Big Data

Workshop

Benchmark and Monitoring

Implementation and Architecture
Implementation and Architecture
Pilot Concept
(2-4 Weeks)

Technical Workshop
End-To-End Management
• System tuning/auto-tuning and configuration management
• Dealing with both structured and unstructured data
• Monitoring, diagnosis, and automated behavior detection
Solution Architecture
• Processor, memory, and system architectures for data analysis
• Benchmarks, metrics, and workload characterization for big
data
• Availability, fault tolerance and recovery issues
• Data management and analytics for vast amounts of
unstructured data
© Pactera. Confidential. All Rights Reserved.

(4 Hours)

Projects:
• Benchmark & Monitoring
• Integrations & Migrations
• Implementation & Architecture
• Project Management
• Analytics
• Reporting

33
Thank You
Kurt Lueck, Managing Director of BI & Analytics
Kurt.lueck@pactera.com
Chris Hackett
chackett@hortonworks.com
Ajay Singh
ajaysingh@hortonworks.com
© Pactera. Confidential. All Rights Reserved.

More Related Content

What's hot

Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti
 
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...StampedeCon
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHortonworks
 
Complex Analytics using Open Source Technologies
Complex Analytics using Open Source TechnologiesComplex Analytics using Open Source Technologies
Complex Analytics using Open Source TechnologiesDataWorks Summit
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyDataWorks Summit
 
Big Data IDEA 101 2019
Big Data IDEA 101 2019Big Data IDEA 101 2019
Big Data IDEA 101 2019Adam Doyle
 
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...BigDataEverywhere
 
Perspectives on Ethical Big Data Governance
Perspectives on Ethical Big Data GovernancePerspectives on Ethical Big Data Governance
Perspectives on Ethical Big Data GovernanceCloudera, Inc.
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyCloudera, Inc.
 
Who, What, Where and How: Why You Want to Know
 Who, What, Where and How: Why You Want to Know Who, What, Where and How: Why You Want to Know
Who, What, Where and How: Why You Want to KnowEric Kavanagh
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemCapgemini
 
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Harald Erb
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoopDr. Wilfred Lin (Ph.D.)
 
IBM Watson Content Analytics: Discover Hidden Value in Your Unstructured Data
IBM Watson Content Analytics: Discover Hidden Value in Your Unstructured DataIBM Watson Content Analytics: Discover Hidden Value in Your Unstructured Data
IBM Watson Content Analytics: Discover Hidden Value in Your Unstructured DataPerficient, Inc.
 
Record manager 8.0 presentation
Record manager 8.0  presentationRecord manager 8.0  presentation
Record manager 8.0 presentationAndrey Karpov
 
Hilton's enterprise data journey
Hilton's enterprise data journeyHilton's enterprise data journey
Hilton's enterprise data journeyDataWorks Summit
 

What's hot (19)

Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
 
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
 
Complex Analytics using Open Source Technologies
Complex Analytics using Open Source TechnologiesComplex Analytics using Open Source Technologies
Complex Analytics using Open Source Technologies
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
 
Big Data IDEA 101 2019
Big Data IDEA 101 2019Big Data IDEA 101 2019
Big Data IDEA 101 2019
 
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
 
Oracle big data discovery 994294
Oracle big data discovery   994294Oracle big data discovery   994294
Oracle big data discovery 994294
 
Perspectives on Ethical Big Data Governance
Perspectives on Ethical Big Data GovernancePerspectives on Ethical Big Data Governance
Perspectives on Ethical Big Data Governance
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data Journey
 
Who, What, Where and How: Why You Want to Know
 Who, What, Where and How: Why You Want to Know Who, What, Where and How: Why You Want to Know
Who, What, Where and How: Why You Want to Know
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
 
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
IBM Watson Content Analytics: Discover Hidden Value in Your Unstructured Data
IBM Watson Content Analytics: Discover Hidden Value in Your Unstructured DataIBM Watson Content Analytics: Discover Hidden Value in Your Unstructured Data
IBM Watson Content Analytics: Discover Hidden Value in Your Unstructured Data
 
Record manager 8.0 presentation
Record manager 8.0  presentationRecord manager 8.0  presentation
Record manager 8.0 presentation
 
Big Data
Big DataBig Data
Big Data
 
Hilton's enterprise data journey
Hilton's enterprise data journeyHilton's enterprise data journey
Hilton's enterprise data journey
 

Viewers also liked

Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Pactera_US
 
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...Hortonworks
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambariHortonworks
 
Siebel to Salesforce
Siebel to Salesforce Siebel to Salesforce
Siebel to Salesforce Pactera_US
 
Icons and Stencils for Hadoop
Icons and Stencils for HadoopIcons and Stencils for Hadoop
Icons and Stencils for HadoopHortonworks
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseHortonworks
 

Viewers also liked (6)

Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
 
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambari
 
Siebel to Salesforce
Siebel to Salesforce Siebel to Salesforce
Siebel to Salesforce
 
Icons and Stencils for Hadoop
Icons and Stencils for HadoopIcons and Stencils for Hadoop
Icons and Stencils for Hadoop
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
 

Similar to Unlock Big Data's Potential in Financial Services with Hortonworks

WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalHortonworks
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalHortonworks
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...Hortonworks
 
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpen Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpenAnalytics Spain
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concretoHP Enterprise Italia
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorMichael Haddad
 
Cloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarCloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarHortonworks
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseJeff Kelly
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikBardess Group
 
Predicting Customer Behavior With Big Data
Predicting Customer Behavior With Big Data Predicting Customer Behavior With Big Data
Predicting Customer Behavior With Big Data Pactera_US
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Hortonworks
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...Big Data Week
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US InformationJulian Tong
 
Tusker Corporate Profile
Tusker Corporate ProfileTusker Corporate Profile
Tusker Corporate ProfilePrashant Kumar
 

Similar to Unlock Big Data's Potential in Financial Services with Hortonworks (20)

WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
 
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpen Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
 
Big Data for BI - Beyond the Hype - Pentaho
Big Data for BI - Beyond the Hype - PentahoBig Data for BI - Beyond the Hype - Pentaho
Big Data for BI - Beyond the Hype - Pentaho
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concreto
 
Big Data Proof of Concept
Big Data Proof of ConceptBig Data Proof of Concept
Big Data Proof of Concept
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sector
 
Cloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarCloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinar
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess Qlik
 
Predicting Customer Behavior With Big Data
Predicting Customer Behavior With Big Data Predicting Customer Behavior With Big Data
Predicting Customer Behavior With Big Data
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Tusker Corporate Profile
Tusker Corporate ProfileTusker Corporate Profile
Tusker Corporate Profile
 

More from Pactera_US

How to Achieve Measurable Benefits Through Project and Organizational Change
How to Achieve Measurable Benefits Through Project and Organizational ChangeHow to Achieve Measurable Benefits Through Project and Organizational Change
How to Achieve Measurable Benefits Through Project and Organizational ChangePactera_US
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Pactera_US
 
Pactera Big Data Solutions for Retail
Pactera Big Data Solutions for Retail Pactera Big Data Solutions for Retail
Pactera Big Data Solutions for Retail Pactera_US
 
Big Data - How to Get Started
Big Data - How to Get Started Big Data - How to Get Started
Big Data - How to Get Started Pactera_US
 
Big Data Webinar
Big Data WebinarBig Data Webinar
Big Data WebinarPactera_US
 
Business Process Management - Enabling The Business Drivers
Business Process Management - Enabling The Business DriversBusiness Process Management - Enabling The Business Drivers
Business Process Management - Enabling The Business DriversPactera_US
 
China IT Outsourcing
China IT Outsourcing China IT Outsourcing
China IT Outsourcing Pactera_US
 
How do you monitor your Basel III compliance?
How do you monitor your Basel III compliance? How do you monitor your Basel III compliance?
How do you monitor your Basel III compliance? Pactera_US
 

More from Pactera_US (8)

How to Achieve Measurable Benefits Through Project and Organizational Change
How to Achieve Measurable Benefits Through Project and Organizational ChangeHow to Achieve Measurable Benefits Through Project and Organizational Change
How to Achieve Measurable Benefits Through Project and Organizational Change
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
Pactera Big Data Solutions for Retail
Pactera Big Data Solutions for Retail Pactera Big Data Solutions for Retail
Pactera Big Data Solutions for Retail
 
Big Data - How to Get Started
Big Data - How to Get Started Big Data - How to Get Started
Big Data - How to Get Started
 
Big Data Webinar
Big Data WebinarBig Data Webinar
Big Data Webinar
 
Business Process Management - Enabling The Business Drivers
Business Process Management - Enabling The Business DriversBusiness Process Management - Enabling The Business Drivers
Business Process Management - Enabling The Business Drivers
 
China IT Outsourcing
China IT Outsourcing China IT Outsourcing
China IT Outsourcing
 
How do you monitor your Basel III compliance?
How do you monitor your Basel III compliance? How do you monitor your Basel III compliance?
How do you monitor your Basel III compliance?
 

Recently uploaded

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 

Recently uploaded (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 

Unlock Big Data's Potential in Financial Services with Hortonworks

  • 1. CONSULTING SOLUTIONS OUTSOURCING Unlock Big Data's Potential in Financial Services Kurt Lueck – Pactera – US ITS Director of BI & Analytics Chris Hackett – Hortonworks – Enterprise Account Manager Ajay Singh – Hortonworks – Director of Technical Channels PARTNER FOR A NEW ERA
  • 2. Topics 1 Pactera & Hortonworks Intro 2 The Hortonworks Approach 3 Smart Banking Requires a Polyglot Approach 4 Catching the Christmas Grinch (Fraud Detection in 2013) 5 360 Degree View of a Customer 6 Next Steps © Pactera. Confidential. All Rights Reserved. 2
  • 3. Global Footprint and Flexible Delivery Capabilities Pactera is a global company strategically headquartered in China, enabling 360 partnerships with global brands seeking to expand in one of the world’s largest and fastest-growing markets. Global FTE: 24,000 © Pactera. Confidential. All Rights Reserved. 3
  • 4. Hortonworks Approach to Enterprise Hadoop Community Driven Enterprise Apache Hadoop Identify and introduce enterprise requirements into the public domain Work with the community to advance and incubate open source projects Apply Enterprise Rigor to provide the most stable and reliable distribution © Pactera. Confidential. All Rights Reserved. 4
  • 5. Hortonworks: The Value of “Open” for You Connect With the Hadoop Community We employ a large number of Apache project committers & innovators so that you are represented in the open source community Avoid Vendor Lock Hortonworks Data Platform remains as close to the open source trunk as possible and is developed 100% in the open so you are never locked in The partners you rely on, rely on Hortonworks We work with partners to deeply integrate Hadoop with data center technologies so you can leverage existing skills and investments Certified for the Enterprise We engineer, test and certify the Hortonworks Data Platform at scale to ensure reliability and stability you require for enterprise use Support from the experts We provide the highest quality of support for deploying at scale. You are supported by hundreds of years of Hadoop experience 5 © Pactera. Confidential. All Rights Reserved.
  • 6. Our Mission: Enable your Modern Data Architecture by delivering One Enterprise Hadoop Our Commitment Headquarters: Palo Alto, CA Employees: 240+ and growing Customers: 120+ and growing Investors: Benchmark, Index, Yahoo, Dragoneer, Tenaya Innovate in the Open We employ the core architects and operators of Hadoop and drive innovation through open source Apache Foundation projects to avoid vendor lock-in Certify for the Enterprise Trusted Partners with: We engineer, test and certify the Hortonworks Data Platform for enterprise usage and deliver the highest quality of support Interoperate with the Ecosystem We work with partners to deeply integrate Hadoop with key technologies so you can leverage existing skills and investments © Hortonworks Inc. 2013 - Confidential 6
  • 7. APPLICATIONS A Modern Data Architecture Custom Applications Business Analytics Packaged Applications DEV & DATA TOOLS SOURCES DATA SYSTEM BUILD & TEST OPERATIONAL TOOLS RDBMS EDW MANAGE & MONITOR MPP REPOSITORIES Existing Sources Emerging Sources (CRM, ERP, Clickstream, Logs) (Sensor, Sentiment, Geo, Unstructured) © Pactera. Confidential. All Rights Reserved. 7
  • 8. DATA SYSTEM APPLICATIONS Goal: Interoperable and Familiar BusinessObjects BI DEV & DATA TOOLS OPERATIONAL TOOLS RDBMS HANA EDW MPP SOURCES INFRASTRUCTURE Existing Sources Emerging Sources (CRM, ERP, Clickstream, Logs) (Sensor, Sentiment, Geo, Unstructured) © Pactera. Confidential. All Rights Reserved. 8
  • 9. Betting on Hortonworks… HDInsight & HDP for Windows • Only Hadoop Distribution for Windows Azure & Windows Server • Native integration with SQL Server, Excel, and System Center Teradata Portfolio for Hadoop • Seamless data access between Teradata and Hadoop (SQL-H) • Simple management & monitoring with Viewpoint integration • Flexible deployment options • Extends Hadoop to .NET community Instant Access + Infinite Scale • SAP can assure their customers they are deploying an SAP HANA + Hadoop architecture fully supported by SAP • Enables analytics apps (BOBJ) to interact with Hadoop Complete Portfolio for Hadoop UDA Diagram Appliances © Hortonworks Inc. 2013 - Confidential 9
  • 10. HDP: Enterprise Hadoop Platform OPERATIONAL SERVICES AMBARI FLUME HBASE FALCON* OOZIE Hortonworks Data Platform (HDP) DATA SERVICES PIG SQOOP HIVE & HCATALOG • The ONLY 100% open source and complete platform LOAD & EXTRACT HADOOP CORE PLATFORM SERVICES NFS WebHDFS KNOX* MAP REDUCE TEZ YARN HDFS Enterprise Readiness High Availability, Disaster Recovery, Rolling Upgrades, Security and Snapshots HORTONWORKS DATA PLATFORM (HDP) OS/VM Cloud © Hortonworks Inc. 2013 - Confidential • Integrates full range of enterprise-ready services • Certified and tested at scale • Engineered for deep ecosystem interoperability Appliance 10
  • 11. Transferring Hadoop Expertise The expert source for Apache Hadoop training & certification • World class training programs Designed to help you learn fast – Role-based hands on classes with 50% lab time – • Hadoop Certification demonstrates expertise in Development & Administration • Expert consulting services • Programs designed to transfer knowledge • Industry leading Hadoop Sandbox Free download – Fastest way to learn Apache Hadoop – Personal, portable Hadoop environment – 11 © Hortonworks Inc. 2013 - Confidential
  • 12. BI in Financial Markets A Polyglot Approach © Pactera. Confidential. All Rights Reserved.
  • 13. Why Big Data What Can You Not Do Today? Store More for Less “Data Lake” © Pactera. Confidential. All Rights Reserved. • • • • Fraud Detection 360 Degree View of Customer Account Risk Analysis Social Media Analysis 13
  • 14. Many Aspects of Smart Banking © Pactera. Confidential. All Rights Reserved. 14
  • 15. Polyglot approach Analytics Massive Process Transactional Applications Real Time BI Process Persistence • Indexing, Clustering, • Interrupt processing • Time sharing processing • A new way of data processing, one technology of MPP (Massive Parallel Processing) NoSQL • Key Value DB / Key Value Stores • Large Column DB • Document-oriented DB • Graphic DB Hadoop • Parallel data storage model • BASE Transform Source HDFS/GPFS ftp/ftps CEP Data Mining SQL Map Reduce No Transform Real Time BI RDBMS • Traditional database for OLTP and OLAP • ACID • Scale up and scale out • New MPP support Memory RAC Cache after loading Streams Tools for stream data MQ/ESB Connectors ELT – Transform ETL – Transform while loading ETL Tools (datastage, informatica, flume, sqoop, etc.) In-Memory Computing • SAP HANA • Software AG Terracotta • Designed For real time analytics and transaction • Column based compressing • Computing near persistence In-Database Computing • SAS Large Memory Disk Persistence SQL for direct loading WS Clients JDBC/MDX API/WS Multi-channels Data Sources © Pactera. Confidential. All Rights Reserved. 15
  • 16. Big Data is part of the Ecosystem Big Data BATCH SOURCE DATA Map Reduce HIVE ETL PIG (data processing) clickstream social USE (data processing) DB PIG HCATALOG (table metadata) INTERACTIVE server logs compute & storage . . . . Flume . . . . compute & storage EDW HIVE/SQL MPP ONLINE geo-location Sqoop sensor . . HBASE YARN STREAMING text © Pactera. Confidential. All Rights Reserved. STORM 16
  • 17. Fraud Detection in 2013 Catching the Christmas Grinch © Pactera. Confidential. All Rights Reserved.
  • 18. Fraud Story Line © Pactera. Confidential. All Rights Reserved. Old School 18
  • 19.
  • 20. Fighting Fraud – Using Rules & Known Patterns Charlotte, NC -$500 Atlanta, GA -$500 Dallas, TX Hong Kong -$500 -$500 Balance = $2000 © Pactera. Confidential. All Rights Reserved. 20
  • 21. Fighting Fraud - Anomaly Detection We have a very simple data model. Each credit card transaction contains the following 4 attributes: 1. 2. 3. 4. Transaction ID Time of the day Money spent Vendor type Here are some examples. The last one is an outlier, injected into the data set. YX66AJ9U 1025 20.47 Drug store 98ZCM6B1 1910 55.50 Restaurant XXXX7362 0100 1875.40 Jeweler store © Pactera. Confidential. All Rights Reserved. 21
  • 22. Fighting Fraud -Predictive Analytics Predictive Descriptive Decision *Predictive analytics is an area of statistical analysis that deals with extracting information from data and using it to predict future trends and behavior patterns. * Wikipedia © Pactera. Confidential. All Rights Reserved. 22
  • 23. Fighting Fraud - Social Network Analysis © Pactera. Confidential. All Rights Reserved. © Pactera. Confidential. All Rights Reserved. 23 23
  • 24.
  • 25. Additional Use Cases of Big Data in Financial Services © Pactera. Confidential. All Rights Reserved.
  • 26. 6 Key Hadoop DATA TYPES 1. Sentiment Understand how your customers feel about your brand and products – right now 2. Clickstream Capture and analyze website visitors’ data trails and optimize your website 3. Sensor/Machine Discover patterns in data streaming automatically from remote sensors and machines 4. Geographic Value Analyze location-based data to manage operations where they occur 5. Server Logs Research logs to diagnose process failures and prevent security breaches 6. Text Understand patterns in text across millions of web pages, emails, and documents 26 © Hortonworks Inc. 2013
  • 27. Big Data in Financial Services Financial Services • Insurance Underwriting • 360 Degree View of the Customer • Website optimization • Brand sentiment • New Account Risk Screening • Accelerate Loan Processing 27 © Hortonworks Inc. 2013
  • 28. Insurance Underwriting Financial Services Data: Geo, Text Business Problem • Insurance companies hold massive amounts of unstructured, textbased claim data • Without analyzing both structured and unstructured data, insurance companies have an incomplete view of risk • Data scarcity leads to moral hazard – companies sell to risky customers, safer individuals stay out of the market Solution • HDP gives underwriters more statistical confidence • Store and use more data, from more sources, for longer • Sensor and geographic data at large scale give real underwriting info for car, home, crop and cargo insurance 28 © Hortonworks Inc. 2012
  • 29. Website Optimization Financial Services Data: Clickstream, Business Problem • Online bankers leave a long trail of clickstream data • Clickstream data can tell product pages customers visit and their interest • The huge volume of unstructured weblogs is difficult to store, refine and analyze for insight • Storing log data in relational databases is too expensive Solution • HDP stores all web logs, for years, at a low cost • Banks use that to understand user paths, do basket analysis, run A/B tests and prioritize site updates • Improve customer service & reduce expense 29 © Hortonworks Inc. 2012
  • 30. 360° View of the Customer Financial Services Data: Clickstream, Text Business Problem • Banks interact with customers across multiple channels • Customer interaction and product subscription is often siloed • Few banks can correlate customer interactions with marketing campaigns and online browsing behavior • Merging data in relational databases is expensive Solution • HDP gives banks a 360° view of customer behavior • Store data longer & track phases of the customer lifecycle • Gain competitive advantage: increase sales, reduce service expense and retain the best customers 30 © Hortonworks Inc. 2012
  • 31. Next Steps © Pactera. Confidential. All Rights Reserved.
  • 32. Pactera Big Data Capability Big Data Solution Architecture  In-Memory Solutions  Scalable Distributed Platforms Next Generation Analytics  Models, Algorithms, and Simulations  Visualization Improving Operational Ability  Help companies drive more operational efficiencies from existing investments.  Moving from the realm of data scientists into everyday business transactions and encounters. New Business Processes  Impact on both customer intelligence and operational efficiency by making everything immediately actionable.  Armed with immediate decision-making capability and intelligence, companies will be able to implement new business processes that will change how business is done.  We ask the Right Questions © Pactera. Confidential. All Rights Reserved. 32
  • 33. How Pactera can help with Big Data Executive Workshop Strategies, Planning, and Expectations • Big Data strategy on what tomorrow will look like POC (2-4 Weeks) • Using Big Data to establish market dominance • Big Data project takeaways • Roadblocks to implementing Big Data analytics • Defining an ROI for Big Data • Getting the right ROI on Big Data Workshop Benchmark and Monitoring Implementation and Architecture Implementation and Architecture Pilot Concept (2-4 Weeks) Technical Workshop End-To-End Management • System tuning/auto-tuning and configuration management • Dealing with both structured and unstructured data • Monitoring, diagnosis, and automated behavior detection Solution Architecture • Processor, memory, and system architectures for data analysis • Benchmarks, metrics, and workload characterization for big data • Availability, fault tolerance and recovery issues • Data management and analytics for vast amounts of unstructured data © Pactera. Confidential. All Rights Reserved. (4 Hours) Projects: • Benchmark & Monitoring • Integrations & Migrations • Implementation & Architecture • Project Management • Analytics • Reporting 33
  • 34. Thank You Kurt Lueck, Managing Director of BI & Analytics Kurt.lueck@pactera.com Chris Hackett chackett@hortonworks.com Ajay Singh ajaysingh@hortonworks.com © Pactera. Confidential. All Rights Reserved.

Editor's Notes

  1. Good afternoon and Good Morning on the west coast.
  2. PACTERA is a very large systems integrator with over 23k employees across 35 offices globally. Our services range from Advisory Services,BI & Analytics (which includes BIG DATA) CRM, Digital Media, to QA/Testing and Localization. We are an end-to-end consulting firm both on-shore and off-shore.We are listed on the NASDAQ symbol under PACT.My role in the organization is to lead the North America BI & Analytics practice.
  3. Make Hadoop an enterprise data platformInnovate core platform, data, & operational servicesIntegrate deeply with enterprise ecosystemProvide world-class enterprise supportDrive 100% open source software development and releases through the core Apache projectsAddress enterprise needs in community projectsEstablish Apache foundation projects as “the standard”Promote open community vs. vendor control / lock-inEnable the Hadoop market to functionMake it easy for enterprises to deploy at scaleBe the best at enabling deep ecosystem integrationCreate a pull market with key strategic partners
  4. Make Hadoop an enterprise data platformInnovate core platform, data, & operational servicesIntegrate deeply with enterprise ecosystemProvide world-class enterprise supportDrive 100% open source software development and releases through the core Apache projectsAddress enterprise needs in community projectsEstablish Apache foundation projects as “the standard”Promote open community vs. vendor control / lock-inEnable the Hadoop market to functionMake it easy for enterprises to deploy at scaleBe the best at enabling deep ecosystem integrationCreate a pull market with key strategic partners
  5. We’re a plus one. We are here to interoperate and to help get additional out of your existing systems.
  6. This is like redhat
  7. Additionally, we are a leading provider of Hadoop support through our Hortonworks University, with courses for both development and operations. If required, we can also provide expert consulting services from both ourselves or our System Integrator partners.And for anyone looking to get their hands on Hadoop, we have recently introduced the Hadoop Sandbox program which enables users to download a full instance of HDP together with guided tutorials covering both development and administration topics.
  8. Thanks Chris. Lets look at Big Data in Financial Markets and how we approach projects.
  9. The first question and one that I get asked even now a surprisingly large amount of times is WHY DO I NEED BIG DATA?I have the answer down to two reasons. Reduce Cost & Do something you could not do before.For many large organizations the simple reduction or at least maintain at current cost was the factor. One more Large Vendor Appliance to store data was simply to expensive to continue.The more interesting projects are around doing things that organizations could simply NOT do …or were definitely struggling to do.Things like 360 Degree view of the customer and Fraud Detection, which we will cover both in detail in this webinar.
  10. Yes, I know adding Smart in front of something does not make you actually Smart. But it is a great marketing ploy.Here at Pactera we are branding our industry solutions with the Term Smart Commerce – Smart City – Smart Banking and so on.The idea is that current solutions and technology will need a refresh. Big Data is such a game changer that current technology and business processes must be reviewed.The items highlighted in yellow are areas that we feel should be carefully reviewed for enhanced capabilities with Big Data technology.For example, we feel that new Data models will emerge that incorporate our old way of storing data with new methods.
  11. Now before you think that we have lost our minds. Big Data will not solve the world.I know even the HW team that is on the line will agree with me that Big Data is part of the solution but there are many other existing and new technologies that are also part of the solution. I believe that in the next few years the lines will be blurred between “Big Data” and traditional db technologies.We believe that every business problem should be addressed with the right technology. Whenever a new technology springs up there are those that try to use it for everything. Don’tLook for technology vendors like HW that co-exist and play well with your existing vendors. At pactera we strive to know the technologies beyond the hype. Take a polyglot approach. Use the best technology for the problem.
  12. Ok – So for some of you this may be a new slide.Big Data has a lot of new and frankly kind of funny terms. The basic element is HDFS, which is the heart of Big data. It is basically the storage of the data and I think is best understood by thinking of it in the same terms as your laptop. You take files and place them into a folder. You don’t care what is in the file and you don’t build a structure before you put them into the folder. Exact same concept with Big Data.Now a quick run through with some of the tools that are used to manipulate data.FLUME – This is a tool to ingest FilesSqoop – This is a tool to get data from or put data into databases like Oracle or MicrosoftHive – This is a tool for people like myself that want to get data using ANSI SQL. PIG – This is a scripting language much like T-SQL or PLSQL or even Python. This can be extended with Java, Python, and other languages.YARN is a new concept in Hadoop 2.0 but I will leave that for another webinar. Just know that it make Hadoop scaleable and flexible.Alright – so lets move into our First Use Case.
  13. Perhaps there really is no such thing as easy money. Based on declining bank robbery statistics, criminals seem to be realizing that it’s hard to make a living by following in the footsteps of Bonnie and Clyde.In 2009, there were no fewer than 22 bank robberies in a trio of counties centered on Augusta, Georgia. “It felt like we were the bank robbery capital of the world that year,” Capt. Troy Elwell, of the Aiken County Sheriff’s department, recently told the Augusta Chronicle.Last year, however, there were “just” eight bank robberies reported in the same area. In fact, the paper noted, the number of bank robberies around the country has been falling steadily for years:According to the FBI, bank holdups have dropped nearly every year since 2003, when nearly 7,500 robberies were reported nationwide with $77 million taken. In 2011 – the last complete year for data – about 5,000 banks reported robberies with $38 million stolen.So where are they all going? You guess it…electronic and quite sophisticated. Easier money and the sentencing is much shorter.There are many ways a bank can be defrauded but lets focus on our discussion on a commonly understood but difficult to solve Credit Card or ATM fraud
  14. So moving across the top there are four buckets of Method to detect fraud.Rules Based DetectionAnomaly DetectionPredictive AnalysisSocial Network AnalysisWhy is Big Data part of the solution?The main reason– More data enables more analysis both in real-time and over-time. If you are thinking “I thought Big Data was too slow for this type of application” you are somewhat correct. Alone – Hadoop is a bit slow for something real-time but with projects like Stinger and Hybrid in-memory approaches this is a reality today.Which brings me to the final comment on this page. Financial institutions must approach fraud in a hybrid approach which may start by enhancing your data types. Ultimately, all financial instituations will need to build Big Data solutions into their current IT ecosystem.Lets break these 4 types of Fraud detection and look at how Big Data can help.
  15. Rules based fraud is the simplest to understand and implement. Every bank has some form of this in place.Simple rules. For example, a rule that states that you cannot simultaneously take out $500 from 4 different locations – Especially if there is no way that you could be in all 4 locations at the same time. You could have some ruesBut this problem is a bit more tricky. What if I took a flight from CLT to ATL – 45 minutes on a flight from one airport to another and this is a very logical transaction. I could then board another flight and within 2hrs or so I take out another. Am I doing something wrong or simply a world traveller taking the longest possible way to China.
  16. The next item is something that we are all familiar with. Why are we familiar? Because it is not working well enough….YET. Hence the fact that we all get our cards rejected.So if we look at this basic example here. We have a number of transactions and then the 3rd is out of the ordinary. We are looking for Data that do not conform to the normal and expected patterns are outliersCriteria for what constitutes an outlier depend on the problem domain. Big Data is needed to perform the following back-end processesTypically involve large amount data -- Think millions upon millions of credit card transactionsMuch of the data may be unstructured There are some anomoly that are easy to detect. Size of transactions – location – time….Instance data, where the outlier detection algorithm operates on individual instance of data e.g., particular credit transaction involving large amount of money purchasing unusual productSequence data with temporal or spatial relationship. The goal of outlier detection is to find unusual sequence e.g., intrusion detection and cyber security.As a quick discussion of how this works. Hadoop is used to continually build your “normal”. Your normal is then stored in an in-memory type of solution that active transactions can be bounced against. Non-normal means a shutdown on your credit card and series of events that usually involve a phone call.-----------------------------but this leads us to our 3rd example.
  17. The next level is predictive analytics.When someone goes from mundane purchases to high priced dinners and gifts. Are they in love. OR is the card stolen.Using Anomoly techniques We have been able to detect the outlier. But how do we know whether it’s a fraudulent transaction or emerging buying pattern.Your credit card may have been compromised and someone is using it. Or you have fallen in love and decided to shower him or her with expensive high price ticket items.We can’t really tell the difference, except that once there is enough data points for this emerging behavior, we won’t be getting these false positives from our analysisThis leads to the 3rd bucket which is predictive analyticsPredictive modelsPredictive models analyze past performance to assess how likely a customer is to exhibit a specific behavior in the future in order to improve marketing effectiveness. This category also encompasses models that seek out subtle data patterns to answer questions about customer performance, such as fraud detection models. Predictive models often perform calculations during live transactions, for example, to evaluate the risk or opportunity of a given customer or transaction, in order to guide a decision. With advancement in computing speed, individual agent modeling systems can simulate human behavior or reaction to given stimuli or scenarios. The new term for animating data specifically linked to an individual in a simulated environment is avatar analytics.Descriptive modelsDescriptive models quantify relationships in data in a way that is often used to classify customers or prospects into groups. Unlike predictive models that focus on predicting a single customer behavior (such as credit risk), descriptive models identify many different relationships between customers or products. Descriptive models do not rank-order customers by their likelihood of taking a particular action the way predictive models do. Descriptive models can be used, for example, to categorize customers by their product preferences and life stage. Descriptive modeling tools can be utilized to develop further models that can simulate large number of individualized agents and make predictions.Decision modelsDecision models describe the relationship between all the elements of a decision — the known data (including results of predictive models), the decision, and the forecast results of the decision — in order to predict the results of decisions involving many variables. These models can be used in optimization, maximizing certain outcomes while minimizing others. Decision models are generally used to develop decision logic or a set of business rules that will produce the desired action for every customer or circumstance.
  18. Knowledge discovery through associative link analysis.So you may think this is a bit futuristic but I actually stole this graphic from something that was done in 2002. What if I could store everything possible about you, your known business relationships, your friends, etc?What if I picked up the fact that you were just indicted in a fraud crime. I then black list you. BUT I also build a list of your known aquantences and put them on all on a list of highly monitored individuals. In other words, I now EXPECT them to try something so anything even close to out of the ordinary is shut-down immediately.Far fetched. Not at all. Does this require big data. Yes.
  19. What does a Big Data architecture look like to support these 4 Fraud Detection Methods.This is a sample. As you can see moving from left to right we are ingesting a wide Variety , large volume, at a high Velocity. We need several different methods of data ingestion. On the far right we have a variety of tools to put the data to use. Ranging from investigation to Visual analytics.Do you notice the Data Hubs running along the middle. These are going to be used for real-time engines to validate transactions.Alright – lots more that we could talk about on this slide but we need to move on to discuss another topic and probably the hottest topic within many industries. The elusive 360 degree of the customer.Ajay – All yours.
  20. Early on in the presentation Hortonworks explained the value that they can provide. HW has some fantastic training classes. I know because I have attended some of them. Check our their website under training and education for more details.Pactera provides a full set of services within this space. We have HW certified resources who can help you with any of your projects.Our service offerings range from Architecture – Installation – Projects – to maintenance.
  21. Pactera offers a complete life cycle solutions within your organization. We offer a free 4 hour executive and technical workshop within your organization. We just ask for you to fill out a 1 page questionnaire to help us understand your expectations.The executive workshop entails strategy, planning, and your current and future goals.The technical workshop is a deep dive involving end to end management and a proper solution architecture based on your current and up and coming goals. Once the workshops is complete, we will provide you an assessment of the outcome.A lot of our clients initially engage us with a 2-4wk pilot to ensure your project is put into action. And finally, we offer Full lifecycle in the following:Benchmark & MonitoringIntegrations & MigrationsImplementation & ArchitectureProject ManagementAnalyticsReporting We can perform these efforts both on-shore and off-shore.