Submit Search
Upload
Investigative Analytics Tools For Multi-Structured Big Data
•
6 likes
•
2,010 views
AI-enhanced title
Data Science London
Follow
Mike Ferguson CEO Intelligent Business Strategies talk at Data Science London @ds_ldn
Read less
Read more
Technology
Business
Report
Share
Report
Share
1 of 16
Recommended
Left Brain, Right Brain: How to Unify Enterprise Analytics
Left Brain, Right Brain: How to Unify Enterprise Analytics
Inside Analysis
01 im overview high level
01 im overview high level
James Findlay
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ajay Ohri
Business Intelligence with Microsoft SQL 2014 - Presented by Atidan
Business Intelligence with Microsoft SQL 2014 - Presented by Atidan
David J Rosenthal
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Mark Heid
Powering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache Hadoop
Hortonworks
Big Data Forum - Phoenix
Big Data Forum - Phoenix
Krishnan Parasuraman
Building the enterprise data architecture
Building the enterprise data architecture
Costa Pissaris
Recommended
Left Brain, Right Brain: How to Unify Enterprise Analytics
Left Brain, Right Brain: How to Unify Enterprise Analytics
Inside Analysis
01 im overview high level
01 im overview high level
James Findlay
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ajay Ohri
Business Intelligence with Microsoft SQL 2014 - Presented by Atidan
Business Intelligence with Microsoft SQL 2014 - Presented by Atidan
David J Rosenthal
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Mark Heid
Powering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache Hadoop
Hortonworks
Big Data Forum - Phoenix
Big Data Forum - Phoenix
Krishnan Parasuraman
Building the enterprise data architecture
Building the enterprise data architecture
Costa Pissaris
IBM Stream au Hadoop User Group
IBM Stream au Hadoop User Group
Modern Data Stack France
Big data ibm keynote d advani presentation
Big data ibm keynote d advani presentation
MassTLC
IP&A109 Next-Generation Analytics Architecture for the Year 2020
IP&A109 Next-Generation Analytics Architecture for the Year 2020
Anjan Roy, PMP
Data Architecture Process in a BI environment
Data Architecture Process in a BI environment
Sasha Citino
IBM-Why Big Data?
IBM-Why Big Data?
Kun Le
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data Governance
Inside Analysis
Enterprise Master Data Architecture
Enterprise Master Data Architecture
Boris Otto
Enterprise Master Data Architecture: Design Decisions and Options
Enterprise Master Data Architecture: Design Decisions and Options
Boris Otto
Analyse prédictive en assurance santé par Julien Cabot
Analyse prédictive en assurance santé par Julien Cabot
Modern Data Stack France
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
Rhapsody Technologies, Inc.
RFT for Business Intelligence and Data Strategy
RFT for Business Intelligence and Data Strategy
SustainableEnergyAut
Big Data World Forum
Big Data World Forum
bigdatawf
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
DATAVERSITY
The New Enterprise Data Platform
The New Enterprise Data Platform
Krishnan Parasuraman
Teradata Overview
Teradata Overview
Teradata
DATAWEEK KEYNOTE: LARGE SCALE SEARCH, DISCOVERY AND ANALYSIS IN ACTION
DATAWEEK KEYNOTE: LARGE SCALE SEARCH, DISCOVERY AND ANALYSIS IN ACTION
ivan provalov
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Mauricio Godoy
Data-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture Requirements
DATAVERSITY
Record manager 8.0 presentation
Record manager 8.0 presentation
Andrey Karpov
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Denodo
Research at last.fm
Research at last.fm
Data Science London
LA TECNOLOGIA COMO APOYO EN EL PROCESO ENSEÑANZA-APRENDIZAJE
LA TECNOLOGIA COMO APOYO EN EL PROCESO ENSEÑANZA-APRENDIZAJE
superveromena
More Related Content
What's hot
IBM Stream au Hadoop User Group
IBM Stream au Hadoop User Group
Modern Data Stack France
Big data ibm keynote d advani presentation
Big data ibm keynote d advani presentation
MassTLC
IP&A109 Next-Generation Analytics Architecture for the Year 2020
IP&A109 Next-Generation Analytics Architecture for the Year 2020
Anjan Roy, PMP
Data Architecture Process in a BI environment
Data Architecture Process in a BI environment
Sasha Citino
IBM-Why Big Data?
IBM-Why Big Data?
Kun Le
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data Governance
Inside Analysis
Enterprise Master Data Architecture
Enterprise Master Data Architecture
Boris Otto
Enterprise Master Data Architecture: Design Decisions and Options
Enterprise Master Data Architecture: Design Decisions and Options
Boris Otto
Analyse prédictive en assurance santé par Julien Cabot
Analyse prédictive en assurance santé par Julien Cabot
Modern Data Stack France
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
Rhapsody Technologies, Inc.
RFT for Business Intelligence and Data Strategy
RFT for Business Intelligence and Data Strategy
SustainableEnergyAut
Big Data World Forum
Big Data World Forum
bigdatawf
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
DATAVERSITY
The New Enterprise Data Platform
The New Enterprise Data Platform
Krishnan Parasuraman
Teradata Overview
Teradata Overview
Teradata
DATAWEEK KEYNOTE: LARGE SCALE SEARCH, DISCOVERY AND ANALYSIS IN ACTION
DATAWEEK KEYNOTE: LARGE SCALE SEARCH, DISCOVERY AND ANALYSIS IN ACTION
ivan provalov
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Mauricio Godoy
Data-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture Requirements
DATAVERSITY
Record manager 8.0 presentation
Record manager 8.0 presentation
Andrey Karpov
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Denodo
What's hot
(20)
IBM Stream au Hadoop User Group
IBM Stream au Hadoop User Group
Big data ibm keynote d advani presentation
Big data ibm keynote d advani presentation
IP&A109 Next-Generation Analytics Architecture for the Year 2020
IP&A109 Next-Generation Analytics Architecture for the Year 2020
Data Architecture Process in a BI environment
Data Architecture Process in a BI environment
IBM-Why Big Data?
IBM-Why Big Data?
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data Governance
Enterprise Master Data Architecture
Enterprise Master Data Architecture
Enterprise Master Data Architecture: Design Decisions and Options
Enterprise Master Data Architecture: Design Decisions and Options
Analyse prédictive en assurance santé par Julien Cabot
Analyse prédictive en assurance santé par Julien Cabot
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
RFT for Business Intelligence and Data Strategy
RFT for Business Intelligence and Data Strategy
Big Data World Forum
Big Data World Forum
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
The New Enterprise Data Platform
The New Enterprise Data Platform
Teradata Overview
Teradata Overview
DATAWEEK KEYNOTE: LARGE SCALE SEARCH, DISCOVERY AND ANALYSIS IN ACTION
DATAWEEK KEYNOTE: LARGE SCALE SEARCH, DISCOVERY AND ANALYSIS IN ACTION
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Big Data Whitepaper - Streams and Big Insights Integration Patterns
Data-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture Requirements
Record manager 8.0 presentation
Record manager 8.0 presentation
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Viewers also liked
Research at last.fm
Research at last.fm
Data Science London
LA TECNOLOGIA COMO APOYO EN EL PROCESO ENSEÑANZA-APRENDIZAJE
LA TECNOLOGIA COMO APOYO EN EL PROCESO ENSEÑANZA-APRENDIZAJE
superveromena
EFOW/ LERCPA: Leaders of Energy without Borders. On our way to 100% renewables.
EFOW/ LERCPA: Leaders of Energy without Borders. On our way to 100% renewables.
Energy for One World
"Telling Stories about People and Their Influence" Ferenc Huszár @ds_ldn
"Telling Stories about People and Their Influence" Ferenc Huszár @ds_ldn
Data Science London
Gradle_ToursJUG
Gradle_ToursJUG
Gregory Boissinot
Utilización y selección
Utilización y selección
weticsblog
ortodoncia
ortodoncia
JULY PAILINA SALDARRIAGA VELEZ
"Human Cloning: The Data Scientist Bottleneck Resolved" Dr. Alex Farquhar @ds...
"Human Cloning: The Data Scientist Bottleneck Resolved" Dr. Alex Farquhar @ds...
Data Science London
M2 actividad2 10
M2 actividad2 10
Escuela de educación media superior
Mais cultura
Mais cultura
EDUCADOR EM HISTÓRIA...
Autonomous Discovery: The New Interface?
Autonomous Discovery: The New Interface?
Data Science London
Fico success story
Fico success story
Michal Friedrich
Sarwat Jahan_cv
Sarwat Jahan_cv
Sarwat Jahan
(Inter)national Facades: International Facade Master: WHY? by Arie Bergsma (2...
(Inter)national Facades: International Facade Master: WHY? by Arie Bergsma (2...
Jasper Moelker
Super-Fast Clustering Report in MapR
Super-Fast Clustering Report in MapR
Data Science London
PetersonSierra_Interface_SustainabilityPoster
PetersonSierra_Interface_SustainabilityPoster
Sierra Peterson
Lines and angles ( Class 6-7 )
Lines and angles ( Class 6-7 )
romilkharia
aparatologia ortodontica
aparatologia ortodontica
JULY PAILINA SALDARRIAGA VELEZ
JENKINS_BreizhJUG_20111003
JENKINS_BreizhJUG_20111003
Gregory Boissinot
Solucionario del primer examen con ingreso directo de la PRE SAN MARCOS ciclo...
Solucionario del primer examen con ingreso directo de la PRE SAN MARCOS ciclo...
Mery Lucy Flores M.
Viewers also liked
(20)
Research at last.fm
Research at last.fm
LA TECNOLOGIA COMO APOYO EN EL PROCESO ENSEÑANZA-APRENDIZAJE
LA TECNOLOGIA COMO APOYO EN EL PROCESO ENSEÑANZA-APRENDIZAJE
EFOW/ LERCPA: Leaders of Energy without Borders. On our way to 100% renewables.
EFOW/ LERCPA: Leaders of Energy without Borders. On our way to 100% renewables.
"Telling Stories about People and Their Influence" Ferenc Huszár @ds_ldn
"Telling Stories about People and Their Influence" Ferenc Huszár @ds_ldn
Gradle_ToursJUG
Gradle_ToursJUG
Utilización y selección
Utilización y selección
ortodoncia
ortodoncia
"Human Cloning: The Data Scientist Bottleneck Resolved" Dr. Alex Farquhar @ds...
"Human Cloning: The Data Scientist Bottleneck Resolved" Dr. Alex Farquhar @ds...
M2 actividad2 10
M2 actividad2 10
Mais cultura
Mais cultura
Autonomous Discovery: The New Interface?
Autonomous Discovery: The New Interface?
Fico success story
Fico success story
Sarwat Jahan_cv
Sarwat Jahan_cv
(Inter)national Facades: International Facade Master: WHY? by Arie Bergsma (2...
(Inter)national Facades: International Facade Master: WHY? by Arie Bergsma (2...
Super-Fast Clustering Report in MapR
Super-Fast Clustering Report in MapR
PetersonSierra_Interface_SustainabilityPoster
PetersonSierra_Interface_SustainabilityPoster
Lines and angles ( Class 6-7 )
Lines and angles ( Class 6-7 )
aparatologia ortodontica
aparatologia ortodontica
JENKINS_BreizhJUG_20111003
JENKINS_BreizhJUG_20111003
Solucionario del primer examen con ingreso directo de la PRE SAN MARCOS ciclo...
Solucionario del primer examen con ingreso directo de la PRE SAN MARCOS ciclo...
Similar to Investigative Analytics Tools For Multi-Structured Big Data
Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the Business
Teradata Aster
When Worlds Collide: Intelligence, Analytics and Operations
When Worlds Collide: Intelligence, Analytics and Operations
Inside Analysis
Teradata Big Data London Seminar
Teradata Big Data London Seminar
Hortonworks
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Calpont Corporation
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
Odinot Stanislas
Enterprise Services Solutions
Enterprise Services Solutions
Karya Technologies
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
Inside Analysis
Hortonworks roadshow
Hortonworks roadshow
Accenture
Intersection of Business Intelligence and CRM vsr12
Intersection of Business Intelligence and CRM vsr12
David J Rosenthal
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
Teradata Aster
Ibm big dataibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousing
DataWorks Summit
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
Databricks
ActuateOne for Utility Analytics
ActuateOne for Utility Analytics
katsoulis
Analyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast Data
EMC
An Overview of BigData
An Overview of BigData
Valarmathi V
Unlocking value in your (big) data
Unlocking value in your (big) data
Oscar Renalias
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information Architecture
Inside Analysis
SAP Explorer Visual Intelligence
SAP Explorer Visual Intelligence
Eric Molner
Big data and you
Big data and you
IBM
The New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front Lines
Inside Analysis
Similar to Investigative Analytics Tools For Multi-Structured Big Data
(20)
Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the Business
When Worlds Collide: Intelligence, Analytics and Operations
When Worlds Collide: Intelligence, Analytics and Operations
Teradata Big Data London Seminar
Teradata Big Data London Seminar
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
Enterprise Services Solutions
Enterprise Services Solutions
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
Hortonworks roadshow
Hortonworks roadshow
Intersection of Business Intelligence and CRM vsr12
Intersection of Business Intelligence and CRM vsr12
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
Ibm big dataibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousing
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
ActuateOne for Utility Analytics
ActuateOne for Utility Analytics
Analyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast Data
An Overview of BigData
An Overview of BigData
Unlocking value in your (big) data
Unlocking value in your (big) data
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information Architecture
SAP Explorer Visual Intelligence
SAP Explorer Visual Intelligence
Big data and you
Big data and you
The New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front Lines
More from Data Science London
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Data Science London
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Data Science London
Real-Time Queries in Hadoop w/ Cloudera Impala
Real-Time Queries in Hadoop w/ Cloudera Impala
Data Science London
Nowcasting Business Performance
Nowcasting Business Performance
Data Science London
Numpy, the Python foundation for number crunching
Numpy, the Python foundation for number crunching
Data Science London
Python pandas workshop iPython notebook (163 pages)
Python pandas workshop iPython notebook (163 pages)
Data Science London
Big Practical Recommendations with Alternating Least Squares
Big Practical Recommendations with Alternating Least Squares
Data Science London
Bringing back the excitement to data analysis
Bringing back the excitement to data analysis
Data Science London
Survival Analysis of Web Users
Survival Analysis of Web Users
Data Science London
ACM RecSys 2012: Recommender Systems, Today
ACM RecSys 2012: Recommender Systems, Today
Data Science London
Beyond Accuracy: Goal-Driven Recommender Systems Design
Beyond Accuracy: Goal-Driven Recommender Systems Design
Data Science London
Machine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and Future
Data Science London
Data Science for Live Music
Data Science for Live Music
Data Science London
Music and Data: Adding Up the UK Music Industry
Music and Data: Adding Up the UK Music Industry
Data Science London
Scientific Article Recommendations with Mahout
Scientific Article Recommendations with Mahout
Data Science London
Simple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in Mahout
Data Science London
Going Real-Time with Mahout, Predicting gender of Facebook Users
Going Real-Time with Mahout, Predicting gender of Facebook Users
Data Science London
Practical Magic with Incanter
Practical Magic with Incanter
Data Science London
Understanding Cause & Effect in Customer Behaviour
Understanding Cause & Effect in Customer Behaviour
Data Science London
Bootstrapping Data Science
Bootstrapping Data Science
Data Science London
More from Data Science London
(20)
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Real-Time Queries in Hadoop w/ Cloudera Impala
Real-Time Queries in Hadoop w/ Cloudera Impala
Nowcasting Business Performance
Nowcasting Business Performance
Numpy, the Python foundation for number crunching
Numpy, the Python foundation for number crunching
Python pandas workshop iPython notebook (163 pages)
Python pandas workshop iPython notebook (163 pages)
Big Practical Recommendations with Alternating Least Squares
Big Practical Recommendations with Alternating Least Squares
Bringing back the excitement to data analysis
Bringing back the excitement to data analysis
Survival Analysis of Web Users
Survival Analysis of Web Users
ACM RecSys 2012: Recommender Systems, Today
ACM RecSys 2012: Recommender Systems, Today
Beyond Accuracy: Goal-Driven Recommender Systems Design
Beyond Accuracy: Goal-Driven Recommender Systems Design
Machine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and Future
Data Science for Live Music
Data Science for Live Music
Music and Data: Adding Up the UK Music Industry
Music and Data: Adding Up the UK Music Industry
Scientific Article Recommendations with Mahout
Scientific Article Recommendations with Mahout
Simple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in Mahout
Going Real-Time with Mahout, Predicting gender of Facebook Users
Going Real-Time with Mahout, Predicting gender of Facebook Users
Practical Magic with Incanter
Practical Magic with Incanter
Understanding Cause & Effect in Customer Behaviour
Understanding Cause & Effect in Customer Behaviour
Bootstrapping Data Science
Bootstrapping Data Science
Recently uploaded
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
DianaGray10
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Sergiu Bodiu
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
LoriGlavin3
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
hariprasad279825
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
mohitsingh558521
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
Dilum Bandara
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
Fwdays
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
BookNet Canada
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
gvaughan
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
LoriGlavin3
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
Nathaniel Shimoni
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
Fwdays
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
BookNet Canada
Training state-of-the-art general text embedding
Training state-of-the-art general text embedding
Zilliz
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
LoriGlavin3
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
Fwdays
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
BkGupta21
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
ScyllaDB
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
LoriGlavin3
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Curtis Poe
Recently uploaded
(20)
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Training state-of-the-art general text embedding
Training state-of-the-art general text embedding
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Investigative Analytics Tools For Multi-Structured Big Data
1.
26/04/2012
Investigative Analytics - What's In The Data Scientists’ Toolkit Mike Ferguson Managing Director Intelligent Business Strategies Data Science London April 2012 About Mike Ferguson Mike Ferguson is Managing Director of Intelligent Business Strategies Limited. As an analyst and consultant he specializes in business intelligence, data management and enterprise business integration. With over 30 years of IT experience, Mike has consulted for dozens of companies, spoken at events all over the world and written numerous articles. He is an expert on the B-EYE-Network. Formerly he was a principal and co-founder of Codd and Date Europe Limited – the inventors of the Relational Model, a Chief Architect at Teradata on the Teradata www.intelligentbusiness.biz DBMS and European Managing Director of mferguson@intelligentbusiness.biz DataBase Associates. Twitter: @mikeferguson1 Tel/Fax (+44)1625 520700 2 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 1
2.
26/04/2012
Topics Big Data Workloads Data science tools for near real-time analytics Data science tools for investigative analytics of multi-structured data Data science tools for investigative analytics of structured data Trends in a fast moving Big Data marketplace Governance of data science projects 3 The Application Processing Spectrum - Big Data Is Pushing Storage Options Towards Optimized Systems Source: BI-Research Copyright © BI-Research, 2011 4 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 2
3.
26/04/2012
Big Data Processing – The Number of Data Stores Optimized for Operational or Analytical Workloads Is Growing • ACID support missing in some NoSQL DBMSs Analytical RDBMS • Can you live with losing a transaction? • OK for sensor data for example OLTP RDBMS NoSQL DBMS NoSQL 5 Data Science Tools – Different Analytical Workloads Need Different Tools Some tools work across multiple platforms Analytical Analytical Analytical Analytical tools tools tools tools streaming data Data Data Data Data management management management management tools tools tools tools CRM ERP SCM Machine generated, markets data, sensors RDF/OWL 6 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 3
4.
26/04/2012
Data Science Tools – Near Real-Time Analytics On Data In Motion Stream analytics / CEP Workload analytical Near real-time automated characteristics analytics on text or semi- structured data Data characteristics Highly volatile data-in-motion, streaming data large volumes Product Examples IBM InfoSphere Streams, Stream Informatica RulePoint analytics / Trends CEP vendors moving to analyse CEP text as well as structured data Some CEP vendors may get acquired Machine generated, markets data, sensors 7 Trends – Streaming Event Data Can Also Be Stored In Hadoop or DW Appliance Analytical tools streaming data Data management tools Machine generated, markets data, sensors 8 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 4
5.
26/04/2012
Data Science Tools - Investigative Analytics on Multi- Structured Data In Hadoop (various distributions) Workload analytical Investigative analysis characteristics Analytical Data characteristics Up to very large volumes of tools multi-structured data (Variety) Data management E.g. Informatica HParser, tools Pentaho ETL, Pervasive, Talend ETL Studio Analysis Batch analytics: Custom MapReduce apps with Data Mahout and R management BI Tools (MapReduce) tools Karmasphere, Datameer IBM Cognos Content Analytics, BI Tools (Search Based) Connexica, Quid BI Tools (Hive interface) JasperSoft, MicroStrategy, Tableau…. 9 Data Deluge - Data Is Arriving Faster Than We Can Consume It – How Good Is Your Filter? Enterprise F DI A L Enterprise systems TT AE R 10 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 5
6.
26/04/2012
Data Management Tools Are Being Extended To Embrace And Exploit Massively Parallel Hadoop Clusters Approaches: • Custom code • Data Management tools suites: e.g. IBM InfoSphere Datastage and Smart Consolidation (uses InfoSphere Blueprint Director), Informatica, Pervasive, Petaho, Talend Extract Data from Hadoop Invoke Custom Analytics on Hadoop Transform & Cleanse Data in Hadoop (MapReduce) Parse & Prepare Data in Hadoop (MapReduce) Data management Discover data in Hadoop tools Load Data into Hadoop Trends: Expect MUCH more from data management tool vendors including generation of MapReduce code to clean and transform data 11 Processing Text Is A Key Part of Hadoop Based Analysis What Is Text Analytics?– deriving data from unstructured content Popular data sources include • Social media, email, news articles, on-line forums Requires pre-processing prior to analysis • Parsing, correction, phase extraction, semantic grouping 12 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 6
7.
26/04/2012
Tools Are Appearing To Make It Easier To Parse Data In Hadoop To Make It Easier To Analyse Product Example: Informatica HParser Source: Informatica 13 Big Data Integration - Talend Open Studio for Big Data Enhancing a big data job with Data Quality Several data quality components are included in Source: Talend the open source version 14 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 7
8.
26/04/2012
Accelerating Custom Data Integration and Preparation Pervasive DataRush for Hadoop • Syncsort DMExpress Hadoop Edition: • Call DataRush from MapReduce • Move data in and out of HDFS • MapReduce runs faster • Create jobs using the DMExpress GUI and run them within the • Less code to write Hadoop Of interest to Map Reduce • Shift transformations to the developers DMExpress engine • Invokes high performance compression Hadoop Distributed File System Mapper Mapper Mapper Mapper DataRush DataRush DataRush DataRush DMX DMX DMX DMX Reducer Reducer DataRush DataRush Hadoop Acceleration 15 Leveraging Hadoop For Data Integration On Massive Volumes Of Data To Bring Additional Insights Into A DW Hundreds of Cloud Data e.g. Deriving insight from huge terabytes up volumes of social web content on to petabytes sites like twitter, facebook. Digg, mySpace, tripAdvisor, Linkedin….for sentiment analytics Operational systems Extract D DW Cloud Data Transform I Map/ Reduce apps HDFS relevant e.g. PIG, IBM JAQL insight 16 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 8
9.
26/04/2012
Product Example – Pentaho Enterprise Data Services Suite Support For Hadoop Source: Pentaho 17 In-Hadoop Analytics – Example Technologies Analytical tools Hadoop MapReduce programs with custom analytics Hadoop MapReduce programs with Hadoop Mahout • Several analytical algorithms for use in batch analysis Pervasive DataRush For Hadoop Analytics Engine Radoop (UI on RapidMiner) Data management Revolution Analytics RevoScaleR tools …. 18 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 9
10.
26/04/2012
New Big Data Analytics Technologies Are Emerging On Hadoop – E.g. Radoop Radoop interfaces RapidMiner (open source) with Hadoop and integrates with Hive and Mahout providing a UI for Hadoop based analytics Source: “Radoop – It’s Like Yahoo Pipes for Hadoop” http://siliconangle.com/blog/2011/08/11/radoop-its-like-yahoo-pipes-for-hadoop/ 19 Revolution Analytics RevoScaleR for Distributed Computing Clusters Scaling R for Big Data Analytics • Portions of the data source are made available to each Compute compute node Data Node Partition (RevoScaleR) • RevoScaleR on the master node assigns a task to each Compute Data Node compute node Partition (RevoScaleR) Master • Each compute node Node Compute (RevoScaleR) independently processes its Data Node Partition (RevoScaleR) data, and returns it’s intermediate results back to Compute the master node Data Node Partition (RevoScaleR) • master node aggregates all of the intermediate results from each compute node and produces the final result Source: Revolution Analytics 2020 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 10
11.
26/04/2012
Analysing Hadoop Data – Multiple Options Batch analytics: • Custom MapReduce applications • Analytical Tools generating MapReduce Analytical • Karmasphere, Datameer, • IBM Cognos Content Analytics tools Search Based Tools (built on Lucene) Connexica, Quid BI Tools using Hive QL JasperSoft, MicroStrategy, Tableau Data e.g. Log files Social networks management Clickstream tools Source: Datameer 21 Big Data Analysis - Exploratory Analysis of Multi-Structured Data In Hadoop via Search e.g. IBM BigIndex (part of IBM BigInsights) File Use massively parallel Map Reduce servers to build a partitioned search index index partitions Web sites BI Tools, Applications, email Mashups CMS LOAD index index Index Image partition server Collab tools Useful for analysing un-modelled semi-structured Web content that is not well understood feeds 22 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 11
12.
26/04/2012
Search Based Analytical Tools For Big Data - E.g. Connexica (runs on top of Lucene indexes) Connexica Venn Diagrams Connexica Dashboard 23 Data Warehouse Appliances – Analytical Workloads on Structured Data using ADBMSs and BI Tools Analytical IBM Cognos, IBI WebFocus MicroStrategy, tools Oracle BIEE, SAP BusinessObjects, SAS, Pentaho, Jaspersoft, QlikView, Tableau MPP analytical DBMS, in-database analytics, Columnar and row storage IBM InfoSphere DataStage, Informatica Data PowerCentre, Microsoft SSIS, Oracle Data management Integrator, Pervasive, Pentaho ETL tools Talend ETL CRM ERP SCM Workload analysis characteristics Historical reporting and analysis, investigative analytics Data characteristics Medium and large volumes, structured data 24 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 12
13.
26/04/2012
In-Database Analytics – E.g. SAS Has Completely Re-Written Analytics to Exploit Parallelism E.g. SAS High Performance Analytics and Teradata Runs ‘alongside’ the ADBMS as peers in the same MPP nodes • In-memory passing of data between DBMS and analytic models within every node without data movement • Highly parallel, in-memory execution of analytics delivered across a distributed computing environment – Linear regression and variable selection with classical and modern methods – Nonlinear regression and maximum likelihood – Correlation analysis In-Database vs. Alongside-DBMS – Logistic regression – Neural nets – Linear mixed models – Optimization GA Q4 2011 25 Trends in Data Science Tooling – Tools Are Broadening Their Reach Analytical Analytical tools tools streaming data Data Data management tools management tools CRM ERP SCM Machine generated, markets data, RDF/OWL sensors 26 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 13
14.
26/04/2012
Microsoft Big Data Solution – SQL Server 2012 Hive ODBC Driver & Hive Add-in For Excel and PowerPivot Source: Microsoft 27 Front End Tools Interfacing With Hadoop And Analytical RDBMS e.g. Karmasphere Datameer, IBM Cognos Content Analytics e.g.Connexica, Quid BI tools platform & Map Reduce Search based Custom data visualisation tools BI tools BI tools Map Reduce applications SAP BO, SQL IBM Cognos, Oracle BIEE, Indexes MicroStrategy, JasperSoft, MPP RDBMS Pentaho, MS Excel Polymorphic table function 28 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 14
15.
26/04/2012
Tools To Govern Data Science Projects – Data Sources, Sandboxes, People, Results governance governance governance Sandbox MPP Analytical RDBMS Graph DBMS DW governance governance Social graph data Unstructured / semi-structured content clickstream Files RDBMS Web logs governance 29 Governance: Big Data Projects Need To Be Managed – E.g. EMC GreenPlum Chorus Workspaces, sandboxes, people and data sources can all be governed Source: EMC GreenPlum 30 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 15
16.
26/04/2012
Architectures – Integrating Big Data Analytics Into The Enterprise users Business analysts BI tools platform & Map Reduce Search based data visualisation tools BI tools BI tools developers actions SQL Custom real-time Indexes MR apps Stream processing MPP RDBMS Graph DBMS Polymorphic table function(s) Event Social streams graph data OLTP data Unstructured / semi-structured content Information Management and Services XML, clickstream JSON Cloud Data Files web services RDBMS Cubes Web logs office web content docs 31 Thank You! www.intelligentbusiness.biz mferguson@intelligentbusiness.biz Twitter: @mikeferguson1 Tel/Fax (+44)1625 520700 32 Copyright © Intelligent Business Strategies 2012 - All Rights Reserved 16