SlideShare a Scribd company logo
1 of 12
Roadmap & Release Plan
10-12-2010
Cloudera Roadmap & Release Plan - DISCLAIMER
 The information in this document is proprietary to Cloudera. No part of this document may be reproduced,
copied or transmitted in any form for any purpose without the express prior written permission of Cloudera.
 This document is not subject to any agreement with Cloudera. This document contains only intended
strategies, developments and functionalities of Cloudera products and is not intended to be binding upon
Cloudera to any particular course of business, product strategy and/or development. Please note that this
document is subject to change and may be changed by Cloudera at any time without notice.
 This document is provided without a warranty of any kind, either express or implied, including but not limited
to the implied warranties of merchantability, fitness for a particular purpose or non-infringement.
 Cloudera shall have no liability for damages of any kind including without limitation direct, special, indirect or
consequential damages that may result from the use of these materials. The limitation shall not apply in
cases of gross negligence.
Goals for this session
• Give you visibility into where Cloudera is
going next
• Help explain why Cloudera is investing
where it is
• Get you to be a part of it
Copyright 2010 Cloudera Inc.
Cloudera’s product strategy
• Provide the reference distribution for the Hadoop
platform
• Functionally complete
• Performant and secure
• Integrated & tested
• Easy to trial & consume
• 100% Apache licensed
• Open to partners and the extended IT
ecosystem
• Provide a commercial solution to helps enterprises
run Hadoop in production
• Software & services
• Increase transparency, consistency & reliability
• Lower the cost & complexity of administration
• Improved compliance to policies & processes
Copyright 2010 Cloudera Inc.
Cloudera’s
Distribution
for Hadoop
Cloudera
Enterprise
Cloudera’s release strategy
• The platform release
• Releases annually
• Public beta
• Comprised of several open source software
component versions & patches
• The applications release
• Releases semi-annually
• Private beta
• Has dependencies to specific platform (CDH)
version(s)
Copyright 2010 Cloudera Inc.
Cloudera’s
Distribution
for Hadoop
Cloudera
Enterprise
CDH themes
• CDH3 – move from kernel to platform
• Provide the features of a platform - expansion of the functional
footprint & take ownership of combining & integrating a single
platform versus the current industry practice of “roll your own”
• Enable others to build on the platform - better functionality for
integration of RDBMS and BI
• Centralize basic functions of the platform - configuration and
service management (more on this…)
• Incremental enhancements to existing components
• CDH4 – consistency and performance
• Extend high availability & authorization throughout the platform
• Rationalize duplicate functions across the platform
• Improve performance throughout
• Incremental enhancements to existing components
Copyright 2010 Cloudera Inc.
What’s CMF?
Datanodes
Region
Servers
Collectors
Flume
Processors
MastersJob
Trackers
Name
Node
Secondary
NN
Task
Trackers
Zookeep
er Quora
Workflow
Servers
Hue
CMF CMF Agent CMF Agent CMF Agent
CMF AgentCMF AgentCMF Agent
CMF Agent CMF AgentCMF Agent
CMF Agent
CMF Agent
Systems Monitor
(Hyperic, Zenoss,
Nagios, etc.)
Configuration
Management
(Puppet, Chef,
cfEngine)
Server / VM
Provisioning
(Bladelogic,
HP, IBM,
Eucalyptus)
• Service &
process mgt
• Unified
config
• Monitor
• Binary
distribution
(optional)
CMF Agent
A framework that helps organizations to operate Hadoop services and resources as a
unified system
In scope Out of scope
• Governance of distributed services and
individual daemons (start, stop, restart, flag)
• Service configuration
• “Movement” of services across physical hosts
• Change management database
• Cross-system issues (e.g. dev – test – prod)
• Operating system and / or JVM
configuration management
• Resource (e.g. server, network, VM)
provisioning
Cloudera Enterprise
• Reduces the risks of running Hadoop in production
• Improves consistency, compliance and administrative overhead
Management applications
• Authorization mgmt &
provisioning
• Monitoring
• Resource mgmt
• System lifecycle
(planned)
• Production support for CDH & certified integrations (Oracle,
Netezza, Teradata, Greenplum, Aster Data)
8Copyright 2010 Cloudera Inc. All rights reserved
Applications
Enterprise management applications themes
• Enterprise 3.0 – cover some immediate enterprise needs
• Extend authorization management & administration to meet the
needs of more complex organizations
• Track the usage of scarce cluster resources
• Monitor incoming data via Flume
• Enterprise 3.5 – improve transparency & automation
• Real-time activity monitor (more on this…)
• Expand file browser to show provenance & ownership of data
including multi-parameter search
• Extended authorization management & administration
• Enhancement to existing components
Copyright 2010 Cloudera Inc.
Why an activity monitor?
• SLA’s are typically measured as
activity completion time or slot
availability rate
• Four different trackers to log
into (Hive, Pig, Oozie,
MapReduce) all with different
and incomplete metrics
• No means of setting policies to
correct or fix misbehaving
activities
• Currently no data available to
drive continuous improvement
Copyright 2010 Cloudera Inc.
• Frustrating that ops can’t reliably
measure what they are supposed
to be measured on!
• Incomplete and inconsistent
metrics, measured differently by
activity
• Even with proper use of the
scheduler, misbehaving activities
can drag down a cluster
• “Just add more boxes?”
Get involved!
• The best Cloudera products and features are built in
partnership with customers
• Contact charles@cloudera.com if you are interested
Copyright 2010 Cloudera Inc.
Appendix

More Related Content

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Recently uploaded

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 

Recently uploaded (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 

Cloudera - Charles Zedlewski - Hadoop World 2010

  • 1. Roadmap & Release Plan 10-12-2010
  • 2. Cloudera Roadmap & Release Plan - DISCLAIMER  The information in this document is proprietary to Cloudera. No part of this document may be reproduced, copied or transmitted in any form for any purpose without the express prior written permission of Cloudera.  This document is not subject to any agreement with Cloudera. This document contains only intended strategies, developments and functionalities of Cloudera products and is not intended to be binding upon Cloudera to any particular course of business, product strategy and/or development. Please note that this document is subject to change and may be changed by Cloudera at any time without notice.  This document is provided without a warranty of any kind, either express or implied, including but not limited to the implied warranties of merchantability, fitness for a particular purpose or non-infringement.  Cloudera shall have no liability for damages of any kind including without limitation direct, special, indirect or consequential damages that may result from the use of these materials. The limitation shall not apply in cases of gross negligence.
  • 3. Goals for this session • Give you visibility into where Cloudera is going next • Help explain why Cloudera is investing where it is • Get you to be a part of it Copyright 2010 Cloudera Inc.
  • 4. Cloudera’s product strategy • Provide the reference distribution for the Hadoop platform • Functionally complete • Performant and secure • Integrated & tested • Easy to trial & consume • 100% Apache licensed • Open to partners and the extended IT ecosystem • Provide a commercial solution to helps enterprises run Hadoop in production • Software & services • Increase transparency, consistency & reliability • Lower the cost & complexity of administration • Improved compliance to policies & processes Copyright 2010 Cloudera Inc. Cloudera’s Distribution for Hadoop Cloudera Enterprise
  • 5. Cloudera’s release strategy • The platform release • Releases annually • Public beta • Comprised of several open source software component versions & patches • The applications release • Releases semi-annually • Private beta • Has dependencies to specific platform (CDH) version(s) Copyright 2010 Cloudera Inc. Cloudera’s Distribution for Hadoop Cloudera Enterprise
  • 6. CDH themes • CDH3 – move from kernel to platform • Provide the features of a platform - expansion of the functional footprint & take ownership of combining & integrating a single platform versus the current industry practice of “roll your own” • Enable others to build on the platform - better functionality for integration of RDBMS and BI • Centralize basic functions of the platform - configuration and service management (more on this…) • Incremental enhancements to existing components • CDH4 – consistency and performance • Extend high availability & authorization throughout the platform • Rationalize duplicate functions across the platform • Improve performance throughout • Incremental enhancements to existing components Copyright 2010 Cloudera Inc.
  • 7. What’s CMF? Datanodes Region Servers Collectors Flume Processors MastersJob Trackers Name Node Secondary NN Task Trackers Zookeep er Quora Workflow Servers Hue CMF CMF Agent CMF Agent CMF Agent CMF AgentCMF AgentCMF Agent CMF Agent CMF AgentCMF Agent CMF Agent CMF Agent Systems Monitor (Hyperic, Zenoss, Nagios, etc.) Configuration Management (Puppet, Chef, cfEngine) Server / VM Provisioning (Bladelogic, HP, IBM, Eucalyptus) • Service & process mgt • Unified config • Monitor • Binary distribution (optional) CMF Agent A framework that helps organizations to operate Hadoop services and resources as a unified system In scope Out of scope • Governance of distributed services and individual daemons (start, stop, restart, flag) • Service configuration • “Movement” of services across physical hosts • Change management database • Cross-system issues (e.g. dev – test – prod) • Operating system and / or JVM configuration management • Resource (e.g. server, network, VM) provisioning
  • 8. Cloudera Enterprise • Reduces the risks of running Hadoop in production • Improves consistency, compliance and administrative overhead Management applications • Authorization mgmt & provisioning • Monitoring • Resource mgmt • System lifecycle (planned) • Production support for CDH & certified integrations (Oracle, Netezza, Teradata, Greenplum, Aster Data) 8Copyright 2010 Cloudera Inc. All rights reserved Applications
  • 9. Enterprise management applications themes • Enterprise 3.0 – cover some immediate enterprise needs • Extend authorization management & administration to meet the needs of more complex organizations • Track the usage of scarce cluster resources • Monitor incoming data via Flume • Enterprise 3.5 – improve transparency & automation • Real-time activity monitor (more on this…) • Expand file browser to show provenance & ownership of data including multi-parameter search • Extended authorization management & administration • Enhancement to existing components Copyright 2010 Cloudera Inc.
  • 10. Why an activity monitor? • SLA’s are typically measured as activity completion time or slot availability rate • Four different trackers to log into (Hive, Pig, Oozie, MapReduce) all with different and incomplete metrics • No means of setting policies to correct or fix misbehaving activities • Currently no data available to drive continuous improvement Copyright 2010 Cloudera Inc. • Frustrating that ops can’t reliably measure what they are supposed to be measured on! • Incomplete and inconsistent metrics, measured differently by activity • Even with proper use of the scheduler, misbehaving activities can drag down a cluster • “Just add more boxes?”
  • 11. Get involved! • The best Cloudera products and features are built in partnership with customers • Contact charles@cloudera.com if you are interested Copyright 2010 Cloudera Inc.