SlideShare a Scribd company logo
1 of 30
Warehouse-Scale
Computers
CS4342 Advanced Computer Architecture
Dilum Bandara
Dilum.Bandara@uom.lk
Slides adapted from “Computer Architecture, A Quantitative Approach” by John L.
Hennessy and David A. Patterson, 5th Edition, 2012, MK Publishers and
The Datacenter as a Computer:An Introduction to the Design of Warehouse-Scale
Machines by Luiz André Barroso & Urs Hölzle
Outline
 Programming model & workloads
 Architectures
 Cloud computing
2
Warehouse-Scale Computers (WSC)
3
www.laserfocusworld.com/articles/print/volume-48/issue-
12/features/optical-technologies-scale-the-datacenter.html http://www.slashgear.com/google-data-center-hd-photos-
hit-where-the-internet-lives-gallery-17252451/
WSC (Cont.)
4
WSC Layout
5
Source: http://bnrg.cs.berkeley.edu/~randy/Courses/CS294.F07/
Main Components of a WSC
6
Warehouse-Scale Computer (WSC)
 Provides Internet services
 Search, social networking, online maps, video sharing,
online shopping, email, cloud computing, etc.
 Differences with HPC clusters
 Clusters use higher performance processors & network
 Clusters emphasize thread-level parallelism, WSCs
emphasize request/task-level parallelism
 Differences with datacenters
 Datacenters consolidate different machines & software
into a single location
 Datacenters emphasize virtual machines & hardware
heterogeneity to serve varied customers 7
Design Factors for WSC
 Cost-performance
 Small savings add up
 Energy efficiency
 Affects power distribution & cooling
 Work per joule
 Operational costs count
 Power consumption is a primary constraint when
designing a system
 Dependability via redundancy
 Many low-cost components
8
Design Factors (Cont.)
 Network I/O
 Interactive & batch processing workloads
 Web search – interactive
 Web indexing – batch
 Ample computational parallelism isn’t important
 Most jobs are totally independent, “Request-level
parallelism”
 Scale – Its opportunities & problems
 Can afford to build customized systems as WSC
require volume purchase
 Frequent failures
9
Failure Example
 Consider a WSC with 50,000 nodes. MTTF of a node is 5
years. How many failures be there for a day?
MTTF in days = 5 x 365 = 1,825
Failure rate = 1/1,825 per day
No of failures per day = 50,000/1,825 = 27.4
 Consider a WSC with 50,000 nodes & each node with 4
hard disks. Suppose a annual failure rate of a disk is 4%.
What is the time for a disk failure?
No of disks = 50,000 x 4 = 200,000
No of failures per year = 200,000 x 0.04 = 8,000
Time for failure = 365 x 24 / 8,000 = 1.095 hours/failure 10
Programming Models & Workloads
 Batch processing framework
– MapReduce
 Map
 Applies a programmer-
supplied function to each
logical input record
 Runs on thousands of
computers
 Provides new set of (key,
value) pairs as intermediate
values
 Reduce
 Collapses values using
another function 11
http://www.cbsolution.net/techniques/ontarget/mapredu
ce_vs_data_warehouse
MapReduce Execution
12
Source: Dean et. al.,
“MapReduce, OSDI, 2004
Programming Models & Workloads
(Cont.)
13
www.datanami.com/datanami/2012-07-
16/top_5_challenges_for_hadoop_mapreduce
_in_the_enterprise.html
Programming Models & Workloads
(Cont.)
 MapReduce runtime environment schedules
map & reduce task to WSC nodes
 Availability
 Use replicas of data across different servers
 Use relaxed consistency
 No need for all replicas to always agree
 Workload demands
 Often vary considerably
14
Computer Architecture of WSC
 Often uses a hierarchy of networks for
interconnection
 Each 19” rack holds 48 1U servers connected to
a rack switch
 Rack switches are uplinked to a switch(es)
higher in hierarchy
 Uplink has 48/n times lower bandwidth –
Oversubscription
 n – No of uplink ports
 Goal is to maximize locality of communication relative
to the rack
15
Hierarchy of Switches
16
Network Hierarchy
17
Source: www.laserfocusworld.com/articles/print/volume-48/issue-12/features/optical-
technologies-scale-the-datacenter.html
Storage Hierarchy
18
Infrastructure & Costs
 Location
 Proximity to Internet backbones, electricity cost, property tax rates,
low risk from earthquakes, floods, & hurricanes
 Power distribution
19
Power Usage
20
U.S. EPA Report 2007 – 1.5% of total U.S.
power consumption used by data centers
which has more than doubled since 2000 &
costs $4.5 billion
How Many Nodes can a WSC Support?
 Each node
 “Nameplate power rating” gives maximum power
consumption
 To get actual, measure power under actual workloads
 Oversubscribe cumulative nodes power by 40%,
but monitor power closely
21
Cooling
22
Typically operate around 18 – 22 0C
Cooling (Cont.)
23
Cooling system also uses water (evaporation & spills)
e.g. 70,000 to 200,000 gallons per day for an 8 MW facility
Efficiency
 Power Utilization Effectiveness (PUE)
= Total facility power / IT equipment power
 ≥ 1
 Median PUE on 2006 study was 1.69
24
Source: http://hightech.lbl.gov/benchmarking-guides/data-a1.html
Performance
 Latency is important metric because it is seen by
users
 Bing study
 Users will use search less as response time
increases
 Service Level Objectives (SLOs) & Service Level
Agreements (SLAs)
 Typically given at application level
 e.g., 99% of requests be below 100 ms
 In clouds typically given only for static resources
 CPU speed, no of cores, & memory
25
Cost
 Capital expenditures (CAPEX)
 Cost to build a WSC
 Hardware cost dominates
 Operational expenditures (OPEX)
 Cost to operate a WSC
 Power for nodes & cooling dominates
26
Cloud Computing
27
Clients
Other
Cloud Services
Govt.
Cloud Services
Private
Cloud
Cloud
Manager
Public Cloud
Green Cloud Computing by Dr. Rajkumar Buyya
Cloud Computing (Cont.)
 WSCs offer economies of scale that can’t be
achieved with a datacenter
 5.7 times reduction in storage costs
 7.1 times reduction in administrative costs
 7.3 times reduction in networking costs
 This has given rise to cloud services such as Amazon
Web Services
 “Utility Computing”
 Based on using open source virtual machine & operating
system software
28
Amazon Web Services
 Virtual machines
 XEN
 Very low cost
 $ 0.10 per hour per instance
 Primary rely on open source software
 No (initial) service guarantees
 No contract required
 Amazon S3
 Simple Storage Service
 Amazon EC2
 Elastic Computer Cloud 29
Amazon Web Services – Example
30
http://www.ryhug.com/free-art-available-on-amazon-amazon-web-services-that-is/

More Related Content

Similar to Introduction to Warehouse-Scale Computers

The Cloud & Its Impact on IT
The Cloud & Its Impact on ITThe Cloud & Its Impact on IT
The Cloud & Its Impact on ITAnand Haridass
 
Desktop to Cloud Transformation Planning
Desktop to Cloud Transformation PlanningDesktop to Cloud Transformation Planning
Desktop to Cloud Transformation PlanningPhearin Sok
 
Paolo Merialdo, Cloud Computing and Virtualization: una introduzione
Paolo Merialdo, Cloud Computing and Virtualization: una introduzionePaolo Merialdo, Cloud Computing and Virtualization: una introduzione
Paolo Merialdo, Cloud Computing and Virtualization: una introduzioneInnovAction Lab
 
Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland mictc
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...jsvetter
 
E source energy managers conf 4 24-13-final
E source energy managers conf 4 24-13-finalE source energy managers conf 4 24-13-final
E source energy managers conf 4 24-13-finaljosh whitney
 
Optimising Service Deployment and Infrastructure Resource Configuration
Optimising Service Deployment and Infrastructure Resource ConfigurationOptimising Service Deployment and Infrastructure Resource Configuration
Optimising Service Deployment and Infrastructure Resource ConfigurationRECAP Project
 
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...DataWorks Summit
 
Green cloud computing
Green cloud computingGreen cloud computing
Green cloud computingNalini Mehta
 
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...IJAEMSJORNAL
 
AI Sustainability Mascots 23-f.pptx
AI Sustainability Mascots 23-f.pptxAI Sustainability Mascots 23-f.pptx
AI Sustainability Mascots 23-f.pptxTamar Eilam
 
Cloudsim & Green Cloud
Cloudsim & Green CloudCloudsim & Green Cloud
Cloudsim & Green CloudNeda Maleki
 
Taking High Performance Computing to the Cloud: Windows HPC and
Taking High Performance Computing to the Cloud: Windows HPC and Taking High Performance Computing to the Cloud: Windows HPC and
Taking High Performance Computing to the Cloud: Windows HPC and Saptak Sen
 

Similar to Introduction to Warehouse-Scale Computers (20)

The Cloud & Its Impact on IT
The Cloud & Its Impact on ITThe Cloud & Its Impact on IT
The Cloud & Its Impact on IT
 
Scheduling in CCE
Scheduling in CCEScheduling in CCE
Scheduling in CCE
 
Desktop to Cloud Transformation Planning
Desktop to Cloud Transformation PlanningDesktop to Cloud Transformation Planning
Desktop to Cloud Transformation Planning
 
Paolo Merialdo, Cloud Computing and Virtualization: una introduzione
Paolo Merialdo, Cloud Computing and Virtualization: una introduzionePaolo Merialdo, Cloud Computing and Virtualization: una introduzione
Paolo Merialdo, Cloud Computing and Virtualization: una introduzione
 
Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
 
E source energy managers conf 4 24-13-final
E source energy managers conf 4 24-13-finalE source energy managers conf 4 24-13-final
E source energy managers conf 4 24-13-final
 
Optimising Service Deployment and Infrastructure Resource Configuration
Optimising Service Deployment and Infrastructure Resource ConfigurationOptimising Service Deployment and Infrastructure Resource Configuration
Optimising Service Deployment and Infrastructure Resource Configuration
 
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
 
Green cloud computing
Green cloud computingGreen cloud computing
Green cloud computing
 
Summer Intern Report
Summer Intern ReportSummer Intern Report
Summer Intern Report
 
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
 
AI Sustainability Mascots 23-f.pptx
AI Sustainability Mascots 23-f.pptxAI Sustainability Mascots 23-f.pptx
AI Sustainability Mascots 23-f.pptx
 
What Makes Software Green?
What Makes Software Green?What Makes Software Green?
What Makes Software Green?
 
Cloudsim & Green Cloud
Cloudsim & Green CloudCloudsim & Green Cloud
Cloudsim & Green Cloud
 
Taking High Performance Computing to the Cloud: Windows HPC and
Taking High Performance Computing to the Cloud: Windows HPC and Taking High Performance Computing to the Cloud: Windows HPC and
Taking High Performance Computing to the Cloud: Windows HPC and
 
cloud computing
cloud computing cloud computing
cloud computing
 
Benefits of Hadoop as Platform as a Service
Benefits of Hadoop as Platform as a ServiceBenefits of Hadoop as Platform as a Service
Benefits of Hadoop as Platform as a Service
 

More from Dilum Bandara

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningDilum Bandara
 
Time Series Analysis and Forecasting in Practice
Time Series Analysis and Forecasting in PracticeTime Series Analysis and Forecasting in Practice
Time Series Analysis and Forecasting in PracticeDilum Bandara
 
Introduction to Dimension Reduction with PCA
Introduction to Dimension Reduction with PCAIntroduction to Dimension Reduction with PCA
Introduction to Dimension Reduction with PCADilum Bandara
 
Introduction to Descriptive & Predictive Analytics
Introduction to Descriptive & Predictive AnalyticsIntroduction to Descriptive & Predictive Analytics
Introduction to Descriptive & Predictive AnalyticsDilum Bandara
 
Introduction to Concurrent Data Structures
Introduction to Concurrent Data StructuresIntroduction to Concurrent Data Structures
Introduction to Concurrent Data StructuresDilum Bandara
 
Hard to Paralelize Problems: Matrix-Vector and Matrix-Matrix
Hard to Paralelize Problems: Matrix-Vector and Matrix-MatrixHard to Paralelize Problems: Matrix-Vector and Matrix-Matrix
Hard to Paralelize Problems: Matrix-Vector and Matrix-MatrixDilum Bandara
 
Introduction to Map-Reduce Programming with Hadoop
Introduction to Map-Reduce Programming with HadoopIntroduction to Map-Reduce Programming with Hadoop
Introduction to Map-Reduce Programming with HadoopDilum Bandara
 
Embarrassingly/Delightfully Parallel Problems
Embarrassingly/Delightfully Parallel ProblemsEmbarrassingly/Delightfully Parallel Problems
Embarrassingly/Delightfully Parallel ProblemsDilum Bandara
 
Introduction to Thread Level Parallelism
Introduction to Thread Level ParallelismIntroduction to Thread Level Parallelism
Introduction to Thread Level ParallelismDilum Bandara
 
CPU Memory Hierarchy and Caching Techniques
CPU Memory Hierarchy and Caching TechniquesCPU Memory Hierarchy and Caching Techniques
CPU Memory Hierarchy and Caching TechniquesDilum Bandara
 
Data-Level Parallelism in Microprocessors
Data-Level Parallelism in MicroprocessorsData-Level Parallelism in Microprocessors
Data-Level Parallelism in MicroprocessorsDilum Bandara
 
Instruction Level Parallelism – Hardware Techniques
Instruction Level Parallelism – Hardware TechniquesInstruction Level Parallelism – Hardware Techniques
Instruction Level Parallelism – Hardware TechniquesDilum Bandara
 
Instruction Level Parallelism – Compiler Techniques
Instruction Level Parallelism – Compiler TechniquesInstruction Level Parallelism – Compiler Techniques
Instruction Level Parallelism – Compiler TechniquesDilum Bandara
 
CPU Pipelining and Hazards - An Introduction
CPU Pipelining and Hazards - An IntroductionCPU Pipelining and Hazards - An Introduction
CPU Pipelining and Hazards - An IntroductionDilum Bandara
 
High Performance Networking with Advanced TCP
High Performance Networking with Advanced TCPHigh Performance Networking with Advanced TCP
High Performance Networking with Advanced TCPDilum Bandara
 
Introduction to Content Delivery Networks
Introduction to Content Delivery NetworksIntroduction to Content Delivery Networks
Introduction to Content Delivery NetworksDilum Bandara
 
Peer-to-Peer Networking Systems and Streaming
Peer-to-Peer Networking Systems and StreamingPeer-to-Peer Networking Systems and Streaming
Peer-to-Peer Networking Systems and StreamingDilum Bandara
 
Wired Broadband Communication
Wired Broadband CommunicationWired Broadband Communication
Wired Broadband CommunicationDilum Bandara
 

More from Dilum Bandara (20)

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Time Series Analysis and Forecasting in Practice
Time Series Analysis and Forecasting in PracticeTime Series Analysis and Forecasting in Practice
Time Series Analysis and Forecasting in Practice
 
Introduction to Dimension Reduction with PCA
Introduction to Dimension Reduction with PCAIntroduction to Dimension Reduction with PCA
Introduction to Dimension Reduction with PCA
 
Introduction to Descriptive & Predictive Analytics
Introduction to Descriptive & Predictive AnalyticsIntroduction to Descriptive & Predictive Analytics
Introduction to Descriptive & Predictive Analytics
 
Introduction to Concurrent Data Structures
Introduction to Concurrent Data StructuresIntroduction to Concurrent Data Structures
Introduction to Concurrent Data Structures
 
Hard to Paralelize Problems: Matrix-Vector and Matrix-Matrix
Hard to Paralelize Problems: Matrix-Vector and Matrix-MatrixHard to Paralelize Problems: Matrix-Vector and Matrix-Matrix
Hard to Paralelize Problems: Matrix-Vector and Matrix-Matrix
 
Introduction to Map-Reduce Programming with Hadoop
Introduction to Map-Reduce Programming with HadoopIntroduction to Map-Reduce Programming with Hadoop
Introduction to Map-Reduce Programming with Hadoop
 
Embarrassingly/Delightfully Parallel Problems
Embarrassingly/Delightfully Parallel ProblemsEmbarrassingly/Delightfully Parallel Problems
Embarrassingly/Delightfully Parallel Problems
 
Introduction to Thread Level Parallelism
Introduction to Thread Level ParallelismIntroduction to Thread Level Parallelism
Introduction to Thread Level Parallelism
 
CPU Memory Hierarchy and Caching Techniques
CPU Memory Hierarchy and Caching TechniquesCPU Memory Hierarchy and Caching Techniques
CPU Memory Hierarchy and Caching Techniques
 
Data-Level Parallelism in Microprocessors
Data-Level Parallelism in MicroprocessorsData-Level Parallelism in Microprocessors
Data-Level Parallelism in Microprocessors
 
Instruction Level Parallelism – Hardware Techniques
Instruction Level Parallelism – Hardware TechniquesInstruction Level Parallelism – Hardware Techniques
Instruction Level Parallelism – Hardware Techniques
 
Instruction Level Parallelism – Compiler Techniques
Instruction Level Parallelism – Compiler TechniquesInstruction Level Parallelism – Compiler Techniques
Instruction Level Parallelism – Compiler Techniques
 
CPU Pipelining and Hazards - An Introduction
CPU Pipelining and Hazards - An IntroductionCPU Pipelining and Hazards - An Introduction
CPU Pipelining and Hazards - An Introduction
 
High Performance Networking with Advanced TCP
High Performance Networking with Advanced TCPHigh Performance Networking with Advanced TCP
High Performance Networking with Advanced TCP
 
Introduction to Content Delivery Networks
Introduction to Content Delivery NetworksIntroduction to Content Delivery Networks
Introduction to Content Delivery Networks
 
Peer-to-Peer Networking Systems and Streaming
Peer-to-Peer Networking Systems and StreamingPeer-to-Peer Networking Systems and Streaming
Peer-to-Peer Networking Systems and Streaming
 
Mobile Services
Mobile ServicesMobile Services
Mobile Services
 
Wired Broadband Communication
Wired Broadband CommunicationWired Broadband Communication
Wired Broadband Communication
 
Mobile IP
Mobile IPMobile IP
Mobile IP
 

Recently uploaded

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 

Recently uploaded (20)

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 

Introduction to Warehouse-Scale Computers

  • 1. Warehouse-Scale Computers CS4342 Advanced Computer Architecture Dilum Bandara Dilum.Bandara@uom.lk Slides adapted from “Computer Architecture, A Quantitative Approach” by John L. Hennessy and David A. Patterson, 5th Edition, 2012, MK Publishers and The Datacenter as a Computer:An Introduction to the Design of Warehouse-Scale Machines by Luiz André Barroso & Urs Hölzle
  • 2. Outline  Programming model & workloads  Architectures  Cloud computing 2
  • 3. Warehouse-Scale Computers (WSC) 3 www.laserfocusworld.com/articles/print/volume-48/issue- 12/features/optical-technologies-scale-the-datacenter.html http://www.slashgear.com/google-data-center-hd-photos- hit-where-the-internet-lives-gallery-17252451/
  • 7. Warehouse-Scale Computer (WSC)  Provides Internet services  Search, social networking, online maps, video sharing, online shopping, email, cloud computing, etc.  Differences with HPC clusters  Clusters use higher performance processors & network  Clusters emphasize thread-level parallelism, WSCs emphasize request/task-level parallelism  Differences with datacenters  Datacenters consolidate different machines & software into a single location  Datacenters emphasize virtual machines & hardware heterogeneity to serve varied customers 7
  • 8. Design Factors for WSC  Cost-performance  Small savings add up  Energy efficiency  Affects power distribution & cooling  Work per joule  Operational costs count  Power consumption is a primary constraint when designing a system  Dependability via redundancy  Many low-cost components 8
  • 9. Design Factors (Cont.)  Network I/O  Interactive & batch processing workloads  Web search – interactive  Web indexing – batch  Ample computational parallelism isn’t important  Most jobs are totally independent, “Request-level parallelism”  Scale – Its opportunities & problems  Can afford to build customized systems as WSC require volume purchase  Frequent failures 9
  • 10. Failure Example  Consider a WSC with 50,000 nodes. MTTF of a node is 5 years. How many failures be there for a day? MTTF in days = 5 x 365 = 1,825 Failure rate = 1/1,825 per day No of failures per day = 50,000/1,825 = 27.4  Consider a WSC with 50,000 nodes & each node with 4 hard disks. Suppose a annual failure rate of a disk is 4%. What is the time for a disk failure? No of disks = 50,000 x 4 = 200,000 No of failures per year = 200,000 x 0.04 = 8,000 Time for failure = 365 x 24 / 8,000 = 1.095 hours/failure 10
  • 11. Programming Models & Workloads  Batch processing framework – MapReduce  Map  Applies a programmer- supplied function to each logical input record  Runs on thousands of computers  Provides new set of (key, value) pairs as intermediate values  Reduce  Collapses values using another function 11 http://www.cbsolution.net/techniques/ontarget/mapredu ce_vs_data_warehouse
  • 12. MapReduce Execution 12 Source: Dean et. al., “MapReduce, OSDI, 2004
  • 13. Programming Models & Workloads (Cont.) 13 www.datanami.com/datanami/2012-07- 16/top_5_challenges_for_hadoop_mapreduce _in_the_enterprise.html
  • 14. Programming Models & Workloads (Cont.)  MapReduce runtime environment schedules map & reduce task to WSC nodes  Availability  Use replicas of data across different servers  Use relaxed consistency  No need for all replicas to always agree  Workload demands  Often vary considerably 14
  • 15. Computer Architecture of WSC  Often uses a hierarchy of networks for interconnection  Each 19” rack holds 48 1U servers connected to a rack switch  Rack switches are uplinked to a switch(es) higher in hierarchy  Uplink has 48/n times lower bandwidth – Oversubscription  n – No of uplink ports  Goal is to maximize locality of communication relative to the rack 15
  • 19. Infrastructure & Costs  Location  Proximity to Internet backbones, electricity cost, property tax rates, low risk from earthquakes, floods, & hurricanes  Power distribution 19
  • 20. Power Usage 20 U.S. EPA Report 2007 – 1.5% of total U.S. power consumption used by data centers which has more than doubled since 2000 & costs $4.5 billion
  • 21. How Many Nodes can a WSC Support?  Each node  “Nameplate power rating” gives maximum power consumption  To get actual, measure power under actual workloads  Oversubscribe cumulative nodes power by 40%, but monitor power closely 21
  • 23. Cooling (Cont.) 23 Cooling system also uses water (evaporation & spills) e.g. 70,000 to 200,000 gallons per day for an 8 MW facility
  • 24. Efficiency  Power Utilization Effectiveness (PUE) = Total facility power / IT equipment power  ≥ 1  Median PUE on 2006 study was 1.69 24 Source: http://hightech.lbl.gov/benchmarking-guides/data-a1.html
  • 25. Performance  Latency is important metric because it is seen by users  Bing study  Users will use search less as response time increases  Service Level Objectives (SLOs) & Service Level Agreements (SLAs)  Typically given at application level  e.g., 99% of requests be below 100 ms  In clouds typically given only for static resources  CPU speed, no of cores, & memory 25
  • 26. Cost  Capital expenditures (CAPEX)  Cost to build a WSC  Hardware cost dominates  Operational expenditures (OPEX)  Cost to operate a WSC  Power for nodes & cooling dominates 26
  • 27. Cloud Computing 27 Clients Other Cloud Services Govt. Cloud Services Private Cloud Cloud Manager Public Cloud Green Cloud Computing by Dr. Rajkumar Buyya
  • 28. Cloud Computing (Cont.)  WSCs offer economies of scale that can’t be achieved with a datacenter  5.7 times reduction in storage costs  7.1 times reduction in administrative costs  7.3 times reduction in networking costs  This has given rise to cloud services such as Amazon Web Services  “Utility Computing”  Based on using open source virtual machine & operating system software 28
  • 29. Amazon Web Services  Virtual machines  XEN  Very low cost  $ 0.10 per hour per instance  Primary rely on open source software  No (initial) service guarantees  No contract required  Amazon S3  Simple Storage Service  Amazon EC2  Elastic Computer Cloud 29
  • 30. Amazon Web Services – Example 30 http://www.ryhug.com/free-art-available-on-amazon-amazon-web-services-that-is/

Editor's Notes

  1. 1U - A rack unit (abbreviated U or RU) is a unit of measure defined as 44.50 mm (1.75 in)
  2. computer room air conditioning (CRAC)
  3. DCiE = 1/PUE
  4. S3 - Simple Storage Service EC2 - Elastic Compute Cloud